| OLD | NEW |
| 1 en_CA Hunspell Dictionary | 1 en_CA Hunspell Dictionary |
| 2 Thu Feb 26 16:06:27 2015 -0500 [9c3f26a] | 2 Version 2016.01.19 |
| 3 Tue Jan 19 17:07:49 2016 -0500 [a535654] |
| 3 http://wordlist.sourceforge.net | 4 http://wordlist.sourceforge.net |
| 4 | 5 |
| 5 README file for English Hunspell dictionaries derived from SCOWL. | 6 README file for English Hunspell dictionaries derived from SCOWL. |
| 6 | 7 |
| 7 These dictionaries are created using the speller/make-hunspell-dict | 8 These dictionaries are created using the speller/make-hunspell-dict |
| 8 script in SCOWL. | 9 script in SCOWL. |
| 9 | 10 |
| 10 The following dictionaries are available: | 11 The following dictionaries are available: |
| 11 | 12 |
| 12 en_US (American) | 13 en_US (American) |
| 13 en_CA (Canadian) | 14 en_CA (Canadian) |
| 14 en_GB-ise (British with "ise" spelling) | 15 en_GB-ise (British with "ise" spelling) |
| 15 en_GB-ize (British with "ize" spelling) | 16 en_GB-ize (British with "ize" spelling) |
| 16 | 17 |
| 17 en_US-large | 18 en_US-large |
| 18 en_CA-large | 19 en_CA-large |
| 19 en_GB-large (with both "ise" and "ize" spelling) | 20 en_GB-large (with both "ise" and "ize" spelling) |
| 20 | 21 |
| 21 The normal (non-large) dictionaries correspond to SCOWL size 60 and, | 22 The normal (non-large) dictionaries correspond to SCOWL size 60 and, |
| 22 to encourage consistent spelling, generally only include one spelling | 23 to encourage consistent spelling, generally only include one spelling |
| 23 variant for a word. The large dictionaries correspond to SCOWL size | 24 variant for a word. The large dictionaries correspond to SCOWL size |
| 24 70 and may include multiple spelling for a word when both variants are | 25 70 and may include multiple spelling for a word when both variants are |
| 25 considered almost equal. Also, the general quality of the larger | 26 considered almost equal. The larger dictionaries however (1) have not |
| 26 dictionaries may also be less as they are not as carefully checked for | 27 been as carefully checked for errors as the normal dictionaries and |
| 27 errors as the normal dictionaries. | 28 thus may contain misspelled or invalid words; and (2) contain |
| 29 uncommon, yet valid, words that might cause problems as they are |
| 30 likely to be misspellings of more common words (for example, "ort" and |
| 31 "calender"). |
| 28 | 32 |
| 29 To get an idea of the difference in size, here are 25 random words | 33 To get an idea of the difference in size, here are 25 random words |
| 30 only found in the large dictionary for American English: | 34 only found in the large dictionary for American English: |
| 31 | 35 |
| 32 Bermejo Freyr's Guenevere Hatshepsut Nottinghamshire arrestment | 36 Bermejo Freyr's Guenevere Hatshepsut Nottinghamshire arrestment |
| 33 crassitudes crural dogwatches errorless fetial flaxseeds godroon | 37 crassitudes crural dogwatches errorless fetial flaxseeds godroon |
| 34 incretion jalapeño's kelpie kishkes neuroglias pietisms pullulation | 38 incretion jalapeño's kelpie kishkes neuroglias pietisms pullulation |
| 35 stemwinder stenoses syce thalassic zees | 39 stemwinder stenoses syce thalassic zees |
| 36 | 40 |
| 37 The en_US and en_CA are the official dictionaries for Hunspell. The | 41 The en_US and en_CA are the official dictionaries for Hunspell. The |
| 38 en_GB and large dictionaries are made available on an experimental | 42 en_GB and large dictionaries are made available on an experimental |
| 39 basis. If you find them useful please send me a quick email at | 43 basis. If you find them useful please send me a quick email at |
| 40 kevina@gnu.org. | 44 kevina@gnu.org. |
| 41 | 45 |
| 42 If none of these dictionaries suite you (for example, maybe you want | 46 If none of these dictionaries suite you (for example, maybe you want |
| 43 the larger dictionary but only use spelling of a word) additional | 47 the normal dictionary that also includes common variants) additional |
| 44 dictionaries can be generated at http://app.aspell.net/create or by | 48 dictionaries can be generated at http://app.aspell.net/create or by |
| 45 modifying speller/make-hunspell-dict in SCOWL. Please do let me know | 49 modifying speller/make-hunspell-dict in SCOWL. Please do let me know |
| 46 if you end up publishing a customized dictionary. | 50 if you end up publishing a customized dictionary. |
| 47 | 51 |
| 48 If a word is not found in the dictionary or a word is there you think | 52 If a word is not found in the dictionary or a word is there you think |
| 49 shouldn't be, you can lookup the word up at http://app.aspell.net/lookup | 53 shouldn't be, you can lookup the word up at http://app.aspell.net/lookup |
| 50 to help determine why that is. | 54 to help determine why that is. |
| 51 | 55 |
| 52 General comments on these list can be sent directly to me at | 56 General comments on these list can be sent directly to me at |
| 53 kevina@gnu.org or to the wordlist-devel mailing lists | 57 kevina@gnu.org or to the wordlist-devel mailing lists |
| 54 (https://lists.sourceforge.net/lists/listinfo/wordlist-devel). If you | 58 (https://lists.sourceforge.net/lists/listinfo/wordlist-devel). If you |
| 55 have specific issues with any of these dictionaries please file a bug | 59 have specific issues with any of these dictionaries please file a bug |
| 56 report at https://github.com/kevina/wordlist/issues. | 60 report at https://github.com/kevina/wordlist/issues. |
| 57 | 61 |
| 62 IMPORTANT CHANGES INTRODUCED IN 2015.04.24: |
| 63 |
| 64 The dictionaries are now in UTF-8 format instead of ISO-8859-1. This |
| 65 was required to handle smart quotes correctly. |
| 66 |
| 67 IMPORTANT CHANGES INTRODUCED IN 2016.01.19: |
| 68 |
| 69 "SET UTF8" was changes to "SET UTF-8" in the affix file as some |
| 70 versions of Hunspell do not recognize "UTF8". |
| 71 |
| 58 ADDITIONAL NOTES: | 72 ADDITIONAL NOTES: |
| 59 | 73 |
| 60 The NOSUGGEST flag was added to certain taboo words. While I made an | 74 The NOSUGGEST flag was added to certain taboo words. While I made an |
| 61 honest attempt to flag the strongest taboo words with the NOSUGGEST | 75 honest attempt to flag the strongest taboo words with the NOSUGGEST |
| 62 flag, I MAKE NO GUARANTEE THAT I FLAGGED EVERY POSSIBLE TABOO WORD. | 76 flag, I MAKE NO GUARANTEE THAT I FLAGGED EVERY POSSIBLE TABOO WORD. |
| 63 The list was originally derived from Németh László, however I removed | 77 The list was originally derived from Németh László, however I removed |
| 64 some words which, while being considered taboo by some dictionaries, | 78 some words which, while being considered taboo by some dictionaries, |
| 65 are not really considered swear words in today's society. | 79 are not really considered swear words in today's society. |
| 66 | 80 |
| 67 COPYRIGHT, SOURCES, and CREDITS: | 81 COPYRIGHT, SOURCES, and CREDITS: |
| (...skipping 229 matching lines...) Expand 10 before | Expand all | Expand 10 after Loading... |
| 297 FOR A PARTICULAR PURPOSE ARE DISCLAIMED. IN NO EVENT SHALL GEOFF | 311 FOR A PARTICULAR PURPOSE ARE DISCLAIMED. IN NO EVENT SHALL GEOFF |
| 298 KUENNING OR CONTRIBUTORS BE LIABLE FOR ANY DIRECT, INDIRECT, | 312 KUENNING OR CONTRIBUTORS BE LIABLE FOR ANY DIRECT, INDIRECT, |
| 299 INCIDENTAL, SPECIAL, EXEMPLARY, OR CONSEQUENTIAL DAMAGES (INCLUDING, | 313 INCIDENTAL, SPECIAL, EXEMPLARY, OR CONSEQUENTIAL DAMAGES (INCLUDING, |
| 300 BUT NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES; | 314 BUT NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES; |
| 301 LOSS OF USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER | 315 LOSS OF USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER |
| 302 CAUSED AND ON ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT | 316 CAUSED AND ON ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT |
| 303 LIABILITY, OR TORT (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN | 317 LIABILITY, OR TORT (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN |
| 304 ANY WAY OUT OF THE USE OF THIS SOFTWARE, EVEN IF ADVISED OF THE | 318 ANY WAY OUT OF THE USE OF THIS SOFTWARE, EVEN IF ADVISED OF THE |
| 305 POSSIBILITY OF SUCH DAMAGE. | 319 POSSIBILITY OF SUCH DAMAGE. |
| 306 | 320 |
| 307 Build Date: Thu Feb 26 15:58:07 PST 2015 | 321 Build Date: Tue Jan 19 17:11:05 EST 2016 |
| 308 Wordlist Command: mk-list en_CA 60 | deaccent | 322 Wordlist Command: mk-list --accents=strip en_CA 60 |
| OLD | NEW |