Index: README_en_US.txt |
diff --git a/README_en_US.txt b/README_en_US.txt |
index 4af3001bb41be96e429c32d4e473f5ccad6014f0..67e55514bac16c9aabfda54d938083a3922a5c3a 100644 |
--- a/README_en_US.txt |
+++ b/README_en_US.txt |
@@ -1,36 +1,83 @@ |
-Version 7.1-0 |
-2011-01-06 |
+en_US Hunspell Dictionary |
+Version 2014.08.11 |
+Mon Aug 11 18:23:56 2014 +0200 [be45e88] |
+http://wordlist.sourceforge.net |
-README file for en_US and en_CA Hunspell dictionaries |
+README file for English Hunspell dictionaries derived from SCOWL. |
These dictionaries are created using the speller/make-hunspell-dict |
-script in SCOWL, version 7.1 released on January 6, 2011. |
+script in SCOWL. |
+ |
+The following dictionaries are available: |
+ |
+ en_US (American) |
+ en_CA (Canadian) |
+ en_GB-ise (British with "ise" spelling) |
+ en_GB-ize (British with "ize" spelling) |
+ |
+ en_US-large |
+ en_CA-large |
+ en_GB-large (with both "ize" and "ise" spelling) |
+ |
+The normal (non-large) dictionaries correspond to SCOWL size 60 and, |
+to encourage consistent spelling, generally only include one spelling |
+variant for a word. The large dictionaries correspond to SCOWL size |
+70 and may include multiple spelling for a word when both variants are |
+considered almost equal. Also, the general quality of the larger |
+dictionaries may also be less as they are not as carefully checked for |
+errors as the normal dictionaries. |
+ |
+To get an idea of the difference in size, here are 25 random words |
+only found in the large dictionary for American English: |
+ |
+ Bermejo Freyr's Guenevere Hatshepsut Nottinghamshire arrestment |
+ crassitudes crural dogwatches errorless fetial flaxseeds godroon |
+ incretion jalapeño's kelpie kishkes neuroglias pietisms pullulation |
+ stemwinder stenoses syce thalassic zees |
+ |
+The en_US and en_CA are the official dictionaries for Hunspell. The |
+en_GB and large dictionaries are made available on an experimental |
+basis. If you find them useful please send me a quick email at |
+kevina@gnu.org. |
+ |
+If none of these dictionaries suite you (for example, maybe you want |
+the larger dictionary but only use spelling of a word) additional |
+dictionaries can be generated at http://app.aspell.net/create or by |
+modifying speller/make-hunspell-dict in SCOWL. Please do let me know |
+if you end up publishing a customized dictionary. |
+ |
+If a word is not found in the dictionary or a word is there you think |
+shouldn't be, you can lookup the word up at http://app.aspell.net/lookup |
+to help determine why that is. |
+ |
+General comments on these list can be sent directly to me at |
+kevina@gnu.org or to the wordlist-devel mailing lists |
+(https://lists.sourceforge.net/lists/listinfo/wordlist-devel). If you |
+have specific issues with any of these dictionaries please file a bug |
+report at https://github.com/kevina/wordlist/issues. |
+ |
+ADDITIONAL NOTES: |
The NOSUGGEST flag was added to certain taboo words. While I made an |
honest attempt to flag the strongest taboo words with the NOSUGGEST |
flag, I MAKE NO GUARANTEE THAT I FLAGGED EVERY POSSIBLE TABOO WORD. |
-The list was originally derived from Németh László, however I removed |
+The list was originally derived from Németh László, however I removed |
some words which, while being considered taboo by some dictionaries, |
are not really considered swear words in today's society. |
-You can find SCOWL and friend at http://wordlist.sourceforge.net/. |
-Bug reports should go to the Issue Tracker found on the previously |
-mentioned web site. General discussion should go to the |
-wordlist-devel at sourceforge net mailing list. |
- |
COPYRIGHT, SOURCES, and CREDITS: |
-The en_US and en_CA dictionaries come directly from SCOWL (up to level |
-60) and is thus under the same copyright of SCOWL. The affix file is |
+The English dictionaries come directly from SCOWL |
+and is thus under the same copyright of SCOWL. The affix file is |
a heavily modified version of the original english.aff file which was |
released as part of Geoff Kuenning's Ispell and as such is covered by |
his BSD license. Part of SCOWL is also based on Ispell thus the |
Ispell copyright is included with the SCOWL copyright. |
-The collective work is Copyright 2000-2011 by Kevin Atkinson as well |
+The collective work is Copyright 2000-2014 by Kevin Atkinson as well |
as any of the copyrights mentioned below: |
- Copyright 2000-2011 by Kevin Atkinson |
+ Copyright 2000-2014 by Kevin Atkinson |
Permission to use, copy, modify, distribute and sell these word |
lists, the associated scripts, the output created from the scripts, |
@@ -141,7 +188,7 @@ The 40 level includes words from Alan's 3esl list found in version 4.0 |
of his 12dicts package. Like his other stuff the 3esl list is also in the |
public domain. |
-The 50 level includes Brian's frequency class 1, words words appearing |
+The 50 level includes Brian's frequency class 1, words appearing |
in at least 5 of 12 of the dictionaries as indicated in the 12Dicts |
package, and uppercase words in at least 4 of the previous 12 |
dictionaries. A decent number of proper names is also included: The |
@@ -170,11 +217,11 @@ The 70 level includes Brian's frequency class 0 and the 74,550 common |
dictionary words from the MWords package. The common dictionary words, |
like those from the 12Dicts package, have had all likely inflections |
added. The 70 level also included the 5desk list from version 4.0 of |
-the 12Dics package which is the public domain. |
+the 12Dics package which is in the public domain. |
The 80 level includes the ENABLE word list, all the lists in the |
ENABLE supplement package (except for ABLE), the "UK Advanced Cryptics |
-Dictionary" (UKACD), the list of signature words in from YAWL package, |
+Dictionary" (UKACD), the list of signature words from the YAWL package, |
and the 10,196 places list from the MWords package. |
The ENABLE package, mainted by M\Cooper <thegrendel@theriver.com>, |
@@ -221,7 +268,7 @@ Accent information was taken from UKACD. |
My VARCON package was used to create the American, British, and |
Canadian word list. |
-Since the original word lists used used in the VARCON package came |
+Since the original word lists used in the VARCON package came |
from the Ispell distribution they are under the Ispell copyright: |
Copyright 1993, Geoff Kuenning, Granada Hills, CA |
@@ -258,4 +305,5 @@ from the Ispell distribution they are under the Ispell copyright: |
ANY WAY OUT OF THE USE OF THIS SOFTWARE, EVEN IF ADVISED OF THE |
POSSIBILITY OF SUCH DAMAGE. |
-Build Date: Thu Jan 6 02:31:28 MST 2011 |
+Build Date: Mon Aug 11 18:27:20 CEST 2014 |
+Wordlist Command: mk-list en_US 60 | deaccent |