OLD | NEW |
(Empty) | |
| 1 Version 7.1-0 |
| 2 2011-01-06 |
| 3 |
| 4 README file for en_US and en_CA Hunspell dictionaries |
| 5 |
| 6 These dictionaries are created using the speller/make-hunspell-dict |
| 7 script in SCOWL, version 7.1 released on January 6, 2011. |
| 8 |
| 9 The NOSUGGEST flag was added to certain taboo words. While I made an |
| 10 honest attempt to flag the strongest taboo words with the NOSUGGEST |
| 11 flag, I MAKE NO GUARANTEE THAT I FLAGGED EVERY POSSIBLE TABOO WORD. |
| 12 The list was originally derived from Németh László, however I removed |
| 13 some words which, while being considered taboo by some dictionaries, |
| 14 are not really considered swear words in today's society. |
| 15 |
| 16 You can find SCOWL and friend at http://wordlist.sourceforge.net/. |
| 17 Bug reports should go to the Issue Tracker found on the previously |
| 18 mentioned web site. General discussion should go to the |
| 19 wordlist-devel at sourceforge net mailing list. |
| 20 |
| 21 COPYRIGHT, SOURCES, and CREDITS: |
| 22 |
| 23 The en_US and en_CA dictionaries come directly from SCOWL (up to level |
| 24 60) and is thus under the same copyright of SCOWL. The affix file is |
| 25 a heavily modified version of the original english.aff file which was |
| 26 released as part of Geoff Kuenning's Ispell and as such is covered by |
| 27 his BSD license. Part of SCOWL is also based on Ispell thus the |
| 28 Ispell copyright is included with the SCOWL copyright. |
| 29 |
| 30 The collective work is Copyright 2000-2011 by Kevin Atkinson as well |
| 31 as any of the copyrights mentioned below: |
| 32 |
| 33 Copyright 2000-2011 by Kevin Atkinson |
| 34 |
| 35 Permission to use, copy, modify, distribute and sell these word |
| 36 lists, the associated scripts, the output created from the scripts, |
| 37 and its documentation for any purpose is hereby granted without fee, |
| 38 provided that the above copyright notice appears in all copies and |
| 39 that both that copyright notice and this permission notice appear in |
| 40 supporting documentation. Kevin Atkinson makes no representations |
| 41 about the suitability of this array for any purpose. It is provided |
| 42 "as is" without express or implied warranty. |
| 43 |
| 44 Alan Beale <biljir@pobox.com> also deserves special credit as he has, |
| 45 in addition to providing the 12Dicts package and being a major |
| 46 contributor to the ENABLE word list, given me an incredible amount of |
| 47 feedback and created a number of special lists (those found in the |
| 48 Supplement) in order to help improve the overall quality of SCOWL. |
| 49 |
| 50 The 10 level includes the 1000 most common English words (according to |
| 51 the Moby (TM) Words II [MWords] package), a subset of the 1000 most |
| 52 common words on the Internet (again, according to Moby Words II), and |
| 53 frequently class 16 from Brian Kelk's "UK English Wordlist |
| 54 with Frequency Classification". |
| 55 |
| 56 The MWords package was explicitly placed in the public domain: |
| 57 |
| 58 The Moby lexicon project is complete and has |
| 59 been place into the public domain. Use, sell, |
| 60 rework, excerpt and use in any way on any platform. |
| 61 |
| 62 Placing this material on internal or public servers is |
| 63 also encouraged. The compiler is not aware of any |
| 64 export restrictions so freely distribute world-wide. |
| 65 |
| 66 You can verify the public domain status by contacting |
| 67 |
| 68 Grady Ward |
| 69 3449 Martha Ct. |
| 70 Arcata, CA 95521-4884 |
| 71 |
| 72 grady@netcom.com |
| 73 grady@northcoast.com |
| 74 |
| 75 The "UK English Wordlist With Frequency Classification" is also in the |
| 76 Public Domain: |
| 77 |
| 78 Date: Sat, 08 Jul 2000 20:27:21 +0100 |
| 79 From: Brian Kelk <Brian.Kelk@cl.cam.ac.uk> |
| 80 |
| 81 > I was wondering what the copyright status of your "UK English |
| 82 > Wordlist With Frequency Classification" word list as it seems to |
| 83 > be lacking any copyright notice. |
| 84 |
| 85 There were many many sources in total, but any text marked |
| 86 "copyright" was avoided. Locally-written documentation was one |
| 87 source. An earlier version of the list resided in a filespace called |
| 88 PUBLIC on the University mainframe, because it was considered public |
| 89 domain. |
| 90 |
| 91 Date: Tue, 11 Jul 2000 19:31:34 +0100 |
| 92 |
| 93 > So are you saying your word list is also in the public domain? |
| 94 |
| 95 That is the intention. |
| 96 |
| 97 The 20 level includes frequency classes 7-15 from Brian's word list. |
| 98 |
| 99 The 35 level includes frequency classes 2-6 and words appearing in at |
| 100 least 11 of 12 dictionaries as indicated in the 12Dicts package. All |
| 101 words from the 12Dicts package have had likely inflections added via |
| 102 my inflection database. |
| 103 |
| 104 The 12Dicts package and Supplement is in the Public Domain. |
| 105 |
| 106 The WordNet database, which was used in the creation of the |
| 107 Inflections database, is under the following copyright: |
| 108 |
| 109 This software and database is being provided to you, the LICENSEE, |
| 110 by Princeton University under the following license. By obtaining, |
| 111 using and/or copying this software and database, you agree that you |
| 112 have read, understood, and will comply with these terms and |
| 113 conditions.: |
| 114 |
| 115 Permission to use, copy, modify and distribute this software and |
| 116 database and its documentation for any purpose and without fee or |
| 117 royalty is hereby granted, provided that you agree to comply with |
| 118 the following copyright notice and statements, including the |
| 119 disclaimer, and that the same appear on ALL copies of the software, |
| 120 database and documentation, including modifications that you make |
| 121 for internal use or for distribution. |
| 122 |
| 123 WordNet 1.6 Copyright 1997 by Princeton University. All rights |
| 124 reserved. |
| 125 |
| 126 THIS SOFTWARE AND DATABASE IS PROVIDED "AS IS" AND PRINCETON |
| 127 UNIVERSITY MAKES NO REPRESENTATIONS OR WARRANTIES, EXPRESS OR |
| 128 IMPLIED. BY WAY OF EXAMPLE, BUT NOT LIMITATION, PRINCETON |
| 129 UNIVERSITY MAKES NO REPRESENTATIONS OR WARRANTIES OF MERCHANT- |
| 130 ABILITY OR FITNESS FOR ANY PARTICULAR PURPOSE OR THAT THE USE OF THE |
| 131 LICENSED SOFTWARE, DATABASE OR DOCUMENTATION WILL NOT INFRINGE ANY |
| 132 THIRD PARTY PATENTS, COPYRIGHTS, TRADEMARKS OR OTHER RIGHTS. |
| 133 |
| 134 The name of Princeton University or Princeton may not be used in |
| 135 advertising or publicity pertaining to distribution of the software |
| 136 and/or database. Title to copyright in this software, database and |
| 137 any associated documentation shall at all times remain with |
| 138 Princeton University and LICENSEE agrees to preserve same. |
| 139 |
| 140 The 40 level includes words from Alan's 3esl list found in version 4.0 |
| 141 of his 12dicts package. Like his other stuff the 3esl list is also in the |
| 142 public domain. |
| 143 |
| 144 The 50 level includes Brian's frequency class 1, words words appearing |
| 145 in at least 5 of 12 of the dictionaries as indicated in the 12Dicts |
| 146 package, and uppercase words in at least 4 of the previous 12 |
| 147 dictionaries. A decent number of proper names is also included: The |
| 148 top 1000 male, female, and Last names from the 1990 Census report; a |
| 149 list of names sent to me by Alan Beale; and a few names that I added |
| 150 myself. Finally a small list of abbreviations not commonly found in |
| 151 other word lists is included. |
| 152 |
| 153 The name files form the Census report is a government document which I |
| 154 don't think can be copyrighted. |
| 155 |
| 156 The file special-jargon.50 uses common.lst and word.lst from the |
| 157 "Unofficial Jargon File Word Lists" which is derived from "The Jargon |
| 158 File". All of which is in the Public Domain. This file also contain |
| 159 a few extra UNIX terms which are found in the file "unix-terms" in the |
| 160 special/ directory. |
| 161 |
| 162 The 55 level includes words from Alan's 2of4brif list found in version |
| 163 4.0 of his 12dicts package. Like his other stuff the 2of4brif is also |
| 164 in the public domain. |
| 165 |
| 166 The 60 level includes all words appearing in at least 2 of the 12 |
| 167 dictionaries as indicated by the 12Dicts package. |
| 168 |
| 169 The 70 level includes Brian's frequency class 0 and the 74,550 common |
| 170 dictionary words from the MWords package. The common dictionary words, |
| 171 like those from the 12Dicts package, have had all likely inflections |
| 172 added. The 70 level also included the 5desk list from version 4.0 of |
| 173 the 12Dics package which is the public domain. |
| 174 |
| 175 The 80 level includes the ENABLE word list, all the lists in the |
| 176 ENABLE supplement package (except for ABLE), the "UK Advanced Cryptics |
| 177 Dictionary" (UKACD), the list of signature words in from YAWL package, |
| 178 and the 10,196 places list from the MWords package. |
| 179 |
| 180 The ENABLE package, mainted by M\Cooper <thegrendel@theriver.com>, |
| 181 is in the Public Domain: |
| 182 |
| 183 The ENABLE master word list, WORD.LST, is herewith formally released |
| 184 into the Public Domain. Anyone is free to use it or distribute it in |
| 185 any manner they see fit. No fee or registration is required for its |
| 186 use nor are "contributions" solicited (if you feel you absolutely |
| 187 must contribute something for your own peace of mind, the authors of |
| 188 the ENABLE list ask that you make a donation on their behalf to your |
| 189 favorite charity). This word list is our gift to the Scrabble |
| 190 community, as an alternate to "official" word lists. Game designers |
| 191 may feel free to incorporate the WORD.LST into their games. Please |
| 192 mention the source and credit us as originators of the list. Note |
| 193 that if you, as a game designer, use the WORD.LST in your product, |
| 194 you may still copyright and protect your product, but you may *not* |
| 195 legally copyright or in any way restrict redistribution of the |
| 196 WORD.LST portion of your product. This *may* under law restrict your |
| 197 rights to restrict your users' rights, but that is only fair. |
| 198 |
| 199 UKACD, by J Ross Beresford <ross@bryson.demon.co.uk>, is under the |
| 200 following copyright: |
| 201 |
| 202 Copyright (c) J Ross Beresford 1993-1999. All Rights Reserved. |
| 203 |
| 204 The following restriction is placed on the use of this publication: |
| 205 if The UK Advanced Cryptics Dictionary is used in a software package |
| 206 or redistributed in any form, the copyright notice must be |
| 207 prominently displayed and the text of this document must be included |
| 208 verbatim. |
| 209 |
| 210 There are no other restrictions: I would like to see the list |
| 211 distributed as widely as possible. |
| 212 |
| 213 The 95 level includes the 354,984 single words, 256,772 compound |
| 214 words, 4,946 female names and the 3,897 male names, and 21,986 names |
| 215 from the MWords package, ABLE.LST from the ENABLE Supplement, and some |
| 216 additional words found in my part-of-speech database that were not |
| 217 found anywhere else. |
| 218 |
| 219 Accent information was taken from UKACD. |
| 220 |
| 221 My VARCON package was used to create the American, British, and |
| 222 Canadian word list. |
| 223 |
| 224 Since the original word lists used used in the VARCON package came |
| 225 from the Ispell distribution they are under the Ispell copyright: |
| 226 |
| 227 Copyright 1993, Geoff Kuenning, Granada Hills, CA |
| 228 All rights reserved. |
| 229 |
| 230 Redistribution and use in source and binary forms, with or without |
| 231 modification, are permitted provided that the following conditions |
| 232 are met: |
| 233 |
| 234 1. Redistributions of source code must retain the above copyright |
| 235 notice, this list of conditions and the following disclaimer. |
| 236 2. Redistributions in binary form must reproduce the above copyright |
| 237 notice, this list of conditions and the following disclaimer in the |
| 238 documentation and/or other materials provided with the distribution. |
| 239 3. All modifications to the source code must be clearly marked as |
| 240 such. Binary redistributions based on modified source code |
| 241 must be clearly marked as modified versions in the documentation |
| 242 and/or other materials provided with the distribution. |
| 243 (clause 4 removed with permission from Geoff Kuenning) |
| 244 5. The name of Geoff Kuenning may not be used to endorse or promote |
| 245 products derived from this software without specific prior |
| 246 written permission. |
| 247 |
| 248 THIS SOFTWARE IS PROVIDED BY GEOFF KUENNING AND CONTRIBUTORS ``AS |
| 249 IS'' AND ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT |
| 250 LIMITED TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS |
| 251 FOR A PARTICULAR PURPOSE ARE DISCLAIMED. IN NO EVENT SHALL GEOFF |
| 252 KUENNING OR CONTRIBUTORS BE LIABLE FOR ANY DIRECT, INDIRECT, |
| 253 INCIDENTAL, SPECIAL, EXEMPLARY, OR CONSEQUENTIAL DAMAGES (INCLUDING, |
| 254 BUT NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES; |
| 255 LOSS OF USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER |
| 256 CAUSED AND ON ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT |
| 257 LIABILITY, OR TORT (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN |
| 258 ANY WAY OUT OF THE USE OF THIS SOFTWARE, EVEN IF ADVISED OF THE |
| 259 POSSIBILITY OF SUCH DAMAGE. |
| 260 |
| 261 Build Date: Thu Jan 6 02:31:29 MST 2011 |
OLD | NEW |