* landed code for viewing topics in a conference, and for adding a topic

(first workout of HTML Checker code)
* modified the dictionary implementation to use a trie system rather than
  a set of HashSets, and also started using a new, much smaller dictionary
* general bugfixes and cleanup on other items as needed
This commit is contained in:
Eric J. Bowersox
2001-02-06 04:50:04 +00:00
parent 650691c8d2
commit 8bcc80ddd7
39 changed files with 47278 additions and 264907 deletions

View File

@@ -1,28 +1,50 @@
The file "webster2.dict" was produced from the word lists supplied as "web2"
and "web2a.gz" in the /usr/share/dict directory of the Debian "miscfiles"
package (GNU miscfiles version 1.2). The following commands were used to
produce this file:
The file "en-us.dict" is taken from the "american-english" word list supplied
in the Debian "wenglish" package. A small amount of processing was used
to produce this file:
zcat /usr/share/dict/web2a.gz | fgrep -v ' ' > temp.dict
cat /usr/share/dict/web2 temp.dict | tr '[A-Z]' [a-z]' | sort \
| uniq > webster2.dict
rm temp.dict
cat /usr/share/dict/american-english | tr '[A-Z]' '[a-z]' | sort \
| uniq > en-us.dict
"webster2.dict" is to be considered "freely redistributable," and is not
"en-us.dict" is to be considered "freely redistributable," and is not
subject to the MPL as with the rest of Venice code. Herewith is the original
README for the dictionary:
copyright for the dictionary:
--- begin README ---
# $NetBSD: README,v 1.2 1997/03/26 07:14:32 mikel Exp $
# @(#)README 8.1 (Berkeley) 6/5/93
This package is now maintained by Charles Briscoe-Smith <cpbs@debian.org>.
WEB ---- (introduction provided by jaw@riacs) -------------------------
I, Erick Branderhorst <branderhorst@heel.fgg.eur.nl>, took over the
wenglish package but I don't know where the list of words is from. If
it is uptodate or copyrighted or whatever.
Welcome to web2 (Webster's Second International) all 234,936 words worth.
The 1934 copyright has elapsed, according to the supplier. The
supplemental 'web2a' list contains hyphenated terms as well as assorted
noun and adverbial phrases. The wordlist makes a dandy 'grep' victim.
This package was debianized by Herbert Xu herbert@debian.org on
Sat, 8 Feb 1997 22:16:50 +1100.
-- James A. Woods {ihnp4,hplabs}!ames!jaw (or jaw@riacs)
--- end README ---
It was downloaded from http://sunsite.unc.edu/pub/Linux/libs/.
Copyright:
Begin2
Title = /usr/dict/words for Linux (linux.words)
Version = 2
Desc1 = This is word list containing 45402 words. Great care has been
Desc2 = taken to be sure that this word list is free of copyright.
Desc3 = This list is suitable for English language spelling checkers
Desc4 = and as a target for look(1).
Author = Rik Faith
AuthorEmail = faith@cs.unc.edu
Maintainer = Rik Faith
MaintEmail = faith@cs.unc.edu
Site1 = ftp.cs.unc.edu
Path1 = /pub/faith/linux/utils
File1 = linux.words.2.tar.gz
FileSize1 = 140k
Site2 = tsx-11.mit.edu
Path2 = /pub/linux/docs
Site3 = sunsite.unc.edu
Path3 = /pub/Linux/libs
CopyPolicy1 = Free of non-commercial restrictions.
CopyPolicy2 = Free for personal, educational, and research purposes.
Keywords = dict, dictionary, words, wordlist
Entered = Sun Oct 10 19:02:47 1993
EnteredBy = Rik Faith
CheckedEmail = faith@cs.unc.edu
End

45375
etc/en-us.dict Normal file

File diff suppressed because it is too large Load Diff

View File

@@ -65,10 +65,10 @@
</email>
<!-- This section dictates which dictionary files get loaded into the spelling checker's
main dictionary. The default lexicon is Webster's 2nd, with a supplemental list of
main dictionary. The default lexicon is a standard US English one, with a supplemental list of
words provided by Erbo. -->
<dictionary>
<file>/home/erbo/venice/WEB-INF/webster2.dict</file>
<file>/home/erbo/venice/WEB-INF/en-us.dict</file>
<file>/home/erbo/venice/WEB-INF/erbo.dict</file>
</dictionary>

File diff suppressed because it is too large Load Diff