[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: a sh wordlist maker



--- Moayyad Al-Sadi <double_bksp at yahoo dot com> wrote:
> ---------
> Letters="a-z"
> cat file | tr A-Z a-z | sed -e 's/[^$Letters]/ /g' | \
>  tr ' ''\12' | sort | uniq
> ----------
> can this be applied to arabic

Doubtful.  We had always wanted to start an Arabic->English Dictionary
(or an arabic.wordlist), but I'm not sure there is a brute force means
to do that and still capture everything there is to capture.  Our best
bet is to somehow build from Duali (the spellchecker) and accumulate
a list that way.

One other means that people have used in the past (for english) is to
create internet crawlers/spiders that would simply surf the internet
and save off new words they come across that they haven't seen before.

We can utilize both methods, but the real question is when to do this
(ie. when should an Arabic.Wordlist be started) and that conversation
should be taken to the 'doc' list :-)

Salam.

 - Nadim


__________________________________
Do you Yahoo!?
Yahoo! Search - Find what you’re looking for faster
http://search.yahoo.com