[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: questions on the wordlist
- To: Documentation and Translation <doc at arabeyes dot org>
- Subject: Re: questions on the wordlist
- From: sven vahar <aabram at gmail dot com>
- Date: Tue, 19 Apr 2005 12:18:21 +0300
- Domainkey-signature: a=rsa-sha1; q=dns; c=nofws; s=beta; d=gmail.com; h=received:message-id:date:from:reply-to:to:subject:in-reply-to:mime-version:content-type:content-transfer-encoding:content-disposition:references; b=RWLcmbKDnfqTt+waDaOG0pXdKK9lk7K4FOx3pO04812ImMgTmDt+4IAhyznuVzlt3VD6+tjlxKJqNfPenB0KUWNq5JpffphAktfNN7ZEiJJ/oLpvGFfzIkSUSe6Q4LJP0FR+7nhnyOBkyDHj8lPHuHYApYRAlPYKRl5/gOCr1lI=
On 4/19/05, Nadim Shaikli <shaikli at yahoo dot com> wrote:
> We couldn't find a proper wordlist to start with and had
> to generate our own (all within the scope of an open license).
I already compared these two wordlists (the Arabeyes one and the
English-Estonian wordlist provided by the Institute of the Estonian
Language) and got 35455 initial matches based on identical (case
ignored) English words in both lists. Not too shabby for a start.
> I'm unaware of what they use in the Al-Mawrid so if there
> is URL that can shed some light that would be wonderful.
For online lookups I use Sakhr dictionary at
http://dictionary.sakhr.com/. For example the lookup for "mountain"
and "sea" give Arabic nouns without an article. As a learner I find it
easier because I may not know whether the Arabic word given actually
starts with "al" or is it just an article. For example الحيمياء of
which I cannot know whether the "al-" belongs to the word and is a
part of it is it used as an article and the real word would be. This
especially confusing when respective English word starts also with an
"al", like in this example "alchemy". Maybe the difference for me is
that I'm not viewing the wordlist as... well... the wordlist for
native users but rather than the dictionary. I acknowledge that these
are different approaches. For example a dictionary-suitable list would
ideally include vowel diacritics as well ;-)
Anyway, when I create a decent Estonian-Arabic dictionary/wordlist
based on Arabeyes wordlist I'll let you know.