[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: Arabic Normalization
- To: General Arabization Discussion <general at arabeyes dot org>
- Subject: Re: Arabic Normalization
- From: Khaled Hassounah <khassounah at gmail dot com>
- Date: Thu, 05 Oct 2006 15:02:55 -0400
- Domainkey-signature: a=rsa-sha1; q=dns; c=nofws; s=beta; d=gmail.com; h=received:message-id:date:from:user-agent:mime-version:to:subject:references:in-reply-to:content-type:content-transfer-encoding; b=RZQfvLM+0ho8pr1O6RtOfoEgZg7cVRZodzFbQMMWfGOoj+qd3E8EEkkyFkCK4nmQmEgGWQPl9pNtlgh9Du2Vp7YN2W1a/8p05ZlRuZrEspCvTl4nkt/Z0V+vnBxmwfZk5/Lxn+pwzg3UHtBVNa6vEEoP1uG0NzTo+2Li0I+U/ts=
- User-agent: Thunderbird 1.5.0.7 (X11/20060911)
Somewhat late, but normalization is very important, especially that it
is important for search, etc.
Mohammed Sameer wrote:
> I guess it can be done with hunspell using some Arabic specific custom code.
> I guess it has been done also. I have no idea whether this is doable with aspell or not.
It has been done? anything re-usable?
>
> I've been searching for an Arabic normalization standard but I didn't find. Do you have
> any ? I'd really appreciate that.
>
I don't, but this is mostly a matter of going through the different
possible characters and contexts in which they could exist. This is the
kind of thing where the arabeyes wiki would be instrumental.
[hint] now if someone would create the page and allow people to
collaborate on filling it!!! hmmm...
Khaled