[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: more on Duali
- To: Kareem M Darwish <kareem at Glue dot umd dot edu>
- Subject: Re: more on Duali
- From: Mohammed Elzubeir <elzubeir at arabeyes dot org>
- Date: Thu, 15 Aug 2002 22:36:16 -0500
- Cc: developer at arabeyes dot org
- User-agent: Mutt/1.3.28i
On Thu, Aug 15, 2002 at 07:24:54AM -0400, Kareem M Darwish wrote:
> AA,
> You can find both at:
> www.glue.umd.edu/~kareem/research
> If you have more questions just e-mail me.
Okay, I finally got a chance to have a look at stem.pl. That's
basically what I was going to do, but it produced a lot of strange
results. I realize that during the spell-checking process those
inaccuracies are not a problem, you just test against several
combinations, etc. But for the creation of the dictionary, accuracy
is the first priority.
Now, I can barely read Perl code.. and I'm looking at this (line
numbers between []'s):
--stem.pl--
[14] $line =~ s/y/y/g;
[20] $line =~ s/[AAAAAA]/A/g;
--stem.pl--
I'm not sure what that does, if anything. Isn't the normalization
done at the utf82morph level?
That's it for now, more to come ;)
later
--
-------------------------------------------------------
| Mohammed Elzubeir | Visit us at: |
| | http://www.arabeyes.org/ |
| Arabeyes Project | Homepage: |
| Unix the 'right' way | http://fakkir.net/~elzubeir/|
-------------------------------------------------------
---
Was I helpful? Let others know:
http://svcs.affero.net/rm.php?r=elzubeir