[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Proposal for the Basis of a Codepoint Extension toUnicodeforthe Encoding of the Quranic Manuscripts

To: General Arabization Discussion <general at arabeyes dot org>
Subject: Re: Proposal for the Basis of a Codepoint Extension toUnicodeforthe Encoding of the Quranic Manuscripts
From: Gregg Reynolds <gar at arabink dot com>
Date: Wed, 22 Jun 2005 02:32:35 -0500
User-agent: Mozilla Thunderbird 1.0.2 (Windows/20050317)

Thomas Milo wrote:

Meor's luadable effort has helped me to return to my original position:
encode graphemes, not glyphs. Keep the tanween graphemically intact, this
will improve searchability. So I recently changed my position regarding
tanween according to the following formula, that I hope this community will
endorse:

tanween = <vowel> <vowel> + [optional] <modifier>

<vowel>=  fatha / dhamma / kasra
<modifier>= tamweem / sequentializer


For backward compatibility,

<vowel> <vowel> = fathatan / dhammatan / kasratan

Hmm. In my opinion, it would be both more useful and more accurate historically to simply have a couple of TANWEEN codepoints. If I'm not mistaken, tanween was originally marked using a small nuun and later evolved into the doubled vowel mark.

For example, using latin-1:

	TANWEEN = �
	TANWEEN IDGHAM = �
	TAMWEEM = %

Examples (x = kha, � = sheen, � = shadda):

	kitaabu�
	xu�ubu� m�usan�ada#u�
	min% ba at d

Now search and sort works much better, and the rendering isn't all that hairy. Edit logic should also be simpler.

I wouldn't advise equating pairs of vowel marks with tanween marks at the level of encoding design.

-gregg

Follow-Ups:
- Re: Proposal for the Basis of a Codepoint Extension toUnicodeforthe Encoding of the Quranic Manuscripts
  - From: Thomas Milo

References:
- Re: Proposal for the Basis of a Codepoint Extension to Unicodeforthe Encoding of the Quranic Manuscripts
  - From: Mete Kural
- Re: Proposal for the Basis of a Codepoint Extension to Unicodeforthe Encoding of the Quranic Manuscripts
  - From: Abdulhaq Lynch
- Re: Proposal for the Basis of a Codepoint Extension toUnicodeforthe Encoding of the Quranic Manuscripts
  - From: Thomas Milo

Prev by Date: Re: GNU Arab League Flag
Next by Date: Re: Proposal for the Basis of a Codepoint Extension to Unicode forthe Encoding of the Quranic Manuscripts
Previous by thread: Re: Proposal for the Basis of a Codepoint Extension toUnicodeforthe Encoding of the Quranic Manuscripts
Next by thread: Re: Proposal for the Basis of a Codepoint Extension toUnicodeforthe Encoding of the Quranic Manuscripts
Index(es):
- Date
- Thread