Mete Kural wrote:
Basically what I am suggesting you is to do this intellectual exercise in morphemic encoding design at the markup level, not at the character encoding level. That's where it belongs. That is partly why initiatives such as TEI and OSIS exist. I suggest that you read up on TEI and OSIS and think about ways to extend them to support detailed text analysis of Arabic.