Hello Nadim,
I think I didn't communicate myself efficiently. I am not proposing
that we should use a <tanween+modifier> sequence for tanween with
small meem and assimilated tanween just to save the hassle of
proposing six extra new codepoints to Unicode (although it would
truly be quite a hassle to try to propose six new codepoints). It is
because using a <tanween+modifier> sequence preserves the text's
graphemic integrity better and results in a cleaner encoding. A
fathatan is a fathatan, regardless of whether its pronounciation
changes slightly. An assimilated fathatan or a fathatan with small
meem is still a fathatan, in fact it is just as much fathatan as any
other fathatan. For hundreds of years all of these fathatans were
written the same exact way. In more recent times scribes have decided
to write these two kinds of fathatans slightly differently to cue the
un-educated reciter to pronounce correctly. For that reason the
logical way to encode this is the <fathatan+modifier> sequen ce in
order to preserve the fathatan codepoint. Using a seperate codepoint
will break this graphemic integrity.