"the encoding of the character used for alif (02BE) carries with it an assigned property in the Unicode database of (Lm), putting it into the category of 'Modifier_Letter'..."
Correction to what I put there: 02BC, rather. The rest of that still holds up; the data I'm looking at regarding properties can be found here: ftp://ftp.unicode.org/Public/UNIDATA/UnicodeData.txt http://www.unicode.org/reports/tr44/#Property_Values ftp://ftp.unicode.org/Public/UNIDATA/DerivedCoreProperties.txt Charles