You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The Unicode concept of 'grapheme cluster' currently fails to represent the small number of conjuncts that are used in modern Tamil, ie. kṣa க்ஷ and the two alternative sequences for srī, ஶ்ரீ and ஸ்ரீ. This means that various editing operations, line breaking algorithms, vertical text, etc. are liable to break text at the wrong point when those conjuncts are used. For more details, see the relevant sections.
Indic Layout Requirements provides a grammar for indian orthographic syllable boundaries which works for the consonant clusters in Tamil which don't use conjuncts.
Specs:
CSS uses the concept of 'typographic character unit', rather than grapheme cluster, in its specs with the explanation that these cases are beyond the scope of the grapheme cluster concept and that implementations should provide appropriate support.
The text was updated successfully, but these errors were encountered:
The first comment in this issue contains text that will automatically appear in the Tamil gap-analysis document as a subsection with the same title as this issue. Any edits made to that comment will be immediately available in the document. Proposals for changes or discussion of the content can be made in comments below this point.
Well, those two and ஶ்ரீ also (the Unicode recommended sequence for shri). The text in this section was very old and i made some badly needed edits. (See the 'edited' pulldown for the initial comment.)
r12a
changed the title
Grapheme clusters fail to represent syllabic conjuncts
Grapheme clusters fail to represent syllabic conjuncts in Tamil
May 18, 2021
Uh oh!
There was an error while loading. Please reload this page.
The Unicode concept of 'grapheme cluster' currently fails to represent the small number of conjuncts that are used in modern Tamil, ie. kṣa க்ஷ and the two alternative sequences for srī, ஶ்ரீ and ஸ்ரீ. This means that various editing operations, line breaking algorithms, vertical text, etc. are liable to break text at the wrong point when those conjuncts are used. For more details, see the relevant sections.
Indic Layout Requirements provides a grammar for indian orthographic syllable boundaries which works for the consonant clusters in Tamil which don't use conjuncts.
Specs:
CSS uses the concept of 'typographic character unit', rather than grapheme cluster, in its specs with the explanation that these cases are beyond the scope of the grapheme cluster concept and that implementations should provide appropriate support.
The text was updated successfully, but these errors were encountered: