Skip to content

Can Hudum hypenation points be derived from Cyrillic? #43

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
r12a opened this issue Jan 25, 2022 · 5 comments
Closed

Can Hudum hypenation points be derived from Cyrillic? #43

r12a opened this issue Jan 25, 2022 · 5 comments
Labels
i:line_breaking Line breaking & hyphenation question s:mong Mongolian script

Comments

@r12a
Copy link
Contributor

r12a commented Jan 25, 2022

I raised a set of issues for Gecko, Blink, & Webkit browsers requesting better support for hyphenation in Mongolian.

Gecko folks noted that they don't have hyphenation break point rules for traditional Mongolian script, but they do for Mongolian written in Cyrillic. They asked the question: would it be possible to automatically generate hyphenation rules for traditional from the cyrillic one?

Any thoughts?

@r12a r12a added the question label Jan 25, 2022
@lianghai
Copy link

In order to wrap part of a Hudum word into the next line (not even talking about hyphenation), the layout engine needs to be able to break the word after shaping the whole word. Is that already solved?

@r12a
Copy link
Contributor Author

r12a commented Feb 23, 2022

That has been requested, but first the browser needs to be able to identify potential break points. Then it can check that the hyphenation rendering is typographically correct.

So my question still stands: can Gecko generate a set of hyphenation break point rules for Hudum from the Cyrillic rules?

@lianghai
Copy link

Then it can check that the hyphenation rendering is typographically correct.

I’m just here to make sure you understand that it’s not a matter of being “typographically correct”, but orthographically correct. Breaking a Hudum word before OTL leads to entirely wrong spellings.

@r12a r12a added i:hyphenation Hyphenation (a subset of Line-breaking & hyphenation) i:line_breaking Line breaking & hyphenation labels May 15, 2024
@r12a r12a added s:mong Mongolian script and removed i:hyphenation Hyphenation (a subset of Line-breaking & hyphenation) labels Jun 30, 2024
@Barker0103
Copy link

From my research, while Cyrillic Mongolian hyphenation rules provide a useful foundation, directly generating traditional Mongolian hyphenation rules from them is not straightforward due to differences in script structure, directionality, and syllabic boundaries. However, aligning corresponding morphemes between the scripts could offer a pathway to semi-automated rule generation with linguistic validation.

@r12a
Copy link
Contributor Author

r12a commented May 8, 2025

So i think we have the (not unexpected) answer: No. Thanks.

@r12a r12a closed this as completed May 8, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
i:line_breaking Line breaking & hyphenation question s:mong Mongolian script
Projects
None yet
Development

No branches or pull requests

3 participants