Language name starts with a upper case #17121

Arthur-Milchior · 2024-09-23T23:54:54Z

Language name starts with a upper case

The menu provided to the users to select their languages is not yet
perfect. It's hard to know for certain what is the correct way to
display the languages to the users. The function provided by Android
was buggy, and so, a year and a half ago, the language names were
hard-coded in #13275. Except that nobody speaks the 93 languages in
which we have translations available.

On the long term, I hope that 92 translators will provide 92 language
string to use. #17120 should start the process to eventually fix this
error.

On the short term, there is one change that we can make that will
probably be, on average, an improvement. Using upper case for the
first letter of each name.

If I understand correctly Brayan's comment, this change would be
correct for Portugese. I can confirm it's correct for French. I can't
promise this won't make things worse for some language. But, if we got
some right previously, it was by accident, and I still hope this is,
on average, an improvement.

The upper cases were obtained by using the "set first later to upper
case" feature of emacs on each language name.

david-allison

Two are incorrect (to my understanding). I used https://en.wikipedia.org/wiki/List_of_ISO_639_language_codes

Do we have this data anywhere in CLDR?

⚠️ EDIT: https://en.wikipedia.org/wiki/IETF_language_tag disagrees with the above

david-allison · 2024-09-24T00:02:19Z

AnkiDroid/src/main/java/com/ichi2/utils/LanguageUtil.kt

        "Venda" to "ve", // Venda
        "Tiếng Việt" to "vi", // Vietnamese
        "Wolof" to "wo", // Wolof
-        "isiXhosa" to "xh", // Xhosa
+        "IsiXhosa" to "xh", // Xhosa


⚠️ This is incorrect

https://www.iana.org/assignments/language-subtag-registry/language-subtag-registry which is referenced by your first wikipedia link only use "Xhosa" while the second link indeed uses isiXhosa.

AnkiDroid/src/main/java/com/ichi2/utils/LanguageUtil.kt

david-allison · 2024-09-24T00:09:18Z

AnkiDroid/src/main/java/com/ichi2/utils/LanguageUtil.kt

-        "հայերեն (Հայաստան)" to "hy-AM", // Armenian (Armenia)
+        "Hrvatski" to "hr", // Croatian
+        "Magyar" to "hu", // Hungarian
+        "Hայերեն (Հայաստան)" to "hy-AM", // Armenian (Armenia)


AnkiDroid/src/main/java/com/ichi2/utils/LanguageUtil.kt

david-allison · 2024-09-24T00:12:33Z

AnkiDroid/src/main/java/com/ichi2/utils/LanguageUtil.kt

        "اردو (پاکستان)" to "ur-PK", // Urdu (Pakistan)
-        "o‘zbek" to "uz", // Uzbek
+        "O‘zbek" to "uz", // Uzbek


O‘zbek is not on Wikipedia as an Endonym

AnkiDroid/src/main/java/com/ichi2/utils/LanguageUtil.kt

Arthur-Milchior · 2024-09-24T01:56:32Z

I applied your comment.

Honestly, I'm not exactly sure where to get confirmation, appart from waiting for translators to provide feedback through crowdin. I don't expect to easily find native Uzbek` speaker for example

david-allison · 2024-09-24T03:05:46Z

I flagged potential issues. Stick with the CLDR-proposed names unless you've confirmed either way

david-allison

I aim to approve after this mix of reverts and changes

AnkiDroid/src/main/java/com/ichi2/utils/LanguageUtil.kt

david-allison · 2024-09-25T00:18:20Z

Note: only one comment was addressed

david-allison

LGTM, cheers!

The menu provided to the users to select their languages is not yet perfect. It's hard to know for certain what is the correct way to display the languages to the users. The function provided by Android was buggy, and so, a year and a half ago, the language names were hard-coded in ankidroid#13275. Except that nobody speaks the 93 languages in which we have translations available. On the long term, I hope that 92 translators will provide 92 language string to use. ankidroid#17120 should start the process to eventually fix this error. On the short term, there is one change that we can make that will probably be, on average, an improvement. Using upper case for the first letter of each name. If I understand correctly Brayan's comment, this change would be correct for Portuguese. I can confirm it's correct for French. I can't promise this won't make things worse for some language. But, if we got some right previously, it was by accident, and I still hope this is, on average, an improvement. The upper cases were obtained by using the "set first later to upper case" feature of emacs on each language name. íslenska, isiXhosa and isiZulu appear to start with lowercase letters so these have not been updated Fixed: ankidroid#17118 Co-authored-by: David Allison <[email protected]>

Santali and Sardinian were written in English, rather than their respective languages These values were obtained using `ULocale`. Sources (Unicode CLDR): * `ᱥᱟᱱᱛᱟᱲᱤ` - https://github.com/unicode-org/cldr/blob/731f226f93f95635500bbbadccf96798c23e4c9a/common/main/sat.xml#L365C25-L365C32 * `sardu` - https://github.com/unicode-org/cldr/blob/731f226f93f95635500bbbadccf96798c23e4c9a/common/main/sc.xml#L369C24-L369C29 * Casing rules do not appear in CLDR yet, but I assume that uppercasing the name is OK Co-authored-by: David Allison <[email protected]>

david-allison · 2024-09-26T18:05:51Z

@Arthur-Milchior I've squashed the commits and force pushed.

I've added additional information to the first commit message, and written the second.

When feasible, could you review my changes and confirm you're happy with them, given I've set you as the author on both

david-allison requested changes Sep 24, 2024

View reviewed changes

david-allison mentioned this pull request Sep 24, 2024

All languages whose name uses the Latin alphabet should probably be upper case. #17118

Open

david-allison requested changes Sep 24, 2024

View reviewed changes

david-allison approved these changes Sep 26, 2024

View reviewed changes

david-allison added squash-merge The pull request currently requires maintainers to "Squash Merge" Needs Second Approval Has one approval, one more approval to merge labels Sep 26, 2024

Arthur-Milchior and others added 2 commits September 26, 2024 18:47

david-allison force-pushed the upper_case branch from 42850c4 to 99e8fce Compare September 26, 2024 18:00

david-allison removed the squash-merge The pull request currently requires maintainers to "Squash Merge" label Sep 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Language name starts with a upper case #17121

Language name starts with a upper case #17121

Arthur-Milchior commented Sep 23, 2024 •

edited by david-allison

Loading

david-allison left a comment •

edited

Loading

david-allison Sep 24, 2024

Arthur-Milchior Sep 24, 2024

david-allison Sep 24, 2024

david-allison Sep 24, 2024

Arthur-Milchior commented Sep 24, 2024

david-allison commented Sep 24, 2024

david-allison left a comment

david-allison commented Sep 25, 2024

david-allison left a comment

david-allison commented Sep 26, 2024

Language name starts with a upper case #17121

Are you sure you want to change the base?

Language name starts with a upper case #17121

Conversation

Arthur-Milchior commented Sep 23, 2024 • edited by david-allison Loading

david-allison left a comment • edited Loading

Choose a reason for hiding this comment

david-allison Sep 24, 2024

Choose a reason for hiding this comment

Arthur-Milchior Sep 24, 2024

Choose a reason for hiding this comment

david-allison Sep 24, 2024

Choose a reason for hiding this comment

david-allison Sep 24, 2024

Choose a reason for hiding this comment

Arthur-Milchior commented Sep 24, 2024

david-allison commented Sep 24, 2024

david-allison left a comment

Choose a reason for hiding this comment

david-allison commented Sep 25, 2024

david-allison left a comment

Choose a reason for hiding this comment

david-allison commented Sep 26, 2024

Arthur-Milchior commented Sep 23, 2024 •

edited by david-allison

Loading

david-allison left a comment •

edited

Loading