Skip to content

Commit

Permalink
Correct all maps metadata to use a common format
Browse files Browse the repository at this point in the history
  • Loading branch information
webdev778 committed Dec 6, 2023
1 parent 334c919 commit 78ca96c
Show file tree
Hide file tree
Showing 8 changed files with 44 additions and 42 deletions.
2 changes: 1 addition & 1 deletion maps/bgnpcgn-fas-Arab-Latn-1956.imp
Original file line number Diff line number Diff line change
Expand Up @@ -47,7 +47,7 @@ metadata {
- Since maddah (آ), which is placed over alif (ا), nearly always occurs in word-initial position, no .)◌َا( as well as for fatḩah alif )آ( confusion results from the use of ā for alif maddah
- The ligatures لا and لـا represent lām- alif, and should be romanized lā.

special_rules:
implementation_notes:
# TODO: These are not used
- Initial definite articles and prepositions should be capitalized and hyphens should not be used to connect parts of names, e.g., Ash Shāriqah and Tall al Laḩm.
- If any evidence is found for the use of the definite article in a name, the article should be used in the name chosen.
Expand Down
2 changes: 1 addition & 1 deletion maps/elot-ell-Grek-Latn-743-2001-tl.imp
Original file line number Diff line number Diff line change
Expand Up @@ -12,7 +12,7 @@ metadata {
Reversible transliteration standard, ELOT

notes:
- Transliteration standard (reversible): Clause 3.1, Table 1
- "Transliteration standard (reversible): Clause 3.1, Table 1"
}

# This map has been partially converted by the bin/maps_v1_to_v2 script
Expand Down
2 changes: 1 addition & 1 deletion maps/elot-ell-Grek-Latn-743-2001-ts.imp
Original file line number Diff line number Diff line change
Expand Up @@ -12,7 +12,7 @@ metadata {
Reversible transliteration standard, ELOT

notes:
- Transcription standard (reversible): Clause 3.1, Table 2
- "Transcription standard (reversible): Clause 3.1, Table 2"
}

# tests copied from iso-ell-Grek-Latn-843-1997-t1
Expand Down
2 changes: 1 addition & 1 deletion maps/iso-ell-Grek-Latn-843-1997-t1.imp
Original file line number Diff line number Diff line change
Expand Up @@ -15,7 +15,7 @@ metadata {
or Modern Greek. Replaces ISO/R 843.

notes:
- Transliteration of Greek into Latin: Type 1, Clause 3 Table 1
- "Transliteration of Greek into Latin: Type 1, Clause 3 Table 1"
- Equivalent to elot-ell-Grek-Latn-743-2001-ts, the transliteration table of ELOT 743:2001
- Assuming that ou, au, eu transliterations are only intended for historical diphthongs /u, av, ev/, and that όυ, άυ, έυ are not to be transliterated as ou, au, eu
- Introduced casing to digamma and lunate sigma. (Casing was late introduction to character sets for those characters)
Expand Down
2 changes: 1 addition & 1 deletion maps/iso-ell-Grek-Latn-843-1997-t2.imp
Original file line number Diff line number Diff line change
Expand Up @@ -15,7 +15,7 @@ metadata {
or Modern Greek. Replaces ISO/R 843.

notes:
- Transliteration of Greek into Latin: Type 2, Clause 3 Table 2
- "Transliteration of Greek into Latin: Type 2, Clause 3 Table 2"
- Introduced casing to digamma, yot, and lunate sigma. (Casing was late introduction to character sets for those characters)
}

Expand Down
18 changes: 9 additions & 9 deletions maps/lshk-yue-Hani-Latn-jyutping-1993.imp
Original file line number Diff line number Diff line change
Expand Up @@ -7,15 +7,15 @@ metadata {
name: Jyutping Cantonese Romanisation Scheme
url: https://lshk.org/jyutping
creation_date: 1993-12
description:
- The Linguistic Society of Hong Kong Cantonese Romanisation Scheme, or
known as Jyutping, was designed and proposed by the Linguistic Society of
Hong Kong in 1993. Jyutping is a new Cantonese romanization system which
has many advantages. It is multifunctional, systematic, user-friendly,
compatible with all possible modern Cantonese sounds, and solely based on
alphanumeric characters without any diacritics and strange symbols.
Jyutping can also be used as a Chinese computer input method. Its basic
principles are simple, easy to learn, and professional.
description: |
The Linguistic Society of Hong Kong Cantonese Romanisation Scheme, or
known as Jyutping, was designed and proposed by the Linguistic Society of
Hong Kong in 1993. Jyutping is a new Cantonese romanization system which
has many advantages. It is multifunctional, systematic, user-friendly,
compatible with all possible modern Cantonese sounds, and solely based on
alphanumeric characters without any diacritics and strange symbols.
Jyutping can also be used as a Chinese computer input method. Its basic
principles are simple, easy to learn, and professional.

notes:
- One may need to parse the text in order to generate accurate
Expand Down
50 changes: 26 additions & 24 deletions maps/mext-jpn-Hrkt-Latn-1954.imp
Original file line number Diff line number Diff line change
Expand Up @@ -11,34 +11,36 @@ metadata {
adoption_date: 1954-12-09
# 昭和二十九年十二月九日
description:
jp: |
国語を書き表わす場合に用いるローマ字のつづり方を次のように定める。
The spelling method for Roman characters used when writing Japanese language is as follows.

まえがき
1 一般に国語を書き表わす場合は、第1表に掲げたつづり方によるものとする。
2 国際的関係その他従来の慣例をにわかに改めがたい事情にある場合に限り、第2表に掲げたつづり方によつてもさしつかえない。
3 前二項のいずれの場合においても、おおむねそえがきを適用する。
en: |
The spelling method for Roman characters used when writing Japanese language is as follows.
Preface
1. In general, when the language is written, the spelling shown in Table 1 shall be used.
2. The spelling methods listed in Table 2 can be used only when there is a situation that is difficult to change due to international relations or other conventional practices.
3. In either case of the preceding two paragraphs, the general introduction will apply.

Preface
1. In general, when the language is written, the spelling shown in Table 1 shall be used.
2. The spelling methods listed in Table 2 can be used only when there is a situation that is difficult to change due to international relations or other conventional practices.
3. In either case of the preceding two paragraphs, the general introduction will apply.
original_description: |
国語を書き表わす場合に用いるローマ字のつづり方を次のように定める。

まえがき
1 一般に国語を書き表わす場合は、第1表に掲げたつづり方によるものとする。
2 国際的関係その他従来の慣例をにわかに改めがたい事情にある場合に限り、第2表に掲げたつづり方によつてもさしつかえない。
3 前二項のいずれの場合においても、おおむねそえがきを適用する。

notes:
- jp: はねる音「ン」はすべてnと書く。
en: ン / ん is romanized always n in Kunrei-siki
- jp: はねる音を表わすnと次にくる母音字またはyとを切り離す必要がある場合には、nの次に’を入れる。
en: When it is necessary to separate the sound n from the vowel or y to follow, the apostrophe is added after the n.
- jp: つまる音は、最初の子音字を重ねて表わす。
en: The clogged sound is represented by overlapping the first consonant characters.
- jp: 長音は母音字の上に^をつけて表わす。なお、大文字の場合は母音字を並べてもよい。
en: Long vowels are expressed in Kunrei-siki by placing a circumflex (^) over a vowel. In the case of capital letters, vowel characters may be arranged.
- jp: 特殊音の書き表わし方は自由とする。
en: The way of writing special sounds is free.
- jp: 文の書きはじめ、および固有名詞は語頭を大文字で書く。なお、固有名詞以外の名詞の語頭を大文字で書いてもよい。
en: Begin writing sentences and proper nouns with capital letters. Note that the beginning of nouns other than proper nouns may be written in capital letters.
- ン / ん is romanized always n in Kunrei-siki
- When it is necessary to separate the sound n from the vowel or y to follow, the apostrophe is added after the n.
- The clogged sound is represented by overlapping the first consonant characters.
- Long vowels are expressed in Kunrei-siki by placing a circumflex (^) over a vowel. In the case of capital letters, vowel characters may be arranged.
- The way of writing special sounds is free.
- Begin writing sentences and proper nouns with capital letters. Note that the beginning of nouns other than proper nouns may be written in capital letters.

original_notes:
- はねる音「ン」はすべてnと書く。
- はねる音を表わすnと次にくる母音字またはyとを切り離す必要がある場合には、nの次に’を入れる。
- つまる音は、最初の子音字を重ねて表わす。
- 長音は母音字の上に^をつけて表わす。なお、大文字の場合は母音字を並べてもよい。
- 特殊音の書き表わし方は自由とする。
- 文の書きはじめ、および固有名詞は語頭を大文字で書く。なお、固有名詞以外の名詞の語頭を大文字で書いてもよい。
}

tests {
Expand Down
8 changes: 4 additions & 4 deletions maps/un-ell-Grek-Latn-1987-phonetic.imp
Original file line number Diff line number Diff line change
Expand Up @@ -17,12 +17,12 @@ metadata {

notes:
- Also included in ISO 843:1997, Annex B, Column 5, and ELOT 743:1982, column 5.
- Corrected obvious errors, which occur every time the table has reappeared: χ > x, x > ks, oï > oi.
- The vowels are taken from the specification, but some are controversial: /ɑ ɛ/ but /o/.
- "Corrected obvious errors, which occur every time the table has reappeared: χ > x, x > ks, oï > oi."
- "The vowels are taken from the specification, but some are controversial: /ɑ ɛ/ but /o/."
- Stress is not indicated. (To do so in IPA would require syllabification in preprocessing, since stress is positioned at syllable breaks)
- Followed specification in treating final μπ as b, but final ντ as nd. That distinction is dubious. (In ELOT 743:1982, both d and nd are erroneously marked as initial, and no final is given.)
- τζ is not correctly transcribed as dz: fixed
- not reducing geminated consonants: fixed
- "τζ is not correctly transcribed as dz: fixed"
- "not reducing geminated consonants: fixed"
}

tests {
Expand Down

0 comments on commit 78ca96c

Please sign in to comment.