Skip to content

Commit 6d66b1a

Browse files
chaaklauronaldtse
authored andcommitted
Add bgn-jpn-Hrkt-Latn-1962
There are issues with this specification. * Despite the mentioning of the term "Modified Hepburn in the specification, the handling of ん/ン in this standard follows the Traditional Hepburn: letter m is used before b, m, p. * Pre-reform combinations will clash with modern Japanese transliteration. * There is no discussion on how cross-morpheme double vowel sounds should be handled. Everything not explicitly stated in the specification will be assumed to be inherited from `var-jpn-Hrkt-Latn-hepburn-1954`. Obsolete combinations are handled by post rules, for the sake of completeness.
1 parent 474ea12 commit 6d66b1a

File tree

1 file changed

+255
-0
lines changed

1 file changed

+255
-0
lines changed

maps/bgn-jpn-Hrkt-Latn-1962.yaml

Lines changed: 255 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,255 @@
1+
---
2+
authority_id: bgn
3+
id: 1962
4+
language: jpn
5+
source_script: Hrkt
6+
destination_script: Latn
7+
name: BGN (Modified Hepburn) System
8+
url:
9+
creation_date: 1930
10+
adoption_date: 1962
11+
description: |
12+
The BGN (Modified Hepburn) System for the transliteration of Japanese
13+
has been in use by the Board on Geographic Names since about 1930 and
14+
has been extensively employed in the systematic standardsization of
15+
thousands of geographic names of Japan in romanized form.
16+
17+
notes: |
18+
19+
1. The "tsu" forms (ツ/つ) are also used to indicate a double consonant and
20+
are generally (but not alwyas) written in smaller script or type
21+
slightly to the right of or below the regular line. These characters
22+
are trasnliterated as k before k; s before s or sh; t before t, ts, or
23+
ch; and p before p. Occasionally, when a "ku" (ク/く) or "ki" (キ/き) form
24+
precedes k, the u in ku or the i in ki is dropped.
25+
26+
2. The transliterations in parentheses are used in specific cases when
27+
the kana symbol is known to be so pronounced.
28+
29+
3. The transliteration m is used before b, p, and m.
30+
31+
4. This letter has been added for the use in transliterating foreign
32+
words.
33+
34+
5. The asterisk (*) indicates standard combined forms. Those combined
35+
forms not so marked are rarely used.
36+
37+
----
38+
39+
Additional Notes:
40+
41+
a. Despite the mentioning of the term "Modified Hepburn in the
42+
specification, the handling of ん/ン in this standard is different from
43+
Modified Hepburn. It follows the Traditional Hepburn in that the
44+
letter m is used before b, m, p.
45+
46+
b. This document includes obsolete (pre-reform) combinations.
47+
48+
c. There is no discussion on how cross-morpheme vowel sounds should be
49+
handled.
50+
51+
tests:
52+
# Note: these test cases follow the pre-reform standard.
53+
- source: "けふ"
54+
expected: "kyō"
55+
- source: "ぎうにう"
56+
expected: "gyūnyū"
57+
- source: "きふ" # きふ should always be kifu in Modern Japanese
58+
expected: "kyū"
59+
- source: "ちう"
60+
expected: "chū"
61+
- source: "けう"
62+
expected: "kyō"
63+
64+
map:
65+
66+
inherit: var-jpn-Hrkt-Latn-hepburn-1954
67+
68+
rules:
69+
# Convert ん into m before b, m, p
70+
- pattern: "[んン](?=[ばびぶべぼまみむめもぱぴぷぺぽバビブベボマミムメモパピプペポ])"
71+
result: "m"
72+
postrules:
73+
# Handle obsolete forms
74+
# Note that these forms are present in the rules, but will break
75+
# if used with Modern Japanese
76+
- pattern: "ef?[uo]|iyau"
77+
result: ""
78+
- pattern: "if?u"
79+
result: ""
80+
- pattern: "[ao]f?[uo]"
81+
result: "ō"
82+
- pattern: "iy"
83+
result: "y"
84+
- pattern: "ty"
85+
result: "ch"
86+
- pattern: "dy"
87+
result: "j"
88+
- pattern: "[jz]y"
89+
result: "j"
90+
- pattern: "(?<=[sc])hy"
91+
result: "h"
92+
- pattern: "sy"
93+
result: "sh"
94+
95+
characters:
96+
# ke
97+
# These are listed as alternative pronunciation, but in fact this usage of ヶ
98+
# as the archaic possessive marker is not found in Kana only texts.
99+
# Also it is always typed using the smaller form. (ヶ U+30F6)
100+
"": ["ke", "ga", "ka", "ko"]
101+
"": ["ke", ga", "ka", "ko"]
102+
"": ["ga", "ka", "ko"]
103+
104+
105+
# The Ha-column
106+
# は is still pronounced as wa when used as a particle,
107+
# the alternative pronunciations for the other four kana's are obsolete.
108+
"": ["ha", "wa"]
109+
"": ["hi", "i"]
110+
"": ["fu", "u", "o"]
111+
"": ["he", "e"]
112+
"": ["ho", "o"]
113+
"": ["ha", "wa"]
114+
"": ["hi", "i"]
115+
"": ["fu", "u", "o"]
116+
"": ["he", "e"]
117+
"": ["ho", "o"]
118+
119+
120+
# The Wa-column
121+
# These two kanas below are only used in pre-reform texts.
122+
"" : "i"
123+
"" : "e"
124+
"" : "i"
125+
"" : "e"
126+
127+
# Combined forms
128+
# These are obsolete forms. See Note 5.
129+
# They can be handled by post-rules if ever needed.
130+
# "あう": "ō"
131+
# "あふ": "ō"
132+
# "いふ": "yū"
133+
# "えう": "yō"
134+
# "えふ": "yō"
135+
# "おふ": "ō"
136+
# "かう": "kō"
137+
# "かふ": "kō"
138+
# "がう": "gō"
139+
# "がふ": "gō"
140+
# "きう": "kyū"
141+
# "きふ": "kyū"
142+
# "きやう": "kyō"
143+
# "ぎう": "gyū"
144+
# "ぎふ": "gyū"
145+
# "ぎやう": "gyō"
146+
"くわ": "ka"
147+
"くわう": ""
148+
"ぐわ": "ga"
149+
"ぐわう": ""
150+
"クワ": "ka"
151+
"クワウ": ""
152+
"グワ": "ga"
153+
"グワウ": ""
154+
# "けう": "kyō"
155+
# "けふ": "kyō"
156+
# "げう": "gyō"
157+
# "げふ": "gyō"
158+
# "こふ": "kō"
159+
# "ごふ": "gō"
160+
# "さう": "sō"
161+
# "さふ": "sō"
162+
# "ざう": "zō"
163+
# "ざふ": "zō"
164+
# "しう": "shū"
165+
# "しふ": "shū"
166+
# "しやう": "shō"
167+
# "じう": "jū"
168+
# "じふ": "jū"
169+
# "じやう": "jō"
170+
# "せう": "shō"
171+
# "せふ": "shō"
172+
# "ぜう": "jō"
173+
# "ぜふ": "jō"
174+
# "そふ": "sō"
175+
# "ぞふ": "zō"
176+
# "たう": "tō"
177+
# "たふ": "tō"
178+
# "だう": "dō"
179+
# "だふ": "dō"
180+
# "ちう": "chū"
181+
# "ちふ": "chū"
182+
# "ちやう": "chō"
183+
# "ぢう": "jū"
184+
# "ぢふ": "jū"
185+
# "ぢや": "ja"
186+
# "ぢやう": "jō"
187+
# "ぢゆ": "ju"
188+
# "ぢよ": "jo"
189+
# "ぢよう": "jō"
190+
# "てう": "chō"
191+
# "てふ": "chō"
192+
# "でう": "jō"
193+
# "でふ": "jō"
194+
# "とふ": "tō"
195+
# "どふ": "dō"
196+
# "なう": "nō"
197+
# "なふ": "nō"
198+
# "にう": "nyū"
199+
# "にふ": "nyū"
200+
# "にやう": "nyō"
201+
# "ねう": "nyō"
202+
# "ねふ": "nyō"
203+
# "のふ": "nō"
204+
# "はう": ["hō","ō"]
205+
# "はふ": "hō"
206+
# "ばふ": "bō"
207+
# "ばう": "bō"
208+
# "ぱう": "pō"
209+
# "ぱふ": "pō"
210+
# "ひう": "hyū"
211+
# "ひふ": "hyū"
212+
# "ひやう": "hyō"
213+
# "びう": "byū"
214+
# "びふ": "byū"
215+
# "びやう": "byō"
216+
# "ぴう": "pyū"
217+
# "ぴふ": "pyū"
218+
# "ぴやう": "pyō"
219+
# "へう": "hyō"
220+
# "へふ": "hyō"
221+
# "べう": "byō"
222+
# "べふ": "byō"
223+
# "ぺう": "pyō"
224+
# "ぺふ": "pyō"
225+
# "ほふ": "hō"
226+
# "ぼふ": "bō"
227+
# "ぽふ": "pō"
228+
# "まう": "mō"
229+
# "まふ": "mō"
230+
# "まを": "mō"
231+
# "みやう": "myō"
232+
# "みう": "myū"
233+
# "みふ": "myū"
234+
# "めう": "myō"
235+
# "めふ": "myō"
236+
# "めを": "myō"
237+
# "もふ": "mō"
238+
# "やう": "yō"
239+
# "やふ": "yō"
240+
# "よふ": "yō"
241+
# "らう": "rō"
242+
# "らふ": "rō"
243+
# "りう": "ryū"
244+
# "りふ": "ryū"
245+
# "りやう": "ryō"
246+
# "れう": "ryō"
247+
# "れふ": "ryō"
248+
# "ろふ": "rō"
249+
# "わう": "wō"
250+
# "わふ": "wō"
251+
# "ゑふ": "yō"
252+
# "をう": "ō"
253+
# "をふ": "ō"
254+
255+

0 commit comments

Comments
 (0)