Skip to content

Commit 9c7fd35

Browse files
authored
Add bgnpcgn-aze-Cyrl-Latn-1993
* Add bgnpcgn-aze-Cyrl-Latn-1993 * Added TABLE_OF_CORRESPONDENCES_FOR_AZERBAIJANI.pdf
1 parent 9682f61 commit 9c7fd35

File tree

2 files changed

+103
-0
lines changed

2 files changed

+103
-0
lines changed
Lines changed: 103 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,103 @@
1+
---
2+
authority_id: bgnpcgn
3+
id: 1993
4+
language: aze
5+
source_script: Cyrl
6+
destination_script: Latn
7+
name: AZERBAIJANI TABLE OF CORRESPONDENCES CYRILLIC-ROMAN -- BGN/PCGN 1993 Agreement
8+
url: https://assets.publishing.service.gov.uk/government/uploads/system/uploads/attachment_data/file/816656/TABLE_OF_CORRESPONDENCES_FOR_AZERBAIJANI.pdf
9+
creation_date: 1993
10+
confirmation date: 2019-06
11+
description: |
12+
Azerbaijani, also known as Azeri, is the official language of the Republic of Azerbaijan. In 1991, the Azerbaijani government adopted the Roman alphabet to replace the existing Cyrillic alphabet. The presentation below provides a table of correspondences between the former Cyrillic alphabet and the current Roman alphabet. When Azerbaijani Roman-alphabet spellings are not available, this table can be used to convert Azerbaijani Cyrillic spellings.
13+
14+
notes:
15+
16+
- The special letter Ə, ə known as schwa, should be reproduced in that form whenever encountered. The characters Ə (Unicode 04D8) and ə (Unicode 04D9) should be used for schwa when writing in the Cyrillic script, but characters Ə (Unicode 018F) and ə (Unicode 0259) should be used when writing in the Roman alphabet. In those instances when it cannot be reproduced, however, the letter Ä ä may be substituted for it (see below).
17+
18+
- The obsolete characters й, э, ю, and я should be romanized ẏ, ė, yu., and ya.
19+
20+
- Unicode values are shown with the uppercase Cyrillic character first, followed by the lowercase character. It is not known whether there exists an uppercase ‘J’ specific to the Cyrillic character set.
21+
22+
- An inventory of letter-diacritic combinations, with their Unicode encoding, in addition to the unmodified letters of the basic Roman script is:
23+
Ğ (U+011E), ğ (U+011F)
24+
Ə (U+018F), ə (U+0259)
25+
İ (U+0130), ı (U+0131)
26+
Ö (U+00D6), ö (U+00F6)
27+
Ü (U+00DC), ü (U+00FC)
28+
Ç (U+00C7), ç (U+00E7)
29+
Ş (U+015E), ş (U+015F)
30+
31+
- The Roman-script columns show only lowercase forms but, when applying the table, uppercase and lowercase Roman letters as appropriate should be used.
32+
33+
tests:
34+
- source:
35+
expected:
36+
37+
map:
38+
characters:
39+
'\u0410' : 'A'
40+
'\u0411' : 'B'
41+
'\u0412' : 'G'
42+
'\u0413' : 'V'
43+
'\u0492' : 'Ğ'
44+
'\u0414' : 'D'
45+
'\u0415' : 'E'
46+
'\u04D8' : 'Ә'
47+
'\u0416' : 'J'
48+
'\u0417' : 'Z'
49+
'\u0418' : 'I'
50+
'\u042B' : 'İ'
51+
'\u0408' : 'Y'
52+
'\u041A' : 'K'
53+
'\u049C' : 'G'
54+
'\u041B' : 'L'
55+
'\u041C' : 'M'
56+
'\u041D' : 'N'
57+
'\u041E' : 'O'
58+
'\u04E8' : 'ö'
59+
'\u041F' : 'P'
60+
'\u0420' : 'R'
61+
'\u0421' : 'S'
62+
'\u0422' : 'T'
63+
'\u0423' : 'U'
64+
'\u04AE' : 'Ü'
65+
'\u0424' : 'F'
66+
'\u0425' : 'X'
67+
'\u04BA' : 'H'
68+
'\u0427' : 'Ç'
69+
'\u04B8' : 'C'
70+
'\u0428' : 'Ş'
71+
72+
'\u0430' : 'a'
73+
'\u0431' : 'b'
74+
'\u0432' : 'v'
75+
'\u0433' : 'g'
76+
'\u0493' : 'ğ'
77+
'\u0434' : 'd'
78+
'\u0435' : 'e'
79+
'\u04D9' : 'ә'
80+
'\u0436' : 'j'
81+
'\u0437' : 'z'
82+
'\u0438' : 'i'
83+
'\u044B' : 'ı'
84+
'\u0458' : 'y'
85+
'\u043A' : 'k'
86+
'\u049D' : 'g'
87+
'\u043B' : 'l'
88+
'\u043C' : 'm'
89+
'\u043D' : 'n'
90+
'\u043E' : 'o'
91+
'\u04E9' : 'ö'
92+
'\u043F' : 'p'
93+
'\u0440' : 'r'
94+
'\u0441' : 's'
95+
'\u0442' : 't'
96+
'\u0443' : 'u'
97+
'\u04AF' : 'ü'
98+
'\u0444' : 'f'
99+
'\u0445' : 'x'
100+
'\u04BB' : 'h'
101+
'\u0447' : 'ç'
102+
'\u04B9' : 'c'
103+
'\u0448' : 'ş'
Binary file not shown.

0 commit comments

Comments
 (0)