Unsupported non-Latin characters
Some Unicode characters are not currently supported for use in Record Manager and will cause a validation error to occur. For information about the Telugu and Sinhala scripts specifically related to the Library of Congress scripts expansion project, see Details from OCLC: Library of Congress scripts.
If you need to enter these characters in a bibliographic or authority record in Record Manager:
- Enter the name of the character within square brackets, using the Unicode standard if available (e.g., enter [schwa]).
- You may also enter the hex values provided into Connexion.
Currently unsupported characters
Telugu - Visarga
Name: TELUGU SIGN VISARGA
Unicode: U+0C04
Hex value: &+0C04;
Character: ఼
Title: తెలుగు-ఉర్దూ ఫ఼ారసీ పదకోశము
Where it appears: In ఖ఼ాన్ (author name) and ఫ఼ారసీ (Fārsī in title)
Telugu - Nukta
Name: TELUGU SIGN NUKTA
Unicode: U+0C3C
Hex value: ఼
Character: ఼
Title: తెలుగు-ఉర్దూ ఫ఼ారసీ పదకోశము
Where it appears: In ఖ఼ాన్ (author name) and ఫ఼ారసీ (Fārsī in title)
Telugu - Nakaara Pollu
Name: TELUGU LETTER NAKAARA POLLU
Unicode: U+0C5D
Hex value: ౝ
Character: ౝ
Title: శ్రీ కృష్ౝదేవమహారాయల ప్రభుత్వము
Where it appears: In కృష్ౝదేవ (archaic spelling of Kṛṣṇadēva)
Telugu - Siddham
Name: TELUGU SIGN SIDDHAM
Unicode: U+0C77
Hex value: ౷
Character: ౷
Title: ౷ సిద్ధిరస్తు
Where it appears: As an invocation at the beginning of inscriptions
Sinhala - Candrabindu
Name: SINHALA SIGN CANDRABINDU
Unicode: U+0D81
Hex value: ඁ
Character: ඁ
Title: සංස්කෘත-සිංහල ශබ්දකෝෂයඁ
Where it appears: At the end of කෝෂයඁ (kōṣayaṁ)
Canadian Syllabics Nattilik Ha
Unicode: U+11AB4 (part of the Supplementary Multilingual Plane)
Hex value: 𑪴
Character: 𑪴
Usage: Part of the Nattilik dialect of Inktitut
