Diacritic characters
Amlib can import and export MARC records that contain Unicode characters.
When a record is imported, the characters that cannot be displayed correctly are encoded so there is no loss of data. Any character that is encoded may not display correctly in the Amlib client/server modules but will still correctly display in all browser applications such as NetOPACs.
An encoded diacritic character is easily recognised because they all start with ‘<d:’ and are suffixed with ‘>’ . For example- <d:cf> or <d:0302>
A diacritic can be the direct translation of a character such as <d:0391> for the Greek Capital Letter Alpha or it can define the formatting to apply to a character such as <d:aa> for an Acute Accent. All formatting diacritics are applied to the character preceding the diacritic.
Commonly used diacritic characters such as Acute Accents and Breves have been assigned a two character translation to allow the easy entering of diacritics. For example - a<d:gv> will be displayed as à
Diacritic characters that cannot be displayed in the Amlib client/server modules are encoded as a four digit hexadecimal code. The four digit hexadecimal code is the same code as used by other companies and conforms to the Unicode standard. For example - <d:03A8> will be displayed as Ψ
For more information on Unicode visit: http://www.unicode.org/charts/
<d:aa> |
<d:0301> |
Acute Accent |
|
<d:ad> |
<d:030B> |
Double Acute Accent |
|
<d:br> |
<d:0306> |
Breve |
|
<d:ca> |
<d:030A> |
Circle Above |
|
<d:cb> |
<d:0325> |
Circle Below |
|
<d:cc> |
<d:0313> |
High Comma Centred |
|
<d:cd> |
<d:0310> |
Candrabindu |
|
<d:ce> |
<d:0327> |
Cedilla |
|
<d:cf> |
<d:0302> |
Circumflex |
|
<d:co> |
<d:0315> |
High Comma Off Centre |
|
<d:da> |
<d:0307> |
Dot Above |
<d:db> |
<d:0323> |
Dot Below |
|
<d:dd> |
<d:0324> |
Diaeresis Below |
|
<d:gv> |
<d:0300> |
Grave Accent |
|
<d:hb> |
<d:032E> |
Half Circle Below |
|
<d:hk> |
<d:030C> |
Hacek (Caron) |
|
<d:hl> |
<d:0326> |
Hook to Left (Comma Below) |
|
<d:hr> |
<d:0328> |
Hook to Right (Ogonek) |
|
<d:ht> |
<d:0309> |
High Tone |
|
<d:ll> |
<d:FE20> |
Left Half of Ligature |
|
<d:lr> |
<d:FE21> |
Right Half of Ligature |
|
<d:ma> |
<d:0304> |
Macron |
|
<d:ru> |
<d:031C> |
Rude |
|
<d:tl> |
<d:FE22> |
Left Half Double Title |
|
<d:tr> |
<d:FE23> |
Right Half Double Tilde |
|
<d:ud> |
<d:0333> |
Double Underline |
|
<d:um> |
<d:0308> |
Umlaut |
|
<d:ul> |
<d:0322> |
Underline |
Amlib Client vs. NetOPACs view
In the Amlib Client Catalogue Module, there is no information where characters cannot be shown in Amlib (Alternate Graphic for example) but various diacritics on latin-based characters can be shown (as in the Title statement).
NetOPACs view where Unicode characters show