Component Coredata Examples
Inline USV example #1
Section titled “Inline USV example #1”Instead of writing “U+0634 ش ARABIC LETTER SHEEN” (and trying to figure out how to type the correct character and find the exact Unicode character name), you can just write <Character usv="0634"/> to produce carefully-styled text with USV, character and Unicode character name: ش
You can also specify a character by typing it literally <Character char="á"/> to produce: á
Inline USV example #2
Section titled “Inline USV example #2”From the glossary, using inline HTML:
Section titled “From the glossary, using inline HTML:”Transformation of data to a normal form. For historical reasons,the Unicode Standard allows some characters to have more than oneencoded representation. For example, _á_ may be represented as a single codepoint,<span class='USV'>U+00E1</span> <span class='UnicodeCharName'>LATIN SMALL LETTER A WITH ACUTE</span>, or two codepoints, <span class='USV'>U+0061</span> <span class='UnicodeCharName'>LATIN SMALL LETTER A</span>and <span class='USV'>U+0301</span> <span class='UnicodeCharName'>COMBININGACUTE ACCENT</span>. A normalization scheme is used to standardize the codepointsso that every character is always represented by the same sequence of codepoints.Normalization is described in the Unicode Standard Section 5.7, Normalization.Transformation of data to a normal form. For historical reasons, the Unicode standard allows some characters to have more than one encoded representation. For example, á may be represented as a single codepoint, U+00E1 LATIN SMALL LETTER A WITH ACUTE, or two codepoints, U+0061 LATIN SMALL LETTER A and U+0301 COMBINING ACUTE ACCENT. A normalization scheme is used to standardize the codepoints so that every character is always represented by the same sequence of codepoints. Normalization is described in the Unicode Standard Section 5.7, Normalization.
Using the Character component:
Section titled “Using the Character component:”Transformation of data to a normal form. For historical reasons, the Unicodestandard allows some characters to have more than one encoded representation.For example, á may be represented as a single codepoint, <Character char="á"/>,or two codepoints, <Character char="a"/> and <Character usv="0301"/>.A normalization scheme is used to standardize the codepoints so that everycharacter is always represented by the same sequence of codepoints. Normalizationis described in the Unicode Standard Section 5.7, Normalization.Transformation of data to a normal form. For historical reasons, the Unicode standard allows some characters to have more than one encoded representation. For example, á may be represented as a single codepoint, á , or two codepoints, a and ◌́ . A normalization scheme is used to standardize the codepoints so that every character is always represented by the same sequence of codepoints. Normalization is described in the Unicode Standard Section 5.7, Normalization.
CharacterTable example
Section titled “CharacterTable example”<CharacterTable startUsv="063D" endUsv="063F" caption="Farsi Yeh forms"/>Consider these Farsi Yeh forms:
| USV | Char | Unicode Character Name |
|---|---|---|
| U+063D | ؽ | ARABIC LETTER FARSI YEH WITH INVERTED V |
| U+063E | ؾ | ARABIC LETTER FARSI YEH WITH TWO DOTS ABOVE |
| U+063F | ؿ | ARABIC LETTER FARSI YEH WITH THREE DOTS ABOVE |