Skip to content

Component Coredata Examples

Instead of writing “U+0634 ش ARABIC LETTER SHEEN” (and trying to figure out how to type the correct character and find the exact Unicode character name), you can just write <Character usv="0634"/> to produce carefully-styled text with USV, character and Unicode character name: ش

You can also specify a character by typing it literally <Character char="á"/> to produce: á

Transformation of data to a normal form. For historical reasons,
the Unicode Standard allows some characters to have more than one
encoded representation. For example, _á_ may be represented as a single codepoint,
<span class='USV'>U+00E1</span>&nbsp;<span class='UnicodeCharName'>
LATIN SMALL LETTER A WITH ACUTE</span>, or two codepoints, <span class='USV'>
U+0061</span>&nbsp;<span class='UnicodeCharName'>LATIN SMALL LETTER A</span>
and <span class='USV'>U+0301</span>&nbsp;<span class='UnicodeCharName'>COMBINING
ACUTE ACCENT</span>. A normalization scheme is used to standardize the codepoints
so that every character is always represented by the same sequence of codepoints.
Normalization is described in the Unicode Standard Section 5.7, Normalization.

Transformation of data to a normal form. For historical reasons, the Unicode standard allows some characters to have more than one encoded representation. For example, á may be represented as a single codepoint, U+00E1 LATIN SMALL LETTER A WITH ACUTE, or two codepoints, U+0061 LATIN SMALL LETTER A and U+0301 COMBINING ACUTE ACCENT. A normalization scheme is used to standardize the codepoints so that every character is always represented by the same sequence of codepoints. Normalization is described in the Unicode Standard Section 5.7, Normalization.

Transformation of data to a normal form. For historical reasons, the Unicode
standard allows some characters to have more than one encoded representation.
For example, á may be represented as a single codepoint, <Character char="á"/>,
or two codepoints, <Character char="a"/> and <Character usv="0301"/>.
A normalization scheme is used to standardize the codepoints so that every
character is always represented by the same sequence of codepoints. Normalization
is described in the Unicode Standard Section 5.7, Normalization.

Transformation of data to a normal form. For historical reasons, the Unicode standard allows some characters to have more than one encoded representation. For example, á may be represented as a single codepoint, á , or two codepoints, a and ◌́ . A normalization scheme is used to standardize the codepoints so that every character is always represented by the same sequence of codepoints. Normalization is described in the Unicode Standard Section 5.7, Normalization.

<CharacterTable startUsv="063D" endUsv="063F" caption="Farsi Yeh forms"/>

Consider these Farsi Yeh forms:

USV Char Unicode Character Name
ؽ
ؾ
ؿ
Farsi Yeh forms