Component Coredata Examples

Inline USV example #1

Instead of writing “U+0634 ش ARABIC LETTER SHEEN” (and trying to figure out how to type the correct character and find the exact Unicode character name), you can just write <Character usv="0634"/> to produce carefully-styled text with USV, character and Unicode character name: U+0634 ش ARABIC LETTER SHEEN

You can also specify a character by typing it literally <Character char="á"/> to produce: U+00E1 á LATIN SMALL LETTER A WITH ACUTE

Inline USV example #2

From the glossary, using inline HTML:

Transformation of data to a normal form. For historical reasons,
the Unicode Standard allows some characters to have more than one
encoded representation. For example, _á_ may be represented as a single codepoint,
<span class='USV'>U+00E1</span>&nbsp;<span class='UnicodeCharName'>
LATIN SMALL LETTER A WITH ACUTE</span>, or two codepoints, <span class='USV'>
U+0061</span>&nbsp;<span class='UnicodeCharName'>LATIN SMALL LETTER A</span>
and <span class='USV'>U+0301</span>&nbsp;<span class='UnicodeCharName'>COMBINING
ACUTE ACCENT</span>. A normalization scheme is used to standardize the codepoints
so that every character is always represented by the same sequence of codepoints.
Normalization is described in the Unicode Standard Section 5.7, Normalization.

Transformation of data to a normal form. For historical reasons, the Unicode standard allows some characters to have more than one encoded representation. For example, á may be represented as a single codepoint, U+00E1 LATIN SMALL LETTER A WITH ACUTE, or two codepoints, U+0061 LATIN SMALL LETTER A and U+0301 COMBINING ACUTE ACCENT. A normalization scheme is used to standardize the codepoints so that every character is always represented by the same sequence of codepoints. Normalization is described in the Unicode Standard Section 5.7, Normalization.

Using the `Character` component:

Transformation of data to a normal form. For historical reasons, the Unicode
standard allows some characters to have more than one encoded representation.
For example, á may be represented as a single codepoint, <Character char="á"/>,
or two codepoints, <Character char="a"/> and <Character usv="0301"/>.
A normalization scheme is used to standardize the codepoints so that every
character is always represented by the same sequence of codepoints. Normalization
is described in the Unicode Standard Section 5.7, Normalization.

Transformation of data to a normal form. For historical reasons, the Unicode standard allows some characters to have more than one encoded representation. For example, á may be represented as a single codepoint, U+00E1 á LATIN SMALL LETTER A WITH ACUTE, or two codepoints, U+0061 a LATIN SMALL LETTER A and U+0301 ◌́ COMBINING ACUTE ACCENT. A normalization scheme is used to standardize the codepoints so that every character is always represented by the same sequence of codepoints. Normalization is described in the Unicode Standard Section 5.7, Normalization.

CharacterTable example

<CharacterTable startUsv="063D" endUsv="063F" caption="Farsi Yeh forms"/>

Consider these Farsi Yeh forms:

USV	Char	Unicode Character Name
U+063D	ؽ	ARABIC LETTER FARSI YEH WITH INVERTED V
U+063E	ؾ	ARABIC LETTER FARSI YEH WITH TWO DOTS ABOVE
U+063F	ؿ	ARABIC LETTER FARSI YEH WITH THREE DOTS ABOVE

Farsi Yeh forms