Character Requirements for African (Arabic script) Orthographies
Developing comprehensive fonts can be quite a challenge. There are over 340 characters in the Arabic Unicode blocks (not including the presentations forms). For many of those characters, there are up to 4 different forms. An Arabic script font can require many, many glyphs.
If a font supports all of the following characters, we believe that most languages using the Arabic script in West Africa would be well supported. Our hope is that all type designers who are designing Arabic script fonts for Africa will be willing to add these characters to their fonts. This would be a great service.
There are comments below about glyph variants that may be needed to support some languages.
The process for developing this list was based on the following:
- Decision to include all Codepage 1252 and MacRoman characters
- Decision to include many characters required for publishing (spaces and dashes)
- Decision to support all languages from West Africa where the character requirements are known
- Decision to support all characters in the Unicode Standard that are indicated as “African”
- Decision to support all characters used by Arabic language (for borrowed words)
Feedback on the character repertoire is welcomed.
| Graphic | USV | Name | Comment |
|---|---|---|---|
| | 0600 | ||
| | 0602 | ||
| ، | 060C | ||
| ◌ؐ | 0610 | ||
| ◌ؑ | 0611 | ||
| ◌ؒ | 0612 | ||
| ◌ؓ | 0613 | ||
| ◌ؔ | 0614 | ||
| ؛ | 061B | ||
| ؞ | 061E | ||
| ؟ | 061F | ||
| ء | 0621 | ||
| آ | 0622 | ||
| أ | 0623 | ||
| ؤ | 0624 | ||
| إ | 0625 | ||
| ئ | 0626 | ||
| ا | 0627 | ||
| ب | 0628 | ||
| ة | 0629 | ||
| ت | 062A | ||
| ث | 062B | ||
| ج | 062C | ||
| ح | 062D | ||
| خ | 062E | ||
| د | 062F | ||
| ذ | 0630 | ||
| ر | 0631 | ||
| ز | 0632 | ||
| س | 0633 | ||
| ش | 0634 | ||
| ص | 0635 | ||
| ض | 0636 | ||
| ط | 0637 | ||
| ظ | 0638 | ||
| ع | 0639 | ||
| غ | 063A | ||
| ـ | 0640 | ||
| ف | 0641 | ||
| ق | 0642 | ||
| ك | 0643 | ||
| ل | 0644 | ||
| م | 0645 | There is a glyph variant for this character which is sometimes used in Africa (see Glyph Variant for U+0645). | |
| ن | 0646 | ||
| ه | 0647 | ||
| و | 0648 | ||
| ى | 0649 | ||
| ي | 064A | ||
| ◌ً | 064B | ||
| ◌ٌ | 064C | ||
| ◌ٍ | 064D | ||
| ◌َ | 064E | ||
| ◌ُ | 064F | There is a glyph variant for this character (see Glyph Variant for U+064F) | |
| ◌ِ | 0650 | There is a glyph variant for this character which is sometimes used in Africa (see Variant Behavior for Kasra (U+0650)) | |
| ◌ّ | 0651 | ||
| ◌ْ | 0652 | There is a glyph variant for this character (see Glyph Variants for U+0652) | |
| ◌ٓ | 0653 | ||
| ◌ٔ | 0654 | ||
| ◌ٖ | 0656 | ||
| ◌ٗ | 0657 | There is a glyph variant for this character (see Glyph Variant for U+0657) | |
| ◌٘ | 0658 | ||
| ◌ٚ | 065A | ||
| ◌ٛ | 065B | ||
| ◌ٜ | 065C | ||
| ◌ٝ | 065D | ||
| ٠ | 0660 | ||
| ١ | 0661 | ||
| ٢ | 0662 | ||
| ٣ | 0663 | ||
| ٤ | 0664 | ||
| ٥ | 0665 | ||
| ٦ | 0666 | ||
| ٧ | 0667 | ||
| ٨ | 0668 | ||
| ٩ | 0669 | ||
| ٫ | 066B | ||
| ٬ | 066C | ||
| ٯ | 066F | ||
| ◌ٰ | 0670 | ||
| ٴ | 0674 | ||
| ٻ | 067B | ||
| پ | 067E | ||
| ڃ | 0683 | ||
| ڄ | 0684 | ||
| چ | 0686 | ||
| ډ | 0689 | ||
| ڊ | 068A | ||
| ڋ | 068B | ||
| ڏ | 068F | ||
| ژ | 0698 | ||
| ڜ | 069C | ||
| ڞ | 069E | ||
| ڟ | 069F | ||
| ڠ | 06A0 | ||
| ڡ | 06A1 | ||
| ڢ | 06A2 | ||
| ڤ | 06A4 | ||
| ڥ | 06A5 | ||
| ڧ | 06A7 | ||
| ڨ | 06A8 | ||
| ک | 06A9 | ||
| ڪ | 06AA | ||
| ڮ | 06AE | ||
| گ | 06AF | ||
| ں | 06BA | ||
| ۀ | 06C0 | ||
| ۆ | 06C6 | ||
| ۈ | 06C8 | ||
| ۉ | 06C9 | ||
| ی | 06CC | ||
| ې | 06D0 | ||
| ۑ | 06D1 | ||
| ے | 06D2 | ||
| | 06DD | There is a glyph variant for this character (see Glyph Variants for U+06DD) | |
| ۥ | 06E5 | ||
| ◌ۨ | 06E8 | ||
| ݐ | 0750 | ||
| ݑ | 0751 | ||
| ݒ | 0752 | ||
| ݓ | 0753 | ||
| ݔ | 0754 | ||
| ݕ | 0755 | ||
| ݖ | 0756 | ||
| ݗ | 0757 | ||
| ݘ | 0758 | ||
| ݚ | 075A | ||
| ݛ | 075B | ||
| ݝ | 075D | ||
| ݞ | 075E | ||
| ݟ | 075F | ||
| ݠ | 0760 | ||
| ݡ | 0761 | ||
| ݣ | 0763 | ||
| ݤ | 0764 | ||
| ݥ | 0765 | ||
| ݦ | 0766 | ||
| ݧ | 0767 | ||
| ݪ | 076A | ||
| ࢠ | 08A0 | ||
| ࢡ | 08A1 | ||
| ࢢ | 08A2 | ||
| ࢣ | 08A3 | ||
| ࢤ | 08A4 | ||
| ࢥ | 08A5 | ||
| ࢦ | 08A6 | ||
| ࢧ | 08A7 | ||
| ࢨ | 08A8 | ||
| ࢩ | 08A9 | ||
| ◌ࣴ | 08F4 | ||
| ◌ࣵ | 08F5 | ||
| ◌ࣶ | 08F6 | ||
| ◌ࣷ | 08F7 | ||
| ◌ࣸ | 08F8 | ||
| ◌ࣹ | 08F9 | ||
| ◌ࣺ | 08FA | ||
| ◌ࣻ | 08FB | ||
| ◌ࣼ | 08FC | ||
| ◌ࣽ | 08FD | ||
| ◌ࣾ | 08FE | ||
| | 202A | ||
| | 202B | ||
| | 202C | ||
| | 202D | ||
| | 202E | ||
| | 2060 | ||
| | 206C | ||
| | 206D | ||
| ﴾ | FD3E | ||
| ﴿ | FD3F | ||
| ﷲ | FDF2 |
Codepage 1252 and Mac-Roman merger
Section titled “Codepage 1252 and Mac-Roman merger”We believe that any Arabic script font should include a core set of Roman characters for mixed script use as well as for common punctuation requirements in Arabic script text. The characters in the two lists below are our recommendation for a basic set of characters. The combined lists include the union of Windows CP1252 and Mac-Roman.
Characters often used in Arabic script
Section titled “Characters often used in Arabic script”These are characters often required in regular Arabic text (not necessarily mixed script). Because these are normally used in Arabic text, these characters should be designed to match the design style of the Arabic script characters listed above.
| Graphic | USV | Name | Comment |
|---|---|---|---|
| 0020 | |||
| ! | 0021 | ||
| " | 0022 | ||
| & | 0026 | ||
| ' | 0027 | ||
| ( | 0028 | ||
| ) | 0029 | ||
| + | 002B | ||
| , | 002C | ||
| - | 002D | ||
| . | 002E | ||
| / | 002F | ||
| 0 | 0030 | ||
| 1 | 0031 | ||
| 2 | 0032 | ||
| 3 | 0033 | ||
| 4 | 0034 | ||
| 5 | 0035 | ||
| 6 | 0036 | ||
| 7 | 0037 | ||
| 8 | 0038 | ||
| 9 | 0039 | ||
| : | 003A | ||
| ; | 003B | ||
| < | 003C | ||
| = | 003D | ||
| > | 003E | ||
| ? | 003F | ||
| @ | 0040 | ||
| [ | 005B | ||
| \ | 005C | ||
| ] | 005D | ||
| { | 007B | ||
| | | 007C | ||
| } | 007D | ||
| « | 00AB | ||
| » | 00BB | ||
| × | 00D7 | ||
| ÷ | 00F7 | ||
| | 200C | ||
| | 200D | ||
| | 200F | ||
| ‘ | 2018 | ||
| ’ | 2019 | ||
| “ | 201C | ||
| ” | 201D | ||
| ‹ | 2039 | ||
| › | 203A | ||
| ◌ | 25CC |
Other characters
Section titled “Other characters”These are other characters from Codepage 1252 and/or Mac-Roman used in mixed script (Latin and Arabic script) text and publishing.
| Graphic | USV | Name | Comment |
|---|---|---|---|
| # | 0023 | ||
| $ | 0024 | ||
| % | 0025 | ||
| * | 002A | ||
| A | 0041 | ||
| B | 0042 | ||
| C | 0043 | ||
| D | 0044 | ||
| E | 0045 | ||
| F | 0046 | ||
| G | 0047 | ||
| H | 0048 | ||
| I | 0049 | ||
| J | 004A | ||
| K | 004B | ||
| L | 004C | ||
| M | 004D | ||
| N | 004E | ||
| O | 004F | ||
| P | 0050 | ||
| Q | 0051 | ||
| R | 0052 | ||
| S | 0053 | ||
| T | 0054 | ||
| U | 0055 | ||
| V | 0056 | ||
| W | 0057 | ||
| X | 0058 | ||
| Y | 0059 | ||
| Z | 005A | ||
| ^ | 005E | ||
| _ | 005F | ||
| ` | 0060 | ||
| a | 0061 | ||
| b | 0062 | ||
| c | 0063 | ||
| d | 0064 | ||
| e | 0065 | ||
| f | 0066 | ||
| g | 0067 | ||
| h | 0068 | ||
| i | 0069 | ||
| j | 006A | ||
| k | 006B | ||
| l | 006C | ||
| m | 006D | ||
| n | 006E | ||
| o | 006F | ||
| p | 0070 | ||
| q | 0071 | ||
| r | 0072 | ||
| s | 0073 | ||
| t | 0074 | ||
| u | 0075 | ||
| v | 0076 | ||
| w | 0077 | ||
| x | 0078 | ||
| y | 0079 | ||
| z | 007A | ||
| ~ | 007E | ||
| 00A0 | |||
| ¡ | 00A1 | ||
| ¢ | 00A2 | ||
| £ | 00A3 | ||
| ¤ | 00A4 | ||
| ¥ | 00A5 | ||
| ¦ | 00A6 | ||
| § | 00A7 | ||
| ¨ | 00A8 | ||
| © | 00A9 | ||
| ª | 00AA | ||
| ¬ | 00AC | ||
| | 00AD | ||
| ® | 00AE | ||
| ¯ | 00AF | ||
| ° | 00B0 | ||
| ± | 00B1 | ||
| ² | 00B2 | ||
| ³ | 00B3 | ||
| ´ | 00B4 | ||
| µ | 00B5 | ||
| ¶ | 00B6 | ||
| · | 00B7 | ||
| ¸ | 00B8 | ||
| ¹ | 00B9 | ||
| º | 00BA | ||
| ¼ | 00BC | ||
| ½ | 00BD | ||
| ¾ | 00BE | ||
| ¿ | 00BF | ||
| À | 00C0 | ||
| Á | 00C1 | ||
| Â | 00C2 | ||
| Ã | 00C3 | ||
| Ä | 00C4 | ||
| Å | 00C5 | ||
| Æ | 00C6 | ||
| Ç | 00C7 | ||
| È | 00C8 | ||
| É | 00C9 | ||
| Ê | 00CA | ||
| Ë | 00CB | ||
| Ì | 00CC | ||
| Í | 00CD | ||
| Î | 00CE | ||
| Ï | 00CF | ||
| Ð | 00D0 | ||
| Ñ | 00D1 | ||
| Ò | 00D2 | ||
| Ó | 00D3 | ||
| Ô | 00D4 | ||
| Õ | 00D5 | ||
| Ö | 00D6 | ||
| Ø | 00D8 | ||
| Ù | 00D9 | ||
| Ú | 00DA | ||
| Û | 00DB | ||
| Ü | 00DC | ||
| Ý | 00DD | ||
| Þ | 00DE | ||
| ß | 00DF | ||
| à | 00E0 | ||
| á | 00E1 | ||
| â | 00E2 | ||
| ã | 00E3 | ||
| ä | 00E4 | ||
| å | 00E5 | ||
| æ | 00E6 | ||
| ç | 00E7 | ||
| è | 00E8 | ||
| é | 00E9 | ||
| ê | 00EA | ||
| ë | 00EB | ||
| ì | 00EC | ||
| í | 00ED | ||
| î | 00EE | ||
| ï | 00EF | ||
| ð | 00F0 | ||
| ñ | 00F1 | ||
| ò | 00F2 | ||
| ó | 00F3 | ||
| ô | 00F4 | ||
| õ | 00F5 | ||
| ö | 00F6 | ||
| ø | 00F8 | ||
| ù | 00F9 | ||
| ú | 00FA | ||
| û | 00FB | ||
| ü | 00FC | ||
| ý | 00FD | ||
| þ | 00FE | ||
| ÿ | 00FF | ||
| ı | 0131 | ||
| Œ | 0152 | ||
| œ | 0153 | ||
| Š | 0160 | ||
| š | 0161 | ||
| Ÿ | 0178 | ||
| Ž | 017D | ||
| ž | 017E | ||
| ƒ | 0192 | ||
| ˆ | 02C6 | ||
| ˇ | 02C7 | ||
| ː | 02D0 | ||
| ˘ | 02D8 | ||
| ˙ | 02D9 | ||
| ˚ | 02DA | ||
| ˛ | 02DB | ||
| ˜ | 02DC | ||
| | 200E | ||
| ‐ | 2010 | ||
| ‑ | 2011 | ||
| ‒ | 2012 | ||
| – | 2013 | ||
| — | 2014 | ||
| ― | 2015 | ||
| ‚ | 201A | ||
| „ | 201E | ||
| † | 2020 | ||
| ‡ | 2021 | ||
| • | 2022 | ||
| … | 2026 | ||
| ‰ | 2030 | ||
| ⁄ | 2044 | ||
| ⁴ | 2074 | ||
| € | 20AC | ||
| ™ | 2122 | ||
| | FEFF | ||
|  | FFFC | ||
| � | FFFD |
This article formerly appeared on ScriptSource.