Arabic

Script details
Section titled “Script details”See all script details: code, region, status and more
| Code | Arab |
| Script type | abjad |
| Region | Middle Eastern |
| Status | Current |
| Direction | RTL bidirectional |
| Baseline | bottom |
| Case | no |
| White space | between words |
| Complex behaviors | diacritics, contextual forms, complex positioning, required ligatures |
| OpenType code | arab |
| ISO 15924 Number | 160 (right-to-left alphabetic) |
Script description
Section titled “Script description”Arabic writing is the second most broadly-used script in the world, after the Latin alphabet.
Read the full description…
It descended from the Nabataean abjad, itself a descendant of the Phoenician script, and has been used since the 4th century for writing the Arabic language. Since the words of the Prophet Muhammed can only be written in Arabic, the Arabic script has traveled far and wide with the spread of Islam and came to be used for a number of languages throughout Asia, Africa and the Middle East. Many of these are non-Semitic languages, so employ very different sound systems from spoken Arabic, and as a result the script has had to be adapted and is used slightly differently by speakers of different languages. Many African languages use an Arabic-based transcription system called Ajami, which is different from the original Arabic script. Romance languages such as Mozarabic or Ladino are also sometimes written in a modified Arabic script, called Aljamiado.
Many variations on the script have developed over time and space, but these can be broadly classified into two groups; an angular kufic style which was originally used for stone inscriptions and which commonly employs no diacritics, and the naskh style which is more commonly used, more rounded in form, and governed by a set of principles regulating the proportions between the letters. There are a number of variant styles included in this group, including those used in Arabic calligraphy.
Arabic letters are read from right to left. The script is an abjad; only the consonants are required to be written. The basic set of letters consists of twenty-eight consonants, although some languages use many more letters than this. Some letters can represent a consonant or a long vowel, depending on the context. An additional set of diacritics exists for writing short vowels, but its use is optional. Conventionally, for writing the Arabic language, long vowels are written and short vowels are omitted. Where a vowel is not written, readers of the script must use their knowledge of the language and its phonology to insert the appropriate vowel sound. Diacritics also exist for marking gemination (consonant lengthening).
Arabic is a unicameral script; there is no upper and lower case. It is also obligatorily cursive, that is, all the letters in a word must be connected wherever possible. Some letters ([a] ا, [d] ﺩ, [ð] ﺫ, [r] ﺭ, [z] ﺯ and [w] ﻭ) cannot always be joined. Each letter has three contextual forms depending on whether it appears at the beginning, in the middle, or at the end of a word, as well as a basic stand-alone shape which is used when it appears at the end of a word and is preceded by one of the letters listed above which cannot join on the left side. Many letters look broadly similar to one another, differing only in the placement of one or more dots above or below the letter. For example, the letters representing [ħ], [g] and [x] are all the same, except that [ħ] is unmarked, [g] has a dot in the loop of the letter and [x] has a dot above it. These dots are called i’jam and form an integral part of the letter.
A number of ligatures are used in handwriting, but only one [l] + [a] is compulsory ﻻ. A ligature is commonly used for writing the word Allah ‘God’ .ﷲ
Latin punctuation is commonly used, with a few exceptions. The Arabic comma, question mark, and percent sign are script-specific, and there is also an Arabic triple-dot mark. In addition, there are script-specific honorific marks which may be placed above a person’s name in order to confer honor or a blessing upon them. There are also Koranic annotation signs, mostly to provide guidance in chanting and singing sacred text.
The Arabic script employs two sets of numbers, Standard and Eastern Arabic. Latin numbers, which derive from a medieval set of Arabic numbers, are also used, particularly in North Africa. Like Arabic letters, numbers are written from right to left, but with the highest value on the left, as with Latin numbers, so they must be read from left to right. There is also a numeral system known as abjad numerals, in which each of the 28 letters of the Arabic abjad is assigned a numeric value of units, tens or hundreds. These are combined to create larger numbers.
Languages that use this script
Section titled “Languages that use this script”| Language | Writing System Code | Writing System Status | SLDR/CLDR locale | Regional variants |
|---|---|---|---|---|
| Aari | aiw-Arab | in use | aiw-Arab-ET (Ethiopia) | |
| Aceh | ace-Arab | in use | ace-Arab-ID (Indonesia) | |
| Adyghe | ady-Arab | obsolete | ady-Arab-RU (Russian Federation) | |
| Aer | aeq-Arab | in use | aeq-Arab-PK (Pakistan) | |
| Afar | aa-Arab | obsolete | aa-Arab-ET (Ethiopia) | |
| See complete list | ||||
Unicode status
Section titled “Unicode status”In The Unicode Standard, Arabic Script implementation is discussed in Chapter 9 Middle East-I: Modern and Liturgical Scripts, in UAX 53 Unicode Arabic Mark Rendering, and in UTN 46 Rendering the Allah Ligature. The Arabic (Nastaliq variant) script uses characters from the Arabic script Unicode blocks.
Other:
Resources
Section titled “Resources”Related articles
- Complex Behaviors - An overview of various kinds of complex behaviors
- Different kinds of hamza characters in Arabic script
- Direction - Description and examples of layout direction
- Glyph Variant for U+0645 - Description of how some languages using arabic script differ in their usage and presentation of U+0645 Arabic Character Meem
- Graphite kerning for right-to-left scripts
- Justification - Description and examples of justification techniques
- OpenType - Features and lookups
- Paragraph Layout - Issues in paragraph layout
- The Unicode Bidirectional Algorithm - The "bidi" algorithm describes how software should process text that contains both left-to-right and right-to-left sequences of characters.
External links
- Arabic (ScriptSource)
- Arabic script (Wikipedia)
- Arabic Alphabet (Omniglot)
- ALA-LC Romanization Tables
- Arabic and Persian Gap Analysis (W3C)
- Arabic character picker (r12a.io)
- Developing OpenType Fonts for Arabic Script (Microsoft)
- DIN 31635 (Arabic script transliteration) (Wikipedia)
- Harir - Reducing Noise in Arabic Script (I Love Typography)
- Inventory Saraiki (PHOIBLE)
- Ishida: List of terms to find letter combinations (r12a.io)
- Ishida: Notes on Scripts, Orthographies and Characters, Lists of Terms (r12a.io)
- Kashmiri Language
- Kyrgyz Alphabet (Wikipedia)
- Saraiki alphabet (Wikipedia)
- Saraiki vowel
- Script comparison table (r12a.io)
- The Persian Alphabet
- Unicode character pickers (r12a.io)
Bibliography
- Abba, Tahir. “Ajamization of Knowledge: The Kanuri Experience”. Conference of Language and Identity in Africa, Cairo University, 2015-04-05, https://www.academia.edu/21905366/Ajamization_of_Knowledge_The_Kanuri_Experience.
- Abdelhadi, Souad, et al. “Perceptual load in the reading of Arabic: Effects of orthographic visual complexity on detection”. Writing Systems Research, vol. 3, no. 2, 2011, http://www.tandfonline.com/doi/abs/10.1093/wsr/wsr014.
- Al-Baba, Kamel. Rouh Al-Khatt Al-Arabi. Dar El-Ilm Lil Malayin, 1994.
- Bauer, Thomas. “Arabic Writing”. The World's Writing Systems, edited by Peter T. Daniels and William Bright, Oxford University Press, 1996, pp. 559-564.
- Beeston, Alfred. The Arabic Language Today. Georgetown University Press, 2006.
- Benatia, Mohamed Jamal Eddine, et al. “Arabic text justification”. TUGboat, 2006, https://www.tug.org/TUGboat/tb27-2/tb87benatia.pdf.
- Bondarev, Dmitry. “An Archaic Form of Kanuri/Kanembu: A Translation Tool for Qur'Anic Studies”. Journal of Qur'anic Studies, 2006, pp. 143-153, https://www.academia.edu/282596/An_Archaic_Form_of_Kanuri_Kanembu_A_Translation_Tool_for_QurAnic_Studies.
- Bondarev, Dmitry. “Islamic Education and Ample Space Layout in West African Islamic Manuscripts”. The Arts and Crafts of Literacy: Manuscript Cultures in Muslim Sub-Saharan Africa, De Gruyter, 2017, https://www.academia.edu/38873196/Islamic_Education_and_Ample_Space_Layout_in_West_African_Islamic_Manuscripts.
- Bondarev, Dmitry. “Multiglossia in West African manuscripts: The case of Borno, Nigeria”. Manuscript Cultures: Mapping the Field, edited by Jörg B. Quenzer, et al., De Gruyter Mouton, 2014, https://www.academia.edu/715694/Multiglossia_in_West_African_manuscripts_The_case_of_Borno_Nigeria.
- Bondarev, Dmitry. “The Language of the Glosses In the Bornu Quranic Manuscripts”. Bulletin of the School of Oriental and African Studies, University of London, vol. 69, no. 1, 2006, pp. 113-140, https://www.academia.edu/282594/The_Language_of_the_Glosses_In_the_Bornu_Quranic_Manuscripts.
- Brigaglia, Andrea. “Central Sudanic Arabic Scripts (Part 1): The Popularization of the Kanawi Script”. Islamic Africa, vol. 2, no. 2, 2011, pp. 51-85, https://www.academia.edu/9668701/Central_Sudanic_Arabic_Scripts_Part_1_The_Popularization_of_the_Kanawi_Script?email_work_card=view-paper.
- Brigaglia, Andrea and Mauro Nobili. “Central Sudanic Arabic Scripts (Part 2): The Barnawi”. Islamic Africa, vol. 2, no. 2, 2013, pp. 195-223, https://www.academia.edu/9668724/Central_Sudanic_Arabic_Scripts_Part_2_The_Barnawi.
- Campbell, George L. Concise Compendium of the World's Languages. Routledge, 2011.
- Clement, Victoria. “Emblems of independence: script choice in post-Soviet Turkmenistan”. International Journal of the Sociology of Language, vol. 2008, no. 192, 2008, pp. 171-185, http://dx.doi.org/10.1515/IJSL.2008.042.
- Donnelly, Kevin. “Writing and transliterating Swahili in Arabic script with Andika!” Grapholinguistics in the 21st Century, 2020-06-17, http://www.fluxus-editions.fr/grafematik2020-files/donnelly-document.pdf.
- Haralambous, Yannis. “The traditional Arabic typecase extended to the Unicode set of glyphs”. Electronic Publishing, vol. 8, no. 2&3, 1995, pp. 111-123, https://www.academia.edu/733363/The_traditional_Arabic_typecase_extended_to_the_Unicode_set_of_glyphs.
- Hatcher, Lynley. “Script change in Azerbaijan: acts of identity”. International Journal of the Sociology of Language, vol. 2008, no. 192, 2008, pp. 105-116, http://dx.doi.org/10.1515/IJSL.2008.038.
- Meletis, Dimitrios and Christa Dürscheid. Writing Systems and Their Use. De Gruyter Mouton, 2022, https://www.degruyter.com/document/doi/10.1515/9783110757835/html#contents.
- Mirdehghan, Mahinnaz. “Persian, Urdu, and Pashto: A comparative orthographic analysis”. Writing Systems Research, vol. 2, no. 1, 2010, pp. 9-23, http://www.tandfonline.com/doi/abs/10.1093/wsr/wsq005.
- Mumin, Meikal and Kees Versteegh, eds. The Arabic Script in Africa (Studies in the Use of a Writing System). Studies in Semitic Languages and Linguistics (Book 71), Brill, 2013.
- Roberts, David. “Review of Meikal Mumin and Kees Versteegh (eds.), The Arabic script in Africa: Studies in the use of a writing system (Studies in Semitic languages and linguistics, 71). Leiden: Brill, 2014. pp. 400.” Written Language & Literacy, vol. 17, no. 2, 2014, pp. 308-314, http://dx.doi.org/10.1075/wll.17.2.06rob.
- Savage, Andrew. “Writing Tuareg — the three script options”. International Journal of the Sociology of Language, vol. 2008, no. 192, 2008, pp. 5-13, http://dx.doi.org/10.1515/IJSL.2008.031.
- Zawaydeh, Bushra. “A Profile of Arabic Script Languages”. Basis Technology Corporation Government Users Conference, 2007-06-07, http://www.scribd.com/doc/8591687/Arabic-Alphabet-Calligraphy#scribd.