Skip to content

Other Input Methods

Although a keyboard (either a physical keyboard or a touch screen keyboard) is likely the most common means of data input, other possibilities exist. Among these are:

  • Web-based pickers
  • Character pickers
  • USV entry
  • Voice recognition
  • Handwriting recognition
  • Optical character recognition

With these, the text is entered in a text box within the picker, running directly in a browser. The text can then be copied into any document/application that supports Unicode. These can be used with any operating system provided the corresponding fonts are available (and smart rendering is available for certain advanced scripts).

  • Unicode character pickers - The Unicode character apps allow you to produce, analyse and manipulate runs of text for a given language or script. Character apps are especially useful for people who don’t know a script well, as characters are displayed in ways that aid identification and input (unlike a regular character map). The apps are written in HTML and JavaScript, so no need to download anything – just use any browser. You can easily cut and paste text between the apps and your own document. The apps are just as useful for analysing text copied from other locations as they are for creating text.
  • Uniview - On his website, Richard Ishida has published a number of useful Unicode-related tools and resources. Relevant to this topic, Richard’s UniView page is a very capable and useful character browser. By default UniView displays server-generated graphics for each character, meaning you don’t have to have the fonts on your local machine. There is an option to display characters as text rather than graphics, in which case you would need to configure the desired fonts into your browser settings. In addition to browsing the Unicode character database, UniView can serve as a character picker and as a clipboard viewer, allowing one to see exactly what Unicode characters are in the clipboard text.
  • Branah.com - Online keyboards for quite a few languages.
  • TypeIt - About a dozen online keyboards.
  • FileFormat.info - One of the strengths of this site is that it shows how the character would be expressed in various Unicode Encoding Forms and programming languages. Another useful aspect of this site is that you can put the character you want directly in the URL for example: https://www.fileformat.info/info/unicode/char/067b/index.htm. Thus you could set up quick-searches in your browser to go directly to the character.
  • Tools for indigenous languages on the web- This site includes basic tools and prototypes for some indigenous languages. Some of these pickers provide a search feature or allow an existing string of text to be analysed.
  • Unicode’s collation charts: Unicode lists all the characters in collation order. This makes it easy to find the character you are looking for, especially for Latin, if you are familiar with how the script is collated. These characters can be copy and pasted into your application.

Character pickers (either hosted on a website or on the user’s device) present an array of characters and allow the user to select the desired ones. This is generally a slower method for data input than a keyboard, but may be useful for entering small amounts of text.

Some examples are listed below for different operating systems.

Windows:

  • Unibook - The Unicode Consortium itself provides a useful character browser called Unibook. The original (and continuing) purpose of this program was to print the code charts used to publish the Unicode standard, but it has evolved into a general purpose and useful Unicode character browser. In the Unibook browser window you can click on any character cell and a popup will give additional details (user configurable) about that character. You can also copy any character to the clipboard, thereby providing character-picker functionality. Unibook’s sophisticated search facilities can find and highlight characters based on up to four criteria including Unicode property values, character names, font coverage, and even user-supplied property files. Other advanced capabilities include Bidi and Line Break algorithm demos, legacy character set views, and flexible font selections. One of Unibook’s main strengths is that it runs locally and does not require an internet connection. Unfortunately it has two weaknesses: it is Windows only, and can display glyphs only for those characters for which you have fonts installed on your system. Nonetheless it has been a mainstay of Unicode developers for years.
  • Babelmap - BabelMap is a free character map application for Windows that allows you to browse through the entire Unicode character repertoire, or search for a particular character by name or by code point. Characters can then be copied to the clipboard for use in any Unicode-aware application.

Desktop Linux:

  • GNOME Character Map - A GNOME application that provides a simple interface that shows some of the Unicode character properties of the selected character, and it can be used as a character picker. Also like Unibook, it requires fonts to be installed in your system.
  • KCharSelect - This is a tool to select special characters from all installed fonts and copy them into the clipboard.
  • Characters - This is a simple utility application to find and insert unusual characters. It allows you to quickly find the character you are looking for by searching for keywords.

macOS X:

  • UnicodeChecker - This is a Mac application to explore Unicode. Analyse and convert Unicode strings for HTML or programming. Supports UniHan, Services, AppleScript and more.
  • Font Book

Android:

Cross-platform applications such as Libreoffice.org also provide glyph palettes (or “Insert special characters”) to find, copy and paste the characters you need.

The computer operating system or a specific software application may offer the possibility of entering a character by typing its Unicode Scalar Value (USV), that is the four-, five-, or six-digit hexadecimal value introduced by “U+”. This is a slow input method, only suitable for entering occasional characters.

Some operating systems also provide a way to type in a character directly using its Unicode Scalar Value (USV).

  • On many Windows applications this can be acheived by typing the USV, selecting the USV and typing ALT+X. Another method can be used if you are willing to edit the Windows registry. This is achieved by holding down the Alt key and entering ”+” followed by the USV number using the numeric keypad in a text area. In some applications (such as Word). This method is described in Unicode Hex Input Windows Registry setting.
  • On Ubuntu this is achieved by holding down Shift and Ctrl, and entering ‘U’ followed by the USV number in a text area.

This method of finding characters is sometimes referred to as “using Alt Codes”. The drawback is that you need to know the numerical value corresponding to the character you are looking for. Making use of a character picker when the existing collection of characters is searchable is usually a more useful solution.

With voice recognition (also known as speech-to-text), a person speaks into the microphone of a computer or phone. The device then analyzes the recorded sounds and matches them to information about a particular language, attempting to produce the corresponding text. Because this method may require large amounts of written text and corresponding recordings, it may not be feasible for a language that is relatively new in the digital space. Text produced in this manner needs to be verified for accuracy.

See Speech recognition for more details.

With hand writing recognition, a person draws characters, often with a stylus on a touch-sensitive screen, and the device (phone, tablet or computer) translates these drawings into characters.

See Handwriting recognition for more details.

Optical character recognition (or OCR) involves taking a visual copy of text (usually printed text, but possibly handwritten text) and attempting to detect what characters are there. Text produced in this manner needs to be verified for accuracy.

See Optical character recognition for more details.