Comprehensive List of Characters (Alphabets, Scripts)

Starting from this topic you can find the complete list of characters available for use in Duxbury DBT. The list has become remarkably long, and so this topic provides links to separate tables of those characters, grouped by their Unicode range. When you link to the individual tables of characters, each character has two identification numbers, its Unicode number and its DUSCI number - the DUSCI number is the internal code for a character in DBT.

Prior to presenting the Table of Unicode Ranges, we briefly discuss several script systems that have special characteristics to be aware of if you need to work with them, starting with the Chinese Han characters.

Note: If you find you need help on ways of accessing special characters or alternative scripts, see the topic, Keyboarding into Word and DBT.

Scripts with Special Characteristics for Braille
Chinese - and Other Han - Characters

DBT does not display traditional Chinese ideogram characters. Instead DBT displays an appropriate substitute alphabet based on the language: Mandarin, Cantonese, Japanese, or Korean - the "Han" group of language scripts. Normally, the language is selected automatically just by selecting the DBT template for importing the file. At need, the automatic selection can be overridden by forcing the language choice in the Global: Import Options dialog, but in almost all cases, letting the template select the correct language is what you want.

These are the choices for how Han characters are imported into DBT:

Chinese - Language and DBT Characters
Language DBT Characters
Mandarin (mainland) Pinyin Romanization with accent
marks for the tones
Mandarin (Taiwan) Zhuyin Romanization
Cantonese Romanization with superscript
numbers for the tones
Japanese Unicode U+30xx characters
Korean Unicode U+11xx characters
Korean Hangul Characters

Hangul often compacts 2 or 3 characters into a single symbol. When DBT imports a file, the process is reversed, DBT breaks a single Hangul character into its component parts. In technical terms, all Hangul characters in the range from U+AC00 through U+D7AF are redirected into Hangul Jamo characters in the U+11xx range. DBT uses a mono-spaced font to display those characters. The result can be difficult to read and is certainly jarring to those who are accustomed to reading conventional inkprint Hangul. In this case, it is best to do all editing in Microsoft Word, using DBT as the translation engine and for output.

Arabic and Hebrew Characters

Both Arabic and Hebrew inkprint are written from right to left. DBT displays the inkprint Arabic and Hebrew text from right to left as well. However, the DBT editing cursor does not (as yet) accommodate the right to left flow within a line.

For these scripts, it is best to do all editing in Microsoft Word, using DBT solely as the translation engine and output manager. In this case, if you need to, you can clipboard whole lines from Word into DBT as a way of making changes within a line.

Unicode Ranges of Characters Supported within DBT

As can be seen in the further tables, DBT can import a vast number of characters not found in this table.

The table in this section lists the Unicode ranges that are supported inside of DBT. The first column identifies the character set, the second column gives the start of the Unicode range for that set, and the third column indicates the use of that character set. The possible uses are: for a language, for mathematics, for symbols, or for the International Phonetic Alphabet. The entries in the second column are hyperlinks to the individual tables, each one showing the full Unicode range for that character set and the DUSCI equivalent for each Unicode character.

Table of Unicode Ranges supported within DBT
Script Unicode and Link Type
Latin U+00xx Language
Latin Extended U+01xx Language
Latin Extended U+02xx Language
Greek U+03xx Language
Cyrillic U+04xx Language
Armenian and Hebrew U+05xx Language
Arabic U+06xx Language
Syriac and Thaana U+07xx Language
Hindi and Bengali U+09xx Language
Gurmukhi and Gujarati U+0Axx Language
Oriya and Tamil U+0Bxx Language
Telugu and Kannada U+0Cxx Language
Malayalam and Sinhala U+0Dxx Language
Thai and Lao U+0Exx Language
Tibetan U+0Fxx Language
Myanmar and Georgian U+10xx Language
Korean U+11xx Language
Ethiopic U+12xx Language
Ethiopic U+13xx Language
Khmer (Cambodian) U+17xx Language
IPA 1 U+1Dxx IPA
IPA 2 U+1Exx IPA
Misc Symbols U+20xx Symbols
Arrows, etc. U+21xx Math
Math Operators U+22xx Math
Misc Technical U+23xx Math
Box Drawing U+25xx Math
Dingbats U+27xx Symbols
Math Arrows U+29xx Math
Math Operators U+2Axx Math
Japanese U+30xx Language
Table of Unicode Ranges U+0000-U+FFFF Supported by DBT Import

DBT 12.7 sr1 can read thousands of characters beyond U+FFFF. Please read this external web page for the details.

This second, very extensive, table lists all of the 4-digit Unicode ranges, including those not supported in DBT.

The first column indicates the start and end of each range. The second column identifies the character set or other use covered by that range. You will see that most of these entries are hyperlinks.

Note, however, that these are all external hyperlinks to web pages, not links to pages inside DBT Help.

If the last column is blank, then there is no support for these characters in Duxbury DBT. If the last column is "Word import", it means that these characters are supported by conversion into other Duxbury supported characters during import. If the last column is "DUSCI supported" then that Unicode range is in the previous table (in the section above).

Table of All Unicode Ranges
Unicode Range Name and Wikipedia Link Support Level
U+0000-007F Basic Latin DUSCI supported
U+0080-00FF Latin-1 Supplement DUSCI supported
U+0100-017F Latin Extended-A DUSCI supported
U+0180-024F Latin Extended-B DUSCI supported
U+0250-02AF IPA Extensions DUSCI supported
U+02B0-02FF Spacing Modifier Letters DUSCI supported
U+0300-036F Combining Diacritical Marks DUSCI supported
U+0370-03FF Greek and Coptic DUSCI supported
U+0400-04FF Cyrillic DUSCI supported
U+0500-052F Cyrillic Supplement DUSCI supported
U+0530-058F Armenian DUSCI supported
U+0590-05FF Hebrew DUSCI supported
U+0600-06FF Arabic DUSCI supported
U+0700-074F Syriac DUSCI supported (use Biblical languages)
U+0750-077F Arabic Supplement DUSCI supported (use West African languages)
U+0780-07BF Thaana DUSCI supported (use Dhivehi)
U+07C0-07FF N'Ko DUSCI supported (use West African languages)
U+0800-083FSamaritan 
U+0840-085FMandaicWord import
U+0860-086FSyriac supplement (Malayalam) 
U+0870-089FArabic extended-B 
U+08A0-08AFArabic extended-A 
U+0900-097F Devanagari DUSCI supported
U+0980-09FF Bengali DUSCI supported
U+0A00-0A7F Gurmukhi DUSCI supported
U+0A80-0AFF Gujarati DUSCI supported
U+0B00-0B7F Oriya DUSCI supported
U+0B80-0BFF Tamil DUSCI supported
U+0C00-0C7F Telugu DUSCI supported
U+0C80-0CFF Kannada DUSCI supported
U+0D00-0D7F Malayalam DUSCI supported
U+0D80-0DFF Sinhala DUSCI supported
U+0E00-0E7F Thai DUSCI supported
U+0E80-0EFF Lao DUSCI supported
U+0F00-0FFF Tibetan DUSCI supported
U+1000-109F Myanmar DUSCI supported
U+10A0-10FF Georgian DUSCI supported
U+1100-11FF Hangul Jamo DUSCI supported (redirected Korean)
U+1200-127F Ethiopic (Ge'ez) DUSCI supported
U+1380-139F Ethiopic Supplement DUSCI supported
U+13A0-13FF Cherokee experimental, contact languages@duxsys.com
U+1400-167F Unified Canadian Aboriginal Syllabics Word import (use English UEB)
U+1680-169F Ogham  
U+16A0-16FF Runic  
U+1700-171F Tagalog (Baybayin) Word import
U+1720-173F Hanunoo Word import
U+1740-175F Buhid Word import
U+1760-177F Tagbanwa Word import
U+1780-17FF Khmer DUSCI supported
U+1800-18AF Mongolian  
U+18B0-18FF Extended Canadian Aboriginal syllabics Word import
U+1900-194F Limbu Word import (use Hindi)
U+1950-197F Tai Le Word import
U+1980-19DF New Tai Lue Word import
U+19E0-19FF Khmer Symbols  
U+1A00-1A1F Buginese (Lontara) Word import
U+1A20-1AAF Tai Tham  
U+1AB0-1AFFCombining Diacritical Marks 
U+1B00-1B7F Balinese  
U+1B80-1BBF Sundanese Word import
U+1BC0-1BFFBatakWord import
U+1C00-1C4F Lepcha Word import
U+1C50-1C7F Ol Chiki  
U+1CC0-1CCFSundanese supplement 
U+1CD0-1CFF Vedic Extentions  
U+1D00-1D7F Phonetic Extensions Word import
U+1D80-1DBF More Phonetic Extensions Word import
U+1DC0-1DFFCombining Diacritical Marks Supplement 
U+1E00-1EFF Latin Extended Additional DUSCI supported
U+1F00-1FFF Greek Extended Word import
U+2000-206F General Punctuation DUSCI supported
U+2070-209F Superscripts and Subscripts DUSCI supported
U+20A0-20CF Currency Symbols DUSCI supported
U+20D0-20FF Combining Diacritical Marks for Symbols  
U+2100-214F Letterlike Symbols DUSCI supported
U+2150-218F Number Forms DUSCI supported
U+2190-21FF Arrows DUSCI supported
U+2200-22FF Mathematical Operators DUSCI supported
U+2300-23FF Miscellaneous Technical DUSCI supported
U+2400-243F Control Pictures Word import
U+2440-245F Optical Character Recognition Word import
U+2460-24FF Enclosed Alphanumerics Word import
U+2500-257F Box Drawing DUSCI supported
U+2580-259F Block Elements DUSCI supported
U+25A0-25FF Geometric Shapes DUSCI supported
U+2600-26FF Miscellaneous Symbols Word import
U+2700-27BF Dingbats DUSCI supported
U+27C0-27EF Miscellaneous Mathematical Symbols-A DUSCI supported
U+27F0-27FF Supplemental Arrows-A DUSCI supported
U+2800-28FF Braille Patterns Word import
U+2900-297F Supplemental Arrows-B DUSCI supported
U+2980-29FF Miscellaneous Mathematical Symbols-B DUSCI supported
U+2A00-2AFF Supplemental Mathematical Operators DUSCI supported
U+2B00-2BFF Miscellaneous Symbols and Arrows  
U+2C00-2C5F Glagolitic  
U+2C60-2C5F Latin Extended-C  
U+2C80-2CFF Coptic Word Import (use Biblical languages)
U+2D00-2D2F Georgian Supplement  
U+2D30-2D7F Tifinagh Word import (use West African languages)
U+2D80-2DDF Ethiopic Extended  
U+2DE0-2DFF Cyrillic Extended  
U+2E00-2E7F Supplemental Punctuation  
U+2E80-2EFF CJK Radicals Supplement  
U+2F00-2FDF Kangxi Radicals  
U+2FF0-2FFF Ideographic Description Characters  
U+3000-303F CJK Symbols and Punctuation DUSCI supported (use Japanese)
U+3040-309F Hiragana DUSCI supported (use Japanese)
U+30A0-30FF Katakana DUSCI supported (use Japanese)
U+3100-312F Bopomofo DUSCI supported
U+3130-318F Hangul Compatibility Jamo Word import
U+3190-319F Kanbun  
U+31A0-31BF Bopomofo Extended  
U+31C0-31EF CJK Strokes  
U+31F0-31FF Katakana Phonetic Extensions  
U+3200-32FF Enclosed CJK Letters and Months  
U+3300-33FF CJK Compatibility  
U+3400-4DBF CJK Unified Ideographs Extension A Word import (see chart above on Chinese)
U+4DC0-4DFF Yijing Hexagram Symbols Word import (see chart above on Chinese)
U+4E00-9FFF CJK Unified Ideographs Word import (see chart above on Chinese)
U+A000-A48F Yi Syllables  
U+A490-A4CF Yi Radicals  
U+A4D0-A4FF Lisu (Fraser alphabet) Word import
U+A500-A59F Vai DUSCI supported (use West African languages)
U+A640-649FCyrillic Extended 
U+A6A0-A6FF Bamum  
U+A700-A71F Modifier Tone Letters  
U+A720-A7FF Latin Extended-D  
U+A800-A82F Syloti Nagri Word import (use Bengali)
U+A830-A83F Common Indic Number Forms  
U+A840-A87F Phags-pa Word import
U+A880-A8DF Saurashtra Word import (use Hindi)
U+A8E0-A8FF Devanagari Extended  
U+A900-A92F Kayah Li  
U+A930-A95F Rejang Word import
U+A960-A97F Hangul Extended  
U+A980-A9DF Javanese  
U+A9E0-A9FFMyanmar Extended-B 
U+AA00-AA5F Cham Word import
U+AA60-AA7F Myanmar Extended  
U+AA80-AADF Tai Viet  
U+AB00-AB2FEthiopic Extended-A 
U+AB30-AB6FLatin Extended-E 
U+AB70-ABBFCherokee supplementWord import
U+ABC0-ABFF Meitei Mayek  
U+AC00-D7AF Hangul Syllables Word import (Korean)
U+D800-DB7F High Surrogates  
U+DB80-DBFF High Private Use Surrogates  
U+DC00-DFFF Low Surrogates  
U+E000-F8FF Private Use Area  
U+F900-FAFF CJK Compatibility Ideographs  
U+FB00-FB4F Alphabetic Presentation Forms Word import (use Hebrew)
U+FB50-FDFF Arabic Presentation Forms-A Word import
U+FE00-FE0F Variation Selectors  
U+FE10-FE1F Vertical Forms  
U+FE20-FE2F Combining Half Marks  
U+FE30-FE4F CJK Compatibility Forms  
U+FE50-FE6F Small Form Variants Word import (use Arabic)
U+FE70-FEFF Arabic Presentation Forms-B Word import
U+FF00-FFEF Halfwidth and Fullwidth Forms Word import (use Japanese)
U+FFF0-FFFF Specials  
Unicode Ranges Beyond U+FFFF

DBT 12.7 sr1 can read thousands of characters beyond U+FFFF. Please read this external web page for the details.