Latin script in Unicode
Over a thousand characters from the Latin script are encoded in the Unicode Standard, grouped in several basic and extended Latin blocks. The extended ranges contain mainly precomposed letters plus diacritics that are equivalently encoded with combining diacritics, as well as some ligatures and distinct letters, used for example in the orthographies of various African languages and the Vietnamese alphabet. Latin Extended-C contains additions for Uighur and the Claudian letters. Latin Extended-D comprises characters that are mostly of interest to medievalists. Latin Extended-E mostly comprises characters used for German dialectology. Latin Extended-F and -G contain characters for phonetic transcription.
Blocks
As of version of the Unicode Standard, 1,492 characters in the following 19 blocks are classified as belonging to the Latin script.- Basic Latin, 0000–007F. This block corresponds to ASCII.
- Latin-1 Supplement, 0080–00FF. This block and the ASCII part collectively corresponds to IANA Latin-1.
- Latin Extended-A, 0100–017F
- Latin Extended-B, 0180–024F
- IPA Extensions, 0250–02AF
- Spacing Modifier Letters, 02B0–02FF
- Phonetic Extensions, 1D00–1D7F
- Phonetic Extensions Supplement, 1D80–1DBF
- Latin Extended Additional, 1E00–1EFF
- Superscripts and Subscripts, 2070–209F
- Letterlike Symbols, 2100–214F
- Number Forms, 2150–218F
- Latin Extended-C, 2C60–2C7F
- Latin Extended-D, A720–A7FF
- Latin Extended-E, AB30–AB6F
- Alphabetic Presentation Forms FB00–FB4F
- Halfwidth and Fullwidth Forms, FF00–FFEF
- Latin Extended-F, 10780–107BF
- Latin Extended-G, 1DF00–1DFFF