VSCII
VSCII, also known as TCVN 5712, ISO-IR-180, .VN, ABC or simply the TCVN encodings, is a set of three closely related Vietnamese national standard character encodings for using the Vietnamese language with computers, developed by the TCVN Technical Committee on Information Technology and first adopted in 1993.
It should not be confused with the similarly-named unofficial VISCII encoding, which was sometimes used by overseas Vietnamese speakers. VISCII was also intended to stand for Vietnamese Standard Code for Information Interchange, but is not related to VSCII.
VSCII was used extensively in the north of Vietnam, while VNI was popular in the south. Unicode and the Windows-1258 code page are now used for virtually all Vietnamese computer data, but legacy files or archived messages may need conversion.
Encodings
All three forms of VSCII keep the 95 printable characters of ASCII unmodified.VSCII-3, also known as TCVN 5712-3, VN3 or simply TCVN3, includes the fewest assignments. It is an extended ASCII, because it keeps all 128 codes of ASCII unmodified. It does not reassign any of the C0 and C1 control codes. Compared to ASCII, it adds 75 characters:
- 67 lowercase characters, allowing full lowercase support.
- 7 uppercase characters, allowing uppercase support for the 29 base letters without tone marks.
- The non-breaking space.
VSCII-2, also known as TCVN 5712-2 and VN2, is a superset of VSCII-3. It is an extended ASCII, because it keeps all 128 codes of ASCII unmodified. It does not reassign any of the C0 and C1 control codes, making it conformant with ISO 2022 as a 96-set. Compared to VSCII-3, it adds :
- 16 more uppercase characters with pre-composed tone marks
- 5 combining diacritics for tone marks, allowing other combinations of uppercase letters and tone marks to be represented. Combining marks follow the base letter as in VNI.
- Adds 44 more pre-composed uppercase letters, bringing them to the same count as the lowercase
- Does this by replacing 12 ASCII control characters and allocating 32 graphical characters to the C1 control area, breaking ISO 2022 compatibility
Conversion from VSCII-1 to VSCII-2 or VSCII-3 and conversion from VSCII-2 to VSCII-3 require expansion of some pre-composed characters.
Character set
| legend||VSCII-3legend|#FFD|Additions for VSCII-2legend|#FEE|Additions for VSCII-1 |