Southern Porch

ASCII Table

Back to Tech Stuff

DecimalOctalHexBinaryValue
00000000000000000NUL (Null char.)
00100100100000001SOH (Start of Header)
00200200200000010STX (Start of Text)
00300300300000011ETX (End of Text)
00400400400000100EOT (End of Transmission)
00500500500000101ENQ (Enquiry)
00600600600000110ACK (Acknowledgment)
00700700700000111BEL (Bell)
00801000800001000BS (Backspace)
00901100900001001HT (Horizontal Tab)
01001200A00001010LF (Line Feed)
01101300B00001011VT (Vertical Tab)
01201400C00001100FF (Form Feed)
01301500D00001101CR (Carriage Return)
01401600E00001110SO (Shift Out)
01501700F00001111SI (Shift In)
01602001000010000DLE (Data Link Escape)
01702101100010001DC1 (XON) (Device Control 1)
01802201200010010DC2 (Device Control 2)
01902301300010011DC3 (XOFF)(Device Control 3)
02002401400010100DC4 (Device Control 4)
02102501500010101NAK (Negative Acknowledgement)
02202601600010110SYN (Synchronous Idle)
02302701700010111ETB (End of Trans. Block)
02403001800011000CAN (Cancel)
02503101900011001EM (End of Medium)
02603201A00011010SUB (Substitute)
02703301B00011011ESC (Escape)
02803401C00011100FS (File Separator)
02903501D00011101GS (Group Separator)
03003601E00011110RS (Request to Send)(Record Separator)
03103701F00011111US (Unit Separator)
03204002000100000SP (Space)
03304102100100001! (exclamation mark)
03404202200100010" (double quote)
03504302300100011# (number sign)
03604402400100100$ (dollar sign)
03704502500100101% (percent)
03804602600100110& (ampersand)
03904702700100111' (single quote)
04005002800101000( (left/opening parenthesis)
04105102900101001) (right/closing parenthesis)
04205202A00101010* (asterisk)
04305302B00101011+ (plus)
04405402C00101100, (comma)
04505502D00101101- (minus or dash)
04605602E00101110. (dot)
04705702F00101111/ (forward slash)
048060030001100000
049061031001100011
050062032001100102
051063033001100113
052064034001101004
053065035001101015
054066036001101106
055067037001101117
056070038001110008
057071039001110019
05807203A00111010: (colon)
05907303B00111011; (semi-colon)
06007403C00111100< (less than)
06107503D00111101= (equal sign)
06207603E00111110> (greater than)
06307703F00111111? (question mark)
06410004001000000@ (AT symbol)
06510104101000001A
06610204201000010B
06710304301000011C
06810404401000100D
06910504501000101E
07010604601000110F
07110704701000111G
07211004801001000H
07311104901001001I
07411204A01001010J
07511304B01001011K
07611404C01001100L
07711504D01001101M
07811604E01001110N
07911704F01001111O
08012005001010000P
08112105101010001Q
08212205201010010R
08312305301010011S
08412405401010100T
08512505501010101U
08612605601010110V
08712705701010111W
08813005801011000X
08913105901011001Y
09013205A01011010Z
09113305B01011011[ (left/opening bracket)
09213405C01011100\ (back slash)
09313505D01011101] (right/closing bracket)
09413605E01011110^ (caret/circumflex)
09513705F01011111 _ (underscore)
09614006001100000`
09714106101100001a
09814206201100010b
09914306301100011c
10014406401100100d
10114506501100101e
10214606601100110f
10314706701100111g
10415006801101000h
10515106901101001i
10615206A01101010j
10715306B01101011k
10815406C01101100l
10915506D01101101m
11015606E01101110n
11115706F01101111o
11216007001110000p
11316107101110001q
11416207201110010r
11516307301110011s
11616407401110100t
11716507501110101u
11816607601110110v
11916707701110111w
12017007801111000x
12117107901111001y
12217207A01111010z
12317307B01111011{ (left/opening brace)
12417407C01111100| (vertical bar)
12517507D01111101} (right/closing brace)
12617607E01111110~ (tilde)
12717707F01111111DEL (delete)

The original character set of the Internet is ASCII (American Standard Code for Information Interchange), the characters you can find on the keyboard of a standard American computer or typewriter. Even today, most information is transmitted over the Internet in the form of ASCII characters, with each character being encoded with a number in the range from 1 to 127. In HTML, the language of the Web, this encoding is represented as &#xxx; with the x's standing for the character's code number. Thus, the letter "a" can be encoded as &#97;, the letter "A" as &#65;, and the figure bracket "{" as &#123;.

It was not difficult to extend this notation from ASCII to the characters used by most Western European languages, resulting in the "Latin-1" character set (also known as the ISO-8859-1 or, somewhat erroneously, as the "extended ASCII" character set). Whereas there are 127 possible 7-bit ASCII characters, the 8-bit Latin-1 encoding allows 255 characters to be represented. Latin-1 includes, in addition to the regular ASCII characters, non-English letters such as the "é" (&#233;) and "ñ" (&#241;), some special symbols, such as the bullet (•) encoded as &#8226;, the section sign (§), encoded as &#167; and others. Most of the "extended ASCII" characters also have a named form, which is easier to remember than the numeric form. For example, "é" can be used in a Web page as either &#233; or &eacute;, "ñ" as &#241; or &ntilde;, and "§" as &#167; or &sect;. All the major Web browsers support both forms of encoding.

Most languages of the world, however, are not restricted to the ISO-8859-1 character set. Hungarian, for example, has the characters ő, Ő, and Ű; Czech has ů and ř, Romanian has ţ and other characters, which are not part of Latin-1. Then there are characters of alphabets other than our Roman alphabet, such as Cyrillic (Кириллица), Hebrew, Arabic and others. Over the years, several forms of encoding have been devised for these characters either by replacing some of the 255 Latin-1 characters by others or by using a completely different 8-bit or 16-bit encoding. Some examples of these character sets are given in the table below.

iso-8859-1 or CP-1252 or Latin-1Western European
iso-8859-2 or CP-1250 or Latin-2Eastern European
iso-8859-3Esperanto, Galician, Maltese, Turkish
iso-8859-4Scandinavian, Baltic
iso-8859-5Cyrillic
iso-8859-6Arabic
iso-8859-7or CP-1253Greek
iso-8859-8Hebrew
GB2312Simplified Chinese
Big5Traditional Chinese
Shift_JIS, EUC-JPJapanese
KOI8-RRussian
ISO-2022-KR, EUC-KRKorean

The Web author has two options to make sure the characters of the page will be displayed as intended:

If the latter option is selected, a meta tag containing the "charset" (character set) attribute is inserted in the header of the Web page.