HTML | Charsets

  • Difficulty Level : Easy
  • Last Updated : 23 Dec, 2021
The web browser displays the alphabets, numbers and some other symbols correctly. This is all possible because of the required character set that web browser uses. The character set or character encoding has different character encoding standards which assign some numbers to these character set which can be used in the internet.

ASCII: American Standard Code for Information Interchange (ANSII) created this character encoding. This character encoding are used in C/C++ programming. It has 128 alphanumeric characters consisting of alphabets(A-Z) and (a-z) and some special symbols like + – * / ( ) @ etc.

ANSI(Windows-1252): American National Standards Institute (ANSI) created character encoding supported 256 characters. It is used as default character set in Microsoft Windows.

ISO-8859-1: It is used as default character set of HTML4 and also supports 256 characters. The International Standards Organization (ISO) defines the standard character sets for different alphabets/languages. It contains numbers, upper and lowercase English letters, and some special characters.

UTF-8: UTF-8 and UTF-16 standards was developed by Unicode Consortium, because the ISO-8859 character-sets are limited, and not compatible a multilingual environment. It consists all the character and punctuation symbols.

Attribute: Web browser must know the character encoding standard used in the html page and this we do as given below.



<meta http-equiv="Content-Type" content="text/html;charset=ISO-8859-1">


<meta charset="UTF-8">

Character set for different character encoding standard: Following list shows different character encoding standards with their characters and their assigned number codes.

Table 1: This table contains characters having same numbers assigned in different character encoding.

33!Exclamation Mark
34"Quotation Mark
35#Hash Sign
36$Dollar Sign
37%Percent Sign
38&Ampersand Sign
39'Apostrophe Sign
40(Opening Paranthesis
41)Closing Parenthesis
42*Asterisk Sign
43+Plus Sign
45-Hyphen/minus Sign
47/Slash/Divide Sign
480Number Zero
491Number One
502Number Two
513Number Three
524Number Four
535Number Five
546Number Six
557Number Seven
568Number Eight
579Number Nine
60<Lessthan Sign
61=Equalto Sign
62>Greaterthan Sign
63?Question Mark
64@at Sign
65ALetter A
66BLetter B
67CLetter C
68DLetter D
69ELetter E
70FLetter F
71GLetter G
72HLetter H
73ILetter I
74JLetter J
75KLetter K
76LLetter L
77MLetter M
78NLetter N
79OLetter O
80PLetter P
81QLetter Q
82RLetter R
83SLetter S
84TLetter T
85ULetter U
86VLetter V
87WLetter W
88XLetter X
89YLetter Y
90ZLetter Z
91[Opening Square Bracket
93]Closing Square Bracket
94^Circumflex Accent
95_Low Line
96`Grave Accent
97aLetter a
98bLetter b
99cLetter c
100dLetter d
101eLetter e
102fLetter f
103gLetter g
104hLetter h
105iLetter i
106jLetter j
107kLetter k
108lLetter l
109mLetter m
110nLetter n
111oLetter o
112pLetter p
113qLetter q
114rLetter r
115sLetter s
116tLetter t
117uLetter u
118vLetter v
119wLetter w
120xLetter x
121yLetter y
122zLetter z
123{Opening Curly Bracket
124|Vertical Line
125}Closing Curly Bracket

Table 2: This table contains character having different character encoding.

127   DEL
Not Used
 131  ƒ
 136  ˆ
 138  Š
 140  Œ
141141141141Not Used
 142  Ž
143143143143Not Used
144144144144Not Used
 152  ˜
 154  š
 156  œ
157157157157Not Used
 158  ž
 159  Ÿ
160160160160no-break Space

