Character encodings
Overview of Unicode allocation
and common latin code pages.
Compare alternate charsets:
ISO ·
Windows ·
DOS ·
Apple ·
EBCDIC ·
legacy ·
symbols •
West ·
Central ·
North European ·
Turkish ·
Greek ·
Cyrillic ·
Hebrew.
Unicode BMP| ↱ | 00 | 10 | 20 | 30 | 40 | 50 | 60 | 70 | 80 | 90 | A0 | B0 | C0 | D0 | E0 | F0
 | 
|---|
| 0
 | control
 | comn
 | basic latin
 | control
 | comn
 | latin1
 | 
|---|
| 1
 | latin extended-A
 | latin extended-B
 | 
|---|
| 2
 | "
 | IPA
 | spacing modifier
 | 
|---|
| 3
 | diacritics
 | greek
 | 
|---|
| 4
 | cyrillic
 | 
|---|
| 5
 | cyrillic+
 | armenian
 | hebrew
 | 
|---|
| 6
 | arabic
 | 
|---|
| 7
 | syriac
 | arabic+
 | thaana
 | n'ko
 | 
|---|
| 8
 | samaritan
 | manda
 | syr
 | reserved
 | arabic ext-A
 | 
|---|
| 9
 | devanāgarī
 | bengali
 | 
|---|
| A
 | gurmukhi
 | gujarati
 | 
|---|
| B
 | oriya
 | tamil
 | 
|---|
| C
 | telugu
 | kannada
 | 
|---|
| D
 | malayālam
 | sinhala
 | 
|---|
| E
 | thai
 | lao
 | 
|---|
| F
 | tibetan
 | 
|---|
| 10
 | myanmar
 | georgian
 | 
|---|
| 11
 | hangeul jamo
 | 
|---|
| 12
 | ethiopic
 | 
|---|
| 13
 | 
 | eth+
 | cherokee
 | 
|---|
| 14
 | unified canadian aboriginal syllabics
 | 
|---|
| 15
 | 
|---|
| 16
 | 
 | ogham
 | runic
 | 
|---|
| 17
 | tagalog
 | hanun
 | buhid
 | tagb
 | khmer
 | 
|---|
| 18
 | mongolian
 | canadian+
 | 
|---|
| 19
 | limbu
 | tai le
 | new tai lü
 | khmer
 | 
|---|
| 1A
 | lontara
 | tai tham
 | diacritics+
 | 
|---|
| 1B
 | balinese
 | sundanese
 | batak
 | 
|---|
| 1C
 | lepcha
 | ol chiki
 | cyr
 | georg+
 | sn
 | vedic
 | 
|---|
| 1D
 | phonetic
 | phonetic+
 | diacritics+
 | 
|---|
| 1E
 | latin extended additional
 | 
|---|
| 1F
 | greek+
 | 
|---|
| 20
 | general punctuation
 | suþscript
 | currency
 | overlay
 | 
|---|
| 21
 | letterlike
 | number
 | arrows
 | 
|---|
| 22
 | mathematical symbols
 | 
|---|
| 23
 | miscellaneous technical
 | 
|---|
| 24
 | control
 | OCR
 | enclosed alphanumerics
 | 
|---|
| 25
 | box drawing
 | blocks
 | geometric shapes
 | 
|---|
| 26
 | miscellaneous symbols
 | 
|---|
| 27
 | dingbats
 | maths-A
 | arr
 | 
|---|
 
UTF-8| ↱ | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F
 | 
|---|
| 0
 | single byte ASCII
 | 
|---|
| 1
 | 
|---|
| 2
 | 
|---|
| 3
 | 
|---|
| 4
 | 
|---|
| 5
 | 
|---|
| 6
 | 
|---|
| 7
 | 
|---|
| 8
 | multi-byte continuation
 | 
|---|
| 9
 | 
|---|
| A
 | 
|---|
| B
 | 
|---|
| C
 | (overl.)
 | 2-byte sequence start
 | 
|---|
| D
 | 
 | 
|---|
| E
 | 3-byte sequence start
 | 
|---|
| F
 | 4-byte sequence
 | (overflow)
 | 5-byte
 | 6-byte
 | invalid
 | 
|---|
 
iso-8859-1| ↱ | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F
 | 
|---|
| 0
 | ␀
 | ␁
 | ␂
 | ␃
 | ␄
 | ␅
 | ␆
 | ␇
 | ␈
 | ␉
 | ␊
 | ␋
 | ␌
 | ␍
 | ␎
 | ␏
 | 
|---|
| 1
 | ␐
 | ␑
 | ␒
 | ␓
 | ␔
 | ␕
 | ␖
 | ␗
 | ␘
 | ␙
 | ␚
 | ␛
 | ␜
 | ␝
 | ␞
 | ␟
 | 
|---|
| 2
 |  
 | !
 | "
 | #
 | $
 | %
 | &
 | '
 | (
 | )
 | *
 | +
 | ,
 | -
 | .
 | /
 | 
|---|
| 3
 | 0
 | 1
 | 2
 | 3
 | 4
 | 5
 | 6
 | 7
 | 8
 | 9
 | :
 | ;
 | <
 | =
 | >
 | ?
 | 
|---|
| 4
 | @
 | A
 | B
 | C
 | D
 | E
 | F
 | G
 | H
 | I
 | J
 | K
 | L
 | M
 | N
 | O
 | 
|---|
| 5
 | P
 | Q
 | R
 | S
 | T
 | U
 | V
 | W
 | X
 | Y
 | Z
 | [
 | \
 | ]
 | ^
 | _
 | 
|---|
| 6
 | `
 | a
 | b
 | c
 | d
 | e
 | f
 | g
 | h
 | i
 | j
 | k
 | l
 | m
 | n
 | o
 | 
|---|
| 7
 | p
 | q
 | r
 | s
 | t
 | u
 | v
 | w
 | x
 | y
 | z
 | {
 | |
 | }
 | ~
 | ␡
 | 
|---|
| 8
 | �
 | �
 | �
 | �
 | �
 | �
 | �
 | �
 | �
 | �
 | �
 | �
 | �
 | �
 | �
 | �
 | 
|---|
| 9
 | �
 | �
 | �
 | �
 | �
 | �
 | �
 | �
 | �
 | �
 | �
 | �
 | �
 | �
 | �
 | �
 | 
|---|
| A
 |  
 | ¡
 | ¢
 | £
 | ¤
 | ¥
 | ¦
 | §
 | ¨
 | ©
 | ª
 | «
 | ¬
 | -
 | ®
 | ¯
 | 
|---|
| B
 | °
 | ±
 | ²
 | ³
 | ´
 | µ
 | ¶
 | ·
 | ¸
 | ¹
 | º
 | »
 | ¼
 | ½
 | ¾
 | ¿
 | 
|---|
| C
 | À
 | Á
 | Â
 | Ã
 | Ä
 | Å
 | Æ
 | Ç
 | È
 | É
 | Ê
 | Ë
 | Ì
 | Í
 | Î
 | Ï
 | 
|---|
| D
 | Ð
 | Ñ
 | Ò
 | Ó
 | Ô
 | Õ
 | Ö
 | ×
 | Ø
 | Ù
 | Ú
 | Û
 | Ü
 | Ý
 | Þ
 | ß
 | 
|---|
| E
 | à
 | á
 | â
 | ã
 | ä
 | å
 | æ
 | ç
 | è
 | é
 | ê
 | ë
 | ì
 | í
 | î
 | ï
 | 
|---|
| F
 | ð
 | ñ
 | ò
 | ó
 | ô
 | õ
 | ö
 | ÷
 | ø
 | ù
 | ú
 | û
 | ü
 | ý
 | þ
 | ÿ
 | 
|---|
 
iso-8859-15 | ↱ | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F
 | 
|---|
| A
 |  
 | ¡
 | ¢
 | £
 | €
 | ¥
 | Š
 | §
 | š
 | ©
 | ª
 | «
 | ¬
 | -
 | ®
 | ¯
 | 
|---|
| B
 | °
 | ±
 | ²
 | ³
 | Ž
 | µ
 | ¶
 | ·
 | ž
 | ¹
 | º
 | »
 | Œ
 | œ
 | Ÿ
 | ¿
 | 
|---|
 
cp1252 | ↱ | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F
 | 
|---|
| 8
 | €
 | 
 | ‚
 | ƒ
 | „
 | …
 | †
 | ‡
 | ˆ
 | ‰
 | Š
 | ‹
 | Œ
 | 
 | Ž
 | 
 | 
|---|
| 9
 | 
 | ‘
 | ’
 | “
 | ”
 | •
 | –
 | —
 | ˜
 | ™
 | š
 | ›
 | œ
 | 
 | ž
 | Ÿ
 | 
|---|
 
cp437 | ↱ | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F
 | 
|---|
| 0
 |  
 | ☺
 | ☻
 | ♥
 | ♦
 | ♣
 | ♠
 | •
 | ◘
 | ○
 | ◙
 | ♂
 | ♀
 | ♪
 | ♫
 | ☼
 | 
|---|
| 1
 | ►
 | ◄
 | ↕
 | ‼
 | ¶
 | §
 | ▬
 | ↨
 | ↑
 | ↓
 | →
 | ←
 | ∟
 | ↔
 | ▲
 | ▼
 | 
|---|
| 8
 | Ç
 | ü
 | é
 | â
 | ä
 | à
 | å
 | ç
 | ê
 | ë
 | è
 | ï
 | î
 | ì
 | Ä
 | Å
 | 
|---|
| 9
 | É
 | æ
 | Æ
 | ô
 | ö
 | ò
 | û
 | ù
 | ÿ
 | Ö
 | Ü
 | ¢
 | £
 | ¥
 | ₧
 | ƒ
 | 
|---|
| A
 | á
 | í
 | ó
 | ú
 | ñ
 | Ñ
 | ª
 | º
 | ¿
 | ⌐
 | ¬
 | ½
 | ¼
 | ¡
 | «
 | »
 | 
|---|
| B
 | ░
 | ▒
 | ▓
 | │
 | ┤
 | ╡
 | ╢
 | ╖
 | ╕
 | ╣
 | ║
 | ╗
 | ╝
 | ╜
 | ╛
 | ┐
 | 
|---|
| C
 | └
 | ┴
 | ┬
 | ├
 | ─
 | ┼
 | ╞
 | ╟
 | ╚
 | ╔
 | ╩
 | ╦
 | ╠
 | ═
 | ╬
 | ╧
 | 
|---|
| D
 | ╨
 | ╤
 | ╥
 | ╙
 | ╘
 | ╒
 | ╓
 | ╫
 | ╪
 | ┘
 | ┌
 | █
 | ▄
 | ▌
 | ▐
 | ▀
 | 
|---|
| E
 | α
 | ß
 | Γ
 | π
 | Σ
 | σ
 | µ
 | τ
 | Φ
 | Θ
 | Ω
 | δ
 | ∞
 | ϕ
 | ε
 | ∩
 | 
|---|
| F
 | ≡
 | ±
 | ≥
 | ≤
 | ⌠
 | ⌡
 | ÷
 | ≈
 | °
 | ∙
 | ·
 | √
 | ⁿ
 | ²
 | ■
 |  
 | 
|---|
 
cp850 | ↱ | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F
 | 
|---|
| 9
 | É
 | æ
 | Æ
 | ô
 | ö
 | ò
 | û
 | ù
 | ÿ
 | Ö
 | Ü
 | ø
 | £
 | Ø
 | ×
 | ƒ
 | 
|---|
| A
 | á
 | í
 | ó
 | ú
 | ñ
 | Ñ
 | ª
 | º
 | ¿
 | ®
 | ¬
 | ½
 | ¼
 | ¡
 | «
 | »
 | 
|---|
| B
 | ░
 | ▒
 | ▓
 | │
 | ┤
 | Á
 | Â
 | À
 | ©
 | ╣
 | ║
 | ╗
 | ╝
 | ¢
 | ¥
 | ┐
 | 
|---|
| C
 | └
 | ┴
 | ┬
 | ├
 | ─
 | ┼
 | ã
 | Ã
 | ╚
 | ╔
 | ╩
 | ╦
 | ╠
 | ═
 | ╬
 | ¤
 | 
|---|
| D
 | ð
 | Ð
 | Ê
 | Ë
 | È
 | ı
 | Í
 | Î
 | Ï
 | ┘
 | ┌
 | █
 | ▄
 | ¦
 | Ì
 | ▀
 | 
|---|
| E
 | Ó
 | ß
 | Ô
 | Ò
 | õ
 | Õ
 | µ
 | þ
 | Þ
 | Ú
 | Û
 | Ù
 | ý
 | Ý
 | ¯
 | ´
 | 
|---|
| F
 | -
 | ±
 | ‗
 | ¾
 | ¶
 | §
 | ÷
 | ¸
 | °
 | ¨
 | ·
 | ¹
 | ³
 | ²
 | ■
 |  
 | 
|---|
 
	
	| control
	 | whitespace
	 | diacritic
	 | punctuation
	 | symbol
	 | numeric
	 | greek
	 | aramaic
	 | syllabic
		| african
		 | japanese
		 | cjk
		 | chinese
		 |   
	 | alphabetic
	 | 
	
	| unicode 10.0
	 | proposed
	 | deprecated
	 | unassigned
	 | invalid
	 |