Character encodings
Overview of Unicode allocation
and common latin code pages.
Compare alternate charsets:
ISO ·
Windows ·
DOS ·
Apple ·
EBCDIC ·
legacy ·
symbols •
West ·
Central ·
North European ·
Turkish ·
Greek ·
Cyrillic ·
Hebrew.
Unicode BMP↱ | 00 | 10 | 20 | 30 | 40 | 50 | 60 | 70 | 80 | 90 | A0 | B0 | C0 | D0 | E0 | F0
|
---|
0
| control
| comn
| basic latin
| control
| comn
| latin1
|
---|
1
| latin extended-A
| latin extended-B
|
---|
2
| "
| IPA
| spacing modifier
|
---|
3
| diacritics
| greek
|
---|
4
| cyrillic
|
---|
5
| cyrillic+
| armenian
| hebrew
|
---|
6
| arabic
|
---|
7
| syriac
| arabic+
| thaana
| n'ko
|
---|
8
| samaritan
| manda
| syr
| reserved
| arabic ext-A
|
---|
9
| devanāgarī
| bengali
|
---|
A
| gurmukhi
| gujarati
|
---|
B
| oriya
| tamil
|
---|
C
| telugu
| kannada
|
---|
D
| malayālam
| sinhala
|
---|
E
| thai
| lao
|
---|
F
| tibetan
|
---|
10
| myanmar
| georgian
|
---|
11
| hangeul jamo
|
---|
12
| ethiopic
|
---|
13
|
| eth+
| cherokee
|
---|
14
| unified canadian aboriginal syllabics
|
---|
15
|
---|
16
|
| ogham
| runic
|
---|
17
| tagalog
| hanun
| buhid
| tagb
| khmer
|
---|
18
| mongolian
| canadian+
|
---|
19
| limbu
| tai le
| new tai lü
| khmer
|
---|
1A
| lontara
| tai tham
| diacritics+
|
---|
1B
| balinese
| sundanese
| batak
|
---|
1C
| lepcha
| ol chiki
| cyr
| georg+
| sn
| vedic
|
---|
1D
| phonetic
| phonetic+
| diacritics+
|
---|
1E
| latin extended additional
|
---|
1F
| greek+
|
---|
20
| general punctuation
| suþscript
| currency
| overlay
|
---|
21
| letterlike
| number
| arrows
|
---|
22
| mathematical symbols
|
---|
23
| miscellaneous technical
|
---|
24
| control
| OCR
| enclosed alphanumerics
|
---|
25
| box drawing
| blocks
| geometric shapes
|
---|
26
| miscellaneous symbols
|
---|
27
| dingbats
| maths-A
| arr
|
---|
UTF-8↱ | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F
|
---|
0
| single byte ASCII
|
---|
1
|
---|
2
|
---|
3
|
---|
4
|
---|
5
|
---|
6
|
---|
7
|
---|
8
| multi-byte continuation
|
---|
9
|
---|
A
|
---|
B
|
---|
C
| (overl.)
| 2-byte sequence start
|
---|
D
|
|
---|
E
| 3-byte sequence start
|
---|
F
| 4-byte sequence
| (overflow)
| 5-byte
| 6-byte
| invalid
|
---|
iso-8859-1↱ | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F
|
---|
0
| ␀
| ␁
| ␂
| ␃
| ␄
| ␅
| ␆
| ␇
| ␈
| ␉
| ␊
| ␋
| ␌
| ␍
| ␎
| ␏
|
---|
1
| ␐
| ␑
| ␒
| ␓
| ␔
| ␕
| ␖
| ␗
| ␘
| ␙
| ␚
| ␛
| ␜
| ␝
| ␞
| ␟
|
---|
2
|
| !
| "
| #
| $
| %
| &
| '
| (
| )
| *
| +
| ,
| -
| .
| /
|
---|
3
| 0
| 1
| 2
| 3
| 4
| 5
| 6
| 7
| 8
| 9
| :
| ;
| <
| =
| >
| ?
|
---|
4
| @
| A
| B
| C
| D
| E
| F
| G
| H
| I
| J
| K
| L
| M
| N
| O
|
---|
5
| P
| Q
| R
| S
| T
| U
| V
| W
| X
| Y
| Z
| [
| \
| ]
| ^
| _
|
---|
6
| `
| a
| b
| c
| d
| e
| f
| g
| h
| i
| j
| k
| l
| m
| n
| o
|
---|
7
| p
| q
| r
| s
| t
| u
| v
| w
| x
| y
| z
| {
| |
| }
| ~
| ␡
|
---|
8
| �
| �
| �
| �
| �
| �
| �
| �
| �
| �
| �
| �
| �
| �
| �
| �
|
---|
9
| �
| �
| �
| �
| �
| �
| �
| �
| �
| �
| �
| �
| �
| �
| �
| �
|
---|
A
|
| ¡
| ¢
| £
| ¤
| ¥
| ¦
| §
| ¨
| ©
| ª
| «
| ¬
| -
| ®
| ¯
|
---|
B
| °
| ±
| ²
| ³
| ´
| µ
| ¶
| ·
| ¸
| ¹
| º
| »
| ¼
| ½
| ¾
| ¿
|
---|
C
| À
| Á
| Â
| Ã
| Ä
| Å
| Æ
| Ç
| È
| É
| Ê
| Ë
| Ì
| Í
| Î
| Ï
|
---|
D
| Ð
| Ñ
| Ò
| Ó
| Ô
| Õ
| Ö
| ×
| Ø
| Ù
| Ú
| Û
| Ü
| Ý
| Þ
| ß
|
---|
E
| à
| á
| â
| ã
| ä
| å
| æ
| ç
| è
| é
| ê
| ë
| ì
| í
| î
| ï
|
---|
F
| ð
| ñ
| ò
| ó
| ô
| õ
| ö
| ÷
| ø
| ù
| ú
| û
| ü
| ý
| þ
| ÿ
|
---|
iso-8859-15 ↱ | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F
|
---|
A
|
| ¡
| ¢
| £
| €
| ¥
| Š
| §
| š
| ©
| ª
| «
| ¬
| -
| ®
| ¯
|
---|
B
| °
| ±
| ²
| ³
| Ž
| µ
| ¶
| ·
| ž
| ¹
| º
| »
| Œ
| œ
| Ÿ
| ¿
|
---|
cp1252 ↱ | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F
|
---|
8
| €
|
| ‚
| ƒ
| „
| …
| †
| ‡
| ˆ
| ‰
| Š
| ‹
| Œ
|
| Ž
|
|
---|
9
|
| ‘
| ’
| “
| ”
| •
| –
| —
| ˜
| ™
| š
| ›
| œ
|
| ž
| Ÿ
|
---|
cp437 ↱ | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F
|
---|
0
|
| ☺
| ☻
| ♥
| ♦
| ♣
| ♠
| •
| ◘
| ○
| ◙
| ♂
| ♀
| ♪
| ♫
| ☼
|
---|
1
| ►
| ◄
| ↕
| ‼
| ¶
| §
| ▬
| ↨
| ↑
| ↓
| →
| ←
| ∟
| ↔
| ▲
| ▼
|
---|
8
| Ç
| ü
| é
| â
| ä
| à
| å
| ç
| ê
| ë
| è
| ï
| î
| ì
| Ä
| Å
|
---|
9
| É
| æ
| Æ
| ô
| ö
| ò
| û
| ù
| ÿ
| Ö
| Ü
| ¢
| £
| ¥
| ₧
| ƒ
|
---|
A
| á
| í
| ó
| ú
| ñ
| Ñ
| ª
| º
| ¿
| ⌐
| ¬
| ½
| ¼
| ¡
| «
| »
|
---|
B
| ░
| ▒
| ▓
| │
| ┤
| ╡
| ╢
| ╖
| ╕
| ╣
| ║
| ╗
| ╝
| ╜
| ╛
| ┐
|
---|
C
| └
| ┴
| ┬
| ├
| ─
| ┼
| ╞
| ╟
| ╚
| ╔
| ╩
| ╦
| ╠
| ═
| ╬
| ╧
|
---|
D
| ╨
| ╤
| ╥
| ╙
| ╘
| ╒
| ╓
| ╫
| ╪
| ┘
| ┌
| █
| ▄
| ▌
| ▐
| ▀
|
---|
E
| α
| ß
| Γ
| π
| Σ
| σ
| µ
| τ
| Φ
| Θ
| Ω
| δ
| ∞
| ϕ
| ε
| ∩
|
---|
F
| ≡
| ±
| ≥
| ≤
| ⌠
| ⌡
| ÷
| ≈
| °
| ∙
| ·
| √
| ⁿ
| ²
| ■
|
|
---|
cp850 ↱ | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F
|
---|
9
| É
| æ
| Æ
| ô
| ö
| ò
| û
| ù
| ÿ
| Ö
| Ü
| ø
| £
| Ø
| ×
| ƒ
|
---|
A
| á
| í
| ó
| ú
| ñ
| Ñ
| ª
| º
| ¿
| ®
| ¬
| ½
| ¼
| ¡
| «
| »
|
---|
B
| ░
| ▒
| ▓
| │
| ┤
| Á
| Â
| À
| ©
| ╣
| ║
| ╗
| ╝
| ¢
| ¥
| ┐
|
---|
C
| └
| ┴
| ┬
| ├
| ─
| ┼
| ã
| Ã
| ╚
| ╔
| ╩
| ╦
| ╠
| ═
| ╬
| ¤
|
---|
D
| ð
| Ð
| Ê
| Ë
| È
| ı
| Í
| Î
| Ï
| ┘
| ┌
| █
| ▄
| ¦
| Ì
| ▀
|
---|
E
| Ó
| ß
| Ô
| Ò
| õ
| Õ
| µ
| þ
| Þ
| Ú
| Û
| Ù
| ý
| Ý
| ¯
| ´
|
---|
F
| -
| ±
| ‗
| ¾
| ¶
| §
| ÷
| ¸
| °
| ¨
| ·
| ¹
| ³
| ²
| ■
|
|
---|
control
| whitespace
| diacritic
| punctuation
| symbol
| numeric
| greek
| aramaic
| syllabic
african
| japanese
| cjk
| chinese
|
| alphabetic
|
unicode 10.0
| proposed
| deprecated
| unassigned
| invalid
|