Latin1的所有字符编码
ISO-8859-1 (ISO Latin 1) Character Encoding
Contents
- The characters at a glance
- Character codes and names
- Notes for html documents
- Other notes
- Additional references
The characters at a glance
Here are all the printable characters, in collating order:
! " # $ % & ' ( ) * + , - . /
0 1 2 3 4 5 6 7 8 9 : ; < = > ? @
A B C D E F G H I J K L M N O P Q R S T U V W X Y Z
[ \ ] ^ _ `
a b c d e f g h i j k l m n o p q r s t u v w x y z
{ | } ~ ¡ ¢ £ ¤ ¥ ¦ § ¨ © ª « ¬ ® ¯
° ± ² ³ ´ µ ¶ · ¸ ¹ º » ¼ ½ ¾ ¿ À Á Â Ã Ä Å Æ Ç È É Ê Ë Ì Í Î Ï Ð Ñ Ò Ó Ô Õ Ö
×
Ø Ù Ú Û Ü Ý Þ
ß
à á â ã ä å æ ç è é ê ë ì í î ï ð ñ ò ó ô õ ö
÷
ø ù ú û ü ý þ
ÿ
The first six rows are the ASCII character set.
Note the ordinary ASCII space (before `!') and the ISO Latin-1 non-breaking space (before `¡')
Character codes and names
The columns show, in order:
- HTML: the HTML notation (decimal);
- OCTL: the C/Modula-3 octal notation;
- HEX: the hexadecimal code, as used e.g. in MIME quoted-printable encoding;
- CMP: the Sun/X11 "Compose" key combinations;
- CHR: the charater itself, when HTML-printable;
- MEANING: the ISO-8859-1 (and ASCII) definition.
HTML OCTL HEX CMP CHR MEANING
------ + ---- + --- + --- + --- + ------------------------------
� | \000 | =00 | | | Invalid (ASCII NUL)
 | \001 | =01 | | | Unused (ASCII SOH)
 | \002 | =02 | | | Unused (ASCII STX)
 | \003 | =03 | | | Unused (ASCII ETX)
 | \004 | =04 | | | Unused (ASCII EOT)
 | \005 | =05 | | | Unused (ASCII ENQ)
 | \006 | =06 | | | Unused (ASCII ACK)
 | \007 | =07 | | | Unused (ASCII BEL, audible bell)
 | \010 | =08 | | | Unused (ASCII BS, backspace)
| \011 | =09 | | | Horizontal tab (ASCII HT)
| \012 | =0A | | | Line feed (ASCII NL, newline)
 | \013 | =0B | | | Unused (ASCII VT, vertical tab)
 | \014 | =0C | | | Unused (ASCII NP, new page)
| \015 | =0D | | | Carriage Return (ASCII CR)
 | \016 | =0E | | | Unused (ASCII SO)
 | \017 | =0F | | | Unused (ASCII SI)
 | \020 | =10 | | | Unused (ASCII DLE)
 | \021 | =11 | | | Unused (ASCII DC1)
 | \022 | =12 | | | Unused (ASCII DC2)
 | \023 | =13 | | | Unused (ASCII DC3)
 | \024 | =14 | | | Unused (ASCII DC4)
 | \025 | =15 | | | Unused (ASCII NAK)
 | \026 | =16 | | | Unused (ASCII SYN)
 | \027 | =17 | | | Unused (ASCII ETB)
 | \030 | =18 | | | Unused (ASCII CAN)
 | \031 | =19 | | | Unused (ASCII EM)
 | \032 | =1A | | | Unused (ASCII SUB)
 | \033 | =1B | | | Unused (ASCII ESC, escape)
 | \034 | =1C | | | Unused (ASCII FS)
 | \035 | =1D | | | Unused (ASCII GS)
 | \036 | =1E | | | Unused (ASCII RS)
 | \037 | =1F | | | Unused (ASCII US)
| \040 | =20 | | ( ) | Space (ASCII SP)
! | \041 | =21 | | (!) | Exclamation mark
" | \042 | =22 | | (") | Quotation mark (")
# | \043 | =23 | | (#) | Number sign
$ | \044 | =24 | | ($) | Dollar sign
% | \045 | =25 | | (%) | Percent sign
& | \046 | =26 | | (&) | Ampersand (&)
' | \047 | =27 | | (') | Apostrophe (right single quote)
( | \050 | =28 | | (() | Left parenthesis
) | \051 | =29 | | ()) | Right parenthesis
* | \052 | =2A | | (*) | Asterisk
+ | \053 | =2B | | (+) | Plus sign
, | \054 | =2C | | (,) | Comma
- | \055 | =2D | | (-) | Hyphen
. | \056 | =2E | | (.) | Period (fullstop)
/ | \057 | =2F | | (/) | Solidus (slash)
0 | \060 | =30 | | (0) | Digit 0
. . .
9 | \071 | =39 | | (9) | Digit 9
: | \072 | =3A | | (:) | Colon
; | \073 | =3B | | (;) | Semi-colon
< | \074 | =3C | | (<) | Less than (<)
= | \075 | =3D | | (=) | Equals sign
> | \076 | =3E | | (>) | Greater than (>)
? | \077 | =3F | | (?) | Question mark
@ | \100 | =40 | | (@) | Commercial at-sign
A | \101 | =41 | | (A) | Uppercase letter A
. . .
Z | \132 | =5A | | (Z) | Uppercase letter Z
[ | \133 | =5B | | ([) | Left square bracket
\ | \134 | =5C | | (\) | Reverse solidus (backslash)
] | \135 | =5D | | (]) | Right square bracket
^ | \136 | =5E | | (^) | Caret
_ | \137 | =5F | | (_) | Horizontal bar (underscore)
` | \140 | =60 | | (`) | Reverse apostrophe (left single quote)
a | \141 | =61 | | (a) | Lowercase letter a
. . .
z | \172 | =7A | | (z) | Lowercase letter z
{ | \173 | =7B | | ({) | Left curly brace
| | \174 | =7C | | (|) | Vertical bar
} | \175 | =7D | | (}) | Right curly brace
~ | \176 | =7E | | (~) | Tilde
| \177 | =7F | | | Unused (ASCII DEL)
| \200 | =80 | | | Unused
| \201 | =81 | | | Unused
| \202 | =82 | | | Unused
| \203 | =83 | | | Unused
| \204 | =84 | | | Unused
| \205 | =85 | | | Unused
| \206 | =86 | | | Unused
| \207 | =87 | | | Unused
| \210 | =88 | | | Unused
| \211 | =89 | | | Unused
| \212 | =8A | | | Unused
| \213 | =8B | | | Unused
| \214 | =8C | | | Unused
| \215 | =8D | | | Unused
| \216 | =8E | | | Unused
| \217 | =8F | | | Unused
| \220 | =90 | | | Unused
| \221 | =91 | | | Unused
| \222 | =92 | | | Unused
| \223 | =93 | | | Unused
| \224 | =94 | | | Unused
| \225 | =95 | | | Unused
| \226 | =96 | | | Unused
| \227 | =97 | | | Unused
| \230 | =98 | | | Unused
| \231 | =99 | | | Unused
| \232 | =9A | | | Unused
| \233 | =9B | | | Unused
| \234 | =9C | | | Unused
| \235 | =9D | | | Unused
| \236 | =9E | | | Unused
| \237 | =9F | | | Unused
| \240 | =A0 | | ( ) | Non-breaking space ( )
¡ | \241 | =A1 | ! ! | (¡) | Inverted exclamation
¢ | \242 | =A2 | c / | (¢) | Cent sign
£ | \243 | =A3 | l - | (£) | Pound sterling
¤ | \244 | =A4 | o x | (¤) | General currency sign
¥ | \245 | =A5 | y - | (¥) | Yen sign
¦ | \246 | =A6 | | | | (¦) | Broken vertical bar
§ | \247 | =A7 | s o | (§) | Section sign
¨ | \250 | =A8 | " " | (¨) | Umlaut (dieresis)
© | \251 | =A9 | c o | (©) | Copyright
ª | \252 | =AA | - a | (ª) | Feminine ordinal
« | \253 | =AB | < < | («) | Left angle quote, guillemotleft
¬ | \254 | =AC | - , | (¬) | Not sign
| \255 | =AD | - - | () | Soft hyphen
® | \256 | =AE | r o | (®) | Registered trademark
¯ | \257 | =AF | ^ - | (¯) | Macron accent
° | \260 | =B0 | ^ * | (°) | Degree sign
± | \261 | =B1 | + - | (±) | Plus or minus
² | \262 | =B2 | ^ 2 | (²) | Superscript two
³ | \263 | =B3 | ^ 3 | (³) | Superscript three
´ | \264 | =B4 | \ \ | (´) | Acute accent
µ | \265 | =B5 | / u | (µ) | Micro sign
¶ | \266 | =B6 | P ! | (¶) | Paragraph sign
· | \267 | =B7 | ^ . | (·) | Middle dot
¸ | \270 | =B8 | , , | (¸) | Cedilla
¹ | \271 | =B9 | ^ 1 | (¹) | Superscript one
º | \272 | =BA | _ o | (º) | Masculine ordinal
» | \273 | =BB | > > | (») | Right angle quote, guillemotright
¼ | \274 | =BC | 1 4 | (¼) | Fraction one-fourth
½ | \275 | =BD | 1 2 | (½) | Fraction one-half
¾ | \276 | =BE | 3 4 | (¾) | Fraction three-fourths
¿ | \277 | =BF | ? ? | (¿) | Inverted question mark
À | \300 | =C0 | A ` | (À) | Capital A, grave accent
Á | \301 | =C1 | A ' | (Á) | Capital A, acute accent
 | \302 | =C2 | A ^ | (Â) | Capital A, circumflex accent
à | \303 | =C3 | A ~ | (Ã) | Capital A, tilde
Ä | \304 | =C4 | A " | (Ä) | Capital A, dieresis or umlaut mark
Å | \305 | =C5 | A * | (Å) | Capital A, ring
Æ | \306 | =C6 | A E | (Æ) | Capital AE dipthong (ligature)
Ç | \307 | =C7 | C , | (Ç) | Capital C, cedilla
È | \310 | =C8 | E ` | (È) | Capital E, grave accent
É | \311 | =C9 | E ' | (É) | Capital E, acute accent
Ê | \312 | =CA | E ^ | (Ê) | Capital E, circumflex accent
Ë | \313 | =CB | E " | (Ë) | Capital E, dieresis or umlaut mark
Ì | \314 | =CC | I ` | (Ì) | Capital I, grave accent
Í | \315 | =CD | I ' | (Í) | Capital I, acute accent
Î | \316 | =CE | I ^ | (Î) | Capital I, circumflex accent
Ï | \317 | =CF | I " | (Ï) | Capital I, dieresis or umlaut mark
Ð | \320 | =D0 | D - | (Ð) | Capital Eth, Icelandic
Ñ | \321 | =D1 | N ~ | (Ñ) | Capital N, tilde
Ò | \322 | =D2 | O ` | (Ò) | Capital O, grave accent
Ó | \323 | =D3 | O ' | (Ó) | Capital O, acute accent
Ô | \324 | =D4 | O ^ | (Ô) | Capital O, circumflex accent
Õ | \325 | =D5 | O ~ | (Õ) | Capital O, tilde
Ö | \326 | =D6 | O " | (Ö) | Capital O, dieresis or umlaut mark
× | \327 | =D7 | x x | (×) | Multiply sign
Ø | \330 | =D8 | O / | (Ø) | Capital O, slash
Ù | \331 | =D9 | U ` | (Ù) | Capital U, grave accent
Ú | \332 | =DA | U ' | (Ú) | Capital U, acute accent
Û | \333 | =DB | U ^ | (Û) | Capital U, circumflex accent
Ü | \334 | =DC | U " | (Ü) | Capital U, dieresis or umlaut mark
Ý | \335 | =DD | Y ' | (Ý) | Capital Y, acute accent
Þ | \336 | =DE | P | | (Þ) | Capital THORN, Icelandic
ß | \337 | =DF | s s | (ß) | Small sharp s, German (sz ligature)
à | \340 | =E0 | a ` | (à) | Small a, grave accent
á | \341 | =E1 | a ' | (á) | Small a, acute accent
â | \342 | =E2 | a ^ | (â) | Small a, circumflex accent
ã | \343 | =E3 | a ~ | (ã) | Small a, tilde
ä | \344 | =E4 | a " | (ä) | Small a, dieresis or umlaut mark
å | \345 | =E5 | a * | (å) | Small a, ring
æ | \346 | =E6 | a e | (æ) | Small ae dipthong (ligature)
ç | \347 | =E7 | c , | (ç) | Small c, cedilla
è | \350 | =E8 | e ` | (è) | Small e, grave accent
é | \351 | =E9 | e ' | (é) | Small e, acute accent
ê | \352 | =EA | e ^ | (ê) | Small e, circumflex accent
ë | \353 | =EB | e " | (ë) | Small e, dieresis or umlaut mark
ì | \354 | =EC | i ` | (ì) | Small i, grave accent
í | \355 | =ED | i ' | (í) | Small i, acute accent
î | \356 | =EE | i ^ | (î) | Small i, circumflex accent
ï | \357 | =EF | i " | (ï) | Small i, dieresis or umlaut mark
ð | \360 | =F0 | d - | (ð) | Small eth, Icelandic
ñ | \361 | =F1 | n ~ | (ñ) | Small n, tilde
ò | \362 | =F2 | o ` | (ò) | Small o, grave accent
ó | \363 | =F3 | o ' | (ó) | Small o, acute accent
ô | \364 | =F4 | o ^ | (ô) | Small o, circumflex accent
õ | \365 | =F5 | o ~ | (õ) | Small o, tilde
ö | \366 | =F6 | o " | (ö) | Small o, dieresis or umlaut mark
÷ | \367 | =F7 | - : | (÷) | Division sign
ø | \370 | =F8 | o / | (ø) | Small o, slash
ù | \371 | =F9 | u ` | (ù) | Small u, grave accent
ú | \372 | =FA | u ' | (ú) | Small u, acute accent
û | \373 | =FB | u ^ | (û) | Small u, circumflex accent
ü | \374 | =FC | u " | (ü) | Small u, dieresis or umlaut mark
ý | \375 | =FD | y ' | (ý) | Small y, acute accent
þ | \376 | =FE | p | | (þ) | Small thorn, Icelandic
ÿ | \377 | =FF | y " | (ÿ) | Small y, dieresis or umlaut mark
Notes for HTML documents
HTML entity names are given in the "MEANING" column only for ampersand, quote, less than, and greater than, which are significant in HTML syntax; and for the non-breaking space, which may be confused with ordinary space. HTML entity names exist for many other characters, but they are superfluous: the ISO-8859-1 eight-bit codes will work, by definition, on any browser.
The characters carriage return (ASCII CR) and line feed (ASCII NL, newline) are equivalent; they are treated as whitespace, except in <pre> contexts, where they force a line break. (However, a line feed is ignored if it immediately follows a carriage return.)
The horizontal tab character (ASCII HT) skips to the next tabbing column in <pre> contexts, and is treated as whitespace elsewhere.
The non-breaking space ( ) is honored even in non-<pre> contexts, and can be used to insert extra space between words, images, etc., like this: | |.
Other notes
Alternative Sun/X11 "Compose" sequences for the Icelandic "thorn" are "t h" (þ, lowercase) and "T H" (Þ, uppercase).
Note that the Sun/X11 "Compose" sequence for masculine ordinal (º) uses an underscore, while the feminine ordinal (ª) uses a minus sign. It takes a lot of imagination to come up with such ideas...
Additional references
- Martin Ramsch's iso8859-1 table.
- The HTML 2.0 Standard [Character Entity Sets] [HTML Coded Character Set]
- The HTML 3.0 specification [Latin-1 Character Entities]
- The HTML+ Discussion Document [Appendix II]
- An exhaustive entity table including HTML 2.0, HTML3.0, HTML+, with Postscript equivalents.
Composed by J. Stolfi from several sources found throughout the net.
http://www.ic.unicamp.br/~stolfi/EXPORT/www/ISO-8859-1-Encoding.html
Latin1的所有字符编码的更多相关文章
- mysql命令行修改字符编码
1.修改数据库字符编码 mysql> alter database mydb character set utf8 ; 2.创建数据库时,指定数据库的字符编码 mysql> create ...
- mysql修改数据库编码(数据库字符集)和表的字符编码的方法
Mysql数据库是一个开源的数据库,应用非常广泛.以下是修改mysql数据库的字符编码的操作过程和将表的字符编码转换成utf-8的方法,需要的朋友可以参考下. mysql将表的字符编码转换成utf-8 ...
- mysql字符编码集(乱码)问题解决
1.创建数据库 CREATE DATABASE `test` CHARACTER SET 'utf8' COLLATE 'utf8_general_ci'; 创建表 CREATE TABLE tp_w ...
- 彻底解决mysql中文乱码的办法,修改mysql解压缩版(免安装版或zip版)字符编码
MySQL会出现中文乱码的原因不外乎下列几点:1.server本身设定问题,例如server字符编码还停留在latin12.table的语系设定问题(包含character与collation)3.客 ...
- 彻底搞清楚字符编码: ASCII, ISO_8859, GB2312,UCS, Unicode, Utf-8
彻底搞清楚字符编码: ASCII, ISO_8859, GB2312,UCS, Unicode, U 1.ASCII: 0-127(128-255未使用),美国标准 2.IS0-8859-1(lati ...
- 修改数据库mysql字符编码为UTF8
Mysql数据库是一个开源的数据库,应用非常广泛.以下是修改mysql数据库的字符编码的操作过程. 步骤1:查看当前的字符编码方法 mysql> show variables like'char ...
- Linux 下查看文件字符编码和转换编码
Linux 下查看文件字符编码和转换编码 如果你需要在Linux中操作windows下的文件,那么你可能会经常遇到文件编码转换的问题.Windows中默认的文件格式是GBK(gb2312),而Linu ...
- vim字符编码设置
vim 编码方式的设置 和所有的流行文本编辑器一样,Vim 可以很好的编辑各种字符编码的文件,这当然包括UCS-2.UTF-8 等流行的 Unicode 编码方式.然而不幸的是,和很多来自 Linux ...
- linux mysql字符编码问题
发布:thatboy 来源:脚本学堂 [大 中 小] 本文介绍下,linux环境中mysql字符编码问题的解决办法,有遇到mysql编码问题的朋友,可以参考下本文的介绍,希望对你有一定的帮 ...
随机推荐
- 转:git windows中文 乱码问题解决汇总
it的Windows版本Msysgit对中文的支持不够好 .当使用时,会出现以下三种情况的中文乱码: 下面的几个文件都在git安装目录下文件夹etc内.1.ls不能显示中文目录 解决办法:在git/g ...
- 微信或手机浏览器在线显示office文件(已測试ios、android)
近期开发微信企业号,发现微信andriod版内置浏览器在打开文件方面有问题,可是ios版没有问题.原因是ios版使用的是safari浏览器 支持文档直接打开.可是andriod版使用的是腾讯浏览器x5 ...
- ListView 使用方法(Asp.Net)
您将须要用到的独有数据绑定控件. Fritz Onion 代码下载位置: ExtremeASPNET2008_03.exe (192 KB) Browse the Code Online 文件夹 L ...
- Fragment保持状态切换,fragment状态切换
在使用Activity管理多个Fragment时,每次切换Fragment使用的是replace,结果导致出现xxx is not currently in the FragmentManager异常 ...
- 浅谈IM软件业务知识——非对称加密,RSA算法,数字签名,公钥,私钥
概述 首先了解一下相关概念:RSA算法:1977年由Ron Rivest.Adi Shamirh和LenAdleman发明的.RSA就是取自他们三个人的名字. 算法基于一个数论:将两个大素数相乘很ea ...
- iOS依据字符串计算UITextView高度
iOS计算字符串高度,有须要的朋友能够參考下. 方法一:ios7.0之前适用 /** @method 获取指定宽度width,字体大小fontSize,字符串value的高度 @param value ...
- oc 可变參数传递
- (id)initWithFrame:(CGRect)frame delegate:(id<SGFocusImageFrameDelegate>)delegate focusImageI ...
- HDU1257 最小拦截系统 【贪婪】
最小拦截系统 Time Limit: 2000/1000 MS (Java/Others) Memory Limit: 65536/32768 K (Java/Others) Total Sub ...
- (升级版)Spark从入门到精通(Scala编程、案例实战、高级特性、Spark内核源码剖析、Hadoop高端)
本课程主要讲解目前大数据领域最热门.最火爆.最有前景的技术——Spark.在本课程中,会从浅入深,基于大量案例实战,深度剖析和讲解Spark,并且会包含完全从企业真实复杂业务需求中抽取出的案例实战.课 ...
- Linux 安装ibus极点五笔输入法备忘录
Linux 安装 ibus 五笔输入法备忘录 useful?: https://github.com/definite/ibus-table-chinese 一. yum install ibus* ...