UniCode编码表及部分不可见字符过滤方案
Unicode编码表/0000-0FFF
|
【Unicode 码表】
0080-00FF:C1控制符及拉丁文补充-1 (C1 Control and Latin 1 Supplement)
0100-017F:拉丁文扩展-A (Latin Extended-A)
0180-024F:拉丁文扩展-B (Latin Extended-B)
0250-02AF:国际音标扩展 (IPA Extensions)
02B0-02FF:空白修饰字母 (Spacing Modifiers)
0300-036F:结合用读音符号 (Combining Diacritics Marks)
0370-03FF:希腊文及科普特文 (Greek and Coptic)
0400-04FF:西里尔字母 (Cyrillic)
0500-052F:西里尔字母补充 (Cyrillic Supplement)
0530-058F:亚美尼亚语 (Armenian)
0590-05FF:希伯来文 (Hebrew)
0600-06FF:阿拉伯文 (Arabic)
0700-074F:叙利亚文 (Syriac)
0750-077F:阿拉伯文补充 (Arabic Supplement)
0780-07BF:马尔代夫语 (Thaana)
07C0-077F:西非書面語言 (N'Ko)
0800-085F:阿维斯塔语及巴列维语 (Avestan and Pahlavi)
0860-087F:Mandaic
0880-08AF:撒马利亚语 (Samaritan)
0900-097F:天城文书 (Devanagari)
0980-09FF:孟加拉语 (Bengali)
0A00-0A7F:锡克教文 (Gurmukhi)
0A80-0AFF:古吉拉特文 (Gujarati)
0B00-0B7F:奥里亚文 (Oriya)
0B80-0BFF:泰米尔文 (Tamil)
0C00-0C7F:泰卢固文 (Telugu)
0C80-0CFF:卡纳达文 (Kannada)
0D00-0D7F:德拉维族语 (Malayalam)
0D80-0DFF:僧伽罗语 (Sinhala)
0E00-0E7F:泰文 (Thai)
0E80-0EFF:老挝文 (Lao)
0F00-0FFF:藏文 (Tibetan)
1000-109F:缅甸语 (Myanmar)
10A0-10FF:格鲁吉亚语 (Georgian)
1100-11FF:朝鲜文 (Hangul Jamo)
1200-137F:埃塞俄比亚语 (Ethiopic)
1380-139F:埃塞俄比亚语补充 (Ethiopic Supplement)
13A0-13FF:切罗基语 (Cherokee)
1400-167F:统一加拿大土著语音节 (Unified Canadian Aboriginal Syllabics)
1680-169F:欧甘字母 (Ogham)
16A0-16FF:如尼文 (Runic)
1700-171F:塔加拉语 (Tagalog)
1720-173F:Hanunóo
1740-175F:Buhid
1760-177F:Tagbanwa
1780-17FF:高棉语 (Khmer)
1800-18AF:蒙古文 (Mongolian)
18B0-18FF:Cham
1900-194F:Limbu
1950-197F:德宏泰语 (Tai Le)
1980-19DF:新傣仂语 (New Tai Lue)
19E0-19FF:高棉语记号 (Kmer Symbols)
1A00-1A1F:Buginese
1A20-1A5F:Batak
1A80-1AEF:Lanna
1B00-1B7F:巴厘语 (Balinese)
1B80-1BB0:巽他语 (Sundanese)
1BC0-1BFF:Pahawh Hmong
1C00-1C4F:雷布查语(Lepcha)
1C50-1C7F:Ol Chiki
1C80-1CDF:曼尼普尔语 (Meithei/Manipuri)
1D00-1D7F:语音学扩展 (Phonetic Extensions)
1D80-1DBF:语音学扩展补充 (Phonetic Extensions Supplement)
1DC0-1DFF:结合用读音符号补充 (Combining Diacritics Marks Supplement)
1E00-1EFF:拉丁文扩充附加 (Latin Extended Additional)
1F00-1FFF:希腊语扩充 (Greek Extended)
2000-206F:常用标点 (General Punctuation)
2070-209F:上标及下标 (Superscripts and Subscripts)
20A0-20CF:货币符号 (Currency Symbols)
20D0-20FF:组合用记号 (Combining Diacritics Marks for Symbols)
2100-214F:字母式符号 (Letterlike Symbols)
2150-218F:数字形式 (Number Form)
2190-21FF:箭头 (Arrows)
2200-22FF:数学运算符 (Mathematical Operator)
2300-23FF:杂项工业符号 (Miscellaneous Technical)
2400-243F:控制图片 (Control Pictures)
2440-245F:光学识别符 (Optical Character Recognition)
2460-24FF:封闭式字母数字 (Enclosed Alphanumerics)
2500-257F:制表符 (Box Drawing)
2580-259F:方块元素 (Block Element)
25A0-25FF:几何图形 (Geometric Shapes)
2600-26FF:杂项符号 (Miscellaneous Symbols)
2700-27BF:印刷符号 (Dingbats)
27C0-27EF:杂项数学符号-A (Miscellaneous Mathematical Symbols-A)
27F0-27FF:追加箭头-A (Supplemental Arrows-A)
2800-28FF:盲文点字模型 (Braille Patterns)
2900-297F:追加箭头-B (Supplemental Arrows-B)
2980-29FF:杂项数学符号-B (Miscellaneous Mathematical Symbols-B)
2A00-2AFF:追加数学运算符 (Supplemental Mathematical Operator)
2B00-2BFF:杂项符号和箭头 (Miscellaneous Symbols and Arrows)
2C00-2C5F:格拉哥里字母 (Glagolitic)
2C60-2C7F:拉丁文扩展-C (Latin Extended-C)
2C80-2CFF:古埃及语 (Coptic)
2D00-2D2F:格鲁吉亚语补充 (Georgian Supplement)
2D30-2D7F:提非纳文 (Tifinagh)
2D80-2DDF:埃塞俄比亚语扩展 (Ethiopic Extended)
2E00-2E7F:追加标点 (Supplemental Punctuation)
2E80-2EFF:CJK 部首补充 (CJK Radicals Supplement)
2F00-2FDF:康熙字典部首 (Kangxi Radicals)
2FF0-2FFF:表意文字描述符 (Ideographic Description Characters)
3000-303F:CJK 符号和标点 (CJK Symbols and Punctuation)
3040-309F:日文平假名 (Hiragana)
30A0-30FF:日文片假名 (Katakana)
3100-312F:注音字母 (Bopomofo)
3130-318F:朝鲜文兼容字母 (Hangul Compatibility Jamo)
3190-319F:象形字注释标志 (Kanbun)
31A0-31BF:注音字母扩展 (Bopomofo Extended)
31C0-31EF:CJK 笔画 (CJK Strokes)
31F0-31FF:日文片假名语音扩展 (Katakana Phonetic Extensions)
3200-32FF:封闭式 CJK 文字和月份 (Enclosed CJK Letters and Months)
3300-33FF:CJK 兼容 (CJK Compatibility)
3400-4DBF:CJK 统一表意符号扩展 A (CJK Unified Ideographs Extension A)
4DC0-4DFF:易经六十四卦符号 (Yijing Hexagrams Symbols)
4E00-9FBF:CJK 统一表意符号 (CJK Unified Ideographs)
A000-A48F:彝文音节 (Yi Syllables)
A490-A4CF:彝文字根 (Yi Radicals)
A500-A61F:Vai
A660-A6FF:统一加拿大土著语音节补充 (Unified Canadian Aboriginal Syllabics Supplement)
A700-A71F:声调修饰字母 (Modifier Tone Letters)
A720-A7FF:拉丁文扩展-D (Latin Extended-D)
A800-A82F:Syloti Nagri
A840-A87F:八思巴字 (Phags-pa)
A880-A8DF:Saurashtra
A900-A97F:爪哇语 (Javanese)
A980-A9DF:Chakma
AA00-AA3F:Varang Kshiti
AA40-AA6F:Sorang Sompeng
AA80-AADF:Newari
AB00-AB5F:越南傣语 (Vi?t Thái)
AB80-ABA0:Kayah Li
AC00-D7AF:朝鲜文音节 (Hangul Syllables)
D800-DBFF:High-half zone of UTF-16
DC00-DFFF:Low-half zone of UTF-16
E000-F8FF:自行使用區域 (Private Use Zone)
F900-FAFF:CJK 兼容象形文字 (CJK Compatibility Ideographs)
FB00-FB4F:字母表達形式 (Alphabetic Presentation Form)
FB50-FDFF:阿拉伯表達形式A (Arabic Presentation Form-A)
FE00-FE0F:变量选择符 (Variation Selector)
FE10-FE1F:竖排形式 (Vertical Forms)
FE20-FE2F:组合用半符号 (Combining Half Marks)
FE30-FE4F:CJK 兼容形式 (CJK Compatibility Forms)
FE50-FE6F:小型变体形式 (Small Form Variants)
FE70-FEFF:阿拉伯表達形式B (Arabic Presentation Form-B)
FF00-FFEF:半型及全型形式 (Halfwidth and Fullwidth Form)
FFF0-FFFF:特殊 (Specials)
U+ | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0000 | NUL | SOH | STX | ETX | EOT | ENQ | ACK | BEL | BS | HT | LF | VT | FF | CR | SO | SI |
0010 | DLE | DC1 | DC2 | DC3 | DC4 | NAK | SYN | ETB | CAN | EM | SUB | ESC | FS | GS | RS | US |
0020 | ! | " | # | $ | % | & | ' | ( | ) | * | + | , | - | . | / | |
0030 | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | : | ; | < | = | > | ? |
0040 | @ | A | B | C | D | E | F | G | H | I | J | K | L | M | N | O |
0050 | P | Q | R | S | T | U | V | W | X | Y | Z | [ | \ | ] | ^ | _ |
0060 | ` | a | b | c | d | e | f | g | h | i | j | k | l | m | n | o |
0070 | p | q | r | s | t | u | v | w | x | y | z | { | | | } | ~ | DEL |
0080 | PAD | HOP | BPH | NBH | IND | NEL | SSA | ESA | HTS | HTJ | VTS | PLD | PLU | RI | SS2 | SS3 |
0090 | DCS | PU1 | PU2 | STS | CCH | MW | SPA | EPA | SOS | SGCI | SCI | CSI | ST | OSC | PM | APC |
00A0 | NBSP | ¡ | ¢ | £ | ¤ | ¥ | ¦ | § | ¨ | © | ª | « | ¬ | SHY | ® | ¯ |
00B0 | ° | ± | ² | ³ | ´ | µ | ¶ | · | ¸ | ¹ | º | » | ¼ | ½ | ¾ | ¿ |
00C0 | À | Á | Â | Ã | Ä | Å | Æ | Ç | È | É | Ê | Ë | Ì | Í | Î | Ï |
00D0 | Ð | Ñ | Ò | Ó | Ô | Õ | Ö | × | Ø | Ù | Ú | Û | Ü | Ý | Þ | ß |
00E0 | à | á | â | ã | ä | å | æ | ç | è | é | ê | ë | ì | í | î | ï |
00F0 | ð | ñ | ò | ó | ô | õ | ö | ÷ | ø | ù | ú | û | ü | ý | þ | ÿ |
U+ | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F |
0100 | Ā | ā | Ă | ă | Ą | ą | Ć | ć | Ĉ | ĉ | Ċ | ċ | Č | č | Ď | ď |
0110 | Đ | đ | Ē | ē | Ĕ | ĕ | Ė | ė | Ę | ę | Ě | ě | Ĝ | ĝ | Ğ | ğ |
0120 | Ġ | ġ | Ģ | ģ | Ĥ | ĥ | Ħ | ħ | Ĩ | ĩ | Ī | ī | Ĭ | ĭ | Į | į |
0130 | İ | ı | IJ | ij | Ĵ | ĵ | Ķ | ķ | ĸ | Ĺ | ĺ | Ļ | ļ | Ľ | ľ | Ŀ |
0140 | ŀ | Ł | ł | Ń | ń | Ņ | ņ | Ň | ň | ʼn | Ŋ | ŋ | Ō | ō | Ŏ | ŏ |
0150 | Ő | ő | Œ | œ | Ŕ | ŕ | Ŗ | ŗ | Ř | ř | Ś | ś | Ŝ | ŝ | Ş | ş |
0160 | Š | š | Ţ | ţ | Ť | ť | Ŧ | ŧ | Ũ | ũ | Ū | ū | Ŭ | ŭ | Ů | ů |
0170 | Ű | ű | Ų | ų | Ŵ | ŵ | Ŷ | ŷ | Ÿ | Ź | ź | Ż | ż | Ž | ž | ſ |
0180 | ƀ | Ɓ | Ƃ | ƃ | Ƅ | ƅ | Ɔ | Ƈ | ƈ | Ɖ | Ɗ | Ƌ | ƌ | ƍ | Ǝ | Ə |
0190 | Ɛ | Ƒ | ƒ | Ɠ | Ɣ | ƕ | Ɩ | Ɨ | Ƙ | ƙ | ƚ | ƛ | Ɯ | Ɲ | ƞ | Ɵ |
01A0 | Ơ | ơ | Ƣ | ƣ | Ƥ | ƥ | Ʀ | Ƨ | ƨ | Ʃ | ƪ | ƫ | Ƭ | ƭ | Ʈ | Ư |
01B0 | ư | Ʊ | Ʋ | Ƴ | ƴ | Ƶ | ƶ | Ʒ | Ƹ | ƹ | ƺ | ƻ | Ƽ | ƽ | ƾ | ƿ |
01C0 | ǀ | ǁ | ǂ | ǃ | DŽ | Dž | dž | LJ | Lj | lj | NJ | Nj | nj | Ǎ | ǎ | Ǐ |
01D0 | ǐ | Ǒ | ǒ | Ǔ | ǔ | Ǖ | ǖ | Ǘ | ǘ | Ǚ | ǚ | Ǜ | ǜ | ǝ | Ǟ | ǟ |
01E0 | Ǡ | ǡ | Ǣ | ǣ | Ǥ | ǥ | Ǧ | ǧ | Ǩ | ǩ | Ǫ | ǫ | Ǭ | ǭ | Ǯ | ǯ |
01F0 | ǰ | DZ | Dz | dz | Ǵ | ǵ | Ƕ | Ƿ | Ǹ | ǹ | Ǻ | ǻ | Ǽ | ǽ | Ǿ | ǿ |
U+ | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F |
0200 | Ȁ | ȁ | Ȃ | ȃ | Ȅ | ȅ | Ȇ | ȇ | Ȉ | ȉ | Ȋ | ȋ | Ȍ | ȍ | Ȏ | ȏ |
0210 | Ȑ | ȑ | Ȓ | ȓ | Ȕ | ȕ | Ȗ | ȗ | Ș | ș | Ț | ț | Ȝ | ȝ | Ȟ | ȟ |
0220 | Ƞ | ȡ | Ȣ | ȣ | Ȥ | ȥ | Ȧ | ȧ | Ȩ | ȩ | Ȫ | ȫ | Ȭ | ȭ | Ȯ | ȯ |
0230 | Ȱ | ȱ | Ȳ | ȳ | ȴ | ȵ | ȶ | ȷ | ȸ | ȹ | Ⱥ | Ȼ | ȼ | Ƚ | Ⱦ | ȿ |
0240 | ɀ | Ɂ | ||||||||||||||
0250 | ɐ | ɑ | ɒ | ɓ | ɔ | ɕ | ɖ | ɗ | ɘ | ə | ɚ | ɛ | ɜ | ɝ | ɞ | ɟ |
0260 | ɠ | ɡ | ɢ | ɣ | ɤ | ɥ | ɦ | ɧ | ɨ | ɩ | ɪ | ɫ | ɬ | ɭ | ɮ | ɯ |
0270 | ɰ | ɱ | ɲ | ɳ | ɴ | ɵ | ɶ | ɷ | ɸ | ɹ | ɺ | ɻ | ɼ | ɽ | ɾ | ɿ |
0280 | ʀ | ʁ | ʂ | ʃ | ʄ | ʅ | ʆ | ʇ | ʈ | ʉ | ʊ | ʋ | ʌ | ʍ | ʎ | ʏ |
0290 | ʐ | ʑ | ʒ | ʓ | ʔ | ʕ | ʖ | ʗ | ʘ | ʙ | ʚ | ʛ | ʜ | ʝ | ʞ | ʟ |
02A0 | ʠ | ʡ | ʢ | ʣ | ʤ | ʥ | ʦ | ʧ | ʨ | ʩ | ʪ | ʫ | ʬ | ʭ | ʮ | ʯ |
02B0 | ʰ | ʱ | ʲ | ʳ | ʴ | ʵ | ʶ | ʷ | ʸ | ʹ | ʺ | ʻ | ʼ | ʽ | ʾ | ʿ |
02C0 | ˀ | ˁ | ˂ | ˃ | ˄ | ˅ | ˆ | ˇ | ˈ | ˉ | ˊ | ˋ | ˌ | ˍ | ˎ | ˏ |
02D0 | ː | ˑ | ˒ | ˓ | ˔ | ˕ | ˖ | ˗ | ˘ | ˙ | ˚ | ˛ | ˜ | ˝ | ˞ | ˟ |
02E0 | ˠ | ˡ | ˢ | ˣ | ˤ | ˥ | ˦ | ˧ | ˨ | ˩ | ˪ | ˫ | ˬ | ˭ | ˮ | ˯ |
02F0 | ˰ | ˱ | ˲ | ˳ | ˴ | ˵ | ˶ | ˷ | ˸ | ˹ | ˺ | ˻ | ˼ | ˽ | ˾ | ˿ |
U+ | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F |
0300 | ̀ | ́ | ̂ | ̃ | ̄ | ̅ | ̆ | ̇ | ̈ | ̉ | ̊ | ̋ | ̌ | ̍ | ̎ | ̏ |
0310 | ̐ | ̑ | ̒ | ̓ | ̔ | ̕ | ̖ | ̗ | ̘ | ̙ | ̚ | ̛ | ̜ | ̝ | ̞ | ̟ |
0320 | ̠ | ̡ | ̢ | ̣ | ̤ | ̥ | ̦ | ̧ | ̨ | ̩ | ̪ | ̫ | ̬ | ̭ | ̮ | ̯ |
0330 | ̰ | ̱ | ̲ | ̳ | ̴ | ̵ | ̶ | ̷ | ̸ | ̹ | ̺ | ̻ | ̼ | ̽ | ̾ | ̿ |
0340 | ̀ | ́ | ͂ | ̓ | ̈́ | ͅ | ͆ | ͇ | ͈ | ͉ | ͊ | ͋ | ͌ | ͍ | ͎ | CGJ |
0350 | ͐ | ͑ | ͒ | ͓ | ͔ | ͕ | ͖ | ͗ | ͘ | ͙ | ͚ | ͛ | ͜ | ͝ | ͞ | ͟ |
0360 | ͠ | ͡ | ͢ | ͣ | ͤ | ͥ | ͦ | ͧ | ͨ | ͩ | ͪ | ͫ | ͬ | ͭ | ͮ | ͯ |
0370 | ʹ | ͵ | ͺ | ; | ||||||||||||
0380 | ΄ | ΅ | Ά | · | Έ | Ή | Ί | Ό | Ύ | Ώ | ||||||
0390 | ΐ | Α | Β | Γ | Δ | Ε | Ζ | Η | Θ | Ι | Κ | Λ | Μ | Ν | Ξ | Ο |
03A0 | Π | Ρ | Σ | Τ | Υ | Φ | Χ | Ψ | Ω | Ϊ | Ϋ | ά | έ | ή | ί | |
03B0 | ΰ | α | β | γ | δ | ε | ζ | η | θ | ι | κ | λ | μ | ν | ξ | ο |
03C0 | π | ρ | ς | σ | τ | υ | φ | χ | ψ | ω | ϊ | ϋ | ό | ύ | ώ | |
03D0 | ϐ | ϑ | ϒ | ϓ | ϔ | ϕ | ϖ | ϗ | Ϙ | ϙ | Ϛ | ϛ | Ϝ | ϝ | Ϟ | ϟ |
03E0 | Ϡ | ϡ | Ϣ | ϣ | Ϥ | ϥ | Ϧ | ϧ | Ϩ | ϩ | Ϫ | ϫ | Ϭ | ϭ | Ϯ | ϯ |
03F0 | ϰ | ϱ | ϲ | ϳ | ϴ | ϵ | ϶ | Ϸ | ϸ | Ϲ | Ϻ | ϻ | ϼ | Ͻ | Ͼ | Ͽ |
U+ | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F |
0400 | Ѐ | Ё | Ђ | Ѓ | Є | Ѕ | І | Ї | Ј | Љ | Њ | Ћ | Ќ | Ѝ | Ў | Џ |
0410 | А | Б | В | Г | Д | Е | Ж | З | И | Й | К | Л | М | Н | О | П |
0420 | Р | С | Т | У | Ф | Х | Ц | Ч | Ш | Щ | Ъ | Ы | Ь | Э | Ю | Я |
0430 | а | б | в | г | д | е | ж | з | и | й | к | л | м | н | о | п |
0440 | р | с | т | у | ф | х | ц | ч | ш | щ | ъ | ы | ь | э | ю | я |
0450 | ѐ | ё | ђ | ѓ | є | ѕ | і | ї | ј | љ | њ | ћ | ќ | ѝ | ў | џ |
0460 | Ѡ | ѡ | Ѣ | ѣ | Ѥ | ѥ | Ѧ | ѧ | Ѩ | ѩ | Ѫ | ѫ | Ѭ | ѭ | Ѯ | ѯ |
0470 | Ѱ | ѱ | Ѳ | ѳ | Ѵ | ѵ | Ѷ | ѷ | Ѹ | ѹ | Ѻ | ѻ | Ѽ | ѽ | Ѿ | ѿ |
0480 | Ҁ | ҁ | ҂ | ҃ | ҄ | ҅ | ҆ | ҈ | ҉ | Ҋ | ҋ | Ҍ | ҍ | Ҏ | ҏ | |
0490 | Ґ | ґ | Ғ | ғ | Ҕ | ҕ | Җ | җ | Ҙ | ҙ | Қ | қ | Ҝ | ҝ | Ҟ | ҟ |
04A0 | Ҡ | ҡ | Ң | ң | Ҥ | ҥ | Ҧ | ҧ | Ҩ | ҩ | Ҫ | ҫ | Ҭ | ҭ | Ү | ү |
04B0 | Ұ | ұ | Ҳ | ҳ | Ҵ | ҵ | Ҷ | ҷ | Ҹ | ҹ | Һ | һ | Ҽ | ҽ | Ҿ | ҿ |
04C0 | Ӏ | Ӂ | ӂ | Ӄ | ӄ | Ӆ | ӆ | Ӈ | ӈ | Ӊ | ӊ | Ӌ | ӌ | Ӎ | ӎ | |
04D0 | Ӑ | ӑ | Ӓ | ӓ | Ӕ | ӕ | Ӗ | ӗ | Ә | ә | Ӛ | ӛ | Ӝ | ӝ | Ӟ | ӟ |
04E0 | Ӡ | ӡ | Ӣ | ӣ | Ӥ | ӥ | Ӧ | ӧ | Ө | ө | Ӫ | ӫ | Ӭ | ӭ | Ӯ | ӯ |
04F0 | Ӱ | ӱ | Ӳ | ӳ | Ӵ | ӵ | Ӷ | ӷ | Ӹ | ӹ | ||||||
U+ | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F |
0500 | Ԁ | ԁ | Ԃ | ԃ | Ԅ | ԅ | Ԇ | ԇ | Ԉ | ԉ | Ԋ | ԋ | Ԍ | ԍ | Ԏ | ԏ |
0510 | ||||||||||||||||
0520 | ||||||||||||||||
0530 | Ա | Բ | Գ | Դ | Ե | Զ | Է | Ը | Թ | Ժ | Ի | Լ | Խ | Ծ | Կ | |
0540 | Հ | Ձ | Ղ | Ճ | Մ | Յ | Ն | Շ | Ո | Չ | Պ | Ջ | Ռ | Ս | Վ | Տ |
0550 | Ր | Ց | Ւ | Փ | Ք | Օ | Ֆ | ՙ | ՚ | ՛ | ՜ | ՝ | ՞ | ՟ | ||
0560 | ա | բ | գ | դ | ե | զ | է | ը | թ | ժ | ի | լ | խ | ծ | կ | |
0570 | հ | ձ | ղ | ճ | մ | յ | ն | շ | ո | չ | պ | ջ | ռ | ս | վ | տ |
0580 | ր | ց | ւ | փ | ք | օ | ֆ | և | ։ | ֊ | ||||||
0590 | ֑ | ֒ | ֓ | ֔ | ֕ | ֖ | ֗ | ֘ | ֙ | ֚ | ֛ | ֜ | ֝ | ֞ | ֟ | |
05A0 | ֠ | ֡ | ֢ | ֣ | ֤ | ֥ | ֦ | ֧ | ֨ | ֩ | ֪ | ֫ | ֬ | ֭ | ֮ | ֯ |
05B0 | ְ | ֱ | ֲ | ֳ | ִ | ֵ | ֶ | ַ | ָ | ֹ | ֻ | ּ | ֽ | ־ | ֿ | |
05C0 | ׀ | ׁ | ׂ | ׃ | ׄ | ׅ | ׆ | ׇ | ||||||||
05D0 | א | ב | ג | ד | ה | ו | ז | ח | ט | י | ך | כ | ל | ם | מ | ן |
05E0 | נ | ס | ע | ף | פ | ץ | צ | ק | ר | ש | ת | |||||
05F0 | װ | ױ | ײ | ׳ | ״ | |||||||||||
U+ | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F |
0600 | ؋ | ، | ؍ | ؎ | ؏ | |||||||||||
0610 | ؐ | ؑ | ؒ | ؓ | ؔ | ؕ | ؛ | ؞ | ؟ | |||||||
0620 | ء | آ | أ | ؤ | إ | ئ | ا | ب | ة | ت | ث | ج | ح | خ | د | |
0630 | ذ | ر | ز | س | ش | ص | ض | ط | ظ | ع | غ | |||||
0640 | ـ | ف | ق | ك | ل | م | ن | ه | و | ى | ي | ً | ٌ | ٍ | َ | ُ |
0650 | ِ | ّ | ْ | ٓ | ٔ | ٕ | ٖ | ٗ | ٘ | ٙ | ٚ | ٛ | ٜ | ٝ | ٞ | |
0660 | ٠ | ١ | ٢ | ٣ | ٤ | ٥ | ٦ | ٧ | ٨ | ٩ | ٪ | ٫ | ٬ | ٭ | ٮ | ٯ |
0670 | ٰ | ٱ | ٲ | ٳ | ٴ | ٵ | ٶ | ٷ | ٸ | ٹ | ٺ | ٻ | ټ | ٽ | پ | ٿ |
0680 | ڀ | ځ | ڂ | ڃ | ڄ | څ | چ | ڇ | ڈ | ډ | ڊ | ڋ | ڌ | ڍ | ڎ | ڏ |
0690 | ڐ | ڑ | ڒ | ړ | ڔ | ڕ | ږ | ڗ | ژ | ڙ | ښ | ڛ | ڜ | ڝ | ڞ | ڟ |
06A0 | ڠ | ڡ | ڢ | ڣ | ڤ | ڥ | ڦ | ڧ | ڨ | ک | ڪ | ګ | ڬ | ڭ | ڮ | گ |
06B0 | ڰ | ڱ | ڲ | ڳ | ڴ | ڵ | ڶ | ڷ | ڸ | ڹ | ں | ڻ | ڼ | ڽ | ھ | ڿ |
06C0 | ۀ | ہ | ۂ | ۃ | ۄ | ۅ | ۆ | ۇ | ۈ | ۉ | ۊ | ۋ | ی | ۍ | ێ | ۏ |
06D0 | ې | ۑ | ے | ۓ | ۔ | ە | ۖ | ۗ | ۘ | ۙ | ۚ | ۛ | ۜ | | ۞ | ۟ |
06E0 | ۠ | ۡ | ۢ | ۣ | ۤ | ۥ | ۦ | ۧ | ۨ | ۩ | ۪ | ۫ | ۬ | ۭ | ۮ | ۯ |
06F0 | ۰ | ۱ | ۲ | ۳ | ۴ | ۵ | ۶ | ۷ | ۸ | ۹ | ۺ | ۻ | ۼ | ۽ | ۾ | ۿ |
U+ | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F |
0700 | ܀ | ܁ | ܂ | ܃ | ܄ | ܅ | ܆ | ܇ | ܈ | ܉ | ܊ | ܋ | ܌ | ܍ | ||
0710 | ܐ | ܑ | ܒ | ܓ | ܔ | ܕ | ܖ | ܗ | ܘ | ܙ | ܚ | ܛ | ܜ | ܝ | ܞ | ܟ |
0720 | ܠ | ܡ | ܢ | ܣ | ܤ | ܥ | ܦ | ܧ | ܨ | ܩ | ܪ | ܫ | ܬ | ܭ | ܮ | ܯ |
0730 | ܰ | ܱ | ܲ | ܳ | ܴ | ܵ | ܶ | ܷ | ܸ | ܹ | ܺ | ܻ | ܼ | ܽ | ܾ | ܿ |
0740 | ݀ | ݁ | ݂ | ݃ | ݄ | ݅ | ݆ | ݇ | ݈ | ݉ | ݊ | ݍ | ݎ | ݏ | ||
0750 | ݐ | ݑ | ݒ | ݓ | ݔ | ݕ | ݖ | ݗ | ݘ | ݙ | ݚ | ݛ | ݜ | ݝ | ݞ | ݟ |
0760 | ݠ | ݡ | ݢ | ݣ | ݤ | ݥ | ݦ | ݧ | ݨ | ݩ | ݪ | ݫ | ݬ | ݭ | ||
0770 | ||||||||||||||||
0780 | ހ | ށ | ނ | ރ | ބ | ޅ | ކ | އ | ވ | މ | ފ | ދ | ތ | ލ | ގ | ޏ |
0790 | ސ | ޑ | ޒ | ޓ | ޔ | ޕ | ޖ | ޗ | ޘ | ޙ | ޚ | ޛ | ޜ | ޝ | ޞ | ޟ |
07A0 | ޠ | ޡ | ޢ | ޣ | ޤ | ޥ | ަ | ާ | ި | ީ | ު | ޫ | ެ | ޭ | ޮ | ޯ |
07B0 | ް | ޱ | ||||||||||||||
07C0 | ||||||||||||||||
07D0 | ||||||||||||||||
07E0 | ||||||||||||||||
07F0 | ||||||||||||||||
U+ | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F |
0800 | ||||||||||||||||
0810 | ||||||||||||||||
0820 | ||||||||||||||||
0830 | ||||||||||||||||
0840 | ||||||||||||||||
0850 | ||||||||||||||||
0860 | ||||||||||||||||
0870 | ||||||||||||||||
0880 | ||||||||||||||||
0890 | ||||||||||||||||
08A0 | ||||||||||||||||
08B0 | ||||||||||||||||
08C0 | ||||||||||||||||
08D0 | ||||||||||||||||
08E0 | ||||||||||||||||
08F0 | ||||||||||||||||
U+ | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F |
0900 | ँ | ं | ः | ऄ | अ | आ | इ | ई | उ | ऊ | ऋ | ऌ | ऍ | ऎ | ए | |
0910 | ऐ | ऑ | ऒ | ओ | औ | क | ख | ग | घ | ङ | च | छ | ज | झ | ञ | ट |
0920 | ठ | ड | ढ | ण | त | थ | द | ध | न | ऩ | प | फ | ब | भ | म | य |
0930 | र | ऱ | ल | ळ | ऴ | व | श | ष | स | ह | ़ | ऽ | ा | ि | ||
0940 | ी | ु | ू | ृ | ॄ | ॅ | ॆ | े | ै | ॉ | ॊ | ो | ौ | ् | ||
0950 | ॐ | ॑ | ॒ | ॓ | ॔ | क़ | ख़ | ग़ | ज़ | ड़ | ढ़ | फ़ | य़ | |||
0960 | ॠ | ॡ | ॢ | ॣ | । | ॥ | ० | १ | २ | ३ | ४ | ५ | ६ | ७ | ८ | ९ |
0970 | ॰ | ॽ | ||||||||||||||
0980 | ঁ | ং | ঃ | অ | আ | ই | ঈ | উ | ঊ | ঋ | ঌ | এ | ||||
0990 | ঐ | ও | ঔ | ক | খ | গ | ঘ | ঙ | চ | ছ | জ | ঝ | ঞ | ট | ||
09A0 | ঠ | ড | ঢ | ণ | ত | থ | দ | ধ | ন | প | ফ | ব | ভ | ম | য | |
09B0 | র | ল | শ | ষ | স | হ | ় | ঽ | া | ি | ||||||
09C0 | ী | ু | ূ | ৃ | ৄ | ে | ৈ | ো | ৌ | ্ | ৎ | |||||
09D0 | ৗ | ড় | ঢ় | য় | ||||||||||||
09E0 | ৠ | ৡ | ৢ | ৣ | ০ | ১ | ২ | ৩ | ৪ | ৫ | ৬ | ৭ | ৮ | ৯ | ||
09F0 | ৰ | ৱ | ৲ | ৳ | ৴ | ৵ | ৶ | ৷ | ৸ | ৹ | ৺ | |||||
U+ | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F |
0A00 | ਁ | ਂ | ਃ | ਅ | ਆ | ਇ | ਈ | ਉ | ਊ | ਏ | ||||||
0A10 | ਐ | ਓ | ਔ | ਕ | ਖ | ਗ | ਘ | ਙ | ਚ | ਛ | ਜ | ਝ | ਞ | ਟ | ||
0A20 | ਠ | ਡ | ਢ | ਣ | ਤ | ਥ | ਦ | ਧ | ਨ | ਪ | ਫ | ਬ | ਭ | ਮ | ਯ | |
0A30 | ਰ | ਲ | ਲ਼ | ਵ | ਸ਼ | ਸ | ਹ | ਼ | ਾ | ਿ | ||||||
0A40 | ੀ | ੁ | ੂ | ੇ | ੈ | ੋ | ੌ | ੍ | ||||||||
0A50 | ਖ਼ | ਗ਼ | ਜ਼ | ੜ | ਫ਼ | |||||||||||
0A60 | ੦ | ੧ | ੨ | ੩ | ੪ | ੫ | ੬ | ੭ | ੮ | ੯ | ||||||
0A70 | ੰ | ੱ | ੲ | ੳ | ੴ | |||||||||||
0A80 | ઁ | ં | ઃ | અ | આ | ઇ | ઈ | ઉ | ઊ | ઋ | ઌ | ઍ | એ | |||
0A90 | ઐ | ઑ | ઓ | ઔ | ક | ખ | ગ | ઘ | ઙ | ચ | છ | જ | ઝ | ઞ | ટ | |
0AA0 | ઠ | ડ | ઢ | ણ | ત | થ | દ | ધ | ન | પ | ફ | બ | ભ | મ | ય | |
0AB0 | ર | લ | ળ | વ | શ | ષ | સ | હ | ઼ | ઽ | ા | િ | ||||
0AC0 | ી | ુ | ૂ | ૃ | ૄ | ૅ | ે | ૈ | ૉ | ો | ૌ | ્ | ||||
0AD0 | ૐ | |||||||||||||||
0AE0 | ૠ | ૡ | ૢ | ૣ | ૦ | ૧ | ૨ | ૩ | ૪ | ૫ | ૬ | ૭ | ૮ | ૯ | ||
0AF0 | ૱ | |||||||||||||||
U+ | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F |
0B00 | ଁ | ଂ | ଃ | ଅ | ଆ | ଇ | ଈ | ଉ | ଊ | ଋ | ଌ | ଏ | ||||
0B10 | ଐ | ଓ | ଔ | କ | ଖ | ଗ | ଘ | ଙ | ଚ | ଛ | ଜ | ଝ | ଞ | ଟ | ||
0B20 | ଠ | ଡ | ଢ | ଣ | ତ | ଥ | ଦ | ଧ | ନ | ପ | ଫ | ବ | ଭ | ମ | ଯ | |
0B30 | ର | ଲ | ଳ | ଵ | ଶ | ଷ | ସ | ହ | ଼ | ଽ | ା | ି | ||||
0B40 | ୀ | ୁ | ୂ | ୃ | େ | ୈ | ୋ | ୌ | ୍ | |||||||
0B50 | ୖ | ୗ | ଡ଼ | ଢ଼ | ୟ | |||||||||||
0B60 | ୠ | ୡ | ୦ | ୧ | ୨ | ୩ | ୪ | ୫ | ୬ | ୭ | ୮ | ୯ | ||||
0B70 | ୰ | ୱ | ||||||||||||||
0B80 | ஂ | ஃ | அ | ஆ | இ | ஈ | உ | ஊ | எ | ஏ | ||||||
0B90 | ஐ | ஒ | ஓ | ஔ | க | ங | ச | ஜ | ஞ | ட | ||||||
0BA0 | ண | த | ந | ன | ப | ம | ய | |||||||||
0BB0 | ர | ற | ல | ள | ழ | வ | ஶ | ஷ | ஸ | ஹ | ா | ி | ||||
0BC0 | ீ | ு | ூ | ெ | ே | ை | ொ | ோ | ௌ | ் | ||||||
0BD0 | ௗ | |||||||||||||||
0BE0 | ௦ | ௧ | ௨ | ௩ | ௪ | ௫ | ௬ | ௭ | ௮ | ௯ | ||||||
0BF0 | ௰ | ௱ | ௲ | ௳ | ௴ | ௵ | ௶ | ௷ | ௸ | ௹ | ௺ | |||||
U+ | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F |
0C00 | ఁ | ం | ః | అ | ఆ | ఇ | ఈ | ఉ | ఊ | ఋ | ఌ | ఎ | ఏ | |||
0C10 | ఐ | ఒ | ఓ | ఔ | క | ఖ | గ | ఘ | ఙ | చ | ఛ | జ | ఝ | ఞ | ట | |
0C20 | ఠ | డ | ఢ | ణ | త | థ | ద | ధ | న | ప | ఫ | బ | భ | మ | య | |
0C30 | ర | ఱ | ల | ళ | వ | శ | ష | స | హ | ా | ి | |||||
0C40 | ీ | ు | ూ | ృ | ౄ | ె | ే | ై | ొ | ో | ౌ | ్ | ||||
0C50 | ౕ | ౖ | ||||||||||||||
0C60 | ౠ | ౡ | ౦ | ౧ | ౨ | ౩ | ౪ | ౫ | ౬ | ౭ | ౮ | ౯ | ||||
0C70 | ||||||||||||||||
0C80 | ಂ | ಃ | ಅ | ಆ | ಇ | ಈ | ಉ | ಊ | ಋ | ಌ | ಎ | ಏ | ||||
0C90 | ಐ | ಒ | ಓ | ಔ | ಕ | ಖ | ಗ | ಘ | ಙ | ಚ | ಛ | ಜ | ಝ | ಞ | ಟ | |
0CA0 | ಠ | ಡ | ಢ | ಣ | ತ | ಥ | ದ | ಧ | ನ | ಪ | ಫ | ಬ | ಭ | ಮ | ಯ | |
0CB0 | ರ | ಱ | ಲ | ಳ | ವ | ಶ | ಷ | ಸ | ಹ | ಼ | ಽ | ಾ | ಿ | |||
0CC0 | ೀ | ು | ೂ | ೃ | ೄ | ೆ | ೇ | ೈ | ೊ | ೋ | ೌ | ್ | ||||
0CD0 | ೕ | ೖ | ೞ | |||||||||||||
0CE0 | ೠ | ೡ | ೦ | ೧ | ೨ | ೩ | ೪ | ೫ | ೬ | ೭ | ೮ | ೯ | ||||
0CF0 | ||||||||||||||||
U+ | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F |
0D00 | ം | ഃ | അ | ആ | ഇ | ഈ | ഉ | ഊ | ഋ | ഌ | എ | ഏ | ||||
0D10 | ഐ | ഒ | ഓ | ഔ | ക | ഖ | ഗ | ഘ | ങ | ച | ഛ | ജ | ഝ | ഞ | ട | |
0D20 | ഠ | ഡ | ഢ | ണ | ത | ഥ | ദ | ധ | ന | പ | ഫ | ബ | ഭ | മ | യ | |
0D30 | ര | റ | ല | ള | ഴ | വ | ശ | ഷ | സ | ഹ | ാ | ി | ||||
0D40 | ീ | ു | ൂ | ൃ | െ | േ | ൈ | ൊ | ോ | ൌ | ് | |||||
0D50 | ൗ | |||||||||||||||
0D60 | ൠ | ൡ | ൦ | ൧ | ൨ | ൩ | ൪ | ൫ | ൬ | ൭ | ൮ | ൯ | ||||
0D70 | ||||||||||||||||
0D80 | ං | ඃ | අ | ආ | ඇ | ඈ | ඉ | ඊ | උ | ඌ | ඍ | ඎ | ඏ | |||
0D90 | ඐ | එ | ඒ | ඓ | ඔ | ඕ | ඖ | ක | ඛ | ග | ඝ | ඞ | ඟ | |||
0DA0 | ච | ඡ | ජ | ඣ | ඤ | ඥ | ඦ | ට | ඨ | ඩ | ඪ | ණ | ඬ | ත | ථ | ද |
0DB0 | ධ | න | ඳ | ප | ඵ | බ | භ | ම | ඹ | ය | ර | ල | ||||
0DC0 | ව | ශ | ෂ | ස | හ | ළ | ෆ | ් | ා | |||||||
0DD0 | ැ | ෑ | ි | ී | ු | ූ | ෘ | ෙ | ේ | ෛ | ො | ෝ | ෞ | ෟ | ||
0DE0 | ||||||||||||||||
0DF0 | ෲ | ෳ | ෴ | |||||||||||||
U+ | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F |
0E00 | ก | ข | ฃ | ค | ฅ | ฆ | ง | จ | ฉ | ช | ซ | ฌ | ญ | ฎ | ฏ | |
0E10 | ฐ | ฑ | ฒ | ณ | ด | ต | ถ | ท | ธ | น | บ | ป | ผ | ฝ | พ | ฟ |
0E20 | ภ | ม | ย | ร | ฤ | ล | ฦ | ว | ศ | ษ | ส | ห | ฬ | อ | ฮ | ฯ |
0E30 | ะ | ั | า | ำ | ิ | ี | ึ | ื | ุ | ู | ฺ | ฿ | ||||
0E40 | เ | แ | โ | ใ | ไ | ๅ | ๆ | ็ | ่ | ้ | ๊ | ๋ | ์ | ํ | ๎ | ๏ |
0E50 | ๐ | ๑ | ๒ | ๓ | ๔ | ๕ | ๖ | ๗ | ๘ | ๙ | ๚ | ๛ | ||||
0E60 | ||||||||||||||||
0E70 | ||||||||||||||||
0E80 | ກ | ຂ | ຄ | ງ | ຈ | ຊ | ຍ | |||||||||
0E90 | ດ | ຕ | ຖ | ທ | ນ | ບ | ປ | ຜ | ຝ | ພ | ຟ | |||||
0EA0 | ມ | ຢ | ຣ | ລ | ວ | ສ | ຫ | ອ | ຮ | ຯ | ||||||
0EB0 | ະ | ັ | າ | ຳ | ິ | ີ | ຶ | ື | ຸ | ູ | ົ | ຼ | ຽ | |||
0EC0 | ເ | ແ | ໂ | ໃ | ໄ | ໆ | ່ | ້ | ໊ | ໋ | ໌ | ໍ | ||||
0ED0 | ໐ | ໑ | ໒ | ໓ | ໔ | ໕ | ໖ | ໗ | ໘ | ໙ | ໜ | ໝ | ||||
0EE0 | ||||||||||||||||
0EF0 | ||||||||||||||||
U+ | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F |
0F00 | ༀ | ༁ | ༂ | ༃ | ༄ | ༅ | ༆ | ༇ | ༈ | ༉ | ༊ | ་ | ༌ | ། | ༎ | ༏ |
0F10 | ༐ | ༑ | ༒ | ༓ | ༔ | ༕ | ༖ | ༗ | ༘ | ༙ | ༚ | ༛ | ༜ | ༝ | ༞ | ༟ |
0F20 | ༠ | ༡ | ༢ | ༣ | ༤ | ༥ | ༦ | ༧ | ༨ | ༩ | ༪ | ༫ | ༬ | ༭ | ༮ | ༯ |
0F30 | ༰ | ༱ | ༲ | ༳ | ༴ | ༵ | ༶ | ༷ | ༸ | ༹ | ༺ | ༻ | ༼ | ༽ | ༾ | ༿ |
0F40 | ཀ | ཁ | ག | གྷ | ང | ཅ | ཆ | ཇ | ཉ | ཊ | ཋ | ཌ | ཌྷ | ཎ | ཏ | |
0F50 | ཐ | ད | དྷ | ན | པ | ཕ | བ | བྷ | མ | ཙ | ཚ | ཛ | ཛྷ | ཝ | ཞ | ཟ |
0F60 | འ | ཡ | ར | ལ | ཤ | ཥ | ས | ཧ | ཨ | ཀྵ | ཪ | |||||
0F70 | ཱ | ི | ཱི | ུ | ཱུ | ྲྀ | ཷ | ླྀ | ཹ | ེ | ཻ | ོ | ཽ | ཾ | ཿ | |
0F80 | ྀ | ཱྀ | ྂ | ྃ | ྄ | ྅ | ྆ | ྇ | ྈ | ྉ | ྊ | ྋ | ||||
0F90 | ྐ | ྑ | ྒ | ྒྷ | ྔ | ྕ | ྖ | ྗ | ྙ | ྚ | ྛ | ྜ | ྜྷ | ྞ | ྟ | |
0FA0 | ྠ | ྡ | ྡྷ | ྣ | ྤ | ྥ | ྦ | ྦྷ | ྨ | ྩ | ྪ | ྫ | ྫྷ | ྭ | ྮ | ྯ |
0FB0 | ྰ | ྱ | ྲ | ླ | ྴ | ྵ | ྶ | ྷ | ྸ | ྐྵ | ྺ | ྻ | ྼ | ྾ | ྿ | |
0FC0 | ࿀ | ࿁ | ࿂ | ࿃ | ࿄ | ࿅ | ࿆ | ࿇ | ࿈ | ࿉ | ࿊ | ࿋ | ࿌ | ࿏ | ||
0FD0 | ࿐ | ࿑ | ||||||||||||||
0FE0 | ||||||||||||||||
0FF0 |
public static String replaceUnicode(String sourceStr)
{
String regEx= "["+
"\u0000-\u001F"+//:C0控制符及基本拉丁文 (C0 Control and Basic Latin)
"\u007F-\u00A0" +//:特殊 (Specials);
"]";
Pattern pattern=Pattern.compile(regEx);
Matcher matcher=pattern.matcher(sourceStr);
return matcher.replaceAll("");
}
如果都喜欢替换 则修改正则表达式如下:
- String regEx= "["+
- "\u4E00-\u9FBF"+//:CJK 统一表意符号 (CJK Unified Ideographs)
- "\u4DC0-\u4DFF"+//:易经六十四卦符号 (Yijing Hexagrams Symbols)
- "\u0000-\u007F"+//:C0控制符及基本拉丁文 (C0 Control and Basic Latin)
- "\u0080-\u00FF"+//:C1控制符及拉丁:补充-1 (C1 Control and Latin 1 Supplement)
- "\u0100-\u017F"+//:拉丁文扩展-A (Latin Extended-A)
- "\u0180-\u024F"+//:拉丁文扩展-B (Latin Extended-B)
- "\u0250-\u02AF"+//:国际音标扩展 (IPA Extensions)
- "\u02B0-\u02FF"+//:空白修饰字母 (Spacing Modifiers)
- "\u0300-\u036F"+//:结合用读音符号 (Combining Diacritics Marks)
- "\u0370-\u03FF"+//:希腊文及科普特文 (Greek and Coptic)
- "\u0400-\u04FF"+//:西里尔字母 (Cyrillic)
- "\u0500-\u052F"+//:西里尔字母补充 (Cyrillic Supplement)
- "\u0530-\u058F"+//:亚美尼亚语 (Armenian)
- "\u0590-\u05FF"+//:希伯来文 (Hebrew)
- "\u0600-\u06FF"+//:阿拉伯文 (Arabic)
- "\u0700-\u074F"+//:叙利亚文 (Syriac)
- "\u0750-\u077F"+//:阿拉伯文补充 (Arabic Supplement)
- "\u0780-\u07BF"+//:马尔代夫语 (Thaana)
- //"\u07C0-\u077F"+//:西非书面语言 (N'Ko)
- "\u0800-\u085F"+//:阿维斯塔语及巴列维语 (Avestan and Pahlavi)
- "\u0860-\u087F"+//:Mandaic
- "\u0880-\u08AF"+//:撒马利亚语 (Samaritan)
- "\u0900-\u097F"+//:天城文书 (Devanagari)
- "\u0980-\u09FF"+//:孟加拉语 (Bengali)
- "\u0A00-\u0A7F"+//:锡克教文 (Gurmukhi)
- "\u0A80-\u0AFF"+//:古吉拉特文 (Gujarati)
- "\u0B00-\u0B7F"+//:奥里亚文 (Oriya)
- "\u0B80-\u0BFF"+//:泰米尔文 (Tamil)
- "\u0C00-\u0C7F"+//:泰卢固文 (Telugu)
- "\u0C80-\u0CFF"+//:卡纳达文 (Kannada)
- "\u0D00-\u0D7F"+//:德拉维族语 (Malayalam)
- "\u0D80-\u0DFF"+//:僧伽罗语 (Sinhala)
- "\u0E00-\u0E7F"+//:泰文 (Thai)
- "\u0E80-\u0EFF"+//:老挝文 (Lao)
- "\u0F00-\u0FFF"+//:藏文 (Tibetan)
- "\u1000-\u109F"+//:缅甸语 (Myanmar)
- "\u10A0-\u10FF"+//:格鲁吉亚语 (Georgian)
- "\u1100-\u11FF"+//:朝鲜文 (Hangul Jamo)
- "\u1200-\u137F"+//:埃塞俄比亚语 (Ethiopic)
- "\u1380-\u139F"+//:埃塞俄比亚语补充 (Ethiopic Supplement)
- "\u13A0-\u13FF"+//:切罗基语 (Cherokee)
- "\u1400-\u167F"+//:统一加拿大土著语音节 (Unified Canadian Aboriginal Syllabics)
- "\u1680-\u169F"+//:欧甘字母 (Ogham)
- "\u16A0-\u16FF"+//:如尼文 (Runic)
- "\u1700-\u171F"+//:塔加拉语 (Tagalog)
- "\u1720-\u173F"+//:Hanunóo
- "\u1740-\u175F"+//:Buhid
- "\u1760-\u177F"+//:Tagbanwa
- "\u1780-\u17FF"+//:高棉语 (Khmer)
- "\u1800-\u18AF"+//:蒙古文 (Mongolian)
- "\u18B0-\u18FF"+//:Cham
- "\u1900-\u194F"+//:Limbu
- "\u1950-\u197F"+//:德宏泰语 (Tai Le)
- "\u1980-\u19DF"+//:新傣仂语 (New Tai Lue)
- "\u19E0-\u19FF"+//:高棉语记号 (Kmer Symbols)
- "\u1A00-\u1A1F"+//:Buginese
- "\u1A20-\u1A5F"+//:Batak
- "\u1A80-\u1AEF"+//:Lanna
- "\u1B00-\u1B7F"+//:巴厘语 (Balinese)
- "\u1B80-\u1BB0"+//:巽他语 (Sundanese)
- "\u1BC0-\u1BFF"+//:Pahawh Hmong
- "\u1C00-\u1C4F"+//:雷布查语(Lepcha)
- "\u1C50-\u1C7F"+//:Ol Chiki
- "\u1C80-\u1CDF"+//:曼尼普尔语 (Meithei/Manipuri)
- "\u1D00-\u1D7F"+//:语音学扩展 (Phone tic Extensions)
- "\u1D80-\u1DBF"+//:语音学扩展补充 (Phonetic Extensions Supplement)
- "\u1DC0-\u1DFF"+//结合用读音符号补充 (Combining Diacritics Marks Supplement)
- "\u1E00-\u1EFF"+//:拉丁文扩充附加 (Latin Extended Additional)
- "\u1F00-\u1FFF"+//:希腊语扩充 (Greek Extended)
- "\u2000-\u206F"+//:常用标点 (General Punctuation)
- "\u2070-\u209F"+//:上标及下标 (Superscripts and Subscripts)
- "\u20A0-\u20CF"+//:货币符号 (Currency Symbols)
- "\u20D0-\u20FF"+//:组合用记号 (Combining Diacritics Marks for Symbols)
- "\u2100-\u214F"+//:字母式符号 (Letterlike Symbols)
- "\u2150-\u218F"+//:数字形式 (Number Form)
- "\u2190-\u21FF"+//:箭头 (Arrows)
- "\u2200-\u22FF"+//:数学运算符 (Mathematical Operator)
- "\u2300-\u23FF"+//:杂项工业符号 (Miscellaneous Technical)
- "\u2400-\u243F"+//:控制图片 (Control Pictures)
- "\u2440-\u245F"+//:光学识别符 (Optical Character Recognition)
- "\u2460-\u24FF"+//:封闭式字母数字 (Enclosed Alphanumerics)
- "\u2500-\u257F"+//:制表符 (Box Drawing)
- "\u2580-\u259F"+//:方块元素 (Block Element)
- "\u25A0-\u25FF"+//:几何图形 (Geometric Shapes)
- "\u2600-\u26FF"+//:杂项符号 (Miscellaneous Symbols)
- "\u2700-\u27BF"+//:印刷符号 (Dingbats)
- "\u27C0-\u27EF"+//:杂项数学符号-A (Miscellaneous Mathematical Symbols-A)
- "\u27F0-\u27FF"+//:追加箭头-A (Supplemental Arrows-A)
- "\u2800-\u28FF"+//:盲文点字模型 (Braille Patterns)
- "\u2900-\u297F"+//:追加箭头-B (Supplemental Arrows-B)
- "\u2980-\u29FF"+//:杂项数学符号-B (Miscellaneous Mathematical Symbols-B)
- "\u2A00-\u2AFF"+//:追加数学运算符 (Supplemental Mathematical Operator)
- "\u2B00-\u2BFF"+//:杂项符号和箭头 (Miscellaneous Symbols and Arrows)
- "\u2C00-\u2C5F"+//:格拉哥里字母 (Glagolitic)
- "\u2C60-\u2C7F"+//:拉丁文扩展-C (Latin Extended-C)
- "\u2C80-\u2CFF"+//:古埃及语 (Coptic)
- "\u2D00-\u2D2F"+//:格鲁吉亚语补充 (Georgian Supplement)
- "\u2D30-\u2D7F"+//:提非纳文 (Tifinagh)
- "\u2D80-\u2DDF"+//:埃塞俄比亚语扩展 (Ethiopic Extended)
- "\u2E00-\u2E7F"+//:追加标点 (Supplemental Punctuation)
- "\u2E80-\u2EFF"+//:CJK 部首补充 (CJK Radicals Supplement)
- "\u2F00-\u2FDF"+//:康熙字典部首 (Kangxi Radicals)
- "\u2FF0-\u2FFF"+//:表意文字描述符 (Ideographic Description Characters)
- "\u3000-\u303F"+//:CJK 符号和标点 (CJK Symbols and Punctuation)
- "\u3040-\u309F"+//:日文平假名 (Hiragana)
- "\u30A0-\u30FF"+//:日文片假名 (Katakana)
- "\u3100-\u312F"+//:注音字母 (Bopomofo)
- "\u3130-\u318F"+//:朝鲜文兼容字母 (Hangul Compatibility Jamo)
- "\u3190-\u319F"+//:象形字注释标志 (Kanbun)
- "\u31A0-\u31BF"+//:注音字母扩展 (Bopomofo Extended)
- "\u31C0-\u31EF"+//:CJK 笔画 (CJK Strokes)
- "\u31F0-\u31FF"+//:日文片假名语音扩展 (Katakana Phonetic Extensions)
- "\u3200-\u32FF"+//:封闭式 CJK 文字和月份 (Enclosed CJK Letters and Months)
- "\u3300-\u33FF"+//:CJK 兼容 (CJK Compatibility)
- "\u3400-\u4DBF"+//:CJK 统一表意符号扩展 A (CJK Unified Ideographs Extension A)
- "\u4DC0-\u4DFF"+//:易经六十四卦符号 (Yijing Hexagrams Symbols)
- "\u4E00-\u9FBF"+//:CJK 统一表意符号 (CJK Unified Ideographs)
- "\uA000-\uA48F"+//:彝文音节 (Yi Syllables)
- "\uA490-\uA4CF"+//:彝文字根 (Yi Radicals)
- "\uA500-\uA61F"+//:Vai
- "\uA660-\uA6FF"+//:统一加拿大土著语音节补充 (Unified Canadian Aboriginal Syllabics Supplement)
- "\uA700-\uA71F"+//:声调修饰字母 (Modifier Tone Letters)
- "\uA720-\uA7FF"+//:拉丁文扩展-D (Latin Extended-D)
- "\uA800-\uA82F"+//:Syloti Nagri
- "\uA840-\uA87F"+//:八思巴字 (Phags-pa)
- "\uA880-\uA8DF"+//:Saurashtra
- "\uA900-\uA97F"+//:爪哇语 (Javanese)
- "\uA980-\uA9DF"+//:Chakma
- "\uAA00-\uAA3F"+//:Varang Kshiti
- "\uAA40-\uAA6F"+//:Sorang Sompeng
- "\uAA80-\uAADF"+//:Newari
- "\uAB00-\uAB5F"+//:越南傣语 (Vi?t Thái)
- "\uAB80-\uABA0"+//:Kayah Li
- "\uAC00-\uD7AF"+//:朝鲜文音节 (Hangul Syllables)
- //"\uD800-\uDBFF"+//:High-half zone of UTF-16
- //"\uDC00-\uDFFF"+//:Low-half zone of UTF-16
- "\uE000-\uF8FF"+//:自行使用区域 (Private Use Zone)
- "\uF900-\uFAFF"+//:CJK 兼容象形文字 (CJK Compatibility Ideographs)
- "\uFB00-\uFB4F"+//:字母表达形式 (Alphabetic Presentation Form)
- "\uFB50-\uFDFF"+//:阿拉伯表达形式A (Arabic Presentation Form-A)
- "\uFE00-\uFE0F"+//:变量选择符 (Variation Selector)
- "\uFE10-\uFE1F"+//:竖排形式 (Vertical Forms)
- "\uFE20-\uFE2F"+//:组合用半符号 (Combining Half Marks)
- "\uFE30-\uFE4F"+//:CJK 兼容形式 (CJK Compatibility Forms)
- "\uFE50-\uFE6F"+//:小型变体形式 (Small Form Variants)
- "\uFE70-\uFEFF"+//:阿拉伯表达形式B (Arabic Presentation Form-B)
- "\uFF00-\uFFEF"+//:半型及全型形式 (Halfwidth and Fullwidth Form)
- "\uFFF0-\uFFFF]";//:特殊 (Specials);
UniCode编码表及部分不可见字符过滤方案的更多相关文章
- UNICODE编码表
UNICODE简介 Unicode(统一码.万国码.单一码)是一种在计算机上使用的字符编码.Unicode 是为了解决传统的字符编码方案的局限而产生的,它为每种语言中的每个字符设定了统一并且唯一的二进 ...
- [转] UniCode编码表
Unicode编码则是采用双字节16位来进行编号,可编65536字符,基本上包含了世界上所有的语言字符,它也就成为了全世界一种通用的编码,而且用十六进制4位表示一个编码,非常简结直观,为大多数开发者所 ...
- 【Unicode编码表】UniCode编码表+转化器
UniCode编码表[转载:https://www.cnblogs.com/csguo/p/7401874.html] Unicode编码则是采用双字节16位来进行编号,可编65536字符,基本上包含 ...
- 【转载】Unicode 编码表
转载备忘:Unicode 编码表 具体请移步: http://www.cnblogs.com/chenwenbiao/archive/2011/08/17/2142718.html
- CSS 中文字体 Unicode 编码表
CSS 中文字体 Unicode 编码表 在 CSS 中设置字体名称,直接写中文是可以的.但是在文件编码(GB2312.UTF-8 等)不匹配时会产生乱码的错误. 为此,在 CSS 直接使用 Unic ...
- A-Z,a-z,0-9的unicode编码表
1.转自:https://blog.csdn.net/fedawn/article/details/7307993 A-Z 的 Unicode 字符编码表 十进制 十六进制 1.“A”的 U ...
- 常用 CSS 中文字体 Unicode 编码表
为什么要在CSS中设置字体用字体 Unicode 编码 在 CSS 中设置字体名称,直接写中文是可以的.但是在文件编码(GB2312.UTF-8 等)不匹配时会产生乱码的错误. 为此,在 CSS直接使 ...
- CSS中常用中文字体转Unicode编码表
中文名 英文名 Unicode Unicode 2 Mac OS 华文细黑 STHeiti Light [STXihei] \534E\6587\7EC6\9ED1 华文细黑 华文黑体 STHeiti ...
- elemet-ui图标—特殊字符的unicode编码表
https://blog.csdn.net/lurr88/article/details/79754811
随机推荐
- Tkinter tkMessageBox
Tkinter tkMessageBox: tkMessageBox模块用于显示在您的应用程序的消息框.此模块提供了一个功能,您可以用它来显示适当的消息 tkMessageBox模块 ...
- congst与指针
指向const的指针 //a pointer to const int;指针指向常量对象,相对本指针而言,不能指针指向的对象的常量,不能通过本指针修改常量对象指针,实际的对象不一定的常量 const指 ...
- MySQL多项模糊查询
最近有个需求,就是要根据搜索框里面的关键字,找到符合条件的数据. 如果是单个条件的话,其实就是一个普通的select语句. 但是需求是这个关键字,要在id,desc,step等多个字段模糊查找. 然后 ...
- Asp.net 的工作原理
转:http://www.cnblogs.com/linjiancun/archive/2010/09/14/1825662.html 1.1.1 Asp.net 的工作原理 ...
- gridView删除提示框
实现方法: 双击GridView的OnRowDataBound事件: 在后台的GridView1_RowDataBound()方法添加代码,最后代码如下所示: protected void GridV ...
- C++防止文件重复包含
引用自:https://blog.csdn.net/xhfight/article/details/51550446 为了避免同一个文件被include多次,C/C++中有两种方式,一种是#ifnde ...
- VRRP概述
随着Internet的发展,人们对网络的可靠性的要求越来越高.对于局域网用户来说,能够时刻与外部网络保持联系是非常重要的. 通常情况下,内部网络中的所有主机都设置一条相同的缺省路由,指向出口网关(即图 ...
- IO操作中的建议
程序输出信息使用PrintStream(或者PrintWriter),程序输入信息使用Scaner
- iPhone开发随想:rand()还是arc4random()
原创作品,允许转载,转载时请务必以超链接形式标明文章 原始出处 .作者信息和本声明.否则将追究法律责任.http://bj007.blog.51cto.com/1701577/544006 今天在iP ...
- centos7 yum 安装 mysql
CentOS7默认数据库是mariadb,配置等用着不习惯,因此决定改成mysql,但是CentOS7的yum源中默认好像是没有mysql的.为了解决这个问题,我们要先下载mysql的repo源. 1 ...