KingbaseES错误分析 -- “requested character too large”

一、适用于：

本文档使用于KingbaseES所有版本。

二、问题现象：

使用从其他数据库迁移到KingbaseES数据库的自定义函数、存储过程、Package包..出现以下错误信息：

错误：所请求的字符太大

ERROR: requested character too large

三、问题原因：

1.报错环境的KingbaseES数据库字符集为GBK，目前不支持使用ASCII取中文值。

2.数据库自定义函数、存储过程、Package包里面有使用ASCII函数。

3.在GBK环境使用自定义函数、存储过程、Package包传入参数包含中文或者全角字符。

四、解决方法：

1.对传入参数的字符串进行过滤，只取数字或者英文字符：

参考以下函数进行改写处理：

CREATE OR REPLACE FUNCTION FUNC_GETLASTNUMBER2(STR TEXT) RETURNS TEXT AS $$

 DECLARE I INT=0;

 LENSTR INT=LENGTH(STR);

 STR1 TEXT ;

 STRRE TEXT ='';

 STRREOR INTEGER =0;

BEGIN

 WHILE I<=LENSTR LOOP

  --STR1=SUBSTRING(STR,I,1);  --原始函数使用substring，改为substrb就可以过滤掉中文或者全角字符

  STR1=SUBSTRB(STR,I,1);

  IF(ASCII(STR1)>=48 AND ASCII(STR1)<=57) THEN

	IF(STRREOR=1) THEN

		STRRE='';

		STRREOR=0;

	END IF;

	STRRE=STRRE || STR1;

  ELSE

  STRREOR=1;---进入了非数字检索

  -- SET STRRE=''

  END IF;

   I=I+1;

  END LOOP;

 RETURN(STRRE);

END;

$$LANGUAGE PLPGSQL;

SELECT * FROM func_getlastnumber2('00000000收拾．获取；?');

 func_getlastnumber2

---------------------

 000000000

(1 row)

注意：substr、substrb、substring函数的区别

以KingbaseES数据库UTF8编码为例（1个汉字占3个字节，GBK编码1个汉字占2个字节）：

cmc=# select substr('KingbaseES数据库',11,3),substrb('KingbaseES数据库',11,3),substring('KingbaseES数据库',11,3);

 substr | substrb | substring

--------+---------+-----------

 数据库 | 数      | 数据库

 #使用 substrb 截取长度为3的字符串时，只能截取到一个字符，而使用substr、substring可以截取到三个字符。

差异：

substr 按字符截取

substrb 按字节截取

substring 按字符截取

2.使用ASCII函数对传入的中文字符串全角字符转半角字符：

参考以下函数进行改写处理：

CREATE FUNCTION hex_to_dec(in_hex TEXT)

RETURNS INT

IMMUTABLE STRICT LANGUAGE sql AS

$body$

  SELECT CAST(CAST(('x' || CAST($1 AS text)) AS bit(8)) AS INT);

$body$;

create or replace Function Trimall(v_Str Varchar2) Return String Is

    --将全角字符转换为半角字符，统一输出格式

    v_Retval Varchar2(4000) := '';

    c        Varchar2(4) := ''; --当前字节

    Cc       Varchar2(10) := ''; --当前双字节

    c_Code   Number(8); --当前字节ascii码（十进制）

    Code1    Number;

    Code2    Number;

    n_Loop   Number(4);

    n_Len    Number(8);

    Begin

      n_Loop := 1;

      n_Len  := Lengthb(v_Str);

      While n_Loop <= n_Len loop

      	--c      := Substrb(v_Str, n_Loop, 1);

		c      := sys.Substrb(v_Str, n_Loop, 1);     --在有kdb_orafce插件的时候，指定sys.Substrb进行处理。

      	--c      := encode(Substrb(v_Str, n_Loop, 1)::bytea,'hex');  --在有kdb_orafce插件的时候，可以用此函数处理。

        Cc     := Substrb(v_Str, n_Loop, 2);

        --c := ' ';

        c_Code := Ascii(c); --获取当前字节ascii码（十进制）

        Code1  := hex_to_dec(Substr(replace(utl_raw.cast_to_raw(Cc),'\x',''), 1, 2));

        Code2  := hex_to_dec(Substr(replace(utl_raw.cast_to_raw(Cc),'\x',''), 3, 2));

        If c_Code = Code1 Then

          --单字节字符

          v_Retval := v_Retval || c;

          n_Loop   := n_Loop + 1;

        Elsif Code1 = 163 Then

          --双字节字符（常用全角字符）

          v_Retval := v_Retval || Chr(Code2 - 128);

          n_Loop   := n_Loop + 2;

        Elsif Code1 > 163 Then

          --汉字

          v_Retval := v_Retval || Cc;

          n_Loop   := n_Loop + 2;

        Elsif Code1 = 161 And Code2 = 161 Then

          --全角空格

          v_Retval := v_Retval || ' ';

          n_Loop   := n_Loop + 2;

        Else

          --其他双字节字符

          v_Retval := v_Retval || Cc;

          n_Loop   := n_Loop + 2;

        End If;

      End Loop;

    Return v_Retval;

    Exception

      When Others Then

        raise notice 'exception----%',sqlerrm;

    	return;

  End;

select Trimall('住＇宅类档案，中．获取；?') from dual;

        trimall

-----------------------

 住'宅类档案,中.获取;?

(1 row)

注意：在数据库有kdb_orafce扩展插件时

删除插件 drop extension kdb_orafce。

不能删除 kdb_orafce扩展插件时通过指定sys.substrb函数解决。

不能删除 kdb_orafce扩展插件时通过encode函数可以避免，但是使用此函数会导致ascii值不正确。

结合使用utl_raw程序包处理。需要安装create extension kdb_raw扩展插件。

五、关于ASCII码：

ASCII简介：

ASCII 码使用指定的 7 位或 8 位二进制数组合来表示 128 或 256 种可能的字符。标准 ASCII 码也叫基础ASCII码，使用 7 位二进制数来表示所有的大写和小写字母，数字 0 到 9、标点符号，以及在美式英语中使用的特殊控制字符。

其中：

0～31及127(共33个)是控制字符或通信专用字符（其余为可显示字符）如控制符：LF(换行)、CR(回车)、FF(换页)、DEL(删除)、BS(退格)、BEL(响铃)等；通信专用字符：SOH（文头）、EOT（文尾）、ACK（确认）等；

ASCII值为8、9、10 和13 分别转换为退格、制表、换行和回车字符。它们并没有特定的图形显示，但会依不同的应用程序，而对文本显示有不同的影响。

32～126(共95个)是字符(32是空格)，其中48～57为0到9十个阿拉伯数字。

65～90为26个大写英文字母，97～122号为26个小写英文字母，其余为一些标点符号、运算符号等。

同时还要注意，在标准ASCII中，其最高位(b7)用作奇偶校验位。所谓奇偶校验，是指在代码传送过程中用来检验是否出现错误的一种方法，一般分奇校验和偶校验两种。奇校验规定：正确的代码一个字节中1的个数必须是奇数，若非奇数，则在最高位b7添1；偶校验规定：正确的代码一个字节中1的个数必须是偶数，若非偶数，则在最高位b7添1。

后128个称为扩展ASCII码。许多基于x86的系统都支持使用扩展（或“高”）ASCII。扩展ASCII 码允许将每个字符的第8 位用于确定附加的128 个特殊符号字符、外来语字母和图形符号。

ASCII产生

在计算机中，所有的数据在存储和运算时都要使用二进制数表示（因为计算机用高电平和低电平分别表示1和0），例如，像a、b、c、d这样的52个字母（包括大写）、以及0、1等数字还有一些常用的符号（例如*、#、@等）在计算机中存储时也要使用二进制数来表示，而具体用哪些二进制数字表示哪个符号，当然每个人都可以约定自己的一套（这就叫编码），而大家如果要想互相通信而不造成混乱，那么大家就必须使用相同的编码规则，于是美国有关的标准化组织就出台了所谓的ASCII编码，统一规定了上述常用符号用哪些二进制数来表示。

美国标准信息交换代码是由美国国家标准学会(American National Standard Institute , ANSI )制定的，标准的单字节字符编码方案，用于基于文本的数据。起始于50年代后期，在1967年定案。它最初是美国国家标准，供不同计算机在相互通信时用作共同遵守的西文字符编码标准，它已被国际标准化组织（International Organization for Standardization, ISO）定为国际标准，称为ISO 646标准。适用于所有拉丁文字字母。

ASCII码算法：

A在ASCII中定义为01000001，也就是十进制65，有了这个标准后，当我们输入A时，计算机就可以通过ASCII码知道输入的字符的二进制编码是01000001。而没有这样的标准，我们就必须自己想办法告诉计算机我们输入了一个A；没有这样的标准，我们在别的机器上就需要重新编码以告诉计算机我们要输入A。ASCII码指的不是十进制，是二进制。只是用十进制表示习惯一点罢了，比如在ascii码中，A的二进制编码为01000001，如果用十进制表示是65，用十六进制表示就是41H。

在ASCII码表中，只包括了一些字符、数字、标点符号的信息表示，这主要是因为计算机是美国发明的，在英文下面，我们使用ASCII表示就足够了！但是在汉字输入下面，用ASCII码就不能表示了，而汉字只是中国的通用表示，所以如果我们要在计算机中输入汉字，就必须有一个像ascii码的标准来表示每一个汉字，这就是中国的汉字国标码，它定义了汉字在计算机中的一个表示标准。通过这个标准，但我们输入汉字的时候，我们的输入码就转换为区位码，通过唯一的区位码得到这个汉字的字形码并显示出来。

汉字编码：

0-127 是 7位ASCII 码的范围，是国际标准。

至于汉字，不同的字符集用的ascii 码的范围也不一样，常用的汉字字符集有GB2312-80,GBK,Big5,unicode 等。下面我重点说一说最常用的GB_2312 的字符集。

GB_2312 字符集是目前最常用的汉字编码标准，windows 95/98/2000 中使用的 GBK字符集就包含了GB2312，或者说和GB2312 兼容，GB_2312 字符集包含了 6763个的简体汉字，和682 个标准中文符号。在这个标准中，每个汉字用2个字节来表示，每个字节的ascii码为 161-254 (16 进制A1 - FE)，第一个字节对应于区码的1-94 区，第二个字节对应于位码的1-94 位。

161-254 其实很好记忆，在英文字符中，可打印的字符范围为33-126。将这对数加上128（或者说最高位置1），就可以得到汉字使用的字符的范围。

ASCII码大致可以分三部分：

1.ASCII不可打印控制字符：

ASCII表上的数字0–31分配给了控制字符，用于控制像打印机等一些外围设备。例如，12代表换页/新页功能。此命令指示打印机跳到下一页的开头。（参详ASCII码表中0-31、127）

ASCII code (Decimal)	ASCII code (Binary)	ASCII code (Octal)	ASCII code (Hex)	Char	Description
00	0	0	0	NULL	Null character
01	1	1	1	SOH	Start of Header
02	10	2	2	STX	Start of Text
03	11	3	3	ETX	End of Text, hearts card suit
04	100	4	4	EOT	End of Transmission, diamonds card suit
05	101	5	5	ENQ	Enquiry, clubs card suit
06	110	6	6	ACK	Acknowledgement, spade card suit
07	111	7	7	BEL	Bell
08	1000	10	8	BS	Backspace
09	1001	11	9	HT	Horizontal Tab
10	1010	12	a	LF	Line feed
11	1011	13	b	VT	Vertical Tab, male symbol, symbol for Mars
12	1100	14	c	FF	Form feed, female symbol, symbol for Venus
13	1101	15	d	CR	Carriage return
14	1110	16	e	SO	Shift Out
15	1111	17	f	SI	Shift In
16	10000	20	10	DLE	Data link escape
17	10001	21	11	DC1	Device control 1
18	10010	22	12	DC2	Device control 2
19	10011	23	13	DC3	Device control 3
20	10100	24	14	DC4	Device control 4
21	10101	25	15	NAK	NAK Negative-acknowledge
22	10110	26	16	SYN	Synchronous idle
23	10111	27	17	ETB	End of trans. block
24	11000	30	18	CAN	Cancel
25	11001	31	19	EM	End of medium
26	11010	32	1a	SUB	Substitute
27	11011	33	1b	ESC	Escape
28	11100	34	1c	FS	File separator
29	11101	35	1d	GS	Group separator
30	11110	36	1e	RS	Record separator
31	11111	37	1f	US	Unit separator
127	1111111	177	7f	DEL	Delete

2.ASCII可打印字符：

数字 32–126 分配给了能在键盘上找到的字符，当您查看或打印文档时就会出现。数字127代表 DELETE 命令。

ASCII code (Decimal)	ASCII code (Binary)	ASCII code (Octal)	ASCII code (Hex)	Char	Description
32	100000	40	20	space	Space
33	100001	41	21	!	Exclamation mark
34	100010	42	22	"	Double quotes ; Quotation mark ; speech marks
35	100011	43	23	#	Number sign
36	100100	44	24	$	Dollar sign
37	100101	45	25	%	Percent sign
38	100110	46	26	&	Ampersand
39	100111	47	27	'	Single quote or Apostrophe
40	101000	50	28	(	round brackets or parentheses, opening round bracket
41	101001	51	29	)	parentheses or round brackets, closing parentheses
42	101010	52	2a	*	Asterisk
43	101011	53	2b	+	Plus sign
44	101100	54	2c	,	Comma
45	101101	55	2d	-	Hyphen, minus sign
46	101110	56	2e	.	Dot, full stop
47	101111	57	2f	/	Slash, forward slash, fraction bar, division slash
48	110000	60	30	0	number zero
49	110001	61	31	1	number one
50	110010	62	32	2	number two
51	110011	63	33	3	number three
52	110100	64	34	4	number four
53	110101	65	35	5	number five
54	110110	66	36	6	number six
55	110111	67	37	7	number seven
56	111000	70	38	8	number eight
57	111001	71	39	9	number nine
58	111010	72	3a	:	Colon
59	111011	73	3b	;	Semicolon
60	111100	74	3c	<	Less-than sign
61	111101	75	3d	=	Equals sign
62	111110	76	3e	>	Greater-than sign ; Inequality
63	111111	77	3f	?	Question mark
64	1000000	100	40	@	At sign
65	1000001	101	41	A	Capital letter A
66	1000010	102	42	B	Capital letter B
67	1000011	103	43	C	Capital letter C
68	1000100	104	44	D	Capital letter D
69	1000101	105	45	E	Capital letter E
70	1000110	106	46	F	Capital letter F
71	1000111	107	47	G	Capital letter G
72	1001000	110	48	H	Capital letter H
73	1001001	111	49	I	Capital letter I
74	1001010	112	4a	J	Capital letter J
75	1001011	113	4b	K	Capital letter K
76	1001100	114	4c	L	Capital letter L
77	1001101	115	4d	M	Capital letter M
78	1001110	116	4e	N	Capital letter N
79	1001111	117	4f	O	Capital letter O
80	1010000	120	50	P	Capital letter P
81	1010001	121	51	Q	Capital letter Q
82	1010010	122	52	R	Capital letter R
83	1010011	123	53	S	Capital letter S
84	1010100	124	54	T	Capital letter T
85	1010101	125	55	U	Capital letter U
86	1010110	126	56	V	Capital letter V
87	1010111	127	57	W	Capital letter W
88	1011000	130	58	X	Capital letter X
89	1011001	131	59	Y	Capital letter Y
90	1011010	132	5a	Z	Capital letter Z
91	1011011	133	5b	[	square brackets or box brackets, opening bracket
92	1011100	134	5c	\	Backslash, reverse slash
93	1011101	135	5d	]	box brackets or square brackets, closing bracket
94	1011110	136	5e	^	Circumflex accent or Caret
95	1011111	137	5f	_	underscore, understrike, underbar or low line
96	1100000	140	60	`	Grave accent
97	1100001	141	61	a	Lowercase letter a, minuscule a
98	1100010	142	62	b	Lowercase letter b, minuscule b
99	1100011	143	63	c	Lowercase letter c, minuscule c
100	1100100	144	64	d	Lowercase letter d, minuscule d
101	1100101	145	65	e	Lowercase letter e, minuscule e
102	1100110	146	66	f	Lowercase letter f, minuscule f
103	1100111	147	67	g	Lowercase letter g, minuscule g
104	1101000	150	68	h	Lowercase letter h, minuscule h
105	1101001	151	69	i	Lowercase letter i, minuscule i
106	1101010	152	6a	j	Lowercase letter j, minuscule j
107	1101011	153	6b	k	Lowercase letter k, minuscule k
108	1101100	154	6c	l	Lowercase letter l, minuscule l
109	1101101	155	6d	m	Lowercase letter m, minuscule m
110	1101110	156	6e	n	Lowercase letter n, minuscule n
111	1101111	157	6f	o	Lowercase letter o, minuscule o
112	1110000	160	70	p	Lowercase letter p, minuscule p
113	1110001	161	71	q	Lowercase letter q, minuscule q
114	1110010	162	72	r	Lowercase letter r, minuscule r
115	1110011	163	73	s	Lowercase letter s, minuscule s
116	1110100	164	74	t	Lowercase letter t, minuscule t
117	1110101	165	75	u	Lowercase letter u, minuscule u
118	1110110	166	76	v	Lowercase letter v, minuscule v
119	1110111	167	77	w	Lowercase letter w, minuscule w
120	1111000	170	78	x	Lowercase letter x, minuscule x
121	1111001	171	79	y	Lowercase letter y, minuscule y
122	1111010	172	7a	z	Lowercase letter z, minuscule z
123	1111011	173	7b	{	braces or curly brackets, opening braces
124	1111100	174	7c	\|	vertical-bar, vbar, vertical line or vertical slash
125	1111101	175	7d	}	curly brackets or braces, closing curly brackets
126	1111110	176	7e	~	Tilde ; swung dash

3.扩展ASCII打印字符：

扩展的ASCII字符满足了对更多字符的需求。扩展的ASCII包含ASCII中已有的128个字符，又增加了128个字符，总共是256个。即使有了这些更多的字符，许多语言还是包含无法压缩到256个字符中的符号。因此出现了一些ASCII的变体来囊括地区性字符和符号。例如，许多软件程序把ASCII表（又称作ISO8859-1）用于北美、西欧、澳大利亚和非洲的语言。

ASCII code (Decimal)	ASCII code (Binary)	ASCII code (Octal)	ASCII code (Hex)	Char	Description
128	10000000	200	80	Ç	Majuscule C-cedilla
129	10000001	201	81	ü	letter u with umlaut or diaeresis, u-umlaut
130	10000010	202	82	é	letter e with acute accent or e-acute
131	10000011	203	83	â	letter a with circumflex accent or a-circumflex
132	10000100	204	84	ä	letter a with umlaut or diaeresis, a-umlaut
133	10000101	205	85	à	letter a with grave accent
134	10000110	206	86	å	letter a with a ring
135	10000111	207	87	ç	Minuscule c-cedilla
136	10001000	210	88	ê	letter e with circumflex accent or e-circumflex
137	10001001	211	89	ë	letter e with umlaut or diaeresis ; e-umlauts
138	10001010	212	8a	è	letter e with grave accent
139	10001011	213	8b	ï	letter i with umlaut or diaeresis ; i-umlaut
140	10001100	214	8c	î	letter i with circumflex accent or i-circumflex
141	10001101	215	8d	ì	letter i with grave accent
142	10001110	216	8e	Ä	letter A with umlaut or diaeresis ; A-umlaut
143	10001111	217	8f	Å	Capital letter A with a ring
144	10010000	220	90	É	Capital letter E with acute accent or E-acute
145	10010001	221	91	æ	Latin diphthong ae in lowercase
146	10010010	222	92	Æ	Latin diphthong AE in uppercase
147	10010011	223	93	ô	letter o with circumflex accent or o-circumflex
148	10010100	224	94	ö	letter o with umlaut or diaeresis ; o-umlaut
149	10010101	225	95	ò	letter o with grave accent
150	10010110	226	96	û	letter u with circumflex accent or u-circumflex
151	10010111	227	97	ù	letter u with grave accent
152	10011000	230	98	ÿ	Lowercase letter y with diaeresis
153	10011001	231	99	Ö	Letter O with umlaut or diaeresis ; O-umlaut
154	10011010	232	9a	Ü	Letter U with umlaut or diaeresis ; U-umlaut
155	10011011	233	9b	ø	Lowercase slashed zero or empty set
156	10011100	234	9c	£	Pound sign ; symbol for the pound sterling
157	10011101	235	9d	Ø	Uppercase slashed zero or empty set
158	10011110	236	9e	×	Multiplication sign
159	10011111	237	9f	ƒ	Function sign ; f with hook sign ; florin sign
160	10100000	240	a0	á	Lowercase letter a with acute accent or a-acute
161	10100001	241	a1	í	Lowercase letter i with acute accent or i-acute
162	10100010	242	a2	ó	Lowercase letter o with acute accent or o-acute
163	10100011	243	a3	ú	Lowercase letter u with acute accent or u-acute
164	10100100	244	a4	ñ	eñe, enie, spanish letter enye, lowercase n with tilde
165	10100101	245	a5	Ñ	Spanish letter enye, uppercase N with tilde, EÑE, enie
166	10100110	246	a6	ª	feminine ordinal indicator
167	10100111	247	a7	º	masculine ordinal indicator
168	10101000	250	a8	¿	Inverted question marks
169	10101001	251	a9		Registered trademark symbol
170	10101010	252	aa	¬	Logical negation symbol
171	10101011	253	ab	½	One half
172	10101100	254	ac	¼	Quarter, one fourth
173	10101101	255	ad	¡	Inverted exclamation marks
174	10101110	256	ae	«	Angle quotes, guillemets, right-pointing quotation mark
175	10101111	257	af	»	Guillemets, angle quotes, left-pointing quotation marks
176	10110000	260	b0	░	Graphic character, low density dotted
177	10110001	261	b1	▒	Graphic character, medium density dotted
178	10110010	262	b2	▓	Graphic character, high density dotted
179	10110011	263	b3	│	Box drawing character single vertical line
180	10110100	264	b4	┤	Box drawing character single vertical and left line
181	10110101	265	b5	Á	Capital letter A with acute accent or A-acute
182	10110110	266	b6	Â	Letter A with circumflex accent or A-circumflex
183	10110111	267	b7	À	Letter A with grave accent
184	10111000	270	b8		Copyright symbol
185	10111001	271	b9	╣	Box drawing character double line vertical and left
186	10111010	272	ba	║	Box drawing character double vertical line
187	10111011	273	bb	╗	Box drawing character double line upper right corner
188	10111100	274	bc	╝	Box drawing character double line lower right corner
189	10111101	275	bd	¢	Cent symbol
190	10111110	276	be	¥	YEN and YUAN sign
191	10111111	277	bf	┐	Box drawing character single line upper right corner
192	11000000	300	c0	└	Box drawing character single line lower left corner
193	11000001	301	c1	┴	Box drawing character single line horizontal and up
194	11000010	302	c2	┬	Box drawing character single line horizontal down
195	11000011	303	c3	├	Box drawing character single line vertical and right
196	11000100	304	c4	─	Box drawing character single horizontal line
197	11000101	305	c5	┼	Box drawing character single line horizontal vertical
198	11000110	306	c6	ã	Lowercase letter a with tilde or a-tilde
199	11000111	307	c7	Ã	Capital letter A with tilde or A-tilde
200	11001000	310	c8	╚	Box drawing character double line lower left corner
201	11001001	311	c9	╔	Box drawing character double line upper left corner
202	11001010	312	ca	╩	Box drawing character double line horizontal and up
203	11001011	313	cb	╦	Box drawing character double line horizontal down
204	11001100	314	cc	╠	Box drawing character double line vertical and right
205	11001101	315	cd	═	Box drawing character double horizontal line
206	11001110	316	ce	╬	Box drawing character double line horizontal vertical
207	11001111	317	cf	¤	Generic currency sign
208	11010000	320	d0	ð	Lowercase letter eth
209	11010001	321	d1	Ð	Capital letter Eth
210	11010010	322	d2	Ê	Letter E with circumflex accent or E-circumflex
211	11010011	323	d3	Ë	Letter E with umlaut or diaeresis, E-umlaut
212	11010100	324	d4	È	Capital letter E with grave accent
213	11010101	325	d5	ı	Lowercase dot less i
214	11010110	326	d6	Í	Capital letter I with acute accent or I-acute
215	11010111	327	d7	Î	Letter I with circumflex accent or I-circumflex
216	11011000	330	d8	Ï	Letter I with umlaut or diaeresis ; I-umlaut
217	11011001	331	d9	┘	Box drawing character single line lower right corner
218	11011010	332	da	┌	Box drawing character single line upper left corner
219	11011011	333	db	█	Block, graphic character
220	11011100	334	dc	▄	Bottom half block
221	11011101	335	dd	¦	Vertical broken bar
222	11011110	336	de	Ì	Capital letter I with grave accent
223	11011111	337	df	▀	Top half block
224	11100000	340	e0	Ó	Capital letter O with acute accent or O-acute
225	11100001	341	e1	ß	Letter Eszett ; scharfes S or sharp S
226	11100010	342	e2	Ô	Letter O with circumflex accent or O-circumflex
227	11100011	343	e3	Ò	Capital letter O with grave accent
228	11100100	344	e4	õ	Lowercase letter o with tilde or o-tilde
229	11100101	345	e5	Õ	Capital letter O with tilde or O-tilde
230	11100110	346	e6	µ	Lowercase letter Mu ; micro sign or micron
231	11100111	347	e7	þ	Lowercase letter Thorn
232	11101000	350	e8	Þ	Capital letter Thorn
233	11101001	351	e9	Ú	Capital letter U with acute accent or U-acute
234	11101010	352	ea	Û	Letter U with circumflex accent or U-circumflex
235	11101011	353	eb	Ù	Capital letter U with grave accent
236	11101100	354	ec	ý	Lowercase letter y with acute accent
237	11101101	355	ed	Ý	Capital letter Y with acute accent
238	11101110	356	ee	¯	Macron symbol
239	11101111	357	ef	´	Acute accent
240	11110000	360	f0	≡	Congruence relation symbol
241	11110001	361	f1	±	Plus-minus sign
242	11110010	362	f2	‗	underline or underscore
243	11110011	363	f3	¾	three quarters, three-fourths
244	11110100	364	f4	¶	Paragraph sign or pilcrow ; end paragraph mark
245	11110101	365	f5	§	Section sign
246	11110110	366	f6	÷	The division sign ; Obelus
247	11110111	367	f7	¸	cedilla
248	11111000	370	f8	°	Degree symbol
249	11111001	371	f9	¨	Diaresis
250	11111010	372	fa	·	Interpunct or space dot
251	11111011	373	fb	¹	Superscript one, exponent 1, first power
252	11111100	374	fc	³	Superscript three, exponent 3, cube, third power
253	11111101	375	fd	²	Superscript two, exponent 2, square, second power
254	11111110	376	fe	■	black square
255	11111111	377	ff	nbsp	Non-breaking space or no-break space