1
- postgresql 6.5 multi-byte (MB) support README Jan 26 1999
1
+ postgresql 6.5 multi-byte (MB) support README Mar 23 1999
2
2
3
3
Tatsuo Ishii
4
4
t-ishii@sra.co.jp
@@ -9,11 +9,12 @@ postgresql 6.5 multi-byte (MB) support README Jan 26 1999
9
9
The MB support is intended for allowing PostgreSQL to handle
10
10
multi-byte character sets such as EUC(Extended Unix Code), Unicode and
11
11
Mule internal code. With the MB enabled you can use multi-byte
12
- character sets in regexp ,LIKE and some functions. The encoding system
13
- chosen is determined when initializing your PostgreSQL installation
14
- using initdb(1). Note that this can be overridden when creating a
15
- database using createdb(1) or create database SQL command. So you
16
- could have multiple databases with different encoding system.
12
+ character sets in regexp ,LIKE and some functions. The default
13
+ encoding system chosen is determined while initializing your
14
+ PostgreSQL installation using initdb(1). Note that this can be
15
+ overridden when you create a database using createdb(1) or create
16
+ database SQL command. So you could have multiple databases with
17
+ different encoding systems.
17
18
18
19
MB also fixes some problems concerning with 8-bit single byte
19
20
character sets including ISO8859. (I would not say all of problems
@@ -41,6 +42,9 @@ where encoding_system is one of:
41
42
LATIN3 ISO 8859-3 English and some European languages
42
43
LATIN4 ISO 8859-4 English and some European languages
43
44
LATIN5 ISO 8859-5 English and some European languages
45
+ KOI8 KOI8-R
46
+ WIN CP1251
47
+ ALT CP866
44
48
45
49
Example:
46
50
@@ -113,17 +117,20 @@ Supported encodings for PGCLIENTENCODING are:
113
117
EUC_CN Chinese EUC
114
118
EUC_KR Korean EUC
115
119
EUC_TW Taiwan EUC
116
- BIG5 Traditional chinese
120
+ BIG5 Traditional Chinese
117
121
MULE_INTERNAL Mule internal
118
122
LATIN1 ISO 8859-1 English and some European languages
119
123
LATIN2 ISO 8859-2 English and some European languages
120
124
LATIN3 ISO 8859-3 English and some European languages
121
125
LATIN4 ISO 8859-4 English and some European languages
122
126
LATIN5 ISO 8859-5 English and some European languages
127
+ KOI8 KOI8-R
128
+ WIN CP1251
129
+ ALT CP866
123
130
124
131
Note that UNICODE is not supported(yet). Also note that the
125
132
translation is not always possible. Suppose you choose EUC_JP for the
126
- backend, LATIN1 for the frotend , then some Japanese characters cannot
133
+ backend, LATIN1 for the frontend , then some Japanese characters cannot
127
134
be translated into latin. In this case, a letter cannot be represented
128
135
in the Latin character set, would be transformed as:
129
136
@@ -151,7 +158,7 @@ To return to the default encoding:
151
158
RESET CLIENT_ENCODING;
152
159
153
160
This would reset the frontend encoding to same as the backend
154
- encoding, thus no endoing translation would be performed.
161
+ encoding, thus no encoding translation would be performed.
155
162
156
163
4. References
157
164
@@ -170,8 +177,13 @@ Unicode: http://www.unicode.org/
170
177
171
178
5. History
172
179
180
+ Mar 23, 1999
181
+ * Add support for KOI8(KOI8-R), WIN(CP1251), ALT(CP866)
182
+ (thanks Oleg Broytmann for testing)
183
+ * Fix problem with MB and locale
184
+
173
185
Jan 26, 1999
174
- * Add support Big5 for fronend encoding
186
+ * Add support for Big5 for fronend encoding
175
187
(you need to create a database with EUC_TW to use Big5)
176
188
* Add regression test case for EUC_TW
177
189
(contributed by Jonah Kuo <jonahkuo@mail.ttn.com.tw>)
0 commit comments