Unicode Support in Oracle9i Database

Content: Existing Unicode Support in 8i New Unicode Features in 9i Character Semantics Support in 9i Unicode reliable data type as NCHAR in 9i VARCHAR2 vs. NVARCHAR2 for Unicode UTF-8 or UTF-16 for NCHAR Unicode Access Interface Unicode Migration and Compatibility

Uploaded by

rp.anbu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

268 views

Unicode Support in Oracle9i Database

Uploaded by

rp.anbu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 19

Unicode Support in Oracle9i Database

®
Topics

• Customer New Requirement

• Existing Unicode Support in 8i
• New Unicode Features in 9i
• Character Semantics Support in 9i
• Unicode reliable data type as NCHAR in 9i
• VARCHAR2 vs. NVARCHAR2 for Unicode
• UTF-8 or UTF-16 for NCHAR
• Unicode Access Interface
• Unicode Migration and Compatibility
• Conclusion
®
Requirements

• Consistent character length semantics

– Column definition close to visual length
– SQL standard recommends varchar2(10) to hold
10 characters
• Reliable Unicode data type
– To be independent of database character set
– To enable gradually migration to Unicode
– To enable third vendor/component for Unicode
• Easy programming for other environments
– Java, XML, Window NT
• High information density for performance
– Storage efficiency for Asian or European data
®
Existing Unicode Support in 8i

• UTF-8 as database character set

– UTF8 for ASCII based platform
– UTFE for EBCDIC based platform
• UTF-8 as client character set
• UTF-16 for OCI bind/define buffer
• UCHAR/UVARCHAR as UTF-16 in ProC
• UTF-16 for ODBC and OLEDB
• UTF-16 for JDBC
• Unicode binary sort

®
New Unicode features in Oracle9i

• Character semantics support for text column

• Reliable Unicode datatype as NCHAR
• UTF-16 support for Oracle Call Interface(OCI)
• Complete Unicode support for
ODBC/OLEDB /JDBC
• Unicode and ISO14651 based multilingual sort
• Unicode enabled Oracle utilities such as
SQL*Loader
• Unicode based locale builder for locale
customization

®
Length Semantics Support in Oracle9i
• A new semantics
– CHAR ( size [BYTE | CHAR] )
– VARCHAR2 ( size [BYTE | CHAR] )
• It meets Ansi SQL standard
– the size is defined in character in the standard, but
most vender implemented in byte
• It fulfills Customer’s requirement
– Portable database schema
– Character set independent
– Same data size across server, client, and third
middle tier
– Easy migration to Unicode support

®
Character Semantics Support in 9i - Cont.

• Character semantics column

– Explicit with quantifier: varchar2(30 char)
– Implicit with NLS_LENGTH_SEMANTICS setting to
‘char’ for varchar2(30)
• Same semantics support for PL/SQL variable
• Character length constraint checking
• SQL functions for different flavor of length
semantics in Unicode
– like/like2/like4/likec
– lengthb/length/length2/length4/lengthc
– substrb/substr/substr2/substr4/substrc
– compose/decompose

®
Character Semantics Support in 9i - Cont.

• UTF-16 semantics
– UTF8 encodes surrogate by a pair of three bytes
– It has the same semantics as UTF-16 and has the match
between varchar2(10 char) and wchar(10)
– It has the same binary sorting order as UTF-16
• UTF-32 semantics
– AL32UTF8 follows UTF-8 standard by encoding
surrogate in 4 bytes
– It has the same semantics as UTF-32 in coding point
and the same binary order
• Conversion between UTF8 and AL32UTF8
– AL32UTF8 can be used at client for the UTF-8
compliance

®
Reliable Unicode Data Type Support

• NCHAR, NVARCHAR2, NCLOB

- 8i NCHAR: any fixed width character set
- Defined in SQL standard
• Unicode Character set encoding
- UTF-8, UTF-16
- Independent on DB character set
• Character Length Semantics Only
- Avoid migration issues in the future
• Support Unicode in non-Unicode database

®
Inter-operability With Other
Data Types
• Explicit Conversion Functions
- TO_NCHAR()
- TO_CHAR()
- ROWIDTONCHAR()
- CHARTOROWID()
- TO_CLOB()
- TO_NCLOB()
- TO_NUMBER()
- TO_DATE()
- TO_TIMESTAMP()
- TO_TIMESTAMP_TZ()
- TO_YMINTERVAL()
…...

®
Inter-operability - cont.
• Implicit Conversion
- Between NCHAR and CHAR types
- Between NCHAR and NUMBER, DATE, ROWID, RAW,
CLOBs etc.
• Conversion Direction:
- Insert/select into/update/assignment operations:
convert to target
- Comparison, concatenation: SQL CHAR to SQL
NCHAR avoid any data loss
- SQL function: convert to first string parameter
• Makes migration to SQL NCHAR much
easier
®
Data Loss Exception Handling
• NLS Parameter:
- NLS_NCHAR_CONV_EXCP
- Dynamically changed in each session
- Effective for both explicit and implicit conversions
• Smoothness of operation vs. accuracy of
operation

®
SQL Unicode String
Processing
• Same level of support as CHAR
- Can use NCHAR same way as CHAR.
• SQL functions support for NCHAR
- SUBSTR, LENGTH, INSTR, LIKE, CONCAT,
LPAD/RPAD, LTRIM, RTRIM, NLS_SORT,
NLS_UPPER, NLS_LOWER etc.

- UNISTR, ASCIISTR
• Mixed type arguments
- CONCAT(nchar,char) - result type is based on first string
parameter
• Easy programming
®
Unicode Database vs. Unicode Data Type

• Codepoint semantics for UTF8 will make

Oracle database a virtual UTF-16 database
– There is no need to use NVARCHAR2 unless it is
for the storage compression for Asian data
– The migration effort is minimum as there is no
need to convert VARCHAR2 into NVARCHAR2
– It is recommended to use one-step migration for a
new system
• NCHAR/NVARCHAR2
– It allows incremental migration to Unicode
– It is always a Unicode column
– It can use UTF-16 encoding natively

®
NCHAR Choice between
UTF8 and AL16UTF16
• UTF-8
- ASCII compatible
- Internet friendly: HTML, XML etc.
- More space efficient for western languages
• UTF-16
- More space efficient for Asian languages
- Faster in string processing
- Supported by JAVA, WINDOWS etc.

®
Programming Interfaces
• OCI Unicode Support
- Support UTF-16 bind/define buffers
- Unicode meta data, SQL_TEXT, error
messages through mode parameter
- Unicode interface support independent on
server or client character set
- Character length semantics
• PL/SQL
• Pro*C/C++: Unicode support through UCHAR,
UVARCHAR
• JDBC
• ODBC/OLEDB
®
Migration, Conversion and
Compatibility
• Old NCHAR to 9i NCHAR migration
• Migration to Unicode Columns
ALTER TABLE tname MODIFY col (NCHAR(n))

• Convert whole database to Unicode

database

®
Migration, Conversion and
Compatibility
• Character length semantics
- Database schema
ALTER TABLE tname MODIFY col (CHAR(n CHAR))
- Modify application to be in sync with database length
semantics

Example: PL/SQL migration

1. Set NLS_LENGTH_SEMANTICS to CHAR
2. Apply %ROWTYPE, or explicit CHAR quantifier
3. Change substrb, lengthb and instrb to substr,
length and
instr

®
Summary

• A flexible and complete Unicode support

– Character semantics on UTF-8 or Unicode data type
– All major access interfaces support Unicode
• High performance by high information density
– UTF-8 for Western scripts
– UTF-16 for Asian scripts
• Easy programming
– Same length semantics between database and
other components
• Easy migration
– One step migration or gradual migration

SQL Notes (201) .
100% (2)
SQL Notes (201) .
119 pages
LogicalReasoningTest4 Solutions PDF
No ratings yet
LogicalReasoningTest4 Solutions PDF
14 pages
Injection Pump Calibration Data: 1. Test Conditions
86% (14)
Injection Pump Calibration Data: 1. Test Conditions
2 pages
Serial Port Complete: COM Ports, USB Virtual COM Ports, and Ports for Embedded Systems
From Everand
Serial Port Complete: COM Ports, USB Virtual COM Ports, and Ports for Embedded Systems
Jan Axelson
3.5/5 (9)
Handout 6 - Advanced Data Types, DateTime Functions, Views
No ratings yet
Handout 6 - Advanced Data Types, DateTime Functions, Views
11 pages
SQL
100% (1)
SQL
181 pages
From ASCII To UTF-8-RolandSchock
No ratings yet
From ASCII To UTF-8-RolandSchock
52 pages
SQL Commands
No ratings yet
SQL Commands
24 pages
Changing The Database Character Set Character Set)
No ratings yet
Changing The Database Character Set Character Set)
5 pages
The Inevitable Unicode Project: Tikkana Akurati, Upgrade & Unicode Specialist
No ratings yet
The Inevitable Unicode Project: Tikkana Akurati, Upgrade & Unicode Specialist
11 pages
DTC Unicode Programming
No ratings yet
DTC Unicode Programming
14 pages
Data Types (Transact-SQL) - Microsoft Docs
No ratings yet
Data Types (Transact-SQL) - Microsoft Docs
124 pages
Working With Data Types
No ratings yet
Working With Data Types
31 pages
Unit 2 Advanced SQL
No ratings yet
Unit 2 Advanced SQL
21 pages
Chapter 1: Creating
No ratings yet
Chapter 1: Creating
1 page
db2 Unicode-Dbcs
No ratings yet
db2 Unicode-Dbcs
30 pages
Data Types Inn SQL Server
No ratings yet
Data Types Inn SQL Server
2 pages
Rdbms 2
No ratings yet
Rdbms 2
21 pages
A String Is A String: BINARY, CHARACTER, LONG: Chapter 1: Creating
No ratings yet
A String Is A String: BINARY, CHARACTER, LONG: Chapter 1: Creating
1 page
How Strings Are Stored
No ratings yet
How Strings Are Stored
18 pages
Ott-03-0035 Unicode and C Business Functions
No ratings yet
Ott-03-0035 Unicode and C Business Functions
11 pages
SQL String Functions
No ratings yet
SQL String Functions
13 pages
SQL Lecture 2
No ratings yet
SQL Lecture 2
41 pages
33-International Considerations in SQL Server
No ratings yet
33-International Considerations in SQL Server
10 pages
AU14C04-Codepages and DB2
No ratings yet
AU14C04-Codepages and DB2
33 pages
Structure of PL/SQL: Baktagul Imasheva, Senior-Lecturer B.imasheva@iitu - KZ
No ratings yet
Structure of PL/SQL: Baktagul Imasheva, Senior-Lecturer B.imasheva@iitu - KZ
49 pages
Structure of PL/SQL: Baktagul Imasheva, Senior-Lecturer B.imasheva@iitu - KZ
No ratings yet
Structure of PL/SQL: Baktagul Imasheva, Senior-Lecturer B.imasheva@iitu - KZ
49 pages
DB Charset Migration Best Practices
No ratings yet
DB Charset Migration Best Practices
18 pages
Datatypes
No ratings yet
Datatypes
5 pages
Working With SQL Server 2014 Data Types
No ratings yet
Working With SQL Server 2014 Data Types
21 pages
Structured Query Language SQL: Htet Mon Win Banking Division ACE Data Systems
No ratings yet
Structured Query Language SQL: Htet Mon Win Banking Division ACE Data Systems
37 pages
Oracle String Functions
No ratings yet
Oracle String Functions
30 pages
ABAP Language: New Features With Relases 6.10 and 6.20: Andreas Blumenthal, SAP AG
No ratings yet
ABAP Language: New Features With Relases 6.10 and 6.20: Andreas Blumenthal, SAP AG
153 pages
01 - SQL - Oracle SQL Training Manual
No ratings yet
01 - SQL - Oracle SQL Training Manual
91 pages
DataBase Day Two
No ratings yet
DataBase Day Two
27 pages
Practical File SQL Queries DBMS
0% (1)
Practical File SQL Queries DBMS
30 pages
(IT) 08 Physical DM Dan Implementasi DB - DDL - DML
No ratings yet
(IT) 08 Physical DM Dan Implementasi DB - DDL - DML
68 pages
SQL - String Functions: Name Desc Ription
No ratings yet
SQL - String Functions: Name Desc Ription
13 pages
Introduction To SQL
No ratings yet
Introduction To SQL
13 pages
SQL Questions
No ratings yet
SQL Questions
3 pages
Oracle Datatypes: Data Types For Oracle 8 To Oracle 11g
No ratings yet
Oracle Datatypes: Data Types For Oracle 8 To Oracle 11g
9 pages
Laboratory Manual On MYSQL For Year IV Database Students (Power and Control Engineering Stream Students)
No ratings yet
Laboratory Manual On MYSQL For Year IV Database Students (Power and Control Engineering Stream Students)
20 pages
Changing The NLS
No ratings yet
Changing The NLS
17 pages
Chapter 5
No ratings yet
Chapter 5
58 pages
Sybase Blogs Questions
100% (1)
Sybase Blogs Questions
43 pages
What Is Difference Between CHAR and Varchar2
No ratings yet
What Is Difference Between CHAR and Varchar2
3 pages
Datatypes in Oracle
No ratings yet
Datatypes in Oracle
11 pages
Character Set Change in DB
No ratings yet
Character Set Change in DB
8 pages
SQL (Structured Query Language) : What Is A Table?
No ratings yet
SQL (Structured Query Language) : What Is A Table?
57 pages
Cheat Sheet Data Type Oracle
No ratings yet
Cheat Sheet Data Type Oracle
1 page
Data Type
100% (2)
Data Type
1 page
DBA Chapter 5
No ratings yet
DBA Chapter 5
21 pages
1) Introduction To SQL
No ratings yet
1) Introduction To SQL
20 pages
Structured Query Language
No ratings yet
Structured Query Language
37 pages
PL SQL Functions
No ratings yet
PL SQL Functions
14 pages
PL - SQL Data Type
No ratings yet
PL - SQL Data Type
7 pages
TAW11 1 Abap Unicode
No ratings yet
TAW11 1 Abap Unicode
16 pages
Lecture 2
No ratings yet
Lecture 2
16 pages
SQL Server Data Types
No ratings yet
SQL Server Data Types
3 pages
Oracle Database Basics
No ratings yet
Oracle Database Basics
18 pages
Unit - IV (Database Handling in PHP& Mysql)
No ratings yet
Unit - IV (Database Handling in PHP& Mysql)
6 pages
Routing in Wireless Mesh Networks
From Everand
Routing in Wireless Mesh Networks
Raghav Kumar
No ratings yet
Successfully Tested Types of Banknote Handling Machine - Customer-Operated Machines
No ratings yet
Successfully Tested Types of Banknote Handling Machine - Customer-Operated Machines
35 pages
Amt 4203 Finals - Module 5 Practice Problem 1
No ratings yet
Amt 4203 Finals - Module 5 Practice Problem 1
3 pages
Question Bank PP Sem 2
100% (3)
Question Bank PP Sem 2
4 pages
Breathing at Depth Physiologic and Clinical Aspects of Diving While Breathing Compressed Gas.
No ratings yet
Breathing at Depth Physiologic and Clinical Aspects of Diving While Breathing Compressed Gas.
26 pages
Strand7 R3 Quick Start Guide For R24 Users
No ratings yet
Strand7 R3 Quick Start Guide For R24 Users
36 pages
Encyclopedia of Computer Science and Technology, Second Edition Volume II Laplante All Chapters Instant Download
100% (1)
Encyclopedia of Computer Science and Technology, Second Edition Volume II Laplante All Chapters Instant Download
65 pages
Becoming AI Engineer Learning Path
No ratings yet
Becoming AI Engineer Learning Path
4 pages
Manual Catia v5 Abgam
No ratings yet
Manual Catia v5 Abgam
321 pages
Solution-F2024_MATH110_FinalExam (2)
No ratings yet
Solution-F2024_MATH110_FinalExam (2)
24 pages
Biological Microscopes 2005 - 2
No ratings yet
Biological Microscopes 2005 - 2
24 pages
Instructions For Parts Books: Note About Country Codes
100% (1)
Instructions For Parts Books: Note About Country Codes
247 pages
S21 - DLD Lab MidTerm
No ratings yet
S21 - DLD Lab MidTerm
1 page
Weight Calculation
No ratings yet
Weight Calculation
14 pages
Oop in Java Diamond
No ratings yet
Oop in Java Diamond
9 pages
Grade 7 (Secondary 1) Sasmo: Δabc 21 cm 20 cm Δabc 11 cm 8 cm 6 cm 12 cm
No ratings yet
Grade 7 (Secondary 1) Sasmo: Δabc 21 cm 20 cm Δabc 11 cm 8 cm 6 cm 12 cm
2 pages
Atomic Theory Science Presentation Colorful 3D Style - 20240609 - 160039 - 0000
No ratings yet
Atomic Theory Science Presentation Colorful 3D Style - 20240609 - 160039 - 0000
25 pages
Ecosystem
No ratings yet
Ecosystem
8 pages
Compressor Efficiency
100% (1)
Compressor Efficiency
15 pages
memorijaKHX1600C9D3B1K2 8GX
No ratings yet
memorijaKHX1600C9D3B1K2 8GX
2 pages
da ds notes
No ratings yet
da ds notes
27 pages
ME2610 Exam Jan-2021
No ratings yet
ME2610 Exam Jan-2021
7 pages
Product Keys 2019
No ratings yet
Product Keys 2019
3 pages
Math Sci 2022 Mechanics of The Game Activities and Criteria of The Contest Proper
No ratings yet
Math Sci 2022 Mechanics of The Game Activities and Criteria of The Contest Proper
14 pages
Practical Medium Data Analytics With Python: Pydata Nyc 2013
No ratings yet
Practical Medium Data Analytics With Python: Pydata Nyc 2013
48 pages
268 Codigo Activo
No ratings yet
268 Codigo Activo
7 pages
A. I. SABRA - Theories of Light - Descartes.
100% (1)
A. I. SABRA - Theories of Light - Descartes.
64 pages
Confined Space Hazard Evaluation Survey Form
100% (1)
Confined Space Hazard Evaluation Survey Form
2 pages
Cloud Resource Virtualization
100% (1)
Cloud Resource Virtualization
39 pages