0% found this document useful (0 votes)
50 views

UC Berkeley: Proposals From The Script Encoding Initiative

Uploaded by

Desa Sumbung
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
50 views

UC Berkeley: Proposals From The Script Encoding Initiative

Uploaded by

Desa Sumbung
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 18

UC Berkeley

Proposals from the Script Encoding Initiative

Title
Proposal for encoding additional Sundanese characters for Old Sundanese in the UCS

Permalink
https://escholarship.org/uc/item/6ss5j9zj

Author
Everson, Michael

Publication Date
2009-09-05

Peer reviewed

eScholarship.org Powered by the California Digital Library


University of California
ISO/IEC JTC1/SC2/WG2 N3666R
L2/09-251R
2009-09-05
Universal Multiple-Octet Coded Character Set
International Organization for Standardization
Organisation Internationale de Normalisation
Международная организация по стандартизации
Doc Type: Working Group Document
Title: Proposal for encoding additional Sundanese characters for Old Sundanese in the UCS
Source: UC Berkeley Script Encoding Initiative (Universal Scripts Project)
Author: Michael Everson
Status: Liaison Contribution
Action: For consideration by JTC1/SC2/WG2 and UTC
Replaces: N3648
Date: 2009-09-05

Sundanese has been written in a number of scripts. Pallawa or Pra-Nagari was first used in West Java to
write Sanskrit from the fifth to eighth centuries CE, and from Pallawa was derived Sunda Kuna or Old
Sundanese which was used in the Sunda Kingdom from the 14th to 18th centuries. Both Javanese and
Arabic script were used from the 17th to 19th centuries and the 17th to the mid-20th centuries
respectively. Latin script has had currency since the 20th century. The modern Sundanese script, called
Sunda Baku or Official Sundanese, was made official in 1996. The modern script itself was derived from
Old Sundanese, the earliest example of which is the Prasasti Kawali stone (see Figure 1).

The Sundanese script was originally more similar to other scripts of the Brahmic type, in that it made use
of conjunct characters. Consonant conjuncts are not formed productively in the modern script, which uses
the explicit SUNDANESE SIGN PAMAAEH character to show indicate the absence of the inherent vowel;
PAMAAEH does not cause Brahmic conjunct formation. (Some consonant clusters are represented in the
modern script with the encoded medial signs -ya, -ra, and -la, but these are not conjuncts.) In order to
support older orthography, the “Myanmar model” as opposed to the “Devanagari model” is used. An
explicit SUNDANESE SIGN VIRAMA is proposed here to cause true Brahmic consonant clustering. Since
PAMAAEH does not cause conjunct formation, and is always visible, and since modern users do not want
conjuncts to be formed, the Myanmar model, which has an explicit ASAT alongside a conjunct-forming
VIRAMA, is the model used here. Analogous is the modern Meetei Mayek script, which has its explicit
KILLER, alongside the older Meetei Mayek orthography, which uses a conjunct-forming VIRAMA.

This proposal also supports some additional characters used in Sanskrit.

᮫ 1BAB SUNDANESE SIGN VIRAMA


This character forms conjuncts in older orthography. This causes no interference with the modern
orthography, though care should be taken because three explictly encoded consonant signs, -ya, -ra,
and -la have been encoded for modern orthography. If virama were to precede ya, ra, or la, a con-
junct is formed, but it would look different from the modern subjoined consonants. This is a matter
of orthography. Note that in modern orthography, the explicit killer U+1BAA °᮪ SUNDANESE SIGN
PAMAAEH is regularly used; it does not cause conjunct formation. Some conjuncts have an archaic
glyph style and a modern glyph style but neither the VIRAMA nor any of the conjuncts are used in
modern standard orthography.

ᮏᮡ jya (ja ᮏ + -ya °ᮡ)  jya (ja ᮏ + virama ᮫ + ya ᮚ)


ᮠᮢ hra (ha ᮠ + -ra °ᮢ)  hra (ha ᮠ + virama ᮫ + ra ᮛ)
ᮊᮣ kla (ka ᮊ + -la °ᮣ.) ᮊᮜ kla (ka ᮊ + virama ᮫ + la ᮜ)

1
The following conjuncts (all formed with VIRAMA) are known to occur:

 ska  lka  lga  kṅa  nca


 ñja  nja  kta  pta  nta
 nda  hda  tna  sna  mpa
 mba  dma  jya  hra  dwa
 ksa  lsa  tha  mha

Note the difference between ᮮ SUNDANESE LETTER KHA and  the conjunct of ka and ha, as well
as ᮯ SUNDANESE LETTER SYA and  the conjunct of sa and ya. The archaic shapes in  kta
(modern style ) and  mpa (modern style ) are a matter of font choice. (Such a choice would
be made only for “historical” fonts; modern readers do not know or expect the archaic shapes.)

ˇᮬ 1BAC SUNDANESE CONSONANT SIGN PASANGAN MA


Like the explictly encoded consonant signs, -ya, -ra, and -la, a subjoined spiral-shaped ma is found
in historical texts. This contrasts with the conjunct form of ma as formed regularly by VIRAMA:
compare ᮕᮓᮬ padma and ᮕ padma. A similar explicit subjoined ma is found in Burmese (U+105F
ˇၟ MYANMAR CONSONANT SIGN MON MEDIAL MA). In the modern script the shape of the explicit
consonant sign ˇᮬ is unknown.

ˇᮭ 1BAD SUNDANESE CONSONANT SIGN PASANGAN WA


This contrasts with the conjunct form of wa as formed regularly by VIRAMA: compare ᮘᮭ ᮔ bwana
and ᮘᮝᮔ bwana. In the modern script the shape of the explicit consonant sign ˇᮭ is unknown.

ᮺ 1BBA SUNDANESE AVAGRAHA


Historically the avagraha deleted an initial vowel, but in older Sundanese orthography it came to
have a different function: it kills the vowel of a preceding consonant but introduces a hiatus before
an initial a-. An example is the word ᮃᮜᮥᮔᮺᮌᮥᮀ alun-agung ‘tidal wave’; this could also be written as
two words ᮃᮜᮥᮔ᮪ ᮃᮌᮥᮀ alun agung; ᮃᮜᮥᮔ᮪ᮌᮥᮀ reads alungung.

ᮻ 1BBB SUNDANESE LETTER REU


The letter ᮻ is the historic descendent of the Brahmi letter  (Devanagari ऋ) and is found on
ancient inscriptions. Although the modern orthography writes this syllable ᮛ ra + °ᮩ eu, the modern
spelling is historically a character sequence which differs from the Brahmic antecedent. For its
ã ß
form, compare the related U+1B0B BALINESE LETTER RA REPA with U+1B27 BALINESE LETTER
PA, as well as U+1B95 ᮕ SUNDANESE LETTER PA.

ᮼ 1BBC SUNDANESE LETTER LEU


The letter ᮼ is the historic descendent of the Brahmi letter  (Devanagari ऌ) and is found on
ancient inscriptions. Although the modern orthography writes this syllable ᮜ la + °ᮩ eu, the modern
spelling is historically a character sequence which differs from the Brahmic antecedent.

ᮽ 1BBD SUNDANESE LETTER BHA


Found on the Prasasta Kawali.

ᮾ 1BBE SUNDANESE LETTER FINAL K


Now written with the sequence ᮊ᮪ SUNDANESE LETTER FINAL K is found in palm leaf manuscripts.
Note that the explicit killer U+1BAA °᮪, can be used to produce a phonetic final -k, as in ᮊ᮪. The
letter ᮾ, however, is not a presentation form of ᮊ KA + °᮪ PAMAAEH. Although the archaic glyph
2
shape of conjunct -ka (modern  kta, archaic  kta) is similar to this, there is no corresponding
modern form of the character FINAL K.

ᮿ 1BBF SUNDANESE LETTER FINAL M


The letter ᮿ is found in palm leaf manuscripts. Note that the explicit killer U+1BAA °᮪ SUNDANESE
SIGN PAMAAEH, can be used to produce a phonetic final -m, as in ᮙ᮪. The letter ᮿ, however, is not a
presentation form of ᮙ MA + °᮪ PAMAAEH. Although the archaic glyph shape of conjunct -ma
(modern  mpa, archaic  mpa) is similar to this, there is no corresponding modern form of the
character FINAL M.

ꧠ 1CC0 SUNDANESE PUNCTUATION BINDU SURYA


‘Sun dot’; when used in the sequence ꧦꧠꧦ, this punctuation character helps to indicate that the
content of a manuscript is religious. When 1C80 ꧠ BINDU SURYA is used as a closing bindu (final
punctuation), 1C82 ꧢ BINDU PURNAMA is used as a comma.

ꧡ 1CC1 SUNDANESE PUNCTUATION BINDU PANGLONG


‘Half moon dot’; when used in the sequence ꧦꧡ this punctuation character helps to indicate that
the content of a manuscript is religious. See Figure 7.

ꧢ 1CC2 SUNDANESE PUNCTUATION BINDU PURNAMA


‘Full moon dot’; when used in the sequence ꧥꧢꧥ, this punctuation character helps to indicate that
the content of a manuscript is historical. When 1C80 ꧠ BINDU SURYA is used as a closing bindu
(final punctuation), 1C82 ꧢ BINDU PURNAMA is used as a comma.

ꧣ 1CC3 SUNDANESE PUNCTUATION BINDU CAKRA


‘Wheel dot’; used generally as a punctuation comma.

ꧤ1CC4 SUNDANESE PUNCTUATION BINDU LEU SATANGA


The semantic domain indicated by this punctuation character is unclear, but it occurs in manuscripts
as do the following three. It is in origin a “decorated” version of U+1BBC ᮼ SUNDANESE LETTER
LEU but it is not a leu, in the same way that ® is not an R and © is not a C.

ꧥ 1CC5 SUNDANESE PUNCTUATION BINDU KA SATANGA


When used in the sequence ꧥꧢꧥ, this punctuation character helps to indicate that the content of a
manuscript is historical. It is in origin a “decorated” version of “half” a U+1B8A ᮊ SUNDANESE
LETTER KA.

ꧦ 1CC6 SUNDANESE PUNCTUATION BINDU DA SATANGA


When used in the sequence ꧦꧠꧦ, this punctuation character helps to indicate that the content of
a manuscript is religious. It is in origin a “decorated” version of U+1B93 ᮓ SUNDANESE LETTER DA.

ꧧ1CC7 SUNDANESE PUNCTUATION BINDU BA SATANGA


When used in the sequence ꧧꧧ, this punctuation character helps to indicate that the content of
a manuscript is religious. It is in origin a “decorated” version of U+1B98 ᮘ SUNDANESE LETTER BA.
See Figure 8 for an example of contrastive use.

3
Linebreaking
The letters, avagraha, and virama behave as in Devanagari. The bindu symbols can occur in initial
position in a paragraph, or in final position at the end of a clause, sentence, or paragraph. Since half of
the bindus are derived from letters, and since sequences like ꧦꧠꧦ should not be broken, linebreaking
properties for all of these should be the same as those for letters. A space should be used if a user wishes
to cause a linebreak before or after any of the bindu symbols.

Ordering
The arrangement of characters in the code table here follows the Brahmic ordering, for transparency to
implementors of ISO/IEC 10646 and Unicode. The ha-na-ca-ra-ka order found in Javanese and Balinese
does not seem to have currency in Sundanese. Two Brahmic orderings do have some currency, however.
One of these inserts modern additions to the character set into the order according to place of articulation
and lettershape. The principle followed here is the same principle that was applied in the ordering of
Balinese; it also follows the way the characters are taught (as in the children’s primer Ngalagena, 2002).

ᮃ a >> ᮺ a > ᮄ i > ᮅ u > ᮆ ae > ᮇ o > ᮈ e > ᮉ eu >


ᮊ ka >> ᮾ k > ᮮ kha > ᮌ ga > ᮍ nga > ᮎ ca > ᮏ ja > ᮑ nya >
ᮒ ta > ᮓ da > ᮔ na > ᮕ pa > ᮘ ba > ᮽ bha > ᮙ ma >> ᮿ m >
ᮚ ya > ᮛ ra > ᮻ reu > ᮜ la > ᮼ leu > ᮝ wa > ᮞ sa > ᮠ ha >
ᮋ qa > ᮖ fa > ᮗ va > ᮟ xa > ᮯ sya > ᮐ za >
Font
The Sundanese community has developed a new, modern font which is used in the code chart below,
which they released on CD with their publication on Unicode implementation for Sundanese (Baidillah
2008). They have requested that this font should replace the font currently used in the standard. A
comparison of the glyphs used in each; the letters are given above, the matras and digits is given below.
The new font is used in top row, the old font in the bottom row:

a i u ae o e eu ka qa ga nga ca ja za nya ta da na pa fa va ba ma ya ra la wa sa xa ha kha sya


ᮃ ᮄ ᮅ ᮆ ᮇ ᮈ ᮉ ᮊ ᮋ ᮌ ᮍ ᮎ ᮏ ᮐ ᮑ ᮒ ᮓ ᮔ ᮕ ᮖ ᮗ ᮘ ᮙ ᮚ ᮛ ᮜ ᮝ ᮞ ᮟ ᮠ ᮮ ᮯ
ᮃᮄᮅᮆᮇᮈᮉᮊᮋᮌᮍᮎᮏᮐᮑᮒᮓᮔᮕᮖᮗᮘᮙᮚᮛᮜᮝᮞᮟᮠᮮᮯ
kaṁ kar kaḥ kya kra kla ki ku kae ko ke keu k 0 1 2 3 4 5 6 7 8 9
ᮊᮀ ᮊᮁ ᮊᮂ ᮊᮡ ᮊᮢ ᮊᮣ ᮊᮤ ᮊᮥ  ᮊᮧ ᮊᮨ ᮊᮩ ᮊ᮪ ᮰ ᮱ ᮲ ᮳ ᮴ ᮵ ᮶ ᮷ ᮸ ᮹
ᮊᮀ ᮊᮁ ᮊᮂ ᮊᮡ ᮊᮢ ᮊᮣ ᮊᮤ ᮊᮥ ᮦᮊ ᮊᮧ ᮊᮨ ᮊᮩ ᮊ᮪ ᮰ ᮱ ᮲ ᮳ ᮴ ᮵ ᮶ ᮷ ᮸ ᮹

Unicode Character Properties


1BAB;SUNDANESE SIGN VIRAMA;Mn;9;NSM;;;;;N;;;;;
1BAC;SUNDANESE CONSONANT SIGN PASANGAN MA;Mc;0;L;;;;;N;;;;;
1BAD;SUNDANESE CONSONANT SIGN PASANGAN WA;Mc;0;L;;;;;N;;;;;
1BBA;SUNDANESE AVAGRAHA;Lo;0;L;;;;;N;;;;;
1BBB;SUNDANESE LETTER REU;Lo;0;L;;;;;N;;;;;
1BBC;SUNDANESE LETTER LEU;Lo;0;L;;;;;N;;;;;
1BBD;SUNDANESE LETTER BHA;Lo;0;L;;;;;N;;;;;
1BBE;SUNDANESE LETTER FINAL K;Lo;0;L;;;;;N;;;;;
1BBF;SUNDANESE LETTER FINAL M;Lo;0;L;;;;;N;;;;;
1CC0;SUNDANESE PUNCTUATION BINDU SURYA;Po;0;L;;;;;N;;;;;
1CC1;SUNDANESE PUNCTUATION BINDU PANGLONG;Po;0;L;;;;;N;;;;;
1CC2;SUNDANESE PUNCTUATION BINDU PURNAMA;Po;0;L;;;;;N;;;;;
1CC3;SUNDANESE PUNCTUATION BINDU CAKRA;Po;0;L;;;;;N;;;;;
1CC4;SUNDANESE PUNCTUATION BINDU LEU SATANGA;Po;0;L;;;;;N;;;;;
1CC5;SUNDANESE PUNCTUATION BINDU KA SATANGA;Po;0;L;;;;;N;;;;;
1CC6;SUNDANESE PUNCTUATION BINDU DA SATANGA;Po;0;L;;;;;N;;;;;
1CC7;SUNDANESE PUNCTUATION BINDU BA SATANGA;Po;0;L;;;;;N;;;;;

4
Bibliography
Baidillah, Idin, et al. 2008. Direktori Aksara Sunda untuk Unicode. Bandung: Pemerintah Provinsi Jawa
Barat, Dinas Pendidikan Provinsi Jawa Barat.
de Casperis, J. G. 1975. Indonesian palaeography: a history of writing in Indonesia from the beginnings
to c. A . D . 1500. (Handbuch der Orientalistik: Dritte Abteilung: Indonesian, Malaysia und die
Philippinen: unter Einschluss der Kap-Malaien in Südafrika). Leiden & Köln: E. J. Brill.
Suryani NS, Elis. 2008. Merumat warisan karuhun rang Sunda yang terpendam dalam naskah dan
prasasti. Sumedang: Alqapring & Dinas Pendidikan Kota Tasikmalaya. ISBN 979-9462-82-7

Acknowledgements.
This project was made possible in part by a grant from the U.S. National Endowment for the Humanities,
which funded the Universal Scripts Project (part of the Script Encoding Initiative at UC Berkeley) in
respect of the Sundanese encoding. Any views, findings, conclusions or recommendations expressed in
this publication do not necessarily reflect those of the National Endowment of the Humanities.

5
1B80 Sundanese 1BBF

1B8 1B9 1BA 1BB

0 $ᮀ ᮐ ᮠ ᮰
1B80 1B90 1BA0 1BB0

1 $ᮁ ᮑ $ᮡ ᮱
1B81 1B91 1BA1 1BB1

2 $ᮂ ᮒ $ᮢ ᮲
1B82 1B92 1BA2 1BB2

3 ᮃ ᮓ $ᮣ ᮳
1B83 1B93 1BA3 1BB3

4 ᮄ ᮔ $ᮤ ᮴
1B84 1B94 1BA4 1BB4

5 ᮅ ᮕ $ᮥ ᮵
1B85 1B95 1BA5 1BB5

6 ᮆ ᮖ $ᮦ ᮶
1B86 1B96 1BA6 1BB6

7 ᮇ ᮗ $ᮧ ᮷
1B87 1B97 1BA7 1BB7

8 ᮈ ᮘ $ᮨ ᮸
1B88 1B98 1BA8 1BB8

9 ᮉ ᮙ $ᮩ ᮹
1B89 1B99 1BA9 1BB9

A ᮊ ᮚ $᮪ ᮺ
1B8A 1B9A 1BAA 1BBA

B ᮋ ᮛ ᮫ ᮻ
1B8B 1B9B 1BAB 1BBB

C ᮌ ᮜ $ᮬ ᮼ
1B8C 1B9C 1BAC 1BBC

D ᮍ ᮝ $ᮭ ᮽ
1B8D 1B9D 1BAD 1BBD

E ᮎ ᮞ ᮮᮾ
1B8E 1B9E 1BAE 1BBE

F ᮏ ᮟ ᮯ ᮿ
1B8F 1B9F 1BAF 1BBF

6 Date: 2009-09-05 Printed using UniBook™


(http://www.unicode.org/unibook/)
1B80 Sundanese 1BBF

Various signs 1BA8 $ᮨ SUNDANESE VOWEL SIGN PAMEPET


=e
1B80 $ᮀ SUNDANESE SIGN PANYECEK
1BA9 $ᮩ SUNDANESE VOWEL SIGN PANEULEUNG
= anusvara
= eu
1B81 $ᮁ SUNDANESE SIGN PANGLAYAR
= repha Virama
1B82 $ᮂ SUNDANESE SIGN PANGWISAD 1BAA $᮪ SUNDANESE SIGN PAMAAEH
= visarga
= virama
Vowels • does not form conjuncts
1BAB ᮫ SUNDANESE SIGN VIRAMA
1B83 ᮃ SUNDANESE LETTER A
• forms conjuncts in older orthography
1B84 ᮄ SUNDANESE LETTER I
1B85 ᮅ SUNDANESE LETTER U Consonant signs
1B86 ᮆ SUNDANESE LETTER AE 1BAC $ᮬ SUNDANESE CONSONANT SIGN
1B87 ᮇ SUNDANESE LETTER O PASANGAN MA
1B88 ᮈ SUNDANESE LETTER E = subjoined ma
1B89 ᮉ SUNDANESE LETTER EU 1BAD $ᮭ SUNDANESE CONSONANT SIGN
PASANGAN WA
Consonants = subjoined wa
1B8A ᮊ SUNDANESE LETTER KA
1B8B ᮋ SUNDANESE LETTER QA Additional consonants
1B8C ᮌ SUNDANESE LETTER GA 1BAE ᮮ SUNDANESE LETTER KHA
1B8D ᮍ SUNDANESE LETTER NGA 1BAF ᮯ SUNDANESE LETTER SYA
1B8E ᮎ SUNDANESE LETTER CA
1B8F ᮏ SUNDANESE LETTER JA
Digits
1B90 ᮐ SUNDANESE LETTER ZA 1BB0 ᮰ SUNDANESE DIGIT ZERO
1B91 ᮑ SUNDANESE LETTER NYA 1BB1 ᮱ SUNDANESE DIGIT ONE
1B92 ᮒ SUNDANESE LETTER TA 1BB2 ᮲ SUNDANESE DIGIT TWO
1B93 ᮓ SUNDANESE LETTER DA 1BB3 ᮳ SUNDANESE DIGIT THREE
1B94 ᮔ SUNDANESE LETTER NA 1BB4 ᮴ SUNDANESE DIGIT FOUR
1B95 ᮕ SUNDANESE LETTER PA 1BB5 ᮵ SUNDANESE DIGIT FIVE
1B96 ᮖ SUNDANESE LETTER FA 1BB6 ᮶ SUNDANESE DIGIT SIX
1B97 ᮗ SUNDANESE LETTER VA 1BB7 ᮷ SUNDANESE DIGIT SEVEN
1B98 ᮘ SUNDANESE LETTER BA 1BB8 ᮸ SUNDANESE DIGIT EIGHT
1B99 ᮙ SUNDANESE LETTER MA 1BB9 ᮹ SUNDANESE DIGIT NINE
1B9A ᮚ SUNDANESE LETTER YA
Sign
1B9B ᮛ SUNDANESE LETTER RA
1B9C ᮜ SUNDANESE LETTER LA 1BBA ᮺ SUNDANESE AVAGRAHA
1B9D ᮝ SUNDANESE LETTER WA Historic letters
1B9E ᮞ SUNDANESE LETTER SA
1BBB ᮻ SUNDANESE LETTER REU
1B9F ᮟ SUNDANESE LETTER XA
• vocalic r
1BA0 ᮠ SUNDANESE LETTER HA
1BBC ᮼ SUNDANESE LETTER LEU
Consonant signs • vocalic l
1BBD ᮽ SUNDANESE LETTER BHA
1BA1 $ᮡ SUNDANESE CONSONANT SIGN
PAMINGKAL 1BBE ᮾ SUNDANESE LETTER FINAL K
= subjoined ya 1BBF ᮿ SUNDANESE LETTER FINAL M
1BA2 $ᮢ SUNDANESE CONSONANT SIGN
PANYAKRA
= subjoined ra
1BA3 $ᮣ SUNDANESE CONSONANT SIGN PANYIKU
= subjoined la

Vowel signs
1BA4 $ᮤ SUNDANESE VOWEL SIGN PANGHULU
=i
1BA5 $ᮥ SUNDANESE VOWEL SIGN PANYUKU
=u
1BA6 $ᮦ SUNDANESE VOWEL SIGN PANAELAENG
= ae
1BA7 $ᮧ SUNDANESE VOWEL SIGN PANOLONG
=o

Printed using UniBook™ Date: 2009-09-05 7


(http://www.unicode.org/unibook/)
1CC0 Sundanese Supplement 1CCF

1CC Punctuation
1CC0 ᳀ SUNDANESE PUNCTUATION BINDU

0 ᳀ SURYA
• sun
1CC0 1CC1 ᳁ SUNDANESE PUNCTUATION BINDU
PANGLONG

1 ᳁ • half moon
1CC2 ᳂ SUNDANESE PUNCTUATION BINDU
1CC1 PURNAMA
• full moon
2 ᳂ 1CC3 ᳃ SUNDANESE PUNCTUATION BINDU
CAKRA
1CC2 • wheel
1CC4 ᳄ SUNDANESE PUNCTUATION BINDU LEU
3 ᳃ SATANGA
1CC5 ᳅ SUNDANESE PUNCTUATION BINDU KA
1CC3 SATANGA
1CC6 ᳆ SUNDANESE PUNCTUATION BINDU DA
4 ᳄ SATANGA
1CC7 ᳇ SUNDANESE PUNCTUATION BINDU BA
1CC4 SATANGA

5 ᳅
1CC5

6 ᳆
1CC6

7 ᳇
1CC7

8 Date: 2009-09-05 Printed using UniBook™


(http://www.unicode.org/unibook/)
Figures

Figure 1. The Prasasti Kawali 1 stone.

Figure 2a. The alphabet in a modern font from Suryani 2008.

Figure 2b. The digits in a modern font from Suryani 2008.

9
Figure 3a. Black and white photo of the Prasasti Kawali 1 stone.

Figure 3b. Inverse black and white photo of the Prasasti Kawali 1 stone.
Circled are ᮽ LETTER BHA and ᮻ LETTER REU.

10
Figure 4. The Old Sundanese Prasasti Kawali 3 inscription showing ᳀ PUNCTUATION BINDU SURYA

Figure 5. Old Sundanese manuscript showing ᳆ PUNCTUATION BINDU DA SATANGA


and ᳀ PUNCTUATION BINDU SURYA

Figure 6. The same Old Sundanese manuscript showing ᳆ PUNCTUATION BINDU DA SATANGA
and ᳀ PUNCTUATION BINDU PURNAMA

11
Figure 7. Old Sundanese manuscript showing ᳄ PUNCTUATION BINDU LEU SATANGA

Figure 8. Old Sundanese manuscript showing ᳁ PUNCTUATION BINDU PANGLONG

Figure 9. Old Sundanese manuscript showing ᳇ PUNCTUATION BINDU BA SATANGA looking like a furry
caterpillar. ᮘ LETTER BA is seen in the top line.

Figure 10. Old Sundanese manuscript showing a number of examples of ᳃ PUNCTUATION BINDU CAKRA

12
ͷȌ ° α patén (pamaéh)„‡”ˆ—‰•‹‡‰Š‹Žƒ‰ƒ
„—›‹˜‘ƒŽ’ƒ†ƒƒ•ƒ”ƒ†ƒ•ƒ”›ƒ‰‡†ƒŠ—Ž—‹›ƒǤ
= ka ÆαkǤ

‘–‘ŠǣαawakǮ„ƒ†ƒǯǤ

‹•ƒ’‹‰ ‹–—ǡ ƒ†ƒ ʹ …ƒ”ƒ ’‡ƒ†ƒ patén Š—•—•ǡ ›ƒ‹–—


ȋƒȌ„‡”ˆ—‰•‹—–—‡ƒ„ƒŠ„—›‹‘•‘ƒȀkȀ
’ƒ†ƒƒŠ‹”ƒ•ƒ”ƒ›ƒ‰†‹†ƒŠ—Ž—‹›ƒǡ…‘–‘Šǣ =
anak ‘ƒƒǯ; †ƒȋ„Ȍ„‡”ˆ—‰•‹—–—‡ƒ„ƒŠ„—›‹
‘•‘ƒȀmȀ’ƒ†ƒƒŠ‹”ƒ•ƒ”ƒ›ƒ‰†‹†ƒŠ—Ž—‹›ƒǡ…‘–‘Šǣ
= banem ǮŠ—–ƒǯǤ
͸Ȍ α ’‡‹•ƒŠ„—›‹‘•‘ƒ†ƒ˜‘ƒŽ†ƒŽƒ•ƒ–—•‹Žƒ„‹•Ǥ
‘–‘Šǣαalun-agungǮ‰‡Ž‘„ƒ‰’ƒ•ƒ‰ǯǤ

Figure 11. Discussion of FINAL K, FINAL M, and AVAGRAHA.

Figure 12. Discussion of archaic horizontal and vertical conjuncts,


and of CONSONANT SIGN PASANGAN MA and CONSONANT SIGN PASANGAN WA,
alongside the already-encoded CONSONANT SIGN PASANGAN RA and CONSONANT SIGN PASANGAN YA.

•ƒ α s ”ƒ•ƒ α

’ƒŽ•— α

Figure 13. Examples of two words written with a conjunct and written with the explicit killer.

13
Figure 14. Sample text from Suryani 2008.

Figure 15. Discussion of diacritic placement in Baidillah et al. 2008

14
Figure 16. Discussion of font design and example of sans-serif and serif fonts in the modern style.

15
A. Administrative
1. Title
Proposal for encoding additional Sundanese characters for Old Sundanese in the UCS.
2. Requester’s name
UC Berkeley Script Encoding Initiative (Universal Scripts Project)
3. Requester type (Member body/Liaison/Individual contribution)
Liaison contribution.
4. Submission date
2009-09-05
5. Requester’s reference (if applicable)
6. Choose one of the following:
6a. This is a complete proposal
Yes.
6b. More information will be provided later
No.

B. Technical – General
1. Choose one of the following:
1a. This proposal is for a new script (set of characters)
No.
Proposed name of script
1b. The proposal is for addition of character(s) to an existing block
Yes.
1c. Name of the existing block
Sundanese, Sundanese Supplement.
2. Number of characters in proposal
17.
3. Proposed category (A-Contemporary; B.1-Specialized (small collection); B.2-Specialized (large collection); C-Major extinct; D-Attested
extinct; E-Minor extinct; F-Archaic Hieroglyphic or Ideographic; G-Obscure or questionable usage symbols)
Category A.
4a. Is a repertoire including character names provided?
Yes.
4b. If YES, are the names in accordance with the “character naming guidelines” in Annex L of P&P document?
Yes.
4c. Are the character shapes attached in a legible form suitable for review?
Yes.
5a. Who will provide the appropriate computerized font (ordered preference: True Type, or PostScript format) for publishing the standard?
Michael Everson and “Tim Unicode Aksara Sunda”.
5b. If available now, identify source(s) for the font (include address, e-mail, ftp-site, etc.) and indicate the tools used:
Michael Everson, Fontographer.
6a. Are references (to other character sets, dictionaries, descriptive texts etc.) provided?
No.
6b. Are published examples of use (such as samples from newspapers, magazines, or other sources) of proposed characters attached?
Yes.
7. Special encoding issues: Does the proposal address other aspects of character data processing (if applicable) such as input, presentation,
sorting, searching, indexing, transliteration etc. (if yes please enclose information)?
Yes.
8. Additional Information: Submitters are invited to provide any additional information about Properties of the proposed Character(s) or
Script that will assist in correct understanding of and correct linguistic processing of the proposed character(s) or script.
See above.

C. Technical – Justification
1. Has this proposal for addition of character(s) been submitted before? If YES, explain.
No.
2a. Has contact been made to members of the user community (for example: National Body, user groups of the script or characters, other
experts, etc.)?
Yes.
2b. If YES, with whom?
Sai Zin Di Di Zone, Khwaan Tai, Sai Murngzuen Hengtai.
2c. If YES, available relevant documents
3. Information on the user community for the proposed characters (for example: size, demographics, information technology use, or
publishing use) is included?
People in West Java, Indonesia.
4a. The context of use for the proposed characters (type of use; common or rare)
Rare.
4b. Reference

16
5a. Are the proposed characters in current use by the user community?
Yes.
5b. If YES, where?
In Indonesia.
6a. After giving due considerations to the principles in the P&P document must the proposed characters be entirely in the BMP?
Yes.
6b. If YES, is a rationale provided?
Yes.
6c. If YES, reference
Keep with other Sundanese characters.
7. Should the proposed characters be kept together in a contiguous range (rather than being scattered)?
No.
8a. Can any of the proposed characters be considered a presentation form of an existing character or character sequence?
No.
8b. If YES, is a rationale for its inclusion provided?
8c. If YES, reference
9a. Can any of the proposed characters be encoded using a composed character sequence of either existing characters or other proposed
characters?
No.
9b. If YES, is a rationale for its inclusion provided?
9c. If YES, reference
10a. Can any of the proposed character(s) be considered to be similar (in appearance or function) to an existing character?
No.
10b. If YES, is a rationale for its inclusion provided?
10c. If YES, reference
11a. Does the proposal include use of combining characters and/or use of composite sequences?
Yes.
11b. If YES, is a rationale for such use provided?
Yes.
11c. If YES, reference
Brahmic vowel and consonant signs.
11d. Is a list of composite sequences and their corresponding glyph images (graphic symbols) provided?
No.
11e. If YES, reference
12a. Does the proposal contain characters with any special properties such as control function or similar semantics?
No.
12b. If YES, describe in detail (include attachment if necessary)
13a. Does the proposal contain any Ideographic compatibility character(s)?
No.
13b. If YES, is the equivalent corresponding unified ideographic character(s) identified?

17

You might also like