U2000 - General Punctuation

Download as pdf or txt
Download as pdf or txt
You are on page 1of 6

General Punctuation

Range: 2000–206F

This file contains an excerpt from the character code tables and list of character names for
The Unicode Standard, Version 8.0

This file may be changed at any time without notice to reflect errata or other updates to the Unicode Standard.
See http://www.unicode.org/errata/ for an up-to-date list of errata.

See http://www.unicode.org/charts/ for access to a complete list of the latest character code charts.
See http://www.unicode.org/charts/PDF/Unicode-8.0/ for charts showing only the characters added in Unicode 8.0.
See http://www.unicode.org/Public/8.0.0/charts/ for a complete archived file of character code charts for Unicode 8.0.

Disclaimer
These charts are provided as the online reference to the character contents of the Unicode Standard, Version 8.0 but do
not provide all the information needed to fully support individual scripts using the Unicode Standard. For a complete
understanding of the use of the characters contained in this file, please consult the appropriate sections of The Unicode
Standard, Version 8.0, online at http://www.unicode.org/versions/Unicode8.0.0/, as well as Unicode Standard Annexes #9,
#11, #14, #15, #24, #29, #31, #34, #38, #41, #42, #44, and #45, the other Unicode Technical Reports and Standards, and the
Unicode Character Database, which are available online.

See http://www.unicode.org/ucd/ and http://www.unicode.org/reports/

A thorough understanding of the information contained in these additional sources is required for a successful
implementation.

Fonts
The shapes of the reference glyphs used in these code charts are not prescriptive. Considerable variation is to be
expected in actual fonts. The particular fonts used in these charts were provided to the Unicode Consortium by a number
of different font designers, who own the rights to the fonts.

See http://www.unicode.org/charts/fonts.html for a list.

Terms of Use
You may freely use these code charts for personal or internal business uses only. You may not incorporate them either
wholly or in part into any product or publication, or otherwise distribute them without express written permission from
the Unicode Consortium. However, you may provide links to these charts.

The fonts and font data used in production of these code charts may NOT be extracted, or used in any other way in any
product or publication, without permission or license granted by the typeface owner(s).

The Unicode Consortium is not liable for errors or omissions in this file or the standard itself. Information on characters
added to the Unicode Standard since the publication of the most recent version of the Unicode Standard, as well as on
characters currently being considered for addition to the Unicode Standard can be found on the Unicode web site.

See http://www.unicode.org/pending/pending.html and http://www.unicode.org/alloc/Pipeline.html.

Copyright © 1991-2015 Unicode, Inc. All rights reserved.


2000 General Punctuation 206F

200 201 202 203 204 205 206

0  ‐ † ‰ ⁀ ⁐ 
2000 2010 2020 2030 2040 2050 2060

1  ‡ ‱ ⁁ ⁑ 
2001 2011 2021 2031 2041 2051 2061

2  ‒ • ′ ⁂ ⁒ 
2002 2012 2022 2032 2042 2052 2062

3  – ‣ ″ ⁃ ⁓ 
2003 2013 2023 2033 2043 2053 2063

4  — ․ ‴ ⁄ ⁔
2004 2014 2024 2034 2044 2054 2064

5 ― ‥ ‵ ⁅ ⁕
2005 2015 2025 2035 2045 2055

6  ‖ … ‶ ⁆ ⁖ 
2006 2016 2026 2036 2046 2056 2066

7  ‗ ‧ ‷ ⁇ ⁗ 
2007 2017 2027 2037 2047 2057 2067

8  ‘  ‸ ⁈ ⁘ 
2008 2018 2028 2038 2048 2058 2068

9  ’  ‹ ⁉ ⁙ 
2009 2019 2029 2039 2049 2059 2069

A  ‚  › ⁊ ⁚ 
200A 201A 202A 203A 204A 205A 206A

B  ‛  ※ ⁋ ⁛ 
200B 201B 202B 203B 204B 205B 206B

C  “  ‼ ⁌ ⁜ 
200C 201C 202C 203C 204C 205C 206C

D  ”  ‽ ⁍ ⁝ 
200D 201D 202D 203D 204D 205D 206D

E  „  ‾ ⁎ ⁞ 
200E 201E 202E 203E 204E 205E 206E

F  ‟  ‿ ⁏ 
200F 201F 202F 203F 204F 205F 206F

The Unicode Standard 8.0, Copyright © 1991-2015 Unicode, Inc. All rights reserved.
2000 General Punctuation 201B

For additional general punctuation characters see also Basic Dashes


Latin, Latin-1, Supplemental Punctuation and CJK Symbols 2010 ‐ HYPHEN
and Punctuation. → 002D -  hyphen-minus
Spaces → 00AD   soft hyphen
2000  EN QUAD 2011  NON-BREAKING HYPHEN
≡ 2002   en space → 002D -  hyphen-minus
2001  EM QUAD → 00AD   soft hyphen
= mutton quad ≈ <noBreak> 2010 ‐ 
≡ 2003   em space 2012 ‒ FIGURE DASH
2002  EN SPACE 2013 – EN DASH
= nut 2014 — EM DASH
• half an em • may be used in pairs to offset parenthetical text
≈ 0020   space → 2E3A ⸺  two-em dash
2003  EM SPACE → 30FC ー  katakana-hiragana prolonged sound
= mutton mark
• nominally, a space equal to the type size in 2015 ― HORIZONTAL BAR
points = quotation dash
• may scale by the condensation factor of a font • long dash introducing quoted text
≈ 0020   space General punctuation
2004  THREE-PER-EM SPACE 2016 ‖ DOUBLE VERTICAL LINE
= thick space
• used in pairs to indicate norm of a matrix
≈ 0020   space
→ 20E6 ⃦  combining double vertical stroke
2005  FOUR-PER-EM SPACE overlay
= mid space → 2225 ∥  parallel to
≈ 0020   space → 23F8 ⏸  double vertical bar
2006  SIX-PER-EM SPACE 2017 ‗ DOUBLE LOW LINE
• in computer typography sometimes equated • this is a spacing character
to thin space
→ 005F _  low line
≈ 0020   space
→ 0333 $̳   combining double low line
2007  FIGURE SPACE
≈ 0020   0333 $̳  
• space equal to tabular width of a font
• this is equivalent to the digit width of fonts Quotation marks and apostrophe
with fixed-width digits Use of quotation marks differs by language. The character
≈ <noBreak> 0020   names cannot reflect actual usage for all languages.
2008  PUNCTUATION SPACE 2018 ‘ LEFT SINGLE QUOTATION MARK
• space equal to narrow punctuation of a font = single turned comma quotation mark
≈ 0020   space • this is the preferred character (as opposed to
2009  THIN SPACE 201B ‛ )
• a fifth of an em (or sometimes a sixth) → 0027 '  apostrophe
→ 202F   narrow no-break space → 02BB ʻ  modifier letter turned comma
≈ 0020   space → 275B ❛  heavy single turned comma quotation
200A  HAIR SPACE mark ornament
• thinner than a thin space 2019 ’ RIGHT SINGLE QUOTATION MARK
• in traditional typography, the thinnest space = single comma quotation mark
available • this is the preferred character to use for
≈ 0020   space apostrophe
→ 0027 '  apostrophe
Format characters
→ 02BC ʼ  modifier letter apostrophe
200B  ZERO WIDTH SPACE → 275C ❜  heavy single comma quotation mark
• commonly abbreviated ZWSP ornament
• this character is intended for invisible word 201A ‚ SINGLE LOW-9 QUOTATION MARK
separation and for line break control; it has no = low single comma quotation mark
width, but its presence between two characters
does not prevent increased letter spacing in • used as opening single quotation mark in some
languages
justification
201B ‛ SINGLE HIGH-REVERSED-9 QUOTATION MARK
200C  ZERO WIDTH NON-JOINER
= single reversed comma quotation mark
• commonly abbreviated ZWNJ • has same semantic as 2018 ‘ , but differs in
200D  ZERO WIDTH JOINER appearance
• commonly abbreviated ZWJ → 02BD ʽ  modifier letter reversed comma
200E  LEFT-TO-RIGHT MARK
• commonly abbreviated LRM
200F  RIGHT-TO-LEFT MARK
• commonly abbreviated RLM
→ 061C   arabic letter mark

The Unicode Standard 8.0, Copyright © 1991-2015 Unicode, Inc. All rights reserved.
201C General Punctuation 2038

201C “ LEFT DOUBLE QUOTATION MARK 2029  PARAGRAPH SEPARATOR


= double turned comma quotation mark • may be used to represent this semantic
• this is the preferred character (as opposed to unambiguously
201F ‟ ) 202A  LEFT-TO-RIGHT EMBEDDING
→ 0022 "  quotation mark • commonly abbreviated LRE
→ 275D ❝  heavy double turned comma 202B  RIGHT-TO-LEFT EMBEDDING
quotation mark ornament • commonly abbreviated RLE
→ 301D 〝  reversed double prime quotation 202C  POP DIRECTIONAL FORMATTING
mark • commonly abbreviated PDF
201D ” RIGHT DOUBLE QUOTATION MARK 202D  LEFT-TO-RIGHT OVERRIDE
= double comma quotation mark
• commonly abbreviated LRO
→ 0022 "  quotation mark 202E  RIGHT-TO-LEFT OVERRIDE
→ 2033 ″  double prime
• commonly abbreviated RLO
→ 275E ❞  heavy double comma quotation mark
ornament 202F  NARROW NO-BREAK SPACE
→ 301E 〞  double prime quotation mark • commonly abbreviated NNBSP
201E „ DOUBLE LOW-9 QUOTATION MARK • a narrow form of a no-break space, typically the
width of a thin space or a mid space
= low double comma quotation mark
→ 00A0   no-break space
• used as opening double quotation mark in
some languages → 2005   four-per-em space
→ 2E42 ⹂  double low-reversed-9 quotation → 2009   thin space
mark ≈ <noBreak> 0020  
→ 301F 〟  low double prime quotation mark General punctuation
201F ‟ DOUBLE HIGH-REVERSED-9 QUOTATION MARK 2030 ‰ PER MILLE SIGN
= double reversed comma quotation mark = permille, per thousand
• has same semantic as 201C “ , but differs in • used, for example, in measures of blood alcohol
appearance content, salinity, etc.
General punctuation → 0025 %  percent sign
2020 † DAGGER → 0609   arabic-indic per mille sign
= obelisk, long cross, oblong cross 2031 ‱ PER TEN THOUSAND SIGN
→ 2E38 ⸸  turned dagger = permyriad
2021 ‡ DOUBLE DAGGER • percent of a percent, rarely used
= diesis, double obelisk → 0025 %  percent sign
2022 • BULLET → 060A   arabic-indic per ten thousand sign
= black small circle 2032 ′ PRIME
→ 00B7 ·  middle dot = minutes, feet
→ 2024 ․  one dot leader → 0027 '  apostrophe
→ 2219 ∙  bullet operator → 00B4 ´  acute accent
→ 25D8 ◘  inverse bullet → 02B9 ʹ  modifier letter prime
→ 25E6 ◦  white bullet 2033 ″ DOUBLE PRIME
2023 ‣ TRIANGULAR BULLET = seconds, inches
→ 220E ∎  end of proof → 0022 "  quotation mark
→ 25B8 ▸  black right-pointing small triangle → 02BA ʺ  modifier letter double prime
2024 ․ ONE DOT LEADER → 201D ”  right double quotation mark
• also used as an Armenian semicolon (mijaket) → 3003 〃  ditto mark
→ 00B7 ·  middle dot → 301E 〞  double prime quotation mark
→ 2022 •  bullet ≈ 2032 ′  2032 ′ 
→ 2219 ∙  bullet operator 2034 ‴ TRIPLE PRIME
≈ 002E .  full stop = lines (old measure, 1/12 of an inch)
2025 ‥ TWO DOT LEADER ≈ 2032 ′  2032 ′  2032 ′ 
≈ 002E .  002E .  2035 ‵ REVERSED PRIME
2026 … HORIZONTAL ELLIPSIS → 0060 `  grave accent
= three dot leader 2036 ‶ REVERSED DOUBLE PRIME
→ 22EE ⋮  vertical ellipsis → 301D 〝  reversed double prime quotation
→ FE19   presentation form for vertical mark
horizontal ellipsis ≈ 2035 ‵  2035 ‵ 
≈ 002E .  002E .  002E .  2037 ‷ REVERSED TRIPLE PRIME
2027 ‧ HYPHENATION POINT ≈ 2035 ‵  2035 ‵  2035 ‵ 
• visible symbol used to indicate correct 2038 ‸ CARET
positions for word breaking, as in dic·tion·ar·ies → 2303 ⌃  up arrowhead
Format characters → A788 ꞈ  modifier letter low circumflex accent
2028  LINE SEPARATOR
• may be used to represent this semantic
unambiguously

The Unicode Standard 8.0, Copyright © 1991-2015 Unicode, Inc. All rights reserved.
2039 General Punctuation 2059

Quotation marks 2049 ⁉ EXCLAMATION QUESTION MARK


2039 ‹ SINGLE LEFT-POINTING ANGLE QUOTATION ⁓ 2049 FE0E  text style
MARK ⁓ 2049 FE0F  emoji style
= left pointing single guillemet ≈ 0021 !  003F ? 
• usually opening, sometimes closing General punctuation
→ 003C <  less-than sign
204A ⁊ TIRONIAN SIGN ET
→ 2329 〈  left-pointing angle bracket
→ 3008 〈  left angle bracket
• Irish Gaelic, Old English, ...
→ 0026 &  ampersand
203A › SINGLE RIGHT-POINTING ANGLE QUOTATION
MARK → 1F670 🙰  script ligature et ornament
= right pointing single guillemet 204B ⁋ REVERSED PILCROW SIGN
• usually closing, sometimes opening → 00B6 ¶  pilcrow sign
→ 003E >  greater-than sign 204C ⁌ BLACK LEFTWARDS BULLET
→ 232A 〉  right-pointing angle bracket 204D ⁍ BLACK RIGHTWARDS BULLET
→ 3009 〉  right angle bracket 204E ⁎ LOW ASTERISK
→ 002A *  asterisk
General punctuation
→ 0359 $͙   combining asterisk below
203B ※ REFERENCE MARK 204F ⁏ REVERSED SEMICOLON
= Japanese kome • also used in Sindhi
= Urdu paragraph separator
→ 003B ;  semicolon
→ 0FBF ྿  tibetan ku ru kha bzhi mig can → 061B   arabic semicolon
→ 200AD 𠂭  cjk unified ideograph-200AD
2050 ⁐ CLOSE UP
Double punctuation for vertical text • editing mark
203C ‼ DOUBLE EXCLAMATION MARK → AB5B ꭛  modifier breve with inverted breve
→ 0021 !  exclamation mark 2051 ⁑ TWO ASTERISKS ALIGNED VERTICALLY
⁓ 203C FE0E  text style 2052 ⁒ COMMERCIAL MINUS SIGN
⁓ 203C FE0F  emoji style = abzüglich (German), med avdrag av (Swedish),
≈ 0021 !  0021 !  piska (Swedish, "whip")
• a common glyph variant and fallback
General punctuation representation looks like ./.
203D ‽ INTERROBANG • may also be used as a dingbat to indicate
→ 0021 !  exclamation mark correctness
→ 003F ?  question mark • used in Finno-Ugric Phonetic Alphabet to
→ 2E18 ⸘  inverted interrobang indicate a related borrowed form with different
→ 1F679 🙹  heavy interrobang ornament sound
203E ‾ OVERLINE → 0025 %  percent sign
= spacing overscore → 066A   arabic percent sign
≈ 0020   0305 $̅   → 00F7 ÷  division sign
203F ‿ UNDERTIE 2053 ⁓ SWUNG DASH
= Greek enotikon → 007E ~  tilde
→ 2323 ⌣  smile 2054 ⁔ INVERTED UNDERTIE
2040 ⁀ CHARACTER TIE 2055 ⁕ FLOWER PUNCTUATION MARK
= z notation sequence concatenation = phul, puspika
→ 2322 ⌢  frown • used as a punctuation mark with Syloti Nagri,
2041 ⁁ CARET INSERTION POINT Bengali and other Indic scripts
• proofreader’s mark: insert here → 274B ❋  heavy eight teardrop-spoked
→ 22CC ⋌  right semidirect product propeller asterisk
2042 ⁂ ASTERISM Archaic punctuation
2043 ⁃ HYPHEN BULLET 2056 ⁖ THREE DOT PUNCTUATION
→ 002D -  hyphen-minus General punctuation
2044 ⁄ FRACTION SLASH
= solidus (in typography) 2057 ⁗ QUADRUPLE PRIME
• for composing arbitrary fractions ≈ 2032 ′  2032 ′  2032 ′  2032 ′ 
→ 002F /  solidus Archaic punctuation
→ 2215 ∕  division slash 2058 ⁘ FOUR DOT PUNCTUATION
2045 ⁅ LEFT SQUARE BRACKET WITH QUILL 2059 ⁙ FIVE DOT PUNCTUATION
2046 ⁆ RIGHT SQUARE BRACKET WITH QUILL = Greek pentonkion
Double punctuation for vertical text = quincunx
→ 2684 ⚄  die face-5
2047 ⁇ DOUBLE QUESTION MARK
≈ 003F ?  003F ? 
2048 ⁈ QUESTION EXCLAMATION MARK
≈ 003F ?  0021 ! 

The Unicode Standard 8.0, Copyright © 1991-2015 Unicode, Inc. All rights reserved.
205A General Punctuation 206F

205A ⁚TWO DOT PUNCTUATION 206D  ACTIVATE ARABIC FORM SHAPING


• historically used to indicate the end of a 206E  NATIONAL DIGIT SHAPES
sentence or change of speaker 206F  NOMINAL DIGIT SHAPES
• extends from baseline to cap height
→ FE30 ︰  presentation form for vertical two
dot leader
→ 1015B 𐅛  greek acrophonic epidaurean two
205B ⁛ FOUR DOT MARK
• used by scribes in the margin as highlighter
mark
• this is centered on the line, but extends beyond
top and bottom of the line
205C ⁜ DOTTED CROSS
• used by scribes in the margin as highlighter
mark
205D ⁝ TRICOLON
= Epidaurean acrophonic symbol three
→ 22EE ⋮  vertical ellipsis
→ 2AF6 ⫶  triple colon operator
→ FE19   presentation form for vertical
horizontal ellipsis
205E ⁞ VERTICAL FOUR DOTS
• used in dictionaries to indicate legal but
undesirable word break
• glyph extends the whole height of the line
→ 2E3D ⸽  vertical six dots
Space
205F  MEDIUM MATHEMATICAL SPACE
• abbreviated MMSP
• four-eighteenths of an em
≈ 0020   space
Format character
2060  WORD JOINER
• commonly abbreviated WJ
• a zero width non-breaking space (only)
• intended for disambiguation of functions for
byte order mark
→ FEFF   zero width no-break space
Invisible operators
2061  FUNCTION APPLICATION
• contiguity operator indicating application of a
function
2062  INVISIBLE TIMES
• contiguity operator indicating multiplication
2063  INVISIBLE SEPARATOR
= invisible comma
• contiguity operator indicating that adjacent
mathematical symbols form a list, e.g. when no
visible comma is used between multiple
indices
2064  INVISIBLE PLUS
• contiguity operator indicating addition
Format characters
2066  LEFT-TO-RIGHT ISOLATE
2067  RIGHT-TO-LEFT ISOLATE
2068  FIRST STRONG ISOLATE
2069  POP DIRECTIONAL ISOLATE
Deprecated
Use of these characters is strongly discouraged.
206A  INHIBIT SYMMETRIC SWAPPING
206B  ACTIVATE SYMMETRIC SWAPPING
206C  INHIBIT ARABIC FORM SHAPING

The Unicode Standard 8.0, Copyright © 1991-2015 Unicode, Inc. All rights reserved.

You might also like