Content deleted Content added
Undid revision 1243073071 by Hellomtfk (talk) rm uncited |
Billjones94 (talk | contribs) No edit summary Tags: Mobile edit Mobile web edit Advanced mobile edit |
||
(13 intermediate revisions by 7 users not shown) | |||
Line 21:
* accents (so called because the acute, grave, and circumflex were originally used to indicate different types of [[pitch accent]]s in the [[polytonic transcription]] of [[Greek language|Greek]])
<!-- This list uses <span style="font-family: serif"> because of rendering limitation in Android (as of v13), that its default sans font fails to render "dotted circle + diacritic", so visitors just get a meaningless (to most) [X] mark. Please retain at least until the issue is resolved because this is a very large proportion of visitors. -->
** <span style="font-family: serif">{{char|◌́}}</span> – [[acute accent|acute]] ({{
** <span style="font-family: serif">{{char|◌̀}}</span> – [[grave accent|grave]]; for example {{char|ò}}
** <span style="font-family: serif">{{char|◌̂}}</span> – [[circumflex accent|circumflex]]; for example {{char|ô}}
Line 60:
** <span style="font-family: serif">{{char|◌̒}}</span> – [[inverted apostrophe]]
** <span style="font-family: serif">{{char|◌̔}}</span> – [[reversed apostrophe]]
** <span style="font-family: serif">{{char|◌̉}}</span> – [[hook above]] ({{
** <span style="font-family: serif">{{char|◌̛}}</span> – [[horn (diacritic)|horn]] ({{
* subscript curls
** <span style="font-family: serif">{{char|◌̦}}</span> – [[comma#Diacritical usage|undercomma]]; for example {{char|ș}}
Line 114:
These diacritics are used in addition to the acute, grave, and circumflex accents and the diaeresis:
* <span style="font-family: serif">{{char|◌ͺ}}</span> – [[iota subscript]] ({{lang|grc|ᾳ, εͅ, ῃ, ιͅ, οͅ, υͅ, ῳ}})
* <span style="font-family: serif">{{char|῾◌}}</span> – [[rough breathing]] ({{
* <span style="font-family: serif">{{char|᾿◌}}</span> – [[smooth breathing|smooth (or soft) breathing]] ({{
===Hebrew===
Line 121:
[[File:Example of biblical Hebrew trope.svg|thumb|upright=1.6|right|'''Genesis 1:9 "And God said, Let the waters be collected".'''<br>Letters in black, <span style="color:#CC0000;">[[niqqud]] in red</span>, <span style="color:#0000CC;">[[Hebrew cantillation|cantillation]] in blue</span>]]
* [[Niqqud]]
** {{
** {{
** {{
** {{
** {{
** {{
** {{
** {{
** {{
** {{
** {{
** {{
** {{
* Other
** {{
** {{
===Korean===
Line 186 ⟶ 187:
==Generation with computers==
{{main|Unicode input}}
[[File:Germanic umlaut on keyboard.jpg|thumb|German keyboard with umlaut letters]]
Modern computer technology was developed mostly in countries that speak Western European languages (particularly English), and many early binary encodings were developed with a bias favoring English{{mdash}}a language written without diacritical marks. With [[computer memory]] and [[computer storage]] at premium, early [[character set]]s were limited to the Latin alphabet, the ten digits and a few punctuation marks and conventional symbols. The American Standard Code for Information Interchange ([[ASCII]]), first published in 1963, encoded just 95 printable characters. It included just four free-standing diacritics{{mdash}}acute, grave, circumflex and tilde{{mdash}}which were to be used by backspacing and overprinting the base letter. The [[ISO/IEC 646]] standard (1967) defined national variations that replace some American graphemes with [[precomposed character]]s (such as {{angbr|é}}, {{angbr|è}} and {{angbr|ë}}), according to language{{mdash}}but remained limited to 95 printable characters.
Depending on the [[keyboard layout]], which differs amongst countries, it is more or less easy to enter letters with diacritics on computers and typewriters. Some have their own keys; some are created by first pressing the key with the [[Combining character|diacritic mark]] followed by the letter to place it on. Such a key is sometimes referred to as a [[dead key]], as it produces no output of its own but modifies the output of the key pressed after it.▼
▲Depending on the [[keyboard layout]]
==Languages with letters containing diacritics==
Line 207:
:* Following spelling reforms since the 1970s, [[Scottish Gaelic]] uses graves only, which can be used on any vowel ({{angbr|[[à]]}}, {{angbr|[[è]]}}, {{angbr|[[ì]]}}, {{angbr|[[ò]]}}, {{angbr|[[ù]]}}). Formerly acute accents could be used on {{angbr|á}}, {{angbr|ó}} and {{angbr|é}}, which were used to indicate a specific vowel quality. With the elimination of these accents, the new orthography relies on the reader having prior knowledge of pronunciation of a given word.
:* [[Manx language|Manx]] uses the cedilla diacritic {{angbr|[[ç]]}} combined with h to give the digraph {{angle bracket|çh}} (pronounced {{IPA|/tʃ/}}) to mark the distinction between it and the digraph {{angle bracket|ch}} (pronounced {{IPA|/h/}} or {{IPA|/x/}}). Other diacritics used in Manx included the circumflex and diaeresis, as in {{angbr|â}}, {{angbr|ê}}, {{angbr|ï}}, etc. to mark the distinction between two similarly spelled words but with slightly differing pronunciation.
:* [[Irish language|Irish]] uses only acute accents to mark long vowels, following the 1948 spelling reform. [[Lenition]] is indicated using an [[overdot]] in [[Gaelic type]] ({{angbr|[[ċ]]}},{{angbr|ḋ}},{{angbr|ḟ}}, {{angbr|[[ġ]]}}, {{angbr|ṁ}}, {{angbr|ṗ}}, {{angbr|[[ṡ]]}}, {{angbr|ṫ}}); in [[Roman type]], a suffixed {{angbr|h}} is used. Thus, <span style="font-family:Duibhlinn, Ceanannas, Corcaigh, sans-serif">{{lang|gv|a ṁáṫair}}</span> is equivalent to <span style="font-family:Times New Roman, serif">{{lang|gv|a mháthair}}</span>.
:* [[Breton orthography|Breton]] does not have a single orthography (spelling system), but uses diacritics for a number of purposes. The diaeresis is used to mark that two vowels are pronounced separately and not as a diphthong/digraph. The circumflex is used to mark long vowels, but usually only when the vowel length is not predictable by phonology. Nasalization of vowels may be marked with a tilde, or following the vowel with the letter {{angbr|ñ}}. The plural suffix -où is used as a unified spelling to represent a suffix with a number of pronunciations in different dialects, and to distinguish this suffix from the digraph {{angbr|ou}} which is pronounced as {{IPA|/u:/}}. An apostrophe is used to distinguish {{angbr|c'h}}, pronounced {{IPA|/x/}} as the digraph {{angbr|ch}} is used in other Celtic languages, from the French-influenced digraph ch, pronounced {{IPA|/ʃ/}}.
Line 218:
====Germanic====
:* [[German orthography|German]] uses the [[two dots (diacritic)|two-dots diacritic]] ({{langx|de|[[Umlaut (diacritic)|umlaut]]}}): letters {{Angbr|[[ä]]}}, {{angbr|[[ö]]}}, {{angbr|[[ü]]}}, used to indicate the [[fronting (phonology)|fronting]] of back vowels (see [[umlaut (linguistics)]]).
:* [[Dutch orthography|Dutch]] uses
:* [[Afrikaans alphabet|Afrikaans]] uses 16 additional
:* [[Faroese alphabet|Faroese]] uses
:* [[Icelandic orthography|Icelandic]] uses acutes and other
:* [[Danish alphabet|Danish]] and [[Norwegian language|Norwegian]] use additional characters like the o-slash
:* [[Swedish alphabet|Swedish]] uses a-diaeresis (
====Romance====
:* In [[Asturian language|Asturian]], [[Galician language|Galician]] and [[Spanish alphabet|Spanish]], the character
:* [[Asturian language|Asturian]] uses an underdot: {{angbr|[[Ḷ]]}} ([[lower case]],
:* [[Catalan language|Catalan]] uses the acute accent
::* In [[Valencian language|Valencian]], the circumflex {{angbr|â}}, {{angbr|ê}}, {{angbr|î}}, {{angbr|ô}}, {{angbr|û}} may also be used.
:* [[Corsican language|Corsican]] uses the following in [[Corsican alphabet|its alphabet]]: {{angbr|À}}/{{angbr|à}}, {{angbr|È}}/{{angbr|è}}, {{angbr|Ì}}/{{angbr|ì}}, {{angbr|Ò}}/{{angbr|ò}}, {{angbr|Ù}}/{{angbr|ù}}.
:* [[
:* [[
:* [[
:* [[Portuguese language|Portuguese]] uses a tilde with the vowels {{angbr|a}} and {{angbr|o}} and a cedilla with c.
:* [[Romanian alphabet|Romanian]] uses a [[breve]] on the letter ''a'' ( :* [[Spanish language|Spanish]] uses acute accents (
====Slavic====
:* [[Gaj's Latin alphabet
:* The [[Czech alphabet]] uses the acute (á é í ó ú ý), caron ([[č]] [[ď]] [[ě]] [[ň]] [[ř]] [[š]] [[ť]] [[ž]]), and for one letter ([[ů]]) the ring. (In ď and ť the caron is modified to look rather like an apostrophe.) Letter with caron are considered separate letters, whereas vowels are considered only as longer variants of the unaccented letters. Acute does not affect alphabetical order, letters with caron are ordered after original counterparts.
:* [[Polish alphabet|Polish]] has the following letters: [[ą]] [[ć]] [[ę]] [[ł]] [[ń]] [[ó]] [[ś]] [[ź]] [[ż]]. These are considered to be separate letters: each of them is placed in the alphabet immediately after its Latin counterpart (e.g. {{angbr|ą}} between {{angbr|a}} and {{angbr|b}}), {{angbr|ź}} and {{angbr|ż}} are placed after {{angbr|z}} in that order.
:* The [[Serbian Cyrillic alphabet|Serbian Cyrillic]] alphabet has no diacritics, instead it has a grapheme ([[glyph]]) for every letter of [[Gaj's Latin alphabet|its Latin
:* The [[Slovak alphabet]] uses the acute (á é í ó ú ý [[ĺ]] [[ŕ]]), caron (č ď ľ ň š ť ž dž), umlaut (ä) and circumflex accent (ô). All of those are considered separate letters and are placed directly after the original counterpart in the [[Slovak alphabet|alphabet]].<ref name="PSP2000">http://www.juls.savba.sk/ediela/psp2000/psp.pdf page 12, section I.2</ref>
:* The basic [[Slovenian alphabet]] has the symbols {{angbr|[[č]]}}, {{angbr|[[š]]}}, and {{angbr|[[ž]]}}, which are considered separate letters and are listed as such in dictionaries and other contexts in which words are listed according to alphabetical order. Letters with a [[caron]] are placed right after the letters as written without the diacritic. The letter {{angbr|đ}} ('d with bar') may be used in non-transliterated foreign words, particularly names, and is placed after {{angbr|č}} and before {{angbr|d}}.
Line 267 ⟶ 268:
===Cyrillic letters===
{{
:*[[Belarusian alphabet|Belarusian]] and [[Uzbek alphabet#Correspondence chart|Uzbek Cyrillic]] have a letter {{angbr|[[Short U (Cyrillic)|ў]]}}.
:* Belarusian, [[Bulgarian language#Alphabet|Bulgarian]], Russian and Ukrainian have the letter {{angbr|[[Short I|й]]}}.
Line 281 ⟶ 282:
===English===
{{main article|English terms with diacritical marks}}
[[English alphabet|English]] is one of the few European languages that does not have many words that contain diacritical marks. Instead, digraphs are the main way the Modern English alphabet adapts the Latin to its phonemes. Exceptions are unassimilated foreign loanwords, including borrowings from [[French language|French]] (and, increasingly, [[Spanish language|Spanish]], like ''jalapeño'' and ''piñata''); however, the diacritic is also sometimes omitted from such words. Loanwords that frequently appear with the diacritic in English include ''café'', ''résumé'' or ''resumé'' (a usage that helps distinguish it from the verb ''resume''), ''soufflé'', and ''naïveté'' (see ''[[English terms with diacritical marks]]''). In older practice (and even among some orthographically
English speakers and writers once used the diaeresis more often than now in words such as ''coöperation'' (from Fr. ''coopération''), ''zoölogy'' (from Grk. ''zoologia''), and ''seeër'' (now more commonly ''see-er ''or simply'' seer'') as a way of indicating that adjacent vowels belonged to separate syllables, but this practice has become far less common. ''[[The New Yorker]]'' magazine is a major publication that continues to use the diaeresis in place of a hyphen for clarity and economy of space.<ref>{{cite magazine|last=Norris|first=Mary|title=The Curse of the Diaeresis|url=http://www.newyorker.com/online/blogs/culture/2012/04/the-curse-of-the-diaeresis.html|magazine=The New Yorker|date=26 April 2012|access-date=18 April 2014}}</ref>
Line 301 ⟶ 302:
* [[Filipino alphabet|Filipino]] has the following composite characters: ''á, à, â, é, è, ê, í, ì, î, ó, ò, ô, ú, ù, û''. Everyday use of diacritics for Filipino is, however, uncommon, and meant only to distinguish between [[homonym]]s between a word with the usual [[penult]]imate stress and one with a different stress placement. This aids both comprehension and pronunciation if both are relatively adjacent in a text, or if a word is itself ambiguous in meaning. The letter ''ñ'' ("''eñe''") is not a ''n'' with a diacritic, but rather collated as a separate letter, one of eight borrowed from Spanish. Diacritics appear in [[Spanish language in the Philippines|Spanish]] [[List of loanwords in Tagalog#Spanish|loanwords]] and [[Filipino name|names]] observing Spanish orthography rules.
* [[Finnish alphabet|Finnish]]. Carons in ''š'' and ''ž'' appear only in foreign proper names and [[loanword]]s, but may be substituted with ''sh'' or ''zh'' if and only if it is technically impossible to produce accented letters in the medium. Contrary to Estonian, ''š'' and ''ž'' are not considered distinct letters in Finnish.
* [[French alphabet|French]] uses five diacritics. The grave (''accent grave'') marks the sound {{IPA|/ɛ/}} when over an e, as in ''père'' ("father") or is used to distinguish words that are otherwise homographs such as ''a''/''à'' ("has"/"to") or ''ou''/''où'' ("or"/"where"). The [[acute accent|acute]] (''accent aigu'') is only used in "é", modifying the "e" to make the sound {{IPA|/e/}}, as in ''étoile'' ("star"). The [[circumflex]] (''accent circonflexe'') generally denotes that an S once followed the vowel in Old French or Latin, as in ''fête'' ("party"), the Old French being ''feste'' and the Latin being ''festum''. Whether the circumflex modifies the vowel's pronunciation depends on the dialect and the vowel. The [[cedilla]] (''cédille'') indicates that a normally hard "c" (before the vowels "a", "o", and "u") is to be pronounced {{IPA|/s/}}, as in ''ça'' ("that"). The diaeresis diacritic ({{
* [[Galician language|Galician]] vowels can bear an acute (''á, é, í, ó, ú'') to indicate stress or difference between two otherwise same written words (''é'', 'is' vs. ''e'', 'and'), but the diaeresis is only used with ''ï'' and ''ü'' to show two separate vowel sounds in pronunciation. Only in foreign words may Galician use other diacritics such as ''ç'' (common during the Middle Ages), ''ê'', or ''à''.
* [[German alphabet|German]] uses the three umlauted characters ''ä'', ''ö'' and ''ü''. These diacritics indicate vowel changes. For instance, the word ''Ofen'' {{IPA|de|ˈoːfən|}} "oven" has the plural ''Öfen'' {{IPA|[ˈøːfən]}}. The mark originated as a superscript ''e''; a handwritten blackletter ''e'' resembles two parallel vertical lines, like a diaeresis. Due to this history, "ä", "ö" and "ü" can be written as "ae", "oe" and "ue" respectively, if the umlaut letters are not available.
Line 348 ⟶ 349:
{{blockquote|
{{
{{vpad|1=2em}}
}}
Line 620 ⟶ 621:
==External links==
* [http://urtd.net/projects/cod/ Context of Diacritics
* [http://diacritics.typo.cz/ Diacritics Project]
* [https://www.unicode.org/ Unicode]
|