Closes #579 - update howto/unicode.po to reflect recent changes

flowdas · flowdas · commit 00fbac205290 · 2020-01-06T09:42:51.000+09:00
diff --git a/howto/unicode.po b/howto/unicode.po
@@ -31,12 +31,13 @@ msgid "1.12"
 msgstr "1.12"
 
 #: ../Doc/howto/unicode.rst:9
-#, fuzzy
 msgid ""
 "This HOWTO discusses Python's support for the Unicode specification for "
 "representing textual data, and explains various problems that people "
 "commonly encounter when trying to work with Unicode."
-msgstr "이 HOWTO는 유니코드를 지원하는 파이썬에 대한 설명과 유니코드로 작업할 때 일반적으로 마주하는 다양한 문제들에 관해 설명합니다."
+msgstr ""
+"이 HOWTO는 텍스트 데이터를 나타내기 위한 유니코드 명세를 지원하는 파이썬에 대한 설명과 유니코드로 작업할 때 일반적으로 "
+"마주하는 다양한 문제들에 관해 설명합니다."
 
 #: ../Doc/howto/unicode.rst:15
 msgid "Introduction to Unicode"
@@ -57,6 +58,10 @@ msgid ""
 "Unicode Standard for representing characters, which lets Python programs "
 "work with all these different possible characters."
 msgstr ""
+"오늘날의 프로그램은 다양한 문자를 처리할 수 있어야 합니다. 응용 프로그램은 종종 국제화되어 다양한 사용자 선택 가능한 언어로 "
+"메시지를 표시하고 출력합니다; 같은 프로그램이 영어, 프랑스어, 일본어, 히브리어 또는 러시아어로 에러 메시지를 출력해야 할 수 "
+"있습니다. 웹 콘텐츠는 이러한 언어로 작성될 수 있으며 다양한 이모티콘 기호를 포함할 수도 있습니다. 파이썬의 문자열형은 문자를 "
+"표현하기 위해 유니코드 표준을 사용하므로, 파이썬 프로그램은 가능한 모든 다른 문자로 작업 할 수 있습니다."
 
 #: ../Doc/howto/unicode.rst:30
 msgid ""
@@ -65,6 +70,8 @@ msgid ""
 "unique code.  The Unicode specifications are continually revised and "
 "updated to add new languages and symbols."
 msgstr ""
+"유니코드(https://www.unicode.org/)는 인간 언어에서 사용하는 모든 문자를 나열하고 각 문자에 고유한 코드를 "
+"부여하고자 하는 명세입니다. 새로운 언어와 기호를 추가하기 위해 유니코드 명세가 계속 개정되고 갱신됩니다."
 
 #: ../Doc/howto/unicode.rst:35
 msgid ""
@@ -75,6 +82,10 @@ msgid ""
 "separate from the uppercase letter 'I'.  They'll usually look the same, "
 "but these are two different characters that have different meanings."
 msgstr ""
+"**문자**\\는 텍스트의 가능한 최소 구성요소입니다. 'A', 'B', 'C' 등은 전부 다른 문자입니다. 'È'\\와 'Í' "
+"역시 그렇습니다. 문자는 언어나 문맥에 따라 다릅니다. 예를 들어 대문자 'I'와는 별개로, \"로마 숫자 하나(Roman "
+"Numeral One)\"를 위한 문자 'Ⅰ'가 있습니다. 이들은 보통 똑같아 보이지만, 서로 다른 의미를 가진 두 개의 다른 "
+"문자입니다."
 
 #: ../Doc/howto/unicode.rst:42
 msgid ""
@@ -85,12 +96,15 @@ msgid ""
 "notation ``U+265E`` to mean the character with value ``0x265e`` (9,822 in"
 " decimal)."
 msgstr ""
+"유니코드 표준은 문자를 어떻게 **코드 포인트**\\로 표현하는지 서술하고 있습니다. 코드 포인트 값은 0에서 0x10FFFF "
+"사이의 정수입니다 (약 110만 개의 값입니다, 지금까지 약 11만 개가 할당되었습니다). 표준과 이 문서에서 코드 포인트는 "
+"``U+265E`` 표기법을 사용하며, 이는 값 ``0x265e``\\(10진수로는 9,822)인 문자를 의미합니다."
 
 #: ../Doc/howto/unicode.rst:49
 msgid ""
 "The Unicode standard contains a lot of tables listing characters and "
 "their corresponding code points:"
-msgstr ""
+msgstr "유니코드 표준은 문자와 그에 상응하는 코드 포인트를 나열한 수많은 표를 포함하고 있습니다:"
 
 #: ../Doc/howto/unicode.rst:70
 msgid ""
@@ -123,7 +137,6 @@ msgid "Encodings"
 msgstr "인코딩"
 
 #: ../Doc/howto/unicode.rst:88
-#, fuzzy
 msgid ""
 "To summarize the previous section: a Unicode string is a sequence of code"
 " points, which are numbers from 0 through ``0x10FFFF`` (1,114,111 "
@@ -133,16 +146,18 @@ msgid ""
 "bytes are called a **character encoding**, or just an **encoding**."
 msgstr ""
 "이전 섹션 요약: 유니코드 문자열은 코드 포인트의 시퀀스이며, 0부터 ``0x10FFFF`` (십진수 1,114,111)의 범위를 "
-"갖는 수입니다. 이 시퀀스는 메모리에서 바이트의 집합(0에서 255까지의 값)으로 표현되어야 합니다. 유니코드 문자열을 바이트 "
-"시퀀스로 변환하는 규칙을 **인코딩**\\이라고 부릅니다."
+"갖는 수입니다. 이 코드 포인트의 시퀀스는 메모리에서 **코드 단위(code units)**\\의 집합으로 표현될 필요가 있고, "
+"그런 다음 **코드 단위**\\는 8비트 바이트로 매핑됩니다. 유니코드 문자열을 바이트 시퀀스로 변환하는 규칙을 **문자 "
+"인코딩(character encoding)**, 또는 단지 **인코딩(encoding)**\\이라고 부릅니다."
 
 #: ../Doc/howto/unicode.rst:96
-#, fuzzy
 msgid ""
 "The first encoding you might think of is using 32-bit integers as the "
 "code unit, and then using the CPU's representation of 32-bit integers. In"
 " this representation, the string \"Python\" might look like this:"
-msgstr "생각할 수 있는 첫 번째 인코딩은 32비트 정수 배열입니다. 이 표현에서 문자열 \"Python\"은 다음과 같습니다:"
+msgstr ""
+"생각할 수 있는 첫 번째 인코딩은 코드 단위로 32비트 정수를 사용한 다음 32비트 정수의 CPU 표현을 사용하는 것입니다. 이 "
+"표현에서 문자열 \"Python\"은 다음과 같습니다:"
 
 #: ../Doc/howto/unicode.rst:106
 msgid ""
@@ -179,20 +194,19 @@ msgstr "``strlen()``\\과 같은 기존의 C 함수와 호환이 안 되기 때
 msgid ""
 "Therefore this encoding isn't used very much, and people instead choose "
 "other encodings that are more efficient and convenient, such as UTF-8."
-msgstr ""
+msgstr "따라서 이 인코딩은 많이 사용되지 않으며, 사람들은 UTF-8과 같은 더 효율적이고 편리한 다른 인코딩을 선택합니다."
 
 #: ../Doc/howto/unicode.rst:125
-#, fuzzy
 msgid ""
 "UTF-8 is one of the most commonly used encodings, and Python often "
 "defaults to using it.  UTF stands for \"Unicode Transformation Format\", "
 "and the '8' means that 8-bit values are used in the encoding.  (There are"
 " also UTF-16 and UTF-32 encodings, but they are less frequently used than"
 " UTF-8.)  UTF-8 uses the following rules:"
 msgstr ""
-"UTF-8은 일반적으로 가장 많이 사용되는 인코딩 중 하나입니다. UTF는 \"Unicode Transformation "
-"Format\"의 약자이며 '8'은 8비트 숫자가 인코딩에 사용됨을 뜻합니다. (UTF-16과 UTF-32 인코딩도 있지만, "
-"UTF-8보다 낮은 빈도로 사용됩니다.) UTF-8은 다음의 규칙을 따릅니다:"
+"UTF-8은 일반적으로 가장 많이 사용되는 인코딩 중 하나이고, 파이썬은 종종 기본적으로 이것을 사용합니다. UTF는 "
+"\"Unicode Transformation Format\"의 약자이며 '8'은 8비트 값이 인코딩에 사용됨을 뜻합니다. "
+"(UTF-16과 UTF-32 인코딩도 있지만, UTF-8보다 낮은 빈도로 사용됩니다.) UTF-8은 다음의 규칙을 따릅니다:"
 
 #: ../Doc/howto/unicode.rst:131
 msgid ""
@@ -217,16 +231,16 @@ msgid "It can handle any Unicode code point."
 msgstr "모든 유니코드 코드 포인트를 처리 할 수 있습니다."
 
 #: ../Doc/howto/unicode.rst:138
-#, fuzzy
 msgid ""
 "A Unicode string is turned into a sequence of bytes that contains "
 "embedded zero bytes only where they represent the null character "
 "(U+0000). This means that UTF-8 strings can be processed by C functions "
 "such as ``strcpy()`` and sent through protocols that can't handle zero "
 "bytes for anything other than end-of-string markers."
 msgstr ""
-"유니코드 문자열은 0 바이트를 갖지 않는 바이트 시퀀스로 변환됩니다. 이는 바이트 순서 문제를 피할 수 있으며, UTF-8 문자열을"
-" ``strcpy()``\\와 같은 C 함수로 처리하고 0바이트를 처리하지 못하는 프로토콜을 통해 전송할 수 있음을 의미합니다."
+"유니코드 문자열은 널 문자를 표현하는 곳에만 내장된 0바이트를 포함하는 바이트 시퀀스로 변환됩니다. 이는 UTF-8 문자열을 "
+"``strcpy()``\\와 같은 C 함수로 처리하고 문자열 끝 표시 이외의 0바이트를 처리하지 못하는 프로토콜을 통해 전송할 수 "
+"있음을 의미합니다."
 
 #: ../Doc/howto/unicode.rst:143
 msgid "A string of ASCII text is also valid UTF-8 text."
@@ -255,6 +269,9 @@ msgid ""
 " oriented encodings, like UTF-16 and UTF-32, where the sequence of bytes "
 "varies depending on the hardware on which the string was encoded."
 msgstr ""
+"UTF-8은 바이트 지향 인코딩입니다. 인코딩은 각 문자가 하나 이상의 바이트의 특정 시퀀스로 표시되도록 지정합니다. 이렇게 하면 "
+"UTF-16과 UTF-32와 같이 바이트의 시퀀스가 문자열이 인코딩된 하드웨어에 따라 달라지는 정수와 워드(word) 지향 "
+"인코딩에서 발생할 수 있는 바이트 순서 문제를 피할 수 있습니다."
 
 #: ../Doc/howto/unicode.rst:157 ../Doc/howto/unicode.rst:513
 #: ../Doc/howto/unicode.rst:734
@@ -279,6 +296,8 @@ msgid ""
 "history of Unicode and UTF-8 "
 "<https://www.youtube.com/watch?v=MijmeoH9LT4>`_ (9 minutes 36 seconds)."
 msgstr ""
+"Computerphile 유튜브 채널에서, Tom Scott가 간략하게 `유니코드와 UTF-8의 역사를 논의합니다 "
+"<https://www.youtube.com/watch?v=MijmeoH9LT4>`_ (9분 36초)."
 
 #: ../Doc/howto/unicode.rst:168
 msgid ""
@@ -328,16 +347,15 @@ msgid "The String Type"
 msgstr "문자열 형"
 
 #: ../Doc/howto/unicode.rst:191
-#, fuzzy
 msgid ""
 "Since Python 3.0, the language's :class:`str` type contains Unicode "
 "characters, meaning any string created using ``\"unicode rocks!\"``, "
 "``'unicode rocks!'``, or the triple-quoted string syntax is stored as "
 "Unicode."
 msgstr ""
-"파이썬 3.0부터는 유니코드 문자를 포함하고 있는 :class:`str` 타입을 특징으로 하며 이는 어떤 문자열이든 "
-"``\"unicode rocks!\"``, ``'unicode rocks!'`` 또는 삼중 따옴표로 묶인 문자열 문법을 사용한다면 "
-"유니코드로 저장됨을 뜻합니다."
+"파이썬 3.0부터는 언어의 :class:`str` 형은 유니코드 문자를 포함하고, 이는 어떤 문자열이든 ``\"unicode "
+"rocks!\"``, ``'unicode rocks!'`` 또는 삼중 따옴표로 묶인 문자열 문법을 사용한다면 유니코드로 저장됨을 "
+"뜻합니다."
 
 #: ../Doc/howto/unicode.rst:195
 msgid ""
@@ -531,7 +549,6 @@ msgid "Unicode Properties"
 msgstr "유니코드 속성"
 
 #: ../Doc/howto/unicode.rst:357
-#, fuzzy
 msgid ""
 "The Unicode specification includes a database of information about code "
 "points.  For each defined code point, the information includes the "
@@ -542,8 +559,8 @@ msgid ""
 "bidirectional text."
 msgstr ""
 "유니코드 사양은 코드 포인트에 대한 정보 데이터베이스를 포함합니다. 각각 정의한 코드 포인트에 대해서, 정보는 문자의 이름, "
-"카테고리, 적용 가능한 숫자 값(유니코드는 로마 숫자와 3분의 1 및 4분의 5와 같은 분수를 표현하는 문자를 가집니다)을 "
-"포함합니다. 양방향 텍스트에서 사용하는 코드 포인트와 관련된 속성과 기타 디스플레이 관련 속성도 있습니다."
+"카테고리, 적용 가능한 숫자 값(로마 숫자와 같은 숫자 개념을 나타내는 문자, 3분의 1이나 5분의 4와 같은 분수를 표현하는 문자"
+" 등)을 포함합니다. 양방향 텍스트에서 코드 포인트를 사용하는 방법과 같은 디스플레이 관련 속성도 있습니다."
 
 #: ../Doc/howto/unicode.rst:365
 msgid ""
@@ -577,7 +594,7 @@ msgstr ""
 
 #: ../Doc/howto/unicode.rst:401
 msgid "Comparing Strings"
-msgstr ""
+msgstr "문자열 비교"
 
 #: ../Doc/howto/unicode.rst:403
 msgid ""
@@ -589,6 +606,10 @@ msgid ""
 "produce the same output when printed, but one is a string of length 1 and"
 " the other is of length 2."
 msgstr ""
+"유니코드는 문자열 비교를 약간 복잡하게 만듭니다, 같은 문자 집합이 다른 코드 포인트 시퀀스로 표시될 수 있기 때문입니다. 예를 "
+"들어, 'ê'와 같은 문자는 단일 코드 포인트 U+00EA로 표시되거나, 'e'의 코드 포인트 다음에 'COMBINING "
+"CIRCUMFLEX ACCENT'의 코드 포인트가 오는 U+0065 U+0302로 표시될 수 있습니다. 인쇄될 때 같은 출력을 "
+"생성하지만, 하나는 길이 1의 문자열이고 다른 하나는 길이 2입니다."
 
 #: ../Doc/howto/unicode.rst:411
 msgid ""
@@ -598,6 +619,9 @@ msgid ""
 "special handling for characters such as the German letter 'ß' (code point"
 " U+00DF), which becomes the pair of lowercase letters 'ss'."
 msgstr ""
+"대소 문자를 구분하지 않는 비교를 위한 한 가지 도구는 문자열을 유니코드 표준에 설명된 알고리즘에 따라 대소 문자를 구분하지 않는 "
+"형식으로 변환하는 :meth:`~str.casefold` 문자열 메서드입니다. 이 알고리즘은 독일어 문자 'ß'(코드 포인트 "
+"U+00DF)를 소문자 'ss' 쌍이 되도록 하는 것과 같이 문자를 특수하게 처리합니다."
 
 #: ../Doc/howto/unicode.rst:424
 msgid ""
@@ -608,6 +632,10 @@ msgid ""
 "perform string comparisons that won't falsely report inequality if two "
 "strings use combining characters differently:"
 msgstr ""
+"두 번째 도구는 :mod:`unicodedata` 모듈의 :func:`~unicodedata.normalize` 함수인데, 문자열을"
+" 여러 정규 형식 중 하나로 변환합니다. 여기서 뒤에 결합 문자가 오는 문자는 단일 문자로 바뀝니다. "
+":func:`normalize`\\를 사용하면 두 문자열이 문자 결합을 다르게 사용할 때 같지 않다고 잘못 보고하지 않는 문자열 "
+"비교를 수행할 수 있습니다:"
 
 #: ../Doc/howto/unicode.rst:447
 msgid "When run, this outputs:"
@@ -619,10 +647,12 @@ msgid ""
 "string giving the desired normalization form, which can be one of 'NFC', "
 "'NFKC', 'NFD', and 'NFKD'."
 msgstr ""
+":func:`~unicodedata.normalize` 함수의 첫 번째 인자는 원하는 정규화 형식을 제공하는 문자열입니다. "
+"'NFC', 'NFKC', 'NFD' 및 'NFKD' 중 하나일 수 있습니다."
 
 #: ../Doc/howto/unicode.rst:460
 msgid "The Unicode Standard also specifies how to do caseless comparisons::"
-msgstr ""
+msgstr "유니코드 표준은 또한 대소 문자를 구별하지 않고 비교하지 않는 방법을 지정합니다::"
 
 #: ../Doc/howto/unicode.rst:476
 msgid ""
@@ -631,6 +661,9 @@ msgid ""
 "normalized string, so the result needs to be normalized again. See "
 "section 3.13 of the Unicode Standard for a discussion and an example.)"
 msgstr ""
+"이것은 ``True``\\를 인쇄합니다. (왜 :func:`NFD`\\가 두 번 호출될까요? :meth:`casefold`\\가 "
+"정규화되지 않은 문자열을 반환하도록 하는 문자가 몇 개 있기 때문입니다, 그래서 결과를 다시 정규화해야 합니다. 토론과 예제는 "
+"유니코드 표준의 3.13 절을 참조하십시오.)"
 
 #: ../Doc/howto/unicode.rst:483
 msgid "Unicode Regular Expressions"
@@ -855,6 +888,11 @@ msgid ""
 "environment variables; if you haven't, the default encoding is again "
 "UTF-8."
 msgstr ""
+"오늘날 자주 쓰이는 대부분의 운영체제는 임의의 유니코드 문자를 갖는 파일 이름을 지원합니다. 이는 보통 유니코드 문자열을 시스템에 "
+"의존적인 인코딩으로 변환하여 구현합니다. 오늘날의 파이썬은 UTF-8을 사용하는 것으로 수렴하고 있습니다: 맥 OS X의 파이썬은 "
+"여러 버전에서 UTF-8을 사용했으며, 파이썬 3.6은 윈도우에서도 UTF-8을 사용하도록 전환했습니다. 유닉스 시스템에서 "
+"``LANG`` 또는 ``LC_CTYPE`` 환경변수를 설정했다면, 파일 시스템 인코딩만을 사용합니다; 그렇지 않은 경우, 기본 "
+"인코딩은 다시 UTF-8입니다."
 
 #: ../Doc/howto/unicode.rst:615
 msgid ""
@@ -887,6 +925,11 @@ msgid ""
 " bytes.  For example, assuming the default filesystem encoding is UTF-8, "
 "running the following program::"
 msgstr ""
+":func:`os.listdir` 함수는 파일 이름을 반환하는데, 문제를 일으킵니다: 이 함수가 파일 이름의 유니코드 버전을 "
+"반환해야 할까요? 아니면 인코딩 버전을 포함한 바이트열을 반환해야 할까요? :func:`os.listdir`\\은 디렉터리 경로를 "
+"바이트열이나 유니코드 문자열로 제공했는지에 따라 둘 모두를 수행할 수 있습니다. 경로를 유니코드 문자열로 넘겨주었을 때 파일 이름은"
+" 파일 시스템의 인코딩으로 디코딩되고 유니코드 문자열의 목록이 반환되는 반면, 바이트를 넘겨주었을 때 파일 이름을 바이트열로 "
+"반환합니다. 예를 들어, 기본 파일 시스템 인코딩이 UTF-8이라 가정할 때, 다음의 프로그램을 실행할 시::"
 
 #: ../Doc/howto/unicode.rst:646
 msgid "will produce the following output:"
@@ -905,6 +948,8 @@ msgid ""
 "undecodable file names can be present; that's pretty much only Unix "
 "systems now."
 msgstr ""
+"대부분의 경우, 이러한 API로 유니코드만 사용할 수 있음에 유의하십시오. 바이트열 API는 디코딩할 수 없는 파일 이름이 존재하는"
+" 시스템에서만 사용해야 합니다; 지금은 유닉스 시스템에 불과합니다."
 
 #: ../Doc/howto/unicode.rst:664
 msgid "Tips for Writing Unicode-aware Programs"
@@ -996,17 +1041,16 @@ msgstr ""
 "있습니다::"
 
 #: ../Doc/howto/unicode.rst:726
-#, fuzzy
 msgid ""
 "The ``surrogateescape`` error handler will decode any non-ASCII bytes as "
 "code points in a special range running from U+DC80 to U+DCFF.  These code"
 " points will then turn back into the same bytes when the "
 "``surrogateescape`` error handler is used to encode the data and write it"
 " back out."
 msgstr ""
-"U+DC80부터 U+DCFF까지의 범위를 가지는 유니코드 개인 사용 영역에서 ``surrogateescape`` 에러 핸들러는 비 "
-"ASCII 바이트들을 코드 포인트로써 디코딩합니다. 이러한 개인 코드 포인트는 ``surrogateescape`` 에러 핸들러가 "
-"데이터를 인코딩하고 다시 쓰는 경우에 사용될 때 같은 바이트열로 다시 되돌려집니다."
+"``surrogateescape`` 에러 핸들러는 비 ASCII 바이트들을 U+DC80부터 U+DCFF까지의 특수한 범위에 있는 "
+"코드 포인트로 디코딩합니다. 이러한 코드 포인트는 ``surrogateescape`` 에러 핸들러가 데이터를 인코딩하고 다시 쓰는 "
+"경우에 사용될 때 같은 바이트열로 다시 되돌려집니다."
 
 #: ../Doc/howto/unicode.rst:736
 msgid ""
@@ -1064,4 +1108,8 @@ msgid ""
 "von Löwis, Terry J. Reedy, Serhiy Storchaka, Eryk Sun, Chad Whitacre, "
 "Graham Wideman."
 msgstr ""
+"이 문서에 오류를 알려주거나 제안을 해주신 아래의 사람들에게 감사를 전합니다: Éric Araujo, Nicholas Bastin,"
+" Nick Coghlan, Marius Gedminas, Kent Johnson, Ken Krugler, Marc-André "
+"Lemburg, Martin von Löwis, Terry J. Reedy, Serhiy Storchaka, Eryk Sun, "
+"Chad Whitacre, Graham Wideman."