bpo-37476: Adding a unit test of unicode in test_unicode.py #14531

shihai1991 · 2019-07-01T18:32:20Z

https://bugs.python.org/issue37476

mangrisano · 2019-07-01T19:55:45Z

/cc @vstinner @ezio-melotti @malemburg @benjaminp

Modules/_testcapimodule.c

zhangyangyu · 2019-07-02T01:55:05Z

Lib/test/test_unicode.py

+
+        self.assertEqual(unicode_asutf8('abc'), 'abc')
+        self.assertEqual(unicode_asutf8('abc\0'), 'abc')
+        self.assertEqual(unicode_asutf8('abc\0abc'), 'abc')


It's more like you are testing decodeUTF8. I'd suggest add more encode samples, BMP, non-BMP, just take from other tests.

np, xiang core.

vstinner

If you add tests, please test also PyUnicode_AsUTF8AndSize() which would allow embedded NUL characters/bytes.

vstinner · 2019-07-02T11:27:52Z

Lib/test/test_unicode.py

+    def test_asutf8(self):
+        from _testcapi import unicode_asutf8
+
+        self.assertEqual(unicode_asutf8('abc'), 'abc')


The function encodes to UTF-8, so the result should be a bytes string, not Unicode. Use PyString_FromString() rather than PyUnicode_FromString() in _testcapi.

The others similar C API tests also return unicodes other than bytes which in some sense also tests the counter part decoding function, though I don't very understand why designed like this.

Either fix other tests, or leave them unchanged. But I would prefer to not add new buggy tests :-)

So returning the decoded bytes string is right way? I am not sure I understand it clearly.

Return bytes, not unicodes. Test decode functionality separately.

shihai1991 · 2019-07-02T17:07:50Z

If you add tests, please test also PyUnicode_AsUTF8AndSize() which would allow embedded NUL characters/bytes.

np, I would continue to add test.

tiran · 2019-07-02T18:43:01Z

FYI, I cancelled the Travis CI run to free some resources for upcoming 3.8b2 and 3.7.4rc2 releases.

Modules/_testcapimodule.c

Lib/test/test_unicode.py

shihai1991 · 2019-07-10T04:29:23Z

@vstinner hi, victor. Pls help me review this patch again, thanks ;)

Adding a unit test of unicode in test_unicode.py

5c79dd5

the-knights-who-say-ni added the CLA signed label Jul 1, 2019

bedevere-bot added the awaiting review label Jul 1, 2019

zhangyangyu reviewed Jul 2, 2019

View reviewed changes

Modules/_testcapimodule.c Outdated Show resolved Hide resolved

zhangyangyu reviewed Jul 2, 2019

View reviewed changes

zhangyangyu added the skip news label Jul 2, 2019

vstinner reviewed Jul 2, 2019

View reviewed changes

add curly brackets

1018743

ZackerySpytz reviewed Jul 3, 2019

View reviewed changes

Modules/_testcapimodule.c Show resolved Hide resolved

Update the test of unicode

2d1a309

shihai1991 changed the title ~~bpo-37476: Adding a unit test of unicode in test_unicode.py~~ [WIP]bpo-37476: Adding a unit test of unicode in test_unicode.py Jul 3, 2019

add test of unicode_asutf8andsize

42612a5

shihai1991 changed the title ~~[WIP]bpo-37476: Adding a unit test of unicode in test_unicode.py~~ bpo-37476: Adding a unit test of unicode in test_unicode.py Jul 4, 2019

zhangyangyu reviewed Jul 7, 2019

View reviewed changes

Lib/test/test_unicode.py Outdated Show resolved Hide resolved

update test desc

8ab018b

zhangyangyu approved these changes Jul 8, 2019

View reviewed changes

bedevere-bot added awaiting merge and removed awaiting review labels Jul 8, 2019

zhangyangyu added the tests Tests in the Lib/test dir label Jul 8, 2019

zhangyangyu merged commit 5623ac8 into python:master Jul 20, 2019

bedevere-bot removed the awaiting merge label Jul 20, 2019

lisroach pushed a commit to lisroach/cpython that referenced this pull request Sep 10, 2019

bpo-37476: Adding tests for asutf8 and asutf8andsize (pythonGH-14531)

59f60f5

DinoV pushed a commit to DinoV/cpython that referenced this pull request Jan 14, 2020

bpo-37476: Adding tests for asutf8 and asutf8andsize (pythonGH-14531)

33ca201

websurfer5 pushed a commit to websurfer5/cpython that referenced this pull request Jul 20, 2020

bpo-37476: Adding tests for asutf8 and asutf8andsize (pythonGH-14531)

d6cea3a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

bpo-37476: Adding a unit test of unicode in test_unicode.py #14531

bpo-37476: Adding a unit test of unicode in test_unicode.py #14531

Uh oh!

shihai1991 commented Jul 1, 2019 •

edited by bedevere-bot

Loading

Uh oh!

mangrisano commented Jul 1, 2019

Uh oh!

Uh oh!

zhangyangyu Jul 2, 2019

Uh oh!

shihai1991 Jul 2, 2019

Uh oh!

vstinner left a comment

Uh oh!

vstinner Jul 2, 2019

Uh oh!

zhangyangyu Jul 2, 2019

Uh oh!

vstinner Jul 2, 2019

Uh oh!

shihai1991 Jul 2, 2019

Uh oh!

zhangyangyu Jul 3, 2019

Uh oh!

shihai1991 commented Jul 2, 2019

Uh oh!

tiran commented Jul 2, 2019

Uh oh!

Uh oh!

Uh oh!

shihai1991 commented Jul 10, 2019

Uh oh!

Uh oh!

Uh oh!

bpo-37476: Adding a unit test of unicode in test_unicode.py #14531

bpo-37476: Adding a unit test of unicode in test_unicode.py #14531

Uh oh!

Conversation

shihai1991 commented Jul 1, 2019 • edited by bedevere-bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mangrisano commented Jul 1, 2019

Uh oh!

Uh oh!

zhangyangyu Jul 2, 2019

Choose a reason for hiding this comment

Uh oh!

shihai1991 Jul 2, 2019

Choose a reason for hiding this comment

Uh oh!

vstinner left a comment

Choose a reason for hiding this comment

Uh oh!

vstinner Jul 2, 2019

Choose a reason for hiding this comment

Uh oh!

zhangyangyu Jul 2, 2019

Choose a reason for hiding this comment

Uh oh!

vstinner Jul 2, 2019

Choose a reason for hiding this comment

Uh oh!

shihai1991 Jul 2, 2019

Choose a reason for hiding this comment

Uh oh!

zhangyangyu Jul 3, 2019

Choose a reason for hiding this comment

Uh oh!

shihai1991 commented Jul 2, 2019

Uh oh!

tiran commented Jul 2, 2019

Uh oh!

Uh oh!

Uh oh!

shihai1991 commented Jul 10, 2019

Uh oh!

Uh oh!

shihai1991 commented Jul 1, 2019 •

edited by bedevere-bot

Loading