Implement reduce and __reduce_ex__ for array #3064

qingshi163 · 2021-09-15T09:14:16Z

No description provided.

vm/src/stdlib/array.rs

qingshi163 · 2021-09-15T09:17:10Z

vm/src/stdlib/array.rs

+            vm.ctx.new_str(
+                char::from_u32(self.0 as u32)
+                    .unwrap_or_default()
+                    .to_string(),
+            )


is the convert safe?

WideChar can contain invalid utf-8 character like surrogate. So I don't think so. To be safe, It must be decoded.

Are we assuming the data in the array is decoded?

I am not sure. I guess this is corresponding to WChar unicode object in CPython - which is deprecated same as this type array.
Even if it is regarded as a decoded string, because this is array, it still can contain surrogate character, which is not a valid utf8 string.
So my last comment was wrong. Regardless it is decoded or not, it can contains invalid utf8 character.

vm/src/builtins/float.rs

youknowone · 2021-09-15T15:33:39Z

vm/src/stdlib/array.rs

+            vm.ctx.new_str(
+                char::from_u32(self.0 as u32)
+                    .unwrap_or_default()
+                    .to_string(),
+            )


WideChar can contain invalid utf-8 character like surrogate. So I don't think so. To be safe, It must be decoded.

qingshi163 · 2021-09-16T14:47:34Z

@youknowone Can you check what went wrong in windows tests and fix it?

youknowone

I am really sorry. I don't have an accessible windows machine for now and until early october.

can anyone help this?

youknowone · 2021-09-16T15:24:31Z

extra_tests/snippets/stdlib_array.py

+u = array('u', test_str)
+assert u.__reduce_ex__(1)[1][1] == list(test_str)
+assert str(loads(dumps(u, 1))) == f"array('u', '{test_str}')"


Suggested change

assert str(loads(dumps(u, 1))) == f"array('u', '{test_str}')"

assert str(loads(dumps(u, 1))) == f"array('u', '{test_str}')", str(loads(dumps(u, 1)))

then it will shows the value on assertion failure

(jedi mind trick wave) That was not the line you're looking for…

fanninpm · 2021-09-16T18:03:58Z

@coolreader18 might be able to diagnose the Windows problem

youknowone · 2021-10-02T08:14:02Z

@qingshi163 could you rebase this PR? I will check windows problem during this week

youknowone · 2021-10-04T18:31:25Z

stdlib/src/array.rs

+                let s = Self::_wchar_bytes_to_string(array.get_bytes(), array.itemsize(), vm)?;
+                s.chars().map(|x| x.into_pyobject(vm)).collect()


In windows, I tried this code with CPython:

test_str = '🌉abc🌐def🌉🌐' u = array('u', test_str) print(u.__reduce_ex__(1))

then

File ".\extra_tests\snippets\stdlib_array.py", line 104, in <module> assert u.__reduce_ex__(1)[1][1] == list(test_str), (u.__reduce_ex__(1)[1][1], list(test_str)) AssertionError: (['\ud83c', '\udf09', 'a', 'b', 'c', '\ud83c', '\udf10', 'd', 'e', 'f', '\ud83c', '\udf09', '\ud83c', '\udf10'], ['🌉', 'a', 'b', 'c', '🌐', 'd', 'e', 'f', '🌉', '🌐'])

The values are wchar encoded in CPython but doesn't look like that here

Do you mean CPython did not treat it as same in linux macos?

Yes, it seems. I think that is not exactly the platform-specific variant but sizeof(wchar_t) is different by platforms. Looks very fragile in point of view of compatibility, no wonder why it is deprecated.

I don't mind if you want to just skip testing for 16bit character environments.

youknowone

Thank you for the long time effort!

qingshi163 commented Sep 15, 2021

View reviewed changes

vm/src/stdlib/array.rs Outdated Show resolved Hide resolved

qingshi163 commented Sep 15, 2021

View reviewed changes

youknowone requested changes Sep 15, 2021

View reviewed changes

qingshi163 force-pushed the array-pickle branch 2 times, most recently from f6dd539 to c289397 Compare September 16, 2021 13:37

youknowone reviewed Sep 16, 2021

View reviewed changes

qingshi163 force-pushed the array-pickle branch from c289397 to a0db9f7 Compare October 2, 2021 16:04

youknowone reviewed Oct 4, 2021

View reviewed changes

qingshi163 force-pushed the array-pickle branch 2 times, most recently from ee6fa13 to be2bf61 Compare October 7, 2021 13:41

youknowone approved these changes Oct 7, 2021

View reviewed changes

qingshi163 added 4 commits October 10, 2021 14:43

Impl __reduce__ for array

ff0adc1

Impl __reduce_ex__ with _array_reconstructor

de47cd0

Fix unicode array pickling

cbd7c59

remove test that fail on CPython

ea69dc5

qingshi163 force-pushed the array-pickle branch from 9af4998 to ea69dc5 Compare October 10, 2021 12:48

qingshi163 requested a review from youknowone October 10, 2021 19:47

youknowone approved these changes Oct 11, 2021

View reviewed changes

youknowone merged commit b986e6b into RustPython:main Oct 11, 2021

fanninpm mentioned this pull request Jul 15, 2022

replace array.__reduce__ to array.__reduce_ex__ #3876

Open

qingshi163 deleted the array-pickle branch August 16, 2022 07:26

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement reduce and __reduce_ex__ for array #3064

Implement reduce and __reduce_ex__ for array #3064

qingshi163 commented Sep 15, 2021

qingshi163 Sep 15, 2021

youknowone Sep 15, 2021

qingshi163 Sep 15, 2021

youknowone Sep 15, 2021

youknowone Sep 15, 2021

qingshi163 commented Sep 16, 2021

youknowone left a comment

youknowone Sep 16, 2021

fanninpm Sep 16, 2021

fanninpm commented Sep 16, 2021

youknowone commented Oct 2, 2021

youknowone Oct 4, 2021

qingshi163 Oct 6, 2021

youknowone Oct 6, 2021

youknowone Oct 6, 2021

youknowone left a comment

	assert str(loads(dumps(u, 1))) == f"array('u', '{test_str}')"
	assert str(loads(dumps(u, 1))) == f"array('u', '{test_str}')", str(loads(dumps(u, 1)))

		let s = Self::_wchar_bytes_to_string(array.get_bytes(), array.itemsize(), vm)?;
		s.chars().map(\|x\| x.into_pyobject(vm)).collect()

Implement __reduce__ and __reduce_ex__ for array #3064

Implement __reduce__ and __reduce_ex__ for array #3064

Conversation

qingshi163 commented Sep 15, 2021

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

qingshi163 commented Sep 16, 2021

youknowone left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

fanninpm commented Sep 16, 2021

youknowone commented Oct 2, 2021

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

youknowone left a comment

Choose a reason for hiding this comment

Implement reduce and __reduce_ex__ for array #3064

Implement reduce and __reduce_ex__ for array #3064