Validation step fails when using shared memory with multiprocessing.managers.BaseManager
#28899
Labels
multiprocessing.managers.BaseManager
#28899
Describe the bug
Original issue: kedro-org/kedro#3674
Relates to #28781
We use multiprocessing managers to work with shared memory for pipeline parallelisation. After this validation step was added we are experiencing
ValueError: cannot set WRITEABLE flag to True of this array
error when objects are retrieved from shared memory and passed toscikit-learn
functions, for examplefit,
including this validation step.The only solution that works for us so far is making a deep copy of objects before passing them to those methods which is not the desired solution.
Steps/Code to Reproduce
Some findings:
n_samples
. Whenn_samles
is relatively small ~100 the error is not happening. So can be related to ColumnTransformer throws error with n_jobs > 1 input dataframes and joblib auto-memmapping (regression in 1.4.1.post1) #28781 (comment)pd.Series
withpd.DataFrame
solves the issue but we don't have an idea whyExpected Results
No error is thrown.
Actual Results
Versions
The text was updated successfully, but these errors were encountered: