Skip to content

Opencl cpu batch matmul fix #2774

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 4 commits into from
Feb 21, 2020

Conversation

umar456
Copy link
Member

@umar456 umar456 commented Feb 21, 2020

Fixes a couple of issues with the CPU OpenCL blas functions. The batching functionality was not working correctly for righ hand side batches. The matmul function was throwing errors in case the same input was passed in. This was caused only on the Intel GPU OpenCL implementation because mapping the same buffer as a READ and WRITE mapped pointer caused an error. These issues have been fixed and the getMappedPtr is refactored to return unique_ptr instead of shared_ptr.

Fixes #1711

@umar456 umar456 force-pushed the opencl_cpu_batch_matmul_fix branch 3 times, most recently from d94382d to a4eb27a Compare February 21, 2020 08:13
The Intel OpenCL mapping the same buffer for write access caused an
error. This caused the matmul operation to fail when the same
array was passed in. To fix this only the READ flag is passed into
the map function instead of the READ and WRITE flags
The new getMappedPtr is now (correctly) returning a unique_ptr.
This commit removes implicit conversions from unique_ptr to
shared ptr.
@umar456 umar456 force-pushed the opencl_cpu_batch_matmul_fix branch from a4eb27a to 2356ec9 Compare February 21, 2020 08:21
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

matmulTN fails when the two parameters are the same array (Intel/OpenCL)
2 participants