Skip to content

Use CL fill buffer instead of host allocation in csrmm kernel #3127

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 1 commit into from
May 5, 2021

Conversation

9prady9
Copy link
Member

@9prady9 9prady9 commented May 4, 2021

Description

Use OpenCL enqueuFillBuffer API instead of host allocation in csrmm kernel

Fixes: #3012

Changes to Users

None

Checklist

  • Rebased on latest master
  • Code compiles
  • Tests pass
  • [ ] Functions added to unified API
  • [ ] Functions documented

@9prady9 9prady9 added this to the 3.8.1 milestone May 4, 2021
@9prady9 9prady9 requested a review from umar456 May 4, 2021 10:05
@9prady9 9prady9 merged commit 25178df into arrayfire:master May 5, 2021
@9prady9 9prady9 deleted the fix_csrmm_counter_init branch May 5, 2021 13:56
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

[BUG] use_greedy in csrmm_nt is failing, cause performance issues (Proposed solution)
2 participants