+
Skip to content

f64 kernels #421

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 31 commits into from
Feb 13, 2023
Merged

f64 kernels #421

merged 31 commits into from
Feb 13, 2023

Conversation

coreylowman
Copy link
Owner

@coreylowman coreylowman commented Jan 30, 2023

  • Adds num-trait dependency
  • Uses num_traits::Float in all Cpu kernels
  • impls Cuda kernels for both f32 & f64 for now.
  • Adds test-f64 feature flag for dev
  • Adds dev.build_module back to make specifying dtype easier

Resolves #224

@coreylowman coreylowman changed the title [Draft] f64 tensor ops [WIP] f64 kernels Jan 30, 2023
@coreylowman coreylowman marked this pull request as draft January 30, 2023 21:08
@coreylowman coreylowman changed the title [WIP] f64 kernels f64 kernels Feb 12, 2023
@coreylowman coreylowman marked this pull request as ready for review February 12, 2023 17:22
@coreylowman coreylowman merged commit 0a5a016 into main Feb 13, 2023
@coreylowman coreylowman deleted the num-traits-float branch February 13, 2023 15:08
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Double precision tensors
3 participants
点击 这是indexloc提供的php浏览器服务,不要输入任何密码和下载