+
Skip to content

Conversation

coreylowman
Copy link
Owner

@coreylowman coreylowman commented Jan 30, 2023

  • Adds num-trait dependency
  • Uses num_traits::Float in all Cpu kernels
  • impls Cuda kernels for both f32 & f64 for now.
  • Adds test-f64 feature flag for dev
  • Adds dev.build_module back to make specifying dtype easier

Resolves #224

@coreylowman coreylowman changed the title [Draft] f64 tensor ops [WIP] f64 kernels Jan 30, 2023
@coreylowman coreylowman marked this pull request as draft January 30, 2023 21:08
@coreylowman coreylowman changed the title [WIP] f64 kernels f64 kernels Feb 12, 2023
@coreylowman coreylowman marked this pull request as ready for review February 12, 2023 17:22
@coreylowman coreylowman merged commit 0a5a016 into main Feb 13, 2023
@coreylowman coreylowman deleted the num-traits-float branch February 13, 2023 15:08
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Double precision tensors

3 participants

点击 这是indexloc提供的php浏览器服务,不要输入任何密码和下载