+
Skip to content

Tags: gorgonia/cu

Tags

v0.9.6

Toggle v0.9.6's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
fix: include v12.0 files (#71)

v0.9.5

Toggle v0.9.5's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
Cuda 12 (#69)

* Update jit.go - comment out old JIT Target compute 20, 21

* Update params.go - move to the _func trick

Support the new cuda 12 structure (v2)

* Update params.go - define the function prototype

hides a warning

* Update api.go - hide deprecated api warnings

v0.9.4

Toggle v0.9.4's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
CUDA11

* CUDA11 initial work. First, we generate the new enums

* Added generateEnums, which generates the Go version of the CUresult type

* Updated tests such that they no longer fail.
Added a Signal() method to BatchedContext, to force the BatchedContext to DoWork

* Updated benchmarking of batched vs no batched context. It would appear that for now Batching no longer confers a benefit

* Attempt #4 at getting CUDA11. Previous attempts were working based off a faulty copy of `cuda.h`

- Updated Device to support UUID
- Updated README
- Updated genlib to do more things more carefully

* More work on CUDA11
- Added more mappings into mappings.go to generate stufff
- Changed the definition of Context, by adding one additional method to clear L2Cache
- Added stubs for LaunchCooperativeKernel
- Added Graph types.

TODO next: add all the basic Graph data structure and then autogenerate all the things!

* Fixed mappings to also include @egonelbre's change in 2e25e65
Fixed a bug where Fix() wasn't called, leading to weird generations

* Added some graph stuff, fixed some mappings stuff for genAPI. It seems that the graph functions will have to be manually written for now

* Updated graph.go from ages ago

* Updated more of CUDA11 Graph API into the library.
Slowly getting there.

* Added the body of CopyParams

* Added AddMemsetNode method for Graph.

* Fixed a bunch of things

* Switched to modernc.org/cc instead of using the older github.com/cznic/cc

* cuDNN updated their website. So parse.py also has to change.
As a result moredecls.go also changed

* Sorted the data in mappings.go. This will allow for better diffing

* Updated the generatethis pipeline

* Initial mappings generation.

* Mapped the old commented out mappings to new commented out mappings (see mappings.ods)

* Generated enums.

* Updated enums and enum strings

* Added more generated data structures

* Added methods

* Generated stubs. 7 TODOs

* Added more incompletes report

* Manually fixed the TODO of SpatialTransformer

* Manually fixed generated_rnndata.go

* Manually fixed generated_seqdata.go

* Manually fixed generated_backend.go

* Manually fixed generated_tensortransform.go

* Fixed the missing getters

* fixed all the .C()s of the generated types

* Generated a new API

* Fixed random C int issues. Now to handle the rest

* Updated INCOMPLETES_REPORTS

* fixed variable collition in _BackendAttributeTypeNames

* gencudnn enum generation syntax fixes added

* Updated INCOMPLETES

* variable renaming added as per the review

* AlgorithmDescriptor syntax fixes added

* AlgorithmPerformance syntax fixes added

* Activation cudnnActivationDescriptor_t return method name change added

* syntax fixes added on FusedOpVariantParams

* FusedOpConsts syntax fixes added

* C type retrieve function added for cudnnStatus

* tensor file syntax fixes added
tensor file unreachable code removed

* method receiver renaming added

* optensor syntax fixes added

* generated_api syntax fixes added

* code review changes added

* go modules updated
algorithmdescriptor Algorithm type changes added

* review changes added
GetRNNLinLayerBiasParams & GetRNNLinLayerMatrixParams methods moved to manually written API.go file

* Fixed a bug in parse.py where when parsing the documentation for CUDA11, the function names have `()`

* Removed deprecated functions from being generated

* More deprecated stuff no longer generated

* Fixed up algorithmdescriptor.go

* fixed some auto generated issues

* Manually fixed the fused ops generation

* Fixed even more autogenerated errors

* Fixed up more of the auto generated issues

* Renamed API to todo, because eh, I'll figure it out later

Co-authored-by: Aruna Prabhashwara <wg.aruna.p@gmail.com>

v0.9.3

Toggle v0.9.3's commit message
Added some more documentation, and support for cuda 10.2

v0.9.2

Toggle v0.9.2's commit message
fixed the convolution.c import

use cuda 10.1

v0.9.1

Toggle v0.9.1's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
Merge pull request #43 from gorgonia/go1.13CI

Added gomod

v0.9.0-beta

Toggle v0.9.0-beta's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
Merge pull request #36 from gorgonia/v0.9.0-working

Added tests for illustrating the failing case of #35

v0.8.0

Toggle v0.8.0's commit message
Merge remote-tracking branch 'origin/master'

点击 这是indexloc提供的php浏览器服务,不要输入任何密码和下载