Highlights
- Pro
-
asmjit Public
Forked from asmjit/asmjitComplete x86/x64 JIT and Remote Assembler for C++
C++ zlib License UpdatedSep 3, 2025 -
xbyak Public
Forked from herumi/xbyakA JIT assembler for x86/x64 architectures supporting FPU, MMX, SSE (1-4), AVX (1-2, 512), APX, and AVX10.2
C++ BSD 3-Clause "New" or "Revised" License UpdatedSep 2, 2025 -
PyTurboJPEG Public
PyTurboJPEG is a highly optimized Python wrapper of libjpeg-turbo (TurboJPEG API) which supports x86 and ARM architecture.
-
good-kernels Public
Forked from ScalingIntelligence/good-kernelsSamples of good AI generated CUDA kernels
Python UpdatedMay 30, 2025 -
effective_transpose Public
Forked from simveit/effective_transposeEffective transpose on Hopper GPU
Cuda UpdatedMay 2, 2025 -
-
-
rmm Public
Forked from rapidsai/rmmRAPIDS Memory Manager
C++ Apache License 2.0 UpdatedJul 16, 2024 -
cuspatial Public
Forked from rapidsai/cuspatialCUDA-accelerated GIS and spatiotemporal algorithms
Jupyter Notebook Apache License 2.0 UpdatedMay 17, 2024 -
opencv Public
Forked from opencv/opencvOpen Source Computer Vision Library
-
-
-
Clipper2 Public
Forked from AngusJohnson/Clipper2Polygon Clipping and Offsetting - C++, C# and Delphi
Pascal Boost Software License 1.0 UpdatedAug 14, 2023 -
rocPRIM Public
Forked from ROCm/rocPRIMROCm Parallel Primitives
C++ MIT License UpdatedJul 25, 2023 -
cub Public
Forked from NVIDIA/cubCooperative primitives for CUDA C++.
Cuda BSD 3-Clause "New" or "Revised" License UpdatedJul 18, 2023 -
darknet Public
Forked from pjreddie/darknetConvolutional Neural Networks
-
thrust Public
Forked from NVIDIA/thrustThe C++ parallel algorithms library.
C++ Other UpdatedJun 21, 2023 -
tensor-cores-numerical-behavior Public
Forked from north-numerical-computing/tensor-cores-numerical-behaviorTest suite for probing the numerical behavior of NVIDIA tensor cores
Cuda GNU General Public License v2.0 UpdatedMar 4, 2023 -
numba-pr-7621 Public
Forked from gmarkall/numba-pr-7621For reproducing issues with https://github.com/numba/numba/pull/7621
Cuda UpdatedApr 20, 2022 -
cutlass Public
Forked from NVIDIA/cutlassCUDA Templates for Linear Algebra Subroutines
C++ BSD 3-Clause "New" or "Revised" License UpdatedApr 8, 2022 -
jemalloc Public
Forked from aerospike/jemallocAerospike Fork of the JEMalloc Memory Allocator
C Other UpdatedMar 23, 2022 -
CUDALibrarySamples Public
Forked from NVIDIA/CUDALibrarySamplesCUDA Library Samples
Cuda Other UpdatedMar 15, 2022 -
vivict Public
Forked from vivictorg/vivictAn easy to use in-browser tool for subjective comparison of the visual quality of different encodings of the same video source.
JavaScript MIT License UpdatedMar 11, 2022 -
yolov4-tiny-custom-training_LOCAL Public
Forked from techzizou/yolov4-tiny-custom-training_LOCALPython UpdatedMar 5, 2022 -
oneAPI-samples Public
Forked from oneapi-src/oneAPI-samplesSamples for Intel oneAPI toolkits
HTML MIT License UpdatedMar 1, 2022 -
-
code-samples Public
Forked from NVIDIA-developer-blog/code-samplesSource code examples from the Parallel Forall Blog
HTML BSD 3-Clause "New" or "Revised" License UpdatedJan 20, 2022 -
moderngpu Public
Forked from moderngpu/moderngpuPatterns and behaviors for GPU computing
C++ Other UpdatedDec 26, 2021 -
SYCL-For-CUDA-Examples Public
Forked from codeplaysoftware/SYCL-For-CUDA-ExamplesExamples for using SYCL on CUDA
C++ Apache License 2.0 UpdatedJun 30, 2021 -