-
AMD
- Edinburgh
- https://www.linkedin.com/in/jatincj/
-
rocm-systems Public
Forked from ROCm/rocm-systemssuper repo for rocm systems projects
C++ UpdatedOct 15, 2025 -
pytorch Public
Forked from pytorch/pytorchTensors and Dynamic neural networks in Python with strong GPU acceleration
Python Other UpdatedJul 15, 2025 -
composable_kernel Public
Forked from ROCm/composable_kernelComposable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators
C++ Other UpdatedJun 11, 2025 -
Catch2 Public
Forked from catchorg/Catch2A modern, C++-native, test framework for unit-tests, TDD and BDD - using C++14, C++17 and later (C++11 support is in v2.x branch, and C++03 on the Catch1.x branch)
C++ Boost Software License 1.0 UpdatedJun 2, 2025 -
rocDecode Public
Forked from ROCm/rocDecoderocDecode is a high performance video decode SDK for AMD hardware
C++ Other UpdatedMay 29, 2025 -
rocPyDecode Public
Forked from ROCm/rocPyDecoderocPyDecode is a set of Python bindings to rocDecode C++ library which provides full HW acceleration for video decoding on AMD GPUs.
C++ MIT License UpdatedMay 21, 2025 -
rocPRIM Public
Forked from ROCm/rocPRIMROCm Parallel Primitives
C++ MIT License UpdatedJan 23, 2025 -
-
-
-
hipBLASLt Public
Forked from ROCm/hipBLASLthipBLASLt is a library that provides general matrix-matrix operations with a flexible API and extends functionalities beyond a traditional BLAS library
Assembly MIT License UpdatedSep 11, 2024 -
HIPIFY Public
Forked from ROCm/HIPIFYHIPIFY: Convert CUDA to Portable C++ Code
C++ MIT License UpdatedAug 12, 2024 -
A high-throughput and memory-efficient inference and serving engine for LLMs
Python Apache License 2.0 UpdatedMay 16, 2024 -
-
-
rocFFT Public
Forked from ROCm/rocFFTNext generation FFT implementation for ROCm
C++ Other UpdatedJan 25, 2024 -
ELFIO Public
Forked from serge1/ELFIOELFIO - ELF (Executable and Linkable Format) reader and producer implemented as a header only C++ library
C++ MIT License UpdatedJan 9, 2024 -
mixbench Public
Forked from ekondis/mixbenchA GPU benchmark tool for evaluating GPUs on mixed operational intensity kernels (CUDA, OpenCL, HIP)
C++ GNU General Public License v2.0 UpdatedSep 11, 2023 -
-
-
rocBLAS Public
Forked from ROCm/rocBLASNext generation BLAS implementation for ROCm platform
C++ Other UpdatedJul 27, 2023 -
occa Public
Forked from libocca/occaPortable and vendor neutral framework for parallel programming on heterogeneous platforms.
C++ MIT License UpdatedJul 27, 2023 -
rocSPARSE Public
Forked from ROCm/rocSPARSENext generation SPARSE implementation for ROCm platform
C++ MIT License UpdatedJun 27, 2023 -
HIP : Convert CUDA to Portable C++ Code
C++ MIT License UpdatedNov 22, 2021 -
PyHIP Public
Forked from jatinx/PyHIPPython Interface to HIP and hiprtc Library
Python MIT License UpdatedNov 2, 2021 -
pal Public
Forked from GPUOpen-Drivers/palPlatform Abstraction Library
C++ MIT License UpdatedSep 2, 2021 -
dbcsr Public
Forked from cp2k/dbcsrDBCSR: Distributed Block Compressed Sparse Row matrix library
Fortran GNU General Public License v2.0 UpdatedJul 16, 2021 -
gloo Public
Forked from pytorch/glooCollective communications library with various primitives for multi-machine training.
C++ Other UpdatedJun 30, 2021 -
llvm-project Public
Forked from llvm/llvm-projectThe LLVM Project is a collection of modular and reusable compiler and toolchain technologies. Note: the repository does not accept github pull requests at this moment. Please submit your patches at…
UpdatedJun 25, 2021 -