sleeping
LLM inference optimization
-
Tiktok
- Sunnyvale, CA
-
04:25
(UTC -07:00) - https://xsxszab.github.io/
- in/yifei--wang
-
TensorRT-LLM Public
Forked from NVIDIA/TensorRT-LLMTensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and support state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorR…
C++ Apache License 2.0 UpdatedAug 29, 2025 -
lock_free_cuckoo_filter Public
A lock-free cuckoo filter implementation
-
-
-
-
-
-
pytorch Public
Forked from pytorch/pytorchTensors and Dynamic neural networks in Python with strong GPU acceleration
C++ Other UpdatedOct 25, 2020 -
first-contributions Public
Forked from firstcontributions/first-contributions🚀✨ Help beginners to contribute to open source projects
MIT License UpdatedOct 20, 2020 -
-
-
CPD-Pytorch Public
Pytorch implementation of CPD net(CVPR2019) for salient object detection.
-
Amulet-Pytorch Public
Pytorch implementation of Amulet(ICCV 2017).
-
DHSNet-Pytorch Public
Pytorch implementation of DHSNet(CVPR2016)