rishiswethan

🎯

Focusing

Rishi Swethan rishiswethan

🎯

Focusing

In love with code

48 followers · 7 following

Serna.ai
India
10:24 (UTC +05:30)
in/rishi-swethan
https://medium.com/@rishiswethan.c.r

Achievements

Stars

innat / VideoMAE

[NeurIPS'22] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training

Jupyter Notebook 22 3 Updated Jan 19, 2024

rishiswethan / Cancer-detection-using-CNN

This CNN is capable of diagnosing breast cancer from an eosin stained image. This model was trained using 400 images. It has an accuracy of 80%

Python 65 37 Updated Apr 15, 2023

JuanBindez / pytubefix

Python3 library for downloading YouTube Videos.

Python 1,349 173 Updated Oct 18, 2025

OpenGVLab / InternVideo

[ECCV2024] Video Foundation Models & Data for Multimodal Understanding

Python 2,080 128 Updated Aug 7, 2025

meta-llama / llama3

The official Meta Llama 3 GitHub site

Python 29,045 3,469 Updated Jan 26, 2025

DAMO-NLP-SG / Video-LLaMA

[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding

Python 3,079 282 Updated Jun 4, 2024

AILab-CVC / YOLO-World

[CVPR 2024] Real-Time Open-Vocabulary Object Detection

Python 5,930 562 Updated Feb 26, 2025

open-mmlab / mmyolo

OpenMMLab YOLO series toolbox and benchmark. Implemented RTMDet, RTMDet-Rotated,YOLOv5, YOLOv6, YOLOv7, YOLOv8,YOLOX, PPYOLOE, etc.

Python 3,316 605 Updated Jul 14, 2024

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 60,411 10,638 Updated Oct 19, 2025

haotian-liu / LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 23,770 2,643 Updated Aug 12, 2024

IDEA-Research / GroundingDINO

[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Python 9,075 930 Updated Aug 12, 2024

cool-xuan / BN-WVAD

The official implementation of "Divergence of Features and Mean: A BatchNorm-based Abnormality Criterion for Weakly Supervised Video Anomaly Detection"

Python 64 15 Updated Nov 30, 2023

autodistill / autodistill

Images to inference with no labeling (use foundation models to train supervised models).

Python 2,417 197 Updated May 14, 2025

zai-org / CogVLM

a state-of-the-art-level open visual language model | 多模态预训练模型

Python 6,678 443 Updated May 29, 2024

rishiswethan / Diabetic-Retinopathy-Detection-Retinal-Vessel-Segmentation

Classification of Fundus Images into 5 stages of Diabetic Retinopathy, and segmentation of blood vessels in fundus images

Python 16 2 Updated Sep 18, 2023

voxel51 / fiftyone

Refine high-quality datasets and visual AI models

Python 9,962 676 Updated Oct 18, 2025

capjamesg / zero-shot-crack-detection

Zero-shot crack detection with SAM and Grounding DINO.

Python 4 1 Updated Nov 9, 2023

luigifreda / plvs

PLVS is a real-time SLAM system with points, lines, volumetric mapping and 3D unsupervised incremental segmentation.

C++ 524 79 Updated Sep 21, 2025

mit-han-lab / efficientvit

Efficient vision foundation models for high-resolution generation and perception.

Python 3,104 230 Updated Sep 5, 2025

Deci-AI / super-gradients

Easily train or fine-tune SOTA computer vision models with one open source training library. The home of Yolo-NAS.

Jupyter Notebook 4,929 565 Updated Sep 17, 2024

rishiswethan / Video-Audio-Face-Emotion-Recognition

The repo contains an audio emotion detection model, facial emotion detection model, and a model that combines both these models to predict emotions from a video

Jupyter Notebook 87 24 Updated Sep 13, 2023

brianhill11 / ABPImputation

Package for imputing the arterial blood pressure (ABP) waveform from non-invasive physiological waveforms (PPG & ECG) using a deep neural network

Python 33 6 Updated Jul 24, 2022

gaomingqi / Track-Anything

Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.

Python 6,844 502 Updated May 31, 2024

brjathu / LART

Code repository for the paper "On the Benefits of 3D Pose and Tracking for Human Action Recognition", (CVPR 2023)

Jupyter Notebook 282 31 Updated Jan 19, 2024

qianqianwang68 / omnimotion

Python 2,247 131 Updated Jun 11, 2024

Cadene / pretrained-models.pytorch

Pretrained ConvNets for pytorch: NASNet, ResNeXt, ResNet, InceptionV4, InceptionResnetV2, Xception, DPN, etc.

Python 9,110 1,827 Updated Apr 22, 2022

facebookresearch / ImageBind

ImageBind One Embedding Space to Bind Them All

Python 8,825 829 Updated Oct 3, 2025

keshavbhatt / whatsie

Feature rich WhatsApp Client for Desktop Linux

C++ 2,619 73 Updated Nov 1, 2024

facebookresearch / segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 52,159 6,104 Updated Sep 18, 2024

open-mmlab / mmsegmentation

OpenMMLab Semantic Segmentation Toolbox and Benchmark.

Python 9,304 2,772 Updated Aug 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Rishi Swethan rishiswethan

Achievements

Achievements

Block or report rishiswethan

Stars

innat / VideoMAE

rishiswethan / Cancer-detection-using-CNN

JuanBindez / pytubefix

OpenGVLab / InternVideo

meta-llama / llama3

DAMO-NLP-SG / Video-LLaMA

AILab-CVC / YOLO-World

open-mmlab / mmyolo

vllm-project / vllm

haotian-liu / LLaVA

IDEA-Research / GroundingDINO

cool-xuan / BN-WVAD

autodistill / autodistill

zai-org / CogVLM

rishiswethan / Diabetic-Retinopathy-Detection-Retinal-Vessel-Segmentation

voxel51 / fiftyone

capjamesg / zero-shot-crack-detection

luigifreda / plvs

mit-han-lab / efficientvit

Deci-AI / super-gradients

rishiswethan / Video-Audio-Face-Emotion-Recognition

brianhill11 / ABPImputation

gaomingqi / Track-Anything

brjathu / LART

qianqianwang68 / omnimotion

Cadene / pretrained-models.pytorch

facebookresearch / ImageBind

keshavbhatt / whatsie

facebookresearch / segment-anything

open-mmlab / mmsegmentation