Stars
SWE-agent takes a GitHub issue and tries to automatically fix it, using your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2024]
Live-Transcription (STT) with Whisper PoC
Visualizer for neural network, deep learning and machine learning models
OpenAI Whisper ASR Webservice API
💬 ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.
Faster Whisper transcription with CTranslate2
Real-time & local speech-to-text server.
Whisper realtime streaming for long speech-to-text transcription and translation
Port of OpenAI's Whisper model in C/C++
Production First and Production Ready End-to-End Speech Recognition Toolkit
Takes a number and converts it to Persian word form
Master programming by recreating your favorite technologies from scratch.
Implementing text normalization for Farsi(Persian) language.
pyclustering is a Python, C++ data mining library.
A Deep-Learning-Based Persian Speech Recognition System
A Persian Latex Template that can be used in different instances such as assignments, exams, quizzes; Not suitable for long documents such as theses or project reports.
Python script that helps to enable | disable on shecan.ir DNS service
Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
speech to text benchmark framework
Implementation of audio degradation processes
This project extends PureData to build a telephone simulator.
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
This repository contains the SpeechBrain Benchmarks
🥇 A curated list of competitive math resources.
This SDK is now deprecated, use the new unified Google GenAI SDK.
LaTeX template for BSc/MSc/PhD theses of University of Tehran - قالب لاتک پایاننامه دانشگاه تهران
fine-tune Wav2vec2. an ASR model released by Facebook