+
Skip to content
View WildHoneyPie's full-sized avatar
  • Academia Sinica
  • Taipei, Taiwan
  • 14:38 (UTC +08:00)
  • Instagram blueburnband

Highlights

  • Pro

Block or report WildHoneyPie

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Text2midi is the first end-to-end model for generating MIDI files from textual descriptions. By leveraging pretrained large language models and a powerful autoregressive transformer decoder, text2m…

Python 98 12 Updated Feb 28, 2025

An open-source AI agent that brings the power of Gemini directly into your terminal.

TypeScript 56,898 5,149 Updated Jul 10, 2025
Python 608 63 Updated Jul 2, 2025

SonicVerse: Multi-Task Learning for Music Feature-Informed Captioning

Python 37 2 Updated Jun 19, 2025

A simple library for Fréchet Audio Distance (FAD) calculation

Python 223 24 Updated May 26, 2025

Official Repository for "Training-Free Multi-Step Audio Source Separation"

Python 46 4 Updated May 26, 2025

Official Repository for "Music Source Restoration"

Python 26 2 Updated Jun 1, 2025

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

Python 3,736 331 Updated Jan 4, 2024

Code required to reproduce the experiments from our paper "Beyond Spectrograms: Rethinking Audio Classification from EnCodec’s Latent Space"

Python 2 2 Updated Feb 24, 2025

Separate Anything in Audio with Zero Training

Python 39 1 Updated Jun 1, 2025

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Python 54,665 9,026 Updated May 30, 2025

SoTA open-source TTS

Python 9,281 1,094 Updated Jun 13, 2025

Open-source Multi-agent Poster Generation from Papers

Python 2,282 134 Updated Jun 17, 2025

Natural Language Web

Python 5,532 568 Updated Jul 10, 2025

c/ua is the Docker Container for Computer-Use AI Agents.

Python 8,939 404 Updated Jul 9, 2025

Gemma open-weight LLM library, from Google DeepMind

Python 3,500 483 Updated Jul 9, 2025

State-of-the-art pretrained music models for training, evaluation, inference

Python 116 10 Updated Jul 2, 2025

Source code and complementary material for "Keep what you need : extracting efficient subnetworks from large audio representation models".

Python 5 1 Updated Feb 25, 2025

SALMONN family: A suite of advanced multi-modal LLMs

1,278 100 Updated Jul 8, 2025

The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.

Python 1,794 135 Updated Apr 21, 2025
TypeScript 25,792 1,756 Updated Jul 5, 2025

DSPy: The framework for programming—not prompting—language models

Python 26,246 2,021 Updated Jul 9, 2025

ACE-Step: A Step Towards Music Generation Foundation Model

Python 2,666 265 Updated Jun 27, 2025

Use any LLMs (Large Language Models) for Deep Research. Support SSE API and MCP server.

JavaScript 3,504 880 Updated Jul 4, 2025

Codes for ISMIR 2022 paper: Beat Transformer: Demixed Beat and Downbeat Tracking with Dilated Self-Attention

Python 110 23 Updated Apr 7, 2024

Chordify Annotator Subjectivity Dataset - A chord-Label harmony dataset with multiple reference annotations per song

Python 61 6 Updated Jun 14, 2019

ChoCo: the Chord Corpus

Jupyter Notebook 88 7 Updated May 15, 2025

飞书/Lark官方 OpenAPI MCP

TypeScript 143 25 Updated Jul 2, 2025
Next
点击 这是indexloc提供的php浏览器服务,不要输入任何密码和下载