FunAudioLLM

CosyVoice Public

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

SenseVoice Public

Multilingual speech understanding: ASR + emotion recognition + audio event detection. 50+ languages, 15x faster than Whisper, non-autoregressive.

Python 8.6k 783

ThinkSound Public

[NeurIPS 2025] PyTorch implementation of [ThinkSound], a unified framework for generating audio from any modality, guided by Chain-of-Thought (CoT) reasoning.

Python 1.4k 82

FunMusic Public

A fundamental toolkit designed for music, song, and audio generation

Python 1.4k 139

Fun-ASR Public

End-to-end speech recognition large model: 31 languages, dialects, accents, lyrics, hotwords, timestamps, speaker diarization. Trained on tens of millions of hours.

Python 1.3k 126

Fun-Audio-Chat Public

Fun-Audio-Chat is a Large Audio Language Model built for natural, low-latency voice interactions.

Python 971 105

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FunAudioLLM

Popular repositories Loading

Repositories

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

People

Top languages

Uh oh!

Most used topics

Uh oh!