FunAudioLLM

CosyVoice Public

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

SenseVoice Public

Multilingual speech understanding: ASR + emotion recognition + audio event detection. 50+ languages, 15x faster than Whisper, non-autoregressive.

Python 8.2k 752

ThinkSound Public

[NeurIPS 2025] PyTorch implementation of [ThinkSound], a unified framework for generating audio from any modality, guided by Chain-of-Thought (CoT) reasoning.

Python 1.4k 82

FunMusic Public

A fundamental toolkit designed for music, song, and audio generation

Python 1.4k 137

Fun-ASR Public

End-to-end speech recognition large model: 31 languages, dialects, accents, lyrics, hotwords, timestamps, speaker diarization. Trained on tens of millions of hours.

Python 1.2k 115

Fun-Audio-Chat Public

Fun-Audio-Chat is a Large Audio Language Model built for natural, low-latency voice interactions.

Python 947 101

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FunAudioLLM

Popular repositories Loading

Repositories

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

People

Top languages

Uh oh!

Most used topics

Uh oh!