Skip to content
@FunAudioLLM

FunAudioLLM

Popular repositories Loading

  1. CosyVoice CosyVoice Public

    Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

    Python 21.3k 2.5k

  2. SenseVoice SenseVoice Public

    Multilingual speech understanding: ASR + emotion recognition + audio event detection. 50+ languages, 15x faster than Whisper, non-autoregressive.

    Python 8.2k 752

  3. ThinkSound ThinkSound Public

    [NeurIPS 2025] PyTorch implementation of [ThinkSound], a unified framework for generating audio from any modality, guided by Chain-of-Thought (CoT) reasoning.

    Python 1.4k 82

  4. FunMusic FunMusic Public

    A fundamental toolkit designed for music, song, and audio generation

    Python 1.4k 137

  5. Fun-ASR Fun-ASR Public

    End-to-end speech recognition large model: 31 languages, dialects, accents, lyrics, hotwords, timestamps, speaker diarization. Trained on tens of millions of hours.

    Python 1.2k 115

  6. Fun-Audio-Chat Fun-Audio-Chat Public

    Fun-Audio-Chat is a Large Audio Language Model built for natural, low-latency voice interactions.

    Python 947 101

Repositories

Showing 10 of 13 repositories

Top languages

Loading…

Most used topics

Loading…