Shanghai Innovation Institute (SII) · Fudan University · MOSI.AI
Open, collaborative research on Large Language Models and Multimodal Foundation Models.
OpenMOSS is a research group led by Prof. Xipeng Qiu, hosted at the Shanghai Innovation Institute (SII) and working in close collaboration with Fudan University and MOSI.AI. We conduct cutting-edge research across the full LLM stack — from model architecture and training to evaluation, interpretability, and real-world applications — with a strong commitment to open and reproducible science.
| Direction | Flagship Repositories |
|---|---|
| 🧠 Foundation LLMs | MOSS · BandPO |
| 👁️ Vision & Video | MOSS-VL · MOSS-Video-Preview · MOVA |
| 🔊 Speech & Audio | MOSS-TTS · MOSS-TTS-Nano · MOSS-TTSD · MOSS-Audio · MOSS-Speech · MOSS-Audio-Tokenizer |
| 🤖 Embodied AI & Robotics | Awesome-WAM · Embodied-Planner-R1 · RoboOmni · FRoM-W1 |
| 🔍 Interpretability | Llamascopium (formerly Language-Model-SAEs) |
- MOSS-TTS-Nano — 0.1B-param multilingual TTS, runs on CPU, 3.2k★
- MOSS-Audio — Unified audio understanding foundation model (4B / 8B, Instruct & Thinking variants)
- MOVA — Scalable and synchronized video–audio generation
- MOSS-VL — Multimodal model series with XRoPE architecture, full training stack open-sourced
- Awesome-WAM — Curated reading list for World Action Models in embodied AI
See the pinned repositories for quick access, or browse all 50+ repositories.
We welcome researchers, students, and collaborators who share our vision. For PhD/intern openings, research collaborations, or general inquiries, please reach us at openmoss@sii.edu.cn.
The Shanghai Innovation Institute (SII) is dedicated to fostering innovation in education and research in the field of artificial intelligence.