OpenOmni: Large Language Models Pivot Zero-shot Omnimodal Alignment across Language with Real-time Self-Aware Emotional Speech Synthesis
Paper
•
2501.04561
•
Published
•
17
This repository contains the model presented in OpenOmni: Advancing Open-Source Omnimodal Large Language Models with Progressive Multimodal Alignment and Real-Time Self-Aware Emotional Speech Synthesis.
Project page: https://github.com/RainBowLuoCS/OpenOmni