GigaSpeech Series Collection Evolving, Large-Scale, and Multi-domain ASR Corpus • 4 items • Updated 13 days ago
k2SSL Collection A Faster and Better Framework for Self-Supervised Speech Representation Learning • 5 items • Updated 13 days ago
SLAM-LLM: A Modular, Open-Source Multimodal Large Language Model Framework and Best Practice for Speech, Language, Audio and Music Processing Paper • 2601.09385 • Published 19 days ago
CLSP Collection Fine-Grained and Multi-Granular Contrastive Language-Speech Pre-training • 4 items • Updated 13 days ago
CLSP Collection Fine-Grained and Multi-Granular Contrastive Language-Speech Pre-training • 4 items • Updated 13 days ago