Why do LLaVA Vision-Language Models Reply to Images in English? Paper • 2407.02333 • Published Jul 2, 2024
Is Your Paper Being Reviewed by an LLM? Benchmarking AI Text Detection in Peer Review Paper • 2502.19614 • Published Feb 26, 2025
SK-VQA: Synthetic Knowledge Generation at Scale for Training Context-Augmented Multimodal LLMs Paper • 2406.19593 • Published Jun 28, 2024
Training-Free Mitigation of Language Reasoning Degradation After Multimodal Instruction Tuning Paper • 2412.03467 • Published Dec 4, 2024 • 1