Parameter-Inverted Image Pyramid Networks for Visual Perception and Multimodal Understanding Paper • 2501.07783 • Published Jan 14, 2025 • 8
OpenGVLab/PIIP-LLaVA_ConvNeXt-B_CLIP-L_1024-336_7B Image-Text-to-Text • 7B • Updated Apr 20, 2025 • 3
OpenGVLab/PIIP-LLaVA_ConvNeXt-L_CLIP-L_1024-336_7B Image-Text-to-Text • 7B • Updated Apr 20, 2025 • 5
OpenGVLab/PIIP-LLaVA-Plus_ConvNeXt-L_CLIP-L_1024-336_7B Image-Text-to-Text • 7B • Updated Apr 20, 2025 • 3
OpenGVLab/PIIP-LLaVA_ConvNeXt-L_CLIP-L_1024-336_13B Image-Text-to-Text • 14B • Updated Apr 20, 2025 • 2
OpenGVLab/PIIP-LLaVA_ConvNeXt-B_CLIP-L_1024-336_13B Image-Text-to-Text • 14B • Updated Apr 20, 2025 • 1