Any-to-Any
Diffusers
Safetensors
MOVA
image-to-video
image-text-to-video
image-to-audio-video
image-text-to-audio-video
MOVA
OpenMOSS
SII
MOSI
sglang-diffusion
Instructions to use OpenMOSS-Team/MOVA-720p with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Diffusers
How to use OpenMOSS-Team/MOVA-720p with Diffusers:
pip install -U diffusers transformers accelerate
import torch from diffusers import DiffusionPipeline # switch to "mps" for apple devices pipe = DiffusionPipeline.from_pretrained("OpenMOSS-Team/MOVA-720p", dtype=torch.bfloat16, device_map="cuda") prompt = "Astronaut in a jungle, cold color palette, muted colors, detailed, 8k" image = pipe(prompt).images[0] - Notebooks
- Google Colab
- Kaggle
Somebody please add support for mlx framework on apple devices
#4
by Narutoouz - opened
TQ for opensourcing this model. This will accelerate development of Video-Audio generation models.
TQ for opensourcing this model. This will accelerate development of Video-Audio generation models.
If they implement it in ComfyUI then everything will be supported, because everyone will focus on this model then.
Hi, you can use this https://github.com/HM-RunningHub/ComfyUI_RH_MOVA
This just wraps your inference code. It still requires 48GB VRAM and and 96 GB RAM, ComfyUI native implementation is required to make it work in 16gb VRAM, 32GB RAM