Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Kreshnik 's Collections
OCR
3D
Language
Image
Voice
Papers
Model training

Voice

updated 3 days ago
Upvote
-

  • microsoft/VibeVoice-1.5B

    Text-to-Speech • 3B • Updated 7 days ago • 329k • 2.2k

  • Running
    Featured
    430

    FastVLM WebGPU

    🍎
    430

    Real-time video captioning powered by FastVLM


  • openbmb/VoxCPM-0.5B

    Text-to-Speech • Updated Sep 19, 2025 • 1.12k • 787

  • Running on CPU Upgrade
    75

    MiMo-Audio-Chat

    💬
    75

    Chat with Xiaomi MiMo-Audio using voice


  • FlashLabs/Chroma-4B

    Any-to-Any • 6B • Updated about 21 hours ago • 5.78k • 279

  • numind/NuMarkdown-8B-Thinking

    Image-to-Text • 8B • Updated Nov 13, 2025 • 1.02M • 373
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs