Miscellaneous - a GayatriValley Collection

Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
- Website
- Community
- Solutions
Log In
Sign Up

GayatriValley 's Collections

Miscellaneous

updated Mar 2

Running on Zero

Agents

Featured

793

Unique3D

⚡

793

Create a 1M faces 3D colored model from an image!
Running on Zero

Agents

53

Paligemma Doc

📚

53

Try PaliGemma on document understanding tasks
wangfuyun/PCM_Weights

Text-to-Image • Updated Oct 30, 2024 • 32 • 99
Running on Zero

Agents

467

Stable Audio Open Zero

🔥

467

Generate custom audio clips from text prompts
Paused

Agents

Featured

315

PaliGemma Demo

🤲

315

Annotate and describe images with text prompts
atcsecure/dolphin-2.9.2-qwen72b-8.0bpw-h8-exl2

Text Generation • Updated Jun 9, 2024 • 4 • 2
stabilityai/stable-video-diffusion-img2vid-xt

Image-to-Video • Updated Jul 10, 2024 • 234k • 3.31k
DAMO-NLP-SG/VideoLLaMA2-7B

Visual Question Answering • 8B • Updated Aug 13, 2024 • 354 • 40
SakanaAI/DiscoPOP-zephyr-7b-gemma

Text Generation • 9B • Updated Jun 13, 2024 • 23 • • 36
madebyollin/taesd3

Updated Jun 14, 2024 • 1.15k • 39
hpcai-tech/OpenSora-VAE-v1.2

0.4B • Updated Jun 17, 2024 • 8.38k • 57
Running

Agents

Featured

84

NaRCan

💊

84

Edit your video with text prompts and style control
MaziyarPanahi/calme-2.1-qwen2-72b-GGUF

Text Generation • 73B • Updated Aug 2, 2024 • 289 • 13
Build error

Agents

Featured

93

DiffIR2VR

👌

93

Video upscaler/restorer
CAMB-AI/MARS5-TTS

Text-to-Speech • Updated Jul 5, 2024 • 60 • 480
dphn/dolphin-vision-72b

Text Generation • 73B • Updated Jul 16, 2024 • 79 • 134
Runtime error

Agents

Featured

72

Florence-2 for Videos

🎬

72

Annotate videos with object boxes and labels using captions
Running on Zero

Agents

131

FLUX.1-dev + Captioner

🐨

131

Generate images from captions or enhance prompts with AI
Running on Zero

Agents

Featured

368

Video Transcription Smart Summary

⚡

368

Generate transcription and summary from uploaded videos
qnguyen3/nanoLLaVA-1.5

Image-Text-to-Text • 1B • Updated Sep 21, 2024 • 71 • 112
Runtime error

Agents

Featured

124

nanoLLaVA-1.5

🚀

124

Chat about images by uploading them
zai-org/codegeex4-all-9b

Text Generation • 9B • Updated Jul 18, 2024 • 11.4k • 269
Sleeping

10

Langflow Crewai

💻

10

Build and run language models visually
Running on Zero

Agents

Featured

994

Tile Upscaler

🚀

994

Upscale and enhance images with tile‑aware AI
Running

Featured

223

Whisper Timestamped

🕒

223

In-browser speech recognition w/ word-level timestamps
Running on Zero

Agents

Featured

2.11k

IDM VTON

👕

2.11k

High-fidelity Virtual Try-on
deepseek-ai/DeepSeek-V2-Chat-0628

Text Generation • 236B • Updated Jul 18, 2024 • 3.25k • 178
TheDrummer/Big-Tiger-Gemma-27B-v1-GGUF

27B • Updated Jul 14, 2024 • 575 • 73
fal/AuraFlow

Text-to-Image • Updated Jul 18, 2024 • 264 • • 654
xinsir/controlnet-union-sdxl-1.0

Text-to-Image • Updated Jul 30, 2024 • 129k • 1.76k
TheBloke/MythoMax-L2-13B-GPTQ

13B • Updated Sep 27, 2023 • 251 • 220
Gryphe/MythoMax-L2-13b

Text Generation • Updated Apr 21, 2024 • 3.44k • • 386
Gryphe/Pantheon-RP-1.0-8b-Llama-3

Text Generation • 8B • Updated May 13, 2024 • 17 • • 51
Gryphe/Tiamat-8b-1.2-Llama-3-DPO

Text Generation • 8B • Updated May 3, 2024 • 5 • 6
BeaverLegacy/Smegmma-9B-v1

Text Generation • 10B • Updated Jul 13, 2024 • 11 • 51
mradermacher/Nymph_8B-i1-GGUF

8B • Updated Aug 2, 2024 • 175 • 2
Runtime error

Agents

29

MusiConGen

🪩

29
mlabonne/Meta-Llama-3.1-8B-Instruct-abliterated

Text Generation • 8B • Updated Sep 14, 2024 • 4.15k • • 204
FunAudioLLM/SenseVoiceSmall

Automatic Speech Recognition • Updated 6 days ago • 5.1k • 408
Running on Zero

MCP

26

Video-to-Audio Ldm

🎧

26

Video-to-Audio Generation with Hidden Alignment
CofeAI/Tele-FLM-1T

Text Generation • Updated Jan 10 • 1.12k • 82
maxin-cn/Cinemo

Image-to-Video • Updated Aug 14, 2024 • 5 • 32
Runtime error

Agents

Featured

204

Cinemo

🎥

204

Multimodal Image-to-Video
Runtime error

Agents

20

Mms Zeroshot

🌍

20

Transcribe audio in any language using text data
Running on Zero

Agents

Featured

56

AccDiffusion

🏆

56

Generate high‑quality images from text prompts
Running on Zero

Agents

Featured

185

Artist

🎨

185

Aesthetically Controllable Text-Driven Stylization w/o Train
Running on Zero

Agents

95

EchoMimic

🐨

95

Generate lifelike video animations from images and audio
HuggingFaceM4/Idefics3-8B-Llama3

Image-Text-to-Text • 8B • Updated Dec 2, 2024 • 341k • 304
parler-tts/parler-tts-mini-v1

Text-to-Speech • 0.9B • Updated Nov 25, 2024 • 28.9k • 153
parler-tts/parler-tts-large-v1

Text-to-Speech • 2B • Updated Nov 22, 2024 • 11.1k • 273
Qwen/Qwen2-Audio-7B

Audio-Text-to-Text • 8B • Updated Nov 20, 2024 • 5.79k • 171
black-forest-labs/FLUX.1-dev

Text-to-Image • Updated Jun 27, 2025 • 702k • • 13k
Running on Zero

Agents

214

CatVTON

🐈

214

Try on clothes virtually on a photo using diffusion models
wanglab/ecg-fm

Updated May 5, 2025 • 16
XLabs-AI/flux-lora-collection

Text-to-Image • Updated Aug 14, 2024 • 587
Runtime error

Agents

58

Vgg Heads

🖼

58
migtissera/Tess-3-Mistral-Nemo-12B

12B • Updated Sep 4, 2024 • 17 • 13
nisten/all-human-diseases

Viewer • Updated Aug 19, 2024 • 2.2k • 87 • 107
DAMO-NLP-SG/VideoLLaMA2-72B

Visual Question Answering • 75B • Updated Aug 14, 2024 • 17 • 10
answerdotai/answerai-colbert-small-v1

33.4M • Updated Feb 14 • 320k • 160
mlabonne/Hermes-3-Llama-3.1-8B-lorablated-GGUF

8B • Updated Aug 16, 2024 • 4.09k • 33
labotollama3/lobotollama-5.5b

Text Generation • 6B • Updated Apr 22, 2024 • 6 • 4
Mozilla/whisperfile

Updated Oct 2, 2024 • 2.66k • 256
Runtime error

Agents

45

FAI Fuzer Medium v0.3

🎨

45

Generate enhanced images by blending foreground with custom backgrounds
ZhengPeng7/BiRefNet

Image Segmentation • 0.2B • Updated Feb 4 • 945k • 587
Running on CPU Upgrade

Agents

10.1k

Kolors Virtual Try-On

👕

10.1k

Generate virtual try‑on images of clothes on a person
fal/AuraFace-v1

Updated Aug 26, 2024 • 149
dphn/dolphin-2.9.4-gemma2-2b

3B • Updated Aug 27, 2024 • 18 • 38
pzc163/MiniCPMv2_6-prompt-generator

Updated Aug 24, 2024 • 44 • 49
Running on Zero

Agents

1.04k

CogVideoX-5B

🎥

1.04k

Text-to-Video
yifeihu/TB-OCR-preview-0.1

Image-Text-to-Text • 4B • Updated Sep 6, 2024 • 21 • 129
InstantX/FLUX.1-dev-Controlnet-Union

Updated Aug 26, 2024 • 66.6k • 478
Running on Zero

Agents

Featured

88

Qwen2-VL-2B

🔥

88

Answer questions about uploaded images or videos
Qwen/Qwen2-VL-2B-Instruct

Image-Text-to-Text • 2B • Updated Jan 12, 2025 • 3.78M • 506
Running

Agents

Featured

59

Groq Gradio Voice Assistant

👁

59

Turn spoken words into AI chat responses
IntelLabs/LlavaOLMoBitnet1B

Updated Aug 30, 2024 • 1 • 30
facebook/sapiens

Updated Sep 20, 2024 • 15 • 246
Runtime error

Agents

28

Tb Ocr

📈

28

Convert image text to markdown format
YuWangX/memoryllm-8b-chat

10B • Updated Nov 17, 2024 • 792 • 20
Running

Agents

211

HivisionIDPhotos

🌖

211

Generate passport‑ready ID photos from a portrait
virtuals-protocol/mario-videogamegen

Updated Sep 6, 2024 • 13
Running on Zero

Agents

269

Qwen2-VL-7B

🔥

269

Answer questions about uploaded images
Running on Zero

Agents

Featured

282

Latent Navigation

🪐

282

Travel through the model latent space
mattshumer/Reflection-Llama-3.1-70B

Text Generation • 71B • Updated Sep 24, 2024 • 97 • • 1.71k
Running on Zero

Agents

Featured

115

ViewCrafter

🐨

115

Generate a novel-view video from a single image
Runtime error

Agents

18

Text Image Analyzer

💻

18

Analyse any image with Llama3.2
vidore/colqwen2-v0.1

Visual Document Retrieval • Updated Mar 21, 2025 • 7.57k • 195
Runtime error

Agents

12

Llama 3.2 Vision Free

🐢

12
facebook/Self-taught-evaluator-llama3.1-70B

Updated Sep 30, 2024 • 42
openai/clip-vit-large-patch14-336

Zero-Shot Image Classification • Updated Oct 4, 2022 • 3.39M • 307
jasperai/Flux.1-dev-Controlnet-Upscaler

Image-to-Image • Updated Mar 22, 2025 • 2.03k • 867
Running on Zero

Agents

Featured

324

Diffusers Image Fill

🏃

324

Fill and edit images using masks
Runtime error

Agents

37

PDF to Page Images Dataset

📂

37

Convert PDFs to individual page images
Runtime error

Agents

Featured

73

ColPali fine-tuning Query Generator

🔍

73

Generate document retrieval queries from a page image
Runtime error

Agents

10

Vision Pipeline

🌍

10

Answer questions about uploaded images and documents
nvidia/NVLM-D-72B

Image-Text-to-Text • 79B • Updated Jan 14, 2025 • 138k • 776
Running on Zero

Agents

1.02k

Whisper Turbo

🤯

1.02k

Transcribe audio or YouTube videos into text
davanstrien/ufo-ColPali

Viewer • Updated Sep 23, 2024 • 2.24k • 241 • 26
jadechoghari/openmusic

Text-to-Audio • Updated Oct 10, 2024 • 39 • 73
Build error

Agents

214

OpenMusic

🎶

214

Generate music from text descriptions
Running

Agents

456

PDF2Audio

📚

456

Generate audio‑ready script from documents
Sleeping

Agents

236

Ultrapixel-demo

😻

236

Ultra-high resolution image synthesis
PleIAs/OCRonos-Vintage

Text Generation • 0.1B • Updated Aug 8, 2024 • 340 • 84
Runtime error

Agents

275

EzAudio

🟣

275

Generate or edit realistic audio from text prompts
stepfun-ai/GOT-OCR2_0

Image-Text-to-Text • 0.7B • Updated Feb 4, 2025 • 147k • 1.54k
Running on CPU Upgrade

Agents

1.02k

Open VLM Leaderboard

🌎

1.02k

VLMEvalKit Evaluation Results Collection
Build error

Agents

65

ArxivCopilot

🏢

65

Generate personalized research profiles and chat with Arxiv Copilot
gpt-omni/mini-omni

Text-to-Speech • Updated Sep 4, 2024 • 441
mistral-community/pixtral-12b-240910

Image-Text-to-Text • Updated Oct 1, 2024 • 6.95k • 381
ICTNLP/Llama-3.1-8B-Omni

9B • Updated Nov 14, 2024 • 614 • 418
fishaudio/fish-speech-1.4

Text-to-Speech • Updated Nov 5, 2024 • 867 • 457
bartowski/Reflection-Llama-3.1-70B-GGUF

Text Generation • 71B • Updated Sep 7, 2024 • 1.07k • 53
lelapa/InkubaLM-0.4B

Text Generation • Updated Sep 5, 2024 • 1.23k • 62
Running

146

Qwen 2.5 Code Interpreter

🐍

146

Run code and get instant results with Qwen Code Interpreter
Runtime error

Agents

312

Virtual Try On

👕

312

High-fidelity Virtual Try-on
Runtime error

Agents

36

Ferret Demo

📚

36

Describe image contents with prompts
Running

Agents

64

ColPali 🤝 Vespa - Visual Retrieval

👀

64

Visual Retrieval with ColPali and Vespa
oxyapi/oxy-1-small

Text Generation • 15B • Updated Apr 30, 2025 • 222 • • 85
QuantFactory/MN-Chunky-Lotus-12B-GGUF

12B • Updated Dec 4, 2024 • 2k • 4
Running

25

ScholarCopilot

📊

25

Using RAG LLM to assist your academic writing
Running on Zero

Agents

616

Leffa

👗

616

Generate realistic person images with new clothes or poses
Lightricks/LTX-Video

Image-to-Video • Updated Jul 16, 2025 • 471k • 2.19k

Collection guide
Browse collections

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs