PuLID-FLUX
Generate customized images from text and reference photos
Generate customized images from text and reference photos
Generate 3D models from images
Inpaint images with text prompts and custom masks
Generate music from text descriptions
Upscale lowโresolution images to highโresolution with AI
Personalised Podcasts For All - Available in 13 Languages
Import a portrait, click to move the head!
Efficient T2V generation
Co-Speech Gesture Video Generation (ICLR 2025 Oral)
Generate images from text prompts
Generate music from text and optional melody
8B parameter transformer model distilled from the FLUX.1-dev
Detect human poses in images and videos
Generate 3D models from images
Make Custom Voices With KokoroTTS
In-browser unified multimodal understanding and generation.
Generate music from lyrics and genre tags
Remove background from images and videos
Audio Gen, Audio Style Transfer and Audio InPainting
Chat with an AI using text and images for visual answers
Generate images from text prompts
OmniParser, turn your LLM into GUI agent
A Generalist Diffusion Model for Vision Perception
Blazingly Fast and Embarrassingly Simple Song Generation
Large Avatar Model for One-shot Animatable Gaussian Head
Generate realistic dialogue from a script, using Dia!
ultra-fast video model, LTX 0.9.8 13B distilled
Demo for multimodal understanding and generation
Multimodal Instruction-based Editing and Generation
Fast 4 step inference with Qwen Image Edit 2509
Track and label objects in videos using text prompts or clicks
Generate sharp, focused images from blurry photos with interactive refocusing