BETAWe are currently in beta. Should you encounter any issues, please do not hesitate to contact us.
Discount on all models + if you follow us on twitter and hit us on dm you will get free credit to your email hurry up 🔥🔥🔥 click here

MODELS · ONE SUBSCRIPTION, ALL OF THEM

Every model. One bill.

getvivix runs 213+ frontier AI models for video, image, and audio in a single studio — with the exact credit cost shown before every render. Pick any model to see what it does, or start free.

Start free — 30 credits

Video models · 74

Kling VIDEO 3.0 4K1,260+ cr

4K multimodal video generation with native audio and richer visual detail

kling-ai
Kling VIDEO O3 4K1,260+ cr

4K variant of Kling O3 with native audio for premium production

kling-ai
LTX-2 Fast720+ cr

High speed cinematic text to video with synced audio

lightricks
LTX-2 Pro1,080+ cr

Cinematic LTX-2 Pro text and image to video generator

lightricks
PixVerse V676+ cr

Multi-shot cinematic video generation with native audio, 20+ camera controls, and character consistency

pixverse
Seedance 1.5 Pro180+ cr

Native audio-visual cinematic AI video generation

bytedance
Seedance 2.01,050+ cr

Premium multimodal video generation with native audio and cinematic motion

bytedance
Seedance 2.0 Fast900+ cr

Speed-optimized Seedance 2.0 for rapid iteration with native audio

bytedance
Veo 3.1 Fast2,401+ cr

High speed Google Veo 3.1 Fast text to video generation

google
Wan2.7601+ cr

Multimodal video generation with reference consistency, video editing, and native audio

alibaba
Aurora v1421 cr

High-quality audio-driven avatar video generation

creatify
Aurora v1 Fast211 cr

Fast audio-driven avatar video generation

creatify
Grok Imagine Video900+ cr

AI video generation with synchronized audio from text and images

xai
HappyHorse 1.01,260+ cr

Alibaba text-to-video and image-to-video at 720p or 1080p with seeded generation and frame conditioning

alibaba
HeyGen Avatar IV301 cr

AI talking avatar video from a HeyGen avatar or your own photo, driven by a script or audio

heygen
HeyGen Avatar V301 cr

Talking digital twins with sharper identity and motion coherence

heygen
HeyGen Video Agent103 cr

AI-powered prompt-to-video production with avatars, B-roll, and motion graphics

heygen
Kling VIDEO 2.6 Pro1,050+ cr

Kling VIDEO 2.6 Pro is a full audio-visual AI video model that combines cinematic-quality video generation with native audio (dialogue, sound effects, ambience), with optional Motion Control for precise character movement via the API.

kling-ai
Kling VIDEO 3.0 Pro336+ cr

High-fidelity multimodal video generation with native audio and advanced editing

kling-ai
Kling VIDEO 3.0 Standard252+ cr

Multimodal video generation with native audio and efficient performance

kling-ai
Kling VIDEO O3 Pro336+ cr

Unified multimodal video generation with native audio and higher-fidelity renders

kling-ai
Kling VIDEO O3 Standard252+ cr

Cost-efficient multimodal video generation with native audio and editing

kling-ai
KlingAI 1.6 Pro970 cr

High fidelity image to video model for dynamic 1080p clips

kling-ai
KlingAI 1.6 Standard555 cr

Mid-tier KlingAI 1.6 Standard text to video model

kling-ai
KlingAI 2.0 Master2,773 cr

KlingAI 2.0 Master for high control AI video generation

klingai
KlingAI 2.1 Master2,773 cr

Premium KlingAI 2.1 Master for high fidelity video

kling-ai
KlingAI 2.1 Pro970 cr

KlingAI 2.1 Pro for cinematic AI video generation

klingai
KlingAI 2.1 Standard555 cr

KlingAI 2.1 Standard for faster AI video generation

kling-ai
KlingAI 2.5 Turbo Pro1,050 cr

Cinematic text to video and image to video at scale

klingai
KlingAI 2.5 Turbo Standard630 cr

Fast cinematic image to video generation for creators

klingai
KlingAI Avatar 2.0 Pro261 cr

High fidelity avatar video generation with smoother motion and quality

kling-ai
KlingAI Avatar 2.0 Standard132 cr

Expressive avatar video generation from image and audio

kling-ai
KlingAI Lip-Sync139+ cr

Accurate AI lip sync for character driven video content

kling-ai
LTX-2450 cr

Open-source AI video model with synchronized audio and high-fidelity output

lightricks
LTX-2 Retake301 cr

Segmented AI video retakes with precise in-shot control

lightricks
LTX-2.3240+ cr

High-fidelity multimodal video generation with native audio

lightricks
LTX-2.3 Fast180+ cr

Fast multimodal video generation optimized for rapid iteration

lightricks
MiniMax 01 Director841 cr

Cinematic text to video with precise camera control

minimax
MiniMax 01 Live841 cr

Anime video model for expressive character animation

minimax
MiniMax Hailuo 021,470+ cr

Cinematic AI video model for viral and commercial clips

minimax
MiniMax Hailuo 2.3841+ cr

High fidelity AI video generation from text or images

minimax
MiniMax Hailuo 2.3 Fast571+ cr

Fast MiniMax Hailuo 2.3 model for short cinematic video

minimax
OmniHuman-1.53,975 cr

Cognitive avatar video from image, audio, and text

bytedance
P-Video15+ cr

Real-time AI video generation with draft mode and native audio

prunaai
P-Video-Animate90+ cr

Reference-image animation driven by the motion, timing, and camera movement of a source video

prunaai
PixVerse LipSync41 cr

Realistic AI lip sync from audio for any video

pixverse
PixVerse V3.5897 cr

PixVerse V3.5 early text to video effects model

pixverse
PixVerse V4897 cr

PixVerse V4 AI text to video with pro camera control

pixverse
PixVerse V4.5897 cr

PixVerse V4.5 cinematic text and image to video model

pixverse
PixVerse V5897 cr

PixVerse V5 cinematic text to video and image to video

pixverse
PixVerse V5 Fast283+ cr

Fast text to video and image to video generation for rapid iteration

pixverse
PixVerse V5.6310+ cr

Enhanced cinematic video generation with improved lip-sync and audio realism

pixverse
Runway Gen-4 Turbo151+ cr

High speed Gen-4 Turbo image to video generation

runway
Runway Gen-4.51,800 cr

Advanced multimodal video generation with text and image input

runway
Seedance 1.0 Pro1,452 cr

Seedance 1.0 Pro high fidelity 1080p text and image to video

bytedance
Seedance 1.0 Pro Fast92+ cr

Fast Seedance 1.0 Pro video generation for dance content

bytedance
SkyReels V4330+ cr

Multimodal video-audio foundation model with 1080p cinematic output, inpainting, and video extension

skywork
Sora 22,401 cr

Next generation AI video and audio model from OpenAI

openai
Sora 2 Pro7,200+ cr

Premium Sora 2 Pro model for high fidelity AI video

openai
sync-3399 cr

Full-scene lip synchronization with global face understanding and obstruction handling

sync
Veo 27,500 cr

High fidelity text to video generation with camera control

google
Veo 34,801+ cr

Cinematic video generation, now with native audio

google
Veo 3 Fast2,401+ cr

Fast Google Veo 3 video generation with native audio

google
Veo 3.12,401+ cr

Veo 3.1 cinematic AI video with native audio

google
Vidu 2.0330+ cr

Fast 1080p AI video generation with strong consistency

vidu
Vidu Q1660 cr

Vidu Q1 high fidelity reference to video generation model

vidu
Vidu Q2 Pro1,651 cr

High fidelity Vidu Q2 Pro model for cinematic AI video

vidu
Vidu Q2 Turbo495 cr

Faster Vidu Q2 video generation with advanced motion control

vidu
Vidu Q3137+ cr

Multimodal video generation with native audio and intelligent shot planning

vidu
Vidu Q3 Turbo390+ cr

Low-latency multimodal video generation with native audio

vidu
Wan2.2 A14B1,350 cr

MoE video generation from text or images at 480p to 720p

alibaba
Wan2.5-Preview1,419+ cr

Wan2.5-Preview AI Text to Video with Native Audio

alibaba
Wan2.61,500+ cr

Multimodal video generation with multi-shot and native sound

alibaba
Wan2.6 Flash76+ cr

Fast distilled image-to-video generation model

alibaba

Image models · 83

GPT Image 1.527+ cr

GPT Image 1.5 flagship image model with faster generation and enhanced editing

openai
GPT Image 21+ cr

OpenAI GPT Image 2 — high-fidelity generation and editing with up to 16 reference images

openai
Grok Imagine Image Pro211+ cr

High fidelity AI image generation and editing with improved prompt control

xai
Kling IMAGE 3.084 cr

2K to 4K image generation with improved realism and practical image-to-image editing

kling-ai
Kling IMAGE O384+ cr

4K Omni image generation with strong consistency and reference control

kling-ai
Nano Banana 2207+ cr

Gemini 3.1 Flash Image fast high quality AI image generation and editing

google
Seedream 5.0 Lite106 cr

Responsive text-to-image generation with real-time search and precise prompt adherence

bytedance
Wan2.7 Image90 cr

Unified image generation and editing with avatar customization, color control, and multilingual text rendering

alibaba
Wan2.7 Image Pro225 cr

Premium image generation with enhanced composition stability and precise prompt comprehension

alibaba
Bria 3.2120 cr

Commercial-safe text to image model for production use

bria
Bria FIBO120 cr

Deterministic JSON native text to image for enterprises

bria
Bria FIBO Edit120 cr

Instruction-driven image editing with mask support

bria
Bria Fibo Edit Tools120 cr

Unified image editing foundation for recolor, relight, restore, blend, reseason, and sketch

bria
DALL·E 248 cr

DALL·E 2 AI image generator for text guided creation

openai
DALL·E 3240 cr

DALL·E 3 high fidelity text to image generation API

openai
Exactly Bold Chromatics585 cr

Vibrant, high-contrast illustrative style with bold color palettes

exactly
Exactly Bright Pulse585+ cr

Bright, energetic photographic style with vivid lighting

exactly
Exactly Dark Comics585 cr

Dark, gritty comic art style with heavy shadows and noir aesthetics

exactly
Exactly Distant Reality585+ cr

Dreamy photographic style with surreal, distant atmosphere

exactly
Exactly Earthy Elegance585 cr

Warm, organic illustrative style with muted earth tones

exactly
Exactly Editorial Line585 cr

Clean, editorial-style line illustrations with refined detail

exactly
Exactly Extreme Contrast585+ cr

High-contrast photographic style with dramatic light and shadow

exactly
Exactly Grain Film Look585+ cr

Analog film photography style with natural grain and warm tones

exactly
Exactly Graphic Harmony585 cr

Balanced, harmonious graphic illustrations with cohesive composition

exactly
Exactly Graphic Novel585 cr

Comic book and graphic novel style with strong ink lines and dramatic shading

exactly
Exactly Graphite Creature585 cr

Textured graphite-style illustrations with creature and character focus

exactly
Exactly Journey585+ cr

Travel and adventure photographic style with rich, cinematic tones

exactly
Exactly Monochrome Café585 cr

Monochromatic illustrative style with warm café-inspired tones

exactly
Exactly Muted Modern585 cr

Contemporary illustrative style with soft, muted color palettes

exactly
Exactly Playful Line Adventures585 cr

Whimsical, playful line art with an adventurous character

exactly
Exactly Warm Light585+ cr

Soft, warm-lit photographic style with inviting golden tones

exactly
FLUX Virtual Try-On128+ cr

Low-latency virtual try-on for transferring garments onto a person image with strong identity and garment fidelity

black-forest-labs
FLUX.1 [dev]12+ cr

Open-weight 12B text to image model for rich visuals

black-forest-labs
FLUX.1 [schnell]4 cr

Ultra fast FLUX.1 text to image model for local use

black-forest-labs
FLUX.1 Kontext [dev]32 cr

Open image editing model for fast iterative workflows

black-forest-labs
FLUX.1 Kontext [max]240 cr

High fidelity FLUX.1 Kontext max for precise image edits

black-forest-labs
FLUX.1 Kontext [pro]120 cr

Context aware FLUX.1 image editing and generation model

black-forest-labs
FLUX.1 Krea [dev]30 cr

FLUX.1 Krea Dev for photorealistic open‑weight generation

black-forest-labs
FLUX.1.1 [pro]120 cr

FLUX.1.1 Pro high fidelity text to image generation

black-forest-labs
FLUX.1.1 [pro] Ultra180 cr

High speed 4MP FLUX image generation for production apps

black-forest-labs
FLUX.2 [dev]24 cr

FLUX.2 dev for controllable open text to image workflows

black-forest-labs
FLUX.2 [flex]180 cr

Configurable FLUX.2 Flex for precise text aligned images

black-forest-labs
FLUX.2 [klein] 4B2 cr

Fastest Klein model for real-time image generation and editing

black-forest-labs
FLUX.2 [klein] 4B Base6 cr

Compact undistilled model for efficient image generation and editing

black-forest-labs
FLUX.2 [klein] 9B3 cr

Ultra-fast image generation and editing with sub-second latency

black-forest-labs
FLUX.2 [klein] 9B Base13 cr

Undistilled foundation model for high-quality image generation and editing

black-forest-labs
FLUX.2 [klein] 9B KV3 cr

KV-cache accelerated image generation and editing for real-time multi-reference workflows

black-forest-labs
FLUX.2 [max]211+ cr

The latest state-of-the-art model from Black Forest Labs, generating images grounded in live web information.

black-forest-labs
FLUX.2 [pro]90 cr

High control FLUX.2 Pro image generation and editing

black-forest-labs
GPT Image 1501 cr

GPT Image 1 high fidelity image generation for GPT-4o

openai
Grok Imagine Image60+ cr

AI image generation from text and images

xai
Grok Imagine Image Quality151+ cr

xAI's quality-focused image generation and editing — sharper realism, better text rendering, tighter prompt following

xai
HiDream-I1 Dev14 cr

HiDream-I1 Dev fast 17B text to image generation model

runware
HiDream-I1 Fast12+ cr

HiDream-I1 Fast for low latency text to image generation

runware
HiDream-I1 Full27 cr

HiDream-I1 Full high fidelity text to image generator

runware
Ideogram 2.0240 cr

Ideogram 2.0 text to image model for sharp design work

ideogram
Ideogram 3.0180 cr

Ideogram 3.0 text to image model for sharp design visuals

ideogram
Imagen 3120 cr

High fidelity text to image generation with Imagen 3

google
Imagen 3 Fast60 cr

High speed Imagen 3 Fast model for rapid image generation

google
Imagen 4 Fast60 cr

High speed Imagen 4 Fast text to image generation

google
Imagen 4 Preview120 cr

High fidelity 2K text to image generation by Google

google
Imagen 4 Ultra180 cr

High fidelity text to image model with sharp typography

google
ImagineArt 1.5 Pro135 cr

Professional AI image generation with native 4K and refined visual control

imagineart
ImagineArt 2.0151 cr

Reasoning-based text to image generation with vibrant true-to-life color

imagineart
Juggernaut Lightning Flux by RunDiffusion3+ cr

Ultra fast Flux-based model for high volume image generation

rundiffusion
Juggernaut Pro Flux by RunDiffusion11+ cr

Photorealistic Flux based text to image model for pros

rundiffusion
Kandinsky 5.0 Image Lite24 cr

Efficient text-to-image and image-to-image editing model

runware
Krea 2 Large180+ cr

Larger Krea 2 variant for rawer, more flexible outputs with stronger photorealism and weighted reference control

krea
Krea 2 Medium90+ cr

Faster Krea 2 variant for stable, consistent generation with controllable prompt strength and weighted reference guidance

krea
Nano Banana117 cr

High quality multi image generation for complex visuals

google
P-Image14+ cr

Real-time text-to-image model for production graphics

prunaai
P-Image-Edit27+ cr

High precision multi image AI editor for fast workflows

prunaai
Qwen-Image18 cr

Qwen-Image high fidelity text aware image generation model

alibaba
Qwen-Image-2.0106 cr

Unified image generation and editing with professional text rendering

alibaba
Qwen‑Image‑Edit10 cr

High fidelity text guided image editing for Qwen

alibaba
Recraft V4120 cr

Professional text-to-image model for brand and marketing design

recraft
Recraft V4 Pro750 cr

Advanced design-focused image generation with enhanced control and fidelity

recraft
Seedream 4.090 cr

High speed 4K AI image generation and editing model

bytedance
Stable Diffusion 34+ cr

Stable Diffusion 3 for sharper text and complex images

runware
Wan2.5-Preview Image81 cr

High fidelity Wan2.5 image generation for rich single frames

alibaba
Wan2.6 Image90 cr

High fidelity image generation built on the Wan2.6 visual stack

alibaba
Z-Image14 cr

Efficient high-quality image generation foundation model

alibaba
Z-Image-Turbo10+ cr

Fast photorealistic image generator with text control

alibaba

Audio models · 19

ACE-Step v1.5 Base14+ cr

Open-source music generation with voice cloning, lyric editing, and multilingual support

runware
ACE-Step v1.5 Turbo10+ cr

Fast music generation optimized for speed with reduced inference steps

runware
Eleven Flash v2240 cr

Low-latency English TTS for real-time voice use-cases

elevenlabs
Eleven Flash v2.5301 cr

Real-time TTS for voice agents, 32 languages, ~75ms latency

elevenlabs
Eleven Monolingual v1330 cr

Legacy English-only TTS

elevenlabs
Eleven Multilingual v1330 cr

Legacy multilingual TTS across 9 languages

elevenlabs
Eleven Multilingual v2301 cr

High-fidelity multilingual TTS across 29 languages

elevenlabs
Eleven Music v11,201 cr

Generate studio quality music tracks from text prompts

elevenlabs
Eleven Turbo v2240 cr

Low-latency English TTS for production

elevenlabs
Eleven Turbo v2.5330 cr

Fast multilingual TTS across 32 languages

elevenlabs
Eleven v3421 cr

Premium expressive TTS across 74 languages with audio tags

elevenlabs
Gemini 3.1 Flash TTS1+ cr

Expressive text-to-speech with audio tags, multi-speaker dialogue, and 70+ languages

google
Inworld TTS-1.5 Max151 cr

High-fidelity expressive text-to-speech with rich prosody and multilingual support

inworld
Inworld TTS-1.5 Mini76 cr

Low-latency expressive text-to-speech optimized for real-time apps

inworld
MiniMax Speech 2.8180+ cr

High-quality text-to-speech with expressive, natural voice synthesis

minimax
Qwen3-TTS 1.7B Base45 cr

High-quality multilingual text-to-speech with voice cloning and ultra-low latency

alibaba
Qwen3-TTS 1.7B CustomVoice45 cr

Text-to-speech with preset premium timbres and precise style control

alibaba
Qwen3-TTS 1.7B VoiceDesign45 cr

Text-to-speech with voice creation from natural language descriptions

alibaba
xAI Text-to-Speech13 cr

Expressive text-to-speech with five voices, speech tags, and multilingual support

xai

Text models · 23

Claude Haiku 4.51+ cr

Anthropic's fastest Claude — latency-optimized for agentic sub-tasks and high-volume work

anthropic
Claude Opus 4.71+ cr

Anthropic's flagship — demanding coding, agent orchestration, multimodal reasoning

anthropic
Claude Sonnet 4.61+ cr

Anthropic's daily-driver Sonnet — coding, agents, long-context reasoning, computer use

anthropic
DeepSeek V4 Flash1+ cr

Budget-tier reasoning LLM with 1M context window and 384K max output

deepseek
Gemini 3 Flash1+ cr

Advanced multimodal text and reasoning model

google
Gemini 3.1 Flash Lite1+ cr

Advanced multimodal text and reasoning model

google
Gemini 3.1 Pro1+ cr

Advanced multimodal text and reasoning model

google
GLM-4.71+ cr

Z.ai's affordable mid-range LLM — 200K context and 73.8% on SWE-bench

zai
GLM-5.11+ cr

Z.ai's flagship LLM — premium reasoning, 200K context, JSON mode, agentic strength

zai
GPT-5.41+ cr

Flagship reasoning LLM with 1M context, native computer use, and high factual accuracy

openai
GPT-5.4 Mini1+ cr

Efficient reasoning LLM with 400K context for coding assistants and subagent workflows

openai
GPT-5.4 Nano1+ cr

Ultra-low-latency LLM for high-volume classification, extraction, and lightweight automation

openai
GPT-5.51+ cr

OpenAI's newest flagship LLM — deepest reasoning, computer-use, 1M+ context

openai
Kimi K2.61+ cr

Moonshot AI multimodal LLM with native image and video understanding, 262K context

moonshotai
LLaVA-1.6-Mistral-7B6 cr

Vision-language model for image understanding and captioning

runware
MiniMax M2.51+ cr

State-of-the-art agentic coding and office-work model, optimized for speed and cost

minimax
MiniMax M2.71+ cr

Long‑context agentic coding and office productivity model for fast, reliable tool use

minimax
MiniMax M2.7 Highspeed1+ cr

Faster throughput for agentic coding and tool‑driven automation

minimax
Open Age Detection2 cr

Facial age estimation model

runware
OpenAI CLIP ViT-L/1410 cr

Vision encoder for text-image representation and similarity

openai
Qwen2.5-VL-3B-Instruct8 cr

Instruction-tuned vision-language model for image and text understanding

alibaba
Qwen2.5-VL-7B-Instruct6 cr

Instruction-tuned multimodal vision-language model

alibaba
ViT Age Classifier2 cr

Vision transformer model for estimating age from facial images

runware

Utility models · 13

3D models · 1