GPT Image 1.5
OpenAI's GPT Image 1.5 — multimodal image generation and editing with text rendering capabilities.
27 モデル
Suno
Suno AIFull-featured AI music generation platform. Supports text-to-music, extend tracks, upload audio covers, generate cover images, and timestamped lyrics. 7 API endpoints.

Kling 3.0
KuaishouState-of-the-art AI video generation with 4K quality support. Text-to-video and image-to-video capabilities.
Flux 2
Black Forest LabsHigh-quality AI image generation with excellent prompt following and photorealistic output.
ElevenLabs V3
ElevenLabsUltra-realistic text-to-speech with natural sounding voices. Supports multiple languages and voice styles.
Gemini
GoogleGoogle's most capable AI model for text generation, reasoning, and conversation.
Background Remover
API in OneRemove image backgrounds instantly with AI precision. Perfect for product photos, portraits, and e-commerce.
Hailuo 2.3
MiniMaxHigh-quality AI video generation by MiniMax. Supports image-to-video with standard and pro quality modes, cinematic motion quality.

Wan 2.6
AlibabaAlibaba's Wan 2.6 video generation model. Supports text-to-video and image-to-video with high-fidelity output.
Vidu Q3
Shengshu TechnologyVidu Q3 by Shengshu Technology — advanced AI video generation with strong motion understanding and character consistency.
PixVerse 5.6
PixVersePixVerse 5.6 — fast AI video generation with stylized output. Supports text-to-video and image-to-video.
Seedance 1.5 Pro
ByteDanceByteDance's Seedance 1.5 Pro — exceptional dance and human motion video generation with text and image input.
GPT Image 1.5
OpenAIOpenAI's GPT Image 1.5 — multimodal image generation and editing with text rendering capabilities.
Ideogram V3
IdeogramIdeogram V3 — industry-leading text-to-image generation with best-in-class typography and design capabilities.
Imagen 4
GoogleGoogle's Imagen 4 — photorealistic image generation with exceptional detail and prompt adherence.
Recraft V4
RecraftRecraft V4 — design-focused AI image generation with style control, vector output, and brand-consistent results.

Grok Imagine
xAIxAI's Grok Imagine — powerful AI image generation and upscaling from the creators of Grok. Fast, high-quality, and unrestricted.

Grok Video
xAIxAI's Grok Video — AI video generation powered by Grok Imagine. Supports text-to-video and image-to-video.
MiniMax Music
MiniMaxMiniMax's AI music generation — create high-quality songs with vocals and instrumentals from text descriptions.
Face Swap
WaveSpeedAI-powered face swapping tool. Replace faces in images with natural-looking results.
Image Upscale
KIEAI image upscaling — enhance image resolution up to 4x while preserving detail and sharpness.
LTX-2
LightTricks / WaveSpeedLTX-2 19B — advanced AI video generation with text-to-video, image-to-video, lipsync, and control modes. Up to 1080p, 5–20 seconds.
Kling Image O1
Kuaishou / WaveSpeedKling Image O1 — text+image guided image generation and AI multi-shot character consistency. Supports 1K/2K resolution and up to 9 output images.
Seedream 4.5
ByteDance / KIESeedream 4.5 — ByteDance's cutting-edge image generation model. Supports text-to-image and image editing with basic/high quality modes.
CogView 4
Zhipu AI / WaveSpeedCogView 4 — Zhipu AI's text-to-image model. Ultra-affordable at just 4 credits per image with standard and HD quality options.
Hunyuan Image 3
Tencent / WaveSpeedHunyuan Image 3 Instruct — Tencent's advanced text-to-image and image editing model. 25 credits per image with seed-based reproducibility.
Video Background Remover
WaveSpeedAI-powered video background removal — remove or replace video backgrounds instantly. Supports custom background images.
Video Upscaler Pro
WaveSpeedAI-powered video upscaling — enhance video resolution up to 4K. Multiple target resolutions with per-resolution pricing.