Gemini Omni is a multimodal AI video generator page for turning prompts, image references, avatar concepts, templates and remix ideas into video clips.

What can I create with Gemini Omni?

You can create social clips, product motion, cinematic concept videos, avatar-led scenes, text-to-video drafts, image-to-video clips and reference-guided videos.

Which model does Gemini Omni use now?

The current generator uses the Gemini Omni Video API, supporting prompt-to-video and image-guided multimodal video creation through the Gemini Omni workflow.

Can Gemini Omni create avatar videos?

Gemini Omni is designed around avatar-led video concepts. The current generator can use image references now, and the page is structured for richer identity and avatar workflows later.

What makes Gemini Omni different from a standard video generator?

Gemini Omni is packaged around multimodal creation: prompts, images, video references, templates, avatar concepts and iterative remixing, rather than only a single one-shot text-to-video form.

Gemini Omni - AI Video Generator

Create multimodal videos from prompts, images, references and avatar ideas with the Gemini Omni generation workflow.

Generation

Choose a mode and provide your description below

Generate videos from text descriptions

API parameters: prompt is required. Gemini Omni supports 4s, 6s, 8s and 10s generation durations, 16:9 or 9:16 aspect ratio, up to 7 public image URLs, 1 audio ID, and 1 public source video up to 30 seconds; the selected video segment must be 10 seconds or shorter.

Prompt

0 / 5000 characters

Model

Duration

Duration of the generated video: 4s, 6s, 8s or 10s.

Resolution

Credit cost changes by resolution. Video input uses the fixed video-input price.

Aspect Ratio

Audio ID (optional)

Audio ID from gemini-omni-audio. Max 1 item.

Video reference clip (optional)

Public video URL

Start seconds

End seconds

Source video must be 30 seconds or shorter; selected segment must be 10 seconds or shorter.

Cost: 50 creditsBalance: 0 credits

Insufficient creditsAdd credits

Video Preview

Your generated video will appear here

Configuration

Model: Gemini Omni Video

Duration: 8s

Resolution: 720P/1080P

Ratio: 16:9

Open in new tab

Gemini Omni video

Gemini Omni AI Video Generator
for Multimodal Creative Clips

Gemini Omni turns any idea into a video canvas. Combine text prompts, reference images, rough clips, audio direction, templates and avatar concepts in one creative workflow. Start generating now with the Gemini Omni video engine.

Create with Gemini Omni See workflows

Text + Image

multimodal prompts

Avatar

selfie-led concepts

Gemini Omni

generation model

Why creators use Gemini Omni

Gemini Omni is positioned as an all-in-one video creation page for fast prompts, visual references, avatar ideas, remixing and cinematic outputs.

A single canvas for text, images, audio and video

Bring prompts, image references, scene notes, sound direction and rough video ideas into one streamlined creation flow.

Chat-style creation, remixing and templates

Shape the first result, then keep refining: preserve the scene, adjust motion, change lighting, apply a template, or reframe the camera.

Avatar-first video concepts

Plan videos where a selfie or portrait guides a personal avatar, presenter, product host, virtual explainer, or recurring character.

Stronger object, motion and light direction

Write prompts around believable object interactions, clean scene composition, realistic motion and consistent light across the clip.

Gemini Omni workflows

Use Gemini Omni for fast creative tests, product motion, social clips, avatar scenes and remix-ready video concepts.

Idea to multimodal video

Start with a prompt, add a reference image or scene direction, and generate a cinematic clip that matches the target style.

Selfie to avatar scene

Plan videos where a selfie or portrait can guide a personal avatar, presenter, product host, or character insert.

Remix and refine by chat

Structure follow-up edits: keep the scene, adjust the motion, change lighting, swap the background, or reframe the camera.

Gemini Omni FAQ

Quick answers about Gemini Omni capabilities and the current Gemini Omni generation workflow.

Start a Gemini Omni video

Use the generator above to prototype multimodal video ideas with the Gemini Omni engine currently available on the site.

Back to generator