Gemini Omni is an independent platform for the chat-edit AI video workflow. It is not affiliated with, endorsed by, or sponsored by Google.
Gemini Omni
Open a chat. Describe a scene. Remix it until it's yours. The chat-edit video workflow Gemini calls 'Omni' — a powerful AI video generator live in your browser today. No waitlist, no API keys, no installs. Powered by the Gemini video stack (Veo 3.1, the model Omni extends); auto-upgrades to Omni the day Google ships it.
Powered by the Gemini video stack — describe, remix, render.
Verified leaks from 9to5Google, TestingCatalog, Chrome Unboxed and r/GeminiAI.
Real Gemini Omni Video Demos — Generated From the Leaked Workflow
These are the same prompts Reddit tester @Zacatac_391 ran inside the leaked Gemini Omni interface on May 11, 2026 — generated through the Gemini video stack the Omni model extends. Describe a scene in plain language, render, then remix camera, audio, or dialogue without leaving the chat.
Reported by the publications that broke the Gemini Omni leak
What Is Gemini Omni — The New Gemini Video Model
Gemini Omni (also referred to as Google Omni) is the new AI video generation model surfacing inside Google's Gemini app. In May 2026, a UI string reading 'Create with Gemini Omni — Meet our new video model. Remix your videos, edit directly in chat, try a template, and more.' was spotted by X user @Thomas16937378 and propagated through 9to5Google, TestingCatalog, and Chrome Unboxed. Metadata suggests Omni extends the existing Gemini video stack (internal codename Toucan, currently powered by Veo 3.1). Gemini Omni lets you practice the same chat-edit workflow today.


Chat-Native Video Editing
Don't open a timeline. Describe the change. Gemini Omni's defining UI string — 'edit directly in chat' — turns a video into a living document. Want a tighter shot, warmer lighting, a different line of dialogue? Type it. The Gemini video stack regenerates only what changed.

One-Click Remix of Any Clip
Every Gemini Omni generation becomes a remix seed. 'Remix your videos' was the second pillar Google teased in the Gemini app UI. Swap the protagonist, the camera angle, the time of day, the entire setting — in a single click. The original stays. The variant is yours.

Templates That Don't Feel Like Templates
'Try a template' was the third Gemini Omni pillar — but templates here are starting jokes, not finishing lines. Start from cinematic dialogue, anime opening, top-down ASMR, or product unboxing. Then bend it through chat until nobody can tell which template was the seed.

Native Audio, Multi-Camera, Ambient Music
Reddit tester @Zacatac_391 said the voice quality was 'much better than Veo by a large margin' and that Omni 'even added some light background music' during a restaurant scene. The seamless multi-camera transitions are Gemini Omni's standout signature — and they happen inside a single shot, no edit needed.
Use Gemini Omni Online — No Install, No Waitlist
Open a chat in your browser and generate the Omni-style video workflow today. Free generations to start, plans from $9.9/month for high-volume creators.
Why the Gemini Omni Workflow Matters
Most video AI today asks you to write a perfect prompt, hit generate, and pray. Gemini Omni's chat-edit pattern flips that loop: you start rough, refine in conversation, and never leave the canvas.
Generate & pray
Describe & refine
No Timeline. No Re-Renders.
Traditional video AI dumps you back into a timeline. Omni keeps every refinement inside the same chat — change a face, swap a line, tighten the shot — and regenerates only the differential. Testers reported camera changes 'frequently and with good coherence.'
Audio That Belongs in the Scene
Synthesized speech, ambient room tone and contextual background music render in one pass — no separate audio timeline. Testers said the voice quality beat Veo 'by a large margin.'
Auto-Upgrades to Official Omni
Practice the workflow today on the Gemini video stack (Veo 3.1). The day Google ships the public Omni model at I/O 2026, your workspace switches automatically — no migration.
How the Gemini Omni Workflow Works
Four steps. One canvas. The Omni workflow Gemini teased — described, demoed, and running in your browser today.
Describe a Scene
Open the workspace. Pick a template or type a scene from scratch — 'A professor writes a trigonometric proof on a chalkboard,' or 'Two men eating spaghetti at a seaside restaurant.' Plain language only. No prompt engineering required.
Generate the First Clip
The Gemini video stack renders a ~10-second clip with native audio and ambient music. Reddit testers reported this was the 'best video model I have seen' — particularly on prompt adherence.
Remix in Chat
Don't open an editor. Type 'tighter close-up on the actor's eyes,' 'swap the centerpiece for a candle,' or 'add light background piano.' Gemini Omni regenerates only the parts that changed.
Render & Share
Export your final clip. Share the chat thread as a public template. When Google officially ships Omni at I/O 2026, every clip in your workspace re-renders at higher fidelity automatically.
Gemini Omni AI Video Generator — Key Features
Every capability Gemini Omni teased in the leaked UI, available in the workspace today.
Chat-Edit Video
Type changes in plain language. Gemini Omni regenerates only the differential, preserving everything you didn't change.
One-Click Remix
Every generated clip becomes a seed. Spin variants in seconds — same scene, different lighting, different protagonist.
Leaked Omni Templates
Start from the same six prompts Reddit testers used in the leaked Gemini Omni interface, then bend them through chat.
Native Audio + Music
Synthesized speech, ambient room tone, and contextual background music — all rendered in one pass.
Seamless Multi-Camera
Multiple camera angles inside a single shot, with coherent action across cuts. The standout feature highlighted as Omni's signature.
Auto-Upgrade to Omni
Powered by the Gemini video stack today. The moment Google ships the public Omni model, your workspace switches automatically.
Gemini Omni vs Veo 3.1 vs Sora 2 — Which One Can You Use Today?
Where Gemini Omni sits in the May 2026 video AI landscape — alongside the Gemini Veo 3.1 stack it extends, and against OpenAI's Sora 2 which shut down its consumer app on April 29, 2026.
Gemini Omni — Chat-Edit Defining PatternUse today
Status: leaked May 2, 2026; expected official launch at Google I/O 2026 (May 19-20). Chat-edit video, one-click remix, native audio with ambient music, and seamless multi-camera. Use via Gemini Omni today on the Gemini video stack; auto-switches to the official Omni model the day Google ships it.
Veo 3.1 (Toucan) — Production Gemini Video Today
Internal codename Toucan. Currently powers Google's production Gemini video generation with 4K output and natively generated audio. Gated, region-locked, and limited to Gemini Advanced subscribers. Same model Omni extends — meaning the workflow you practice today runs on Omni's foundation.
Sora 2 — Shut Down April 29, 2026
OpenAI shut down the Sora 2 consumer app on April 29, 2026. Google responded publicly with 'video's here to stay' and accelerated the Omni rollout for I/O 2026. The Sora 2 era ended; the Omni era is the next chapter — and you can practice in it now.
Who Uses the Gemini Omni Workspace
The Gemini Omni workflow isn't just a faster Veo — it's a different relationship between you and a video model. Here's who benefits most from generating-then-remixing through chat instead of writing perfect prompts.

01
Short-Form Creators on TikTok, Reels, Shorts
Generate the first cut from a chat description. Remix in chat for vertical, horizontal, and square. Push BPM or change palette without leaving the canvas. Gemini Omni's one-click remix maps perfectly to a publishing cadence that demands variants, not perfection.

02
Indie Filmmakers and Spec Directors
Block a scene in 10 seconds before you scout a location. Test camera language, lighting, and dialogue beats through chat. The Omni workspace becomes a previz tool that costs nothing to iterate inside.

03
Performance Marketers and Brand Studios
Spin 30 ad variants from one chat thread. Each remix preserves the brand frame and swaps only the offer, headline, or hero shot. Native audio means no separate VO budget for early testing.

04
Educators and Course Creators
Generate an explainer with on-screen math, diagrams, or chalkboard text — the kind of content Reddit testers proved Gemini Omni handles well. Remix the language or pace through chat without re-recording.
Gemini Omni — The Numbers That Matter
Verified data points from the May 2026 Gemini Omni leak window and the broader video AI landscape Omni is launching into.
Day Gemini Omni UI was first spotted in 2026
Views on TestingCatalog's leak thread
Google I/O 2026 — expected Omni launch
What Early Testers Are Saying About Gemini Omni
Verified leaks and first-hand reactions to the Gemini Omni video model. Every quote below links back to its original source — Reddit, X, or a named publication.
I won't lie, this is one of the best video models I have seen — maybe not the best, but a really strong performance. The voice quality is much better than Veo by quite a large margin. It even added some light background music.
A new video generation model is apparently coming to Gemini, with 'Omni' producing some pretty impressive initial results. The video does a great job of handling text while putting out a fairly realistic video.
GOOGLE I/O: New evidence of the upcoming Gemini Omni video model has been spotted. Based on the description, we might be really talking about the true 'Omni' model based on Gemini, rather than Veo.
An impressive new Gemini 'Omni' video model just leaked ahead of Google I/O. The remix-your-videos, edit-directly-in-chat workflow is what makes this different from every other video AI we've seen this year.
The new Omni model isn't just good at video — it's also reasoning across these mediums. You got early access to the big new model that supposedly combines it all: video, audio, text, images.
Google appears to be testing a new video-generation model called Omni inside Gemini, surfaced via a UI string spotted ahead of Google I/O 2026: 'Start with an idea or try a template. Powered by Omni.'
Frequently Asked Questions about Gemini Omni AI Video Generator
Common questions about the Gemini Omni video model leak, how to use the workspace today, and how the auto-upgrade works at Google I/O 2026.
Gemini Omni (also called Google Omni) is a new AI video generation model leaked from the Gemini app in May 2026. Google describes it as: 'Meet our new video model. Remix your videos, edit directly in chat, try a template, and more.' Gemini Omni gives you the same chat-edit workflow today.
You can use the workflow today on Gemini Omni — open a chat, describe a scene, generate, then remix or edit directly in chat. Generations run on the Gemini video stack (Veo 3.1, the model Omni extends). When Google officially ships Omni, we auto-upgrade your workspace.
No. Gemini Omni is an independent platform and is not affiliated with, endorsed by, or sponsored by Google. 'Google', 'Gemini', 'Omni' and 'Veo' are trademarks of Google LLC. We aggregate verified leaks and provide a chat-edit video workflow built on the same Gemini video stack.
Verified reports include: 9to5Google's coverage by Ben Schoon (May 11, 2026), TestingCatalog's X thread that first surfaced 'Powered by Omni' UI strings (80.6K views), Chrome Unboxed's analysis, and Reddit r/GeminiAI tester @Zacatac_391's hands-on share links from the Gemini app.
OpenAI shut down the Sora 2 consumer app on April 29, 2026. Google publicly stated 'video's here to stay' and is rolling Omni out at I/O 2026 (May 19-20). For now, Veo 3.1 (codename Toucan) remains the production Gemini video model, with Omni positioned as its successor or extension.
Your prompts, generations and workspace stay yours. The moment Google ships the public Omni model, Gemini Omni switches your generations to it automatically — no migration, no re-onboarding, no waitlist. You keep practicing the workflow today; the engine upgrades itself.
Early leaked tests inside the Gemini app generated ~10-second clips. The current Veo 3.1 backbone supports 4K output with natively generated audio. Official Omni specs (resolution, duration, FPS, API limits) will be confirmed at Google I/O 2026 on May 19-20.
Yes. Reddit tester @Zacatac_391 reported the audio quality is 'much better than Veo by quite a large margin' and that Omni 'even added some light background music' during a restaurant scene generation. Native multi-camera transitions and ambient sound were highlighted as standout features.
Be ready before Google I/O 2026.
Open your workspace, generate your first Omni-style video, and have it rendered when Google flips the switch on May 19. The workflow is the same. The engine upgrades itself.