Meet Gemini Omni - Google's New Video-Generating AI

IMG_20260520_090106_170.webp

What Is Gemini Omni?​


Gemini Omni is Google's newest AI model family where multimodal reasoning meets generative creation. It can accept any combination of images, audio, video, and text as input and generate high-quality video output grounded in Gemini's real-world knowledge. Think of it as a natural evolution from Nano Banana (which focused on image generation) — Omni takes the leap into video.

Key Capabilities​


  • Conversational video editing — Edit videos through natural language prompts; each instruction builds on the last while maintaining character consistency and physics coherence
  • Physics-aware generation — Omni has an improved intuitive grasp of forces like gravity, kinetic energy, and fluid dynamics for realistic scenes
  • World knowledge-grounded storytelling — Goes beyond pattern matching by drawing on Gemini's knowledge of history, science, and cultural context to generate meaningful narratives
  • Any-input creation — Accepts image, text, video, and audio references simultaneously to produce a single cohesive output
  • Digital avatars — Users can create a digital version of themselves to generate personalized videos that look and sound like them

7a87487f6d4ae483e74569046ead684f.webp

Multi-Turn Editing in Practice​


One standout feature is multi-turn iterative editing. For example, you can start with a video of a violinist, then prompt it to transport the violinist to a new environment, make the violin invisible, and shift the camera angle — all without losing scene continuity. Each edit stacks naturally on top of the previous ones.

Responsible AI & Transparency​


All videos generated with Gemini Omni include an imperceptible SynthID digital watermark, and users can verify AI-generated content directly through the Gemini app, Gemini in Chrome, and Google Search. Google is also taking a cautious approach to speech/audio editing features, still testing those capabilities before broader rollout.

Availability​


Platform
Access
Cost
Gemini App & Google Flow​
Rolling out now​
Google AI Plus, Pro & Ultra​
YouTube Shorts & YouTube Create App​
This week​
Free (no subscription needed)​
API (developers & enterprise)​
Coming weeks​
TBD​


The first model launched is Gemini Omni Flash, with image and audio output modalities planned for future releases.


Learn more about this update here:

You do not have permission to view the full content of this post. Log in or register now.


Your feedback is highly appreciated​

😎


Support my other posts 🙏
 

About this Thread

  • 23
    Replies
  • 1K
    Views
  • 17
    Participants
Last reply from:
TheUnholy1

Online now

Members online
1,952
Guests online
981
Total visitors
2,933

Forum statistics

Threads
2,272,585
Posts
28,943,761
Members
1,237,261
Latest member
kay74
Back
Top