Google Gemini can now turn your photos into videos with Veo 3

You can now bring your photos to life with Veo 3

Google Gemini can now turn your photos into videos with Veo 3

If you’ve ever stared at an old vacation photo and wished it could just spring to life, complete with crashing waves, ambient chatter, or even whispered dialogue, Google’s making that daydream a reality.

The company has just rolled out a new photo-to-video feature for its Gemini AI platform, letting users transform static images into dynamic eight-second clips, complete with AI-generated audio.

This new capability is powered by Veo 3, Google’s latest-gen video model that’s quietly been shaping the future of generative filmmaking. Now, it’s landing directly in the hands of Gemini Ultra and Pro subscribers, as part of a wider rollout that hits web users today and mobile users over the next few days, though access is still limited to select regions.

How to turn your photos into videos using Veo 3 on Gemini

Here’s how it works – upload a photo, write a short description of what you want it to do, whether that’s a wind-swept tree rustling in the breeze or a streetlamp flickering under the rain, and Gemini does the rest.

You can even add sound design details, like footsteps, dialogue, or environmental ambience. The system then generates a 720p MP4 video in 16:9, all “perfectly synced with the visuals,” according to Google.

ALSO READ: YouTube Shorts, Canva get a major AI upgrade with Veo 3

There’s no need to jump between platforms either. While a similar tool already lives inside Flow, Google’s filmmaking-focused AI tool released in March, this update brings visual storytelling straight into Gemini’s core UI. Users can find the new tool under the “video” option in the prompt bar.

All outputs come stamped with a visible watermark, as well as Google’s invisible SynthID digital watermarking tech to flag the content as AI-generated.

Why the feature is more important than you think

So, what’s the use case? Think less deepfake and more creative spark. You can animate your sketches, breathe life into a still frame of your pet, or make your childhood doodles move, sound, and feel like something out of a dream sequence. It’s storytelling at the intersection of imagination and machine learning.

Google Pixel 9 5G (12GB RAM, 256GB, Porcelain)

Buy now

Google Pixel 9 Pro Fold 5G (16GB RAM, 256GB, Obsidian)

Buy now

As a bonus, Google also announced that Flow is expanding to 75 additional countries starting today; so if you’ve been waiting to dip your toes into AI video creation, now’s your moment.

Unleash your inner geek with Croma Unboxed

Subscribe now to stay ahead with the latest articles and updates

You are almost there

Enter your details to subscribe

0

Disclaimer: This post as well as the layout and design on this website are protected under Indian intellectual property laws, including the Copyright Act, 1957 and the Trade Marks Act, 1999 and is the property of Infiniti Retail Limited (Croma). Using, copying (in full or in part), adapting or altering this post or any other material from Croma’s website is expressly prohibited without prior written permission from Croma. For permission to use the content on the Croma’s website, please connect on contactunboxed@croma.com

Comments

Leave a Reply
  • Related articles
  • Popular articles
  • Gaming

    GTA V cheat codes: A complete list

    Karthekayan Iyer

  • Gaming

    GTA San Andreas cheats and codes

    Shubhendu Vatsa

  • Smartphones

    All Apple iPhones launched since 2007

    Chetan Nayak