Top
Best
New

Posted by meetpateltech 4/15/2025

Generate videos in Gemini and Whisk with Veo 2(blog.google)
347 points | 132 commentspage 2
byearthithatius 4/15/2025|
Very impressive release compared to what was possible even a single year ago. It feels like we are in a great state right now with respect to ML where all the big companies are competing and pushing each other to make the tech better. This is rare nowadays in America (or in general).
kumarm 4/16/2025||
Pretty disappointed with content moderation on Veo2. Here are the steps I did:

1. Took a picture of me and asked to describe person in the image.

2. Used Imagegen to create the cartoon version using description.

3. Tried to use veo-2.0-generate-001 to generate video of person in image (holding a coffee cup in original image) drinking coffee and having a conversation.

Video generation is blocked by content moderation.

snappyleads 4/16/2025||
I been waiting along time for this - how long before we get to the 30sec - 1 min milestone for video generation - why is it capped - is it hardware limitations or software?
bk496 4/15/2025||
I wonder what takes more compute power: this or a blender render farm?
hu3 4/15/2025||
is there a tool to generate AI videos that doesn't change the original picture so much?

Whisk redraws the entire thing and it barely resembles source picture.

vunderba 4/15/2025||
Wan 2.1 can do a decent job with i2v.

https://comfyanonymous.github.io/ComfyUI_examples/wan

CSMastermind 4/15/2025|||
You want Kling: https://klingai.com/global/

Everything else performs terribly at that task, though a bunch including Sora technically have that functionality.

Google's tool forcing you to redraw the image is silly.

rishabhjain 4/16/2025||
Try Snowpixel https://snowpixel.app/
wewewedxfgdf 4/15/2025||
1: Press release about amazing AI development.

2: "Try it now!" the release always says.

3: I go try it.

4: Doesn't work. In this case, I give it a prompt to make a video and literally nothing happens, it goes back to the prompt. In the case of the breathtakingly astonishing Gemini 2.5 Coding - attach to source code file to the prompt "file type not supported".

That's the pattern - I've come to expect it and was not disappointed with Google Gemini 2.5 coding nor with this video thing they are promoting here.

throwup238 4/15/2025||
On the contrary I had completely written off Google until a few days ago.

Gemini 2.5 Pro is finally competitive with GPT/Claude, their Deep Research is better and has a 20/day limit rather than 10/month, and now with a single run of Veo 2 I’ve gotten a much better and coherent video than from dozens of attempts at Sora. They finally seem to have gotten their heads collectively unstuck from their rear end (but yeah it sucks not having access).

energy123 4/16/2025||
Gemini 2.5 Pro is smarter, faster, cheaper and longer context than o1.
martinald 4/15/2025|||
I really don't know why Google especially seems to struggle with this so much.

While Google have really been 'cooking' recently, every launch they do is like that. Gemini 2.5 was great but for some reason they launched it on web first (which still didn't list it) then a day or so later on app, at which point I thought it was total vapourware.

This is the same - I have gemini advanced subscription, but it is nowhere to be seen in mobile or app. If you're having scale/rollout issues how hard is it to put the model somewhere and say 'coming really soon'? You don't know if it's not launched yet or you are missing where to find it.

siva7 4/15/2025|||
you're using it wrong. change file ending to .txt instead
bornfreddy 4/15/2025|||
I can't tell if this is sarcasm or a helpful advice?
Workaccount2 4/15/2025||
It's how you have to do it. The gemini model is excellent, but the implementation/chat environment seems like it was thrown together in a weekend as an afterthought.

You cannot upload a .py file, but if you change the name to "main.txt" you can upload it, and it will automatically treat it as "main.py". Not sure how this hasn't been fixed yet, but it is google so...

bornfreddy 4/16/2025||
Thank you for explaining!
nolist_policy 4/16/2025|||
On Chrome you can share your whole Project directory to Gemini. I think it uses the File System Access api which Firefox doesn't support.
bredren 4/16/2025||
The UI on this product page does not make any sense to me. The three prompt workflows don’t stack in any obvious way, then seemingly combine on any submission to the main prompt area?

They generate independent images.

Gemini’s web interface is also way behind chatgpt and Claude. The mobile app is even worse.

This is while having the champ 2.5 pro model in the pocket.

It seems that web product resources are not getting adequate allocation to the AI group(s).

somishere 4/16/2025||
Two notes:

- is there a sly dig in there at Meta? Ice cream melting ... blue-suited hand

- the Ghibli style feels controversial

anonzzzies 4/16/2025||
I have Advanced but no Veo2 model; is it controlled rollout or something again?
tefkah 4/16/2025|
evil technology
More comments...