Posted by meetpateltech 4 days ago
1. Took a picture of me and asked to describe person in the image.
2. Used Imagegen to create the cartoon version using description.
3. Tried to use veo-2.0-generate-001 to generate video of person in image (holding a coffee cup in original image) drinking coffee and having a conversation.
Video generation is blocked by content moderation.
Whisk redraws the entire thing and it barely resembles source picture.
Everything else performs terribly at that task, though a bunch including Sora technically have that functionality.
Google's tool forcing you to redraw the image is silly.
- is there a sly dig in there at Meta? Ice cream melting ... blue-suited hand
- the Ghibli style feels controversial
They generate independent images.
Gemini’s web interface is also way behind chatgpt and Claude. The mobile app is even worse.
This is while having the champ 2.5 pro model in the pocket.
It seems that web product resources are not getting adequate allocation to the AI group(s).
2: "Try it now!" the release always says.
3: I go try it.
4: Doesn't work. In this case, I give it a prompt to make a video and literally nothing happens, it goes back to the prompt. In the case of the breathtakingly astonishing Gemini 2.5 Coding - attach to source code file to the prompt "file type not supported".
That's the pattern - I've come to expect it and was not disappointed with Google Gemini 2.5 coding nor with this video thing they are promoting here.
Gemini 2.5 Pro is finally competitive with GPT/Claude, their Deep Research is better and has a 20/day limit rather than 10/month, and now with a single run of Veo 2 I’ve gotten a much better and coherent video than from dozens of attempts at Sora. They finally seem to have gotten their heads collectively unstuck from their rear end (but yeah it sucks not having access).
While Google have really been 'cooking' recently, every launch they do is like that. Gemini 2.5 was great but for some reason they launched it on web first (which still didn't list it) then a day or so later on app, at which point I thought it was total vapourware.
This is the same - I have gemini advanced subscription, but it is nowhere to be seen in mobile or app. If you're having scale/rollout issues how hard is it to put the model somewhere and say 'coming really soon'? You don't know if it's not launched yet or you are missing where to find it.
You cannot upload a .py file, but if you change the name to "main.txt" you can upload it, and it will automatically treat it as "main.py". Not sure how this hasn't been fixed yet, but it is google so...