Antigravity 2.0 Tops the OpenSCAD Architectural 3D LLM Benchmark

Posted by jetter 9 hours ago

Antigravity 2.0 Tops the OpenSCAD Architectural 3D LLM Benchmark(modelrift.com)

295 points | 114 commentspage 2

thedougd 4 hours ago|

I've been trying out MCP servers for FreeCAD to mixed results.

One area I had near magic was providing a land survey which includes details in writing of the plat. It took those directions and beautifully reconstructed the boundaries to exact precision in CAD.

Where I ran into trouble was creating good constraints on sketches without being overly explicit. I kept running into it creating distance constraints from an arbitrary point instead of using other elements in the diagram that a human drafter would think to do by default.

debarshri 8 hours ago||

I have been using GPT 5.5 to build a video game. Benchmark sounds about right. It generates assets and sprite good enough, if not closer to AAA level games. Will check antigravity now.

phn 8 hours ago|

Would you be able to share a bit about your workflow? Have been meaning to try AI gen for game models, and would love to know how people are tackling this.

debarshri 7 hours ago||

I have alot to share. I'm writing a blog about it. I'll share along with the game.

roflcopter69 7 hours ago||

Sounds interesting! Please don't forget to link that in this comment thread :)

lithiumii 4 hours ago||

That's actually a reason for me to try it again. My past attempts to use LLM for OpenScad has greatly improved my own OpenScad skills.

usermac 4 hours ago||

I've been using LLM's to do my OpenSCAD work for over two years now. It's always where I start (and end).

emmanuelsemugga 4 hours ago||

This is a really important project. Preserving humanity’s knowledge and making it openly accessible,including in formats usable by AI systems feels like one of the most valuable things happening right now. Thank you for the clear technical instructions and the bulk download options.

Projects like Anna’s Archive make it much easier for researchers and builders to work responsibly with large datasets.

pshirshov 5 hours ago||

That's curious, I've been trying to do some parametric modeling with Claude - and its performance was abysmal.

plumeria 4 hours ago|

I've had a positive experience building a library of parametric HVAC duct parts using Claude, Gemini and Codex using build123d (they all review the specs and code collaboratively).

faangguyindia 8 hours ago||

Why are specialized CAD making LLM models not showing up? In future are we going to have same model for everything? from programming to creative writing to CADs?

embedding-shape 8 hours ago||

If you have a model that only know how to model CAD but also doesn't know history, and was trained on visual language of said history, how is it supposed to be able to model the Pantheon in the first place? It'd only be able to model exactly what you can describe with text, or even worse, exactly what it'd be able to visually extract from images via the vision encoders, for "vision models", but it'd be a far cry from what you see in this blogpost, would be my guess.

xnx 8 hours ago||

> In future are we going to have same model for everything?

A model that knows more in general, will often be better at specific tasks. e.g. If you ask a model to "make a program that estimates the annual production of a solar installation", it needs to have been trained on a lot more than just Python code.

JumpCrisscross 4 hours ago|||

> A model that knows more in general, will often be better at specific tasks

Is this your hypothesis or broad conclusion among AI experts?

lifty 7 hours ago|||

You might combine a general world model with a python coding model in that case. Not sure if it's better, just saying.

lukeschlather 4 hours ago||

What's the difference between a "general world model combined with a python coding model" and a multimodal LLM?

a3w 8 hours ago||

Claude Code 2.1 / Opus 4.7 looks best to me: Dome and ceiling structure is correcter than the others.

Why is this medium ranked, and not on par with the best two?

WarmWash 6 hours ago||

Look at a picture of the Pantheon, the dome isn't as dome-like as you would imagine. It's more like a hump shape.

andybak 6 hours ago||

Dome looks wrong to me. Look at a few other photos - it's far from being a hemisphere

megiddo 8 hours ago||

This would be the same Antigravity 2.0 that "surprise, no longer an IDE, did I forget to mention that? Lolol."

kyrra 7 hours ago|

I'm a googler, opinions are my own.

My take is that it's a fancy wrapper around the CLI tool. It's there to organize multiple conversations and see all the related output and generate files.

I've been using the internal version and I've actually liked it quite a bit. It's clear from when I started using it, it's not an editor, and they have ways to open your normal editor outside of it. They have turned it fully into an agent management tool.

When the antigravity development team doesn't have to focus on all the things that vscode is already good at, it lets them simplify the UI and do only agent related things. We'll see if this bet works out for them, but so far I like the idea.

jdw64 8 hours ago|

To be brutally honest, I'm disappointed with antiGravity. It feels incredibly unGoogle-like. The AI billing models are fragmented, and the AntiGravity IDE is currently tripping over something as trivial as a basic Electron deployment config bug.

Don't get me wrong, I don't think AI coding is a bad thing. For East Asians like myself, it levels the playing field with Westerners, so as long as you rigorously review the AI's output, it's a perfectly viable tool.

However, the absolute farce we just witnessed with the antiGravity2.0 update really raises doubts about whether 'vibe coding' can actually be trusted. If even a behemoth like Google is dropping the ball like this, it says a lot.

NortySpock 6 hours ago||

> I don't think AI coding is a bad thing. [...] it levels the playing field [...]

I'd like to put regional differences aside and say AI coding / LLMs are incredible tools.

While I'm nervous about my job as a programmer being able to pay a prevailing wage after the dust settles, I do hope that everyone gaining access to an AI coder / tutor will allow anyone to be able to achieve things they previously only dreamed of. If the tutor costs pennies per session, sure, the tutors are out of work, but I hope everyone can thus up-skill to work on the challenges they actually want to work on.

I'm taking baby-steps into coding in Elixir on the other monitor, a language I had only read about before, because an LLM is walking me through the changes, answering my questions, and accepting my rebuttals. There's no way I would have time to pick up the language otherwise.

Yesterday I vibe-coded some additions to the static site generator python script for my blog. It was awesome to be able to think in terms of desired features instead of digging around documentation for libraries and syntax.

embedding-shape 8 hours ago|||

> AI billing models are fragmented ... IDE is currently tripping over something as trivial ... farce we just witnessed with the antiGravity2.0 update

I'm sorry, but that sounds exactly like almost every single Google "product" out there, they seem to only care about throwing stuff over the wall as quickly as possible, and you'd have a hard time finding a single Google product that doesn't also feel filled with fragmented choices, like every project of theirs have a different project manager every week.

nutjob2 4 hours ago||

> For East Asians like myself, it levels the playing field with Westerners

Why do you say that? Are there language or cultural disadvantages to being East Asian?

jdw64 2 hours ago||

[flagged]

More comments...