Gemma 4 on iPhone - Hacker News

Posted by janandonly 1 day ago

Gemma 4 on iPhone(apps.apple.com)

829 points | 226 commentspage 3

sshrajesh 5 hours ago|

> Note: I tried to hook this one up to OpenClaw and ran into issues

Anyone worked on hooking up OpenClaw to gemma4 running locally?

carbocation 1 day ago||

It would be very helpful if the chat logs could (optionally) be retained.

davecahill 20 hours ago||

I really like Enclave for on-device models - looks like they're about to add Gemma 4 too: https://enclaveai.app/blog/2026/04/02/gemma-4-release-on-dev...

robbru 8 hours ago|

I've been using Enclave ever since, they have been the best App Store option for a long time.

rudedogg 20 hours ago||

This is fun, FYI you don’t have to sign in/up with a Google account. I hesitated downloading it for that reason.

satvikpendem 21 hours ago||

This is also on Android and has an option to use AICore with the NPU which can run much faster than even the GPU models.

nout 20 hours ago|

How do you get it running on Android?

satvikpendem 20 hours ago||

It's the same app, Google AI edge gallery.

dwa3592 1 day ago||

I think with this google starts a new race- best local model that runs on phones.

dwa3592 1 day ago|

I wonder why the cut off date for 3n-E4B-it is Oct, 2023. That's really far in the past.

satvikpendem 22 hours ago||

Because that's Gemma 3, not 4.

danielrmay 19 hours ago||

I spent some time getting Gemma4-e4b working via llamacpp on iPhone and I'm really impressed so far! I posted a short video of an example application on LinkedIn here https://www.linkedin.com/feed/update/urn:li:activity:7446746... (or x: https://x.com/danielrmay/status/2040971117419192553)

thot_experiment 23 hours ago||

Gemma 4 E4B is an incredible model for doing all the home assistant stuff I normally just used Qwen3.5 35BA4B + Whisper while leaving me with wayy more empty vram for other bullshit. It works as a drop in replacement for all of my "turn the lights off" or "when's the next train" type queries and does a good job of tool use. This is the really the first time vramlets get a model that's reliably day to day useful locally.

I'm curious/worried about the audio capability, I'm still using Whisper as the audio support hasn't landed in llama.cpp, and I'm not excited enough to temporarily rewire my stuff to use vLLM or whatever their reference impl is. The vision capabilities of Gemma are notably (thus far, could be impl specific issues?) much much worse than Qwen (even the big moe and dense gemma are much worse), hopefully the audio is at least on par with medium whisper.

derwiki 10 hours ago||

I asked it about the “Altamont Free Concert” (exact name of Wikipedia article), and it’s been a while since I’ve seen an hallucination this rich. Doesn’t give me confidence to use it.

totetsu 13 hours ago|

I have been looking at ARGmax https://www.argmaxinc.com/#SDK for running on apple devices, but not sure yet at whats involved in porting a model to work with their sdk

More comments...