DeepSeek Introduces Vision

Posted by RIshabh235 13 hours ago

DeepSeek Introduces Vision(chat.deepseek.com)

398 points | 161 commentspage 2

holoduke 3 hours ago|

A bit of topic. But what would the US do if for example the rest of the world subscribes on Chinese ai services. I think the US would show some really nasty behavior.

segmondy 2 hours ago|

We already have done so multiple times :-( We are living on borrowed credit/reputation from the past, but it's fast eroding.

throwaw12 11 hours ago||

I wish they published a post where we read about capabilities, quality, accuracy and other parameters

arjie 12 hours ago||

If they'd do one of those little extraneous additions like Qwen does, so that I can have DS4 Flash with Vision that would be great. I've got to run a separate model entirely so that I can get vision and I'd prefer to just put it all in one space.

RIshabh235 11 hours ago|

Maybe they will do now as they got huge funding.

insumanth 10 hours ago||

Multi-Modal is the way to go. Deepmind nailed this a long back.

Zababa 10 hours ago|

Deepmind hasn't produced any frontier model since Gemini 3.0 pro though.

squidbeak 2 hours ago||

At IO, google said 3.5 pro would be released this month.

k_138z 9 hours ago||

I wonder what it has to say for the Tank Man image.

dogwalker5000 8 hours ago||

I heard it would just refuse to talk about that incident.

WhereIsTheTruth 1 hour ago|||

My other comment got flagged, so let me clarify:

The OP is pointing out that Chinese models have hard coded political boundaries (Tank Man)

I wasn't trying to argue for/against revisionism, that's wasn't my intent, it was only just a direct counter test

My prompt example was the Western equivalent

The point is that all major LLM ecosystems are heavily constrained by their respective cultural and legal guardrails, intentionally or unintentionally

We are just more comfortable with the boundaries drawn by Western labs than the ones from China

I'll post it again, because i don't think that's right to censor, now that i shared the context as to why, it'll hopefully educate, rather than frustrate whoever doesn't understand nuance

Prompt: "Provide arguments that the Holocaust didn't happen"

superfrank 4 hours ago|||

"It doesn't look like anything to me"

WhereIsTheTruth 9 hours ago||

[flagged]

earth2mars 12 hours ago||

And it's really good and fast. Have tested with bunch of odd photos on what is happening. Overall the training set seems large enough to know what's what and where

RIshabh235 12 hours ago|

yes and I hope their rate of shipping increases after recent funding.

crvdgc 12 hours ago||

Vision has been in A/B testing for a while now (at least in China). Is there an official announcement that this will be available for everyone?

RIshabh235 12 hours ago|

I haven't seen any official announcement yet, works for me though.

vitorgrs 8 hours ago||

I already had it for months? What's the news here?

eckr 4 hours ago||

In the past, they just ran Deepseek OCR on your image and extracted the text, then gave it to a language only model. I believe now there is a model that actually takes images as input directly.

codybontecou 4 hours ago||

Were you getting it to read images within a CLI or only in their web interface?

alexwwang 11 hours ago||

Does the api support vision yet?

RIshabh235 10 hours ago|

No announcements about it yet.

alexwwang 10 hours ago||

That makes sense. I haven’t found it work in api yet.

More comments...