DeepSeek Piles Pressure on AI Rivals With New Image Model Release 34
Chinese AI startup DeepSeek has launched Janus Pro, a new family of open-source multimodal models that it claims outperforms OpenAI's DALL-E 3 and Stable Diffusion's offering on key benchmarks. The models, ranging from 1 billion to 7 billion parameters, are available on Hugging Face under an MIT license for commercial use.
The largest model, Janus Pro 7B, surpasses DALL-E 3 and other image generators on GenEval and DPG-Bench tests, despite being limited to 384 x 384 pixel images.
The largest model, Janus Pro 7B, surpasses DALL-E 3 and other image generators on GenEval and DPG-Bench tests, despite being limited to 384 x 384 pixel images.
Interesting (Score:5, Informative)
Re: (Score:2)
Re: (Score:3)
Re: (Score:2)
How much video card RAM do you need for smaller a model like that?
Re: (Score:2)
Re: (Score:2)
Ok, thanks.
Re: (Score:2)
It all depends on what quantization version of the model you use, of course. The lower the quantization, the less VRAM you need, but the more you risk compromising model quality.
Re: (Score:1)
Re: (Score:2)
Re: (Score:3)
Re: (Score:3)
Impressive (Score:4, Interesting)
Maybe this will drive down the training cost for models and the multi-$billion AI shops won't have to build as many nuclear reactors to power their data centers now? It seems like it would be a lot cheaper to figure out how to build these things less expensively than to just dump $500 billion into acres of installations housing Blackwell systems.
Re: (Score:3)
Re: Impressive (Score:5, Informative)
Re: (Score:2)
Company Foo now: "We're an AI company."
Company Foo after poppage: "We're a server-farm parts distributor".
Re: (Score:2)
>> a power purchase agreement which is a key element to actually building a large project
I think they have to lock in the power arrangements in order to get financing? Most new electricity generation in the US these days is renewable, and the tech industry needs lots of new electricity..
"Solar accounted for 79.3% of all new utility-scale generation placed into service in the first ten months of 2024. In October alone, solar comprised 91.8% of all new capacity added."
https://www.solarpowerworldonl... [solarpower...online.com]
Re: Impressive (Score:2)
Re: (Score:2)
Seems more likely that of all those hundreds of billions that are being invested in AI, a big chunk will get diverted to R&D so they can clone DeepSeek's work. Or maybe the NSA will get involved and try to steal it.
This is good news though. AI used to be only available to massive corporations with deep pockets to invest in training. Thanks to DeepSeek and the fact that they open sourced much of their work, now startups can afford to build their own models. More choice, more variety, and less environment
With really small means ? (Score:4, Interesting)
Isn't all this just psychological warfare ? To make investors panic. We have no proof that model was trained for as cheap as the Chinese claim. We do not know if they really used a so small number of H800 GPU.
Re: (Score:3)
Re: (Score:1)
Re: With really small means ? (Score:1)
From my limited experience, I must admit the resources needed for chatGPT do seem extreme.
Re: (Score:2)
China vs copyright law (Score:2)
Any details on how the training data were curated?
Re:China vs copyright law (Score:4, Informative)
From the tech report:
> In Janus-Pro, we incorporate approximately 72 million samples of synthetic aesthetic data, bringing the ratio of real to synthetic data to 1:1 during the unified pretraining stage. The prompts for these synthetic data samples are publicly available, such as those in [43]. Experiments demonstrat that the
model converges faster when trained on synthetic data, and the resulting text-to-image outputs are not only more stable but also exhibit significantly improved aesthetic quality.
Re: (Score:2)
Great news, now we can have 10x the disinformation and fakery for the same amount of AI-dollars.
Economic Warfare (Score:2)
The only thing China needs to do is to keep throwing out the possibility that their AI is
on par or exceeds that of their Western counterparts and well . . . . you see what it did
to the markets today.
It's crazy how the markets react to the mere mention / possibility of a thing these days :|
Hell, it doesn't even have to be true or verified. Just the suggestion is enough to wreck
havoc.
You Keep Using That Word (Score:2)
I do not think it means, what you think it means.
Looks unimpressive, IMHO (Score:2)
Most people's whelm seems to be very much under [reddit.com].
" I tried it on huggingface and its...average to good, but I wouldn't say its groundbreaking from what I seen with the few trials I gave it."
"I tried the huggingface stable diffusion(sic) demo and was completely underwhelmed for realistic images. I can only assume config or user error because it can't possibly that bad"
"Does it support nsfw content too?" "If you manage to generate anything that resembles a human, yes"
"Good to see this. But just for text2img pu