OpenAI's DALL-E 2 Produces Fantastical Images of Most Anything You Can Imagine (engadget.com) 10
On Wednesday, the OpenAI consortium unveiled (PDF) the next iteration of the DALL-E machine learning system, which can draw anything you'd like but bigger, better, and faster than before. Engadget reports: The first DALL-E (a portmanteau of "Dali," as in the artist, and "WALL-E," as in the animated Disney character) could generate images as well as combine multiple images into a collage, provide varying angles of perspective, and even infer elements of an image -- such as shadowing effects -- from the written description. [...] DALL-E was never intended to be a commercial product and was therefore somewhat limited in its abilities given the OpenAI team's focus on it as a research tool, it's also been intentionally capped to avoid a Tay-esque situation or the system being leveraged to generate misinformation. Its sequel has been similarly sheltered with potentially objectionable images preemptively removed from its training data and a watermark indicating that its an AI-generated image automatically applied. Additionally, the system actively prevents users from creating pictures based on specific names.
DALL-E 2, which utilizes OpenAI's CLIP image recognition system, builds on those image generation capabilities. Users can now select and edit specific areas of existing images, add or remove elements along with their shadows, mash-up two images into a single collage, and generate variations of an existing image. What's more, the output images are 1024px squares, up from the 256px avatars the original version generated. OpenAI's CLIP was designed to look at a given image and summarize its contents in a way humans can understand. The consortium reversed that process, building an image from its summary, in its work with the new system.
Unlike the first, which anybody could play with on the OpenAI website, this new version is currently only available for testing by vetted partners who themselves are limited in what they can upload or generate with it. Only family-friendly sources can be utilized and anything involving nudity, obscenity, extremist ideology or "major conspiracies or events related to major ongoing geopolitical events" are right out. [...] The current crop of testers are also banned from exporting their generated works to a third-party platform though OpenAI is considering adding DALL-E 2's abilities to its API in the future. If you want to try DALL-E 2 for yourself, you can sign up for the waitlist on OpenAI's website.
DALL-E 2, which utilizes OpenAI's CLIP image recognition system, builds on those image generation capabilities. Users can now select and edit specific areas of existing images, add or remove elements along with their shadows, mash-up two images into a single collage, and generate variations of an existing image. What's more, the output images are 1024px squares, up from the 256px avatars the original version generated. OpenAI's CLIP was designed to look at a given image and summarize its contents in a way humans can understand. The consortium reversed that process, building an image from its summary, in its work with the new system.
Unlike the first, which anybody could play with on the OpenAI website, this new version is currently only available for testing by vetted partners who themselves are limited in what they can upload or generate with it. Only family-friendly sources can be utilized and anything involving nudity, obscenity, extremist ideology or "major conspiracies or events related to major ongoing geopolitical events" are right out. [...] The current crop of testers are also banned from exporting their generated works to a third-party platform though OpenAI is considering adding DALL-E 2's abilities to its API in the future. If you want to try DALL-E 2 for yourself, you can sign up for the waitlist on OpenAI's website.
OpenAI... (Score:3, Insightful)
Only family-friendly sources can be utilized and anything involving nudity, obscenity, extremist ideology or "major conspiracies or events related to major ongoing geopolitical events" are right out.
... is not very open, is it?
Can't this thing be leaked, or forked?
Re: (Score:2)
This isn't the first time OpenAI has failed to live up to it's name. When I tried to get the source code to GPT3 I found it wasn't available.
Well, this is legal, even under the GPL, but it's not very open.
Re: (Score:2)
i can't believe it (Score:2)
Re: (Score:2)
I think you need 2FA.
As usual the R18 4Chan gang will spoil the party (Score:2)
with their vivid interpretation of anything they can image to use this technology for.
Every day we stray further from Nude Tayne (Score:1)
Re: (Score:2)
> stop treating me like I belong on Disney.com
Have you seen the Disney indictments?
you know how old people marvel at modern tech (Score:2)
In Soviet Russia (Score:2)
... mushrooms hallucinate you. [slashdot.org]