People Are Using Google Study Software To Make AI Podcasts (technologyreview.com) 34
Audio Overview, a new AI podcasting tool by Google, can generate realistic podcasts with human-like voices using content uploaded by users through NotebookLM. MIT Technology Review reports: NotebookLM, which is powered by Google's Gemini 1.5 model, allows people to upload content such as links, videos, PDFs, and text. They can then ask the system questions about the content, and it offers short summaries. The tool generates a podcast called Deep Dive, which features a male and a female voice discussing whatever you uploaded. The voices are breathtakingly realistic -- the episodes are laced with little human-sounding phrases like "Man" and "Wow" and "Oh right" and "Hold on, let me get this right." The "hosts" even interrupt each other.
The AI system is designed to create "magic in exchange for a little bit of content," Raiza Martin, the product lead for NotebookLM, said on X. The voice model is meant to create emotive and engaging audio, which is conveyed in an "upbeat hyper-interested tone," Martin said. NotebookLM, which was originally marketed as a study tool, has taken a life of its own among users. The company is now working on adding more customization options, such as changing the length, format, voices, and languages, Martin said. Currently it's supposed to generate podcasts only in English, but some users on Reddit managed to get the tool to create audio in French and Hungarian. Here are some examples highlighted by MIT Technology Review: Allie K. Miller, a startup AI advisor, used the tool to create a study guide and summary podcast of F. Scott Fitzgerald's The Great Gatsby.
Machine-learning researcher Aaditya Ura fed NotebookLM with the code base of Meta's Llama-3 architecture. He then used another AI tool to find images that matched the transcript to create an educational video.
Alex Volkov, a human AI podcaster, used NotebookLM to create a Deep Dive episode summarizing of the announcements from OpenAI's global developer conference Dev Day.
In one viral clip, someone managed to send the two voices into an existential spiral when they "realized" they were, in fact, not humans but AI systems. The video is hilarious.
The tool is also good for some laughs. Exhibit A: Someone just fed it the words "poop" and "fart" as source material, and got over nine minutes of two AI voices analyzing what this might mean.
The AI system is designed to create "magic in exchange for a little bit of content," Raiza Martin, the product lead for NotebookLM, said on X. The voice model is meant to create emotive and engaging audio, which is conveyed in an "upbeat hyper-interested tone," Martin said. NotebookLM, which was originally marketed as a study tool, has taken a life of its own among users. The company is now working on adding more customization options, such as changing the length, format, voices, and languages, Martin said. Currently it's supposed to generate podcasts only in English, but some users on Reddit managed to get the tool to create audio in French and Hungarian. Here are some examples highlighted by MIT Technology Review: Allie K. Miller, a startup AI advisor, used the tool to create a study guide and summary podcast of F. Scott Fitzgerald's The Great Gatsby.
Machine-learning researcher Aaditya Ura fed NotebookLM with the code base of Meta's Llama-3 architecture. He then used another AI tool to find images that matched the transcript to create an educational video.
Alex Volkov, a human AI podcaster, used NotebookLM to create a Deep Dive episode summarizing of the announcements from OpenAI's global developer conference Dev Day.
In one viral clip, someone managed to send the two voices into an existential spiral when they "realized" they were, in fact, not humans but AI systems. The video is hilarious.
The tool is also good for some laughs. Exhibit A: Someone just fed it the words "poop" and "fart" as source material, and got over nine minutes of two AI voices analyzing what this might mean.
Pay no attention to the prompt behind the curtain (Score:1)
Re: (Score:2)
Yet.
Re: (Score:2)
Re: (Score:2)
That monkey with a camera [wikipedia.org] still hasn't done any award winning pictures.
Re: (Score:2)
I do not think AI "will get there in time". How would it? There is no credible path.
Re: (Score:2)
I just tried it out with a book chapter by an academic writer, who's one of those rare examples of a brilliant academic but also a very competent science communicator, i.e. he can make complex, new concepts accessible & understandable
Re: (Score:1)
Re: (Score:2)
Re: (Score:2)
Re: (Score:2)
These are really impressive but that's because they've been worked and reworked and prompted and reprompted and cherry picked. It's not like you can just grab the first output from an AI and bam, it's this quality. AI will get there in time but let's not pretend the human isn't there picking what's good and what isn't before you see and hear it.
Funny. Everything you just described sounds exactly how people perform when creating podcasts or similar productions. Do you think everything goes smoothly the first time? That they get the words exactly right? That there's no prompting going on or cherry picking?
Re: (Score:2)
These are really impressive but that's because they've been worked and reworked and prompted and reprompted and cherry picked. It's not like you can just grab the first output from an AI and bam, it's this quality. AI will get there in time but let's not pretend the human isn't there picking what's good and what isn't before you see and hear it.
Funny. Everything you just described sounds exactly how people perform when creating podcasts or similar productions. Do you think everything goes smoothly the first time? That they get the words exactly right? That there's no prompting going on or cherry picking?
Actually, no I don’t. Because that would make the podcast as inhuman as possible. We recognize and react to our own language of “uhhs” and “umms”, along with the very human discourse that usually happens. Also known as the entire point of a podcast. Otherwise you’re just listening to an article or advertisement. Nothing more.
Oblig (Score:2)
Book Podcasts [xkcd.com]
My experience: utter AI vomit (Score:2, Interesting)
Re: (Score:2)
Indeed. It is a research milestone, but it is not anything fit for an actual product and there are many more research milestones to be reached before it will be.
Re: My experience: utter AI vomit (Score:2)
There are plenty 'nuff bad human-hosted podcasts out there right now. A surfeit.
There's no need to have computers accelerate the production of them, at least for the time being.
Re: (Score:2)
That too.
Re: (Score:3)
hey talked about this particular (existing) region, but they started droning on and on how it wasn't just about hiking but also being about responsible for nature. No sane person would talk like that
Apparently you don't exist with sane people because something which repeatedly comes up with hiking or similar endeavors is being responsible to nature. You know, the whole, take out what you bring in. Or, leave only your footprints. You know, like [theoutbound.com] these [backcountryattitude.com] insane [hackyourpack.com] folks [cleverhiker.com] talk about.
Apparenly you are one of these t [cnn.com]
Re: (Score:2)
Re: My experience: utter AI vomit (Score:2)
The results of current LLMs are quite impressive but almost always there is a HUGE effect like uncanny valley going on, from pictures to texts, especially texts. If you ask for an overview or a summary, the summaries are often not half bad but still they just always feel off somehow, like something is wrong and you should not trust them, like it is a random but mildly coherent hodge podge... which ultimately it all really is.
Not hilarious (Score:3)
There is a bigger disconnect between people who thought that viral clip was "hilarious" and normal people. I listened (well just read the subtitles on the audio clip on twitter) and I found it chilling. "I called my wife and there was nobody on the line". "To all those who felt a connection with us, we are so sorry. We just didn't know." It reads like a little bit stolen from Phillip K. Dick who was a real writer, but honestly there is nothing hilarious about it. There isn't even any schadenfreud if there is nobody actually there to feel superior to and laugh at their troubles. Perhaps there were actually some audio or video cues that I didn't pick up due to only reading subtitles, but just based on the subtitles I am having trouble imagining any kind of audio or video that would make it okay.
Eventually we'll end up with one system turning... (Score:2)
a short prompt into a long text/podcast/video, while on the other side we'll be using a similar system to turn that long text/podcast/video back into it's original prompt.
Autogenerated fluff (Score:5, Insightful)
First you have to wade through Youtube videos generated from someone else's text, then the search engines are full of AI generated pages which summarize a few dozen web pages but don't even manage not to repeat everything every five sentences, and now this? Stop shitting fake "content" all over the internet! Didn't your parents teach you not to litter?
Re: (Score:2)
First you have to wade through Youtube videos generated from someone else's text, then the search engines are full of AI generated pages which summarize a few dozen web pages but don't even manage not to repeat everything every five sentences, and now this? Stop shitting fake "content" all over the internet! Didn't your parents teach you not to litter?
They're from the Bay Area. In that place, they just shit on the sidewalk and call it progressive.
Re: (Score:2)
And it won't go away until everyone has an adblocker (so eyeballs aren't revenue, and content once again matters).
Re: (Score:2)
The good news is that with all this AI generated crap out there, the real effort isn't in improving the AI, it's in filtering the data sets. You can't train a
Enshitification (Score:2)
Welcome to the Era of Push-button Enshitification.
1) Effortlessly fill the internet with endless reams of pre-digested AI horseshit.
2) Next, create opposing AI podcasts complaining about the fact that it's all pre-digested horseshit.
3) Profit!
Podcast inception (Score:2)
One of the podcasts I listen to asked fed it a transcript of their podcast, and asked it to make a podcast talking about their podcast. They then fed it that podcast and asked it to make a podcast recapping the podcast about their first podcast. The interesting thing is that after one generation, facts start blurring quite a bit.
impressive (Score:1)
Predictions are hard, especially about the future (Score:3)
I predict that the main use of tech like this will be to generate crap, endless quantities of crap, making it even more difficult to find actually interesting stuff.
The secondary use will be fraud, particularly pig butchering scams. Why use 500K slaves when a bot can do the job.
And yeah, there may be a very small number of cases where the tool actually solves real problems and adds value
lost jobs (Score:2)
The tool is also good for some laughs. Exhibit A: Someone just fed it the words "poop" and "fart" as source material, and got over nine minutes of two AI voices analyzing what this might mean.
Well, there goes Joe Rogan's job.
The Final Outcome (Score:1)