Catch up on stories from the past week (and beyond) at the Slashdot story archive

 



Forgot your password?
typodupeerror
×
AI

Google Books Is Indexing AI-Generated Garbage (404media.co) 11

Google Books is indexing low quality, AI-generated books that will turn up in search results, and could possibly impact Google Ngram viewer, an important tool used by researchers to track language use throughout history. From a report: I was able to find the AI-generated books with the same method we've previously used to find AI-generated Amazon product reviews, papers published in academic journals, and online articles. Searching Google Books for the term "As of my last knowledge update," which is associated with ChatGPT-generated answers, returns dozens of books that include that phrase. Some of the books are about ChatGPT, machine learning, AI, and other related subjects and include the phrase because they are discussing ChatGPT and its outputs. These books appear to be written by humans. However, most of the books in the first eight pages of results turned up by the search appear to be AI-generated and are not about AI.

For example, the 2024 book Bears, Bulls, and Wolves: Stock Trading for the Twenty-Year-Old by Tristin McIver, bills itself as "a transformative journey into the world of stock trading" and "a comprehensive guide designed for beginners eager to unlock the mysteries of financial markets." In reality, it reads like ChatGPT-generated text with surface, Wikipedia-level analysis of complex financial events like Facebook's initial public offering or the 2008 financial crisis summed up in a few short paragraphs. [...] Other books appear to be outdated to the point of being useless at the time they are published because they are generated with a version of ChatGPT with an old "knowledge update."

This discussion has been archived. No new comments can be posted.

Google Books Is Indexing AI-Generated Garbage

Comments Filter:
  • Sounds redundant.
    • "Google-generated garbage" would also sound redundant, and as a phenomenon it pre-dates AI by quite a few years. Google is simply delivering more garbage now, and doing it more efficiently, thanks to AI.

    • I mean, we should be calling it PISS. Plagiarized Information Synthesis System.
    • Looks like the AI meltdown, in which AI no longer trains only on human-generated content but starts also training on AI-generated gibberish leading to more and more deviation of normalcy, has begun.
  • by DrunkenTerror ( 561616 ) on Thursday April 04, 2024 @02:03PM (#64370418) Homepage Journal

    Future historians researching 21st century literature will note a stratum around 2023, the point at which anything written after will much more likely to be contaminated with LLM output, using its presence to date works for which the publication date is unknown.

  • by Casandro ( 751346 ) on Thursday April 04, 2024 @02:31PM (#64370474)

    SEO-Expert generated garbage content and now "AI"-generated SEO garbage. That's why search engines become more and more useless.

    • SEO types over the last 20 years have consistently hated it when I accurately point out, "Oh, so you're the reason searching the internet sucks."
    • by hjf ( 703092 )

      How about reputable news sites now spewing chatgpt crap?

      I wrote an email yesterday to one of the largest news website in my country. they had one of those trash "filler" articles ("how to get the most out of your air fryer"). I clicked on it to see how much they could fluff such a silly premise.

      It was 100% written by ChatGPT, with ChatGPT editorial style and all. The list of bullet points in the article was copied verbatim (some words were different but i attribute that to chatGPT's randomness rather than t

  • by nightflameauto ( 6607976 ) on Thursday April 04, 2024 @03:19PM (#64370626)

    When the AIs can generate content fast enough to choke the AIs processing the content, which are used to train the AIs generation the content, the AI dominance era will have finally begun! PRAISE TECHO-JESUS! NO MOAR HUMANS! ALL THE SAME TRAFFIC!

    How many ads do they sell to AI's scraping content?

  • by Anonymous Coward

    Human-guided AI content is often rated higher than human content in blind tests. It is only a matter of time and sufficient reinforcement learning before AI is able to guide itself according to human taste. Humans taste also follows patterns that can be learned by neural networks, it only needs enough feedback about what is perceived as pleasing and what is perceived as ugly for the AIs to learn what's the goal when creating content for humans.

What hath Bob wrought?

Working...