Google

Google's Personal Data Removal Tool Now Covers Government IDs (blog.google) 14

Google on Tuesday expanded its "Results about you" tool to let users request the removal of Search results containing government-issued ID numbers -- including driver's licenses, passports and Social Security numbers -- adding to the tool's existing ability to flag results that surface phone numbers, email addresses, and home addresses.

The update, announced on Safer Internet Day, is rolling out in the U.S. over the coming days. Google also streamlined its process for reporting non-consensual explicit images on Search, allowing users to select and submit removal requests for multiple images at once rather than reporting them individually.
AI

Perplexity Launches Subscription Program That Includes Revenue Sharing With Publishers (pymnts.com) 7

An anonymous reader quotes a report from PYMNTS: Artificial intelligence startup Perplexity has announced a new subscription program called Comet Plus that it said gives users access to premium content from trusted publishers and journalists, while providing publishers with a better compensation model. "Comet Plus transforms how publishers are compensated in the AI age," the company said in a Monday blog post. "As users demand a better internet in the age of AI, it's time for a business model to ensure that publishers and journalists benefit from their contributions to a better internet."

Comet Plus is included in Perplexity's Pro and Max memberships and is available as a standalone subscription for $5 per month. Perplexity introduced its Comet AI-powered browser in July, saying the tool lets users answer questions and carry out tasks and research from a single interface. Bloomberg reported Monday that Perplexity has allocated $42.5 million for a revenue sharing program that compensates publishers when their content is used by its Comet browser or AI assistant. The program will use funds that come from Comet Plus and will deliver 80% of the revenue to publishers, with Perplexity getting the other 20%, the report said, citing an interview with Perplexity CEO Aravind Srinivas. "AI is helping to create a better internet, but publishers still need to get paid," Srinivas said in the report. "Sowe think this is actually the right solution, and we're happy to make adjustments along the way."

The Internet

Reddit Wants To Be a Search Engine Now (theverge.com) 41

Reddit wants to become a full-fledged search engine, leveraging its vast repository of human-generated content and expanding its AI-powered Reddit Answers tool. In its latest note (PDF) to investors, CEO Steve Huffman says the company is "concentrating our resources on the areas that will drive results for our most pressing needs," including "making Reddit a go-to search engine." The Verge reports: Huffman says that "every week, hundreds of millions of people come to Reddit looking for advice, and we're turning more of that intent into active users of Reddit's native search." Reddit's core search has more than 70 million weekly active unique users -- Reddit overall averages 416.4 million weekly active unique users -- and Reddit Answers, the platform's AI search tool that it launched in December, has 6 million weekly users, up from 1 million weekly users in the first quarter of this year. To continue to build out search, Reddit is "expanding Reddit Answers globally, integrating it more deeply into the core search experience, and making search a central feature across Reddit," Huffman says.
Google

Google's Exclusive Reddit Access (404media.co) 43

Google is now the only search engine that can surface results from Reddit, making one of the web's most valuable repositories of user generated content exclusive to the internet's already dominant search engine. 404 Media: If you use Bing, DuckDuckGo, Mojeek, Qwant or any other alternative search engine that doesn't rely on Google's indexing and search Reddit by using "site:reddit.com," you will not see any results from the last week.

DuckDuckGo is currently turning up seven links when searching Reddit, but provides no data on where the links go or why, instead only saying that "We would like to show you a description here but the site won't allow us." Older results will still show up, but these search engines are no longer able to "crawl" Reddit, meaning that Google is the only search engine that will turn up results from Reddit going forward. Searching for Reddit still works on Kagi, an independent, paid search engine that buys part of its search index from Google. The news shows how Google's near monopoly on search is now actively hindering other companies' ability to compete at a time when Google is facing increasing criticism over the quality of its search results.
The news follows Google signing a $60 million deal with Reddit early this year to use the social network's content to train its LLMs.
The Internet

Archie, the Internet's First Search Engine, Is Rescued and Running (arstechnica.com) 35

An anonymous reader quotes a report from Ars Technica: It's amazing, and a little sad, to think that something created in 1989 that changed how people used and viewed the then-nascent Internet had nearly vanished by 2024. Nearly, that is, because the dogged researchers and enthusiasts at The Serial Port channel on YouTube have found what is likely the last existing copy of Archie. Archie, first crafted by Alan Emtage while a student at McGill University in Montreal, Quebec, allowed for the searching of various "anonymous" FTP servers around what was then a very small web of universities, researchers, and government and military nodes. It was groundbreaking; it was the first echo of the "anything, anywhere" Internet to come. And when The Serial Port went looking, it very much did not exist.

While Archie would eventually be supplanted by Gopher, web portals, and search engines, it remains a useful way to index FTP sites and certainly should be preserved. The Serial Port did this, and the road to get there is remarkable and intriguing. You are best off watching the video of their rescue, along with its explanatory preamble. But I present here some notable bits of the tale, perhaps to tempt you into digging further.

Google

Revolutionary New Google Feature Hidden Under 'More' Tab Shows Links To Web Pages (404media.co) 32

An anonymous reader shares a report: After launching a feature that adds more AI junk than ever to search results, Google is experimenting with a radical new feature that lets users see only the results they were looking for, in the form of normal text links. As in, what most people actually use Google for. "We've launched a new 'Web' filter that shows only text-based links, just like you might filter to show other types of results, such as images or videos," the official Google Search Liaison Twitter account, run by Danny Sullivan, posted on Tuesday. The option will appear at the top of search results, under the "More" option.

"We've added this after hearing from some that there are times when they'd prefer to just see links to web pages in their search results, such as if they're looking for longer-form text documents, using a device with limited internet access, or those who just prefer text-based results shown separately from search features," Sullivan wrote. "If you're in that group, enjoy!" Searching Google has become a bloated, confusing experience for users in the last few years, as it's gradually started prioritizing advertisements and sponsored results, spammy affiliate content, and AI-generated web pages over authentic, human-created websites.

Google

Google Tests Removing the News Tab From Search Results (niemanlab.org) 37

An anonymous reader shares a report: News publishers are worried -- with good reason -- about changes coming to Google Search. AI-generated content replacing links on some of the most valuable space on the internet, in particular, has left media types with a lot of questions, starting with "is this going to be a traffic-destroying nightmare?" The News filter disappearing from Google search results for some users this week won't help publishers sleep any easier. Google confirmed some users were not seeing the News filter as part of ongoing testing. "We're testing different ways to show filters on Search and as a result, a small subset of users were temporarily unable to access some of them," a Google spokesperson confirmed via email.
The Courts

Judge Rules Against Users Suing Google and Apple Over 'Annoying' Search Results (arstechnica.com) 22

An anonymous reader quotes a report from Ars Technica: While the world awaits closing arguments later this year in the US government's antitrust case over Google's search dominance, a California judge has dismissed a lawsuit from 26 Google users who claimed that Google's default search agreement with Apple violates antitrust law and has ruined everyone's search results. Users had argued (PDF) that Google struck a deal making its search engine the default on Apple's Safari web browser specifically to keep Apple from competing in the general search market. These payments to Apple, users alleged, have "stunted innovation" and "deprived" users of "quality, service, and privacy that they otherwise would have enjoyed but for Google's anticompetitive conduct." They also allege that it created a world where users have fewer choices, enabling Google to prefer its own advertisers, which users said caused an "annoying and damaging distortion" of search results.

In an order (PDF) granting the tech companies' motion to dismiss, US District Judge Rita Lin said that users did not present enough evidence to support claims for relief. Lin dismissed some claims with prejudice but gave leave to amend others, allowing users another chance to keep their case -- now twice-dismissed -- at least partially alive. Under Lin's order, users will not be able to amend claims that Google and Apple executives allegedly sealed the default search deal on the condition that Apple would not create its own general search engine through "private, secret, and clandestine personal meetings." Because plaintiffs showed no evidence pinpointing exactly when Apple allegedly agreed to stay out of the general search market, these meetings, Lin reasoned, could just as easily indicate "rational, legal business behavior," rather than an "illegal conspiracy."

Users attempted to argue that Google and Apple intentionally hid these facts from the public, but Lin wrote that their "conclusory and vague allegations that defendants 'secretly conducted meetings' and 'engaged in conduct to obfuscate internal communications' are plainly insufficient." Sharing bystander photos documenting Google's Sundar Pichai and Apple's Tim Cook meeting at a restaurant with a manila folder tucked under Pichai's elbow did not help users' case. Lin was also not moved by users demonstrating that Google has a history of destroying evidence, because "they put forth no specific factual allegations that defendants did so in this case." However, users will have 30 days to amend currently "inadequately" alleged claims that "Google's exclusive default agreement, under which Apple set Google as the default search engine for its Safari web browser, foreclosed competition in the general search services market in the United States," Lin wrote. If users miss that deadline, the case will be tossed with no opportunities to further amend claims.

Google

Google Search Really Has Gotten Worse, Researchers Find (404media.co) 58

An anonymous reader quotes a report from 404 Media: Google search really has been taken over by low-quality SEO spam, according to a new, year-long study by German researchers (PDF). The researchers, from Leipzig University, Bauhaus-University Weimar, and the Center for Scalable Data Analytics and Artificial Intelligence, set out to answer the question "Is Google Getting Worse?" by studying search results for 7,392 product-review terms across Google, Bing, and DuckDuckGo over the course of a year. They found that, overall, "higher-ranked pages are on average more optimized, more monetized with affiliate marketing, and they show signs of lower text quality ... we find that only a small portion of product reviews on the web uses affiliate marketing, but the majority of all search results do."

They also found that spam sites are in a constant war with Google over the rankings, and that spam sites will regularly find ways to game the system, rise to the top of Google's rankings, and then will be knocked down. "SEO is a constant battle and we see repeated patterns of review spam entering and leaving the results as search engines and SEO engineers take turns adjusting their parameters," they wrote. They note that Google, Bing, and DuckDuckGo are regularly tweaking their algorithms and taking down content that is outright spam, but that, overall, this leads only to "a temporary positive effect."

"Search engines seem to lose the cat-and-mouse game that is SEO spam," they write. Notably, Google, Bing, and DuckDuckGo all have the same problems, and in many cases, Google performed better than Bing and DuckDuckGo by the researchers' measures. The researchers warn that this rankings war is likely to get much worse with the advent of AI-generated spam, and that it genuinely threatens the future utility of search engines: "the line between benign content and spam in the form of content and link farms becomes increasingly blurry -- a situation that will surely worsen in the wake of generative AI. We conclude that dynamic adversarial spam in the form of low-quality, mass-produced commercial content deserves more attention."

Privacy

Face Search Engine PimEyes Blocks Searches of Children's Faces (nytimes.com) 25

PimEyes, a search engine that relies on facial recognition to help people scan billions of images to find photos of themselves on the internet, announced that it has banned searches of minors as part of the company's "no harm policy." The New York Times reports: PimEyes, a subscription-based service that uses facial recognition technology to find online photos of a person, has a database of nearly three billion faces and enables about 118,000 searches per day, according to [PimEyes CEO Giorgi Gobronidze]. The service is advertised as a way for people to search for their own face to find any unknown photos on the internet, but there are no technical measures in place to ensure that users are searching only for themselves. Parents have used PimEyes to find photos of their children on the internet that they had not known about. But the service could also be used nefariously by a stranger. It had previously banned more than 200 accounts for inappropriate searches of children's faces, Mr. Gobronidze said.

"Images of children might be used by the individuals with twisted moral compass and values, such as pedophiles, child predators," Mr. Gobronidze said. PimEyes will still allow searches of minors' faces by human rights organizations that work on children's rights issues, he added. Mr. Gobronidze said that blocking searches of children's faces had been on "the road map" since he acquired the site in 2021, but the protection was fully deployed only this month after the publication of a New York Times article on A.I.-based threats to children. Still, the block isn't airtight. PimEyes is using age detection A.I. to identify photos of minors. Mr. Gobronidze said that it worked well for children under the age of 14 but that it had "accuracy issues" with teenagers.

It also may be unable to identify children as such if they're not photographed from a certain angle. To test the blocking system, The Times uploaded a photo of Mary-Kate and Ashley Olsen from their days as child stars to PimEyes. It blocked the search for the twin who was looking straight at the camera, but the search went through for the other, who is photographed in profile. The search turned up dozens of other photos of the twin as a child, with links to where they appeared online. Mr. Gobronidze said PimEyes was still perfecting its detection system.

Privacy

Brave Cuts Ties With Bing To Offer Its Own Image and Video Search Results (theregister.com) 14

Brave Software, maker of the Brave web browser, has tuned its search engine to run on a homegrown index of images and videos in an effort to end its dependency on "Big Tech" rivals. The Register reports: On Thursday, the company said that image and video results from Brave Search -- available on the web at search.brave.com and via its browser -- will be served from Brave's own index. Search indexes are made by visiting online resources -- typically web pages, images, videos, or other files -- with a crawler bot and recording the locations of these resources in a database. And when an internet user submits a query to a search engine, the search engine checks its index (and possible other sources) to find the addresses of resources that correspond to the query keywords. There's actually a lot more to it but that's the basic idea.

Brave now aims to ride the wave of discontent with "Big Tech" by highlighting its commitment to privacy and independence â" small tech. "Brave Search is 100 percent private and anonymous, which sets a high bar for image/video search to meet," the company said in a blog post provided to The Register. "Whether it's a matter of personal safety or personal preference, users should be able to discover content without their search engine reporting and profiling those results to a Big Tech company." [...] Brave argues that having its own index frees the company from content decisions made by others.
"Brave is on a mission to build a user-first Web," the company said in its blog post. "That mission starts with the Brave browser and Brave Search. With the release of image and video search, we're continuing to innovate within the search industry, providing viable and preferable products for users who want choice and transparency in their search for information online."
The Internet

Brave Releases Its Search API (thurrott.com) 8

Brave has launched its Brave Search API, allowing third parties to integrate its privacy-preserving and ad-free search results into their applications through a simple API call. Thurrott reports: Brave notes that its Search API is inexpensive and that it's a great fit for Artificial Intelligence (AI) and Large Language Models developers in particular because it provides access to a collection of high-quality, Web-scale data including recent events. Brave claims that its standalone Brave Search offering now delivers over 8 billion annualized queries, which makes it the fastest-growing search engine since Microsoft Bing. And in sharp contrast to the market leaders, Brave Search is private and transparent. Plus, it's fueled by opt-in users of the Brave browser's Web Discovery Project, which adds millions of new web pages to the index every single day and keeps it current and fresh. The Brave web browser has over 60 million active users now, the company adds.

A free version of the Brave Search API provides one search query per second and up to 2,000 queries per month. Paid tiers start at $3 CPM (cost per one thousand) for 20 queries per second and up to 20 million queries per month, with access to web search, Goggles, news cluster, and videos cluster, plus added cost access to autosuggest and spellcheck at $5 per 10,000 requests. Higher-price tiers add more queries per second and per month, plus additional capabilities like schema-enriched web results, infobox, FAQ, discussions, locations, and more.

AI

Google Search Starts Rolling Out ChatGPT-style Generative AI Results (arstechnica.com) 14

Google's "Search Generative Experience" is a plan to put ChatGPT-style generative AI results right in your Google search results page, and the company announced the feature is beginning to roll out today. At least, the feature is rolling out to the mobile apps for people who have been on the waitlist and were chosen as early access users. From a report: Unlike the normally stark-white Google page with 10 blue links, Google's generative AI results appear in colorful boxes above the normal search results. Google will scrape a bunch of information from all over the Internet and present it in an easy list, with purchase links to Best Buy and manufacturers' websites. If this ever rolls out widely, it would be the biggest change to Google Search results ever, and this design threatens to upend the entire Internet. One example screenshot of a "Bluetooth speaker" search on desktop shows a big row of "Sponsored" shopping ads, then the generative AI results start to show up in a big blue box about halfway down the first page. The blue box summarizes a bunch of information harvested from somewhere and lists several completely unsourced statements and opinions about each speaker. In Google's example, users are never told where this information comes from, so they can't make any judgment as to its trustworthiness.
Google

Google Is Spared a Search-Engine Switch by a Major Partner (wsj.com) 13

Samsung Electronics won't be swapping out the default search engine on its smartphones from Google to Microsoft's Bing any time soon, WSJ reported Friday, citing people familiar with the matter. From the report: Samsung, the world's largest smartphone maker, has suspended an internal review that had explored replacing Google with Bing on its mobile devices, the people said. The potential switch would have swapped out Google as the go-to search engine on Samsung's "Internet" web-browsing app, which comes preinstalled on the South Korean company's smartphones. Any imminent breakup would have handed Bing a coveted victory in a search-engine space that has long been dominated by Alphabet-owned Google. This year, Bing gained some fresh momentum as it adopted the features of ChatGPT, the chatbot that has surged in popularity and is run by Microsoft-backed OpenAI.
AI

Google Search Gets AI-Powered 'Snapshots' (theverge.com) 14

"The AI takeover of Google Search starts now," writes The Verge's David Pierce. At Google I/O today, the company demoed a new opt-in feature called Search Generative Experience (SGE). The new experience generates AI "snapshots" that appear at the top of the search results page consisting of an AI-generated summary about your query, with links to sources of information and shopping. From the report: To demonstrate, Liz Reid, Google's VP of Search, flips open her laptop and starts typing into the Google search box. "Why is sourdough bread still so popular?" she writes and hits enter. Google's normal search results load almost immediately. Above them, a rectangular orange section pulses and glows and shows the phrase "Generative AI is experimental." A few seconds later, the glowing is replaced by an AI-generated summary: a few paragraphs detailing how good sourdough tastes, the upsides of its prebiotic abilities, and more. To the right, there are three links to sites with information that Reid says "corroborates" what's in the summary.

Google calls this the "AI snapshot." All of it is by Google's large language models, all of it sourced from the open web. Reid then mouses up to the top right of the box and clicks an icon Google's designers call "the bear claw," which looks like a hamburger menu with a vertical line to the left. The bear claw opens a new view: the AI snapshot is now split sentence by sentence, with links underneath to the sources of the information for that specific sentence. This, Reid points out again, is corroboration. And she says it's key to the way Google's AI implementation is different. "We want [the LLM], when it says something, to tell us as part of its goal: what are some sources to read more about that?"

A few seconds later, Reid clicks back and starts another search. This time, she searches for the best Bluetooth speakers for the beach. Again, standard search results appear almost immediately, and again, AI results are generated a few seconds later. This time, there's a short summary at the top detailing what you should care about in such a speaker: battery life, water resistance, sound quality. Links to three buying guides sit off to the right, and below are shopping links for a half-dozen good options, each with an AI-generated summary next to it. I ask Reid to follow up with the phrase "under $100," and she does so. The snapshot regenerates with new summaries and new picks.
"This is the new look of Google's search results page," concludes Pierce. "It's AI-first, it's colorful, and it's nothing like you're used to. It's powered by some of Google's most advanced LLM work to date, including a new general-purpose model called PaLM 2 and the Multitask Unified Model (MUM) that Google uses to understand multiple types of media."

"In the demos I saw, it's often extremely impressive. And it changes the way you'll experience search, especially on mobile, where that AI snapshot often eats up the entire first page of your results."
The Internet

DuckDuckGo's Building AI-Generated Answers Into Its Search Engine (theverge.com) 26

DuckDuckGo announced a new tool called DuckAssist that "automatically pulls and summarizes information from Wikipedia in response to certain questions," reports The Verge. From the report: DuckAssist's beta is live on the search engine right now -- but only through DuckDuckGo's mobile apps and browser extensions. Gabriel Weinberg, the founder and CEO of DuckDuckGo, says the company will add it to the web-based search engine if the trial "goes well." When you enter a question that DuckAssist can help with, you'll see a box that says "I can check to see if Wikipedia has relevant info on this topic, just ask" at the very top of your search results. Hit the blue "Ask" button, and you'll get an AI-generated answer using summarized information from Wikipedia. If DuckAssist has already answered a question once before, that response will automatically appear, which means you won't have to "ask" it the same thing multiple times.

While the tool's built upon language models from OpenAI, the company that makes ChatGPT, and the Google-backed Anthropic, Weinberg says it'll retain the same focus on privacy as DuckDuckGo. According to the announcement, DuckAssist won't share any personally identifiable information with OpenAI and Anthropic, and neither company will use your anonymous questions to train their models. DuckDuckGo says the feature uses the "most recent full Wikipedia download available," which is around a few weeks old, so it might not be able to help if you're searching for something later than that. However, the company plans to update this in the future, as well as add more sources for DuckAssist to draw from.

Google

Google Will Soon Blur Explicit Images By Default in Search Results (theverge.com) 67

Google is introducing a new online safety feature to help users avoid inadvertently seeing graphically violent or pornographic images while using its search engine. From a report: Announced as part of the company's Safer Internet Day event on Tuesday, the new default setting enabled for everyone will automatically blur explicit images that appear in search results, even for users that don't have SafeSearch enabled. Google has confirmed to The Verge that, should they wish, signed-in users over 18 will be able to disable the blur setting entirely after it launches in "the coming months."
Google

Google Is Making It Easier To Find Search Results From Reddit and Other Forums (engadget.com) 41

Google is making it easier to find search results from Reddit and other forum sites. Engadget reports: The search engine is adding a new module that will surface discussions happening on forums across the web for queries that may benefit from crowd-sourced answers. The "discussions and forums" module will surface relevant posts from sites like Reddit and Quora alongside more traditional search results. It's not clear exactly how Google is determining what types of searches are best suited to forum posts. The company says the new "forum" results will "appear when you search for something that might benefit from the diverse personal experiences found in online discussions."

Google is also adding a new feature to news-related searches that will make it easier to browse international headlines that are published in languages other than English. With the change, news-related searches will also turn up relevant local coverage translated by Google.
In other Google Search-related news, the company announced that starting today people in the U.S. will be able to use their new "Results About You" feature, "which aims to provide a simpler way for people to get their sensitive personal information out of the company's search results," reports Gizmodo. "Next year, Results About You will become proactive and allow users to opt in to alerts when new personal information related to them appears in search results, enabling users to request removal more quickly."

Slashdot Top Deals