Forgot your password?

typodupeerror

SPAM: Google's Supplemental Index

Submitted by seobuddy
seobuddy writes "The Big Daddy update of late 2005 to early 2006 was largely about installing a new Supplemental index. The new version is so different to the old version that it shouldn't now be called the Supplemental index. The old Supplemental index was a repository for garbage webpages and such, and was accessed for the search results only when a reasonable number of results couldn't be found in the regular index. The new version is very different because many millions of perfectly good pages are put in it.

Many, perhaps most, websites have plenty of their pages in the Supplemental index because their linkage profiles don't score well enough. Even Google has pages in there — hundreds of thousand of them. A site's linkage profile is an evaluation of the links into and out of the site. Things like linking to off-topic sites, and too high a percentage of a site's inbound links being reciprocals, lowers the score of a site's linkage profile, and reduces the number of pages that it can have in the Regular index, which means that more of its pages are placed in the Supplemental index. Improving the linkage profile brings pages out of the Supplemental index and into the Regular one.

Before Big Daddy, pages in the Supplemental index had been given the kiss of death — they rarely came out, and were rarely seen in the search results. But that has changed, and is continuing to change. It is now possible to bring pages out of theÃÂSupplemental index byÃÂgetting some goodÃÂlinks to the site, and the continued improvement is in the way that the Supplemental index is used by Google's system.

Right now, most of the datacenters are using the new Supplemental index inÃÂthe same way as the old one was used; i.e. get a results set from the Regular index and, if the set isn't large enough, add to it from the Supplemental index. The quality of the results from the Regular index doesn't come into it. If the results set is large enough, the Supplemental index is ignored.

But at least one datacenter operates differently. It operates along the lines of, get a results set from the Regular index. Sometimes many of those results will be poor quality matches (e.g. they only match one word of a three word query), so get some better matches from the Supplemental index. The use of the Supplemental index in a way something like this is likely to spread across the datacenters in 2007.

The new way makes a lot of sense. Since many of the results that are acquired from the Regular index are often poor matches for the query, and since millions of perfectly good pages are now stored in the Supplemental index, some of which will be good matches for many queries, it makes good sense to pull results from the Supplemental index when there are some poor matches from the Regular index.

It's good news for website owners who have large numbers of pages in the Supplemental index. As the new way of operating spreads, more of their pages will rightly find their way into the search results, even though they are in the Supplemental index."

Link to Original Source

Photoshop Disaster Spawns DMCA Lawsuit

Submitted by pickens
pickens writes "Pickens writes:

Cory Doctorow writes that Ralph Lauren issued a DMCA takedown notice after Boing Boing republished the photoshop disaster contained in a Ralph Lauren advertisement in which a model's proportions appear to have been altered to give her an impossibly skinny body with the model's head larger than her pelvis. Doctorow says that one of the things that makes their ISP Priority Colo so awesome is that they don't automatically act on DMCA takedowns and proceeded to dare Lauren to sue. "This is classic fair use: a reproduction "for purposes such as criticism, comment, news reporting," etc," writes Doctorow. "Copyright law doesn't give you the right to threaten your critics for pointing out the problems with your offerings." Doctorow adds that every time Lauren threatens to sue he will "reproduce the original criticism, making damned sure that all our readers get a good, long look at it," "publish your spurious legal threat along with copious mockery," and "offer nourishing soup and sandwiches to your models.""
Security

Why the FBI director doesn't bank online-> 1

Submitted by angry tapir
angry tapir writes "The head of the U.S. Federal Bureau of Investigation has stopped banking online after nearly falling for a phishing attempt. FBI Director Robert Mueller said he recently came "just a few clicks away from falling into a classic Internet phishing scam" after receiving an e-mail that appeared to be from his bank."
Link to Original Source
Google

Brin, Schmidt eye further Google expansion->

Submitted by
angry tapir
angry tapir writes "Despite Google's phenomenal growth, the Internet search giant does not appear to be worried about taking on too many projects, judging from comments made at a media roundtable with company cofounder Sergey Brin and CEO Eric Schmidt. Though Google is expanding into multiple areas such as operating systems, applications, online books and display advertising, more than 90 percent of company revenue comes from keyword-related search advertising, according to Schmidt. Other comments at the roundtable indicated that the Google Chrome OS is still on track to appear on devices in 2010, and that the company is trying to adjust its underlying network architecture to prevent outages having such a widespread impact on users."
Link to Original Source
Technology (Apple)

Mac|Life tours Mythbusters workshop->

Submitted by JaxTJ
JaxTJ writes "In a teaser for an upcoming interview with Mythbuster Adam Savage, the folks at Mac|Life took a tour of M5 and picked their 10 favorite things. From the article:

"We were prepared to be awestruck by the chance to get up close and personal with recognizable MythBusters artifacts. But since the M5 building, which belongs to Hyneman, also serves as the show's office and a place for Hyneman to keep his sizable collection of flotsam and jetsam from past projects throughout his illustrious career in special effects and model building and as the principle of M5, we got so much more than we bargained for in the awe department.""

Link to Original Source

Comment: Re:Because if only.. (Score 4, Interesting) 225

by JaxTJ (#28435493) Attached to: Watch TV On Your Satnav
Absolutely. I used to have a 45-minute commute to work when I lived in New Jersey and I honestly can't remember the number of times that I was almost run into by someone in a 3-ton SUV that was talking or texting instead of driving.

Japanese taxi drivers frequently have the TV playing on their nav units while driving, but they are among the safest drivers I've ever ridden with. I think two way communication is far more taxing to a driver's attention than a receive-only medium.
Google

Copyright legality of google cache?

Submitted by Anonymous Coward
An anonymous reader writes "I've wondered for a long time how it is legal for google to download and redistribute documents that they do not own automatically and presumably without the copyright owner's consent via google cache. The Internet Archive [archive.org] seems like another obvious violator. Even 1998 slashdot [archive.org] is archived and redistributed complete with the copyright notice.
I searched the Internet Archive for a few sites that I know forbid crawlers via robots.txt and they are not in the archive (at least not displayed). I've also searched for sites that I know the owner hasn't given permission for redistribution and the site is displayed.
Do google and the archive interpret a non-existent or permissive robots.txt as permission to redistribute copyrighted works? Does failure to deny their crawlers constitute granting permission?"

The Osmonds! You are all Osmonds!! Throwing up on a freeway at dawn!!!

Working...