Want to read Slashdot from your mobile device? Point it at m.slashdot.org and keep reading!

 



Forgot your password?
typodupeerror
×

El Reg Says Google Choking on Spam Sites 234

Grubby Games writes "The Register is reporting that Google is full, and in trouble." From the article: "Recently, we featured a software tool that can create 100 Blogger weblogs in 24 minutes, called Blog Mass Installer. A subterranean industry of sites providing 'private label articles,' or PLAs exists to flesh out 'content' for these freshly minted sites. And as a result, legitimate sites are often caught in the cross fire. But the new algorithms may not be solely to blame. Google's chief executive Eric Schmidt has hinted at another reason for the recent chaos. In Google's earnings conference call last month, Schmidt was frank about the extent of the problem. 'Those machines are full,' he said. 'We have a huge machine crisis.'" James Robertson points out that's a fairly selective bit of quoting.
This discussion has been archived. No new comments can be posted.

El Reg Says Google Choking on Spam Sites

Comments Filter:
  • by jamie ( 78724 ) * <jamie@slashdot.org> on Friday May 05, 2006 @04:35PM (#15273181) Journal
    Meanwhile, for no good reason, here's some gorgeous stats porn [drunkmenworkhere.org] on how Google (and Yahoo and MSN) crawled a sample website. The animations and closeups of the trees are very cool.
  • by NewWorldDan ( 899800 ) <dan@gen-tracker.com> on Friday May 05, 2006 @04:37PM (#15273202) Homepage Journal
    Over the past 6 months or so, I've been finding a lot of link farms in my search results. Oh, irony or irony, SEOs are making search results worthless.
  • by TheNoxx ( 412624 ) on Friday May 05, 2006 @04:39PM (#15273210) Homepage Journal
    You know, writing code and assuming that an end user somewhere will do the dumbest thing imaginable, but I guess nobody ever imagined the possible effects of collusion between extreme stupidity and cleverness (spammers). I know I'd never would have thought that someone would go to such lengths and spend so much time to barely scrape out a living while pissing off countless hordes of people. How do you go about creating enough international legislation and cooperation to catch these guys without crippling the internet with regulation? Are third world countries even capable of compliance? All I can think of is that we need something on the level of the UN where tech-heavy countries are given jurisdiction over other nations that don't have the resources needed to police these kinds of things in exchange for a fee , or maybe a guarantee that said nation will dedicate x amount of troops to any areas needing occupation to stop civil war or genocide or something. Am I over-reacting here? I just can't help but think that dealing with this problem without any legal consequence for the spammers is just encouraging and allowing them to come up with ways around whatever solution is currently in place.

    Eh, or I could be completely off my rocker, and just not competent enough to see a simple and effective method of combating these guys.
  • Fud Light (Score:2, Interesting)

    by Loconut1389 ( 455297 ) * on Friday May 05, 2006 @04:40PM (#15273219)
    I do hate it when searching for something about 4-10 pages in a row are purely sites that pretend to have what you're looking for but are merely meta dumps with adwords or other advertising mechanisms on them. Some of them even have valid cached pages. That said, this article, while certainly Fud, is only Fud Light. I personally prefer Fud Dark- at least I can generally laugh at the article's absurdity. This one was more or less just plain retarded.
  • by merreborn ( 853723 ) on Friday May 05, 2006 @04:56PM (#15273325) Journal
    ...It's not like google invented internet advertising.

    Banner ads were taking the same path. If anything, we should thank google for making internet advertising less intrusive.
  • by s-gen ( 890660 ) on Friday May 05, 2006 @05:18PM (#15273486)
    ...then eventually the spam sites will actually contain the information you were looking for.
  • Re:One idea? (Score:3, Interesting)

    by IamTheRealMike ( 537420 ) on Friday May 05, 2006 @06:01PM (#15273775)
    How does a moderator prove they are in fact a legit human and not a bot?

    I foresee a time when to access large parts of the net you will be required to use some central "proof of life" system. The current mish-mash of captchas isn't working. We have custom English captchas on a forum I admin and it doesn't seem to stop the bots: presumably when they get stuck they call for help.

    It's hard to believe a third of Googles index is auto-generated crap, but then I couldn't really believe the "50% of net traffic is spam or viruses" claim either and I'm pretty sure that one turned out to be true. It appears that an unregulated commons will always degenerate into a wasteland without some form of governance and law enforcement; perhaps rather than an arms race the only solution is for the internet to grow its own legal system and police force (how that'd work is left as an exercise to the imagination)

  • by Anonymous Coward on Friday May 05, 2006 @07:17PM (#15274213)
    Interesting though that they index fairly different things.

    Top 10 results for "slashdot poneys" on yahoo:

    1. slashdot.cuteness.org (not on google)
    2. jfaughnan.blogspot.com (#1 on google)
    3. jfaughnan.blogspot.com (#1 on google)
    4. index.cristal-trace.com (not on google, outdated link)
    5. mfrost.typepad.com (#22 on google)
    6. pcdq.blogspot.com (not on google)
    7. www.ninme.com (#15 on google)
    8. www.firstworld.biz (not on google, spam)
    9. musicindustry.firsindustry.com (not on google, spam)
    10. girls-having-sex-with-horses.danielblog.info (not on google, spam)

    Top 10 on google:

    1. jfaughnan.blogspot.com (#2 on yahoo)
    2. slashdot.org (not on yahoo)
    3. slashdot.org (not on yahoo)
    4. linux.slashdot.org (#27 on yahoo)
    5. linux.slashdot.org (#27 on yahoo)
    6. mitternachts-lied.net (#22 on yahoo)
    7. interviews.slashdot.org (not on yahoo)
    8. linuxfr.org (#19 on yahoo)
    9. www.releton.com (not on yahoo)
    10. www.japancar.fr (not on yahoo)

    Both yahoo and google are missing pages from their indexes. Some appear on one but not the other. Yahoo was slightly worse at indexing spam sites. (Is www.releton.com spam?)

    I'd say both are 'full' in the sense that neither seems to have enough capacity to index everything.

One way to make your old car run better is to look up the price of a new model.

Working...