Stories
Slash Boxes
Comments

News for nerds, stuff that matters

Slashdot Log In

Log In

[ Create a new account ]

Google De-indexes Talk.Origins, Won't Say Why UPDATED

Posted by kdawson on Sun Dec 03, 2006 11:21 PM
from the honest-webmasters-go-fish dept.
J. J. Ramsey writes "Talk.Origins is an archive with thousands of pages exposing creationist pseudoscience. Rather mysteriously, Google pulled the plug on its search engine, giving only the vague reason: 'No pages from your site are currently included in Google's index due to violations of the webmaster guidelines.' This was apparently triggered by a recent cracking of the site that added 'hidden links to non-topical sites,' but Google won't say just what the violations were. Talk.Origins webmaster Wesley R. Elsberry believes that this Google policy harms honest webmasters." From the article: "My mission, whether I liked it or not, was to find and fix whatever problem the [Talk.Origins Archive] might have, with no guidance as to what the problem was and nothing at all about where to start looking... I was extremely lucky. The damage to my site was limited and in the first place that I happened to look. Other honest webmasters might not be so lucky. They may have to undertake an arduous process of vetting pages, essentially having to second-guess the mind of the cracker in trying to locate a problem that Google knows the exact location of." Thanks to an alert reader who sent in Matt's blog posting about how Google handles hacked sites.
This discussion has been archived. No new comments can be posted.
Display Options Threshold:
The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.
  • huh? (Score:3, Funny)

    by Average_Joe_Sixpack (534373) on Sunday December 03 2006, @11:25PM (#17095420)
    What's this? [google.com]
    • Re:huh? by fxm87 (Score:1) Sunday December 03 2006, @11:31PM
    • Re:huh? by anagama (Score:2) Sunday December 03 2006, @11:36PM
    • Re:huh? (Score:5, Informative)

      That's the Google Groups archive of the talk.origins newsgroup, which is a different animal (an ancestral form, one might say) from the Talk.Origins Archive web site. It was the site that was delisted. [talkorigins.org]

      And indeed, as of right now (10:35 PM CST) a Google search for "talk.origins" doesn't show any links at all to the Talk.Origins Archive. In fact, the first link that comes up is to a young-Earth creationist site which claims to offer "intellectually honest responses to the claims of evolutionism's proponents, including--but not limited to--the 'Talk.Origins' newsgroup and the 'Talk.Origins Archive' website."

      Conclusions about species competing in crowded niches are left as an exercise to the reader.
      [ Parent ]
      • Re:huh? by TapeCutter (Score:2) Monday December 04 2006, @12:32AM
        • Re:huh? by misleb (Score:2) Monday December 04 2006, @01:45AM
          • Re:huh? by sumdumass (Score:2) Monday December 04 2006, @01:56AM
        • Re:huh? (Score:4, Funny)

          by bytesex (112972) on Monday December 04 2006, @06:08AM (#17097412)
          (http://ufy.sourceforge.net/)
          evolutionism, homosexualism - they're all new, and evil ! Better stick to the good old isms such as religionism, wifebeatism and alcoholism.
          [ Parent ]
          • Re:huh? by TapeCutter (Score:2) Monday December 04 2006, @07:49AM
        • 1 reply beneath your current threshold.
    • groups are not google. by www.sorehands.com (Score:2) Sunday December 03 2006, @11:41PM
    • 2 replies beneath your current threshold.
  • Hmm (Score:3, Insightful)

    by Herkum01 (592704) on Sunday December 03 2006, @11:26PM (#17095424)

    While, I have some sympathy for the guy, just because you think your an honest webmaster does not mean that Google should have to vet you and your content. They have a business to run too. At some point a webmaster has to put themselves in a position to recognize and address these sorts of problems BEFORE Google gets involved.

    • Re:Hmm by arun_s (Score:3) Sunday December 03 2006, @11:56PM
      • Re:Hmm by Yusaku Godai (Score:2) Monday December 04 2006, @01:35PM
    • Re:Hmm by UglyTool (Score:1) Monday December 04 2006, @12:03AM
      • Re:Hmm by sjbcfh (Score:1) Monday December 04 2006, @12:26AM
        • Re:Hmm by bladesjester (Score:2) Monday December 04 2006, @12:43AM
          • Re:Hmm by Brushfireb (Score:2) Monday December 04 2006, @09:29AM
      • Re:Hmm by pimpimpim (Score:2) Monday December 04 2006, @07:39AM
    • Re:Hmm by jkauzlar (Score:2) Thursday December 07 2006, @04:15PM
    • 1 reply beneath your current threshold.
  • Google censoring Usenet? Not! (Score:5, Insightful)

    by BorgCopyeditor (590345) on Sunday December 03 2006, @11:35PM (#17095502)
    The writeup sucks. It implies that Google is censoring Usenet.
  • Backups? (Score:3, Insightful)

    by TubeSteak (669689) on Sunday December 03 2006, @11:38PM (#17095518)
    (Last Journal: Saturday February 25 2006, @11:02PM)
    You'd think they'd keep regular "Last Known Good" backups and just be able to do a simple diff between the current page & their backup.

    Or even just MD5 sums of all their pages, once a day, with known updates marked as such.

    There should be no reason anyone has to even contemplate manually digging through thousands of pages if they've prepared sufficiently beforehand.

    Maybe they'll take some very simple & no-cost precautions now that they've been burned.
  • Whine, Whine, Whine (Score:5, Insightful)

    by MDMurphy (208495) on Sunday December 03 2006, @11:48PM (#17095590)
    (http://www.KateTheDog.com)
    So many people refer to Google as if it were a human looking at web sites and giving it the big thumbs up or down. As part of the indexing if the spider finds "violations" such as presenting a different page to spiders than to humans, it risks being dropped from the index. To expect a human response to why each site triggered the de-indexing is not reasonable.

    In the webmaster's whining about Google, he complains about the request to be re-indexed containing:

                        *I believe this site has violated Googles quality guidelines in the past.

                        * This site no longer violates Googles quality guidelines.

    He thinks these are "an admission of guilt", but they dont' say "I violated" they say "the site violated". So, if the site were hacked and did violate their indexing policy, fix it, say you've fixed it and move on. How many hits has he had over the years that came directly from Google? And did they come from Google due to all those people choosing Google to search for his site or it's topics? But now he whines about being delisted for the time it takes him to fix a site he should have kept unhacked in the first place.

    • Re:Whine, Whine, Whine by TodMinuit (Score:1) Sunday December 03 2006, @11:51PM
    • Re:Whine, Whine, Whine by kjart (Score:2) Monday December 04 2006, @12:25AM
    • Sites don't do things by btempleton (Score:2) Monday December 04 2006, @01:56AM
    • Re:Whine, Whine, Whine (Score:4, Funny)

      by identity0 (77976) on Monday December 04 2006, @02:27AM (#17096430)
      (Last Journal: Monday March 31 2003, @01:23AM)
      Heresy! Google sees all, Google knows all! Google is a man, a spider, and the holy page in one!

      Brin 3:14 "And Google so loved the internet, that he sent his only-born son Larry Page to it so that any who believe in him shall not perish but have ever-lasting life in the Googleplex."

      So you see, there *is* a person, Larry Page, who is also the spider that indexes everything and is also the page that serves up results. Only through this holy trinity could results as good as Google's result, thus proving Google's divinity. If the almighty Google has delisted this sinner's page, then we should not be looking at it in the first place, yes? To go against the wishes of Google brings hellfire!
      [ Parent ]
    • Re:Whine, Whine, Whine by CAIMLAS (Score:2) Monday December 04 2006, @06:40AM
  • Same thing happened to the first wiki (Score:1, Interesting)

    by Tablizer (95088) on Sunday December 03 2006, @11:56PM (#17095650)
    (http://www.geocities.com/tablizer | Last Journal: Saturday March 15 2003, @01:22PM)
    The first wiki, c2.com, also has a similar problem. Google stopped indexing it (or at least listing it), and nobody is sure why. It may be a side-effect of anti-spam features that c2 added, but this is just speculation. Site custodians debated removing the anti-spam features because of this, but it has yet to be settled.
  • Synopsis (Score:5, Insightful)

    by operagost (62405) on Sunday December 03 2006, @11:59PM (#17095670)
    (http://operagost.com/ | Last Journal: Monday May 01 2006, @12:08PM)

    "Talk.Origins is an archive with thousands of pages exposing creationist pseudoscience"
    This article is a submission containing a biased summary which has little to do with the actual topic, which is the enigmatic status of Google's search algorithms.
    • Re:Synopsis by ksalter (Score:1) Monday December 04 2006, @12:56AM
      • Re:Synopsis by nacturation (Score:2) Monday December 04 2006, @03:55AM
        • Re:Synopsis by Tyreth (Score:1) Monday December 04 2006, @06:00AM
          • Re:Synopsis by coastwalker (Score:2) Monday December 04 2006, @10:53AM
            • Re:Synopsis by eluusive (Score:2) Monday December 04 2006, @12:28PM
      • Re:Synopsis by StoatBringer (Score:1) Monday December 04 2006, @07:21AM
      • 3 replies beneath your current threshold.
    • Re:Synopsis by TapeCutter (Score:2) Monday December 04 2006, @01:17AM
      • Re:Synopsis by Legion303 (Score:2) Monday December 04 2006, @04:02AM
        • Re:Synopsis by Angostura (Score:1) Monday December 04 2006, @06:37AM
          • Re:Synopsis by Legion303 (Score:2) Monday December 04 2006, @07:37AM
        • Re:Synopsis by TapeCutter (Score:2) Monday December 04 2006, @07:23AM
          • Re:Synopsis by Legion303 (Score:2) Monday December 04 2006, @07:40AM
      • Re:Synopsis by Guuge (Score:1) Monday December 04 2006, @02:09AM
        • Re:Synopsis by NMerriam (Score:2) Monday December 04 2006, @03:18AM
          • Re:Synopsis (Score:5, Interesting)

            by Tyreth (523822) on Monday December 04 2006, @06:13AM (#17097426)
            I think that the person you are replying to is referring to the big bang. If you look far back enough into the history of the universe, you get to a point where everything began to exist. At the singularity of the big bang, we find that both time and space began. There is no "before" the big bang, as time did not exist. This is a central part of the cosmological argument for God's existence:
            1. Everything that begins to exist has a cause
            2. The universe began to exist
            3. Therefore, the universe has a cause
            What came before the big bang? That question is meaningless, as time did not exist. So you have a few options, only one of them feasible. The first is that the universe is infinitely old and had no beginning. Once a view of atheists, this is no longer scientifically plausible. The second answer is that the universe came into existence from nothing - absolutely nothing. The third, and most reasonable, is that something else caused the universe to be created. This cause must itself be timeless, and spaceless, as time and space began to exist with the big bang.

            So the atheist must either claim the absurdity that the universe came from nothing, or he(/she) must acknowledge that there was something that created it. And that *something* is inaccessible from scientific analysis. It is not, however, too far from the reach of philosophy and logic. We can draw reasonable conclusions about this entity.

            [ Parent ]
            • Re:Synopsis by CmdrGravy (Score:3) Monday December 04 2006, @06:59AM
              • 1 reply beneath your current threshold.
            • Re:Synopsis by TapeCutter (Score:3) Monday December 04 2006, @07:03AM
              • Re:Synopsis by Tyreth (Score:2) Monday December 04 2006, @08:41PM
                • Re:Synopsis by TapeCutter (Score:2) Tuesday December 05 2006, @04:49AM
            • Re:Synopsis by Dragonslicer (Score:1) Monday December 04 2006, @07:49AM
              • Re:Synopsis by Coryoth (Score:2) Monday December 04 2006, @12:28PM
              • Re:Synopsis by Tyreth (Score:2) Monday December 04 2006, @08:54PM
            • Re:Synopsis (Score:4, Interesting)

              by AlXtreme (223728) on Monday December 04 2006, @08:31AM (#17098184)
              (http://www.aperte.nl/ | Last Journal: Monday July 07 2003, @05:11AM)
              What came before the big bang? That question is meaningless, as time did not exist. So you have a few options, only one of them feasible. The first is that the universe is infinitely old and had no beginning. Once a view of atheists, this is no longer scientifically plausible.
              Sorry, you are incorrect. There wasn't any time before the big bang, thus nothing could have come before it as you yourself stated. Not even time, as the big bang was the beginning of time. However one alternative view is that the universe oscillates between a big bang and a big crunch, thus from a timeless point-of-view the universe would be infinite.

              What I really find worrying (hello, 1500's are calling) is the method of reasoning by creationists, like yourself.
              A: There was a big bang.
              B: We currently don't know what was the cause of this.
              C: There must be 'some higher being' that created the universe.

              Now A and B do not lead to C, no matter how you reason. If you want to have a drop of credibility, you'll have to support your claims. However, you can not, thus your logic is flawed. What created the 'entity' you speak of? What came before it? Why did it create the universe? If you want to play the science game, you should be answering those questions. Science allows questions to be left open, but tries to answer as many as possible by using facts. Creationism is not, and is unlikely to ever be, scientific or logical. You are allowed to believe in the toothfairy for all I care, but unless you have evidence that a mystical entity is willing to pay for your teeth: keep your belief to yourself.

              [ Parent ]
              • Re:Synopsis by Ogive17 (Score:1) Monday December 04 2006, @11:25AM
                • Re:Synopsis by TapeCutter (Score:2) Monday December 04 2006, @08:10PM
              • Re:Synopsis by eluusive (Score:2) Monday December 04 2006, @12:44PM
                • Re:Synopsis by eluusive (Score:2) Tuesday December 05 2006, @02:47PM
                • 1 reply beneath your current threshold.
              • Re:Synopsis by vakuona (Score:1) Monday December 04 2006, @07:11PM
              • Re:Synopsis by Tyreth (Score:2) Monday December 04 2006, @09:12PM
            • Bigger universe by AlpineR (Score:2) Monday December 04 2006, @09:21AM
            • Ah, yes. The "forced choice"... by Dr. Manhattan (Score:2) Monday December 04 2006, @10:23AM
            • Re:Synopsis by Laur (Score:2) Monday December 04 2006, @11:09AM
            • Re:Synopsis by NMerriam (Score:2) Monday December 04 2006, @12:26PM
            • Oh my "god" you are... by rmdyer (Score:2) Monday December 04 2006, @12:38PM
            • Re:Synopsis by Yusaku Godai (Score:2) Monday December 04 2006, @01:48PM
            • Re:Synopsis by heinousjay (Score:1) Monday December 04 2006, @04:36PM
            • Re:Synopsis by dentin (Score:2) Monday December 04 2006, @07:02PM
              • Re:Synopsis by Tyreth (Score:2) Monday December 04 2006, @08:30PM
            • Re:Synopsis by tehcyder (Score:1) Tuesday December 05 2006, @07:36AM
              • Re:Synopsis by Tyreth (Score:2) Tuesday December 05 2006, @06:06PM
            • Re:Synopsis by Rufty (Score:1) Tuesday December 05 2006, @08:44AM
            • Re:Synopsis by Tyreth (Score:2) Monday December 04 2006, @09:17PM
            • 3 replies beneath your current threshold.
        • Re:Synopsis by brado77 (Score:1) Monday December 04 2006, @03:20AM
          • Re:Synopsis by TapeCutter (Score:3) Monday December 04 2006, @06:11AM
            • Re:Synopsis by TapeCutter (Score:2) Monday December 04 2006, @07:10PM
            • Re:Synopsis by Rufty (Score:1) Tuesday December 05 2006, @08:41AM
            • 2 replies beneath your current threshold.
        • Re:Synopsis by Guuge (Score:1) Monday December 04 2006, @03:45AM
        • Re:Synopsis by Harry Coin (Score:1) Monday December 04 2006, @12:53PM
        • 1 reply beneath your current threshold.
      • Re:Synopsis (Score:4, Interesting)

        by TapeCutter (624760) on Monday December 04 2006, @04:29AM (#17096998)
        (Last Journal: Tuesday February 13 2007, @05:31PM)
        To paraphrase your post: "science is a religion". I disagree, please correct me if I misunderstood.

        Science is based on a single "article of faith", ie: I belive the real world exists as I and others who inhabit said "real world" collectively percieve it. I can only prove with certainty that I exist and furthermore can only prove it to myself. If I do not have faith in the real world then "others" must be a figment of my imagination, a troublesome state of affairs since the imaginary/real others will declare me a psychopath [google.com] and lock me up in a real/imaginary padded room.

        Since I and "others" can observe and agree on things in and about the real world we can create testable theories that can be refined to better fit our observations and accurately predict outcomes. ie: We can practice the scientific method and refine our theories until we reach a (possibly non-existant) point where the only "assumption" is that the real world exists, or as I like to put it the Universe "just is".

        So regarding a belief in evolution - The only faith required is the faith that the real world exists.

        As for religion, it is based on blind faith, blind since I and "others" cannot percieve the same observations, these observations and associated theories fail the "real world" test because they cannot be demonstrated to "others" using their own perceptions. This does not mean religion is pointless or even psuedoscience, it simply means religion is not comprable to science (apples vs oranges). In my mind making such comparisons entirely misses the point of both endevours.

        Psuedoscience, litteraly "fake science" is blind faith dressed in a lab coat. Sure creationism is a theory but it's NOT a scientific one, claiming otherwise is by definition, psudoscience.

        Finally the lack of a strong scientific theory for the origin of life does not validate creationisim, nor does it invalidate the theory of evolution.

        Bias: I suppose you could argue on some deep philosophical level that faith in the "real world" makes me biased toward...um...the real world, I can only wonder if that automatically means psychopath are unbiased? What does "science is a religion" prove? - I'm biased because science has a demonstratably superior track record of explaining and predicting the real world's behaviour whereas blind faith performs no better than random chance. Is that the kind of "bias" we are talking about here? - Because if it is, I am wondering how a non-phycopath can go to bed confident they will awake on the same planet the next morning?

        Short cut to scientific enlightenment: Carl Sagan's book "A demon haunted world". It's also serves as an outstanding example of what a skeptic should be.
        [ Parent ]
        • Re:Synopsis by The One and Only (Score:2) Monday December 04 2006, @10:01AM
        • Re:Synopsis by JonathanBoyd (Score:2) Monday December 04 2006, @10:52AM
          • Re:Synopsis by CapedOpossum (Score:1) Monday December 04 2006, @12:43PM
            • Re:Synopsis by TapeCutter (Score:2) Monday December 04 2006, @08:21PM
            • 1 reply beneath your current threshold.
        • Two kinds of faith by Russ Nelson (Score:2) Monday December 04 2006, @01:42PM
        • Re:Synopsis by RobertLTux (Score:1) Monday December 04 2006, @09:54PM
        • Re:Synopsis by TapeCutter (Score:2) Monday December 04 2006, @06:39PM
        • 1 reply beneath your current threshold.
      • Re:Synopsis by Sr. Zezinho (Score:1) Monday December 04 2006, @05:38AM
      • Re:Synopsis by Conanymous Award (Score:2) Monday December 04 2006, @05:42AM
      • 1 reply beneath your current threshold.
    • That's not biased. by StoatBringer (Score:1) Monday December 04 2006, @04:55AM
    • Re:Synopsis by iainl (Score:1) Monday December 04 2006, @05:24AM
    • Re:Synopsis by DrXym (Score:1) Monday December 04 2006, @05:26AM
    • Re:Synopsis by Conanymous Award (Score:1) Monday December 04 2006, @05:27AM
      • Re:Synopsis by alexgieg (Score:2) Monday December 04 2006, @06:01AM
        • Re:Synopsis by JohnFluxx (Score:3) Monday December 04 2006, @06:26AM
          • Re:Synopsis by Dragonslicer (Score:1) Monday December 04 2006, @07:56AM
          • Re:Synopsis by alexgieg (Score:2) Monday December 11 2006, @08:06AM
        • Re:Synopsis by Conanymous Award (Score:3) Monday December 04 2006, @06:32AM
  • by Nevyn (5505) * on Monday December 04 2006, @12:05AM (#17095712)
    (http://www.and.org/ | Last Journal: Thursday December 07 2006, @05:00PM)
    They may have to undertake an arduous process of vetting pages, essentially having to second-guess the mind of the cracker in trying to locate a problem that Google knows the exact location of.

    Bzzt. The website admin needs to locate one or more problems (== however many the cracker planted), and Google knows the exact location of at least one. "one or more" >= "at least one". If google tells people where their problems are, google will be playing whack a mole for eternity. There are contractors/services that should be able to help them/anyone, google is not one of them.

  • Caped Hacker (Score:4, Funny)

    by derubergeek (594673) on Monday December 04 2006, @12:23AM (#17095822)
    (http://slashdotislame.org/ | Last Journal: Monday March 17 2003, @08:15PM)
    This was quite obviously the work of the Flying Spaghetti Monster.
  • probably just bad algorithms (Score:5, Insightful)

    by martin-boundary (547041) on Monday December 04 2006, @12:27AM (#17095844)
    While it's natural to sympathise with the victimized website, it doesn't follow that Google is doing something Evil(TM) in this instance, rather it's most likely that their current algorithms are badly tuned.

    With the index sizes that are being collected by search engines these days (on the order of 10 billion entries), it's completely naive to think that some humans are sitting at a terminal choosing to delist websites for some policy reason or other. It's also completely naive to think that a human email monkey can do any sort of digging to find out the exact reason that Google's automated algorithm has censored this particular site.

    Instead, Google's engineers have automated algorithms which do all the censorship, and the policy is just there as a thin cover for whatever the algorithm happens to be doing today. It's worse of course, because 1) algorithms change every few months and 2) there's simply no comprehensive way to test the quality of the implementation.

    Anyone who's programmed a nontrivial algorithm knows that obscure edge cases are a bitch, and with 10 billion websites, any algorithm will have plenty of obscure edge cases which nobody has ever tested, nor ever will. The most likely explanation is that the website in TFA is a false positive of some subsystem, but fixing it will require changes to the algorithms, and Google don't want to risk that, would you? The problem will probably go away in a few months when the algorithms are scheduled to be updated.

  • by medge_42 (173874) on Monday December 04 2006, @12:35AM (#17095884)
    (http://www.users.on.net/~medge)
    Look for the webmaster's email address on the pages, assume it's webmaster@domain.com or try root@domain.com? All these could bounce or simply disappear. Should Google put the effort into finding out who the webmaster really is?
  • The Truth (Score:2, Funny)

    by Anonymous Coward on Monday December 04 2006, @12:49AM (#17095972)
    Let's face it. It's all a vast right wing conspiracy by fundamentalist Christians to remove any website that counters their beliefs... either that or it's simple Google policy and posting this story was a waste of everyone's time and only served to try and stir up debate.
    • 1 reply beneath your current threshold.
  • It's not like google is running his board or that he is paying google to debug his site.

    I felt like making some silly comment about how lost the site is with out creationism and science.
    Without either the site couldn't exist, or at least there would be nothing to "talk" about.

    Some paradoxs:

    We have science and use it to do things, to create things. Someday we may have the know how to create a galaxy and care for it lie a farmer does his crop. To inject life and help mold it to a conscious life from. But what would hat life form believe? Creationism, that they can never be such creator we have become? Maybe they would rather develope science and learn how to.

    One cannot exist without the other.

    Right to life vs. freedome of choice is another paradox exposed by a starving child who doesn't seem to have the freedome of choice to eat some food and there for will lose his right to life. Imagin all teh children that could have been feed, clothed, taught, given shelter and medicine with all the resources that have been wasted in the arguement....

    People who something for themselves can sometime take advantage of something symbotic by falsely splitting it to create a public arguement that sells their book or presence, wins them an election, etc..
    .

     
  • Or at least like a particular ex of mine... (I know -- I shouldn't generalize)

    "If you don't know what you did wrong, I'm not going to tell you!"

    cue jokes about /.ers and girlfriends...

  • Google Webmaster Tools (Score:5, Informative)

    by RockoW (883785) on Monday December 04 2006, @01:32AM (#17096168)
    (http://steve.blogdns.org/)
    Google have a set of http://www.google.com/webmasters/tools/ [google.com] tools for webmasters. essencially it give out every diagnostic needed to fix your site for Google. Additionaly you have statistics for searches and how GoogleBot see your site. So, you shouldn't blame until you googled for the answer! Searching for "Google index tool" shows up "Google Webmaster Central"...
    • Re:Google Webmaster Tools (Score:4, Interesting)

      by DrXym (126579) on Monday December 04 2006, @08:03AM (#17097958)
      Google certainly has some useful tools, but when they don't work you are screwed. I have a site which I won't name which is not indexed by google and I have absolutely no idea why. I've submitted the url, built a sitemap using their own tools, validated it and even submitted the site for relisting. It still isn't there. What have I done wrong? The tools say everything is fine except it isn't. I could go to the web forum but other postings suggest the employees will likely just tell me wait for indexing. Except its not indexing me.

      The sick thing is that I have Google Adwords on that site so each day that Google don't list me, THEY are losing money. I estimate I get 10x the click through business from MSN search than I do from Google. I'd probably make 3x the profit (as would Google) if they'd index.

      [ Parent ]
  • Easy to check. (Score:2)

    by truckaxle (883149) on Monday December 04 2006, @01:34AM (#17096176)
    (http://perlworks.com/)
    I recently had several sites that I host find their way into Google hell. I wrote a small 15 line Perl script (using File::Find) to hunt thru thousands of pages and return all url patterns.

    Filtered out local domains and found easily found the problem with a run away forum that spammer zeroed in. Looks like they were also using a Perl script :{

    They had uploaded over 3000 links - the bastards.

  • Ironical (Score:2)

    by nsingapu (658028) on Monday December 04 2006, @01:50AM (#17096240)
    (http://www.businessphonesdirect.com/)
    A site gets delisted by google.
    Then scores a slashdot homepage link.
  • by Trojan35 (910785) on Monday December 04 2006, @01:52AM (#17096244)
    Really. Google has no obligation to anyone but its shareholders. You can start calling google evil when it pulls a microsoft and says "Redhat, install us as your default search engine and delete yahoo from your computers or we'll de-list your site".
  • by SpaghettiPattern (609814) on Monday December 04 2006, @02:03AM (#17096306)
    with no guidance as to what the problem was and nothing at all about where to start looking...
    So you're a web master and you don't know how to check if the content you're mastering is OK. We clearly have to redefine the word master .
  • Talkorigins hacked by porn spammers (Score:5, Informative)

    by Mouth of Sauron (196971) on Monday December 04 2006, @02:07AM (#17096334)
    The site www.talkorigin.org is not the only site to have been de-indexed by Google.


    This is a google cache of talkorgins.org [72.14.203.104] showing the porn spam links.


    However, I checked on deepx.com [deepx.com] and it is *not* a porn site.


    From DeepX.com's about page:


    XML provides an open and flexible language for the creation, management and exchange of electronic content. Founded in 2000, deepX has an experienced team of consultants and developers, who specialise in the design and development of solutions using XML and the emerging technologies related to XML.


    Also, another link shows www.theoi.com [theoi.com] and it is *not* a porn site, either:


    Here's how THEOI used to look via the Wayback machine. [archive.org]


    Theoi.com has been banned by Google (no reason given) and forced to close down as a result. There are no plans to re-establish this site in the future.


    wu.edu.gh is Valley View University is a Seventh Day Adventist college in Ghana.


    Both deepx.com and wu.edu.gh redirect to porn sites.


    Unsurprisingly, wu.edu.gh, theoi.com and deepx.com have been de-indexed by google.


    I speculate that all these sites that have been de-indexed were tagged by automated processes.

  • No Free Consulting (Score:4, Insightful)

    by rossz (67331) <ogre @ g e ekbiker.net> on Monday December 04 2006, @02:20AM (#17096390)
    (http://geekbiker.net/ | Last Journal: Thursday July 01 2004, @05:57PM)
    Basically, this "so called" webmaster wanted free consulting from Google. I don't think so. My personal response would have been, "I'll be happy to supply you with the information you request. It will, however, cost you my standard consulting rate of $xx/hour, two hour minimum."

    Only friends and family get free computer help from me, but I'm rethinking that policy since I spent half a day cleaning the malware off my brother's computer during the last family holiday. He probably won't ask me to do it again, though. When he asked how his system got so infected, I answered (in front of the entire family), "You got infected from all those lesbian porn sites you've been visiting."

  • happens all the time (Score:2, Interesting)

    by Bauguss (62171) on Monday December 04 2006, @03:31AM (#17096734)
    This is how google does business

    I used to work on a site that had this happen. It ends up that past practices from the company led to the penalty and delisting. Unfortunately, google will not tell you exactly what you are doing wrong.

    It pretty much led to the demise of the company. Sales plummeted so far that the investor pulled the plug. We did actually end up fixing the issue and relisted but the damage was done. (amongst other problems the company had...wasn't only google that did them in)

    There really should be a tool provided by google that tests your site and tells you if and what it finds wrong. You would think this would be easy considering the code already exists.

    Perhaps it could even just be a tool provided only to advertisers.

  • Welcome to the Real World (Score:4, Insightful)

    by Evets (629327) on Monday December 04 2006, @04:44AM (#17097060)
    (http://www.stevekallestad.com/ | Last Journal: Thursday May 31, @03:02AM)
    Have a peek over at the forums at WebmasterWorld, DigitalPoint, SearchEngineWatch, or any number of other webmaster related sites. This happens all the time. It is an issue that webmasters have had to deal with for some time now. Google at least provides some input for you if you can be bothered to register a sitemap with them.

    Google has several billion pages in their index, and a significant portion of them are spam. Their business model relies on them having internal methods of dealing with web spam and it is not feasible or desirable for them to produce a list of violations to each and every person who runs afoul of their algorithms.

    This is far from the most popular or important site this has happened to. Wordpress was delisted, as was BMW, Syndic8, and many others. This guy is using the controversial nature of his subject matter in an attempt to draw more attention. Get in line buddy, there is a long list of people whining all over the web about the same thing. Are you more important because the word Christianity is loosely affiliated with your site? Nope.

    Do a little googling yourself and you can pretty easily figure out how to resolve the problem. It takes some time, and there are ways to accelerate the process. If you are that reliant on Google, it is time to start participating in some webmaster communities and figure out how to play ball with the Search Engines. Just like everybody else.
  • Google emailed this site (Score:5, Informative)

    by GoogleGuy (754053) * on Monday December 04 2006, @05:44AM (#17097318)
    (http://www.google.com/webmasters/)
    If you dig deeper, it turns out that Google emailed talkorigins.org to alert the site that it had been hacked and was stuffed with rape and animal porn spam. Google's head of webspam has posted a full write-up [mattcutts.com].
  • The only reason this article appeared on the /. is because crusaders against creationism (rightfully) suffered (for the reasons unrelated to the content), and the viciously atheistic vigilante crowd of alarmists started to moan and moarn. The /. editors smelled the opportunity to add some hits (why? /. is very popular without this sensationalism).
  • evolution (Score:3, Funny)

    by dwater (72834) on Monday December 04 2006, @07:26AM (#17097732)
    So, if evolutionary theory is correct, it seems to have favoured a line of cry babies. There's evidence against, if ever there was any.

    I suppose he could be a mutant....and his predecessors are all non-cry babies.
  • in their website. But if you wait a million years, Google's algorithm could randomly change.
  • by RyuuzakiTetsuya (195424) <taiki @ c o x.net> on Monday December 04 2006, @07:47AM (#17097870)
    After seeing the headlines, I popped "talk.origins" into my Google search bar and got "trueorigin.org." Which claims to "Expose the myth of Evolution."

    WHAT THE FUCK.
  • by Secrity (742221) on Monday December 04 2006, @09:15AM (#17098518)
    This whole article assumes that Google exists for the convenience of websites -- it doesn't. Google exists for the convenience of USERS who, as a group, view advertisers' paid links. As a user, I don't give a shit if Google notifies website owners about their reasons for delisting.
  • by mlwmohawk (801821) on Monday December 04 2006, @09:22AM (#17098584)
    The assumption that google "knows" where the problems are is based on a probably false assumption. Many times these algorithms use merely ranking and automated inspection.

    Google has billions of pages indexed, there isn't enough time or manpower to have humans inspect pages. Even if they could elevate to a human, google could not possibly inspect for every domain.

    The truth is that the webmaster let his site get hacked, Google delisted the site to protect the integrity of its product. It is the responsibility of the WEBMASTER to protect the integrity of their site. He may complain that he has to assume "guilt" (He doen't really, he merely has to affirm he has corrected the problems and believes that there are no more.) The problem is he IS guilty of being a bad admin that allowed someone to hack his site, and he wants to blaim google and not himself.
  • by Jahz (831343) on Monday December 04 2006, @09:42AM (#17098770)
    (http://www.adkap.com/ | Last Journal: Thursday August 10 2006, @04:10PM)
    If Google revealed the core techniques used by its index to filter out hacked pages, then it would be effectively making those techniques useless. Its the same reason Microsoft and Firefox have not told the world how its anti-phishing features work, even if you have a domain that appears erroneously as phished. Its also the same reason that only dozens Google employees REALLY know how AdSense works. We know it depends on the context of the page, but clearly there is more to it (this confirmed by an Google employee friend of mine who is under NDA; could noy say more).

    Its security through obscurity, sure... A valid form of security when the goal is to prolong discovery of methods until new ones are in place.
  • by Ranger (1783) on Monday December 04 2006, @11:31AM (#17100174)
    I hope this gets fixed soon. Because now when you search for talk.origins the first thing that pops up is true.origins. www.trueorigin.com which I won't link to. They are a bunch of willfully ignorant creationists. Sorry that was redundant.

    Anyway, here's the proper link http://www.talkorigins.org/ [talkorigins.org] if you can't find it with google. I occassionally use other search engines like AllTheWeb [alltheweb.com] if I can't find what I'm looking for with google.

    And there's nothing on the Talk.origins news page [talkorigins.org] about the delisting.
  • Nazis on Usenet (Score:1)

    by bayers (155001) on Monday December 04 2006, @01:53PM (#17102388)
    (http://specfic.com/)
    Has anyone called anyone a nazi yet?

    http://en.wikipedia.org/wiki/Godwin's_law [wikipedia.org]
  • Re:Words are Meaningless (Score:5, Insightful)

    by Baricom (763970) on Sunday December 03 2006, @11:30PM (#17095450)
    Nobody was evil here. The guy's site got hacked and spam links added, Google rightfully de-listed him, and then the webmaster found the problem, fixed it, and asked Google to re-list. Am I missing something?
    [ Parent ]