Google's Site Ranking Secrets
Posted by
samzenpus
on Thu Jun 16, 2005 07:04 AM
from the more-google dept.
from the more-google dept.
vivin writes "Ever wonder how Google's site ranking works? Wonder no more. Google recently filed United States Patent Application 20050071741 on March 31, 2005. This patent reveals a great deal of information about Google's site ranking algorithm and makes very good reading. For example, one of the criteria that they use is the number of years that your site has been registered. If your site has been registered for less than a year, then it counts against you. A site registered for a longer period of time means that the owner is probably serious about the site, and the site is probably legitimate. Google's Site Ranking algorithms reveal how hard they are making it for spam sites to get listed (on Google). This information will also make it easier for you to make sure that you get listed well in Google."
This discussion has been archived.
No new comments can be posted.
Google's Site Ranking Secrets
|
Log In/Create an Account
| Top
| 309 comments
(Spill at 50!) | Index Only
| Search Discussion
The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.
Note (Score:5, Insightful)
(http://nullref.se/)
Re:Note (Score:4, Insightful)
(Last Journal: Sunday November 11, @03:52AM)
Uhhh...to prevent others from benefiting from it? That's what patents are for. They say it there to promote innovation. It protects the owners exclusive control in the hope that he might reveal his idea to the world. More often than not, what really happens is that the owner will put the invention on the shelf because a)it competes with other inventions the owner may have on the market, or b)like a land or commodities speculator, he's holding out for an exorbitant price. Note that also more often than not, the owner of the IP privileges is not the creator. Patents are bought and sold like poker chips. While the actual device rots. Only the paper pushers benefit.
what are they trying to protect then?
Their advantage over everybody else.
or (Score:3, Insightful)
(http://www.lostpixel.co.uk/)
Real Explanation (Score:5, Funny)
http://www.google.com/technology/pigeonrank.html [google.com]
Speed of gaining links? (Score:4, Insightful)
Re:Speed of gaining links? (Score:5, Insightful)
However, if you have gotten 1000 links at once, and for the next months noone else is linking to you - then you have probably bought the initial links, but nobody real considers the content worthy of attention.
Spammers killing Google (Score:5, Interesting)
(Last Journal: Friday December 24 2004, @08:49PM)
Sometimes when I search for something specific, I get a bunch of useless links that have results of other "search engines" that invariably show something similar to "0 results for your search terms 'sheep+barn+slashbot+erotica'"
How do these sites get on the first page of Google results?
Re:Spammers killing Google (Score:5, Interesting)
(http://jasonrumney.net/)
Doesn't work, see explanation (Score:5, Interesting)
(http://bonoki.com/)
A while back I proposed a distributed approach like this in the Nutch mailing list [mail-archive.com]. The problem is that it would be hard to implement and it may not be worth the effort, since there are cheaper ways to fight spam.
Re:Spammers killing Google (Score:5, Informative)
(http://www.myplugins.info/ | Last Journal: Tuesday January 13 2004, @08:30AM)
They get to the top through link spamming, 302 hijacks, "scaping" content from other sites, search engine optimisation etc etc etc.
They are sites "made for adsense" as its called, whereby they exist for the sole purpose to be highly ranked in google and get ad clicks from people looking for something else. Effectively 'doorway' pages, which make a shitload of money, as people that land on such pages don't find what they really want, so click through on the ads in hopes of finding it there instead.
The crap of the internet, many hundreds of thousands of such sites run by only a hanful of thousand very rich people.
Re:Spammers killing Google (Score:5, Insightful)
(http://205.205.253.95/Crackster | Last Journal: Wednesday September 22 2004, @09:57PM)
Re:Spammers killing Google (Score:5, Funny)
(http://nullref.se/)
Hey, there's a help button... *clicks*... Oh God...
Its a patent... and a laundry list... (Score:5, Insightful)
(http://www.geoexpat.com/)
Some of these techniques are just plain old bizzare and might be way too difficult to approach algorithmically.
Oh well
Re:IMNAPL (Score:4, Informative)
(http://jasonrumney.net/)
SEOs make me barf (Score:5, Insightful)
Re:SEOs make me barf (Score:5, Interesting)
(http://www.ausedcar.com/ | Last Journal: Monday August 22 2005, @10:29PM)
Not that simple (Score:5, Insightful)
(http://www.lireland.com/)
No, its not that simple. Lets say I have a small business, I sell garden tools, lawnmowers,etc, in a certain region. And yet I do a search on google for garden tools + region, I am nowhere to be found. What do I do? I optimise the hell out of my site, caking it with region name + garden tools information, and I set up a links exchange program, getting in links left right and centre from related sites. This is SEO, and it will only affect people that enter a search for "garden" "tools" "my region". In other words, those that actually want to find my site.
Theres a distinction between SEO and spamming; if I was to optimise for a garden tools site and set up a poker site there, that would be spamming.
Re:Not that simple (Score:4, Insightful)
The way I see it, SEO is a tool - nothing more, nothing less. It isn't inherently evil or inherently good - it's how you use it and what you use it for that matters.
If you've got a good site on... i dunno... aardvark polishing for fun and profit, then you should rank highly on Google. If you don't rank well on Google, it's probably because your site is lacking one of fame, content or clean code. All of these are necessary for (or inevitable side-products of) a good site that does what people want.
Conversely, a good site will probably have many inbound links, clean semantic markup, well-focused pages full of good content and so on. This is simply good site design (or, like the links, a side-effect of it), but it's also the very ethical end of the SEO spectrum.
Now, you also get evil scumbag fuckwits-for-hire who specialise in link-farming, keyword stuffing, cloaking and other black-hat techniques, and sell their services to shitty pr0n or spam sites. This is spam - no doubt about it - but it only represents the black-hat side of SEO.
The black-hat SEOers, it must be admitted, are the one which gets all the attention. They're the ones advertising like mad, making overblown claims, spamming search engines with crap listings and generally getting in people's faces. However, just because these people use SEO doesn't make SEO bad. Before SEO they were likely sending e-mail spams until that got too hard, but you don't unilaterally brand professionally-looking e-mails or people who sell mailing-list managers as evil, do you?
As Google et al. get their acts in gear and revamp their algorithms, "SEO" is increasingly overlapping with "good site design" - this was always the intention, and even now "white-hat SEO" and "good site design" are pretty much synonymous.
SEO isn't the problem - the problem is a combination of shithead black-hat SEOers, Search Engines inadequately assessing a page's worth and ill-educated types who shortsightedly blame the gun or bullet instead of the guy who fired it at them.
Re:Temporal data. (Score:5, Funny)
(http://www.supergameworld.com/)
I thought so .. changed my site from .ro to .com (Score:5, Interesting)
(http://www.interaktonline.com/)
Instantly, our ranking went from number one (for "Dreamweaver Php" for example, we were number one there instead of Macromedia itself a long time), to page 10.
Now, we're working hard to promote our site, we have links all over the place, but still our site don't get up again to page 1 (search for "dreamweaver extensions" - we have to pay to get our site in the first position). I even thought that they do this on purpose for us to continue to pay on Google Ads
Probably they say it too in the patent, but the best ranking tool is to use the right "title" tag in your pages. It's invaluable how well this scores as compared to the page content.
Alexandru
Re:I thought so .. changed my site from .ro to .co (Score:5, Informative)
(http://www.interaktonline.com/)
Alexandru
Foiled Again Google! (Score:4, Funny)
(http://sandfly.net.nz/)
PageRank (Score:5, Informative)
About the autor (Score:5, Insightful)
(http://linux-studie.nl/ | Last Journal: Wednesday April 21 2004, @01:22PM)
So that explains a lot. What a crappy article, I wonder if the submitter is the same as the Author?
Registration Age vs. Registration Duration (Score:5, Informative)
(http://www.hydrous.net/)
one of the criteria that they use is the number of years that your site has been registered
is not the same thing as (from the article):
How many years did you register your domain name for?
Though the summary suggests that older sites do better, the article is stating that, in order to improve one's Google ranking, domain owners should purchase longer domain registrations.
Impossible? Spyware? (Score:4, Interesting)
(http://www.bogado.net/ | Last Journal: Tuesday June 14 2005, @09:40PM)
Google does have a click-through engine attached to the results, but many people find this in adition to the single identifier cookie that googles push into you abusive already.
We all thing google is doing a good job, and it did managed to incorporate adds and an add service that is well accepted by the people. (I wonder why people still think it is a good idea to make blinking and noisy flash adds?) The point is how much we trust google? I personaly don't mind very much the click through, but do not accept the cookie and will not install a toolbar.
more on the subject (Score:5, Informative)
(http://www.ubuntunews.info/)
Some more on info the subject:
1. U.S. Patent Application [uspto.gov] - it's best to read what's exactly been patented.
2. interesting discussion on webmasterworld [webmasterworld.com]
Personally I think that while some of the stuff is interesting, most of it is made up rather to confuse SEOs (google doesn't quite like them, you know that, right?). Before that, they had couple factors to think about and work on. Now, there's a shitload of stuff that just makes their work harder. Also, more factors influencing SERPS means it's much, much harder to make a trial-an-error research on what works well and what doesn't.
Spammers (Score:3, Insightful)
Won't this information now make it easier for spam sites to get listed?
Just look at the patent application (Score:4, Informative)
Just look at the patent application yourself [uspto.gov].
I haven't read the whole thing, but just having taken a quick look at it, I have to agree with the posters who said that Google purposefully tried to cover any conceivable technique to index and rank pages. The application discusses multiple implementations of the various techniques that could be used to rank a page. Therefore analysis of the patent application is probably of limited utility for those trying to game PageRank (which was certainly a factor that Google's very competent IP lawyers considered before prosecuting the patent).
For those who are worried that Google is doing evil with this patent application, given the breadth of the patent and the fact that it discusses a plethora of techniques which Google may or may not be using, I will be surprised to see Google try to use this patent (or be able to use this patent) to push another search engine out of the market. More likely, I think, is that this will constitute prior art to enable Google to withstand challenges from other patent applicants for infringement. Of course, if you know anything about PageRank, you know that it was getting published in Scientific American long before Google was the dominant search engine. So this patent application is probably more to prevent allegations that Google infringed by adding on all the other checks and balances to the original PageRank technology to discourage spam sites.
Moiche
I find this quote funny: (Score:3, Insightful)
(http://www.slappyjack.com/ | Last Journal: Monday July 18 2005, @03:37PM)
If any of you have worked in a small online shops you know what a fucking holy war this is between marketing and pretty much everyone else. I specifically remember saying at one point, "Do we have to make ALL of the money RIGHT NOW?"
Good for Google for coming forward and telling peole they won't be a part of that slimy shit.
Bad for Google for saying all of this to drive up prices on their AdWord sales.
Doesn't mean it really works this way (Score:4, Interesting)
My experience with FunWithHeadlines.net (Score:3, Interesting)
(http://www.funwithheadlines.net/)
OK, so there aren't that many sites like mine, let alone sites that update daily over a period of years and include their entire archive on the site that grows daily. On the other hand, to my knowledge from doing searches on Google, I have very few site that link to mine, and I thought that counted highly with Google. So basically without trying to game the system, let alone advertise my site (other than incidentally in comments like this), I've been treated really well by Google.
In my case, it must be the longevity issue coupled with the scarcity of sites like mine. It sure ain't the links to my site.
So... (Score:3, Funny)
(Last Journal: Thursday July 12, @12:30PM)
Uh oh, Google applied for a patent! (Score:3, Funny)
(http://www.ferion.net/ | Last Journal: Monday May 06 2002, @02:16AM)
Other, better approaches to search engine spam (Score:3, Insightful)
(http://www.animats.com)
For example, let's search Google for "london hotels", a common search phrase. The first return is LondonNights.com [londonnights.com]. "Whois" returns "Worldview Ltd, 16 Marine Road West, Morecambe, LA3 1BS, Lancs, GREAT BRITAIN (UK)."
That's a UK company, so we look it up at Companies House. [companieshouse.gov.uk], where we find "WORLDVIEW LIMITED, 16 MARINE ROAD WEST, MORECAMBE, LANCASHIRE LA3 1BS, Company No. 04588973". So we have a match on a registered company.
We check further with Dun and Bradstreet [dnb.com], which has a worldwide database of companies. We find "WORLDVIEW LTD 16 MARINE RD WEST MORECAMBE , UK Type of Location: single"
So they pass company validation, and we can get financial information about them.
Now let's try a domain that just appeared in a spam: "fleagroups.com". "Whois" gives us "Flea Market Groups. 126 73rd Ave N., Coral Springs, Florida 34992. US" So we go to Sunbiz, the Florida State Division of Corporations [sunbiz.org], and search. No "Flea Market Groups" under fictitions names. No match on address under anything beginning with "Flea". No "Flea Market Groups" under corporations, and no "Flea Market *" address matches.
Looking in Dun and Bradstreet, there are "Flea Market *" hits, but no exact match and no address match.
So they fail company validation. Add to probable spammer list, drop search engine ranking.
This is a reasonable test for any site that appears to be selling something.
Number of years the site has been registered (Score:3, Interesting)
(http://www.vad1.com/)
So I get the following:
cat got my tongue (Score:3, Insightful)
(Last Journal: Tuesday November 26 2002, @05:46PM)
Google's Site Ranking algorithms reveal how hard they are making it for spam sites to get listed (on Google).
And provides a list of techniques for spam sites to use that guarantee them positions on every search engine but Google (in fact, if you use these techniques it's illegal for other search engines to penalize you for them.
This could be an especially evil technique for spammers.
Re:ATTENTION (Score:4, Funny)
I want to change. Please help me--I don't think I can do it on my own.