Google's Search Results Degraded? 209
scrm writes "According to this Wired article, recent tweaks to Google's PageRank search algorithm have degraded rather than improved the accuracy of the results." I noticed this firsthand the other day, but only when I was searching for pictures of famous people, but all my technical queries came back fine.
Famous People? (Score:4, Funny)
Re:Famous People? (Score:2)
Re:Famous People? (Score:5, Funny)
Re:Famous People? (Score:2)
What's his name again?
Yeah sure! (Score:5, Funny)
They did it to protect Microsoft and Adword$ (Score:5, Interesting)
To test if we really are in front of a Dmoz dominated Update, we have set up a Aspseek based small search engine, a GNU search engine with a crude PageRank-like ranking system. We have indexed around 1.500.000 pages, using as the starting point 700 Dmoz very competitive dmoz cats, including up to 250 pages per site, following up to 10 outside link, with up to 100 pages per outside link. What we found was that 59% of our top 20 results on the 100 cat-related competitive Keywords where also top 20 in Googles new index, and 26% of our Top 10 where also Googles Top 10.
But we must also said that we have not been able to find a so-compelling relationship using no-competitive categories. A 2.000.000 pages index with non-competitive regional cats, using non-competitive Keywords, showed a very small correlation between top 10, top 20, and even top 50 results.
So, our working theory right now is, yes, small changes, probably committed in order to fight both googlebombing and Pagerank commoditization, have affected the index accuracy in many different ways.
We think the index is unbalance, or unless much more unbalanced than the last one, and, as a result, the weight of some previously no-so important characteristics are souped-up, opening the door for abuse.
The main effect of all of this is small well managed sites at competitive categories can not relay anymore in good content + good linking to get a good listing. They will have to pay Google using Addwords program. That is the main change here. Forcing small business to pay to get at the top, using advertising space.
But we do think this update and the changes committed are, to say the least, unbalanced, and the new algo is rampantly open to easy abuse. Lets hope Google good old Phd common sense returns soon, and a new, improved update takes place as soon as possible. Lets hope they are not tring to make a easy killing by forcin small popular sites, all of the sudden deprived from traffic, to pay google using Adwords.
Re:They did it to protect Microsoft and Adword$ (Score:2)
actually ... (Score:5, Interesting)
Re:actually ... (Score:2)
Re:actually ... (Score:5, Informative)
They index lots of other sites pages and when google comes around spidering they return random content from them. If you follow one of these links from google you will be redirected to their portal.
No doubt it gives them quite a bit of traffic.
Re:actually ... (Score:2)
"Come on, Slashdot me! Gimme your best shot! What, are you yellow? Chicken! I've got bandwidth coming out of my ears! Slashdot me! Just you try!"
Re:actually ... (Score:1)
Not quite (Score:1)
The spammer with the greater number of Dmoz Editor accounts.
Re:actually ... (Score:5, Interesting)
Re:actually ... (Score:2)
Re:actually ... (Score:2)
The Date on Google. (Score:2, Informative)
They want us to pay (Score:1)
Most of dose categories are now dominated by garbage sites and spamers. Good sites with a lot of content and a lot of popular links who used to be #1 or #2 are now #60 or #100, precede with spammers and affiliation programs and 1 page redirection sites.
Those sites are being forced to pay Googles "affordable" Adwords program to stay alive.
Quote ... (Score:4, Funny)
Famous people Google Search: "Courtney Cox getting a facial"
Results: "Courtney Cox advertises facial creme from Loreal"
Technical Google Search: "How do best use lube and type one handed"
Results: "www.pinkbits.com"
Good Overview and Links to Discussion... (Score:5, Interesting)
My pagerank... (Score:1)
Re:My pagerank... (Score:1)
Re:My pagerank... (Score:1)
I absolutely agree. The people who go down will say that the new algorithm is worse, and the people who go up will say it's an improvement.
In my case, I've been steadily going up for a while, and so the fact that I went up doesn't seem obviously related to a change in algorithm.
For a while I've held the #1 spot for 'Andromeda' and I just went to #5 from #7 for 'MP3 server'.
Personally, I can't tell the difference in searches for other sites.
From what I understand, the change is supposed to help new sites, which seems like a good thing (unless it's easy to exploit).
Pideon (Score:5, Funny)
Re:Pideon (Score:4, Funny)
So for GOD'S SAKE MAN, SMILE!
Re:Pideon (Score:1)
other search engines/ They all need to get better (Score:5, Interesting)
I had an instructor point us to a page on networking that was amazing good but not found on any of those 3 search engines, at least in the top 30. Most of those top 30 hits wern't very good either.
Maybe yahoo has it right, The web should be indexed by people.
Re:other search engines/ They all need to get bett (Score:5, Informative)
But how about not one that requires sites to pay to get in? (Yahoo)
At Google, go to the "Directory" tab, or go to DMOZ.org (Open Directory) itself. DMOZ is bigger, better organized, has fewer broken links, no ads, and is built by hand by people who know their categories and are interested in keeping them linking only to sites with meaningful content.
Semi-mindles search spiders are not all there is in finding stuff on the Internet.
Re:other search engines/ They all need to get bett (Score:5, Informative)
At Google, go to the "Directory" tab, or go to DMOZ.org (Open Directory) itself. DMOZ is bigger, better organized, has fewer broken links, no ads, and is built by hand by people who know their categories and are interested in keeping them linking only to sites with meaningful content.
First, I would suggest going directly to the categories at dmoz.org rather than the Google relistings. Google picks up revised RDF dumps from DMOZ whenever they please, but the lag in the cycle is pretty long. If you are looking for the "fresher" data, go directly to the source.
Second, DMOZ can become what you say it is only with proper editing. The project itself may list 50000+ editors [dmoz.org], but they're volunteers and there is a lot of ground to cover. A large number of edits are made by those "high up" in the directory structure to "lower"/"deeper" categories less well understood. Certain branches of the project are neglected; others eat editors for breakfast with the amount of work that needs to be done. Volunteer and help out.
You may also want to investigate ChefMoz [chefmoz.org] and MusicMoz [musicmoz.org], too.
Re:other search engines/ They all need to get bett (Score:2)
Re:other search engines/ They all need to get bett (Score:2)
Re:other search engines/ They all need to get bett (Score:1)
Alltheweb (Score:2)
Don't get the wrong idea, I like google, but having just one search engine that's commonly used is a bad idea. No other company can really compete due to the fact that google has so much traffic to sell ads on.
So anyway, that's why I pimp alltheweb whenever this topic comes up.
Re:Alltheweb (Score:2)
I use it if google gives me less than a dozen hits for a query.
For Me: Google First, Alltheweb Second (Score:5, Interesting)
Background: Among other things, I am always trying to discover music from independent (especially blues) artists that post mp3's of their stuff on the web. I have been boycotting the major record labels 100% for about 15 years (hooray for independents!) for several reasons: 1: CD prices have always been a rip-off, 2: most major label artists suck, 3: I have worked in music business (artist/bands/production) and detest the industry for the way that it exploits artists, 4: I have always loved discovering talented "unknowns" and turning other people on to them. I went through a new music dry spell until the web started to become a vehicle for independent artists to promote themselves.
It's amazing what's out there now - I've found great artists from all over the Americas, Europe, Australia, Russia and even a few from Asia. I have found a lot of crap, too. The mp3 search engines are essentially useless for this purpose (I don't want major label music) and I have never used Napster or any of the off-spring. Links pages are more often out of date than not and webrings have similar problems. I have contrived several search techniques that try to exploit the strengths of search engines and the likely information on an artist's site. One very simple one is to look for "mp3 +(insert name of a well-known blues standard) -(a lot of keywords to exclude the many sites that put "mp3" on every page that simply lists a song title just to pull in traffic) -(specific sites that pollute the searches)", to find artists that cover the song and also have their own tunes.
I have been a proponent of Google for many years. It came along just as I really started to dislike Altavista and I was an almost instant convert. But I am always on the lookout for a backup or something better. I have tried Teoma several times in the last year (as recently as last night), but I'm not terribly impressed. I find its interface and the way it presents results simplistic and dumbed down and it appears to have indexed far less of the web than Google. I got turned off Lycos years ago, when it seemed to want to become another portal/Yahoo (as if we need another one).
The one search engine that I do use as a regular alternative to Google is Alltheweb [alltheweb.com]. For one thing, IMO, its advanced search is currently better than Google's (I swear that I have brought Google to its knees by entering too many keywords - it stops responding and is inaccessible for several minutes thereafter - this has happened several times). When I've done back to back comparisons with Google, Alltheweb seems to fare pretty well and seems to find more international pages than Google. The difference in top rankings can also be useful. Google has some nice features that Alltheweb does not, such as the elimination of duplicate pages.
For one-stop searching, I find Google best for me, but Alltheweb is a good alternative.
Hype hurts (Score:4, Insightful)
AlltheWeb.com must be soaking this up with glee.
-- jetlag --
Goatse.cx no longer in googles search results (Score:1, Funny)
Re:Goatse.cx no longer in googles search results (Score:3, Insightful)
Re:Goatse.cx no longer in googles search results (Score:1)
Maybe this shows that google really doesn't give a link more weight if it is repeated often on a popular site.
Re:Goatse.cx no longer in googles search results (Score:3, Informative)
I wonder what the folks at Google are thinking when that shows up as a high-ranking source of traffic... I also wonder why somebody felt that was neccesary to even put. Ah well.
Who cares (Score:1)
This seems to be the most evidence presented in the article. Looks like baseless whining to me.
Google Classic? (Score:2, Funny)
Don't Panic (Score:4, Insightful)
There are glitches in every complicated software. Maybe they were trying out some new algorithm that wasn't completely refined yet. Maybe it was a random off-by-one bug that has been already fixed. Shit happens all the time, Google is no different.
There will probably be many people who try to see a conspiracy theory behind this and say that Google has sold out.
This is very unlikely. The nature of the described flaw suggests that all queries are affected. Now why should they skew the results of everything to appease a single entity who might have given them some money? That just doesn't make sense.
Dmoz is King (Score:5, Interesting)
We who are editors at dmoz hold a lot of power right now. Its time for you to share in some of that power. Head over to dmoz and apply to edit your favorite category.
Can't decide where to apply?
Re:Dmoz is King (Score:2, Interesting)
We who are editors at dmoz hold a lot of power right now.
I fear that is all too true. I have encountered Google entries ranked "top five" in result sets of greater than 10,000 members where: (a) the search term(s) are not present in the document; (b) no other Google-indexed document links to the one in question; (c) sole use of the search term(s) is in a DMOZ listing of the document.
It would seem that getting listed on DMOZ can really affect Google "placement" -- I cannot determine whether this is the result of simply weighting DMOZ or due to a "multiplier effect" as Google indexes all those many, many sites who utilize the RDF generated by the Open Directory Project.
Then again, Google top-ranks a Web site that hasn't been on the Web in over a year for one query I run with frequency. I don't think I can point the finger at the ODP for that one.
Re:Dmoz is King (Score:2)
That's why I prefer yahoo.
"Dead pages" complaint is real (Score:5, Interesting)
(I still haven't found what I was looking for
Re:"Dead pages" complaint is real (Score:1)
Re:"Dead pages" complaint is real (Score:3, Informative)
Tho I did try in Mozilla as well, same result. THEN I checked docsource in both browsers, and saw what I noted in my previous post.
The live pages on the SAME server were behaving normally in Netscape, so it's clearly not a connexion problem (in NS, that usually produces the "connexion reset by peer" complaint). When I went to their root site and tried looking thru their links and site search, it appeared that the entire branch I'd hoped to read had been deleted (anyway, no reference to it anywhere in sight) -- assuming it ever existed!! There was no scripting involved, and no reason for anyone at this site to give a damn where they're ranked.
If you're wondering, I was looking for historical maps incrementally showing the shoreline for Devil's Lake, North Dakota, to supplement what I've found in ancient atlases.
Re:What? (Score:2)
Re:"Dead pages" complaint is real (Score:4, Interesting)
I got a helluva lot of dead links and blank pages.
Definitely so [google.com] - check the first result. This is nothing new. For the referenced query, the faulty top-ranking has existed for a long time, though the site in question hasn't existed for over a year. [archive.org] I've even written support a number of times (blatant errors such as these are shameful). Google is far from perfect.
Re:"Dead pages" complaint is real (Score:2)
Re:"Dead pages" complaint is real (Score:2)
The first result to that query [...] does appear to be up...
You're posting two plus days, one more email to support (see thread), and a direct removal request (see thread) by me later ... they did finally kill the errant entry. One year after it was removed from the 'Net, but still it's gone.
One small step for...
Re:"Dead pages" complaint is real (Score:2)
But the next few pages are full of good hits. So is your point that people never check anything but the first result, or that google is not allowed to have 1 page out of several dozen be a bad hit?
My point is that Google decidedly indexes and top ranks "dead" sites. Check that. *Long* dead sites. I can accept a dead entry for a recently pulled site. I might even be tempted to accept a dead entry for a site pulled several months ago should it be ranked deep within a given set of results. But to top rank a site gone for twelve months or more points to exceedingly flaky behind-the-scenes behavior on the part of Google. Whether results [2 .. n] are valid has no bearing on my position.
As to whether people only check the first result: Judging from traffic to my sites through Google and the corresponding ranking of them over time, it would seem that people do frequent top-most entries more than bottom-most. That's common sense. Regardless, I would hazard the guess that leaving a "first" entry broken for so long collectively wastes many people's time. It certainly doesn't put a best foot forward in quelling concerns about the relevance of Google search results, does it?
Re:"Dead pages" complaint is real (Score:2)
As to indexing dead links, since the alternative of finding a dead page in Google's cache is useful, I can understand doing this for that purpose. However -- perhaps Google should INDICATE known dead links, so we don't waste our time chasing them, and can immediately go to the cached page instead.
Re:"Dead pages" complaint is real (Score:2)
I hadn't really thought about it til I read your replies, but I hardly ever click the top result.
Sounds like a variant of "banner blindness" ... Google Goggles?
As to indexing dead links, since the alternative of finding a dead page in Google's cache is useful [...]
Ah, but the cited example doesn't offer a cached version. So much for that.
Google cache is only beneficial in cases where the page no longer exists or cannot be accessed. Neat and all, but of use to only a small portion of users, I would think. For sites which update frequently (given the general lag in Google indexing), a cached version is detrimental, since the content would not accurately reflect the "true" site. The benefit in other cases is broadly questionable, as "dead" sites are (usually) pruned from Google's index, along with cached contents.
Apart from Slashdotted servers where Google already indexed the content (not likely for new documents), the window of utility is limited -- assuming that people can keep their servers up. I would suggest that archive.org serves the same purpose nicely, but that collection has its own troubles.
Perhaps Google should INDICATE known dead links.
Perhaps Google shouldn't index dead links at all. There's geek chic in placing document copies in cache, but that's about it. Google only caches the document, not the images or other embedded content, or referenced stylesheets or script elements. For many, many sites (unfortunately) much is lost in retrieving only the base document. This again reduces utility.
Favoring inclusion of "dead" documents will quickly lead to an immense repository of this and that, some live and some not. Sounds an awful lot like archive.org and I don't really understand why Google would even bother. Indexing dead pages (cache or no cache) only clutters up results and confuses what one is actually searching.
Flagging the dead from the living isn't a whole solution, but it's a start. Although, were Google to team up with archive.org to permit cross-linking and separation of duties (live Web / dead Web) that might make a bit more sense (to me). I doubt that will happen, given the ties of archive.org to another search technology.
I noticed this myself (Score:2, Informative)
They Kill PageRank to protect Microsoft (Score:2, Informative)
Re:They Kill PageRank to protect Microsoft (Score:1)
Re:They Kill PageRank to protect Microsoft (Score:2, Insightful)
registro wrote:
This doesn't make sense. The current top result on a search for `correo gratis' [google.com] is LatinMail - Tu Correo Gratuito En Español [latinmail.com]. It seems to me that the listings have gotten more accurate, not less, if my search in Spanish for free mail returns a Spanish company instead of a Yankee one (Hotmail is Microsoft, in case you weren't aware). LatinMail belongs to eresMas [eresmas.com], who are headquartered in Madrid.
Re:They Kill PageRank to protect Microsoft (Score:2)
You have a good point there, but that should reasonably weight the results. I checked correo gratis [google.com]. Hotmail.com/ES [hotmail.com] was ranked five hundred and eighty seven. In my oppinion that result is broken.
-
I doubt it (Score:2)
If that was all they were worried about, they could simply have manually changed those searches to exclude MS (as they have done for people as small as Bernie Shiftman [alltheweb.com]), that guy who spammed his resume around everywhere. Searches on his name would turn up pages bitching about him.)
Btw, you definitely deserve a +5 for plagiarizing this post verbatim [webmasterworld.com]. Well, except for the paragraph breaks, I guess.
How exactly is this a bad thing? (Score:2)
I noticed the same thing (Score:1)
xenu.net (Score:1)
I encourage people not to judge Google's update before a few patches. All x.0 releases are non-functional. All you software engineer should know.
-- nyri
End paragraph says it all (Score:4, Insightful)
Wow, can you people even read? (Score:4, Interesting)
Actualy, according to the artical, a few people who run blogs seem to think that google has been degraded, while google itself has not seen a higher number of actual complaints.
Basicaly what happened is that google took some mesures to reduce the effects of "googlebombing" by bloggers.
Re:Wow, can you people even read? (Score:2)
I'm not a blogger and I've noticed more and more crap results on the first pages of my google searches in the past month or so. Unfortunately for Google, I haven't really stopped to think about why my search results have been crappy (and thus, complained to Google about it), but I have noticed I've been getting better results from alltheweb.com.
The silent majority aren't going to complain about bad search results, they'll just move to another site that does better. How do you think Google managed to steal the crown from Altavista, et al. in the first place?
Re:Wow, can you people even read? (Score:2)
The results are hosed, the timing of the article is interesting for me, because I just sent a complaint to google about 30 days ago. Any search that contains the term "wholesale" in it contains pages and pages of results (some times as many as ten consective pages) that all redirect to ebay. Now obviously "wholesale" is going to be a spam magnet, but to have in excess of 75% of the first 10 pages of results all link to some sort of redirection script indicates some sort of page ranking issue.
maru
People are cheating (Score:5, Insightful)
I work for a company that hosts pr0n sites. Maybe 95% of our partneers are cheating that way. Fake sites, fake auto-generated HTML pages (with pseudo real sentences), cloaking (what Google sees is not what visitors see), javascript tricks, etc. are a must. They spend most of their time on trapping google, it brings more money than working on the site itself.
The company I'm working for has even a team working full-time on this (spamming search engines, and creating thousands of fake sites just to promote one real site) .
darn (Score:4, Funny)
Search Engine Optimisation - Don't waste your time (Score:5, Insightful)
Your job, as a webmaster, is to produce a user friendly, useful, maybe informative website.
Google's job, as a search engine, is to find the sites most likely to be of interest to a user, based on their search terms.
Therefore,
To get good rankings, all you have to do as a webmaster is produce a user friendly, useful, maybe informative website.
It is Google's job to optimise to the web, not the web's job to optimise to Google.
So,
Search Engine Optimisation is big massive NET LOSS to you, because all it results in is getting visitors who aren't the slightest bit interested in your website or product.
It also results in a soon to be pretty useless Google, so please don't do it.
Re:Search Engine Optimisation - Don't waste your t (Score:5, Insightful)
Then, one day, someone would rediscover that by putting "Anna Kournikova Blowjob" on their site, they'd get some hits they wouldn't have otherwise. No harm, they think, it's a few bytes and it's not hurting anyone. Webmaster tells his friends, they tell their coworkers, and this continues until the search engines have to begin working around the issue.
Sound familiar?
Unfortunately, people aren't always honest, especially when it's something they perceive as benefiting them without hurting anyone else, especially if the benefit is financial. They don't care about the integrity of some other website's engine, they care about profits. Realize also that even personal sites do this; people like other people to see what they've made. While your idea is a great fantasy, near-perfect search results will only come from human-edited sites, or a better algorithm than we have now.
Re:Search Engine Optimisation - Don't waste your t (Score:2)
Your job, as a webmaster, is to produce a user friendly, useful, maybe informative website.
Google's job, as a search engine, is to find the sites most likely to be of interest to a user, based on their search terms.
And somewhere in the middle, the two concerns meet. There are a *lot* of unintentionally odd entries in Google that should be cleared out as misleading. Three examples: (1) Redirection URLs from one site but content from the targeted site; (2) Past versions of multi-version documents such as Wiki revision URLs; and (3) Static redirection documents from relocated sites.
All should probably be combatted by site maintainers through application of robot exclusion rules and a little redirection know-how. However, given the frequency with which I've encounted these awkward entries, I'm guessing that attention to such detail has gone by the wayside.
How will I trick kids into looking at my Butt?? (Score:2)
Your job as a webmaster: Generate massive quantities of traffic... While figuring out how to be profitable.
Google's job, as a search engine: Generate massive quantities of traffic... While figuring out how to be profitable.
Now, the webmaster could accomplish that by offering tons of great content at a better price, and Google can accomplish this by having the most relevant results. That takes care of the generating traffic.
Now, profitability... Tons of great quantity at a reasonable price will not be supported by 100 users... You need hundreds of thousands for the economies of scale to work in your favor. End result, you do everything in your power to drive traffic at your website. Top of mind awareness, etc... What you postulate is that they should stop sending me Credit Card offers because if I wanted one, I'd call. True, but would I call your company or another one??
SEO actually shows that we are achieving a Search engine monoculture. Back in the day when we had varied competing engines, bombing one site hardly had any impact. (Actually, most results were bombed so you got used to checking several engines.) And these techniques were around about two minutes after the first Search Engine went online.
Google was an evolution beyond the early Excite, Lycos, et. al. If an unusable Google is the result, new ideas will surface that are evolutionary. This shows that there are areas where you could improve on Google search technology and either compete or license it to Google. Then we will have an improvement over Google like Google was over Infoseek, like Infoseek was an improvement over Yahoo.
Webmasters are NEVER going to play nice with the search engines. The job of the search engine designer will be to limit the ways in which his system can be abused. XHTML should be a big help, as most search engines get easily fooled by having their parsers parse verrrrrrry loose HTML. When they start actual content checking and checking for invisible divs and other HTML tricks, the engines will evolve again. I'm suprised with the increase in processor speed it hasn't happened more recently, but the extra horsepower has been dedicated to indexing PDF etc... Next generation maybe.
Hammy
Re:Search Engine Optimisation - Don't waste your t (Score:2)
On the real web, the one where I make my living building and promoting commerce sites, the truth is that no matter how aesthetically pleasing, no matter how well engineered for human use, no matter how informative your site is or how low your prices are, if your site is not easily located by the methods people actually use, it is worthless.
I don't waste my time doing search engine optimization -- I make very good use of my time doing search engine optimization. I don't use doorway pages, spamdexing, googlebaiting, or any such cheats, but I do build every site from the ground up with the primary focus being search engine spidering and indexing. That's why I'm still in business, and my clients are not only still in business but are spending right now on expansions and upgrades to their sites.
Your utopian vision would be fine if the real world would just consent to be ruled by it, but the real world is a fickle and contrary organism that defies such simplistic wishfulness.
observations... (Score:5, Interesting)
* worse rankings with a particular keyword mean that a company will seriously consider using AdWords to maximize the traffic gain from google - so hey, it's good for google
* the big mistake is to use a "static" relationship between websites as a measure for a site's traffic or importance - better offer a "google counter" (google has the resources, I suppose)
All things considered, Google is still doing pretty well.
-lj
Been slipping for awhile (Score:2, Interesting)
Was having problems with a program last night and kept finding links from 1996. I kept saying to myself, "My good man, surely those bugs have been fixed since then!"
Looking for things on the net with any search engine is a distinctly unpleasant experience. Kind of an impotent feeling to type in an explicit search with all the trappings ("bla bla" -foo -gar..etc), and to get a bunch of nothing in the results.
And don't ask me why, but I've been getting an uneasy feeling about google. Where do they get all the money such a company needs to stay afloat? They're in a pretty good position to learn all kinds of neat details about everybody. Pretty powerful position to hold.
My rank has gone down! (Score:2)
"tim ward cambridge" always used to get me in seven of the top ten places - now it's down to six!! Better get my mates to put up a few more links
But as I'm still on the first page with "accommodation cambridge" I can agree - famous people are worse off, but technical queries are unaffected.
They obviously DO tinker... (Score:5, Interesting)
For a long time, a search on "Samuel Johnson" returned Frank Lynch's "Samuel Johnson Sound Bite Page" as the first hit. And, flatteringly, but mysteriously, a search on "Eyeglass Prescription" returned a web page of mine as the first hit. (I say "mysteriously" because the only page that Google reports as linking to my page is... my own home page! So it is not PageRank that accounted for its ranking).
About a month ago, Frank's page dropped to #3 and mine dropped to about #20. In Frank's case, the #1 spot went to a fine Samuel Johnson web site at Rutgers; in mine, I was edged out by a bunch of commercial sites selling eyeglasses.
The interesting thing is that two or weeks ago both sites popped back to number 1.
And then a few weeks later, Frank's is again at #3 and mine is down around #10 or so.
I don't think there's any reason why eyeglass prescriptions and Samuel Johnson would be connected. (And, no, Frank's page and mine do NOT link to each other!) So the changes must reflect tinkering by Google.
Neither Frank nor I use any kind of "cheating" to boost our ratings. And I don't think the sites that climbed above our did, either. Nor do I think many of the sites involved changed ANYTHING significant that would have altered their rankings.
(BTW I'm NOT giving URL's because the contents of these pages are irrelevant to my observations, I don't want them slashdotted, and this is NOT an attempt to boost the rankings of either page).
They do tend to oscilate: cache effect? (Score:2)
Stanford's not #1 any more (Score:3, Interesting)
He still beats Tracey Ullman, though.
Plural oddity (Score:4, Informative)
It does not seem like a very good idea to me in most cases.
Re:Plural oddity (Score:2, Informative)
Re:Plural oddity (Score:2)
If it is an English word, then they should ingore plurality. Generally, it should be pretty easy to determine the language of a page anyhow using statistical probabilities, etc. If the metric suggests that it is mixed, then maybe be more literal.
Re:Plural oddity (Score:2, Funny)
And Latin stole its words from medical texts.
Will we will pay for our relevance? (Score:2, Interesting)
The service search engines provide to all of us, in the face of what can only be charitably called a glut of data (some of it actually useful to some of us) is remarkable. (And let's be clear: We're talking about Google, possibly the only thing keeping the net from collapsing under it's own weight in ether.)
It's great that Google is effectively "free" to us, but should that be so? I'm personally willing to pay a nominal charge -- perhaps in the form of a subscription -- for access to the most effective search algorithms. It has that much value to me. If someone wants to create a free system -- as Google has done -- that's great. But as the web gets bigger and hairer, we might need to think about ways to support (read, fund) continued development of effective search/navigation tools.
There's some serious linear algebra in publicly disclosed portions of the PageRank(TM) algorithm, and the really good stuff is probably even funkier. This sort of thing isn't going to be hacked together on the weekend by your average penguinista. Could the brains behind Google go diving for herring in richer seas? I suspect so.
Perfect search results? (Score:4, Informative)
The pagerank algorithm is one of the most important reasons why it is so good in bringing up relevant and popular results. But, this is just one of the ways of searching for good results and will not always work to your satisfaction.
Google gives preference to the number and quality of links to a particular site rather than the content of the site itself. One can easily come up with cases when this is probably not the best approach.
For example, consider a portion of the web containing lyrics of songs. If you search by artist name or song name, google will return excellent results, because the pages are probably linked using the names. However, if you only know the soneg from radio, you might want to search for songs containing a few particular lines. The pagerank algorithm might not be the best fit here.
Re:Perfect search results? (Score:5, Informative)
Google should get spiteful (Score:5, Insightful)
Result: pr0n sites will be too terrified of deletion to munge their ratings.
Re:Google should get spiteful (Score:2, Interesting)
Second: for persistent offenders, don't just delete the spamming site, delete all the sites they own or operate. Or threaten the ISP with black-holing.
Open source software for PageRank (Score:2, Interesting)
Observations on Google cache (Score:3, Interesting)
A few quick observations (Score:2)
When I just search and get multiple results, on average one of the page 1 links is faulty (dead/404/not the same page anymore/spam). But this is not new or news, I have run into this for years with Google.
There is still no better option out there, regardless of a few new quirks in PageRank that might be noticed by those trying to bomb it anyways. Of course, if you taunt the monkeys, they'll scream and complain.
Punishing META tags? (Score:5, Insightful)
Now, I have written some software for a reasonably obscure research area in computer science. It used to be that when I searched Google for "obscure area" (where that's the name of my obscure area, not the actual words), I would rank somewhere in the 40's. This seemed quite reasonable to me. My software is fairly widely used, but is by no means the number one concern in that research area. It's free, GPL software so I have no monetary stake here.
Now if I search for "obscure area", I rank somewhere around 140! Worse still, if I search for "obscure area software", I still don't rank in the top 100! There are questions on mailing list archives from people _looking_ for "obscure area software" that rank in the top 10!
Only a handful of research groups have software in this area, certainly less than 10. The #1 ranked site in that search is actually based on my software, mentions it, and links to my software's site! A few other pages link to my site, including some that are ranked very high in that research area (including the #1 page).
So here's the kicker. My pages have keyword metatags that honestly provide information about what's on the page. I'm wondering if Google has started treating such keywords as manipulation attempts and is punishing my pages.
Since I doubt anyone can say for sure, I guess I'll have to remove the tags and see if my ranking rises. The ranking is not, frankly, terribly important to me. People who want such software can ask on a mailing list, but it does raise concerns that some honest attempts to accurately categorize pages could now result in lower ranks. It's now significantly harder to find something that is exactly what it claims to be!
Then again, maybe the tags have nothing to do with it and ranking is just broken. But it seems awfully strange...
Re:Punishing META tags? (Score:2)
If you don't get it, just move on.
Irish Porn Star (Score:2)
Re:Vanity, thy name is me (Score:1)
Re:Ssshhh... (Score:2)
Re:Ssshhh... (Score:2)
Any Google folk reading here today who'd care to comment on the situation??