Slashdot Log In
Modern Day Search Engine Manipulations
Posted by
CmdrTaco
on Wed Aug 14, 2002 06:46 PM
from the what-happened-to-just-being-popular? dept.
from the what-happened-to-just-being-popular? dept.
An anonymous reader writes "I fondly recall the days of yore when search engines could be manipulated just by sticking thousands of extraneous filler words in the META tags or hidden at the bottom of the page. Nowadays search engines work by more advanced techniques that generally don't fall prey to these simplistic tactics, but it'd be folly to presume them impervious. Does it still happen?"
This discussion has been archived.
No new comments can be posted.
The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.
Full
Abbreviated
Hidden
Loading... please wait.
of course. (Score:2)
yep (Score:5, Informative)
Much like security, I think this is the kind of thing that hackers and tinkerers will always find a way to exploit. The question is who can stay ahead in the race?
Re:yep (Score:2)
the new status quo (Score:5, Interesting)
-- Greg
Re:the new status quo (Score:4, Informative)
It's also worthwhile to mention that Yahoo's not really a search engine in the sense of something that crawls the internet looking for info; they generally rely on submissions, with which they're surely inundated, and that tiny subset of the internet is what they search.
As for sponsored links, 75% of the "sponsored links" on search engines are culled from Overture (formerly goto.com). Goto took a lot of heat back in the day for selling search results, but they've found a market in selling these results to other engines. Until like 3 or 4 months ago, their results were on Yahoo, AOL, Netscape, Altavista, and most other search engines. Then Google got into the bid-for-keywords market with their Adwords Select program. Now in addition to searches on google.com, Google's adwords show up on searches on AOL, Earthlink, and a few others. The process is basically as you described - bidding for keywords. Usually it's not worth bothering unless you're in the top 3 for that keyword on Overture, as those are the ones that show up on Yahoo (I think #4 and #5 show up at the bottom of the page). On Google I've seen up to 8 ads for a given keyword (e.g. computers [google.com]) but AOL only takes the top 3 for its "sponsored matches" as well.
On Google it's important to note that the sponsored sites and the real search results are completely separate (dependent on how much you trust google, of course, but they have a lot of karma built up), and google's results are gleaned from having their robot (Googlebot [google.com]) crawl the web, not from submissions; and the algorithm that ranks sites is another matter entirely. E.g. a search for "ass grabbing computers" [google.com] predictably has 0 results, but there are plenty of ads for the word 'computer' that pop up.
It's doubly important to note the above about google since many Yahoo searches fall through to google when there aren't any results in yahoo's (IMO Lame) directory, so the results from yahoo are not as paid-for as you seem to imply.
Parent
who cares? (Score:5, Insightful)
I had a boss that was asking me "How do we improve our site on google?"
Answer: Provide actual information instead of some glossy maketrdroid garbage that is so prevalent in webpages today and you wouldn't have to worry about the search engines would you?
Re:who cares? (Score:3, Informative)
Let's say you had the best article in the world about installed redhat, but the link was to www.fperkins.com/tip.cgi?101
Forget about it, you just won't get linked in the top 10. A good trick is to have your dynamic content create a static page which is, of course, dynamically created from the database. Then you would get something simliar to what allrecipes.com does.
Ie their recipe for "African Chicken Soup" is not recipe_view.asp?id=100 but rather http://chicken.allrecipes.com/az/africanchickenst
Smart. Notice how they even have a subdomain to chicken.allrecipes.com which can be setup really easily for most sites, especially those that can alias any subdomain to the main domain.
Regardless, getting ranked in the top20 in Search Engines is some skill and knowledge and a lot of luck.
Fixing Google (Score:2)
I've heard accusations that Google can be "fixed" by creating lots of phony sites that link to your site. Scientology sites are famous for that. I'm sceptical -- thousands of links from sites nobody visits have less impact than one link from a site everybody visits.
Re:Fixing Google (Score:2, Insightful)
Google Limitations (Score:5, Informative)
For example, a search on google for "plaid socks [google.com]" yields only 1 or 2 sites out of 100 that have 3 or more CGI parameters, when I'm sure there are many sites using very complicated urls (with session IDs, etc). Sure, this is just anecdotal evidence, but as someone whose product catalog was listed by urls that had at least 3 CGI parameters (and sometimes 5 or 6 depending on the referring URL) I can say with 90% confidence that having a "complicated" URL severely hurt us. What I ended up doing recently was using mod_rewrite to change all the listed URLs on our site from site.com/product.cgi?sku=something§ion=2&styl
What does this have to do with Google's relevance? Sure, they are returning relevant results when you search, but if they are arbitrarily not listing a site because its URL structure is too "complex" then there's a ton of possibly relevant content that they're missing. If you're someone who sells plaid socks for $10 less than your nearest competitor but Google isn't indexing your plaid socks page because of URL structure (exactly what was happening to us, except not for plaid socks) then you're really not getting the most relevant results. Which is not to say that what you DO see isn't relevant, it's just that there's possibly MORE relevant stuff that you won't ever see.
Fortunately Google has something in the works to cover this particular situation, but it doesn't really have anything to do with fixing their URL complexity policy.
Parent
Re:Google Limitations (Score:2, Insightful)
Re:who cares? (Score:2, Interesting)
Sometimes that's true, but not always. I created a site for a small business that sells fireplaces. When doing a google search for "fireplace", hundreds of sites show up before ours. One that especially irks me is a site that has about 6 pictures of fireplaces
My point is that providing *real* information helps you *none* in relation to Google rankings.
Re:who cares? (Score:2)
Basic Marketing (Score:2)
It really becomes a question of what kind of market searches to you want to show up in.
Random Searches? File searches? product searches?
What is your market? If you do not know what searches you want to show up in, then how can you push yourself higher in google?
You got fired, right? (Score:3, Funny)
Re:who cares? (Score:2)
At a guess, Google does something clever to try and ignore banner ads, so you make some guesses as to how Google spots an ad, and only pay for ads that don't look like ads... (like, what if you paid for a "banner" ad, and then requested that the site just stick in an ordinary text link rather than a banner graphic?).
Another way to promote your site (Score:2)
this trick works every time (Score:5, Interesting)
Here's one I use all the time.. just follow these easy steps:
Now, watch your Google ranking rise to the top! IT'S THAT EASY! And you'll laugh all the way to the bank!
No! Wrong! (Score:4, Funny)
Parent
Re:this trick works every time (Score:3, Insightful)
You forgot to say "make sure it works in lynx because all disabled people use lynx as their browser."
Who makes the "guidelines" for usability. For accesibility? Do all disabled people get lumped together so that one guideline fit's all? Each disabled person has their own difficulties and there is no one size fits all approach. Disabled people are no different that any other person and it is up to them to empower themselves with the technology to view any webpage regardless of guidelines used.
Maybe we can use the gubment's guidelines and use PDF files which rate along with Flash as major web annoyances. I mean, so what if a disabled person gets annoyed having thir computer freeze because some clueless moron decided that the best way to give out a 1 page brochure was to put it into a 2 mb PDF. Don't you think disabled peopel get annoyed at this crap also. But it's okay, because it fit's the disability guidelline.
The best guideline any web developer can use is both common sense and do not interfere with the user regardless if they are disabled or not.
Re:this trick works every time (Score:2)
product quality has nothing to do with popularity, you should know this - we're on a Linux-centric site after all
Re:this trick works every time (Score:4, Funny)
Create a well-designed, easy-to-use web site that follows accessibility and useability guidelines.
Fill the web site with useful, relevant information on a selection of topics.
Make sure the information is kept up to date, and don't let it become stale.
Allow this web site to become popular and authoritative, so lots of people link to it and reference it.
?????
Profit!!!
Parent
Search Engine Optimization (Score:2, Interesting)
The Britney Spears mystery (Score:3, Informative)
The relevant bit on one of the Britney Spears pages seems to be:
Which, yeah, seems to be a roundabout bit of Google bombing.
The question is -- how does this help Shavlik? Presumably there aren't that many people searching for Britney Spears content who say, "Oooh, a way to push Windows patches through a network! I want that!" You'd think the Google algorithm would weight links according to their relevance to the search criteria.
Re:The Britney Spears mystery (Score:2)
anyone else agree/
Re:My guess is that it's a problem with IP numbers (Score:2)
Oh yes, this was a real problem for me. I run poetrycontestonline.com [poetrycontestonline.com] and I also registered psychicweb.net in an insane fit, thinking I could capitalize on the Ms. Cleo and John Edward syndrome. I let psychicweb.net expire after pointing it to the same IP address as poetrycontestonline.com as a virtual host. For months after psychicweb.net expired, google thought that poetrycontestonline.com was psychicweb.net. A search for poetrycontestonline.com would yield cache links that had psychicweb.net as the domain name. Also, searches on things like "Free Poetry Contest" would yield links to psychicweb.net and not poetrycontestonline.com, which means that after the domain expired, I was effectively removed from google for almost a year.
I hope they got it fixed now, because this behavior was very annoying. Had my site been more of a real business site, I would have been pretty pissed off about it.
Can't have it both ways... (Score:2)
How To Promote Your Own Site
Clearly there is some awareness out there as to how to manipulate the search rankings, and following are a few methods that I think are common:
In no way am I promoting any method that encourages false search rank increases.
Is it just me or is there something just slightly contradictory about these statements?
The Church of Scientology (allegedly) Does It (Score:2, Interesting)
not right (Score:3, Informative)
There may be some confusion because the Google Toolbar, when viewing a page that hasn't been indexed, tries to "guess" what it's PageRank would be based on the site PageRank... but that's not "real".
If you want to know more about Google, the place to go is the Webmaster World Google forum [webmasterworld.com].
Danny.
eBags (Score:3, Interesting)
Effectively, they circumvented Google's "site grouping" wherein all hits from one site get clustered under a smaller group. I got fed up with it and resolved not to buy anything from eBags.
But I thought to myself, "maybe they're Scientologists..."
Re:eBags (Score:2)
Not that I should admit to this... (Score:4, Interesting)
For instance. Webcam32 Crack [google.com]
Yes, I OWN webcam32. So there.
The point is, the first THREE PAGES are
Re:Not that I should admit to this... (Score:2)
[snip]
The point is, the first THREE PAGES are
"Five minutes later, 'webcam32' became the most popular search term in Google history..."
Re:Not that I should admit to this... (Score:2)
That's because for that type of search you're supposed to use "astalavista [astalavista.box.sk]." Really. Google's great, but the real hax0rZ know how to filter their own...
More background reading (Score:4, Interesting)
Here are some more URLs that might be of interest:
When my websites needed to be ranked high... (Score:3, Informative)
Multiple sites with same content... (Score:2)
Links on Google would show up under one site name, but not the other. Apparently Google does something on the back-end to determine that the contents are identical and assign the listing to one of the domain names (in this case the older one).
Only after feeding all visits to the old domain with a 301 and then sending them along to the new domain name did Google's results update to only indicate the new one.
Google ranking tips (Score:2, Informative)
It all boils down to everything in moderation.
So you have 'normal' amount of meta-keywords, say about 5-9, and the same effect in the title.
Another one that is debated to work is
http://keyword1.keyword2.com/keyword3
Basically, IMO google trys to limit results to 'real' pages.
Easy. (Score:4, Funny)
Put a META tag containing the follow words:
grain, rice, corn, worms, wheat - worked like a charm. You get the idea.
HUMANS do it better... (Score:4, Informative)
Just a shameless plug here for the Open Directory Project [dmoz.org]. Leaving aside occasional occurances of editor-fraud or editor-abuse (which are quickly tracked down by the meta-editors), this is the best way to determine a site's real value.
A human looking at the page to subjectively/objectively determine its value is something that can't be replaced by a spider and an AI program.
URL cloaking, hidden text, keyword tricks, etc... don't matter. =)
-jc
Re:HUMANS do it better... (Score:4, Informative)
If humans ever got around to doing it.
I know MANY webmasters still waiting for the sites to be reviewed, months later.
Cheers,
Backov
Parent
I wonder why this is the first link to my page is (Score:2, Interesting)
Google Bombing (Score:4, Interesting)
Re:Google Bombing (Score:2)
YAGA - Yet Another Google Article (Score:3, Funny)
Oh yeah, it sure does get manipulated. (Score:2)
Check the results. Are there _any_ relevant ones?
Pretty much nope.
Getting a good Google rank is child's play (Score:2, Informative)
How did he do that? Here's the explanation [blogspot.com] - far shorter and clearer than that article.
Re:An interesting question (Score:2, Informative)
Re:"invisible links" (Score:2, Funny)
You should try this Google search! [google.com]
Re:And I thought I could read English! (Score:2)
fondly
My heart, untraveled, fondly turns to thee. --Goldsmith.
yore
prey
One that is defenseless, especially in the face of attack; a victim.
folly
impervious
Incapable of being affected: impervious to fear.
looks like fine english to me.