WebCrawler Turns 10 Today 136

Posted by timothy on Tuesday April 20, 2004 @07:10PM from the young-at-art dept.

Brian Pinkerton writes "WebCrawler, one of the first search engines on the 'Net, turns 10 today. You can read a short history of WebCrawler. When I wrote WebCrawler, one could do a credible job of crawling, indexing, and searching the Web from a single desktop PC. Today, the reality is a little bit different."

This discussion has been archived. No new comments can be posted.

WebCrawler Turns 10 Today

Load All Comments

Search 136 Comments Log In/Create an Account

Comments Filter:

Guess this celebration... (Score:5, Funny)

by oberondarksoul ( 723118 ) writes: on Tuesday April 20, 2004 @07:11PM (#8923332) Homepage

...won't have an accompanying Google Doodle? [google.com]

Share
twitter facebook
e-mailing results (Score:4, Interesting)

by qewl ( 671495 ) writes: on Tuesday April 20, 2004 @07:12PM (#8923339)

Ah the nostalgia of receiving search results via e-mail :)

Share
twitter facebook
- Re:e-mailing results (Score:5, Informative)
  
  by Brianwa ( 692565 ) writes: <<ten.tsacmoc> <ta> <aw-nairb>> on Tuesday April 20, 2004 @08:03PM (#8923709) Homepage
  
  You can be emailed results from Google as well.
  Simply email google@capeclear.com with the search terms in the subject line, you will soon recieve a response with the results. I think there is a limit to how many times a day you can use this, but I cannot find the link to the project webpage.
  
  Parent Share
  twitter facebook
  - And from there it goes to spam lists, right? (Score:2)
    
    by melted ( 227442 ) writes:
    
    People, don't be stupid, don't send your emails to people you don't know.
    - Re:And from there it goes to spam lists, right? (Score:2)
      
      by Anonymovs Coward ( 724746 ) writes:
      
      People, don't be stupid, don't send your emails to people you don't know.
      So you don't post to mailing lists either?
      BTW, the google-by-mail thing's webpage is here [capeclear.com].
Birthdays (Score:4, Funny)

by 7Ghent ( 115876 ) writes: on Tuesday April 20, 2004 @07:12PM (#8923341) Homepage

Happy birthday to Webcrawler AND Hitler! Hurray!

Share
twitter facebook
- Re:Birthdays (Score:4, Funny)
  
  by stevejsmith ( 614145 ) writes: on Tuesday April 20, 2004 @07:22PM (#8923427) Homepage
  
  ...and 4/20.
  
  Parent Share
  twitter facebook
  - Re:Birthdays (Score:1, Funny)
    
    by Anonymous Coward writes:
    
    I'll Toke to that
- Barney Gumble too! (Score:1)
  
  by imlepid ( 214300 ) writes:
  
  Simpsons #AABF06 Homer: Let's see, what's Marge's birthday? Barney is April twentieth, same as Hitler's, so Marge must be fifty ...oh, forget it. Flanders, what's your birthday? (source [snpp.com])
- Godwin's Law Invoked (n/t) (Score:1)
  
  by achaudhary ( 461062 ) writes:
  
  n/t
- Re:Birthdays (Score:2)
  
  by steeef ( 98372 ) writes:
  
  And me (honest), and Crispin Glover!
They used to be my google.... (Score:5, Interesting)

by wo1verin3 ( 473094 ) writes: on Tuesday April 20, 2004 @07:12PM (#8923343) Homepage

I remember when webcrawler was the only search engine I touched...

In 1996 it was nice and simple [archive.org]. Then as the time went on [archive.org] it got a bit too cluttered for my liking [archive.org]. Now looks like they're trying to googlize themselves with the current interface [webcrawler.com].

Share
twitter facebook
- Re:They used to be my google.... (Score:5, Funny)
  
  by Basehart ( 633304 ) writes: on Tuesday April 20, 2004 @07:18PM (#8923391)
  
  I really like their pantyhosecrawler [pantyhosecrawler.com] companion site.
  
  Very cool research tool.
  
  Parent Share
  twitter facebook
  - Re:They used to be my google.... (Score:2, Funny)
    
    by antic ( 29198 ) writes:
    
    Pantyhose research? One moderation point left and there's no option for +/-1 WTF!?
    - Re:They used to be my google.... (Score:2)
      
      by Tackhead ( 54550 ) writes:
      
      > > I really like their pantyhosecrawler [pantyhosecrawler.com] companion site.
      > >
      > > Very cool research tool.
      >
      > Pantyhose research? One moderation point left and there's no option for +/-1 WTF!?
      For every kink, fetish, and perversion, there exists at least one adherent with a website. Proof is left as an exercise for the Internet.
      And thus, web crawlers were born.
      - Re:They used to be my google.... (Score:1)
        
        by mog007 ( 677810 ) writes:
        
        Sigmund Freud said that the only unnatural sexual urges were none at all.
  - Re:They used to be my google.... (Score:2)
    
    by MouseR ( 3264 ) writes:
    
    Since when does linking to a porn site gets modded up "interesting"??
    
    Gee. Pimple out folks.
    - Re:They used to be my google.... (Score:1)
      
      by wo1verin3 ( 473094 ) writes:
      
      I find porn interesting.. but my guess is that since +1 funny doesn't increase karma, people are using others :0
  - I really like the... (Score:2)
    
    by Altima(BoB) ( 602987 ) writes:
    
    I like one of Webcrawler's featured searches today: Camel Spiders. [webcrawler.com]
    
    Those things may have urban legends surrounding them or whatever, but they are GODDAMN SCARY!!!
- Re:They used to be my google.... (Score:2)
  
  by Kenja ( 541830 ) writes:
  
  "I remember when webcrawler was the only search engine I touched..."
  Touched what? No, never mind. I dont want to know.
- Re:They used to be my google.... (Score:1)
  
  by docmittens ( 529542 ) writes:
  
  One sort of interesting note:
  
  sometime between May 6, 1999 [archive.org] and Oct. 9, 1999 [archive.org], WebCrawler stopped pitching the Netscape Now! and Microsoft Internet Explorer buttons at the bottom of the page.
  
  an interesting milestone, to say the least
Whoa! (Score:5, Interesting)

by outZider ( 165286 ) writes: on Tuesday April 20, 2004 @07:13PM (#8923347)

Holy crap!

I remember WebCrawler, but lost touch with it in around 1996, when I started religiously using AltaVista. They sure have changed a bit. ... but do they have any relevance anymore? They're owned by InfoSpace. :P

Share
twitter facebook
- Re:Whoa! (Score:1)
  
  by Deraj DeZine ( 726641 ) writes:
  
  They're owned by InfoSpace
  
  I don't think InfoSpace has pwned anyone.
- Re:Whoa! (Score:1)
  
  by Hettch ( 692387 ) writes:
  
  Yeah, i am totally in your shoes. Somebody above posted some way-back links from 96, and it brought back some fond memories of my first searchings. I went to the page now, and couldn't believe it was actually webcrawler! I moved from webcrawler to hotbot to google. I actually thought webcrawler had just sort of dropped off the face of the net, but i guess that never really happens
- Re:Whoa! (Score:1)
  
  by HD Webdev ( 247266 ) writes:
  
  I remember WebCrawler, but lost touch with it in around 1996, when I started religiously using AltaVista. They sure have changed a bit. ... but do they have any relevance anymore? They're owned by InfoSpace. :P
  
  A lot of usenet people that I knew dropped WebCrawler shortly after the AOL deal went through.
  
  AltaVista then ended up being the search engine that most people recommended.
Wow (Score:5, Interesting)

by z0ink ( 572154 ) writes: on Tuesday April 20, 2004 @07:13PM (#8923355)

Does anybody else remember getting a WebCrawler promotional CD 10 years ago? I didn't even have a CD-ROM then!

Share
twitter facebook
- Re:Wow (Score:1)
  
  by dolo666 ( 195584 ) writes:
  
  I think we played frizbee with ours... seriously. But I do remember using Web Crawler quite frequently at the Internet Cafe in Kingston. Oh those were the days.... :-)
  
  I can remember really digging the simple search interface.
  - Re:Wow (Score:2)
    
    by BTWR ( 540147 ) writes:
    
    everyone always suggests this as a possible use of AOL cds, but I've NEVER been able to accurately throw one more than 3 feet before it turns from horizontal to vertical and crashes (and hurts!). Any tips?
    - Rage (Score:1)
      
      by dolo666 ( 195584 ) * writes:
      
      > Any tips?
      
      Put your rage in it and then throw. Oh, and flick your wrist. Oh, and take the packaging off first! :-)
I remember using Webcrawler before google... (Score:5, Insightful)

by John Seminal ( 698722 ) writes: on Tuesday April 20, 2004 @07:17PM (#8923384) Journal

It was a good search engine. I dunno why I stopped using it, I think it was a bit on the slow side and Google had more pages.
Heck, while reminiscing, I remember when excite was my start page, and when I used them for email. I remember they were the first "start" page to have groups. I stopped using them 4 years ago when their email stopped working.
I guess if anything, we can learn the web is not going to be the same in 5 years as it is today. My question is, "is it better"? Personally, I think it was better back in the day. I would like to see a search engine that does not display any spam or sales or sex sites as hits. I now do most of my searches on google doing "search parameters site:edu".

Share
twitter facebook
- Re:I remember using Webcrawler before google... (Score:5, Interesting)
  
  by Anonymous Coward writes: on Tuesday April 20, 2004 @07:33PM (#8923506)
  
  I think I remember why I left WebCrawler.
  
  WebCrawler was simple and effective. But then AltaVista emerged. I started using AltaVista.digital.com, and from there WebCrawler went down hill - lots of advertising and junk that kind of made me hate it. What was once seemless and simple became noisy.
  
  I used AltaVista for a number of years, but once again advertising got the best of it. It turned super-sophisticated, with a lot of advertising fluff and "features". Altavista was becoming overly commercialized. They had a "simple" version that was better (I forget the name [begins with an "R"?]), but soon the result sets were scewed towards advertisers and abusers.
  
  In 2001, I made the switch to Google. It was everything that WebCrawler once was in terms of ease of use and quality of results. I've been more or less happy with Google ever since.
  
  Parent Share
  twitter facebook
  - Re:I remember using Webcrawler before google... (Score:5, Informative)
    
    by qodfathr ( 255387 ) writes: on Tuesday April 20, 2004 @08:03PM (#8923710)
    
    You are remembering raging.com [raging.com], still up-and-running today.
    
    Parent Share
    twitter facebook
  - Re:I remember using Webcrawler before google... (Score:2)
    
    by zsau ( 266209 ) writes:
    
    ... but now Google's results are getting skewed in favor of abusers/googlebombers and such, though they've managed to stear clear of intergrating advertising into the page. You're thinking about using AllTheWeb or Teoma, but they too will succumb to the ever-present abusers. Eventually, you'll return to WebCrawler and wonder why you left... :)
/me 's jaw hits the floor (Score:3, Funny)

by Stalin ( 13415 ) writes: on Tuesday April 20, 2004 @07:18PM (#8923388)

I can't believe it is even still around.

Share
twitter facebook
Then and now... (Score:5, Funny)

by jdreed1024 ( 443938 ) writes: on Tuesday April 20, 2004 @07:19PM (#8923403)

When I wrote WebCrawler, one could do a credible job of crawling, indexing, and searching the Web from a single desktop PC. Today, the reality is a little bit different.
No kidding. Back then, one could serve a website from most any machine, and it would be there for all to see. Today only the largest websites can avoid a slashdotting with only 9 posts in the thread.

Share
twitter facebook
- Re:Then and now... (Score:5, Funny)
  
  by mph ( 7675 ) writes: <mph@freebsd.org> on Tuesday April 20, 2004 @07:30PM (#8923488)
  
  Today only the largest websites can avoid a slashdotting with only 9 posts in the thread.
  
  Imagine how bad it would be if everyone actually read the articles.
  
  Parent Share
  twitter facebook
  - Re:Then and now... (Score:2)
    
    by zsau ( 266209 ) writes:
    
    I actually specifically don't, often as not. If it's a simple page, I'll often (not always) look for mirrors/copy-and-paste here first, and if I can't find them, then sometimes I won't open the source anyway...
- Re:Then and now... (Score:1, Insightful)
  
  by /dev/trash ( 182850 ) writes:
  
  In all fairness if the majority of the websites today that are Slashdotted would not use a mySQL dynamic solution to serve pages, they'd be okay.
- Re:Then and now... (Score:1)
  
  by mnewton32 ( 613590 ) writes:
  
  Not only did we /. his web server, but his DNS server too! Probably the same box....
- Re:Then and now... (Score:3, Informative)
  
  by berenddeboer ( 305245 ) writes:
  
  Today only the largest websites can avoid a slashdotting with only 9 posts in the thread.
  
  Not true, see Surviving Slashdotting with a Small Server [slashdot.org]. Lots of people tried to bring it down (see comments), but it survived with no trouble at all.
Holy search engines batman! (Score:3, Interesting)

by ylikone ( 589264 ) writes: on Tuesday April 20, 2004 @07:20PM (#8923405) Homepage

Who uses webcrawler anymore? I didn't even know they still exist. Anybody remember opentext.com search?

Share
twitter facebook
Birthday party (Score:5, Funny)

by jacobhoupt ( 728382 ) writes: on Tuesday April 20, 2004 @07:20PM (#8923409)

I'll be hosting my tenth annual WebCrawler birthday party tonight in the back of my Yugo.

Feel free to drop in, there should be plenty of seating available for those interested.

Share
twitter facebook
my new hero (Score:5, Funny)

by theMerovingian ( 722983 ) writes: on Tuesday April 20, 2004 @07:20PM (#8923411) Journal

Some guys are too cool for their own good. Brian Pinkerton has the domain 'thinkpink.com', AND he wrote his own search engine.

I bet he even has a 3-digit UID, a beowulf cluster of Xboxes running linux, and he sold all his stock options during the bubble. :)

Share
twitter facebook
- Re:my new hero (Score:2)
  
  by antic ( 29198 ) writes:
  
  Think Pink is a bizarre brand of clothing with a semi-cult following in Slovenia. Pinkerton can expect a lawsuit, and some abusive phone calls in a language he (probably) cannot understand!
  - - Re:OT, your sig (Score:3, Funny)
      
      by antic ( 29198 ) writes:
      
      If I updated the sig, would as many people click it?
      - Re:OT, your sig (Score:1)
        
        by quinto2000 ( 211211 ) writes:
        
        yay for SLOPS.
Already Slashdotted (Score:3, Funny)

by MrRuslan ( 767128 ) writes: on Tuesday April 20, 2004 @07:22PM (#8923429)

Here is the google cache
http://216.239.39.104/search?q=cache:-vPR77Hq9OYJ: www.thinkpink.com/bp/WebCrawler/History.html+&hl=e n&ie=UTF-8

Share
twitter facebook
- Well isn't that ironic (Score:5, Funny)
  
  by Zygote-IC- ( 512412 ) writes: on Tuesday April 20, 2004 @07:36PM (#8923527) Homepage
  
  So, to read a story celebrating an anniversary about a search engine, we have to go through the cache of another search engine?
  
  Go figure.
  
  Parent Share
  twitter facebook
  - Re:Well isn't that ironic (Score:3, Insightful)
    
    by Adam9 ( 93947 ) writes:
    
    No worries, just go here [209.24.201.206]
- Re:Already Slashdotted (Score:1)
  
  by elsilver ( 85140 ) writes:
  
  Doesn't that just deserve a +1, Ironic?
  
  E.
old search engines (Score:2)

by Coneasfast ( 690509 ) writes:

ah yes, i remember the old days, search engines like webcrawler, altavista, magellan, and infoseek. in those days wouldn't help you find what you want most of the time. now with google we need not worry :)
When did they give up.... (Score:5, Interesting)

by David Hume ( 200499 ) writes: on Tuesday April 20, 2004 @07:23PM (#8923435) Homepage

...on their own web search technology and become a metasearch engine? From the WebCrawler About Page [webcrawler.com]:

WebCrawler uses innovative metasearch technology to search the Internet's top search engines, including Google, Yahoo, Ask Jeeves, About, Teoma, FindWhat, LookSmart, and many more.

With one single click, WebCrawler searches the best results from the combined pool of the world's leading search engines -- instead of results from only one single search engine.

And WebCrawler makes it easy to refine your search so you can find the most meaningful results right away. No wonder it's a leader in the search industry.

Was it 2001? The History [thinkpink.com] states:

2001 InfoSpace acquires WebCrawler. Excite, now Excite@Home, went belly up. In the bankruptcy, Infospace acquired WebCrawler. Today Infospace runs WebCrawler as a meta-search engine. And they've given Spidey a new name and turned him purple!

Oh, and if it is not being otherwise used, has the code for the WebCrawler spider been open-sourced? :)

Share
twitter facebook
- Re:When did they give up.... (Score:3, Informative)
  
  by The Bungi ( 221687 ) writes:
  
  There's MetaCrawler [metacrawler.com]. If my memory serves me correctly, it appeared before WebCrawler went to this format.
  I honestly don't remember the first time I saw MetaCrawler (but it used to be much simpler back then!) so I don't know if it predates Google. WebCrawler's idea however is not new, AFAIK.
  - Re:When did they give up.... (Score:1)
    
    by Derek Pomery ( 2028 ) writes:
    
    Metacrawler easily predates Google.
    I was using it in, like, '95 or '96. (Webcrawler *was* my first, though)
    
    I seem to remember it didn't even have a domain name back then, it was a page on some university site.
    
    I know Google has some history, but I only started using Google Beta sometime in '99 or so.
    I'm sure the initial engine wasn't around in '95.
- The more things change... (Score:4, Interesting)
  
  by Old Man Kensey ( 5209 ) writes: on Tuesday April 20, 2004 @08:51PM (#8924137) Homepage
  
  Originally there was WebCrawler (among others). In late 1996, AOL acquired WebCrawler and turned it into AOL Netfind. Later, apparently, Excite bought it from AOL, made it a separate service, and Excite became the engine that powered AOL Netfind. After that apparently InfoSpace bought it in the Excite sell-off.
  But after AOL bought it I lost track of it, because it started sucking (returning lots more stale links than before), and altavista.digital.com burst upon the scene (anyone else remember "kayak sailing San Juan islands"?).
  My guess would be that the meta-search switch initially happened when Excite bought them.
  
  Parent Share
  twitter facebook
Boy, does this take me back... (Score:5, Interesting)

by Faust7 ( 314817 ) writes: on Tuesday April 20, 2004 @07:23PM (#8923440) Homepage

...to the days when the search engine market resembled the microcomputer market of the '80s. Several competitors, all with (roughly) the same market share, each with a certain number of hits that the others didn't have. I had to use at least a few of them to assure myself that I was getting something reasonably close to what the whole Web could offer on my search topic (even though no search engine comes close to penetrating all of the pages out there).

If I was looking for something, I'd query Lycos, AltaVistas, Infoseek, Excite, Webcrawler, and Magellan. And, later on, Google. Vastly different results, site designs, site objectives. I won't say it was the most streamlined, elegant experience, but it was kind of fun.

Share
twitter facebook
- Oh, Yahoo too. (Score:3, Interesting)
  
  by Faust7 ( 314817 ) writes:
  
  Oh yeah, and Yahoo as well. Forgot to include them.
  
  Interestingly, their look has changed very, very little from their olden days.
  - Re:Oh, Yahoo too. (Score:3, Interesting)
    
    by sflory ( 2747 ) writes:
    
    Yahoo in the past has never done their own search engine. They've used a number of backends including google. This is has been true up until they aquired Inktomi. Late last year they launched Yahoo search using Inktomi's search engine.
  - - Re:Speaking of Yahoo (Score:2)
      
      by Syre ( 234917 ) writes:
      
      akebono.stanford.edu
  - - - Re:Imagine if Copernic had become the standard (Score:2)
        
        by bhtooefr ( 649901 ) writes:
        
        Copernic [copernic.com] still exists... They offer a meta-searching APP called Copernic Agent, available in free and pay-for versions.
- Re:Boy, does this take me back... (Score:2)
  
  by System.out.println() ( 755533 ) writes:
  
  If I was looking for something, I'd query Lycos, AltaVistas, Infoseek, Excite, Webcrawler, and Magellan.
  
  That's why metasearch engines popped up. Can't remember any of their names though... Metacrawler maybe? I can't bring myself to Google for them. ;-)
- Re:Boy, does this take me back... (Score:2)
  
  by Accipiter ( 8228 ) writes:
  
  I typically swore by HotBot, since I had the most luck with them. I never had a use for the directory sites (Yahoo) though.
- Re:Boy, does this take me back... (Score:2)
  
  by zsau ( 266209 ) writes:
  
  Dogpile! I used to do the multi-search thing, then I happened upon Dogpile, then I noticed I always went straight to Google's results and got it from the source...
Wow. Just. Wow. (Score:5, Interesting)

by TWX ( 665546 ) writes: on Tuesday April 20, 2004 @07:26PM (#8923460)

I remember using Webcrawler back when I got my first 14.4 Slirp connection back in 1994. It was the only way to search!

and then came the marvels of altavista.digital.com.

I'm so glad that google came along...

Share
twitter facebook
WebCrawler was the best, back in the day... (Score:1, Interesting)

by Anonymous Coward writes:

WebCrawler was the best search engine, back in the day when search engines were all new and Yahoo wasn't a search engine but a human-moderated list of sites.

--
Callas
Wow - the 1996 wayback WebCrawler page STILL WORKS (Score:5, Interesting)

by Anonymous Coward writes: on Tuesday April 20, 2004 @07:29PM (#8923482)

http://web.archive.org/web/19961023234707/http://w ww.webcrawler.com/

Presumably connects to the current crawler which still accepts the old format :)

--
Callas

Share
twitter facebook
- Re:Wow - the 1996 wayback WebCrawler page STILL WO (Score:2, Interesting)
  
  by Anonymous Coward writes:
  
  1996 WebCrawler [archive.org]
  
  I have NO idea how that space got in there...
  
  --
  Callas
  - Re:Wow - the 1996 wayback WebCrawler page STILL WO (Score:2, Informative)
    
    by Bullet-Dodger ( 630107 ) writes:
    
    I have NO idea how that space got in there...
    Not your fault. Slashcode does that itself whenever there's a long enough unbroken string of characters, to stop page-widening posts.
- Re:Wow - the 1996 wayback WebCrawler page STILL WO (Score:1)
  
  by t1m0r4n ( 310230 ) writes:
  
  Presumably connects to the current crawler which still accepts the old format :)
  
  Ya, but I followed the link to get my free copy of Internet Explorer that is advertised on the page and got
  
  P3P: CP='ALL IND DSP COR ADM CONo CUR CUSo IVAo IVDo PSA PSD TAI TELo OUR SAMo CNT COM INT NAV ONL PHY PRE PUR UNI'
  
  Whatever that means.
First query? (Score:5, Funny)

by Xzzy ( 111297 ) writes: <sether@tru7h.DALIorg minus painter> on Tuesday April 20, 2004 @07:34PM (#8923516) Homepage

So who remembers the first search query they typed into Webcrawler?

I was just crawling out of the gopher world, a short period where I was getting turned on to the web but there was no way to find links, almost everything came through the university homepage or word of mouth. Then someone pointed me to webcrawler.

What did I search for first? "fart jokes". No kidding.

"boobs" was second.

Share
twitter facebook
Google's next step... (Score:2, Funny)

by pVoid ( 607584 ) writes:

Skynet!
Engineer: ARGGHHH... IT'S ALIIIIIVE... <BANG> <CRACK>
<STATIC>
WebCrawler on NeXTStep - before Open Source (Score:5, Interesting)

by ben_kelley ( 234423 ) writes: on Tuesday April 20, 2004 @07:39PM (#8923543)

It is scary to think that at one point I e-mailed the WebCrawler people to ask them how it worked. In response they sent me a copy of the source (Objective C for NeXT) so I could compile it up on my NeXT PC (I had a "black" NeXT - 68000 based) to index my intranet web server.

I doubt that someone like Google would send you a copy of their source these days - even if you asked nicely.

I could never get it to compile, and I deleted it long ago, but I kind of wish I had kept it now. An interesting piece of internet history.

Share
twitter facebook
- Re:WebCrawler on NeXTStep - before Open Source (Score:4, Funny)
  
  by houseofmore ( 313324 ) writes: on Tuesday April 20, 2004 @07:48PM (#8923598) Homepage
  
  Ah it's just a perl module I think. Google::Search or something or other...
  
  Parent Share
  twitter facebook
- Re:WebCrawler on NeXTStep - before Open Source (Score:1, Funny)
  
  by Anonymous Coward writes:
  
  Y'know, I was about to call that a bet. I had this mail typed out to help@google.com and everything, but I realized my company does search, too. They might have taken it the wrong way.
- Re:WebCrawler on NeXTStep - before Open Source (Score:2, Insightful)
  
  by ArbitraryConstant ( 763964 ) writes:
  
  I doubt that someone like Google would send you a copy of their source these days - even if you asked nicely.
  
  The next best thing.
  
  search appliance [google.com]
- Re:WebCrawler on NeXTStep - before Open Source (Score:5, Interesting)
  
  by sacremon ( 244448 ) writes: on Tuesday April 20, 2004 @10:40PM (#8924949)
  
  I sometimes sit back and think about some of the various projects that first saw life on the NeXT platform:
  
  the first web server
  Webcrawler
  Doom and DoomII
  
  Pretty good for a machine that only sold ~70,000 units total, not including the versions of NEXTSTEP for ix86/SPARC/PA-RISC.
  
  I still have a Color NeXTStation stashed away in a closet. I was using it as a print server till about two years ago.
  
  Parent Share
  twitter facebook
DNS failure (power outage) (Score:1, Insightful)

by Anonymous Coward writes:

Try http://209.24.201.206/bp/WebCrawler/History.html for the history.
Worth Remember (Score:2, Funny)

by Anonymous Coward writes:

I've reminisced before on slashdot about the beautiful geeky girl that introduced me to hotbot. [hotbot.com] Glasses, long blond hair, full breasts... cute sandals, short skirts... those silk panties.

Fuck WebCrawler. hotbot. [hotbot.com]
The WebCrawler Search Voyeur (Score:5, Funny)

by Faust7 ( 314817 ) writes: on Tuesday April 20, 2004 @07:46PM (#8923584) Homepage

Anyone remember the WebCrawler Search Voyeur?

It was a little Java applet that sat on your screen and displayed the pseudo-real-time search queries of other people.

When I was a computer lab monitor at my college, we used to note in the log book any particularly amusing queries that we'd seen.

"hairy woman"... "squirrel torture"... "tom AND cruise AND foot AND odor"... "asian girl underage spanking"...

Share
twitter facebook
- Almost forgot this (Score:2)
  
  by Faust7 ( 314817 ) writes:
  
  I'll never forget reading this entry in the monitor log book:
  
  "Search Voyeur query of the day:
  'Why does poop stink?'"
- Re:The WebCrawler Search Voyeur (Score:4, Funny)
  
  by intangible ( 252848 ) writes: on Tuesday April 20, 2004 @08:46PM (#8924100) Homepage
  
  I remember that it wouldn't show every search of course, but you could verify it was working by searching for the same phrase over and over again. About 10 seconds later, you could see your search phrase. You could actually use it to communicate with other people, albeit a little slow, but it was amusing. I would type in silly things just so others watching the voyeur would see them.
  I bet you guys recorded some of my stuff :P
  
  Parent Share
  twitter facebook
- Re:The WebCrawler Search Voyeur (Score:3, Informative)
  
  by Anonymous Coward writes:
  
  It's still there in a slightly different incarnation.... http://www.metaspy.com [metaspy.com]
- Re:The WebCrawler Search Voyeur (Score:1, Interesting)
  
  by Anonymous Coward writes:
  
  yeah, I remember that. I think I ended up staring at the screen for the afternoon with my jaw on the floor in between giggle fits...
  
  It was like a really horrible glimpse inside the mind of my fellow man... but, funny...
Webcrawler is a crawler no more.... (Score:2, Redundant)

by Camel Pilot ( 78781 ) writes:

I just did a search on webcrawler for "digital camera" and the results where 70% pay-per-click advertising with a small and hardly noticable "Sponsored by:" disclaimer. Worse yet the paid links are intermigled with the indexed hits.

Looks like Webcrawler is now more of a pay-per-click dispensor than a search engine... No thanks!

I think google has done a good job of clearly identified what is relevant and what is paid for.
public search engine (Score:5, Interesting)

by jacquesm ( 154384 ) writes: <j AT ww DOT com> on Tuesday April 20, 2004 @07:57PM (#8923645) Homepage

I'd happily contribute cash to a publicly funded and publicly run search engine.

Anyone game ?

Share
twitter facebook
- Re:public search engine (Score:1, Informative)
  
  by Anonymous Coward writes:
  
  yeah, it's called dmoz [dmoz.org]
- Re:public search engine (Score:3, Funny)
  
  by Chess_the_cat ( 653159 ) writes:
  
  Wait until Google goes public and buy stock.
I remember the exact day (Score:2, Funny)

by Anonymous Coward writes:

The exact day that I stopped using webcrawler. It happened to coincide with the day that AOL was announced as the new owner.
I didn't even know they were around anymore. (Score:3, Interesting)

by Captain Rotundo ( 165816 ) writes: on Tuesday April 20, 2004 @08:01PM (#8923686) Homepage

And the odd part is I don't even remember the interface being as cluttered as the very early one linked through the archive in an earlier post. I suppose I moved on very early, although I remember when as far as I was concerned they were the only game in town.

Share
twitter facebook
Takes me back (Score:3, Interesting)

by SuperBigGulp ( 177180 ) writes: on Tuesday April 20, 2004 @08:11PM (#8923812)

I remember using WebCrawler on my very first SLIP dial up account and thinking "How cool is this?" I had used AOL for a couple years prior but was hoping trade in their UI (and limitations) for Netscape. The funny thing is that I wasn't sure if I could find enough content on the web.

Also a great testament to the original design and concept that search engines still look and work a lot like WebCrawler, 10 years on.

Happy birthday, and thanks for the walk down 32K memory lane

Share
twitter facebook
Wasn't WebCrawler "Powered By NEXTSTEP" ? (Score:2)

by green pizza ( 159161 ) writes:

I seem to remember that the WebCrawler site used to have a "Powered By NEXTSTEP" badge on it. I can't verify this with web.archive.org as it doesn't seem to go back that far (I started using WebCrawler in 1995). I can't RTFA at the moment, does anyone know what sort of hardware powered the WebCrawler site originally? Did it run on black NeXT hardware? White box PCs running NeXTSTEP? Did they ever utilize the WebObjects framework that NeXT (and later, Apple) used to sell?
- Re:Wasn't WebCrawler "Powered By NEXTSTEP" ? (Score:2)
  
  by globalar ( 669767 ) writes:
  
  Sounds like white boxes instead...
  
  "In the current implementation, the WebCrawler builds an index at the rate of about 1000 documents an hour on a 486-based PC running NEXTSTEP."
  
  "The full-text index is currently based on NEXTSTEP's IndexingKit [NeXT]"
  
  - from Experiences with Webcrawler [uiuc.edu]
  
  I think Webcrawler used CERN's WWW library, but I can't say this made it's way into WebObjects.
Hardly one of the first (Score:5, Informative)

by btempleton ( 149110 ) writes: on Tuesday April 20, 2004 @09:22PM (#8924343) Homepage

Internet searching way predates 1994. Archie by Peter Deutsch (the one from Montreal, not the American one) was one of the most popular applications on the internet in the 80s. The http search engines like Webcrawler and Lycos came much, much later on internet time scales.

Share
twitter facebook
The one before WebCrawler? (Score:2, Insightful)

by SnappingTurtle ( 688331 ) writes:

I seem to remember that before WebCrawler there was actually a "big" search engine run by a non-profit. For the life of me I can't remember what it was, but I seem to remember one day going "Wow, this webcrawler thing is great, I'm never touching [whatever] again."
Of course a few years later I said "Wow, this AltaVista thing is great. I'm never touching WebCrawler again." And then I went "Wow, this Google thing is great. I'm never touching AltaVista again."
Wow... (Score:4, Interesting)

by }InFuZeD{ ( 52430 ) writes: on Tuesday April 20, 2004 @10:21PM (#8924825) Homepage

I think WebCrawler was my first search engine ever...

From there I graduated to MetaCrawler, which parsed WebCrawler and all the other currently popular web search engines at the time.

For some reason or another MetaCrawler started sucking and I used InfoSeek for quite some time... then they were acquired by Go.com and it went downhill from there.

I remember what I'd search the internet for back in those days tho. It was always "jedi knight" and "giga pets" (remember those cute tamagotchi rip-offs? =p)

Share
twitter facebook
- More WebCrawler History (Score:2, Interesting)
  
  by Anonymous Coward writes:
  
  I used to be one of the Excite@Home engineers who looked after Webcrawler. WebCrawler and the Excite front end all belonged to the same code base called My Excite Start Page (known internally as MESP at Excite).
  
  The WebCrawler at Excite was pretty much an unsupported product when I was there. All I ever did were maintenance releases, never any new stuff for WebCrawler. WebCrawler was actually the Excite front end, except it had the WebCrawler logos instead of Excite.
  
  The search engine was the Excite search
WebCrawler Sale Sensation (Score:4, Interesting)

by Anonymous Coward writes: on Tuesday April 20, 2004 @11:44PM (#8925315)

WebCraweler's Brian Pinkerton formerly worked at NeXT, and I remember being in the the NeXT kitchen when news arrived in 1995 of his sale of WebCrawler to AOL. The sale price was around $1 million, and everyone was absolutely awed that a software company could sell for so much. This marked the beginning for me of the dot-com era: Just a few month later, other companies started or run by ex-NeXTers sold for millions, then tens of millions, and at least one for hundreds of million. Soon after that, NeXT CEO Jobs took Pixar through an IPO, for a personal gain of about $1 billion!

Share
twitter facebook
Ah... back in the day (Score:3, Interesting)

by StefanSavage ( 454543 ) writes: on Wednesday April 21, 2004 @01:34AM (#8925940)

I remember back in 1994 WebCrawler was running on three machines in the corner of Sieg Hall 433. They were rigged up so one could reboot the others via a serial line, but occassionally that machine would crash too. That was when Brian would call in and say "Hey, Webcrawler is hung. Could you go reboot it?". I'm guessing this doesn't happen much at Google...

Share
twitter facebook
nope (Score:5, Informative)

by millette ( 56354 ) writes: <robin@@@millette...info> on Wednesday April 21, 2004 @03:01AM (#8926307) Homepage Journal

You just need 8 desktop machines [gigablast.com] and you can index a 10th of what google does. From a recent article:

Gigablast runs on eight desktop machines, each with four 160-GB IDE hard drives, two gigs of RAM, and one 2.6-GHz Intel processor. It can hold up to 320 million Web pages (on 5 TB), handle about 40 queries per second and spider about eight million pages per day. Currently it serves half a million queries per day to various clients, including some meta search engines and some pay-per-click engines.
I also read it was going to expand it's index this year, but I wasn't able to find where I read that.

Share
twitter facebook
- Re:world wide worm? (Score:2, Informative)
  
  by Captain Kangaroo ( 445932 ) writes:
  
  The WWWW (World-Wide Web Worm) pre-dated WebCrawler (and Jumpstation pre-dated it.) Jumpstation indexed only titles, while the Worm indexed both titles and anchor text (IIRC).

There may be more comments in this discussion. Without JavaScript enabled, you might want to turn on Classic Discussion System in your preferences instead.

Guess this celebration... (Score:5, Funny)

e-mailing results (Score:4, Interesting)

Re:e-mailing results (Score:5, Informative)

And from there it goes to spam lists, right? (Score:2)

Re:And from there it goes to spam lists, right? (Score:2)

Birthdays (Score:4, Funny)

Re:Birthdays (Score:4, Funny)

Re:Birthdays (Score:1, Funny)

Barney Gumble too! (Score:1)

Godwin's Law Invoked (n/t) (Score:1)

Re:Birthdays (Score:2)

They used to be my google.... (Score:5, Interesting)

Re:They used to be my google.... (Score:5, Funny)

Re:They used to be my google.... (Score:2, Funny)

Re:They used to be my google.... (Score:2)

Re:They used to be my google.... (Score:1)

Re:They used to be my google.... (Score:2)

Re:They used to be my google.... (Score:1)

I really like the... (Score:2)

Re:They used to be my google.... (Score:2)

Re:They used to be my google.... (Score:1)

Whoa! (Score:5, Interesting)

Re:Whoa! (Score:1)

Re:Whoa! (Score:1)

Re:Whoa! (Score:1)

Wow (Score:5, Interesting)

Re:Wow (Score:1)

Re:Wow (Score:2)

Rage (Score:1)

I remember using Webcrawler before google... (Score:5, Insightful)

Re:I remember using Webcrawler before google... (Score:5, Interesting)

Re:I remember using Webcrawler before google... (Score:5, Informative)

Re:I remember using Webcrawler before google... (Score:2)

/me 's jaw hits the floor (Score:3, Funny)

Then and now... (Score:5, Funny)

Re:Then and now... (Score:5, Funny)

Re:Then and now... (Score:2)

Re:Then and now... (Score:1, Insightful)

Re:Then and now... (Score:1)

Re:Then and now... (Score:3, Informative)

Holy search engines batman! (Score:3, Interesting)

Birthday party (Score:5, Funny)

my new hero (Score:5, Funny)

Re:my new hero (Score:2)

Re:OT, your sig (Score:3, Funny)

Re:OT, your sig (Score:1)

Already Slashdotted (Score:3, Funny)

Well isn't that ironic (Score:5, Funny)

Re:Well isn't that ironic (Score:3, Insightful)

Re:Already Slashdotted (Score:1)

old search engines (Score:2)

When did they give up.... (Score:5, Interesting)

Re:When did they give up.... (Score:3, Informative)

Re:When did they give up.... (Score:1)

The more things change... (Score:4, Interesting)

Boy, does this take me back... (Score:5, Interesting)

Oh, Yahoo too. (Score:3, Interesting)

Re:Oh, Yahoo too. (Score:3, Interesting)

Re:Speaking of Yahoo (Score:2)

Re:Imagine if Copernic had become the standard (Score:2)

Re:Boy, does this take me back... (Score:2)

Re:Boy, does this take me back... (Score:2)

Re:Boy, does this take me back... (Score:2)

Wow. Just. Wow. (Score:5, Interesting)

WebCrawler was the best, back in the day... (Score:1, Interesting)

Wow - the 1996 wayback WebCrawler page STILL WORKS (Score:5, Interesting)

Re:Wow - the 1996 wayback WebCrawler page STILL WO (Score:2, Interesting)

Re:Wow - the 1996 wayback WebCrawler page STILL WO (Score:2, Informative)

Re:Wow - the 1996 wayback WebCrawler page STILL WO (Score:1)

First query? (Score:5, Funny)

Google's next step... (Score:2, Funny)

WebCrawler on NeXTStep - before Open Source (Score:5, Interesting)

Re:WebCrawler on NeXTStep - before Open Source (Score:4, Funny)

Re:WebCrawler on NeXTStep - before Open Source (Score:1, Funny)

Re:WebCrawler on NeXTStep - before Open Source (Score:2, Insightful)

Re:WebCrawler on NeXTStep - before Open Source (Score:5, Interesting)

DNS failure (power outage) (Score:1, Insightful)

Worth Remember (Score:2, Funny)

The WebCrawler Search Voyeur (Score:5, Funny)

Almost forgot this (Score:2)