Google to Offer API 218
philipx writes "From the ruby-talk archives here's a little interesting snippet from a post you have to check out:
"Here at Google, we're about to start offering an API to our
search-engine, so that people can programmatically use Google through
a clean and clearly defined interface, rather than have to resort to
parsing HTML." It goes on talking about SOAP and I think this is utterly cool."
Cool, but.... (Score:4, Insightful)
Re:Cool, but.... (Score:2, Informative)
Re:Cool, but.... (Score:2, Interesting)
Re:Cool, but.... (Score:1)
Re:Cool, but.... (Score:1, Troll)
-Restil
Re:Cool, but.... (Score:2)
That's the beautiful thing about a site that doesn't fund themselves through the use of banner ads. It doesn't MATTER if you access their content through the main website, an affiliate like yahoo, or an api interface.
They sells text ads on their main site. Yahoo pays them a fee. So how will the API users pay for themselves?
Re:Cool, but.... They never said if was free! (Score:3, Insightful)
I just wonder how it will tie into my app. Will it open my browser? Will the Google Bar plugin be the foundation?
We'll just have to wait and see...
Re:Cool, but.... They never said if was free! (Score:4, Informative)
I just wonder how it will tie into my app. Will it open my browser? Will the Google Bar plugin be the foundation?
The post describes a SOAP web service which in most cases is an RPC call in your application of choice. However unlike RPC in days of yore using SOAP to do RPC in applications is relatively easy. If you want to learn more about SOAP I suggest reading A GEntle Introduction To SOAP [weblogs.com] by Sam Ruby for an overview of the protocol and A Busy Developer's Guide to WSDL 1.1 [weblogs.com] to see how one could go from defining a WSDL file (as the Google sys admin is trying to do) to actually accessing the web service remotely from a Java application.
There is also a grab bag of resources on XML webservices [gotdotnet.com] at the
To answer your question, if the Google API is available as a web service then it can be intergrated into any application at all from command line to dynamic web page to GUI application as long as there is network availability on the host machine.
Re:Cool, but.... They never said if was free! (Score:2)
Dissing the rule of Satan, only to advocate the
rule of a loving God.
reductio ad absurdum
Re:Cool, but.... They never said if was free! (Score:2)
I think of Scott as more like Jim Barksdale
on acid than God.
Re:Cool, but.... (Score:1)
Re:Cool, but.... (Score:3, Informative)
- Amit
Re:Cool, but.... (Score:1)
Re:Cool, but.... (Score:1)
Barter worked for a long time... (Score:3, Insightful)
Google benefits from the monetery system in an obvious way. They also benefit from the barter system by vastly adding to the crunch power which hopefully improves their indexing/grading system. Unused clock cycles which would otherwise be wasted can now earn some value for the users and at the same time give google the 'value' for providing their service.
So their 'open' system if presented in the form of barter could actually work for the advantage of both parties involved.
It could be an improvement for them! (Score:2)
And as for "not having to visit their site," remember that they're not doing huge amounts of banner ads. It's not totally evident that this "destroys" any of their business.
They still get to collect statistics on what queries come from where on what, which doesn't change terribly much whether they're receiving queries as HTML FORMs or XML SCHEMAs, and there's only a little reason for them to care about folks receiving back HTML versus XML
Re:Cool, but.... (Score:2)
Yeah, it makes it hard/impossible to make money on it with current models (no ads etc), but it could evolve into a system where sites can exchange data for mutual benefit, giving them all data they need while cutting back on transmission overheads.
Think of it as being ahead of the curve. This is where MS is pushing with SOAP, and others are going with XML-RPC. It can lead us to the point where the "noosphere" of web info can be universally accessed without having to run a clunky old web browser to do it (cell phone, palm pilot, and other alternative access users of the world, rejoice! :).
Google is clever. They're trying to set themselves up as all purpose, all access information scroungers, and they're doing a hell of a good job at it while making a nice profit besides. I don't doubt that they'll be able to parlay this into a revenue stream if it takes off -- just give it a chance to flourish...
Cool feature (Score:4, Insightful)
Google topic icon (Score:3, Funny)
Re:Cool feature GREAT IDEA! (Score:2, Insightful)
all of us do nothing but rave about google day and night
for it is a search engine we love, with a company many of us have come to love
I for one would love to see google have its own slashdot icon
Come to think of it, there are plenty of USELESS icons none of us give a damn about
the following are a few:
Heres hoping for a new google icon!!
Just my two cents, all taxes included
Sunny Dubey
DoS Google? (Score:3, Interesting)
With the exposed API I could see, by malice or sheer accident, floods of queries coming in...
Re:DoS Google? (Score:1)
I wonder if they have a [business] plan for this feature or if it's just the brain child of their gurus in the back room. Actually, I'm quite intriqued by the possibilities, so I'm going to keep on eye on it...
-
Re:DoS Google? (Score:4, Informative)
In fact, an attack through the front door will be more likely to succeed because you're hitting the rendering engine, which takes a lot more CPU time (believe it or not) than the search engine.
OTOH the back door is lightweight and is as such advantageous for not only third parties but also Google itself to employ.
Besides, if you're being abused, if you don't want to use technological avenues to keep miscreants away, you can always use legal ones.
This is great. (Score:2, Interesting)
Re:This is great. (Score:2)
Ah, but they do [slashdot.org]
Re:This is great. (Score:2, Interesting)
Re:This is great. (Score:1)
squid [squid-cache.org] plus ad-zap [zip.com.au] (my choice) ;
junkbuster [junkbuster.com] ;
proxomitron [thewebfairy.com].
:)
There may be others. Some tweaking required (not for subscribing, obviously
Re:This is great. (Score:2)
Re:This is great. (Score:2)
Re:This is great. (Score:2)
No, theft is theft. This would be using the offered services. Much like buying food from the University of Maryland food services on your point card, and then giving it to charity for the homeless isn't theft either. (of corse the UofMD food services folks claimed it was though)
Reducing someone's revenue is not theft, even if it involves using someone else's services (so long as you use them within the bounds they have been offered). A Mobil gas station next to a Shell gas station is likely to reduce the Shell's revenue. Not theft. If the Shell is dumb enough to to sell the gas at half price to the Mobil, it still isn't theft. If the Shell sells it half price but with a no resale rider it is a contract violation though.
Re:This is great. (Score:2)
Re:This is great. (Score:2)
No, that is still not theft, it is a contract violation. Not everything bad is theft. Spray painting my house isn't theft, even if it lowers the property value of my house, or is done just before an open house and in fact costs me potential buyers.
Kicking my dog wouldn't be assault either, it would be animal cruelty (and if I catch you kicking my dog, you may get to file an assault charge against me).
Re:This is great. (Score:2)
Re:This is great. (Score:2)
Whoever runs alterslash is a dirty thief though...
And I think it's a perfect example of fair use... The purpose and character is non-commercial news and education. The nature is a published document freely distributed over the internet. The amount and substantiality is minor; slashdot owns copyright in about 1% of the work. The effect of the use on the potential market is minor.
Re:This is great. (Score:3, Insightful)
Text ads... Open standards for content distribution... If only certain other sites would follow...
Apples and oranges... Google's bread and butter is their patented PageRank technology, which they license for what I'm sure is a lot of money. Slashdot, having made the decision to opensource slashcode do not have this option, therefore we're forced to endure banner ads and subscriptions as their only source of revenue. Ironic, eh? The people that screamed so loud about how long it took ./ to release the source for slash are now bitching about subscriptions and banner ads.. Like it or not, if slashcode was proprietary it could be sold and licensed and you wouldn't have to see ads here (or at least not the larger ones). Sourceforge figured this out too late, and are now trying to sell the SourceForge software as a source of income.
Hopefully ./ will wise up and figure out if they ever want to make any real money they'll have to offer a real service.. Like consulting to companies/webmasters to setup slashcode for customers (like MySQL AB does)... Too bad VA Linux went out of the hardware market. I think a pre-configured "Slash Appliance" (sort of like google's Search Appliance [google.com]) would be cool as hell for companies needing an internal collaboration system. ./ has really missed the boat here, IMHO.
Shayne
This is the beginning of the revolution (Score:5, Insightful)
They'll need a business model of some sort -- without the ads, and with the potential this has to hammer their servers, they'll need to meter access to the API in some way. But I'll pay -- where do I sign up?
I'll bet that this is how they'll end up making most of their money a couple of years from now.
Re:This is the beginning of the revolution (Score:3, Interesting)
I'm curious as to whether people would actually want such functionality from MSDN. It's one thing to be able to do a Google search from a function call and get the results back as XML but do people want API docs and technical articles retrieved via getArticle() and getAPI() webmethods?
One place where it might be useful however is KnowledgeBase articles [microsoft.com]. Perhaps a web service that retrieves a KB article given the Q number (e.g. Q123456) might be useful.
Disclaimer: This post is my opinion on doesnot reflect the thoughts, strategies, intentions or opinions of my employer.
Re:This is the beginning of the revolution (Score:2)
Re:i'd settle (Score:2)
Believe me, the folks at MSDN are well aware of how unsatisfied some people are by their search results. If and when any drastic change will occur is in doubt so for now I suggest using the Microsoft-related search on Google [google.com]
Re:This is the beginning of the revolution (Score:2)
Re:This is the beginning of the revolution (Score:2)
Another option is to give better access to paying customers: a paying customer is given unlimited use of the search, while private individuals (distinguished via IP/registration/...) would be limited to, say, one search per 5 seconds. It would be great to be able to use this API for some small things without having the hassle of paying. A 5-10 second delay isn't very bad in a small home situation, but is out of the question for any larger-scale applications.
I'd say it would also be consistent with their current user-friendly business model, and give another jolt of good PR for them.
Re:This is the beginning of the revolution (Score:2)
* WWW::Search::Google [cpan.org] by Jim Smyser [cpan.org]
* WWW::Cache::Google [cpan.org] by MIYAGAWA Tatsuhiko [cpan.org]
* Apache::No404Proxy::Google [cpan.org] by MIYAGAWA Tatsuhiko [cpan.org]
* WWW::Search::Scraper::Google [cpan.org] by Glenn Wood [cpan.org]
Yeah, but shouldn't those modules be re-written, or at least patched, every time Google decides to change the HTML format of the result page?
I think that accessing the results using a standard API would be much better.
Re:This is the beginning of the revolution (Score:2)
I don't remember any major HTML change since I started using Google few years ago. See the WWW::Search::Google change log [cpan.org], I thing that about one minor HTML change per year is not much.
I didn't say it's not better, of course it is. I just said it's not any revolution. People could write their own programs searching Google using WWW::Search::Google since 1999, and before that using just a simple regex.
To demonstrate that it's not a rocket sciense to parse Google results, I just wrote this quick and dirty, nasty hack in the command line:
(it's all in one line) And this is what I have in my new perl-links.txt file:
In 180-character command line I wrote a Google frontend, and the parser itself takes only 75 characters including "perl -ne", so it's a real one-liner. I'm sure I could optimize the whole damned thing to less than 100 characters. But why the hell I'm writing this?! I must be very tired... I guess, my point is, that I don't need no stinkin API! (Note to self: must decrease daily 1,3,7-trimethylxanthine dose below LD50 as soon as possible)
To moderators (Score:2)
Redundant?! *Please* answer and tell me how the hell was it redundant while 95 minutes later a comment [slashdot.org] duplicating a subset of mine [slashdot.org] was suddenly interesting. Oh, I get it, you mean redundant in a sense, that others said the same later? I see, it is redundant for some reversed definition of redundancy.
I don't care about the stupid karma, but I do care about people reading and replying to my comments. Now I have karma 50 again, thanks to my other post [slashdot.org], so you can safely mod my post [slashdot.org] up as +2 informative, +1 insightful and +1 interesting, not being affraid that I might get some karma. Thank you for your attention. I'm sure no one will read it now with Score: 1. Of course I'm currently not eligible to Meta Moderate so I can't even complain.
Cool! (Score:2, Insightful)
Could this be in response to the supposed competition from tokohma? open up thier results in some way to increase thier usage?
Re:Cool! (Score:1, Funny)
Had you enough sense to use Perl, you would have spent a lot less time on that thing. What crack smoking teacher would force someone to do string manipulation in Java...sadistic.
Re:Cool! (Score:2)
Re:Cool! (Score:2)
I think your high school programming teacher was one of those Java programmers with a sense of humor [theonion.com]. There are languages where you just have to write one line, and this is only when you want to reinvent the wheel [cpan.org].
m|^<p><a href="?([^">]+)"?>(.+?)</a>| and print "$2\n$1\n\n" while <>;
Those Java programmers really does have a sense of humor... Or maybe he was one of those CIPA guys [slashdot.org], because if you had used Perl, you would have spent 3/4 of your time watching pr0n.
Contradicts the terms of use (Score:4, Interesting)
So how useful might that API be if you can't do anything with it...
Re:Contradicts the terms of use (Score:5, Interesting)
I'd assume that the API would be subject to a different set of terms and conditions than those for the main site. Given that it'll probably be a pay-for-use service (as another poster hinted at), it'd most certainly be that way.
Re:Contradicts the terms of use (Score:1, Interesting)
Re:Contradicts the terms of use (Score:3, Informative)
Re:Contradicts the terms of use (Score:1)
"I wonder if they have a [business] plan for this feature..."
The more I think about it, the less I think this is going to be a "free" feature. It seems to encourage "automated queries" and that doesn't fit with any business model I've thought of (for Google) and, as you've pointed out, it's against their TOS.
I'm guessing that, if this feature comes to fruition and stays, its usage will come under some sort of paid model.
-
Re:Contradicts the terms of use (Score:2)
I suspect that, despite the outcry and outrage from some quarters, they're not simply going to give away their entire search engine API connected to their search farm. Perhaps they'll limit it, meter it, and even charge for it. All would be more then fair.
No contradiction (Score:5, Insightful)
On the other hand, Google would obviously not want you to set up your own search site that passes queries to their engine, harvests the results, and presents them on your own site. That is the obvious target of the "Personal Use" restriction.
As for the "Automated Query" restriction -- well, what do you think they mean by "Automated"? Programmatic access to their engine? They couldn't prevent that even if they wanted to. "Automated" obviously means programs that issue hundreds of queries for data mining purpose. Example: crawling the Groups archives to harvest email addresses.
(This was a matter of some concern to me, when I noticed that the Google Usenet archives included all my company's private groups. I'd innocently used by real corporate email, innocently thinking that the groups weren't accessible outside the company. But the spam volume is still very low. Their bot detection software must be quite good.)
Note that making a simple API available doesn't enable any new kind of access to the Google engine. A clever programmer can already parse the HTML results. The API just makes it easier -- and gives Google another product they can sell licenses for.
Re:No contradiction (Score:2)
Re:No contradiction (Score:2)
ote that making a simple API available doesn't enable any new kind of access to the Google engine. A clever programmer can already parse the HTML results. The API just makes it easier -- and gives Google another product they can sell licenses for.
I've written a script to pull from google's search results so that I could have a site-only search function built into my page. I've had to rework it about 3-4 times as they keep adjusting the way to connect.
This is definitely about money, and I don't blame them. As long as their pricing model is decent, then I think they'll make a killing. Information is power, being able to find that information is even more power, and when you hold a monopoly because you're so much fucking better than everyone else... you can command money for your services.
I'll probably buy into it if they come up with a decent pricing scheme. I'd like to see something relating hits to cash, but also have some cut-off switches (so you can protect yourself against runs and slashdottings).
Re:No contradiction (Score:2)
Symbiosis can be viewed as mutual parasitism. There are good reasons for corporate shirts to pay good money for what hackers download for free.
Re:No contradiction (Score:2, Interesting)
The search results that appear from Google's indices are indexed by Google's automated machinery and computers
The User Agreement precludes you from automatically querying their site:
You may not send automated queries of any sort to Google's system without express permission in advance from Google.
The google agreement demands express permission to automatically scan its site, while Google assumes the permission to index all sites on the net.
Google also pretty much make the claim that if it is on the Internet they will index it. Their terms of service states that the only way not to have a site indexed is remove it from the net:
For each web site reflected in Google's indices, if either (i) a site owner restricts access to his or her web site or (ii) a site is taken down from the web, then, upon receipt of a request by the site owner or a third party in the second instance, Google would consider on a case-by-case basis requests to remove the link to that site from its indices.
I think Google is providing a great service, but I hope you can see the subtle contradictions in their product. They basically are saying that anything on the web is fair game for Google. Yet, Google is on the web, it is not fair game for other organizations. This is a very blatant double standard.
Google is a derivative work. The product model of Google is to determine expert sites by aggregating the link lists on other expert sites. In other words, they are taking other people's work, aggregating it and providing the results. Google's aggregation program is a derivative work. Not only that, they fail to give any compensation to the expert sites.
As for the issue of intergallactic karma, they actually expect the expert site to pay for the bandwidth needed by Google to aggregate the site. They then use this information to draw human traffic from the expert site.
Again, to the Google worshippers, I am not complaining or flaming Google, but simply pointing out a logical contradiction. Jack's Expert Site is harmed by Google in two ways: The googlebots take up a great deal of bandwidth that Jack pays for. Google then uses this information to draw actual human traffic from Jack's Expert Site. From this vantage Google is a big guy stomping on the small guy. When Microsoft does this type of stuff, we call it evil.
robots.txt? (Score:2, Informative)
Re:Contradicts the terms of use (Score:2)
The mean it. The version of WWW::Search::Google [cpan.org] on CPAN sends a User-Agent header of "WWW::Search/2.33", which Google rejects, saying: "Your client does not have permission to get URL...". Changing the User-Agent to "lynx" allows the query to be answered.
Of course, the question of how their terms of service become binding upon us is yet to be answered. They say: By using Google's search engine services ("Google Search Services"), you agree to be bound by the following terms and conditions (the "Terms of Service"). Yeah, right. I don't accept that from microsloth, and I don't accept it from Google.
This sounds cool, but.. (Score:2, Insightful)
Ok, it can be done already, but this would make it possibly too easy...?
Also, this will miss out their ads etc that they get revenue from, I wonder what their long term stratagy is?
Re:This sounds cool, but.. (Score:2)
Or lets say Google spikes the search request at some competitors to prove they are using Google.
So sure they could do it, but I doubt any popular site could get away with it for long.
Re:This sounds cool, but.. (Score:2)
Some unscrupulous players could surely abuse this by 'making their own' search engines that essentially rip off google without any hassle what so ever?
Yahoo [yahoo.com] hasn't had enough problems with it to take it down. It's really nice being able to make my own PHP script to display customized stock quotes on my PDA.
Re:This sounds cool, but.. (Score:1)
Actually, it works quite OK. (Score:3, Interesting)
If you run the Ruby script, as is, the result is thus:
#: Exception from service object: Invalid authorization key: xxxxxxxxxxxxxxx (SOAP::FaultError)
If somebody starts abusing a particular key, it's a no-brainer for Google to shut the key off.
The app I'm working on could use this (Score:2)
Explanation / Keep Supporting Google (Score:3, Interesting)
---
The following is the preliminary code that a particular Google sysadmin (ian@) is trying out. He'd prefer to have a single WSDL file do all of the configure (from Google's end to client), but he first needs to get some advice from an experienced Ruby hacker.
Also, let's keep in mind that this API will actually be decreasing Google pageviews and hits, which will in turn make their AdWords, AdWordsSelect, and textads less effective. So, it's our duty to continue to support Google and show them that the free/open source software people are behind them 100%. We know that Teoma just doesn't deliver, and Google's already got 3 billion pages indexed and cached.
Support Google today, because they're the future of information indexing on the Web!
--- begin code ---
#!/usr/bin/ruby
require 'soap/driver'
endpoint = 'http://api-ab.google.com/search/beta2'
ns = 'urn:GoogleSearch'
key = 'xxxxxxxxxxxxxxx'
service = 'file:GoogleSearch.wsdl'
query = ARGV.shift || 'foo'
soap = SOAP::Driver.new(nil, nil, ns, endpoint)
# uncomment the next line to dump the traffic on the wire
#
#soap.setWireDumpDev(STDERR)
soap.addMethodWithSOAPAction('doGoogleSearch', ns, 'key', 'q', 'start',
'maxResults', 'filter', 'restrict',
'safeSearch', 'lr', 'ie', 'oe')
r = soap.doGoogleSearch(key, query, 0, 10, false, nil, false, nil,
'latin1', 'latin1')
printf "Estimated number of results is %d.\n", r.estimatedTotalResultsCount
printf "Your query took %6f seconds.\n", r.searchTime
Re:Explanation / Keep Supporting Google (Score:2)
Maybe. However, you can already use screen-scraping to query Google today. The difference is that Google will better know the difference between a program and an actual user. So each ad-word view may be more effective. Of course, now that Google offers pay-per-click (in place of pay-per-impression), that might not be an issue anyway.
Will the number of actual ad impressions go down? Could be. But maybe by being a better product, Google will gain popularity and increase impression counts.
As for supporting Google: Definitely; they make an unbeatable product.
Is this a subscription service? (Score:4, Interesting)
I havent tried to get it to work yet, due to not having ruby installed, but does this imply some sort of subscription service?
Possibly a new way for them to raise revenue? Im assuming that the bold line means the authors key has been blanked out so other people cant abuse this service for free?
Lameness filter encountered. Post aborted! Reason: Too much repetition. :/
OpenGoogle? (Score:2, Funny)
We can finally find out how to implement their PigeonRank system...
Re:OpenGoogle? (Score:2)
Good to see this idea return (Score:4, Interesting)
Of course, now it's just forbidden. I am surprised they would go back to such a service, it would seem to wind up losing revenue for them depending upon whether or not people are good about passing along whatever Ad-words Google returns. They could expect the traffic to be low enough to not matter compared to the continued word-of-mouth benefit. Or access to the SOAP interface could be offered as a subscription model (pure speculation on my part).
-Robert
Re:Good to see this idea return (Score:2)
If the results are returned using SOAP, then the backend surely would want to display the ads because a lot of the time, they are what the user is looking for.
I know if I am looking to buy something search Google for vendors, I am more likely to choose a vendor from the Ads on the side. I figure it is a bit safer since these people actually have something invested in it.
The only reason I can think that someone would filter out the ads is simply because they want to hurt Google. Who wants to hurt Google though?
The click through rate is probably going to make things hard since there is no way to tell if a user clicked an ad. That just means a different guage...
Ode to Google :) (Score:4, Insightful)
They have always made the right decision ! they have offered internet users an incredible asset ! and I was so much grateful when they decided to rescue Deja, a site something I just don't know how I can leave without !
I view them as the most "honest and fair" site on the Net ! and without any doubt the most useful too.
Go Google ! you are showing the right way ! to all these stupid-crapy-portal sites which have invaded the net, I just hope you manage to stay in business and prosper for a loooooong, looooong time
Very Useful (Score:1)
wow -- wish i'd had this earlier (Score:2)
Pay-per-placement will pay for this... (Score:1, Insightful)
Re:Pay-per-placement will pay for this... (Score:2, Informative)
That was the very thing that turned people onto Google. I very much doubt that they would change that.
Google returns 2 distinct sets of results (Score:2)
Google does not have a pay for placement plan - if you are making reference to the practice of changing the order of search results based on advertiser dollars.
Yes it does. When you search Google, it displays two distinct sets of results side-by-side. One set is based solely on PageRank values; the other (clearly marked "Sponsored Links") is based on advertising dollars. The problem with GoTo [overture.com] was that you had to scroll and click past pages and pages of sponsored links to get to the results scrolled by relevance.
what about the law suit (Score:2)
Not publically accessible (Score:2)
at com.google.soap.search.QueryLimits.lookUpAndLoadF
...
Alas, looks like the rest of us won't be able
to play with Google's beta SOAP service. Which makes quite a bit of sense - this would be a great way for Google to allow people to resell Google in a standardized way, be it from inside a program (scary, too easy to reverse engineer) or from some other web service (less scary.
These guys drive me crazy. (Score:5, Funny)
Already something like this... (Score:3, Informative)
http://www.google.com/search?hl=en&q=blah&outpu
with
http://www.google.com/search?hl=en&q=blah&outpu
with
http://www.google.com/search
-nonymous
API for Biz Partners ($$$) Only ??? (Score:3, Informative)
http://www.google.com/xml?q=slashdot [google.com]
You'll (probably) get an error page.
I read about this on Scripting News [scripting.com] in February:
Dave Winer made an inquiry [userland.com] to Google about accessing this XML API.
Their initial response [weblogs.com] was not very helpful, asking for the link to be removed, and saying that the link is "obviously reserved for Google partners." Eventually, Google let Dave access the API. Now, he sounds like he's under NDA [userland.com] about this.
been done... (Score:2, Interesting)
Re:been done... (Score:2, Insightful)
Terms of service (Score:2, Interesting)
Re:Terms of service (Score:2)
Re:Terms of service (Score:2)
Slash could use this. (Score:4, Insightful)
Need to reference John Carmack's comments? Sorting him out of the masses is next to impossible. Even a comment poster as prolific as Signal 11 (arguabley slashdots first and greatest Karma Whore) is nearly impossible to find. First 30 matches of how many? You want to sort through jeffy124's 700+ comments and 24 submitted stories just to find the pertinate one I need by hand? Not to mention the benefit to Slashdot's editors, being able to follow a clear history of articles on a given subject to look for repeats and make more informed editorial commentary. If 90% of readers never read the comments, the editors owe that 90% the sort of editorial commentary attached to each story that only good research can provide.
In fact, the editors could try it on an interim basis immediately, and provide the service to readers only if they had the resources. I sort of get the feeling that the editors are still thinking of slashdot as a small time blog run out of their apartment closet server.
Run google on slashdot now and you get the news from three weeks ago. Incorperate a google box or google APIs into Slash so I could search today's news and I would Pay 10 cents of subscription funds per search in a heartbeat.
Editors: look at the number of hits to your current broken search engine. Double that number because a dedicated google box would be so much better it would get used a whole lot more. Multiply that by 10 cents per search. See if the numbers work to afford the initial expenditure to get a nice yellow rack mount google box. Slashdot is sitting on a goldmine of data and no one can search it and Slashdot cannot profit from it without a nice pay per search subscription using the best engine available.
Re:Slash could use this. (Score:2)
Re:Slash could use this. (Score:2)
Results 1 - 10 of about 51. Search took 0.12 seconds.
51 of 621 is no where close to good enough. The entire database must be indexed and parsed. Magic Google algorithms must dance upon the text and context.
I want a search to say:
results 1-50 of 621.
sort by relavance.
sort by score.
sort by date.
sort by number of replies.
sort by total moderation done to comment.
sort by the number of author's posts in story.
So I can say, "gee, I remember p3d0 had at least 5 comments on that story and two of them were at least score 5" and the search engine finds all data sets that match, sorted by how closely they match. Slashdot is a convoluted enough system that the Slash crew may have to extend even Google a bit. But they are bright lads and if not too busy with day to day editing would find such a task within their abilities.
Re:Slash could use this. (Score:2)
I don't know what became of him. I shure wish he (god, for that matter signal might be a she) would make a slashdot submission story letting everyone know how he's doing.
To give the kids an idea of his abilities, he would have at least one 5 post on every single story. It's like he was able to prepare posts for stories before they were even posted. He always had the first informative or insightfull comment in every single article.
Signal 11 has posted 2099 comments.
He was the Karma Whore for which the term was invented. He has HUNDREDS of imposters and may have had the first imposter, Signail11, a troll that knew moderators would get confused and occasionally mod him up. The man never slept and his insistance on posting early and often made slashdot consider headline beepers.
One of the things I miss the most was his ability to deep link to a NY Times article. No one ever had to register as long as he was around.
sensible bandwidthwise (Score:2)
SOAP me up baby ! (Score:2)
This is even better. With this feature, I'll be able to SSI and/or push results using something as simple as SoapLite [soaplite.com] to get the job done.
I sure hope other content providers are taking note. Imagine how useful (not to much fun) it would be to snap up stuff from places like MoreOver.Com [moreover.com]?
Re:Since when do we like Google? (Score:3, Funny)
We like them Monday, Wednesday, and Friday.
We love them Tuesday, Thursday, and Saturday.
And we alternate Sundays.
Get with the program.
Re:Ron Hubbard's role in this (Score:2)
Google Hosted SiteSearch (Score:2)
They admit it: see Google Hosted SiteSearch [google.com]. Pay Google the $$$, and Google will index all your site's pages as often as you want.
It's primarily intended for webmasters of commercial sites who want to outsource their "Search This Site" functionality, but the index does leak into the general web search database.
Re:Command line google is already here! (Score:2)
Very cool. Makes it very straightforward to automate navigation/downloading of web pages based on a query, a la the Matrix. I need to fish out my dark glasses.