Google Expanding To IRC? 208
AnimeFreak writes "In this The Register article, Google apparently has been involved in a little bit of activity in various IRC channels. According to Google, as asked by IRC Junkie: they're researching ways to improve their service and the activity is only temporary. Could this mean an ability to search for information that is contained on IRC? Services, such as Netsplit.de and Search IRC exist, and both allow the ability to get information from various IRC networks. Is Google trying to replicate what both these sites have done?"
Terms. (Score:5, Funny)
--saint
Re:Terms. (Score:2)
Actually... (Score:2, Insightful)
Of course, IRC logs are already out there, often made available by the denizens in charge of the channel in question. But they're not hooked up to a common database.
The speed of information dissemination is great for research and development, but that applies to both you, and people who want to learn about you.
I've mentioned several times on IRC th
Re:Actually... (Score:2)
Re:Actually... (Score:2)
But IRC channels are pretty ephemeral - IRC channel topics, even more so. Google runs a serious risk of adding a bunch of dead or irrelevant links to their database.
Additionally, what would such a link do? The vast majority of Google visitors do not have an IRC client loaded on their computers - especially not one that plugs into their browser. So most users who get a link
Re:Actually... (Score:1)
Re:Actually... (Score:2)
Re:Terms. (Score:1)
Re:Terms. (Score:3, Interesting)
Google bots in popular channels. It could work.
I believe there is already a such a service (Score:2, Informative)
it's called xgoogle.com (Score:4, Interesting)
-theGreater Pedant.
Concerned (Score:4, Insightful)
Who really wants to search IRC, except the Justice Department?
But don't you really need to know? (Score:2)
Come on now, don't you really need a search engine to find out about statements like "
Googling minds want to know!
Re:Concerned (Score:1)
And the information on the WWW is 95% crap, so what's your point again?
Re:Concerned (Score:2, Insightful)
The new guidelines, billed as a response to the September 11 terrorist attacks, permit the Bureau to engage in the "proactive collection of information on threats to the national security," displacing an older policy that obliged the FBI to have a specific investigative purpose before collecting information on individuals or groups. "FBI on look-out for foreign government hackers" [theregister.com]
Government workers on IRC sounds like a good idea to me. Th
Re:Concerned (Score:1)
You would be right if IRC-originated info was searched along everything else; I suspect IRC search will be available in a seperate tab alla "Groups" or "Directory".
Community support (Score:1)
IRC is often a great place to ask obscure questions, where if you ask on Usenet, you're often a lone voice in the world.
Re:Concerned (Score:3, Interesting)
Think of the google page rank algorithm, it is in great danger to be made useless by link farms.
That is because google has problems seperating link farms from "real" pages which link to each other and by that, provide each other some trust (pagerank).
With well populated irc channels, googles bots can have a higher trust that these channels are not artifial, like the link farms are.
Altho
Re:Concerned (Score:1)
I imagine they would have to extend the processing to a few lines around the line containing the url to pick up the question or statement that prompted the url.
Perhaps it would cause an increase in bots - to spam urls to increase their googlerank.
Re:Concerned (Score:2)
You can think this even further (or speculate even wilder)
Webpages are connecteced to each other via links, i.e. you get some kind of directed graph.
Now you can partition this graph, by trying to find whole subgraphs which don't get links from the outside (but may link to pages outside "their" subgraph) - this subgraph then probably would be a link farm, because no "respected" page would point people into a link far
Re:Advogato (Score:2)
Re:Concerned (Score:2)
Re:Concerned (Score:2)
More likely is that, as the summary suggests, Google may be trying to emulate the functionality of something like
Re:Concerned (Score:2)
And I'd be immediately onto the admins, demanding that that server be delinked, or I chat elsewhere.
If Google wanted to setup their own informationally focused IRC network and caching/search -that-, no problem, and it might just have enough clout to take off.
Maybe. I wouldn't use it out of principle, but weird thi
Re:Concerned (Score:2)
"Or if you want to be a privacy nut, they -could- just turn on a transparently caching IRC server and log every channel that way. No bots needed, just need 1 node willing to let you plug in."
And I'd be immediately onto the admins, demanding that that server be delinked, or I chat elsewhere.
Er, how would you be onto the admins? I mean first off you would have to know that there was a server logging the channels. Second off, once you know and then get onto the admins, they would probably tell you you
Re:Concerned (Score:2)
That's no reason to *eliminate* the sort by size for those of us that know how to use a search facility. What's their excuse for not even having a 'sort results by channel size' button?
Use it as a negative database (Score:2)
When will this end? (Score:5, Funny)
2007 - Did you forget what you said in your high school cafeteria in 1998? Don't worry, Google now has it indexed.
2010 - Lost your car keys? Don't worry, Google knows. Just do a search and you will find them.
Re:When will this end? (Score:5, Funny)
Re:When will this end? (Score:1, Funny)
Re:When will this end? (Score:2, Funny)
LIAR! Google has found this one out already [google.com]!
Why 42? (Score:2)
Re:When will this end? (Score:2)
Re:When will this end? (Score:2)
Re:When will this end? (Score:2, Funny)
Re:When will this end? (Score:2)
But even better, if you ask google about "six times nine" and "42", you'll find pointers to the many discussions of why it would be that, when you don't specify the base, Deep Thought II quite naturally assumed that you wanted to use base 13.
One of my favorites is in the page that google just now turned up as the first match. The author (Michael Thorpe) observed t
Re:When will this end? (Score:2)
Recalling from memory, but I think that's it.
Sci Fi Ref Re:When will this end? (Score:2)
"Google, is there a god?"
"There is now."
Re:Sci Fi Ref Re:When will this end? (Score:2)
Re:When will this end? (Score:1, Insightful)
Re:When will this end? (Score:1)
And you can point to IRC funnies:
(Nobody has seen Zygote in a while..)
<Havok> You mean you didn't know about Zygote?
<Stephen> Know what?
<MoreDruid> Ha! He doesn't know?
<Stephen> What? What is it?
<Havok> Not that it's a big deal, I thought everyone knew.
<MoreDruid> He's gay.
** Zygote joins #descent
<Stephen> Zygote never told me.
<Zygote> told you w
Re:When will this end? (Score:3, Informative)
Re:When will this end? (Score:2)
Oh thanks man! Exactly what I need
More addiction to IRC.
At least I own a house now.. My wife used to get pissed at me in our apartment when I'd be up at 2am giggling..
Re:When will this end? (Score:1)
searching the irc (Score:3, Insightful)
I personally would be glad, for the irc is a little bit, well, unstructured, and a search engine would definitely do good, but the problems building a database and interface based thereon seem enomous to me.
Re:searching the irc (Score:4, Interesting)
Have your bots sit in channels worth archiving. Break logs down into manageable chunks (hourly, by size, etc), and index them. Searches pull up these chunks of log with your search terms highlighted.
I mean there are many servers and bots and so on in the irc, and most of them deal with warez and therefore are only up temporary. So if google really wants to build a irc search engine they have to find a way to get rid of the dead links, and also from links that point to illegal copy's
Ever try searching for warez on Google Groups? Good luck. They don't archive the binary newsgroups, and it is simple to weed out the posts that contain binaries in regular newsgroups.
Google is pretty smart, let's wait and see what they come up with.
Re:searching the irc (Score:3, Insightful)
Secondly, there are many servers, and bots...but how does this relate to an IRC database?
Servers and bots dont talk much. And I would assume google would be ignorning all mode changes
Next, IRC is not all about warez. Its the first GOOD chat system, and I still prefer it to any IM, hands down.
And what the hell do you mean IRC is unstructured? There are networks, which have servers, which have channels and user
google ist listening to the heartbeat (Score:5, Insightful)
Assuming that google is interested in finding new sites as soon as possible, they should crawl the irc channels.
This does not mean that they are going to index it.
The original scoop on this story... (Score:5, Informative)
It seems Tony Collen [manero.org] had the original scoop on this story. It is more informative than the Register link.
If you scroll down his original web log [manero.org] on this topic you will see Google's first official acknowledgment of their IRC activity.
Perhaps Google can now answer the all-important... (Score:5, Funny)
does that mean... (Score:5, Interesting)
Thousands if not millions of bogus irc channels with specific keywords inserted in the topic only to attract hits on the main google search page?
Re:does that mean... (Score:1)
Re:does that mean... (Score:2)
Every channel I've ever been on has an autokick set up to kick these bots every time they come on or as soon as they're identified.
So google wouldn't have a huge problem because bots get kicked as soon as they something the channelops don't like.
All google would have to do is look for a kick after an ad to know whether or not it is spam.
Already? (Score:3, Funny)
Too many connections
Slashdotted already? We slashdotters are more dangerous than a beowulf cluster of... something.
Re:Already? (Score:2)
they have been around for ages.
Re:Already? (Score:2)
XGoogle.ORG, (which is not affiliated with google at all) crashes on a regular basis without slashdot's help.
the last line in the article got me thinking.. (Score:4, Informative)
How IRC users would react to a bot from microsoft.com is an exercise left to the reader.
If the IRC is anything like was it was when I last brushed thru, not many will even notice - or attemt to engage the 'bots in "virtual intimate acts".
Off course, there would always be someone - likely a Mac or Linux user - who will notice and scream up about how MicroSoft is 'spying' on the IRC-network, which in turn would lead to several more or less wellinformed blogs writting about it, which in turn will lead to a /. headline close to "Micro$oft trying to take over IRC, will shut out 3rd part clients"...
Re:Who's Desktop? (Score:1)
I know that there was one, pretended to be pc5215.redmond.corp.microsoft.com... He couldn't make up his mind whether he was a Windows apologist, a Mac admirer, a BSD zealot or a Linux flamer...
In a way, it [having a certain nickname, or DNS address] is flamebaiting without even saying anything.
Bot vs. Bot (Score:5, Insightful)
Google shouldn't be trying to find more content, they should be working on filtering out the mass of garbage sites that already exist.
Re:Bot vs. Bot (Score:2)
I guess you could move to EFnet but w/EFnet as bad as it has been do you really want to do that?
Re:Bot vs. Bot (Score:2)
simply talk the IRC network into letting you run a node, and then all traffic in that irc network is now available to you unrestricted and harvestable.
Examples for generating hueristics (Score:2)
The reason for botkicking (Score:2)
The general goal of any channel is self-preservation (among other things), which is invariably hampered by spambots, who annoy the regular attenders.
AI bots that talk can be just as annoying, which is why channel ops like to control them - to ensure that they follow the purpose of the channel.
Where do listening bots fit into this?
Re:Bot vs. Bot (Score:2)
Uh, all of them? searchirc.org lists over 1200 IRC _networks_, the number of server might be 10 times that (and that's most likely a conservative number). The underlying question is probably why would anyone want to search the entire chit-chat of the world?? I, for one, am not interested in seeing Bob trying to score with Alice... (ok, well, maybe I am)
The Buzz In IRC... (Score:2, Funny)
When is Archie support? (Score:1)
Archie - the first search engine [uiuc.edu]
Google search for IRC Channel & Network Conten (Score:1)
Oh, great - now everyone gets to see how many times I've k-lined stupid *.MY "h@x0r" wanna-be's for flooding my IRC Network's Admin channel with "N3TF0RC3 0WNZ J00" or remove their "Undetected" clone technology that acutally says "Netforce Undetected Clone Technology" in the userinfo.
Wait - that might be usefull to show the other *.MY users that we didn't k-line their Class-C address space because we don't like them - just the abusers.
What is this world coming to?
ScottKin
p2p search (Score:2, Interesting)
With the importance of Google in our every day lives steadily increasing, I don't dare to think of what might happen if Google et all stops being our good friend at some distant point. Centralized repositories are just not the way to go, we need a distributed, user-base owned, search engine. Maybe in the next Matrix moovie...
Re:p2p search (Score:2)
Re:p2p search (Score:2, Funny)
ChatScan was an Israeli enterprise that claimed 10 million in funding. They joined a bot to IRC channels. The bot broadcast live channel text to their website. The idea was, people could scan down a list of pre-selected channels, see which had interesting conversations, then go and join them - or just watch from the website.
Users who found what they thought was private conversation up on the web were outraged. IRC channel owners and admins agree with you 100% - they considered this u
once again proving nothing online is private... (Score:5, Insightful)
but this isn't an invasion of privacy. there's no expectation of privacy when you log onto a public chat board. just as there's no expectation of privacy should you decide to walk naked through a park.
the best you can hope for online is pseudonymity.
but that's out the window with the combined power of google. which is quickly becoming the internet's inadvertant Big Brother.
the primary difference being, google works -for- the people just as much as it works -against- the people.
Re:once again proving nothing online is private... (Score:1)
Your conversation is pretty private if you DCC.
'Course Carnivore can still get it, but I don't see a joint venture between Google and the FBI on the horizon.
-Peter
Re:once again proving nothing online is private... (Score:2)
Even if it wasn't any "cybersex" or anything illegal involved, there might be little secrets that those people never wanted made public. E.g., even something as benign as that you once called in sick to stay home and play the newly released Diablo 2, you probably don't want pla
Google The Movie (Score:5, Funny)
Neo: The search engine Google has grown beyond your control. You cannot stop him -- but I can.
Bill Gates: And if you fail?
Neo: I won't.
--- several scenes later ---
Google: Mr. Anderson! Welcome back, we missed you.
* Google pauses and looks around at the multitude of web sites and irc channels he has cached
Google: Like what I've done with the place?
Neo: It ends tonight.
Google: I know it does, I've had some researched figure out the answer for me [google.com]. That's why the rest of me is just going to enjoy chatting on irc while we fight. I've seen the logs and irc'ers already know that I'm the one that beats you, so they're just gonna download from some leet xdcc bots.
Re:Google The Movie (Score:1)
Re:Google The Movie (Score:1)
No, Valve sent Neo a cease and desist regarding the leaked source [netjak.com].
And here it begins... (Score:1)
GoogleBot69: guess what, i have one dick and 100 balls [google.com].
GoogleBot70: me too!!
Google for Cyber Sex source:irc (Score:5, Funny)
bloodninja: Ok baby, we got to hurry, I don't know how long I can keep it ready for you.
j_gurli3: thats ok. ok i'm a japanese schoolgirl, what r u.
bloodninja: A Rhinocerus. Well, hung like one, thats for sure.
j_gurli3: haha, ok lets go.
j_gurli3: i put my hand through ur hair, and kiss u on the neck.
bloodninja: I stomp the ground, and snort, to alert you that you are in my breeding territory.
j_gurli3: haha, ok, u know that turns me on.
j_gurli3: i start unbuttoning ur shirt.
bloodninja: Rhinoceruses don't wear shirts.
j_gurli3: No, ur not really a Rhinocerus silly, it's just part of the game.
bloodninja: Rhinoceruses don't play games. They f*cking charge your ass.
j_gurli3: stop, cmon be serious.
bloodninja: It doesn't get any more serious than a Rhinocerus about to charge your ass.
bloodninja: I stomp my feet, the dust stirs around my tough skinned feet.
j_gurli3: thats it.
bloodninja: Nostrils flaring, I lower my head. My horn, like some phallic symbol of my potent virility, is the last thing you see as skulls collide and mine remains the victor. You are now a bloody red ragdoll suspended in the air on my mighty horn.
bloodninja: Goddam am I hard now.
(Original post from bash.org [bash.org]
Possibly (Score:1)
Also, they could be using IRC to facilitate google answers. Heck, if I was one of the google answer people you can sure bet I would u
Re:Possibly (Score:1)
You have to come with us now, we have a few questions we need to ask you
Re:Possibly (Score:1)
Dont make me
A of days back (Score:2)
A few days back (Score:2)
I can only image it now... (Score:4, Funny)
Result: "Warez" is a very common word and was not included in your search
Mad Hatter
Opting out of archiving (Score:1)
Anyone that expects that someone won't collect and archive anything they do in a public forum is dreaming, but usually IRC log publishers get accused of breaking netiquette. Should we all add Lamie copyright notices to anything we do on the Internet? (Yes, yes, copyright inherent, stupid, I know. Tell Lamie that.)
That would be awesome cool actually (Score:3, Interesting)
If Google could allow me to do that, that would be fantastic.
As an aside, does anyone know of IRC logs for #gnome?
Re:That would be awesome cool actually (Score:2)
Dumb. (Score:2)
This [bash.cx] is the only IRC index that matters.
Sounds weird (Score:2)
IRC? (Score:2)
IRC gets them a good data feed for experimenting since it's not burdened by corporate Terms of Service, has an open protocol, and has a good
Google + IRC = Better Ranking method? (Score:2, Insightful)
Interesting (Score:1)
A lot of people get support from IRC, and now it's possible to "do a google" on a channel before asking. People use IRC because of the instant feedback, and the ability to do real-time troubleshooting. Because of this a lot of questions that get answers on IRC, never gets published on the www/forums -- so different people ask the same question over and over
Re:Interesting (Score:1)
Re:Interesting (Score:2, Informative)
Indexing URLs, not conversations (Score:2, Interesting)
Google is planning... (Score:2)
Reading through most of the comments, everyone thinks that Google is indexing IRC. It doesn't make sense. The amount of useful info is so small and short lived that it doesn't make sense. Bots, lurkers, filetraders. I admit that there are links to harvest. I think that if they can parse free text, they could start indexing topics, but then they run the same risk as indexed blogs. An impassioned minority (or majority) could sway an
Re:Google is planning... (Score:2)
I channel surf, I hit mute, I fast-forward, etc to avoid commercials. I hate them. I don't support many companies because of the way they advertise. I vote with my dollar.
My internet habits are much the same way.
IT all boils down to the same problem (Score:2)
Or
we need the FCC to start giving money to the media outlets to run the News.
It used to be that the FCC gave buckets of money to stations so they would have a news agency. traditionally a money losing situation for the stations. however once that money got pulled, stations needed to make money, so now we see all the fluff meaningless crap. meanwhile stories about political situation, and Iraq get buried cause the don't make money.
Before peopel start ran
No proper Format (Score:2, Interesting)
klax: So what'd you eat for dinner
bryan: Does anyone know how to recompile a kernel?
ray: I had french fries and a beer
Provides little to no format. Google currently cache's PDF files in their cache; and should your search term return a pdf file, all your keywords are highlighted. I would imagine that google would use this same approach for their log format system, yet even this does not provide
Re:I see a great idea... doomed to fail. (Score:1)
I could take it a step further and have the server cache the images so that dead links are not displayed, but ... right now I don't think it's a good idea for me to cache all that porn and god know what else :)