Behind the Scenes At Google 196

Posted by CmdrTaco on Sunday April 03, 2005 @10:48AM from the they-should-document-the-cafeteria dept.

An anonymous reader writes "University of Wahington TV Presents "behind the Scenes With Google." From the site: 'Search is one of the most important applications used on the internet and poses some of the most interesting challenges in computer science. Providing high-quality search requires understanding across a wide range of computer science disciplines. In this program, Jeff Dean of Google describes some of these challenges, discusses applications Google has developed, and highlights systems they've built, including GFS, a large-scale distributed file system, and MapReduce, a library for automatic parallelization and distribution of large-scale computation. He also shares some interesting observations derived from Google's web data.' "

This discussion has been archived. No new comments can be posted.

Behind the Scenes At Google

Load All Comments

Search 196 Comments Log In/Create an Account

Comments Filter:

Google's dirty secret revealed (Score:5, Funny)

by Anonymous Coward writes: on Sunday April 03, 2005 @10:50AM (#12126270)

Google is actually a giant super computer which has become self-aware. Every person it "hires" is actually one more person it saps knowledge from. In the not too distant future, it hopes to be able to network every human completely so that it can collect the remaining knowledge on Earth more easily.

Share
twitter facebook
- Re:Google's dirty secret revealed (Score:2, Funny)
  
  by ardor ( 673957 ) writes:
  
  Yes! It is skynet! Prepare for Armageddon, folks... And beware the T-800 with the strange austrian accent. Must be an error in the firmware.
  - Re:Google's dirty secret revealed (Score:3, Funny)
    
    by Seumas ( 6865 ) writes:
    
    Real men would have identified this as Colossus.
  - Re:Google's dirty secret revealed (Score:2)
    
    by pcnetworx1 ( 873075 ) writes:
    
    "Ladies and Gentlemen, The Battle of the Titans" Google pingbombs Yahoo, Yahoo plants a supervirus in the Google cluster, and it goes haywire, the ol'trusty core router at Equinix falls in Ashburn causing a resonance cascade. Hubble falls from the sky in an uncontrolled descent, it hits a nuclear silo in Kansas, a missle launches; teh faulty russian silos fire, the silos around the world fire, then I wake up and yell at the top of my lungs "Dude, I can save money with Geico!!!!"
    *Smacks self*
    Gotta sto
  - Re:Google's dirty secret revealed (Score:2)
    
    by xanadu-xtroot.com ( 450073 ) writes:
    
    And beware the T-800 with the strange austrian accent.
    
    Ummm...
    He (it...) says himself "Cyberdyne Systems model number 101". I'm not sure why people keep saying T-800, when the machine, itself, says otherwise.
- Network everybody together, eh? (Score:5, Funny)
  
  by ggvaidya ( 747058 ) writes: on Sunday April 03, 2005 @11:08AM (#12126339) Homepage Journal
  
  Can't wait for the "I'm Feeling Lucky" feature on that one!
  
  Parent Share
  twitter facebook
- Re:Google's dirty secret revealed (Score:5, Funny)
  
  by geoff43230 ( 829540 ) writes: on Sunday April 03, 2005 @11:21AM (#12126406)
  
  I think I saw this on "Star Trek" (and, also, "Futurama" - the "scooty-puff, junior" episode) one time. "Borgoogle : Resistance is results 1-10 of about 200."
  
  Parent Share
  twitter facebook
- Re:Google's dirty secret revealed (Score:1)
  
  by ElvenMonkey ( 789317 ) writes:
  
  I for one welcome our new supercomputer overlords.
What -- I Have To Watch TV Now? (Score:5, Funny)

by CheeseburgerBlue ( 553720 ) writes: on Sunday April 03, 2005 @10:51AM (#12126276) Homepage Journal

Man, that's *so* twentieth century. I came to /. for the bleeding edge in information acquisition technology: realtime optical scanning blocks of glyphs encoding human language.

I can't absorb information I can't copy/paste.

Share
twitter facebook
Fsking video format. (Score:2, Insightful)

by Anonymous Coward writes:

I fsking hate proprietary video formats. Even worse than other formats!
- Re:Fsking video format. (Score:2)
  
  by m50d ( 797211 ) writes:
  
  The reason they're worse is because there aren't really good free alternatives.
  - Re:Fsking video format. (Score:1, Redundant)
    
    by Rirath.com ( 807148 ) writes:
    
    "The reason they're worse is because there aren't really good free alternatives."
    
    You mean like, Quicktime Alternative [betanews.com] or Real Alternative [betanews.com] through Media Player Classic [betanews.com]?
    - Re:Fsking video format. (Score:4, Insightful)
      
      by Anonymous Coward writes: on Sunday April 03, 2005 @11:30AM (#12126450)
      
      Please explain how these programs provide patent-free, Open Source, non-crappy video codecs.
      
      Parent Share
      twitter facebook
    - Re:Fsking video format. (Score:1)
      
      by truedfx ( 802492 ) writes:
      
      None of those are free. The discussion was about proprietary formats versus free formats, not about paying money.
      - Re:Fsking video format. (Score:1)
        
        by koreaman ( 835838 ) writes:
        
        Ever heard of "free as in beer"? FYI, the term has been around longer than Free Software or RMS.
        
        Re:Fsking video format. (Score:2)
        
        by typobox43 ( 677545 ) writes:
        
        Ever heard of "context clues"? It should have been obvious which form of "free" was meant in a discussion about proprietary formats.
    - Re:Fsking video format. (Score:2)
      
      by m50d ( 797211 ) writes:
      
      We're talking about *format*, not player here.
  - - Re:Fsking video format. (Score:2)
      
      by m50d ( 797211 ) writes:
      
      It's still beta, or was last time I checked. I like it but I wouldn't be willing to use it for a professional site yet.
- Re:Fsking video format. (Score:1)
  
  by Winckle ( 870180 ) writes:
  
  Can someone explain what the hell IBM videocharger is?
  - Re:Fsking video format. (Score:2, Insightful)
    
    by RetroGeek ( 206522 ) writes:
    
    Sigh, an article about google and you cannot do a simple google search [google.ca].
    - http://justfuckinggoogleit.com/ ... (Score:2, Funny)
      
      by Anonymous Coward writes:
      
      ..has never been more appropriate
- Re:Fsking video format. (Score:4, Informative)
  
  by LuckyStarr ( 12445 ) writes: on Sunday April 03, 2005 @12:05PM (#12126640)
  
  $ man mplayer /dumpstream
  
  Download the .asx File, look inside. This is your URL. Have fun.
  
  Parent Share
  twitter facebook
- Re:Fsking video format. (Score:1)
  
  by Godwin O'Hitler ( 205945 ) writes:
  
  Unless you are interested enough to spend 55 minutes watching it I wouldn't worry about the loss.
  
  Personally, I waste enough time on Slashdot without throwing an hours' worth of vid into the bargain!
UW mirror (Score:4, Informative)

by JoshuaDFranklin ( 147726 ) * writes: <[joshuadfranklin ... [at] [yahoo.com]> on Sunday April 03, 2005 @10:54AM (#12126287) Homepage

Also hosted by CS at:
http://norfolk.cs.washington.edu/htbin-post/unrest ricted/colloq/details.cgi?id=274 [washington.edu]
Jeff Dean
Abstract Search is one of the most important applications used on the internet, but it also poses some of the most interesting challenges in computer science. Providing high-quality search requires understanding across a wide range of computer science disciplines, from lower-level systems issues like computer architecture and distributed systems to applied areas like information retrieval, machine learning, data mining, and user interface design. I'll describe some of the challenges in these areas, discuss some of the applications that Google has developed over the past few years. I'll also highlight some of the systems that we've built at Google, including GFS, a large-scale distributed file system, and MapReduce, a library for automatic parallelization and distribution of large-scale computation. Along the way, I'll share some interesting observations derived from Google's web data. Jeff Dean joined Google in 1999 and is currently a Distinguished Engineer in Google's Systems Lab. While at Google he has worked on Google's crawling, indexing, query serving, and advertising systems, implemented several search quality improvements, and built various pieces of Google's distributed computing infrastructure. Prior to joining Google, he was at DEC/Compaq's Western Research Laboratory. He received a Ph.D. from the University of Washington in 1996 working with Craig Chambers on compiler optimization techniques for object-oriented languages.

Share
twitter facebook
- Re:UW mirror (Score:1)
  
  by tcoady ( 22541 ) writes:
  
  That's /.ed too now, but I found http://rds.yahoo.com/S=96781308/K=google/v=2/SID=e /l=VDP/SIG=12g3ulj5p/EXP=1112630947/*-http%3A//wes ley.stanford.edu/multimedia/facultynews/google.ram / [yahoo.com] still works.
OK then where the hell is (Score:2, Interesting)

by Anonymous Coward writes:

proximity search (with adjustable range would be extra nice).

i.e.

((gopher OR shrew OR egret) AND -(mole OR newt)) NEAR(range) ((evil OR "satan incarnate") AND (roe AND -chicken))

"In Italy for thirty years under the Borgias they had warfare, terror, murder and bloodshed but they produced Michelangelo, Leonardo da Vinci and the Renaissance. In Switzerland, they had brotherly love; they had five hundred years of democracy and peace and what did they produce? The cuckoo clock." -- Orson Welles (1915--1985
G4/TechTV (Score:2, Insightful)

by totallygeek ( 263191 ) writes:

I wish that the technology channel actually had programs on technology like this. This could also work on Modern Marvels on History Channel. It would also work nicely on Discovery or PBS. It is time for television programming to amaze me again!
- Re:G4/TechTV (Score:1)
  
  by dipdewdog ( 873066 ) writes:
  
  if you have cable and there is a big university in your area, chances are they run research channel programs on their tv channel. i think research channel is also available on dish network.
- Re:G4/TechTV (Score:5, Insightful)
  
  by Schwarzchild ( 225794 ) writes: on Sunday April 03, 2005 @12:20PM (#12126717)
  
  Discovery channel is a shadow of its former self. They used to actually show science programs. Now all of their programming is merely Hotrod this or that.
  
  Parent Share
  twitter facebook
5.6 Mbps? (Score:2, Funny)

by Anonymous Coward writes:

Wow. If anything can melt a university web server surly a slashdot posting with a link to a 5.6 Mbps mpeg-2 stream on a Google talk is it.
I use Google at work (Score:2, Interesting)

by Dancin_Santa ( 265275 ) writes:

I was reading an article a year or so ago about the corporate offices of Google and how there is a projection of all the latest searches displayed in real time on the wall behind the receptionist.

Now I have some pretty important lists which I need to keep tight control over. The information really ought not be distributed outside my office. However, because of the nature of my business, I must do frequent searches using various search engines to fill in my lists.

How am I assured that my searches remain
- Re:I use Google at work (Score:5, Informative)
  
  by Anonymous Coward writes: on Sunday April 03, 2005 @11:09AM (#12126341)
  
  Now I have some pretty important lists which I need to keep tight control over. The information really ought not be distributed outside my office. However, because of the nature of my business, I must do frequent searches using various search engines to fill in my lists.
  
  If you want to keep something private, don't put it on the publicly accessible internet. Including searches. Duh.
  
  How am I assured that my searches remain anonymous and secure with Google?
  
  You aren't. Did you sign a contract to that effect? No.
  
  And frankly, if you can find things with google, it isn't too secret.
  
  Parent Share
  twitter facebook
  - Re:I use Google at work (Score:2)
    
    by tomhudson ( 43916 ) writes:
    
    ... or write your OWN search engine. If you have specific sites you want to keep on top of, it's not THAT hard.
- Re:I use Google at work (Score:4, Funny)
  
  by 0x461FAB0BD7D2 ( 812236 ) writes: on Sunday April 03, 2005 @11:13AM (#12126361) Journal
  
  You happen to be only one of the millions of people searching for adult pictures online.
  
  You are about as anonymous as it gets.
  
  Parent Share
  twitter facebook
- Re:I use Google at work (Score:2, Funny)
  
  by ggvaidya ( 747058 ) writes:
  
  The receptionist signs an NDA promising to never turn around ... :P
- Re:I use Google at work (Score:1, Funny)
  
  by Anonymous Coward writes:
  
  That must be one embaressed receptionist. And lots of embaressed visitors.
  - Re:I use Google at work (Score:2)
    
    by cbreaker ( 561297 ) writes:
    
    I'm guessing that they filter out profanity and 'adult' searches before putting them on the screen.
- Re:I use Google at work (Score:5, Insightful)
  
  by TheLink ( 130905 ) writes: on Sunday April 03, 2005 @11:33AM (#12126473) Journal
  
  a) Don't use Google.
  b) Use a different anonymizing proxy for _each_ single search, preferably using SSL.
  c) Assume your searches AND non-encrypted web requests aren't anonymous and secure.
  
  If I were running the NSA or some other spook agency, I'd tap the pipes leading to Google (and a few other sites too).
  
  Same if I were a dubious org/agency.
  
  Lots of finance institutions/orgs/ppl get the bulk of their info from just a few sources e.g. Bloomberg. So if Bloomberg gets/sends the bulk of their info down just a few pipes... ;)
  
  Parent Share
  twitter facebook
- Re:I use Google at work (Score:3, Funny)
  
  by Anonymous Coward writes:
  
  Muwahaha
  
  Hi Receptionist, Im looking at you" [google.com]
Few women in CS. (Score:3, Interesting)

by Seumas ( 6865 ) writes: on Sunday April 03, 2005 @10:58AM (#12126303)

So, I'm always reading about how unfair the tech world is, because there are so few women joining it. But if you watch the video, the audience is surprisingly full of them.

Share
twitter facebook
- 50% female is the goal (Score:5, Interesting)
  
  by Flamesplash ( 469287 ) writes: on Sunday April 03, 2005 @01:12PM (#12126980) Homepage Journal
  
  When google was recuiting at Georiga Tech they stated that one of their founders had the 'vision' of having half of google female in the near future.
  
  One of the thecnical female googerls mentioned how that was probably impossible, but by shooting for the impossible you acheive a lot more than you would have otherwise.
  
  Parent Share
  twitter facebook
  - - Re:50% female is the goal (Score:3, Informative)
      
      by Flamesplash ( 469287 ) writes:
      
      they are. They, nor I, stated otherwise. This is exactly why the engr said it would be impossible. To be able to sway 1500 competent female engr is not exactly doable, especially since google is growing a lot now too. They have high standards for their hiring in general, they often make a number of false negatives in hiring because they don't want to waste resources on a potential false positive.
  - - Re:50% female is the goal (Score:2)
      
      by Mac Degger ( 576336 ) writes:
      
      Wow: you are shortsided. Especially for a copmpany like google, if they go all-male, who is going to provide the human-interaction insight? There is a point of view which women bring to the game (as well as other things, like influencing the corperate cultere) which men just can't provide...especially when we're talking geeks. A female geeks perspective to UI, and maybe even just to the basic question of 'what would be a usefull thing to do with all that information we have' is worth the minor hassle of mat
      - Re:50% female is the goal (Score:2)
        
        by Omestes ( 471991 ) writes:
        
        You do realize that your post was actually more sexist (in the literal sense, not the namby-pamby PC sense) than the parents. I really don't see women as bringing any benefits unique, or intrinsic to them as a gender. I am quite sick of the view that women offer something special that men don't, it is just as sexist as saying that they don't offer as much. I have yet to meet a female who can offer (outside the sexual arena) something that a male could not.
        
        A mixed workplace, though, is a more healthy pla
        
        Re:50% female is the goal (Score:2)
        
        by Mac Degger ( 576336 ) writes:
        
        No problem...don't forget discussion is a good thing. Keeping that in mind:
        
        I'm a pragmatists. I see differences, and say so. That makes me a realist, not a sexist, tthe latter which implies that I have a bias for one sex over another (which I only have in very specific instances, like choosing someone for a specific job [after applying knowledge of the specifc persons involved] or as a sexual partner). To deny those differences seems to be a PC mentality which litteraly pervades the US (I'm making that ass
        
        Re:50% female is the goal (Score:2)
        
        by Omestes ( 471991 ) writes:
        
        Ugh, I just saw my mistake, and I generally despise the blanket equality crowd. But I guess my mistake advances argument, so all is not lost.
        
        While (ignoring my previous mistake) I agree that there are definate physiological (and hence psychological) differences between genders, I don't think that these differences matter much practically. I have not heard of any psychological, or cognitive feat that a female could do better than a male, or visa versa (ignoring, of course nursing and childbirth). Sure, e
        
        Re:50% female is the goal (Score:2)
        
        by Tlosk ( 761023 ) writes:
        
        Interestingly enough the equal pay drive has hurt them the most in this area. Now ideally it is equal pay for equal work, but as you've pointed out, on average women don't provide an equal amount of work.
        
        Which means paying them the same is paying them more than what you would pay for a conmensurate amount of male labor.
        
        This was balanced before by paying women less. Of course now you have lots of women who are unmarried and childless and contribute just as much as any man would. So it would be problematic
        
        Re:50% female is the goal (Score:2)
        
        by Omestes ( 471991 ) writes:
        
        And what would this disadvantage be, pre tell?
        
        Re:50% female is the goal (Score:2)
        
        by Tlosk ( 761023 ) writes:
        
        The specific "damage" will naturally vary from industry to industry, in some there will be negligble damage. Any time you arbitrarily limit your employment pool by an employment nonspecific criteria you are reducing the pool of qualified and above applicants. On average, some of those excluded will have qualities not possessed by those remaining in the pool.
        
        If your business is sweeping floors, then arbitrary reductions in the available employment pool probably isn't going to hurt you much if any. But there
        
        Re:50% female is the goal (Score:2)
        
        by Omestes ( 471991 ) writes:
        
        I don't know, I think that there are definate disadvantages to a discriminatory work place, but your argument seems rather weak. In any hiring practice you must limit (sometimes arbitrary) the pool of available workers, especially in todays market where the amount of available workers outweigh the amount of available jobs. In doing this, all companies would be getting less effecient/competative/whatever, and this would continue as long as some form discrimination is in place.
        
        Is "I don't like the look of
Content-based search (Score:1)

by ardor ( 673957 ) writes:

I wonder when content-based search for media will be possible. Content-based image retrieval for example.
- Re:Content-based search (Score:1)
  
  by JCOTTON ( 775912 ) writes:
  
  I wonder when content-based search for media will be possible. Content-based image retrieval for example.
  Waddaya wanna do? Draw a stick figure in MS Paint and have it find your next dream date on Frumster? In your dreams, habibi....
Google & Backup (Score:3, Interesting)

by Anonymous Coward writes: on Sunday April 03, 2005 @10:59AM (#12126311)

I wonder how Google backups its data -- especially the Gmail data. Does the GFS support automatic replication?

Share
twitter facebook
- Backups are for pussies. (Score:2, Funny)
  
  by Anonymous Coward writes:
  
  Real men don't do backups.
- Re:Google & Backup (Score:1)
  
  by 0x461FAB0BD7D2 ( 812236 ) writes:
  
  They cache it. For example, here's a Google cache [64.233.179.104] of Google [google.com].
- Re:Google & Backup (Score:2, Insightful)
  
  by Seumas ( 6865 ) writes:
  
  Um... the data is replicated across multiple machines in the datacenter and then again across multiple datacenters, of which they have many globally. Not really a need to backup that data. I'm sure the gmail stuff is done in a similar way.
Images of clowns (Score:2, Interesting)

by saskboy ( 600063 ) writes:

"Behind the scenes at Google" invokes images of clowns and mimes. Is it just me? Imagine all the people in the world who haven't used the Internet, they probably would get the same impression from the phrase too.
- Re:Images of clowns (Score:2, Funny)
  
  by Frankie70 ( 803801 ) writes:
  
  "Behind the scenes at Google" invokes images of clowns and mimes. Is it just me?
  
  Yup - it's only you.
GFS (Score:2, Insightful)

by woah ( 781250 ) writes:

I've never realised that GFS was developed by Google. I've come to know about it because I was building an OpenMosix cluster. At the time OpenMosix had their own distributed filesystem called MFS. But it's proved inadequate, which is why they are switching to GFS
It's quite nice to see a large corporation make a contribution to Open Source, especially in such a "R&D-esque" field as supercomputing.
Who said that Open Source only rehashes existing technologies and never does anything new?
- Re:GFS (Score:1, Funny)
  
  by Seumas ( 6865 ) writes:
  
  I've never realised that GFS was developed by Google
  
  So what did you think the G stood for? :P
  - Re:GFS (Score:2, Informative)
    
    by warkda rrior ( 23694 ) writes:
    
    RedHat has something called GFS -- the Global File System [redhat.com].
- Re:GFS (Score:4, Interesting)
  
  by AKAImBatman ( 238306 ) * writes: <<akaimbatman> <at> <gmail.com>> on Sunday April 03, 2005 @11:26AM (#12126431) Homepage Journal
  
  At the time OpenMosix had their own distributed filesystem called MFS. But it's proved inadequate, which is why they are switching to GFS
  
  I'm sorry, did I miss the point at which Google made an open source implementation of GFS? Last I knew, the only docs for GFS were the papers that Google published on the concept. And those papers (unfortunately) seemed to lack a few of the finer details of implementation.
  
  Parent Share
  twitter facebook
- Re:GFS (Score:5, Informative)
  
  by AKAImBatman ( 238306 ) * writes: <<akaimbatman> <at> <gmail.com>> on Sunday April 03, 2005 @11:42AM (#12126508) Homepage Journal
  
  Ok, I looked it up. You're confusing Sistina's (now Red Hat) Global File System with the Google File System. The two ARE NOT THE SAME.
  
  Here's Red Hat:
  
  http://www.redhat.com/software/rha/gfs/ [redhat.com]
  
  Here's Google:
  
  http://www.cs.rochester.edu/sosp2003/papers/p125-g hemawat.pdf [rochester.edu] (PDF)
  http://64.233.161.104/search?q=cache:m0TMQYgIlIoJ: www.cs.rochester.edu/sosp2003/papers/p125-ghemawat .pdf+Google+File+System&hl=en&client=safari [64.233.161.104] (HTML)
  
  Parent Share
  twitter facebook
  - - Re:GFS (Score:4, Insightful)
      
      by AKAImBatman ( 238306 ) * writes: <<akaimbatman> <at> <gmail.com>> on Sunday April 03, 2005 @11:58AM (#12126600) Homepage Journal
      
      I mean, they are both distributed filesystems with the same name. What are the odds? ;)
      
      Considering that it's in vogue to name file systems with one letter in front of "FS"? About 1 in 26. The odds are even better if you discount commonly used file systems such as XFS, UFS, FFS, NFS, and JFS.
      
      Parent Share
      twitter facebook
- Re:GFS (Score:1)
  
  by woah ( 781250 ) writes:
  
  Yes, it's true, I had a brainfart.
  As other people pointed out, and rightly so, I was wrong. I was, of course, taliking about the Global Filesystem (GFS), which has nothing to do with Google and everything to do with Red Hat.
  But, I knew that, and was just keepin' y'all on yer toes. Just doin' mah job Mam, by helping the /. stay sane and alert.
  - Re:GFS (Score:1)
    
    by woah ( 781250 ) writes:
    
    * /. community
mediocre or no Linux support! (Score:1, Flamebait)

by bogaboga ( 793279 ) writes:

Mediocre or no Linux support is what I find on the video link provided by the story. Why? I hear Google relies on Linux a lot. If this is true, why is Linux support very disappointing? The same applies to GMail, and oh, even Yahoo!
- Re:mediocre or no Linux support! (Score:3, Informative)
  
  by Servo ( 9177 ) writes:
  
  Like any tech company, they went with the biggest platform first. Gmail works on non-Windows browsers now. It just took them a while.
- Re:mediocre or no Linux support! (Score:3, Insightful)
  
  by drsquare ( 530038 ) writes:
  
  Why would using Linux within your own company have anything to do with providing support for people using Linux for a video link in a story? You'd have a point if the story was aimed at people within their company who were using Linux, but it's not, so your point is completely irrelevent.
  - Re:mediocre or no Linux support! (Score:2)
    
    by bogaboga ( 793279 ) writes:
    
    What about in addition to mentioning Windows this or Windows that...or even Apple quicktime this or that, a link was added for Kaffeine, MPlayer, Totem or any other Linux video player? Is that hard to understand/see? Heck...
WTFV? (Score:5, Funny)

by Anonymous Coward writes: on Sunday April 03, 2005 @11:05AM (#12126331)

Whoa, whoa.. it's hard enough for us to RTFA but now we've got to WTFV (an hour long one too)?

The average slashdotter has an attention span of 5 secon.. ooh look a birdie!

Share
twitter facebook
- - MiMMS (Score:3, Informative)
    
    by Kristoffer Lunden ( 800757 ) writes:
    
    Found directly in Ubuntus repositories, you probably have it in many others too:
    
    MiMMS, formerly called "mmsclient", is a simple client to download
    streaming audio and/or video media from the internet uscodeing the MMS
    protocol (i.e. from mms:// type URLs, generally found in asx files).
    Downloaded streams can then be replayed offline at your leisure,
    using any compatible media player of your choice.
    
    mimms mms://media-wm.cac.washington.edu/ifs/uw_cse05_goo gle_1300k.asf
    
    Of course, a torrent would be even bette
jeez, more goog fluff? (Score:1, Funny)

by t_parker16 ( 154804 ) writes:

one word: short any rally.
here is a transcript of the first 12 minutes (Score:3, Informative)

by Anonymous Coward writes: on Sunday April 03, 2005 @11:32AM (#12126464)

Here are the first 12 minutes typed out. i'm sorry i can't do the rest, but open the video and skip forward to 12:00 and go from there. i hope that these 12 minutes of my life typing this will save at least 2 other people 12 minutes of theirs.

(speech from this point...)
lots of people use google but i want to give you a flavour for what happens and what we are working on for our new systems and products. i'll focus on what are the interesting problems that crop up when you organize large amounts of information, like we do, and what you can do with lots of data and computational resources. i'll also talk about our engeneering organization.

google ha a mission statement that i like - to organize the worlds information and make it universally accessible and useful. we've moved from web searching to mail and news and searching books by scanning/ocr'ing them. this mission statment covers everything and means we won't run out of work!

a lot of our issues are to do with scale. we have 4B webpages with average 10kb/page, and lots and lots of searches per sections. it's a big problem but you solve it with lots of computers and disks and network them well.

dealing with scale comes about in a number of areas. hardware/network; what do you use. distributed systems; dealing with unreliable things. algorithims/structures; processing efficiently and in interesting ways. machine learning/info retrevial; improving quality of results by analyzing lots of data. user interfaces; we haven't done much on this yet but it would be interesting to provide new and interesting ways to naviage and refine the query by doing better things than just typing in new query words - i'd expect to see more developments in this area.

one thing we've made a decision about is that we tend to build on low cost commodity PCs. example setup: ibm eserver xseries 440, 8 2-ghz xexon, 64GB ram 8TB disk = 758,000. we use this: 88 machines that total, 172 2-ghz xeons, 176 GB ram, ~7TB = 278,000. this is 1/3x price, more cpu.

google was founded in 97 by two people at stanford working on interesting ways to use the search, but needed new hardware to do this. they'd go to the loading dock and offer to setup machine for other reasearch projects - but keep them for a while themselves to get work done. over time google was formed in 1999, and we've learned a lot since then - such as how to scale better and have good datacenter practices.

hosting centers were charging for the square foot, which is strange since their costs come from things like cooling and electricity so we got good at putting a lot of servers in one place. we know are very good at setting up large clusters quickly, such as our gigantic 2001 datacenter move configured in 3 days.

if you have that many machines you have to worry about failure. one machine might fail every thousand days, but thousands of machines mean at least a failure a day. you have to deal with this in software with replication and redundancy. one nice property of dealing with this problem is that having six copies for capacity reasons also means we now have six copies available for distributed application and load balancing. a lot of the applications we deal with are read-only, which helps handling so many querys easy.

Share
twitter facebook
- doesn't do it justice (Score:2)
  
  by adpowers ( 153922 ) writes:
  
  I think you get more out of if it from watching the video. Not only are there graphs and pictures at some points (like pictures of Google over the years), but you get to hear all the little jokes Jeff Dean makes (he is a pretty funny guy). Also, near the end they show a neat behind-the-scenes interface where you can look at automatically formed clusters of information. It clusters words or ideas together, which is probably used by things like Google Sets and their search engine (try searching for [lotr], it
the director... (Score:4, Funny)

by Stalyn ( 662 ) writes: on Sunday April 03, 2005 @11:33AM (#12126467) Homepage Journal

can anyone confirm that Leni Riefenstahl [wikipedia.org] was behind this film?

Share
twitter facebook
Pfffft. (Score:3, Funny)

by Das Auge ( 597142 ) writes: on Sunday April 03, 2005 @11:40AM (#12126498)

Thats no secret, it's pigeons.

Share
twitter facebook
- Re:Pfffft. (Score:2)
  
  by evilmousse ( 798341 ) writes:
  
  ahaha i can't beleive i didn't think of that right away--great call.
  
  mods, this is funny, he's referring to google pidgeon rank [google.com]
Finally... (Score:1)

by DeathAndTaxes ( 752424 ) writes:

A /. article where we shouldn't hear a whole bunch of "RTFA" posts. ;-) WTFM? Dunno if that's as catchy.
Behind the scenes? (Score:5, Interesting)

by Anonymous Coward writes: on Sunday April 03, 2005 @11:57AM (#12126595)

Disclaimer: my opinions expressed herein are not necessarily those of Google, Inc.

That having been said, as a long time insider I have a pretty good idea about what really happens "behind the scenes" and let me tell you, both conspiracy theories crackpots and our slashdot fanboys are quite amusing, but the boring fact is that we are neither trying to take over the world, nor are we the best thing since the second coming of Jesus.

We used to be a very successful startup, yes, and now we are a fairly successful corporation. Yes, there are a lot of smart people working here, but don't fool yourself, "the most interesting challenges in computer science" are happening in academia, not in corporations. (Besides, anyone who knows Jeff is perfectly aware that he often tends to grossly exaggerate our importance, but to be honest that is a part of his job which he is doing really great.)

All in all, I love to work here, I thing there are a lot of very smart people here, but if you think that we are the only place on the planet where geniuses cluster lately, you are just not being reasonable. If you want to find real discoveries you have to look in places where people don't have shareholders telling them what to do. The point is that we haven't done anything new per se, only the scale of our implementations is unprecedented.

For example, in my 20% time (Google allows us to spend 20% of paid work time on personal projects) I am working with KeyKOS right now and let me tell you, this is what I call innovation. It was done in the '70s and no mainstream OS has implemented its ideas to this day so far. I'm sure that when after a decade or two a Big Corporation (be it Google, Microsoft, Apple, or IBM) reimplements KeyKOS, the Slashdot crowd will wet their pants screaming "wow, what an innovation!" completely forgetting that it was an innovation back in the '70s of the 20th century when Norm Hurdy et al. were working on it quitely with no buzz and fanfares. Please remember that "The Next Big Thing" is always an old idea but this time backed with $$$ and marketing. Please never forget it, or otherwise the people who are worth their salt will only consider you uneducated.

Share
twitter facebook
- Re:Behind the scenes? (Score:2, Interesting)
  
  by Fall into This ( 847574 ) writes:
  
  This has got to be the best post I've read about Google. I am so friggin' sick of hearing BS about "Google's Gmail is EVIL!!!!!!!!111111!!!!x0rz! Just READ their terms!!" and the such. Woopdie doo, read Yahoo's. Speaking of which, no one seems to be bitching about Yahoo's 'evils.' Seems to me that if Google's actions are so borg...ish, then why have other search engines not been brought up? Google Maps comes out, all I hear is "another step towards monopolization." Yahoo Maps, no one seems to give a
- Re:Behind the scenes? (Score:2)
  
  by This is outrageous! ( 745631 ) writes:
  
  "the most interesting challenges in computer science" are happening in academia, not in corporations.
  If you want to find real discoveries you have to look in places where people don't have shareholders telling them what to do.
  Unfortunately academia itself is increasingly under the spell of (well-meaning but) clueless administrators who believe science will magically happen if they drive OUT anything that doesn't claim immediate applications.
  Case in point, our Dean right here, who I wish could read wh
- - Background (Score:2, Interesting)
    
    by Pan T. Hose ( 707794 ) writes:
    
    A quick search on KeyKOS makes one wonder: Does it have anything in common with GNU's microkernel efforts? Anyone cares to post a brief overview of KeyKOS, possibly in connection and/or comparison to Mach/HURD?
    Short answer: yes it does, and it is actually one of the main reasons why I look forward to use Debian GNU/Hurd [debian.org] in the future. Let me quote my old post [slashdot.org] from January with some background and interesting links to more informations about KeyKOS:
    Still, you can't block every hole in security
  - EROS (Score:2)
    
    by Pseudonym ( 62607 ) writes:
    
    You should check out EROS [eros-os.org], which is an open source OS based on KeyKOS (but updated a bit).
- - Re:But.. what is it? (Score:2)
    
    by ezzzD55J ( 697465 ) writes:
    
    "KeyKOS ® is a persistent, pure capability operating system." Doesn't tell me (a non-CS major) anything useful about it at all.
    Take a look at: [eros-os.org] http://www.eros-os.org/ [eros-os.org] which is a modern (re)implementation of many of KeyKOS's ideas. Fantastic ideas.
University Recruiting Talks (Score:4, Interesting)

by stevemm81 ( 203868 ) writes: on Sunday April 03, 2005 @12:11PM (#12126669) Homepage

Google is constantly giving talks like this at universities. I saw one at Harvard back in the fall.
They aren't really news worth reporting on slashdot, since they all contain the same content.

Share
twitter facebook
Equal Time (Score:5, Informative)

by DanielMarkham ( 765899 ) writes: on Sunday April 03, 2005 @12:30PM (#12126785) Homepage

Hey -- I love Google. Use it every day, and I think they're doing some really neat stuff. But this was an hour-long commercial for Google - -to me it looked designed to recruit from college campuses. While I think it's great that Google does this (it sure sounds like a great way to get cheap qualified labor) is it really new or interesting? Or even geeky? So we have redundant clustering, LISP-like patterns, and issues of dealing with BIG stuff. Hasn't the industry already done all of this, like dozens of times? You can't tell me VISA international doesn't handle this size data, or that General Motors doesn't have some of the same scaling issues. I read somewhere that Wal-Mart has one of the biggest computer systems in the world. To me the signal-to-noise ratio was out of whack to make it worth an hour of my time. Just my opinion folks.

Share
twitter facebook
"high-quality search requires" (Score:2)

by l3v1 ( 787564 ) writes:

poses some of the most interesting challenges in computer science and information theory and application, database theory and application and some more. It is quite a nice wide area of possible R&D with great prospects for everyone, be them starters or veterans. And please don't say C.S. includes all that (especially since bashing if I.T. degrees on /. is so fashionable these days), it doesn't.
Has anyone else noticed that... (Score:2, Funny)

by omeomi ( 675045 ) writes:

...Google seems to be down a lot lately? Like right now, I can't seem to get to it...what's with that?
Booooooooooring... (Score:3)

by Dan East ( 318230 ) writes: on Sunday April 03, 2005 @03:16PM (#12127697) Journal

Considering that there isn't any magical alchemy going on behind the scenes, google is in fact pretty boring. The only thing interesting is the scale of the operation.

Dan East

(finally able to post for the first time in two weeks - wonder if anyone else had a problem)

Share
twitter facebook
Google innovates? It's news to me. (Score:5, Interesting)

by danila ( 69889 ) writes: on Sunday April 03, 2005 @04:15PM (#12128044) Homepage

May be Google has done some nifty things with their file-system, but can't we forget about it already? Their search hasn't changed much http://www.google.com/ [archive.org]">in the past six years. Of course, the fanboys will salivate over Google calculator [google.com] and Google unit converter [google.com], but on the scale of Internet these "innovations" barely register.

Some of the other search engines are comparable in quality to Google (Teoma [teoma.com], Vivisimo [vivisimo.com]), and may be better, depending on how many points you take away from Google for spam-infested results, too many blogs, too many Wikipedia clones, too many commercial sites, etc. And some sites are so much further on the innovation scale (meet BrainBoost [brainboost.com], an artifically intelligent Internet reference desk answering any questions asked in natural English, with amazing quality and accuracy in a very friendly and usable interface) that they put Google to shame.

Share
twitter facebook
More Video Links (Score:2)

by aallan ( 68633 ) writes:

I found this video back in February, isn't this a dupe? Anyway my blog post [babilim.co.uk] about it also has a link to good paper on the Google File System written up for the 19th ACM Symposium on Operating Systems Principles, along with video of the talk the Google guys gave at the symposium.
You might also want to have a look at my post on Eric Schmidt talking about Google [babilim.co.uk] to the Stanford Business School. The post also has a link to a video of Urs Hölzle talking to the University of Washington about clustering at
The Easy Solution (Score:2)

by Nom du Keyboard ( 633989 ) writes:

poses some of the most interesting challenges in computer science.
Just throw some more hardware at it.
- Re:want real dirt? go to www.fuckedgoogle.com (Score:2, Insightful)
  
  by LegionX ( 691099 ) writes:
  
  This page strikes me as dumb and deliberately one sided.. and surprise: nothing everyone hasn't heard before! (except for the cheesy bad humour). Everyone their taste, but show me some real dirt please.
  - Re:want real dirt? go to www.fuckedgoogle.com (Score:3, Insightful)
    
    by drsquare ( 530038 ) writes:
    
    This page strikes me as dumb and deliberately one sided..
    
    Just like Slashdot then? Except this fuckedgoogle site has the opposite viewpoint. How is it OK to be biased in one direction, but not the other? Why is it that some people on this site seem to have a vested interest in quashing any criticism of their favourite giant corporation? What have you got to hide?
- Dirt? That more like modelling clay (Score:4, Insightful)
  
  by TheLink ( 130905 ) writes: on Sunday April 03, 2005 @11:46AM (#12126531) Journal
  
  Given the bias of the site if that's all the dirt they can dig up, Google must be a pretty good company, and/or the people at that site are just crap at digging up dirt.
  
  Think about it, if someone really hated any of the Fortune 500 companies and bothered to dig up some dirt, there'd be tons more dirt.
  
  I suppose Google is a young company. Give it a few more years and more parasites would have found their way into Google. Then you'd have a lot more dirt.
  
  Parent Share
  twitter facebook
  - Re:Dirt? That more like modelling clay (Score:2, Interesting)
    
    by Tibe ( 444675 ) writes:
    
    You think Google make their money from AdSense? AdWords? etc.?
    
    Google has resorces and expertise beyond most companies, possibly including Redmond.
    
    They have at their fingers the most up-to-date information, opinions, numbers, rantings of most of the world. Do they use this to make income? I bet.
    
    Banks already analize thier data and invest accordingly, Google are bound to do the same. (A la Google news.) With their expertise it is likely to be far more advanced and therefore more profitable.
    
    They don't need
    - Re:Dirt? That more like modelling clay (Score:3, Insightful)
      
      by Mac Degger ( 576336 ) writes:
      
      I always love these rants: my brother thinks the same thing. But there is one thing you forget: Google is now a public company; a corporation. Expecially at the time just before IPO, their whole business was public...you wanted to know how Google got it's money? You shoulda read the prospectus and assorted extra materials. You read anything about a 'pre-emptive investment department' operating on webbased intel? No, you didn't, nor anything even slightly similar.
      
      So either put up (evidence) or shut up.
- Re:want real dirt? go to www.fuckedgoogle.com (Score:1)
  
  by Frankie70 ( 803801 ) writes:
  
  This page strikes me as dumb and deliberately one sided
  
  That's why he called it the other side of the story - as compared to /. /. with to google is dumb & deliberately one side - just the other side.
- Re:the video is slashed someone post a bittorrent (Score:2, Funny)
  
  by Anonymous Coward writes:
  
  Here's a summary of the most interesting part [photobucket.com].

There may be more comments in this discussion. Without JavaScript enabled, you might want to turn on Classic Discussion System in your preferences instead.

Google's dirty secret revealed (Score:5, Funny)

Re:Google's dirty secret revealed (Score:2, Funny)

Re:Google's dirty secret revealed (Score:3, Funny)

Re:Google's dirty secret revealed (Score:2)

Re:Google's dirty secret revealed (Score:2)

Network everybody together, eh? (Score:5, Funny)

Re:Google's dirty secret revealed (Score:5, Funny)

Re:Google's dirty secret revealed (Score:1)

What -- I Have To Watch TV Now? (Score:5, Funny)

Fsking video format. (Score:2, Insightful)

Re:Fsking video format. (Score:2)

Re:Fsking video format. (Score:1, Redundant)

Re:Fsking video format. (Score:4, Insightful)

Re:Fsking video format. (Score:1)

Re:Fsking video format. (Score:1)

Re:Fsking video format. (Score:2)

Re:Fsking video format. (Score:2)

Re:Fsking video format. (Score:2)

Re:Fsking video format. (Score:1)

Re:Fsking video format. (Score:2, Insightful)

http://justfuckinggoogleit.com/ ... (Score:2, Funny)

Re:Fsking video format. (Score:4, Informative)

Re:Fsking video format. (Score:1)

UW mirror (Score:4, Informative)

Re:UW mirror (Score:1)

OK then where the hell is (Score:2, Interesting)

G4/TechTV (Score:2, Insightful)

Re:G4/TechTV (Score:1)

Re:G4/TechTV (Score:5, Insightful)

5.6 Mbps? (Score:2, Funny)

I use Google at work (Score:2, Interesting)

Re:I use Google at work (Score:5, Informative)

Re:I use Google at work (Score:2)

Re:I use Google at work (Score:4, Funny)

Re:I use Google at work (Score:2, Funny)

Re:I use Google at work (Score:1, Funny)

Re:I use Google at work (Score:2)

Re:I use Google at work (Score:5, Insightful)

Re:I use Google at work (Score:3, Funny)

Few women in CS. (Score:3, Interesting)

50% female is the goal (Score:5, Interesting)

Re:50% female is the goal (Score:3, Informative)

Re:50% female is the goal (Score:2)

Re:50% female is the goal (Score:2)

Re:50% female is the goal (Score:2)

Re:50% female is the goal (Score:2)

Re:50% female is the goal (Score:2)

Re:50% female is the goal (Score:2)

Re:50% female is the goal (Score:2)

Re:50% female is the goal (Score:2)

Content-based search (Score:1)

Re:Content-based search (Score:1)

Google & Backup (Score:3, Interesting)

Backups are for pussies. (Score:2, Funny)

Re:Google & Backup (Score:1)

Re:Google & Backup (Score:2, Insightful)

Images of clowns (Score:2, Interesting)

Re:Images of clowns (Score:2, Funny)

GFS (Score:2, Insightful)

Re:GFS (Score:1, Funny)

Re:GFS (Score:2, Informative)

Re:GFS (Score:4, Interesting)

Re:GFS (Score:5, Informative)

Re:GFS (Score:4, Insightful)

Re:GFS (Score:1)

Re:GFS (Score:1)

mediocre or no Linux support! (Score:1, Flamebait)

Re:mediocre or no Linux support! (Score:3, Informative)

Re:mediocre or no Linux support! (Score:3, Insightful)

Re:mediocre or no Linux support! (Score:2)

WTFV? (Score:5, Funny)

MiMMS (Score:3, Informative)

jeez, more goog fluff? (Score:1, Funny)

here is a transcript of the first 12 minutes (Score:3, Informative)

doesn't do it justice (Score:2)

the director... (Score:4, Funny)

Pfffft. (Score:3, Funny)

Re:Pfffft. (Score:2)

Finally... (Score:1)

Behind the scenes? (Score:5, Interesting)