Digital Future of the Library of Congress 141

Posted by Zonk on Friday March 25, 2005 @12:15PM from the yay-for-learning dept.

lesinator writes "On Monday the 28th the US Library of Congress is holding the eighth lecture in its series on Managing Knowledge and Creativity in a Digital Context. Previous speakers include David Weinberger on blogging, Brewster Kahle - founding member of archive.org and the wayback machine, and Lawrence Lessig on intellectual property and the creative commons. After the lecture questions will be taken from the audience and the internet. C-Span will be broadcasting the lecture live at 6:30 PM EST, and also has archives of previous lectures. Audio archives of previous lecture are available at Audible.com in the Selected Free Media section."

This discussion has been archived. No new comments can be posted.

Digital Future of the Library of Congress

Load All Comments

Search 141 Comments Log In/Create an Account

Comments Filter:

At last! (Score:3, Funny)

by Shadow Wrought ( 586631 ) writes: <shadow.wrought@NOSPam.gmail.com> on Friday March 25, 2005 @12:18PM (#12046615) Homepage Journal

We'll know just how much storage really is required to hold the Library of Congress.

Share
twitter facebook
- Re:At last! (Score:5, Insightful)
  
  by cmburns69 ( 169686 ) writes: on Friday March 25, 2005 @12:28PM (#12046708) Homepage Journal
  
  While it's an interesting question, it really depends on how you want to store the contents of each book.
  
  Would you store each page of each book as an image? As flat ASCII text (except of pictures and diagrams, of course!)? What kind of indexing would you do? Basic indexing of book names? Full-text indexing of the contents? All that storage adds up!
  
  In summary, the library of congress (depending on the method used) could probably fit into something ranging from a couple of gigabytes to a couple of petabytes.
  
  Parent Share
  twitter facebook
  - Re:At last! (Score:5, Interesting)
    
    by Shadow Wrought ( 586631 ) writes: <shadow.wrought@NOSPam.gmail.com> on Friday March 25, 2005 @12:34PM (#12046764) Homepage Journal
    
    Well I owuld think that they would have to start with an image first. Once they OCR'd it and generated ascii text files, they could save a tremendous amoutn of space by simply deleting the images. However, after that much effort in imaging all those pages, I just can't see them doing that. The best bet is probably two databases, one of ascii text and one of images.
    They might even be able to generate revenue by having the ascii text freely available and searchable, while the images would cost money. That way folks just interested in the text can find it easily, while scholars and others who need to see the source material can have access at a moderate price.
    
    Parent Share
    twitter facebook
    - Yes, and yet...no. (Score:3, Insightful)
      
      by oneiros27 ( 46144 ) writes:
      
      You're making a large number of assumptions in your first paragraph:
      
      The OCR is always correct.
      
      The documents could be represented in ASCII
      The text is the only part of the document with any value
      Of course, your second paragraph shows that clearly those assumptions can't be true -- why would someone pay more for something without an additional benefit?
      
      And you wouldn't maintain seperate databases -- pictures aren't searchable. You'd want to use any OCRd (preferably vetted afterwards) as the basis for inde
      - Re:Yes, and yet...no. (Score:2)
        
        by Shadow Wrought ( 586631 ) writes:
        
        I did make quite a few assumptions, but it is, after all, a thought process. The actual image would have greater value for some people than for others. If you want to read Moby Dick, you can do that in just a text format, you don't have to view the actual images (which are significantly larger files). If, on the other had, you are doing your doctorate thesis on Moby Dick than you will likely want to view the actual images. It is also more likely that you, since you require more than just the text, would
    - Re:At last! (Score:2)
      
      by cduffy ( 652 ) writes:
      
      Mmm. I've seen document archival formats available (patented, I think) taylored for printed documents -- using one of these, it should be possible to get your typical page well below 100K, and stay in that general range even with drawn illustrations (though anything w/ color or photo-style images is no longer suitable). DjVu is a prime example of these, though others exist.
      
      So keeping the scanned images shouldn't really require such a tremendous amount of space.
  - Re:At last! (Score:2)
    
    by NoMoreNicksLeft ( 516230 ) writes:
    
    Uncompressed text, maybe markup, and you're looking at about 20 terabytes I believe. Adding in the works with either illustrations or photographs, in some decent but lossy compressed format, and you're easily quadrupling that (just a guess).
    
    Indexing, by what, subject, author, and title? 1% overhead at most. Fancier googlesque searching though, could be a big hit.
    
    And correct me if I'm wrong, but there are quite a few videos too.
    
    Not to mention some historical stuff that can't even be digitized all that wel
    - Re:At last! (Score:3, Interesting)
      
      by caseydk ( 203763 ) writes:
      
      I was working on this project just a few years back (2001-2002).
      
      Our estimates projected that by 2005, it would be take about 4 TB of digitization EACH day to keep pace.
      
      The first storage phase called for 180TB server.
  - Small representations. (Score:3, Interesting)
    
    by Grendel Drago ( 41496 ) writes:
    
    Have you ever seen someone's hundred and fifty page thesis, diagrams and all, fit onto a 3.5" floppy? People who wrote their theses in TeX or LaTeX, with a few postscript diagrams. I was impressed by how tiny the code for a real, well-produced book could be.
    
    'Course, the problem is that these representations work if you're entering in the content with that method in the first place.
    
    --grendel drago
  - Re:At last! (Score:1)
    
    by FlopEJoe ( 784551 ) writes:
    
    I can't stress this enough... whatever they do, for the love of God, DON'T USE .LIT!
  - Re:At last! (Score:2, Insightful)
    
    by aboyko ( 16319 ) writes:
    
    A couple of gigabytes?! Only if you burn it first. There's something like 10^8 books, nevermind the other stuff. How do you compress any given book into 100 bytes?
    
    The "20 TB" figure comes from the smallest possible measure, treating the flat books as ASCII text. Even just considering current digital content, it's also inaccurately small by >1 order of magnitude.
    
    It's a really really really big library.
- Re:At last! (Score:3, Interesting)
  
  by WillAdams ( 45638 ) writes:
  
  There's a cue for a question I've been wondering about for a while.
  
  What was the first reference / usage of ``LoC'' as a unit of knowledge measurement?
  
  The first time I recall seeing it was in Michael Gear's novels, _The Artifact_ if memory serves, ~1976.
  
  Anyone have an earlier instance?
  
  William
  - Re:At last! (Score:2)
    
    by lelitsch ( 31136 ) writes:
    
    Well, you could always google for this kind of information.
    
    10 Terabytes: Printed collection of the U. S. Library of Congress
    - Re:At last! (Score:1, Flamebait)
      
      by punkass ( 70637 ) writes:
      
      Go back and read his question. He was asking when was the term first used, not what it meant. Ass.
    - Re:At last! (Score:1)
      
      by DustMagnet ( 453493 ) writes:
      
      I just tried Google and it's not supported by Google math [google.com].
- Re:At last! (Score:1)
  
  by spectasaurus ( 415658 ) writes:
  
  I'm betting it takes about one Library of Congress to store the Library of Congress. Any takers?
Here's an idea related to audio archiving (Score:5, Insightful)

by filmmaker ( 850359 ) * writes: on Friday March 25, 2005 @12:19PM (#12046627) Homepage

Maybe the fine folks at audio.com might consider making their audio clips available by means other than the Real or MS media players?

Share
twitter facebook
- - Re:Here's an idea related to audio archiving (Score:2)
    
    by boarder8925 ( 714555 ) writes:
    
    How about MP3 or Ogg?
    - Re:Here's an idea related to audio archiving (Score:2)
      
      by zotz ( 3951 ) writes:
      
      OGG - theora and vorbis perhaps. Yes, I agree.
      
      "I am your father's brother's nephew's cousin's former roommate."
      
      Dude, I bremembah you now. Why didn't ya say so sooner?
      
      all the best,
      
      drew
      
      http://www.archive.org/search.php?query=creator%3A %22drew%20Roberts%22 [archive.org]
- That's the right idea .. carry it further (Score:5, Insightful)
  
  by Anonymous Coward writes: on Friday March 25, 2005 @12:30PM (#12046728)
  
  It is amusing that this story follows directly after a story about Microsoft proprietary file formats.
  
  The Library of Congress should insist that all 'publications' be submitted to it in open formats. What good is it if they have something on file that nobody can read! The extreme is that they have to have a licensed copy of every piece of software that ever created a file. If all the formats have to be open then at least historians can cobble together something that can read a file of interest.
  
  With the ip laws as stupid as they are now, we run the real risk of losing the record of our age.
  
  Parent Share
  twitter facebook
  - Re:That's the right idea .. carry it further (Score:1, Insightful)
    
    by Anonymous Coward writes:
    
    "...What good is it if they have something on file that nobody can read!..."
    
    I wouldn't say nobody. The paying members of a private club would be able to read it.
    - Re:That's the right idea .. carry it further (Score:2)
      
      by zotz ( 3951 ) writes:
      
      Not if the only people with the rights to make readers stop doing so.
      
      The government would then have to get into some emminent (SP?) domain type takings. Right?
      
      all the best,
      
      drew
  - Re:That's the right idea .. carry it further (Score:3, Insightful)
    
    by John Seminal ( 698722 ) writes:
    
    It is amusing that this story follows directly after a story about Microsoft proprietary file formats. The Library of Congress should insist that all 'publications' be submitted to it in open formats. What good is it if they have something on file that nobody can read!
    Why even have it on any digital media. I want the original records. Screw having computerized copies. This is the nations library, where a copy of everything in its' original form must be.
    I have no problem with the card catalogue system
- DRM and archiving are so diametrically opposed... (Score:4, Insightful)
  
  by PornMaster ( 749461 ) writes: on Friday March 25, 2005 @01:15PM (#12047137) Homepage
  
  DRM and archiving are quite conflicting. But then again, how do you make available information on which you want to retain technical methods of copyright protection?
  
  I think the obvious solution is to archive it in a non-DRM, non-proprietary format, but transcode to a DRM/proprietary format when retrieved, if the content is not in the public domain.
  
  Parent Share
  twitter facebook
- Re:Here's an idea related to audio archiving (Score:2)
  
  by zotz ( 3951 ) writes:
  
  I will second that request. If you are really trying to benefit the public, try a format that is Free please.
  
  all the best,
  
  drew
  
  I was indeed taught that "beggers can't be choosers," but I am not begging, just giving "a word to the wise."
  
  http://www.archive.org/audio/audio-details-db.php ? collection=opensource_audio&collectionid=JohnConst antakisdrewRobertsRainwaterBlues [archive.org]
Dammit! (Score:2, Insightful)

by dteichman2 ( 841599 ) writes:

What are they thinking! Airing this at 6:30 PM EST! CSpan has just ensured that nobody on the west coast will see this. Or, is that what they are aiming for?
- Re:Dammit! (Score:1)
  
  by BroadwayBlue ( 811404 ) writes:
  
  No VCRs, Tivo, mythTV, or tv-tuner cards out there? Or maybe just wait for the torrent when you get home?
  - Re:Dammit! (Score:1)
    
    by dteichman2 ( 841599 ) writes:
    
    Yes. Maybe the torrent. I've got the VCR, but no tape for it.
- Re:Dammit! (Score:5, Funny)
  
  by lukewarmfusion ( 726141 ) writes: on Friday March 25, 2005 @12:31PM (#12046734) Homepage Journal
  
  C-SPAN is clearly concerned with ratings. Didn't you see the stuff they pulled out for Sweeps week? I think it was something like "old guy reading boring text to empty room."
  
  Parent Share
  twitter facebook
  - Re:Dammit! (Score:1, Funny)
    
    by 0x461FAB0BD7D2 ( 812236 ) writes:
    
    Sounds like college to me.
- Re:Dammit! (Score:1)
  
  by Hachey ( 809077 ) writes:
  
  I live on the west coast (California) and I am going to watch it.
  
  -----
  Check out the Uncyclopedia.org [uncyclopedia.org]:
  The only wiki source for politically incorrect non-information about things like Kitten Huffing [uncyclopedia.org] and Pong! the Movie [uncyclopedia.org]!
- Re:Dammit! (Score:2)
  
  by gregorio ( 520049 ) writes:
  
  What are they thinking! Airing this at 6:30 PM EST! CSpan has just ensured that nobody on the west coast will see this. Or, is that what they are aiming for?
  
  From the submitter: C-Span will be broadcasting the lecture live at 6:30 PM EST, and also has archives of previous lectures. .
  
  Well, if it's a LIVE broadcasting, I'm pretty sure that C-Span will have to air it at whatever time the lecture will be happening. =]
  
  Chill out, they archive their boardcasts.
- Re:Dammit! (Score:2)
  
  by Scott7477 ( 785439 ) writes:
  
  I think parent was aiming for "Funny" and not "Insightful".
Nice, but how long? (Score:2, Funny)

by Anonymous Coward writes:

How long is it going to take to digitize the entire library?

Anyone have a good approximation? I'd like to know in Burning Libraries of Congress (BLC) please.

I'm guessing somewhere around 10-200 BLC.
- Re:Nice, but how long? (Score:4, Interesting)
  
  by yuriismaster ( 776296 ) writes: <(tubaswimmer) (at) (gmail.com)> on Friday March 25, 2005 @12:37PM (#12046808) Homepage
  
  Well, I would imagine that unless they have a massive staff and many OCR scanners or automation with REALLY good OCR, this may take a LOONNNG time.
  
  I'm not quite sure about the length of a BLOC, but this is a job for not-quite-manual labor. Each book requires a simple task: Scan page 1, flip page, scan page 2, page 3, flip, ad infinitum.
  
  One way to save on time would be to contact the publshers of any book made after 1985-ish, where you can get electronic copies from the author. Some older books may have been already digitized, but it's still going to take more than 25 years unless there's a massive army working on this.
  
  Parent Share
  twitter facebook
  - Re:Nice, but how long? (Score:4, Informative)
    
    by Blue-Footed Boobie ( 799209 ) writes: on Friday March 25, 2005 @12:55PM (#12046973)
    
    Nonsense. I put together solutions with high-speed scanners all the time. Some of our highest-end average 118ipm (Duplex) and have 1000pg ADFs.
    Also, you would generally split the load between 4-6 of these scanners for a job this big. The software is automated, and will OCR/Convert/Archive the file is one step.
    As a general rule, you can fit 10,000 b/w text pages in 1GB of storage.
    
    Parent Share
    twitter facebook
    - Re:Nice, but how long? (Score:1)
      
      by erick99 ( 743982 ) writes:
      
      That's great for loose sheets but what about scanning bound books? Aren't you then back to scanning a page, flip a page, scan a page, etc.?
      - Re:Nice, but how long? (Score:2, Informative)
        
        by Blue-Footed Boobie ( 799209 ) writes:
        
        Nope, Canon (and others) make Book Scanners with actually flip and scan each page automatically. They can handle all sizes too.
        They are very expensive, but cool as hell.
      - Re:Nice, but how long? (Score:1)
        
        by melandy ( 803088 ) writes:
        
        I used to work in that industry too. Typically, bound material would be cut into loose sheets... you basically sacrifice a book to get the images electronically. Also, any decent high volume scanner can scan both sides of a sheet at once, so there's no flipping.
        
        As an unrelated asidem some even scan in color, but your storage requirements go way up if you do anything other than bitonal (even greyscale eats up the bytes pretty quick).
      - Re:Nice, but how long? (Score:1)
        
        by AnFraX ( 809909 ) writes:
        
        That's great for loose sheets but what about scanning bound books? Aren't you then back to scanning a page, flip a page, scan a page, etc.?
        
        Cut the binding off?
  - Re:Nice, but how long? (Score:2)
    
    by sribe ( 304414 ) writes:
    
    I'm not quite sure about the length of a BLOC, but this is a job for not-quite-manual labor. Each book requires a simple task: Scan page 1, flip page, scan page 2, page 3, flip, ad infinitum.
    
    Uhmm, no. You cut the binding off and run the pages through a document feeder, then rebind the book, using these things that some people refer to as "machines" ;-)
Some ideas (Score:5, Insightful)

by gowen ( 141411 ) writes: <gwowen@gmail.com> on Friday March 25, 2005 @12:24PM (#12046669) Homepage Journal

Here an interesting talks they might give:

i) What if the Apostles had had technological means to prevent the reproduction of the New Testament?

ii) Would our culture be diminished if the people who rediscovered Beowulf had been unable to decrypt the manuscript?

iii) Is the continual repitition and reworking of myth and fable through the Oral Tradition disrespectful of the content creators who first recorded these stories?

Share
twitter facebook
- Re:Some ideas (Score:4, Insightful)
  
  by Scrameustache ( 459504 ) writes: on Friday March 25, 2005 @12:29PM (#12046719) Homepage Journal
  
  i) What if the Apostles had had technological means to prevent the reproduction of the New Testament?
  
  Main Entry: apostle
  Pronunciation: &-'pä-s&l
  Function: noun
  Etymology: Middle English, from Old French & Old English; Old French apostle & Old English apostol, both from Late Latin apostolus, from Greek apostolos, from apostellein to send away, from apo- + stellein to send
  1 : one sent on a mission: as a : one of an authoritative New Testament group sent out to preach the gospel and made up especially of Christ's 12 original disciples and Paul b : the first prominent Christian missionary to a region or group
  
  They wouldn't have prevented the distribution of the story their mission it was to distribute, that's for sure.
  
  Parent Share
  twitter facebook
  - Re:Some ideas (Score:1)
    
    by gowen ( 141411 ) writes:
    
    Ooops my bad. What's the collective noun for the dudes that wrote the Gospels?
    - Re:Some ideas (Score:4, Interesting)
      
      by Anonymous Coward writes: on Friday March 25, 2005 @01:25PM (#12047227)
      
      It's been continually re-written. For example, until 1954 Jesus never actually said "I am the Son of God"; when Pontius Pilate accused him of claiming to be the Jewish Messiah, he cryptically responded "It is you who said it." The fact Jesus didn't claim to be the Son of God but was surrounded by intense believers was one the essential "mysteries" of Christianity that you were supposed to accept as a Christian.
      
      In 1954, the American "New International" edition just editted the trial dialog and "re-interpreted" "it is you who said it" into "I am the Son of God." I don't think the European and Catholic churches have editted that part yet.
      
      Parent Share
      twitter facebook
      - Re:Some ideas (Score:1)
        
        by doctorcisco ( 815096 ) writes:
        
        If you actually go read Mark 15:2 [biblegateway.com] and John 18:37 [biblegateway.com] in the New International Version, which was not first published until the late 1970's, you'll find out that the previous poster is misinformed.
  - Publication of New Testament (Score:3, Interesting)
    
    by dpilot ( 134227 ) writes:
    
    Authorship of the New Testament is not a simple question at all. First off, the Apostles didn't sit down and start collecting the New Testament. That was done hundreds of years later by some chaps in Rome or Turkey who also had political axes to grind. Every few decades or centuries, there's also Yet Another Translation, and in the forward they talk about the prayer, consideration, and attempts to divine the True Word of God that went into it. Common belief is that over the centuries there has been so much
    - - Re:Publication of [The New Balancing Act] (Score:1)
        
        by dpilot ( 134227 ) writes:
        
        Name a few more, and I'll add them to future rants.
- God is on the side of DRM (Score:2)
  
  by Ohreally_factor ( 593551 ) writes:
  
  You'd better read this [bbspot.com].
- Re:Some ideas (Score:2)
  
  by zotz ( 3951 ) writes:
  
  "iii) Is the continual repitition and reworking of myth and fable through the Oral Tradition disrespectful of the content creators who first recorded these stories?"
  
  iv) Why do people of oral traditions get no legal protections for their work? (From those outside their tradition who would fix it and lock them out from their own work?) Why must it be fixed?
  
  I know that is at least halfway to zany, but please try to give a halfway to reasonable answer.
  
  all the best,
  
  drew
- Re:Some ideas (Score:2)
  
  by Mazem ( 789015 ) writes:
  
  This, folks, is why I read Slashdot. Despite all the dupes, trolls, groupthink and pseudoscience, occasionally I read a gem of a post. That is one of the most scathing, concise attacks on DRM and IP ridiculousness that I have ever read. Parent poster, I salute you!
Next series (Score:4, Funny)

by E IS mC(Square) ( 721736 ) writes: on Friday March 25, 2005 @12:34PM (#12046766) Journal

"Managing Knowledge and Creativity with DRM"...

Sponsored by Apple and Microsoft!

Share
twitter facebook
- Re:Next series (Score:1)
  
  by dteichman2 ( 841599 ) writes:
  
  No, not Apple. Apple has gone out of their way to ensure that you have somewhat generous rights with the music you purchase from their store. This is keeping the RIAA people happy.
  
  More like: Sponsored by the RIAA and Microsoft!
  
  If you aren't happy with the DRM on the iTMS songs, I suggest the HYMN project [hymn-project.org].
Hello, Project Gutenberg?!? (Score:5, Interesting)

by Infosquawk ( 131022 ) writes: on Friday March 25, 2005 @12:40PM (#12046843)

I can never understand why there isn't more acknowledgment of our debt to Project Gutenberg on these issues.

Michael Hart was digitizing books before digitizing books was cool, as far back as 1971, and the Project's efforts have been hugely successful on very little money. Nevertheless, I rarely see any official or media acknowledgment of the Project's efforts. If anyone should be on that panel for their ability to give advice from practical experience and performance in this field, while on a shoestring budget, it would be Hart!

Share
twitter facebook
- No money is precisely Why (Score:1)
  
  by Baldur_of_Asgard ( 854321 ) writes:
  
  The fact that Project Gutenberg has not consumed huge amounts of money to produce a great amount of value is PRECISELY WHY it does not get more recognition.
  
  The business of charity does not want competition from groups that create better products for less money, as that would put pressure on them to create a reasonable amount of value themselves, without the benefits of cushy offices and hefty salaries.
  
  The business of education also does not want competition from organizations that produce greater value at
  - Conspiracy much? (Score:2)
    
    by Grendel Drago ( 41496 ) writes:
    
    Man, you're appealing to malice a lot more than laziness and stupidity, when the latter is a much, much more likely culprit.
    
    --grendel drago
    - Re:Conspiracy not so much? (Score:1)
      
      by Baldur_of_Asgard ( 854321 ) writes:
      
      Let's just say that I have seen so many examples that I can only conclude that:
      
      (1) People in many "charitable" organizations and "educational" establishments are quite corrupt; or
      
      (2) People in many "charitable" organizations "educational" establishments are amazingly, astoundingly stupid.
      
      Neither bodes well, but only corruption seems to explain all the facts, especially in the case of the "education" establishment.
      
      Baldur of Asgard
      - Re:Conspiracy not so much? (Score:1)
        
        by dpilot ( 134227 ) writes:
        
        Maybe:
        (3) People in many charitable organizations are out DOING charity, not talking about it. Kind of like Project Gutenberg.
        
        I suspect it's the (3)s that make charity work, and make people want to keep it alive, but it's the (1)s that make the most noise and draw the most money.
        
        IMHO there's an unfortunately large class of people who specialize in smelling the flow of money, and inserting themselves into that flow. The world would be for the most part better off without them.
- Re:Hello, Project Gutenberg?!? (Score:1)
  
  by Quiet_Desperation ( 858215 ) writes:
  
  It's just the way the media works. They are lazy, and point their sensor arrays at the noisiest targets. Look at the Terry Schaivo case. I've heard the televised opinion of eighty seven million doctors *EXCEPT* the ones that have actually examined her.
  - - Re:Hello, Project Gutenberg?!? (Score:1)
      
      by Quiet_Desperation ( 858215 ) writes:
      
      I know. My point was more the pointlessness of hearing from anyone else.
- - Re:The problem I see with Project Gutenberg... (Score:2, Informative)
    
    by Baldur_of_Asgard ( 854321 ) writes:
    
    (1) Under the old US law, content had to be marked "Copyright" to be copyrighted. Under the present US law, all work is automatically copyrighted the moment it is created, UNLESS the author specifies otherwise. I think this holds true for works since, was it 1987? I forget exactly - but it's been a little while now.
    
    (2) A person who transcribes a book that is in the public domain can CLAIM a copyright on it, but this is not enforceable unless they have changed the text significantly enough for it to be
Outsource parts of LOC to Google or Amazon? (Score:5, Insightful)

by G4from128k ( 686170 ) writes: on Friday March 25, 2005 @12:44PM (#12046888)

With the current wave of outsourcing, privatization, and government use of commercial contractors, I wonder if Amazon or Google don't have a major role to play in the process of cataloging/archiving/serving digital content in the future.

Although LOC could never be replaced by a Google or Amazon, these private companies could provide services that augment or reduce the cost of LOC-like services. For example, if Amazon scans a book, why should LOC scan it too?

Share
twitter facebook
- Re:Outsource parts of LOC to Google or Amazon? (Score:2, Interesting)
  
  by HeedlessYouth ( 685980 ) writes:
  
  You mean like this [infotoday.com]?
- Re:Outsource parts of LOC to Google or Amazon? (Score:1)
  
  by SirGarlon ( 845873 ) writes:
  
  Recent IP law allows copyright on aggregations of data even if the data itself is public domain. So if Google were to digitally archive a bunch of public domain books (copyright expired on each book) then the searchable database could still be copyrighted and owned by Google.
  
  In order to outsource the digitization of the collection to a private company, the LOC would have to license its own collection back from that company!
- Re:Profit (Score:2)
  
  by zotz ( 3951 ) writes:
  
  " 1.)Steal LOC Carmen Sandiego style 2.)???? 3.)Profit 4.)???? 5.)???? 6.)Jail time 7.)???? 8.)President of US"
  
  Dude, that is some business plan/method! Did you try to patent it yet?
  
  all the best,
  
  drew
  
  I would have given you +1 Funny
What about a backup copy? (Score:4, Interesting)

by voss ( 52565 ) writes: on Friday March 25, 2005 @01:10PM (#12047101)

It would seem if the LOC is going to have X number of Petabytes on computers...why not have a second copy stored AWAY from DC. If something were to happen to DC at least we would have backup copies of everything...and we probably should have a separate backup location at a third site.

Share
twitter facebook
- Re:What about a backup copy? (Score:1)
  
  by aboyko ( 16319 ) writes:
  
  Say! that's brilliant! I'm going to go down the hall and mention that to the LC IT guys!
  .
  .
  .
  Oh. Apparently it had occurred to them. Well, thanks, just the same. You think of anything else, please, drop us a line!
- Re:What about a backup copy? (Score:1)
  
  by karlrado ( 604410 ) writes:
  
  Brewster Kahle said on a podcast (IT Conversations) that they are working on an agreement with the library at Alexandria Egypt to back up each other's archives. Sounds like a good deal, since Alexandria doesn't have most of the LOC content and the LOC has little of what Alexandria is archiving.
This just in... (Score:5, Funny)

by SmokeHalo ( 783772 ) writes: on Friday March 25, 2005 @01:54PM (#12047455)

The LOC has announced that they are accepting volunteers to digitize texts. Their first volunteer is Earl the night janitor, who has been busily keying in the last 20 years of New York City phone books. He hopes to move on to Chicago soon.

Share
twitter facebook
- Re:This just in... (Score:3, Funny)
  
  by superpulpsicle ( 533373 ) writes:
  
  Don't worry Earl will soon have the assistance of hundreds of non-English speaking Iraqi prisoners to help him.
Merger with the CIA? (Score:2)

by DJCF ( 805487 ) writes:

I wonder how long before they merge with the CIA and become the Central Intelligence Corporation...

(It's a joke.)
Are they requiring publishers to submit PDF files? (Score:5, Interesting)

by melted ( 227442 ) writes: on Friday March 25, 2005 @02:05PM (#12047537) Homepage

Are they requiring publishers to submit PDF files for new entries yet? Or files in another open format? Man, I'd hate to see taxpayer's money wasted on doing work that they could avoid doing by simply mandating PDF submissions from publishers.

I can see that some publishers may just say, "oh, my book isn't gonna be in libraries if I don't submit PDF, so much the better, I'll sell more copies". I hope these fellas realize how badly they're shooting themselves in the foot.

Share
twitter facebook
- Or even the sources. (Score:2)
  
  by pavon ( 30274 ) writes:
  
  Since nearly all typesetting is done electronically these days, I wonder if they shouldn't just have publishers send them the raw typesetting documents in addition to a hardcopy. It wouldn't be much work for the LOC to write (or buy) software to convert all the common typesetting formats into whatever standard format(s) they would like to use internally, and for dispersion to the public.
  
  It would certainly be smarter than scanning them in themselves, or demanding extra work on the publishers part to to conv
  - - That's the beauty of PDF (Score:2)
      
      by melted ( 227442 ) writes:
      
      It's an open format. Adobe does not control it, just like they don't control TIFF or PostScript (despite having invented both).
DRM? (Score:2)

by 192939495969798999 ( 58312 ) writes:

Isn't the Library of Congress' digital collection, especially with respect to music, going to totally screw iTunes and any other online DRM stuff, in order to bring us our library materials?
I'd say... (Score:1)

by Liquid Len ( 739188 ) writes:

... that as a universal unit of measurement, it's gonna be around for a while.
it might be cool if... (Score:1)

by montygreen ( 870808 ) writes:

sometime before the "finale" Enterprise gets destroyed and they rebuilt it as Enterprise B heh
- Re:How many.... (Score:1, Troll)
  
  by tquinlan ( 868483 ) writes:
  
  Ya know, when you do that, you ruin the opportunity to actually make the jokes for the rest of us! ;)
  - Re:How many.... (Score:1, Troll)
    
    by punkass ( 70637 ) writes:
    
    That would be the point... I've been a user here since 1999, and while quips and jokes are amusing, memes like "hot grits" and such are really just noise.
    - Re:How many.... (Score:2)
      
      by punkass ( 70637 ) writes:
      
      Troll? That's cool. I could comment on how the moderation system has completely derailed since then too, but I wouldn't want to upset anybody :).
- Re:it has to be said... (Score:2)
  
  by TheGavster ( 774657 ) writes:
  
  Assuming 10% overhead for indexing, 1.1LOC.
  - Re:it has to be said... (Score:2, Funny)
    
    by Clay Pigeon -TPF-VS- ( 624050 ) writes:
    
    But how are we going to measure asteroids and meteors now that the larger imperial unit (Libraries of Congress) is going to get smallers? Will we have to fall back to the smaller unit (VW Beetles) for all of them now?
    - Re:it has to be said... (Score:2)
      
      by mog007 ( 677810 ) writes:
      
      Just assume they'll store all that data on the old type punch cards, or those big drums from before I was a fetus. Then, the LoC can retain its position as the largest unit of mass, data, AND volume.
- Re:it has to be said... (Score:1)
  
  by kaens ( 639772 ) writes:
  
  No such thing as Left Over Crack
- Re:Riiiight.. (Score:1)
  
  by MynockGuano ( 164259 ) writes:
  
  Excuse me, sir. I think you are lost. I believe the thread you are looking for is here [slashdot.org].
  
  Have a nice day, and good karma to you!

There may be more comments in this discussion. Without JavaScript enabled, you might want to turn on Classic Discussion System in your preferences instead.

At last! (Score:3, Funny)

Re:At last! (Score:5, Insightful)

Re:At last! (Score:5, Interesting)

Yes, and yet...no. (Score:3, Insightful)

Re:Yes, and yet...no. (Score:2)

Re:At last! (Score:2)

Re:At last! (Score:2)

Re:At last! (Score:3, Interesting)

Small representations. (Score:3, Interesting)

Re:At last! (Score:1)

Re:At last! (Score:2, Insightful)

Re:At last! (Score:3, Interesting)

Re:At last! (Score:2)

Re:At last! (Score:1, Flamebait)

Re:At last! (Score:1)

Re:At last! (Score:1)

Here's an idea related to audio archiving (Score:5, Insightful)

Re:Here's an idea related to audio archiving (Score:2)

Re:Here's an idea related to audio archiving (Score:2)

That's the right idea .. carry it further (Score:5, Insightful)

Re:That's the right idea .. carry it further (Score:1, Insightful)

Re:That's the right idea .. carry it further (Score:2)

Re:That's the right idea .. carry it further (Score:3, Insightful)

DRM and archiving are so diametrically opposed... (Score:4, Insightful)

Re:Here's an idea related to audio archiving (Score:2)

Dammit! (Score:2, Insightful)

Re:Dammit! (Score:1)

Re:Dammit! (Score:1)

Re:Dammit! (Score:5, Funny)

Re:Dammit! (Score:1, Funny)

Re:Dammit! (Score:1)

Re:Dammit! (Score:2)

Re:Dammit! (Score:2)

Nice, but how long? (Score:2, Funny)

Re:Nice, but how long? (Score:4, Interesting)

Re:Nice, but how long? (Score:4, Informative)

Re:Nice, but how long? (Score:1)

Re:Nice, but how long? (Score:2, Informative)

Re:Nice, but how long? (Score:1)

Re:Nice, but how long? (Score:1)

Re:Nice, but how long? (Score:2)

Some ideas (Score:5, Insightful)

Re:Some ideas (Score:4, Insightful)

Re:Some ideas (Score:1)

Re:Some ideas (Score:4, Interesting)

Re:Some ideas (Score:1)

Publication of New Testament (Score:3, Interesting)

Re:Publication of [The New Balancing Act] (Score:1)

God is on the side of DRM (Score:2)

Re:Some ideas (Score:2)

Re:Some ideas (Score:2)

Next series (Score:4, Funny)

Re:Next series (Score:1)

Hello, Project Gutenberg?!? (Score:5, Interesting)

No money is precisely Why (Score:1)

Conspiracy much? (Score:2)

Re:Conspiracy not so much? (Score:1)

Re:Conspiracy not so much? (Score:1)

Re:Hello, Project Gutenberg?!? (Score:1)

Re:Hello, Project Gutenberg?!? (Score:1)

Re:The problem I see with Project Gutenberg... (Score:2, Informative)

Outsource parts of LOC to Google or Amazon? (Score:5, Insightful)

Re:Outsource parts of LOC to Google or Amazon? (Score:2, Interesting)

Re:Outsource parts of LOC to Google or Amazon? (Score:1)

Re:Profit (Score:2)

What about a backup copy? (Score:4, Interesting)

Re:What about a backup copy? (Score:1)

Re:What about a backup copy? (Score:1)

This just in... (Score:5, Funny)

Re:This just in... (Score:3, Funny)

Merger with the CIA? (Score:2)

Are they requiring publishers to submit PDF files? (Score:5, Interesting)

Or even the sources. (Score:2)

That's the beauty of PDF (Score:2)

DRM? (Score:2)

I'd say... (Score:1)

it might be cool if... (Score:1)

Re:How many.... (Score:1, Troll)

Re:How many.... (Score:1, Troll)

Re:How many.... (Score:2)