Large File Problems in Modern Unices 290

Posted by CmdrTaco on Sunday January 26, 2003 @10:53AM from the stuff-to-deal-with dept.

david-currie writes "Freshmeat is running an article that talks about the problems with the support for large files under some operating systems, and possible ways of dealing with these problems. It's an interesting look into some of the kinds of less obvious problems that distro-compilers have to face."

This discussion has been archived. No new comments can be posted.

Large File Problems in Modern Unices

Load All Comments

Search 290 Comments Log In/Create an Account

Comments Filter:

Not really that groundbreaking... (Score:5, Interesting)

by CoolVibe ( 11466 ) writes: on Sunday January 26, 2003 @10:59AM (#5161560) Journal

The problem is nonexistant in the BSD's, which use the large file (64 bit) versions anyway. And that you have to use a certain -D flag if your OS (like Linux) doesn't use the 64 bit versions. Whoopdiedoo. Not so hard. Recompile and be happy.

Share
twitter facebook
- - Re:Not really that groundbreaking... (Score:3, Funny)
    
    by statusbar ( 314703 ) writes:
    
    2^64 = 17,179,869,184 gigabytes!
    
    17,179,869,184 gigabytes ought to be enough for ANYBODY!
    
    --jeff++
Its funny how some lamers dont listen... (Score:3, Insightful)

by cheekyboy ( 598084 ) writes: on Sunday January 26, 2003 @11:05AM (#5161596) Homepage Journal

I said this to some unix 'so called experts' in 95, and they said, oh why why do you need >2gig

I can just laugh at them now...

Share
twitter facebook
- Re:Its funny how some lamers dont listen... (Score:2)
  
  by FooBarWidget ( 556006 ) writes:
  
  No you can't, both Linux and FreeBSD support files > 2 GB. Apparenly you've laughed all for nothing.
640 K ought to be enough for anybody (Score:3, Funny)

by cyber_rigger ( 527103 ) writes: on Sunday January 26, 2003 @11:10AM (#5161622) Homepage Journal

--Bill Gates

Share
twitter facebook
It will happen with time_t, too (Score:5, Informative)

by wowbagger ( 69688 ) writes: on Sunday January 26, 2003 @11:11AM (#5161629) Homepage Journal

We are seeing problems with off_t growing from 32 to 64 bits. We are also going to see this when we start going to a 64 bit time_t, as well (albeit not as badly - off_t is probably used more than time_t is.)

However, the pain is coming - remember we have only about 35 years before a 64 bit time_t is a MUST.

I'd like to see the major distro venders just "suck it up" and say "off_t and time_t are 64 bits. Get over it."

Sure, it will cause a great deal of disruption. So did the move from aout to elf, the move from libc to glibc, etc.

Let's just get it over with.

Share
twitter facebook
- Only 35 years... (Score:2)
  
  by Kjella ( 173770 ) writes:
  
  And that big y2k problem that was supposed to bring down mankind? How many years did it take to fix that? I very much doubt we started in 1965 ;)
  
  Prediction: First distro to "suck it up" will be around 2035 or so. Personally, I think this is so far down on the priority list as you can get. Besides, with open source, is there really that problematic to grep the source for "time_t" and fix it? I don't think so.
  
  Kjella
  - Re:Only 35 years... (Score:3, Informative)
    
    by Dan Ost ( 415913 ) writes:
    
    For most programs, it would require little more
    than to change the typedef that defines __time_t
    in bits/types.h.
    
    For stupidly written programs that assume the
    size of __time_t or that use __time_t in unions,
    each will need to be addressed individually to
    make sure things still work correctly.
  - Re:Only 35 years... (Score:2)
    
    by edhall ( 10025 ) writes:
    
    The FreeBSD folks have already done a considerable amount of work on this, even to the point of making time_t 64 bits for both kernel and userland and testing for issues. Enough is known that the main worry now is how to handle the change in ports, some of which need a fair amount of work to move away from 32-bit time_t. But at the rate things are going, I'd expect that they will make the transition to 64-bit time_t for FreeBSD 6.0. I've no idea how they will handle the legacy issues (ports and pre-6.0 binaries) though.
    
    -Ed
- Re:It will happen with time_t, too (Score:2)
  
  by koreth ( 409849 ) writes:
  
  First of all, it's a Y2038 problem rather than a Y2106 problem because time_t is signed in many places. Simply switching to an unsigned time_t (who uses time_t to represent pre-1970 values?) will buy us an extra 68 years with minimal application grief, but the underlying problem will still be there.
  It boggles my mind that Sun, for example, went to the trouble of building a whole host of interfaces and a porting process for 64-bit file offsets (see the lf64 and lfcompile64 manpages on Solaris) and yet they didn't bother to increase the size of time_t at the same time. If everyone is going to be recompiling their apps anyway, why not fix it all in one go?
  On the application side, it should be noted that this isn't a problem for code written in Java, whose equivalent of time_t is already 64-bit (in milliseconds, granted, but that only eats about 10 of the extra 32 bits.) Obviously the Java VM won't be able to make up for the underlying OS not supporting large time values, but at least the applications won't have to change.
  First one to start whining about Java's year-584544016 problem gets whacked with a wet noodle.
  - - - Re:Needs to be signed... (Score:3, Informative)
        
        by Ben Hutchings ( 4651 ) writes:
        
        No, the type of time_t - time_t must be signed. That doesn't imply that time_t must be signed. For example, (unsigned int) - (unsigned int) is int, not unsigned int.
        
        Wrong. The C99 standard says in section 6.3.1.8 paragraph 1:
        
        Many operators that expect operands of arithmetic type cause conversions and yield result types in a similar way. The purpose is to determine a
        
        common real type for the operands and result. For the specified operands, each operand is converted, without change of type domain, to a type whose corresponding real type is the common real type. Unless explicitly stated otherwise, the common real type is also the corresponding real type of the result, whose type domain is the type domain of the operands if they are the same, and complex otherwise.
        
        Here, the common real type is unsigned int, and the description of the addition and subtraction operators (section 6.5.6) does not specify a different type for the result when both operands have arithmetic type.
        
        If you disagree, please cite relevant parts of the standard to support your case.
A woman's perspective . . . (Score:5, Funny)

by pariahdecss ( 534450 ) writes: on Sunday January 26, 2003 @11:12AM (#5161633)

So my wife says to me, "Honey, do I look fat in this filesystem ?"
I replied, "Sweetie, I married you for your trust fund not your cluster size."

Share
twitter facebook
Funny...in AIX... (Score:4, Informative)

by cshuttle ( 613776 ) writes: on Sunday January 26, 2003 @11:18AM (#5161665)

We don't have this problem-- 4 petabyte maximum file size 1 terabyte tested at present http://www-1.ibm.com/servers/aix/os/51spec.html

Share
twitter facebook
- Re:Funny...in AIX... (Score:3, Insightful)
  
  by n3m6 ( 101260 ) writes:
  
  whenever something like this comes up. somebody just has to say "we dont' have a problem, we use X"
  
  that's just so lame. we have XFS and JFS. you can keep your AIX and your expensive hardware with you.
  
  thanks.
Have you ever seen some people's email? (Score:5, Insightful)

by alen ( 225700 ) writes: on Sunday January 26, 2003 @11:19AM (#5161672)

On the Windows side many people like to save every message they send or receive to cover their ass just in case. This is very popular among US Government employees. Some people who get a lot of email can have their personal folders file grow to 2GB in a year or less. At this level MS recommends breaking it up since corruption can occur.

Share
twitter facebook
- Re:Have you ever seen some people's email? (Score:5, Funny)
  
  by nentwined ( 626268 ) writes: on Sunday January 26, 2003 @11:45AM (#5161801) Homepage
  
  I agree with MS on this one. government employees shouldn't be allowed to hold their positions for longer than a year. DOWN WITH GOVERNMENTAL CORRUPTION! ... :)
  
  Parent Share
  twitter facebook
- Re:Have you ever seen some people's email? (Score:2, Informative)
  
  by sqrlbait5 ( 67782 ) writes:
  
  Yeah, but if you're using NTFS, where there doesn't appear to be a max file size, you still get the 2GB limit on Outlook files. Every damn version of Outlook has had this 2GB limit, but OutlookXP doesn't actually fix the problem, just warns the user at 1.87GB. We have people hitting their limit all the time at work, but that's because they like to send artwork and whatnot and not clear out their folders.
  - Re:Have you ever seen some people's email? (Score:2)
    
    by spongman ( 182339 ) writes:
    
    ouch. you're not using exchange, i take it?
- Re:Have you ever seen some people's email? (Score:3, Insightful)
  
  by kasperd ( 592156 ) writes:
  
  2GB in a year or less.
  
  They probably don't write emails but instead write Word documents and attach them to empty emails.
- Re:Have you ever seen some people's email? (Score:2)
  
  by sean23007 ( 143364 ) writes:
  
  Don't call that just a Windows phenomenon. There are many cases where it is a good idea to save every email you get. Then again, there are others where it is a good idea to destroy all the evidence. Either situation can happen to you regardless of what OS you use.
  
  Just saying, is all.
- - Re:MOD UP (Score:2, Insightful)
    
    by DAldredge ( 2353 ) writes:
    
    Thats not why he wanted you to call him. If he answered your questions via email there would have been a record of what he had said.
    - - Re:RTFPP (Score:2)
        
        by DAldredge ( 2353 ) writes:
        
        The have to save it for the same reason they do not like sending it. Open Records laws. It is much easier to take 2 or 3 different stands on an issue if those you talk to have no record...
Switch to gnu/hurd (Score:3, Funny)

by Anonymous Coward writes: on Sunday January 26, 2003 @11:22AM (#5161690)

It has a nice small 1gb filesystem limit. I have partitioned my hard disk in to 64 little chunks and it runs very slowly, and unstabilly, but its completley open source and im happy.

Share
twitter facebook
- Re:Switch to gnu/hurd (Score:2)
  
  by /dev/trash ( 182850 ) writes:
  
  Is this true? If so I don't think I'll ever try it out.
Why not to learn from past? (Score:2)

by Libor Vanek ( 248963 ) writes:

I just wonder why we don't learn from past (limits) and remove this limits "forever". E.g. 1 month ago I recieved question of possibility building 10 TB Linux cluster (physics are crazy ;-)).

There surely MUST be some way how to do this - I just imagine some file (e.g. defined in LSB) which would define this limits for COMPLETE system (from kernel, filesystems, utils to network daemons). I know there are efforts to things like this but if we'd say (for example) thay that distribution in 2004 won't be marked "LSB compatible" if ANY of programs will use any other limits I think it will create enough preasure on Linux vendors.

Just a crazy idea ;-)
- Re:Why not to learn from past? (Score:2)
  
  by n3m6 ( 101260 ) writes:
  
  there is no spoon and there is always a limit.
  
  the problem is where its sticking at . ;)
- It's all about efficiency. (Score:3, Insightful)
  
  by OS24Ever ( 245667 ) writes:
  
  There is something innate in the education, learning, and daily working of a programmer that makes them not want to use 'too big' of a number for a certain task.
  
  it either
  
  A) Wastes Memory Space
  B) Wastes Code Space
  C) Wastes Pointer Space
  D) Or Violates some other tenant the programmer believes
  
  So, When they go out and create a file structure, or something similar, they don't feel like exceeding some 'built-in' restriction to their way of thinking.
  
  And usually, at the time, it's such a big number that the programmer can't think of an application to exceed it.
  
  Then, one comes along and blows right through it.
  
  I've been amused by all the people jumping on the 'it don't need to be that big' bandwagon. I can think of many applications that ext3 or whatever would need to use to make big files. they include:
  
  A) Database Servers
  B) Video Streaming Servers
  C) Video Editing Workstations
  D) Photo Editing Workstations
  E) Next Big Thing (tm) that hasn't come out yet.
  - Re:It's all about efficiency. (Score:3, Insightful)
    
    by dvdeug ( 5033 ) writes:
    
    There is something innate in the education, learning, and daily working of a programmer that makes them not want to use 'too big' of a number for a certain task.
    
    We have code for infinite precision integers. The problem is, if it were used for filesystem code, you still couldn't do real-time video or DVD burning, because the computer would be spending too long handling infinite precision integers.
    
    As long as you're careful with it, setting a "really huge" number, and fixing it when you reach that limit is usually good enough.
The O/S should do it and do it well. (Score:3, Interesting)

by tjstork ( 137384 ) writes: <todd@bandrowsky.gmail@com> on Sunday January 26, 2003 @11:41AM (#5161769) Homepage Journal

1) Splitting up a big file turns an elegant solution into a an inelegant nightmare.

2) Instead of 10 different applications writing code to support splitting up an otherwise sound model, why not have 1 operating system have provisions for dealing with large files.

3) You are going to need the bigger files with all those 32 bit wchar_t and 64 time_ts you got!

Share
twitter facebook
BeOS Filesystem (Score:2)

by SixArmedJesus ( 513025 ) writes:

I remember reading in the BeOS Bible that the BeOS filesystem could contain files as large as 18 petabytes. Makes you wonder two things: What's the biggest filesystem that you could use with a BeOS machine? and Why don't other OSs have filesystem like this. Espcecially with those awesome extended attributes. I weep for the loss of the BeOS filesystem...
- Re:BeOS Filesystem (Score:5, Informative)
  
  by Yokaze ( 70883 ) writes: on Sunday January 26, 2003 @12:06PM (#5161912)
  
  Mine is bigger than yours :)
  
  Linux XFS [sgi.com]: 9 exabytes
  
  Also supports extended attributes [bestbits.at].
  
  Parent Share
  twitter facebook
Somewhat cumbersome, even on Linux (Score:2, Informative)

by topologist ( 644470 ) writes:

To enable LFS (Large File Support) in glibc (which not all filesystems support), you need to recompile your application with
-D_FILE_OFFSET_BITS=64 and -D_LARGEFILE_SOURCE

This forces all file access calls to their 64-bit variants, and you'll explicitly need to use structs like off64_t instead of off_t where needed. And I believe most large file support is really available only past glibc 2.2

Additionally you need to use O_LARGEFILE with open etc. So legacy applications that use glibc fs calls have to be recompiled to take advantage of this, and may need source level changes. Won't work on older kernels either.
Error Prevention (Score:3, Interesting)

by Veteran ( 203989 ) writes: on Sunday January 26, 2003 @12:13PM (#5161941)

One of the ways to keep errors from creeping into programs is to put limits on things so high that you can never reach them in the practical world.

The 31 bit limit on time_t overflows in this century - 63 bits outlasts the probable life of the Universe so it is unlikely to run into trouble.

That is the best argument I know for a 64 bit file size; in the long run it is one less thing to worry about.

Share
twitter facebook
- Re:Error Prevention (Score:2)
  
  by jhines ( 82154 ) writes:
  
  The next significant problem with time will come in the year 9999, when the four digit field that lazy programmers have used for thousands of years overflows. Didn't they learn their lessons the first time around?
  
  Digital took a bug report on this for Vax/VMS and promised a fix, some time in a future release.
- Re:Error Prevention (Score:3, Interesting)
  
  by Thing 1 ( 178996 ) writes:
  
  One of the ways to keep errors from creeping into programs is to put limits on things so high that you can never reach them in the practical world.
  
  Anyone ever thought of a variable-bit filesystem?
  Start with 64-bit, but make it 63-bit. If the 64th bit is on, then there's another 64-bit value following which is prepended to the value (making it a 126-bit address -- again, reserve one bit for another 64-bit descriptor).
  Chances are it won't ever need the additional descriptors since 64-bits is a lot, but it would solve the problem once-and-for-all.
I can't believe this...superSynchronicity??? (Score:3, Interesting)

by haggar ( 72771 ) writes: on Sunday January 26, 2003 @12:23PM (#5162003) Homepage Journal

I had a problem with HP-UX apparently not wanting to transfer via NFS (when the NFS server is on HP-UX 11.0) files larger than 2GB. I had to backup a Solaris computer's hard disk using DD across NFS. This usually worked when the NFS server is Solaris. However, last friday it failed, when the server was setup on HP-UX. I had to resort to my little Blade 100 as the NFS server, and I had no problems with it.

I have noticed that on the SAME DAY some folks have asked question about the 2 GB filesize limit in HP-UX on comp.sys.hp.hpux !! Apparently, HP-UX default tar and cpio don't support files over 2 GB, either. Not even in HP-UX 11i. I never thought HP-UX stinked this bad...

How does Linux on x86 stack up? I decided not to use it for this backup, since I had my Blade 100, but would it have worked? Oh, btw, is there finally implemented on Linux a command like "share" (exsts in Solaris) to share directories via NFS, or do I still need to edit /etc/exports and then restart NFS daemon (or send SIGHUP)?

Share
twitter facebook
- Re:I can't believe this...superSynchronicity??? (Score:2)
  
  by Arethan ( 223197 ) writes:
  
  the command that is equivalent to 'share' is 'exportfs', it can usually be found in /usr/sbin/.
  
  It allows you to push NFS exports to the kernel and nfsd without having to edit /etc/exports. Thus, they do not persist across reboots. However, you cannot use exportfs until nfsd is running, and nfsd will auto kill itself if /etc/exports is completely empty. So you must share at least 1 directory tree in /etc/exports before you can use exportfs.
  
  I believe Solaris has this same problem with share though. I don't remember these days, it's been a while since my SCSA cert. (Heh, i guess that's what man pages are for :)
  - Re:I can't believe this...superSynchronicity??? (Score:2)
    
    by haggar ( 72771 ) writes:
    
    Thanks.
    And no, Solaris doesn't have this kind of problem. In Solaris, you have (a more general) /etc/dfs/* for sharing filesystems. Even if there is no fs shared in /etc/dfs/dfstab, nfsd and mountd will happily run. This autokill thing is really stupid.
  - Re:I can't believe this...superSynchronicity??? (Score:2)
    
    by haggar ( 72771 ) writes:
    
    Oh yeah, so how does Linux cope with > 2 GB files transferred via NFS TO a Linux server? So far, only Solaris seem to support our solution. I have not tried Linux because the test takes some relatively considerable time, and if large files aren't supported to be transferred via NFS, I better not even try.
Admittedly, I had problems with the need for... (Score:2)

by constantnormal ( 512494 ) writes:

... 64-bit addressing before thinking this through. I couldn't see the significant advantage for more than a very tiny fraction of apps in being able to address more than a few gigabytes.

Now I can't wait for OS X to have 64-bit support for the IBM 970 processors (I do realize that it will take several releases before default 64-bit operation is practical).

When compared to clustered 32-bit filesystems, I would think that a "pure" 64-bit filesystem would have a number of very practical advantages.

I could easily see the journalled filesystem becoming one of the first 64-bit subsystems in OS X, right after VM.
Large filesystem lack more of a problem (Score:3, Interesting)

by mauriceh ( 3721 ) writes: <mhilariusNO@SPAMgmail.com> on Sunday January 26, 2003 @01:45PM (#5162431)

A much bigger problem is that Linux filesystems have a capacity limit of 2TB.
Many servers now have the physical capacity of over 2TB on a filesystem storage device.
Unfortunately this is still a very significant limitation.
This problem is much more commonly encountered than file size limitations.

Share
twitter facebook
I miss BeFS... (Score:2)

by jonr ( 1130 ) writes:

18 EXAbytes file sizes, real journals, life queries...
*SOB*
J.
The "l" in lseek() (Score:4, Informative)

by edhall ( 10025 ) writes: <slashdot@weirdnoise.com> on Sunday January 26, 2003 @03:26PM (#5163004) Homepage

Once upon a time (prior to 1978) there was no lseek() call in Unix. The value for the offset was 16 bits . Larger seeks were handled by using the different value for "whence" (the third argument to seek()) which causes seeks to occur in 512-byte increments. This resulted in a maximum seek of 16,777,216 bytes, with an arbitrary seek() often requiring two calls, one to get to the right 512-byte block and a second to get to the right byte within the block. (Thank goodness they haven't done any such silliness to break the 2GB barrier.)

When Research Edition 7 Unix came out, it introduced lseek() with a 32-bit offset. 2,147,483,648 bytes should be enough for anyone, hmmm? :-).

-Ed

Share
twitter facebook
- Re:Why large files (Score:3, Funny)
  
  by mr.henry ( 618818 ) writes:
  
  Who needs more than 512k of RAM??
  - Re:Why large files (Score:3, Funny)
    
    by Big Mark ( 575945 ) writes:
    
    Come on. Even Bill Gates admitted that half a meg ain't enough.
    
    640K, on the other hand, should be enough for anyone...
    
    -Mark
    - Re:Why large files (Score:2)
      
      by perfects ( 598301 ) writes:
      
      Bill Gates now claims that he was misquoted. What he really said was that "640K should be more than enough memory for anybody's toaster."
  - data warehouse, and any database for that matter (Score:5, Insightful)
    
    by CrudPuppy ( 33870 ) writes: on Sunday January 26, 2003 @11:04AM (#5161591) Homepage
    
    my data warehouse at work is 600GB and grows at a rate of 4GB per day.
    
    the production database that drives the sites is like 100GB
    
    welcome to last week. 2GB is tiny.
    
    Parent Share
    twitter facebook
    - Re:data warehouse, and any database for that matte (Score:2, Insightful)
      
      by hector13 ( 628823 ) writes:
      
      my data warehouse at work is 600GB and grows at a rate of 4GB per day. the production database that drives the sites is like 100GB welcome to last week. 2GB is tiny.
      
      And you store this "production database" as one file? didn't think so (or atleast I hope you don't).
      I am not agreeing (or disagreeing) with the original post, but having a database > 2 GB has nothing to do with having a single file over 2 GB. A db != a file system (except for MySQL perhaps).
      - Re:data warehouse, and any database for that matte (Score:2, Informative)
        
        by CrudPuppy ( 33870 ) writes:
        
        the datafile size averages 8GB in the warehouse.
    - row partitions (Score:2)
      
      by axxackall ( 579006 ) writes:
      
      I agree that 2 GB limit is obsolete today, especially for projects with large databases and with video editing tasks.
      However, I would recommend to stay away from > 2GB files in database environment. Even if your FS supports large files, you still loose performance on "double-driver": first your kernel provedes a partition, than it provides a file-system over it. But if you need so big files, why would you need file-system? Just use row partitions!
      Of course you still need large files for video, but massive concurrent preformance overhead is not a typical problem in such case.
      - Re:row partitions (Score:2)
        
        by iamacat ( 583406 ) writes:
        
        I don't think most database programmers can write better space allocation, I/O buffering or virtual memory code than good OS programmers. Did any of you guys write a database buffering code and used something better than a simple LRU list? Like taking physical disk layout into account? If you did, and it performed better than the OS on realistic benchmarks, why not write a reusable device driver that will improve performance of everything, not just the database?
        Now it's possible that somehow you have a very good knowledge of your application-specific disk usage pattern and can get a speed up that outweighs user-mode overhead, system swapping your buffers in and out of memory and so on. In this case, you better use a dedicated disk rather than just a partitition. Otherwise, your I/O scheduling code will have interesting interactions with system's swapfile and other normal filesystem activity.
        Even then you run a risk that OS code will one day improve and outperform your homegrown changes. Most programmers are better off just tuning their code to work well with OS native filesystem, virtual memory and so on.
- video, mp3's, even dvds are beyond 2gb (Score:2, Informative)
  
  by xintegerx ( 557455 ) writes:
  
  Question answered, move along, nothing to see here :)
- Re:Why large files (Score:3, Informative)
  
  by voodoopriestess ( 569912 ) writes:
  
  Databases, Movie files, Backup files (think dumps to tapes). Animations, 3D modelling.... Lots of things need a > 2GB file size. Iain
- Re:Why large files (Score:5, Insightful)
  
  by Big Mark ( 575945 ) writes: on Sunday January 26, 2003 @11:01AM (#5161571)
  
  Video. Raw, uncompressed, high-quality video with a sound channel is fucking HUGE. Look how big DivX files are, and they're compressed many, many times over.
  
  And compressing video on-the-fly isn't feasible if you're going to be tweaking with it, so that's why people use raw video.
  
  -Mark
  
  Parent Share
  twitter facebook
  - Yep... (Score:3, Informative)
    
    by Kjella ( 173770 ) writes:
    
    Some numbers for *uncompressed* video:
    
    NTSC/YUV2/stereo: ~111gb for a cinema movie (1hr 45min)
    PAL/YUV2/stereo: ~125gb for same
    
    HTDV/surround: ~908gb for same
    
    With huffyuv (very low CPU usage, lossless) you should be able to cut that by a factor of 2-3. But it's still *huge*
    
    Kjella
    - - PAL & NTSC (Score:3, Informative)
        
        by Kjella ( 173770 ) writes:
        
        PAL: Max 720x576x25fps interlaced (50 Hz)
        NTSC: Max 640x480x29.97fps interlaced (60 Hz)
        
        No, the don't have same frequency, nor scanlines. Some european TVs will take PAL-60, like PAL only at 60Hz though. Also I don't think the color space works in the same way, but not sure about that one. That was why I used YUV2 (16bit) for both.
        
        Kjella
- Re:Why large files (Score:2, Insightful)
  
  by Ogion ( 541941 ) writes:
  
  Ever heard of something like movie-editing? You can get huge files really fast.
- Re:Why large files (Score:5, Interesting)
  
  by Anonymous Coward writes: on Sunday January 26, 2003 @11:02AM (#5161574)
  
  Real analytical work can easily produce files this large. Output for analyses of structures with more than half a million elements and several million degrees of freedom can EASILY produce output of over two gigs. Yes, these results can and should be split, but sometimes it makes sense to keep them together as a matter of convenience. Plus, there IS a small performance hit when dealing with multiple files on most of the major FEA packages.
  
  Parent Share
  twitter facebook
  - Re:Why large files (Score:3, Interesting)
    
    by bunratty ( 545641 ) writes:
    
    Over Christmas and New Years, I helped my wife run a simulation of 1000 different patients for an acedemic pharmacokinetics paper. The run took ten days and had an input file of about 1.5 GB. If her computer was faster, or she had access to more computers, she would have wanted to simulate more patients and would easily have needed support for files larger than 4 GB. As CPUs get faster and hard disks get larger, there will be much more demand for these large files as well as more than 4 GB per process.
- Re:Why large files (Score:4, Informative)
  
  by hbackert ( 45117 ) writes: on Sunday January 26, 2003 @11:02AM (#5161586) Homepage
  
  vmware uses files as virtual disks. 2GB would be a really, really small disk. UML does the same, using the loop device feature of Linux. Again, a filesystem in a file. Again, 2GB is not much. Simulating 20GB would need 10 files.
  
  Feels like 64kbyte segments somehow...and I really don't want to have those back.
  
  Parent Share
  twitter facebook
  - 64KB memory segments (Score:2)
    
    by KDan ( 90353 ) writes:
    
    Oh come on, those were fun, when you had to load into memory and uncompress a file larger than that :-)
    
    Oh the fond memories :-)
    
    Daniel
  - Re:Why large files (Score:2)
    
    by drinkypoo ( 153816 ) writes:
    
    There is a need for a virtualizing filesystem which supports multiple volumes, offline and not, and files stored in segmented form to fit. It would be insanely handy in a clustering environment; The whole cluster could store the file (with some redundancy) and access it in a shared fashion. This would substantially improve the ease of working with inanely large data sets in a clustered scenario.
- Re:Why large files (Score:3, Insightful)
  
  by Idaho ( 12907 ) writes:
  Can anyone give a good reason for needing files larger than 2gb?
  I can think of some:
  A/V streaming/timeshifting
  
  Backups of large filesystems (since there exist 320 GB harddisks now, I don't think I should create 160 .tgz files just to back it up, do I?)
  
  Large databases. E.g. the slashdot posts table will be easily >2 GB, or so I'd guess. Should the DB cut it in two (or more) files, just...because the OS doesn't understand files >2 GB? I don't think so...
  
  And that's just without thinking twice...there are probably many more reasons why people would want files >2 GB.
- Re: (Score:2)
  
  by account_deleted ( 4530225 ) writes:
  
  Comment removed based on user account deletion
- Q: Why large files? A: Disk images too (Score:2, Interesting)
  
  by Anonymous Coward writes:
  
  While almost all the examples given are good, I don't think anyone has mentioned complete disk images. I have recently had to do this in order to recover from a hardware issue (drive cable failure resulted loss of MBR, nasty) and on a TiVo unit that had a bad drive.
  I have most all of my older system images available to inspect. The loopback devices under Linux are tailor made for this type of thing.
  
  I am puzzled as to why you mention the seek times. Surely you would agree that the seek time should be only inversely geometrically related to size, the particular factors depending on the filesystem. Any deviation from the theoretical ideal is the fault of a particular OS's implementation. My experience is that this is not significant.
  
  (user dmanny on wife's machine, ergo posting as AC)
- Re:Why large files (Score:3, Interesting)
  
  by bourne ( 539955 ) writes:
  
  Can anyone give a good reason for needing files larger than 2gb?
  
  Forensic analysis of disk images. And yes, from experience I can tell you that half the file tools on RedHat (like, say, Perl) aren't compiled to support >2GB files.
- Re:Why large files (Score:2, Insightful)
  
  by benevold ( 589793 ) writes:
  
  We use a Unidata database here for an ERP system, each database is more than 2gb a piece (more like 20 gb) of relatively small files, when the directories are tarred for backup reasons they are usually over 2gb which means that gzip won't compress them. Unless I'm missing something I don't see an alternative for files large than 2gb in this case. Sure on the personal computing level the closest thing you probably get is ripping DVD's but there are other things out there, and I realize this is tiny in comparison to some places.
- Re:Why large files (Score:2)
  
  by Veteran ( 203989 ) writes:
  
  I have run into problems trying to compress a tar archive of my home directory which has been around since 1995 when I switched to Linux. The two gig limit runs into trouble here.
- Re:Why large files (Score:4, Insightful)
  
  by kasperd ( 592156 ) writes: on Sunday January 26, 2003 @11:35AM (#5161742) Homepage Journal
  
  The seek times alone withinr these files must be huge
  
  Who moded that as Insightful? Sure, if you are using a filesystem designed for floppy disks, it might not work well with 2GB files. In the old days where the metadata could fit in 5KB a linked list of diskblocks could be acceptable. But any modern filesystem uses tree structures which makes a seek faster than it would be to open another file. Such a tree isn't complicated, even the minix filesystem has it.
  
  If you are still using FAT... bad luck for you. AFAIK Microsoft was stupid enough to keep using linked lists in FAT32, which certainly did not improve the seek time.
  
  Parent Share
  twitter facebook
  - Re:Why large files (Score:2)
    
    by drinkypoo ( 153816 ) writes:
    
    Ostensibly your filesystem driver will be caching much of the list information in memory, thus for the uses to which fat32 is applied, it is still a reasonable method. There's a reason it's called fat32, it's a direct descendant.
    Anyway those using a M$ OS which does not support NTFS are fooling themselves. If you are using some form of windows prior to Windows 2000, then you are getting a terrible experience which is nothing like the real OS -- NT. NTFS is a pretty good filesystem with journaling, ACLs, and implicit support for encryption and compression. Fat32 is shite.
    - - Re:Why large files (Score:2)
        
        by drinkypoo ( 153816 ) writes:
        
        Sure, but that's good enough to save people in almost all cases. I've never, EVER lost data on NTFS5 due to a crash (which has happened plenty) or a power failure (only twice since I started using it.) FAT32, on the other hand... Or ext2 for that matter, it doesn't matter. A partially journaling filesystem gets the job done well enough for basically any purpose. If it's not good enough for you, perhaps a filesystem is not the best place to store your data in the first place, I'd considered a clustered replicating RDBMS :P
- Re:Why large files (Score:2)
  
  by joto ( 134244 ) writes:
  
  Can anyone give a good reason for needing files larger than 2gb?
  Yes. Sometimes you need to store a lot of data. Even DVD's has 4.3 GB of data these days. But that's not even much compared to the amount of data we handle in seismic research. I would believe astronomists, particle physicists and a lots of other people also routinely handle ridiculous amounts of data.
  By the way, in producing the DVD, you would naturally work with uncompressed data. How would you handle that?
  The seek times alone withinr these files must be huge, and it smacks a bit of inefficienecy
  And because it is inefficient, we should not support it? As a matter of fact, any file larger than one disk-block is inefficient. Maybe we should stop supporting that as well?
  sure its just as bad to have an app use hundreds of say 4kb files or so, but two GIGABYTES???
  As I've said, it's not really that much, depending on the application.
- Re:Why large files (Score:3, Interesting)
  
  by Zathrus ( 232140 ) writes:
  
  In my previous job we regularly processed credit data files >2 GB. All the data is processed serially (as someone else mentioned), so seek time is not an issue (nor is it an issue in a binary data file - seek to 1.4GB. Done. Next.).
  
  The real issue we ran up against was compression... we wanted to have the original and interm data files available on-disk for awhile in case of reprocessing. The processing would generally take up 10x as much space as the original data file, so you compressed everything. Except that gzip can't handle files >2GB (at the time an alpha could, but we didn't want to touch it). Nor can zip. So we had to use compress. Yay. (bzip could handle it, but was decided against by the powers that be).
  
  Compression of large files is still an issue, unless you want to split them up. Unless you download a beta version gzip still can't handle it. As I understand it zip won't ever be able to do it. There are some fringe compressors that can handle large files, but, well, they're fringe.
  - - Re:gzip handles large files fine (Score:2)
      
      by Whelkman ( 58482 ) writes:
      
      gzip works over 4 GB but loses the ability to accurately report uncompressed file sizes (minor).
- Re:Why large files (Score:2)
  
  by Markus Landgren ( 50350 ) writes:
  
  Last time I wrote a 7 gig file it was an image of a hard disk. Lots of other stuff (video) can get large too. Anyway, there is an error in the headline. 2 gigs is not a limit in modern unices, only in ancient or otherwise really crappy unices.
- Re:Why large files (Score:2)
  
  by wideBlueSkies ( 618979 ) writes:
  
  That tarball of 2002 stock quotes used to feed your stock research system.
  
  The database files themselves, in the system.
- Re:Why large files (Score:2)
  
  by Sayjack ( 181286 ) writes:
  
  Backup files, exporting a huge oracle database to a file. And, when I record divx quality video through my ATI card I can go through the GB like crazy.
  
  A better question is, Who doesn't need largefile support?
  
  As for the seek time...not everything is accessed like a random access file. I imagine that the backup data will be read in sequentially. The video file would mostly be handed sequentially other than when jumping to a chapter fast forwarding or reversing.
- Re:Why large files (Score:2)
  
  by AJWM ( 19027 ) writes:
  
  Can anyone give a good reason for needing files larger than 2gb?
  
  Video/movie files, for one thing. Even compressed (eg DV or MPEG) those things are huge. A 2 GB file at professional DV compression (50 Mb/sec) is about 4 minutes worth. (DV is similar to MJPEG, so it's still lossy. Uncompressed or unlossy compressed video (critical for machine vision or image analysis apps) chews even more space.
  
  I know I've wanted to be able to just dump a mini-DV tape (about 13 GB) directly to a single disk file for later editing.
  
  Other fields also use huge data sets - seismic data analysis for example. Filesystems designed for supercomputer clusters (eg PVFS) have unlimited size on the total filesystem (tens of terabytes is not unusual) although the individual file size may still be limited by the underlying OS or hardware word size.
  
  Then there's creating a .zip or .tgz of a collection of big files. Or creating the equivalant of an ISO image of a DVD. And so on.
- Re:Why large files (Score:2)
  
  by AJWM ( 19027 ) writes:
  
  The seek times alone within these files must be huge,
  
  Depends on how your inodes are laid out, how big you have to get for triple indirect blocks, etc.
  
  Shouldn't be any worse (and maybe better) than trying to seek through an equivalent collection of smaller files -- you've got to do all those directory searches, etc. (Exact comparisons will depend greatly on the filesystem and parameters chosen when the FS was created.)
- - - Re:Why large files (Score:2)
      
      by AvitarX ( 172628 ) writes:
      
      Maybe high quality audio+vidio for say...
      making a movie will be larger then that.
      
      I guess a lot of the editing would probably be done scen by scene, and then you could on the fly merge and compress them so that at no point you use more then 2gb, but it seems that if you make a 2 hour dvd it would be nice to keep the 4gb image file on your hardrive if you planned to reburn it.
      
      Not a scattering of scenes that it would recreate the image on the fly.
      
      It is kind of a dumb question when we have computers being marketed as home dvd makers why would be need that big of a file.
    - Re:Why large files (Score:5, Interesting)
      
      by CoolVibe ( 11466 ) writes: on Sunday January 26, 2003 @11:13AM (#5161636) Journal
      
      raw video can easily exceed 2 GB in size. Why raw video? Because (like others said) it's easier to edit. Then you encode to MPEG2, which will shrink the size somewhat (usually still bigger than 2 GB, ever dumped a DVD to disk?), so it'll be "small" enough to burn onto a DVD or somesuch. Oh, editing 3 hours of raw wave data also chews away at the disk size. Also, since you need to READ the data from the media to see if it looks nice, you need to have support for those big files as well. Right, now why don't we need files bigger than 2 GB again? Well?
      Oh, you're still not convinced, well see it this way: when in the future will you ever need to burn a DVD?
      Well? A typical one sided DVD-R holds around 4 GB of data (somewhat more), if you use both sides, you can get more than 8 GB of data on it. That's way bigger than 2 GB, no? Now, how big must your image be before you burn it on there? well?
      Right...
      
      Parent Share
      twitter facebook
    - Wrong. (Score:2)
      
      by I Am The Owl ( 531076 ) writes:
      
      You obviously have never done any work with video before. Most DV will eat up 2GB easy with 15min of footage or less.
- Re:Unices? (Score:3, Informative)
  
  by moonbender ( 547943 ) writes:
  
  Yes. Just like "matrices" is the plural of "matrix". Not that the words have a similar etymology - according to dictionary.com [reference.com] it's, in the authors' words, "A weak pun on Multics".
  - Re:Unices? (Score:2)
    
    by bunratty ( 545641 ) writes:
    
    Oh, that brings up a pet peeve of mine -- when people call a matrix a "matricee"! When I hear someone say that word, I roll my eyes and think "this guy has no idea what he's talking about!"
    Getting back on topic, maybe the plural for Unix should be Unixen, like the plural for Vax is Vaxen?
- Re:Wrong point of view. (Score:5, Insightful)
  
  by KDan ( 90353 ) writes: on Sunday January 26, 2003 @11:17AM (#5161660) Homepage
  
  Two words:
  
  Video Editing
  
  Daniel
  
  Parent Share
  twitter facebook
  - Cripes! (Score:2)
    
    by Hubert_Shrump ( 256081 ) writes:
    
    That's three words.
    
    I didn't realize Daniel was so big, though.
    
    Has he considered going lossy?
  - three words (Score:2)
    
    by Nick Mitchell ( 1011 ) writes:
    
    hate jar jar
- Re:Wrong point of view. (Score:5, Funny)
  
  by heby ( 256691 ) writes: on Sunday January 26, 2003 @11:22AM (#5161692) Homepage
  
  "oh yes, those were the days." - misty eyed smile - "when i was young and filesizes were small. you should have seen it. today's youth is so spoiled that they don't even learn assembly language any more. i tell you, you're all going to die because of your large files, yes, die!" - madly waves his cane in the air - "2gb, that's more than anybody will ever need and you are greedy for even more! the holy bit will punish you for this, it will!" - dies of a heart attack.
  
  Parent Share
  twitter facebook
- Re:Wrong point of view. (Score:5, Insightful)
  
  by cvande ( 150483 ) writes: <craig DOT vandeputte AT gmail DOT com> on Sunday January 26, 2003 @11:30AM (#5161722)
  
  In a world everything is small and manageable. Unfortunately, some databases need tables BIGGER than 2gb. Even splitting that table into multiple files still finds you with files larger than two gb. Try adding more tables? OK. Now they've grown to over 2gb and the more tables the more complicated everthing gets. I still need to back these suckers up and a backup vendor that I won't name can't help me because their software wasn't large file (for Linux) ready. So let's get into the game with this and make it the default so we don't need to worry about these problems in the future. Linux IS an enterprise solution.....(my $.02)
  
  Parent Share
  twitter facebook
- Re:Wrong point of view. (Score:5, Insightful)
  
  by costas ( 38724 ) writes: on Sunday January 26, 2003 @11:42AM (#5161774) Homepage
  
  Maybe in your problem domain that's true. I work with retailer data mines and we've hit the 2GB file limit, oh, 4-5 yrs ago? We've been forced to partition databases causing maintainance issues, scalability issues, and the like, just because of the size of a B-tree index.
  
  True, it looks like the optimal solution is lower-level partitioning, rather than expanding the index to 64bits (tests showed that the latter is slower), but that still means that the practical limit of 1.5-1.7 GB per file (because you have to have some safety margin) is far too constraining. I know installations who could have 200GB files tomorrow if the tech was there (which it isn't, even with large file support).
  
  I am also guessing that numerical simulations and bioinformatics apps can probably produce output files (which would then need to be crunched down to something more meaningful to mere humans) in the TB range.
  
  Computing power will never be enough: there will always be problems that will be just feasible with today's tech that will only improve with better, faster technology.
  
  Parent Share
  twitter facebook
  - Please mod parent up. (Score:2)
    
    by wideBlueSkies ( 618979 ) writes:
    
    Please mod this guy up as interesting or informative.
- Re:Wrong point of view. (Score:5, Interesting)
  
  by Yokaze ( 70883 ) writes: on Sunday January 26, 2003 @11:52AM (#5161843)
  
  I'm not a specialist on this matter, so maybe you can enlighten me, where I am wrong or misunderstood you.
  
  > fragmentation: large files increase to fracmentation of most file systems
  What kind of fragmentation?
  
  Small files lead to more internal fragmentation.
  Large files are more likely to consist of more fragments, but when splitting this data into small files, those files are fragments of the same data.
  
  >entropy pollution
  What kind of entropy? Are you speaking of compression algorithms?
  
  Compression ratios are actually better with large files than small files, because similarities between files across file-boundaries can be found. Therefor, gzip(bzip2) compresses a single large tar-file. (Simple test, try zip on many files and then zip without compression and subsequent compression on the resulting file).
  
  >data pollution
  How should limiting file size improve that situation? Then, people tend to store data in lot of small files. What a success. People will waste space, whether there is a file size limit or not.
  
  >These limits are there for very good reasons and in my opinion they are even much to big.
  
  Actually, they are there for historical reasons.
  And should a DB spread all its tables over thousands of files instead of having only one table in one file and mmapping this single file into memory? Should a raw video stream be fragmented into several files to circumvent a file limit?
  
  >[...] original K&R Unix [...] was much faster than modern systems
  
  Faster? In what respect?
  
  Parent Share
  twitter facebook
- Re:Wrong point of view. (Score:3, Interesting)
  
  by kasperd ( 592156 ) writes:
  
  I sure hope that was a joke. Because otherwise it would be one of the most clueless comments I have seen.
  
  Sure spliting data into a lot of smaller files is going to reduce the fragmentation slightly, but it is not going to improve your performance. Because the price of accessing different files is going to be higher than the price of the fragmentation.
  
  In the next two arguments you managed to make two opposite statements both incorrect. That is actually quite impressive.
  
  First you say large files increase the entropy of the data stored on the disk. Which is wrong as long as you compare to the same data stored in diffeerent files. Of course if the number of files on the disk is constant smaller files will lead to less entropy, but most people actually want to store some data on their disks.
  
  Then you say large files are highly redundant, which is the opposite of having a large entropy as claimed in your previous argument. And in reality the redundancy does not tend to increase with filesize, but might of course depend on the format of the file.
  
  All in all you are saying that people shouldn't store many data on their disks, and the little data they do store should be as compact as possible, while still allowing it to be compressed even further when doing backups. You might as well have said people shouldn't use their disks at all.
  
  Finally claiming older Unix versions were faster is ridiculous, first of all they ran on different hardware. And surely on that hardware they were slower than todays systems. And even if you managed to port an ancient Unix version to modern hardware, I'm sure it wouldn't beat modern systems in todays tasks. Which DVD player would you suggest for K&R Unix?
- Re:Wrong point of view. (Score:2)
  
  by smoondog ( 85133 ) writes:
  
  There is not a problem with support of large files in Unix system, there is a problem with incompetent people using too large files in Unix systems.
  
  You are a troll. It is not up to administrators to decide how big a file needs to be. I do scientific research and deal regularly with datasets larger than 300GB. Single files often in the range of 2GB-10GB. For me to split up my data would create an enormous headache, and would be very slow.
  
  -Sean
- - Re:Wrong point of view. (Score:2)
    
    by mickwd ( 196449 ) writes:
    
    And the amazing thing is, everyone else seems to be taking it seriously.
    
    Is it just me, or is Slashdot getting much less informed as the user count continues to increase ?
    - Re:Wrong point of view. (Score:2)
      
      by Simon Brooke ( 45012 ) writes:
      
      Is it just me, or is Slashdot getting much less informed as the user count continues to increase ?
      
      It's not just you.
- - Re:Wrong point of view. (Score:2)
    
    by orangesquid ( 79734 ) writes:
    
    At least 2GB is better than the Multics [multicians.org] large file support [multicians.org] situation! Files were limited to the size of segments, which were at most 255K 36-bit words, which is equivalent to roughly one megabyte! The Multics designers didn't consider most users would have to ever have larger files than this. The first database product (ever!), MRDS, was severely limited, so Multics programmers created a (kludgy) workaround. Modern operating systems are designed differently and thus aren't limited to such (small) file sizes.
    
    We have conquered this problem before, by redesigning filesystems to allow files bigger than segments, and we can conquer it again by allowing files bigger than the addressable range of a 32-bit processor's full word.
- Re:huh? (Score:2, Informative)
  
  by JanneM ( 7445 ) writes:
  
  Because the sentences mean different things.
  
  "It is an interesting problem that some distro-compilers have to face."
  
  talks about the problem facing distro compilers, whereas
  
  "It's an interesting look into some of the kinds of less obvious problems that distro-compilers have to face."
  
  Talks about the article adressing these problems. /Janne
- Re:huh? (Score:2, Interesting)
  
  by RumpRoast ( 635348 ) writes:
  
  Actually you changed the meaning of that sentence. I think really we object to:
  "It's an interesting look into some
  
  of the kinds of less obvious problems that distro-compilers have to face."
  
  "of the kinds" really adds nothing to the meaning here, nor does "have to"
  Thus we have:
  "It's an interesting look into some of the less obvious problems that distro-compilers face."
  
  The same sentence, but much cleaner!
  Thanks! I'll be here all week.
- - - Re:How large are we talking? (Score:2)
      
      by NoOneInParticular ( 221808 ) writes:
      
      Ah, this 2^31 brings back memories of the time I had a box for scientific work with appr 4Gb of addressable memory (most of it RAM, but also some swapspace), and wanted to view some kind of lame proprietary video format, with proprietary viewer. When starting up the application it would complain I had less than 4 MB of memory (while in fact I had a thousandfold of that).
      Hmm, the programmers seemed to store the information in an int, so by allocating 2 MB of memory (through Matlab, zeros(10000,10000) is quite a chunk), I could finally convince the application that I did not have negative memory, but actually enough to display the movie.
      But then the video was lame.

There may be more comments in this discussion. Without JavaScript enabled, you might want to turn on Classic Discussion System in your preferences instead.

Not really that groundbreaking... (Score:5, Interesting)

Re:Not really that groundbreaking... (Score:3, Funny)

Its funny how some lamers dont listen... (Score:3, Insightful)

Re:Its funny how some lamers dont listen... (Score:2)

640 K ought to be enough for anybody (Score:3, Funny)

It will happen with time_t, too (Score:5, Informative)

Only 35 years... (Score:2)

Re:Only 35 years... (Score:3, Informative)

Re:Only 35 years... (Score:2)

Re:It will happen with time_t, too (Score:2)

Re:Needs to be signed... (Score:3, Informative)

A woman's perspective . . . (Score:5, Funny)

Funny...in AIX... (Score:4, Informative)

Re:Funny...in AIX... (Score:3, Insightful)

Have you ever seen some people's email? (Score:5, Insightful)

Re:Have you ever seen some people's email? (Score:5, Funny)

Re:Have you ever seen some people's email? (Score:2, Informative)

Re:Have you ever seen some people's email? (Score:2)

Re:Have you ever seen some people's email? (Score:3, Insightful)

Re:Have you ever seen some people's email? (Score:2)

Re:MOD UP (Score:2, Insightful)

Re:RTFPP (Score:2)

Switch to gnu/hurd (Score:3, Funny)

Re:Switch to gnu/hurd (Score:2)

Why not to learn from past? (Score:2)

Re:Why not to learn from past? (Score:2)

It's all about efficiency. (Score:3, Insightful)

Re:It's all about efficiency. (Score:3, Insightful)

The O/S should do it and do it well. (Score:3, Interesting)

BeOS Filesystem (Score:2)

Re:BeOS Filesystem (Score:5, Informative)

Somewhat cumbersome, even on Linux (Score:2, Informative)

Error Prevention (Score:3, Interesting)

Re:Error Prevention (Score:2)

Re:Error Prevention (Score:3, Interesting)

I can't believe this...superSynchronicity??? (Score:3, Interesting)

Re:I can't believe this...superSynchronicity??? (Score:2)

Re:I can't believe this...superSynchronicity??? (Score:2)

Re:I can't believe this...superSynchronicity??? (Score:2)

Admittedly, I had problems with the need for... (Score:2)

Large filesystem lack more of a problem (Score:3, Interesting)

I miss BeFS... (Score:2)

The "l" in lseek() (Score:4, Informative)

Re:Why large files (Score:3, Funny)

Re:Why large files (Score:3, Funny)

Re:Why large files (Score:2)

data warehouse, and any database for that matter (Score:5, Insightful)

Re:data warehouse, and any database for that matte (Score:2, Insightful)

Re:data warehouse, and any database for that matte (Score:2, Informative)

row partitions (Score:2)

Re:row partitions (Score:2)

video, mp3's, even dvds are beyond 2gb (Score:2, Informative)

Re:Why large files (Score:3, Informative)

Re:Why large files (Score:5, Insightful)

Yep... (Score:3, Informative)

PAL & NTSC (Score:3, Informative)

Re:Why large files (Score:2, Insightful)

Re:Why large files (Score:5, Interesting)

Re:Why large files (Score:3, Interesting)

Re:Why large files (Score:4, Informative)

64KB memory segments (Score:2)

Re:Why large files (Score:2)

Re:Why large files (Score:3, Insightful)

Re: (Score:2)

Q: Why large files? A: Disk images too (Score:2, Interesting)

Re:Why large files (Score:3, Interesting)

Re:Why large files (Score:2, Insightful)

Re:Why large files (Score:2)

Re:Why large files (Score:4, Insightful)

Re:Why large files (Score:2)

Re:Why large files (Score:2)

Re:Why large files (Score:2)

Re:Why large files (Score:3, Interesting)

Re:gzip handles large files fine (Score:2)

Re:Why large files (Score:2)

Re:Why large files (Score:2)

Re:Why large files (Score:2)

Re:Why large files (Score:2)

Re:Why large files (Score:2)

Re:Why large files (Score:2)