Comment Backup vs. Archive (Score 1) 356
Although I have by no means figured out the file problem, I've learned what I believe to be the fundamental formula of data management:
Backups != Archives
When I first got a computer with a CD/DVD burner, I was thrilled, as I would finally have limitless and cheap data storage. When I filled up my disk, instead of having to delete files to create new ones, I could "back up" the files that were taking a lot of space.
It came to pass that I literally had hundreds of CDs and DVDs with a variety of backups using multiple indexing schemes. AVI_SET_1 through AVI_SET_5. LS000 through LS038. DVD±RWs 0 through 8. Many of my discs remain unlabeled. Most of them contain a hodgepodge of file types, many of which are outdated.
I have hundreds of gigabytes of data, but most of it is unaccessible. When I want that project file from five years ago, I can't just search my hard drive for it. I have to dig through disc after disc, insert it, look at the contents, and occasionally write a memo of some sort concerning the contents of the disc (often losing track of the memo in mere months). It would be a lot easier if it were all on one monster hard drive (better yet, two hard drives in mirrored RAID configuration). If I actually knew how to organize the files as well, they might not even take that much space, and I wouldn't have needed today's hard drive prices to replace compact discs for the type of storage I was doing.
I failed to understand what a backup is for and what an archive is for. A backup is for disaster recovery. An archive is for shoving files out of sight until I need them in the future. I was using backup technology (compact discs) for archiving.
Let's say that, of all the data one has, 99% of it is junk that you hang on to "just in case", and 1% of it is used routinely (these figures vary wildly in real life). Backups target the 1%, while archives target the 99%. Fortunately, for most people, a DVD is more than sufficient to store the 1% at any given moment in time (operating system files aside).
The bottom line is, an ideal (and cheap) arrangement to keep archives and backups in their respective places is:
- Buy two or more big hard drives (e.g. 1TB each), set them up in RAID-1 (mirrored) configuration, and use them to archive junk. When you put a file in the archive, you can delete it from your system. You shouldn't have to worry about losing your junk files, as they are stored redundantly. This assumption is the whole purpose of the junk drive: you can delete files from your system without blotting them out of existence.
- Once a month or so, burn a DVD backing up all of your most important files, as if you were grabbing your valuables out of a burning building. However, once you have done so, proceed as though this backup does not exist (i.e. don't delete the corresponding files from your hard drive). Backups are for disaster recovery only. I'll say it again, backups are for disaster recovery only