Comment Re:Bad... (Score 1) 239
The problem with the raw data (and dna sequence) is a lot of it is wrong (errors). When confronted with a large data set with errors it is often best to reduce it to the portion that is more correct, than to treat all data as correct for later analysis. For some sorts of analysis such as genome assemblies this may be the only realistic way to proceed.