Comment Re:So we aren't sequenced yet? (Score 5, Informative) 145
The issue was that the Y chromosome contains very long sequences of repeating junk DNA and the old sequencing methods were producing very small pieces.
Here is an analogy with all letters instead of just the four ATCG found in DNA.
Let's imagine that sequencing gives you a lot of small pieces of no more than 10 characters such as
(1) eThatIsT
(2) ToBeThatI
(3) eOrNotToB
(4) ToBeOr
(5) hatIsThe
(6) OrNotToBe
(7) BeThatIs
(8) Question
(9) IsTheQuest
Overlapping of (7), (1) , (5), (9) and (8) produces the longer sequence BeThatIsTheQuestion
Similarly, (4), (3) and (6) produces ToBeOrNoToBe
And finally (2) connects those 2 sequences to form the full genome ToBeOrNotToBeThatIsTheQuestion
That this technique does not work well if you have a lot of repeating sequences in the genome.
Imagine that the genome contains ToBeOrNoToBe repeated a few hundred times.
Sequencing would produce a lot of fragments such as ToBeOr, eOrNotT, ToBeOrNo, rNotToBeToB, ToBeToBe, eOrNotToB, oBeToBeOr, eToBeXrNotT, oBeOrN, oBeOr, BeOrNotT ,
It would be easy to figure out that ToBeOrNoToBe is repeating itself but not how many times.
Also, some fragments may contain single mutations like the X in one of the fragments above there would be no way to figure out where that X occurs in the repetition.
The new sequencing methods produce longer fragments and so make decoding a lot easier.