Want to read Slashdot from your mobile device? Point it at m.slashdot.org and keep reading!

 



Forgot your password?
typodupeerror
×
Perl

Submission + - How Perl Saved the Human Genome Project (dobbscodetalk.com)

viyh writes: "The human genome project was inaugurated at the beginning of the decade as an ambitious international effort to determine the complete DNA sequence of human beings and several experimental animals. The justification for this undertaking is both scientific and medical. By understanding the genetic makeup of an organism in excruciating detail, we hope to better understand how organisms develop from single eggs into complex multicellular beings, how food is metabolized and transformed into the constituents of the body, and how the nervous system assembles itself into a smoothly functioning ensemble. From the medical point of view, the wealth of knowledge that will come from knowing the complete DNA sequence will greatly accelerate the process of finding the causes of, and potential cures for, human diseases.

From the beginning, researchers realized that informatics would have to play a large role in the genome project. An informatics core formed an integral part of every genome center that was created. The mission of this core was twofold: to provide computer support and database services for their affiliated laboratories, and to develop data analysis and management software for use by the genome community as a whole.

Consider the steps that may be performed on a bit of newly sequenced DNA. First, there's a basic quality check on the sequence: Is it long enough, and are the number of ambiguous letters below the maximum limit? Then, there's the "vector check." For technical reasons, the human DNA must be passed through a bacterium before it can be sequenced (this is the process of "cloning"). Not infrequently, the human DNA gets lost somewhere in the process, and the sequence that's read consists entirely of the bacterial vector. The vector check ensures that only human DNA gets into the database.

Next, there's a check for repetitive sequences. Human DNA is full of repetitive elements that make fitting the sequencing jigsaw puzzle together challenging. The repetitive-sequence check tries to match the new sequence against a library of known repetitive elements. The penultimate step is to attempt to match the new sequence against other sequences in a large community database of DNA sequences. Often, a match at this point will provide a clue to the function of the new DNA sequence. After performing all these checks, the sequence (along with the information that's been gathered about it along the way) is loaded into the local laboratory database.

The process of passing a DNA sequence through these independent, analytic steps looks like a pipeline, and we realized that a UNIX pipe could handle the job. We developed a simple Perl-based data-exchange format called "boulderio" that allowed loosely coupled programs to add information to a pipe-based I/O stream. Boulderio is based on tag/value pairs. A Perl module makes it easy for programs to reach into the input stream, pull out only the tags it is interested in, do something with them, and drop new tags into the output stream. Tags that the program isn't interested in are just passed through to standard output so that other programs in the pipeline can get to them."

Space

Submission + - Voyager Clue Points to Origin of the Axis of Evil (technologyreview.com)

KentuckyFC writes: "Cosmologists have been scratching their heads over the discovery of a pattern imprinted on the cosmic microwave background, the radiation left over from the Big Bang. This pattern, the so-called Axis of Evil, just shouldn't be there. Now an independent researcher from Canada says the pattern may be caused by the boundary between the Solar System and interstellar space where there is a sharp change in pressure, temperature and density of ions in space. Known as the termination shock, astronomers had thought this boundary was spherical. But last year, data from the Voyager spacecraft which have crossed the boundary, showed it was asymmetric. The new thinking is that the termination shock acts like a giant lens, refracting light that passes through it. Any distortion of the lens ought to show up as a kind of imprinted pattern on an otherwise random image. But the real eye-opener is that as the shape of the termination shock changes (as the Solar Wind varies, for example), so too should the pattern in the microwave background. And there is tentative evidence that this is happening too (abstract)."

Comment Re:phone (Score 2, Informative) 107

Not a phone. That thing you can see in some pictures sticking out on the left is the foldable stand, not an antenna.

You can use a BlueTooth keyboard, at the expense of battery life. If they made the USB controller act as a "host" (it does not in the 770), you could use a USB keyboard. None included in the package, anyway.

Slashdot Top Deals

This restaurant was advertising breakfast any time. So I ordered french toast in the renaissance. - Steven Wright, comedian

Working...