Comment Re:NetTalk (Score 1) 170
The big difference here is that a backprop model needs to essentially be told what the correct phonemic boundaries are, etc., either in the way it represents the phonemes (possibly as a localist representation) or as target outputs. The model discussed in PNAS is simply given raw speech sounds and learns the categories itself (and how many there are) with no given targets. It's the difference between an unsupervised and supervised model that makes the difference here. In the earliest stages of language learning, I don't think parents correct their children's babbling!