Bell Labs, Preserving Delicate Sensibilities 101
LuserOnFire writes: "There is a PigDog article talking about the Bell Labs Text-to-Speech Synthesis. The amazing thing is not the technology itself, but that fact that Bell-Labs has a checkbox next to it that says 'If you plan to enter text which our system might consider to be obscene, check here to certify that you are old enough to hear the resulting output.'?!?! Like if you are old enough to spell a swear, you don't know what the word sounds like?" More fun than a TI-99/4A with speech-synthesis card. Those wouldn't say the bad words at all.
better text to speech (Score:1)
Re:ti99/4a (Score:1)
There was also the "Terminal Emulator II" cart that would allow you to say anything from a BASIC program. The quality of speech was much lower, since there was no phonetic mapping going on.
It wasn't that TI censored the speech synth cart, they just didn't include a phonetic version for swear words.
On the Amiga... (Score:1)
...speech-synthesis (and I mean it, no digitalized vocabulary or the like, no expansion modules, it was part of the OS) was implemented as a filesystem.
So you could do
CLI:> copy textfile SPEAK:or
CLI:> type textfile >SPEAK:or use the SAY command
CLI:> say -p120 "Hello, old sucker !"You would do that in 1986 [and it would sound a lot worse ;-)]
The Amiga sythesized all on the fly.
However, I found that Bell Stuff nice. It is possible to hear the human voice behind that. I trapped myself whishing this would have gone, so to hear a native computer voice while sounding like a human rather than a robot.
Seems I got the Digitanoia...
Gee, I still love my Amiga
Actually, the TI 99/4A would swear (Score:1)
What you really needed was the cartridge "Terminal Emulator II"; then it was possible to write programs in TI basic that just fed the speech synthesizer bits of phonetics out of which you got speech. Actually, the phonetics were fed to the TE2 cartridge, which handed you back a huge list of numbers to feed to the synthesizer, and then your program had to feed the result to the synthesizer. I suppose that it would be possible to take the list of numbers, save them off somewhere, and then have basic programs that could swear even without the TE2 cartridge, but that never came up. Really, getting the TI to say "you asshole" once was enough.
The cover of the TE2 book even showed a little caption ballon next to the computer that read "I can say anything". And it could, sort of, in that stilted way that it had.
Actually... (Score:1)
Ahh, the memories.
Re:The checkbox makes sense... (Score:1)
--
The checkbox makes sense... (Score:1)
slashdotted (Score:1)
Linux (Score:1)
A *real* "take-care" notice ! (Score:2)
The best warning ever, must be this:
Don't use nuclear weapons to troubleshoot faults.
It comes from SAFETY RULES FOR US STRATEGIC BOMBERS [cryptome.org]2 C3AF4F2snlbxq'|dc
--
echo '[q]sa[ln0=aln80~Psnlbx]16isb15CB32EF3AF9C0E5D727
Slashdotted already. Sigh. (Score:1)
Jon Acheson
Re:Daisy, Daisy... (Score:2)
The reference is wrong. According to this page, [mu.oz.au] both British and Australian computers had been used for playing music at least six years previously.
The music of CSIRAC, the first Australian computer, has recently been recreated using an emulator and rebuilt hardware, and a CD has been released.
Go you big red fire engine!
I've had TextToSpeech on my Mac for years ... (Score:2)
All my dialog boxes eventually catch my attention by speaking the message if I'm not paying attention. Even more important with OS X since I can be concentrating on some other task.
Text to speech is cute, not very difficult and not computationally demanding.
Speech to text is a very different kettle of fish.
DeCSS (Score:4)
Quick! Someone pipe the DeCSS source through there so the RIAA can go after Bell Labs!
grubTI-99/4A speech (Score:1)
Re:this could be Tipper Gore's dream come true (Score:1)
You could always OCR it, and get all the profanity with none of that annoying music getting in the way :)
Caution: contents may be quarrelsome and meticulous!
Re:nostalgia (Score:2)
MS Bookshelf was one of the first "shovelware" CD apps out there. Back when buying a single-speed CD-rom (with controller card!) for $900 was a good deal...
---------------------------------------------
Re:The options are inadequate (Score:3)
-- Barbara Billingsley
--
Re:this could be Tipper Gore's dream come true (Score:1)
And if your post was meant as a joke, well, it wasn't that funny.
nostalgia (Score:3)
i remember when i was 14 or so, and i discovered that ms bookshelf had an audio file for the sample pronunciation of "motherfucker" i thought it was absolutely hilarious to have that monotone voice demonstrate to me how to swear.
ah, the memories...
How these text-to-speech synths work (Score:1)
On to how text-to-voice works. This was an excellent situation for using an artificial neural network. I don't recall the exact topology of the net, but the lower (input) neurons contained different letter combinations often found in the english language. The output neurons were, obviously, the sound that needed to be produced.
Neural Networks, being the excellent statistical analysis tools they are, would train on many words and determine their "error" by comparing the network output to the actual pronunciation of said words. The network would then adjust the weights through many iterations to reduce this error to zero. Then, new words are fed to the input neurons, and then we have the voice you hear on the Bell Lab site.
This technique for text-to-voice lent a lot of weight to neural network programming. The people demonstrating the algorithm were particularly clever, in that as the neural network was training (i.e. adjusting internal weight values to reduce error to zero), they would test different words on the network in a child's voice. This basically gave the effect of sounding like a child slowly learning to speak. One can imagine how impressive this would be for someone who doesn't really know how the algorithm works.
"Bad Words" (Score:1)
Fa Queue Ice hole
Talk sec stew me beach
etc. Needless to say, I was much younger in those days. Today I'd rather have them say things like "Thanks for calling RetroAxle Inc. For a list of the ways technology has made your life better, please press 1..."
bukra fil mish mish
-
Monitor the Web, or Track your site!
Re:Of Trash-80s and profanity (Score:2)
Re:Zis the best we can do? (Score:4)
It, too, filtered profanities, but foh-net-ik spelling solved that problem.
That's bugged me for years. In Kubrick's 2001, they could talk to HAL and HAL would respond in a pleasant human sounding voice. Okay, it didn't do a great job singing, but not only do we in the real 2001 not have a computer you can converse with (Eliza notwithstanding), but we can't even do the speech! Of course, Clarke never envisioned marvels like http://www.amiallyourbaseornot.com, so I guess we have other advantages in the real world vs. science fiction.
Great for email alerts. (Score:1)
spoonz
---
Foreign Language Version (Score:1)
Interesting, what that says about the maturity of english speaking people.
Confused
Before you dopes post... (Score:2)
should take a look at some of the examples to see
just what it is capable of. This think is not the
mere equivalent of something you ran on a TI-99 or
a MAC once, it's really quite a sophisticated thing.
mefus
--
um, er... eh -- *click*
Re:Harsh Language (Score:1)
Re:Daisy, Daisy... (Score:2)
Here is a link [c2i.net] to a reference: The first time a computer played music was in 1957, at Bell Labs in the United States. The song was called Daisy, which is the same piece that the intelligent computer HAL (in Stanley Kubrick's film version of Arthur C. Clarke's science fiction novel 2001) starts humming as it is being disassembled. Naturally, this is not a coincidence, but rather the intention of the director to return the computer to its "childhood state" (in a double sense) as it loses its advanced electronic identity
I wonder how this comes out... (Score:1)
http://cm.bell-labs.com/cm/cs/who/dmr/odd.html
Re:Harsh Language (Score:2)
My favorite is all the conservative websites out there that designate movies to be "good" and "bad" simply by counting all the times they have the word fuck....
just a little fucking rediculous if you ask me...
Doug
Re:Harsh Language (Score:2)
They're the same voice actor, and I was
just going off of what I remembered
(I have a very audio oriented memory).
Thanks for the catch...
Doug
Copyright 1997 (Score:1)
What I'm curious to know is how much their Text-to-Speech synthesis has improved in the past few years. Heck, even the local Weather Channel has more realistic synthesis than this.
Re: Apocrophal fokker story (Score:5)
I've heard the original audio on a BBC request show. Quite hilarious when you hear the whole thing. This is the closest I've found in a web search to what I remember the interview was like.
BBC INTERVIEWS AN RAF PILOT...
Gerry Wills, the famous BBC commentator, was interviewing Gerherd "Zibby" Zebdrehah the equally famous Polish WWI fighter ace who flew for the British. The interview went like this...
BBC: So please tell us Captain Zebdrehah about your most intimidating foe from those years.
Zebdrehah: I remember being jumped by 4 or 5 Fokkers. My God, the sky was thick with those Fokkers and every where I turned they were on me instantly...
BBC: I should just inform the radio audience that Captain Zebrehah is talking about the aircraft his opponent usually flew, the "Fokker" fighter plane.
Zebdrehah: Ya, maybe usually but these Fokkers were all flying Messerschmits!
the AC
Send the process a SIGHIP... (Score:3)
Somewhere I still have the Usenet article where somebody was trying to get an Apache server to "speak Java" (some JVM problem I think), and was told that he had forgotten to signal with SIGHUP after making his changes. Except that U is right next to I on a qwerty keyboard, and so the poster mispelled it SIGHIP.
One of the followups said, no, if you send it a SIGHUP it'll speak Java, but if you send it a SIGHIP it'll start speaking Jive, and then proceeded to list a bunch of HTTP response codes all in "jive".
The only one I remember is "404 that file is NOT in da house."
Or things which simply /sound/ like obscenities (Score:4)
In high school, I did some work for a local TV station, and I still remember filming an interview with a "hometown celebrity" who flew in some of the older warplanes.
Fokker (sp?) makes a lot of warplanes. Fokker also sounds a bit, um, obscene when spoken by somebody with a gutteral voice.
"...an' then we hit some ack-ack from the border guards an' that ol' Fokker started twisting so bad..."
Had to do quite a bit of trimming on that interview before running it.
Re:TI-99/4A (Score:2)
It was more fun for me to type in this small program from MICROpendium for TI Extended basic, which made the Speech Synthesizer make all sorts of strange-ass sounds. I believe one game I had for it (in Extended Basic) made a frog croaking sound from the routine.
I stil have no idea how they got the female voices from the game Parsec in it though. Looknig from an old TI magazine, they DID digitze a woman's voice to do it.
Ahh, the speech synthesizer, an underexploited piece of hardware for the TI, as well as the MBX expansion system which had speech recognition!
Re:Copyright 1997 (Score:1)
I don't think it has changed since then. Note that the page says that for the 'Advanced Interface' you need a javascript enabled browser 'such as Netscape3'
What I'm curious to know is how much their Text-to-Speech synthesis has improved in the past few years.
I don't know if this site will be any help as there is no way of knowing whether they have updated the backend.
Re:Harsh Language (Score:1)
Man, I can remember my mom getting pissed that they had it in the movie and it got the PG rating. Yeah, my virgin ears.
Bell Labs, is this the best you can do?????? (Score:1)
It seems as if this technology is following a strange inverse of Moore's law where it gets better by only half as much each year.
Re:ti99/4a (how it really was) (Score:2)
As other people said, it was the TE2 Cart that provided good text-to-speech. Someone else posted that you had get/feed numbers for that to work with TE2. I think he is confusing it with the "Extended Basic" cart. There was a hackish workaround of getting small samples of speech published in some magazine (Home Computer Mag I think), but was a bitch to use. The TE2 cart let you do this in it's modified version of BASIC(from my memory):
10 open #1,speech (I think thats what you use)
20 print #1, "Fart Fart Yams Hobo in my room"
Or it was something simular to that. It's text to speech was VERY good for the time, I only needed to modify some words to better speech rarely.
you could also modify the speech tone/speed by doing:
30 print #1,"//30 59"
or something close. If you used values outside valid ranges (It never checked), you could get some crazy hissing and growling sounds.
It was the easy version of basic that the TI had that got me interested in programming. God Bless 'em.
Daisy, Daisy... (Score:2)
The folks at Bell Labs have a sense of humor, anyway.
On their pre-generated samples [bell-labs.com] page, the English sample is a computer-sung rendition of "Bicycle Built for Two." This is the song which the murderous HAL 9000 in Clark/Kubrick's 2001: A Space Odyssey sang as he was being put to sleep.
They're just trying to protect youth literacy (Score:3)
last thing i heard (Score:2)
Intuity Messaging (Score:1)
actually I guess that's not all that interesting.
Re:this could be Tipper Gore's dream come true (Score:1)
Re:Harsh Language (Score:2)
The best of these would be capalert.com [capalert.com]. He hasn't rated "Freddy Got Fingered", but he has nothin' good to say about the South Park movie [capalert.com]. "*South Park* is another movie straight from the smoking pits of Hell."
Fade To Black [fadetoblack.com] somehow got Cap from CapAlert to do a Q&A column [fadetoblack.com]. It's a hoot.
--
The funny thing about this is... (Score:1)
http://www.commondreams.com/
Re:Tried it, didn't work (Score:2)
Hey Dennis Miller called he wants his bad stand up back.
Festival (Score:4)
The automatic voice pitch is pretty neat; I built a hardware text-to-speech converter around 10 years ago, and it only produced a monotone voice that got pretty annoying after a while. Don't feed Festival raw HTML documents, though - it can cause the voice to get deeper and deeper until it has to reset the pitch.
cheap sampling => bad synthesis?? (Score:2)
But I was also really disappointed with the results. It doesn't sound much better than the old Apple II setup my chem teacher would wheel into class when his voice went.
I blame cheap sampling over the past decade or so. The ability to record a voice actor's voice and use that has precluded any real advancement on the synthesis front.
People talked about Jar Jar as the first virtual main character (and he wasn't even first) but they seem to forget-- his voice was nothing near virtual.
For better or worse, it seems like it will be a numberof years before we have an artificially generated voice as annoying as Jar Jar's.
--
Re:Or things which simply /sound/ like obscenities (Score:1)
Buddy, I got your Messerschmidts right here [web-birds.com], and here [bell-labs.com].
The TI-99/4A WOULD swear! (Score:1)
Damn, I miss the good ol' days sometimes...
Re:The TI-99/4A WOULD swear! (Score:1)
I think the reason they added the text-to-speech to TE-II was to give some sort of accessibility of the online world (such as it was at the time) to the visually impaired.
I remember quite fondly, however, using the text-to-speech for less lofty goals. Prank calls come to mind. A friend and I wrote a "is your refrigerator running" program complete with 4 (count 'em, 4) menu options to select from when the person on the other end answered. I can still remember exactly how it sounded, especially with the "har-har, bye-bye" at the end. Heh...
I was going through some old stuff the other day and came across a copy of the Texas Instruments Terminal Emulator II Protocol Manual. I was obsessed with owning it even though I had no RS-232 stuff, as I said before. TI offered a free (or dirt cheap, I forget which) copy if you wrote and requested it. I begged my mother to do it, and she did. She was swell (still is, actually). Now I'm starting to wish I hadn't sold all my TI stuff at a tag sale years ago.
No... (Score:2)
My mom is not a Karma whore!
Slashdotted (Score:5)
Tried it, didn't work (Score:1)
The fact that we are to a point where we have to declare that we are old enough to hear what we typed, things have gone way too far.
Now I don't wan to get off on a rant here, but it seems as technology increases, so does the desire of everyone to make sure they have no reason to get sued. Personally I don't think the two are related. I think it has more to do with the fact that we have constantly been making more and more laws without enforcing the laws in existance. This leads to more and more lawyers there to interpret and fight the laws while at the same time, twist the laws so that they can make an optimum dollar. Combine that with the fact that most parents in today's society never take responsibility for their children while expecting the greater public to do so and you have a manifistaion such as the one written about here at Bell Labs. If we expect these things to change, we need to take responsibility for ourselves and our loved ones and stop putting the burden on the schools and the rest of society. After all, I want to be able to control what my under-aged children see/read/whatever, not those who think they know what's best for me and mine. But as usual...
Re:Tried it, didn't work (Score:1)
By the way, openly admitting that the opening (I don't want to get off on a rant here...) and closing (Of course, that's just my opinion, I could be wrong) are Dennis Miller's, I don't remember the last time he went off on a topic so trivial as this.
Maybe I'm wrong, and you are some unheard of genius who is just waiting too be noticed. I'll hold my breath so you don't have to hold yours.
Re:Or things which simply /sound/ like obscenities (Score:1)
http://www.bootyproject.org [bootyproject.org]
Re:Or things which simply /sound/ like obscenities (Score:2)
Perhaps your "hometown hero" was a double-agent who secretly flew missions for the Germans as well, and had covered up his secret so well all these years, until accidentally letting it slip during your interview with him. Too bad you let such an awesome scoop wind up on the cutting-room floor. Nice move, Cronkite.
Erm... eh... or I could always just put down the crack pipe. Yeah, maybe that would be better.
http://www.bootyproject.org [bootyproject.org]
Re:this could be Tipper Gore's dream come true (Score:1)
Of Trash-80s and profanity (Score:1)
This stuff sounds only marginally better than speech card add-on a college friend had for his TRS-80 and that was in 1982.
It, too, filtered profanities, but foh-net-ik spelling solved that problem.
We've got one of those at home. To clarify, I believe it's the software that does the profanity filtering rather that the speech synthesiser device itself.
On another note, I seem to remember an old game called "688 Attack Sub" having an amusing profanity filter. You could send messages when playing the game multi-player, but it would alter any of the swear-words it knew about - so while you might type out something rather threatening to your opponent, it would come out as "I'm going to kick your donkey, Mother Theresa!!", or something similar :-)
Re:TI-99/4A (Score:2)
Zis the best we can do? (Score:2)
(Of course, all I get is silence, and a 500 error.)
Harsh Language (Score:2)
The earlier the better, in my opinion. There's nothing wrong with a little harsh language.
how abouts... speak and spell!!! (Score:1)
E.
ti99/4a (Score:2)
The ti99/4a WOULD say the bad words, and it wasn't a speech synthesis card, it was a module that plugged into a port on the right side of the keyboard/console.
I liked my ti99/4a, my mom ran a ti education center, and had a network of 14 of them with the BIG expansion cases.
A host is a host from coast to coast, but no one uses a host that's close
Re:Daisy, Daisy... (Score:2)
Re:I can't wait for MS to get a hold of this (Score:1)
this could be Tipper Gore's dream come true (Score:2)
Re:Send the process a SIGHIP... (Score:1)
the GNU solution to this (Score:2)
Bryguy
Speak-n-Spell was better (Score:1)
Re:The TI-99/4A WOULD swear! (Score:1)
TI-99/4A (Score:1)
Ask Jamie about that (Score:2)
I dunno -- remember Jamie's article about the kids who screwed up the output of fortune to HTML [slashdot.org] and put the words "I put the shotgun in an Adidas bag and padded" on their web site? Where Jamie was irate that the police asked them some questions before realizing it was a misunderstanding and dropping the matter? Where Jamie and one boy's father were claiming that the kids now have a "police record" (absolutely false, according to the usual understanding of that term)?
In the course of showing what obviously angelic youngsters these are, Jamie writes:
..whose domain name contained the school's name and the Fword. This is a word, by the way, which G. obviously typed in to register the domain but which he was too polite to use over the phone. By the time we hung up, he had me embarrassed for saying it.
Uh, yeah. The owner of fuckwestbeverlyhigh.com is a candidate to replace Miss Manners.
Unsettling MOTD at my ISP.
Re:Or things which simply /sound/ like obscenities (Score:2)
The Fokker F-100 etc. are airliners seen around the US fairly frequently. American Airlines, for one, have a bunch. I think ATC get tired of hearing:
Approach: Cessna 123, make a left base for 22, traffic ahead is a Fokker, caution wake turbulence
Cessna 123: I've got the Fokker in sight.
Simpsons, Futurama and Hawking (Score:1)
"I call it a Hawking chamber."
Eerie.
hm... (Score:1)
Some
ha det godt,
Platy
Re:Or things which simply /sound/ like obscenities (Score:1)
So you explain that, in fact, "Fokker" isn't a bad word after all, but the name of an aircraft.
To which the old pilot replies, "Yeah, but these fokkers were Messerschmidts."
Re:Bell Labs, is this the best you can do?????? (Score:2)
Human speech (at least in English) is very complicated - it's not just a matter of translating symbols one-to-one into sounds. English, according to my copy of the Big Book of Linguistic Facts (tm), has ~40 vowel sounds (including regional variants, but not including diphthongs). And 5, maybe 6, vowel letters. So the computer has to perform some fairly elaborate calculations that most humans can do "automatically" in order to decide which sound snippet to use, how to tweak it and blend it with surrounding sounds, etc, etc - personally, I thought that the output from the Bell program was quite impressive.
Look at http://www.bell-labs.com/project/tts/tts-overview. html [bell-labs.com] for more information on how they built this system:
etc, etc, etc
Re:Or things which simply /sound/ like obscenities (Score:1)
Don't snicker at the 'non-prude' checkbox (Score:1)
Come to think of it, I'm surprised I haven't heard from her attorneys yet since I still rip her publicly for being a litigious moron every chance I get.
~Philly
slashdotted - Mirror (Score:5)
I can do the male voice, the female voice, in French, Dutch and English.
TI-99/4A + a pack of 14-yr-old boys = (Score:1)
More fun than a TI-99/4A with speech-synthesis card. Those wouldn't say the bad words at all.
Yeah, the TI-99/4A voice synth was quite limited, but we ended up with...
Re:Zis the best we can do? (Score:1)
Delicate Sensibilites... (Score:2)
Anyways, the thing is that it's like TV. They don't want it that something this widely used might have obscenities. There is enough parents that would raise hell over something as small as that, and likely, they just don't want to deal with it. That's all.
breakup 2 (Score:1)
Re:The checkbox makes sense... (Score:1)
You've got to hope the legal dept knows that wouldn't acheive anything. Are the engineers that silly?
---
Re:DeCSS (Score:1)
Cool! (Score:2)
Dancin Santa
A notable example... (Score:1)
Re:Zis the best we can do? (Score:1)
Advanced TTS systems are becoming more and more like giant databases of prerecorded words. There are only 600,000 or so English words. Record them all and add a few decent continuous-izers, inflectors, and crazy/baby/big man-izers, and you're done. In 10 years, this will fit in a tiny corner of your hard drive.
The options are inadequate (Score:3)
Re:TI-99/4A (Score:1)
Re:I can't wait for MS to get a hold of this (Score:1)
Re:How these text-to-speech synths work (Score:1)
Re:I can't wait for MS to get a hold of this (Score:2)
Instead of what you said, you'll get a receipt printed out
Or:
**computer voice** you are fined one-half credit for violation of the verbal parameter
As in "Demolition Man."
Along with a receipt of demerit from M$, which would be a fun alternative to toilet paper!
Much better TTS: IBM's ViaVoice (Score:1)
Re:I can't wait for MS to get a hold of this (Score:1)