Opera Promises Voice-Operated Web Browser 352
unassimilatible writes "Opera's latest browser talks and listens, according to AP.
The new browser incorporates IBM's ViaVoice technology, enabling the computer to ask what the user wants and "listen" to the request. "Hi. I am your browser. What can I do for you?" asked a laptop with the demonstration versions of the browser. The message can be personalized, such as greeting users by name. The computer learns to recognize users' voices, accents and inflections by having them read a list of words into a microphone. Opera plans to first launch an English version of the voice browser for computers running the Windows operating system. Versions for other systems, including handhelds, will follow. Opera's press release has more details, including Opera's hopes that people will adopt this technology for presentations - and to replace PowerPoint."
i can hear see it now (Score:5, Funny)
Re:i can hear see it now (Score:3, Funny)
Re:i can hear see it now (Score:2)
man, i wrote a voice-command processor for explorer on mac in applescript back in, uh, 1998.
Re:i can hear see it now (Score:2, Funny)
Re: Thermonuclear War (Score:3, Funny)
User: I want to play thermonuclear war.
Computer: Wouldn't you rather play a nice game of chess?
User: No goddammit, I want to nuke, not puke!
Re:i can hear see it now (Score:5, Funny)
And, since pr0n is not an English word, you'll get this [google.ca].
Re:i can hear see it now (Score:5, Funny)
Re:i can hear see it now (Score:5, Funny)
Re:i can hear see it now (Score:3, Funny)
Hmm.. hands free surfing for pr0n. Imagine the possibilities.
Re:i can hear see it now (Score:5, Funny)
page loads automated audio file
browser: "close all other tabs"
user: "what the hell!"Re:i can hear see it now (Score:4, Informative)
It's okay. Opera lets you suppress those annoying automated audio clips. Hit F12 (opens the Quick Preferences menu) and uncheck 'Enable embedded audio'.
The same menu also contains all the popup killing settings ('Open requested popups only' works quite well) and cripple some other annoyances of the web (uncheck 'Enable plugins' and possibly 'Enable Javascript'.)
Cheers.
a few things to say... (Score:5, Interesting)
as for their statement about it being a replacement for powerpoint, I don't think that this will fly either unless they either: a) find a company to make a powerpoint alternative which saves to html files b) make the aforementioned software themselves. Even if they accomplished that, people's stupidity and ignorance has proven time and time again that whether microsoft's software is better, worse, or just as good as its competitors- people will buy microsoft's software instead of others. Look at openoffice.org, mozilla (most people use ie)/opera/konquer/galeon/netscape/etc, linux, amd a bunch of other superior software. Maybe a couple could be explained (linux often involves use of the command line interface, netscape is slower to load (even though ie cheats by loading some of the program at startup time)), but most of it is due to a problem which exists somewhere between the keyboard and the chair. Besides, I would find a remote control a better option than speech, since a remote control wouldn't force me to scream "NEXT SLIDE" across the room like an idiot before it recognizes what I'm saying. It would also be much smoother to just press a button on a remote control.
Re:a few things to say... (Score:5, Insightful)
Re:a few things to say... (Score:3, Interesting)
So many times while putzing around the house or driving I've wanted to bark out a command a la Star Trek and having Google answer me. Very cool.
Although if it chimes in with - "It sounds like you are trying to browse th
Re:a few things to say... (Score:5, Insightful)
That being said, this will likely make life better for people with severe spinal injuries or others with limited use of their hands. Kudos to Opera.
I had a parapalegic teacher in college.. (Score:5, Insightful)
PDAs? (Score:5, Insightful)
Imagine a PDA that you can actually talk to instead of having to struggle with "Graffiti" or the little thumb keyboards. Hell, if it's good enough, you could even get rid of the need for a screen and just interact entirely through voice - here's how we could finally get a useable web browser/email client/schedule program in a watch!
One step closer to some of the concepts explored in Snowcrash, maybe?
Re:a few things to say... (Score:2, Insightful)
Well, while you probably have the option to pick your keyboard, there are many handcapped people in the world that would find amazing just surf the Web all by themselves. This will be much more than a toy for them.
Re:a few things to say... (Score:5, Interesting)
And as for presentations, who says you have to stop your speech to scream "NEXT SLIDE". Imagine a presentation package capable of picking up from your presentation exactly when you'd like the next "slide" (useless word since you could now do much more than you are constrained with using Powerpoint).
Imagine, during a presentation, being able to say "If you look at the sales figures for the year..." and have your presentation automatically display those figures.
Re:a few things to say... (Score:2)
If successful, actually doing what it promises, this application by Opera could make his browsing life easier, and much more interactive.
Re:a few things to say... (Score:2)
Re:a few things to say... (Score:2)
Options include:
a) Save PowerPoint presentation to HTML using PowerPoint itself.
b) Save PowerPoint presentation to HTML using OpenOffice
Of course, you're going to lose all your fancy wipes and fades, but to those I say "good riddance!"
Re:a few things to say... (Score:5, Interesting)
OpenOffice [openoffice.org] can save to HTML and Flash files from Presentations.
Even if they accomplished that, people's stupidity and ignorance has proven time and time again that whether microsoft's software is better, worse, or just as good as its competitors- people will buy microsoft's software instead of others. Look at openoffice.org, mozilla (most people use ie)/opera/konquer/galeon/netscape/etc, linux, amd a bunch of other superior software.
People buy Microsoft software because they are
a.) not familiar with the competitors
b.) worried about compatibility with the rest of their microsoft software
c.) do not want to retrain staff
d.) need feature X which competition lacks
e.) work for Microsoft or are otherwise affiliated with them.
f.) do not trust an unproven product (in their eyes) and don't want to be the guinea pigs
Point being, as other software matures it will be harder and harder for Microsoft to release sub par software and expect a solid buy in. If you look at Mozilla it's growing speed very fast now, I know a number of Windows users that aren't even very technical that use FireFox and/or Mozilla. Look at OpenOffice, Microsoft is killing themselves with their own Doc standard. They can't move future iteratios of Office to abandon or morph the compatiblity of
As far as Opera's voice operated browser goes I think this is great, especially for disabled and handicapped people. I also think there's a certain appeal to be in front of a board and say Next slide to your openoffice html/flash presentation and have it progress. I mean what a way to impress.
Re:a few things to say... (Score:3, Informative)
http://wp.netscape.com/newsref/pr/newsrelease22
The voice recognition was OK, and it was quite easy to navigate from website to website using bookmarks and links in the page.
-Jim
But I don't wanna talk to my computer (Score:4, Funny)
What could I possibly have to say to my browser?
Re:But I don't wanna talk to my computer (Score:2)
Re:But I don't wanna talk to my computer (Score:3, Insightful)
Agreed. While there are some cases where voice-activated technology has its uses (I very much doubt people would be thrilled with typing into their onboard navigation systems while driving) a web browser or other common features on your computer simply don't need speech recognition.
For Joe User, I doubt we'll ever see widespread use of speech recognition technology. Who wants to go hoarse telling a computer what to do when it only takes a flick of the wris
Voice activated (Score:5, Funny)
Now the jerk in the cubicle next to me will talk both with himself, "the fairies" and his browser.
Re:Voice activated (Score:5, Funny)
Re:Voice activated (Score:3, Funny)
....Step into his cube...
hey bob, HOT GRITS...hows that spreadsheet comming? I got some time off coming going to head down to my brothers GOAT farm in middleSEX, could you water the plant in my cube? thanks you're the BREAST.
You know, now that I think about this - I am going to love it.
Word Processing is clunky, will this be better? (Score:5, Insightful)
I've worked with and supported both ViaVoice and DragonNaturallySpeaking solutions for voice-based typing in word processors, and neither of them felt natural. Perhaps because I'm a geek, or just because I've been doing it so long, I'd rather manually key in exactly what I want and let myself make the mistakes, not the interpretation.
With corrections, it always took longer to do the alleged "easier way" than manually keying in. Even with 99% accuracy, Word Processing was always clunky at best.
That, and every time I scream out "litigious bastards", I don't need it pulling up litigious bastards [thescogroup.com].
Re:Word Processing is clunky, will this be better? (Score:2, Funny)
Psht. And wrong, too. The most natural and effective way we communicate is through body language.
Give me a ring when they invent a web browser that scrolls down when it sees my eyes get to a certain part of the page, or clicks "back" when it sees my jaw slack in boredom.
Or, better yet, automatically browses to another, non-porn, page while the girlfriend/boss is still walking down the hall...
Slash dot (Score:5, Funny)
Re:Slash dot (Score:5, Funny)
"Cannot connect to http:///..org"
http:///..org (Score:5, Funny)
Re:Slash dot (Score:5, Funny)
wow. sounds almost obscene.
Re:Slash dot (Score:2, Funny)
It will sound like you are stuttering.
Re:Slash dot (Score:2, Funny)
voice operated? (Score:2, Insightful)
Re:voice operated? (Score:2, Interesting)
Re:voice operated? (Score:2)
ScanSoft's Dragon Naturally Speaking is amazing, does some outstanding voice recognition. Of course, you're shelling out $600 (plus $75 on a good mike) so it damn better work well, and it does.
I didn't see an price listed in the Opera press release. If it incorporates the full ViaVoice software, then it may work well. Of course, you still need a good mic, that makes a huge difference in the quality of voice recognition.
Re:voice operated? (Score:2, Insightful)
It was pretty effective as far as it went, but not a total solution. I used commands to launch all my common programs, and common File: and Edit: commands.
(Photoshop. Select all. Copy This. Quark Expre
Re:voice operated? (Score:4, Funny)
I have had some success with "hardware", though. The other night I called home and asked my daughter to tell me the address of a shopping mall I was looking for. She googled it, clicked around, and a few seconds I had the address. That's the kind of thing I wish voice recognition apps could do!
We've seen this before.. (Score:2)
Hard to manage tech (Score:5, Funny)
Voice input and output.. that'll make it a lot harder to discreetly search for pr0n whilst at work.
Computer: "Hi. I am your browser. What can I do for you?"
User: [whispering]Find me "porn"...
Computer: "The band KoRn was formed in 1993 by Jonathan Davis and..."
User: NO! [whispering] Not "KoRn"; "porn".
Computer: "Clogged pores are the major cause of adolescent acne. Starting at puber..."
User: NOT "PORE", DAMMIT!!! [coughs, lowers voice] find me "porn"..
Computer: "Iron Ore is the primary ingredient in steel. Metalurgists will add other elements and compounds to give the steel certain proper..."
User: NOT "ORE", YOU PIECE OF SHIT! [office mates look over cubes] [whispers] Look.. I want to look at naked people..
Computer: "The goatse.cx lawyer has informed us that we need a warning! So.. if you are under the age of 18 or find this photograph offensive, please don't look at it. Thank you!"
The catch... (Score:5, Funny)
"Is this the real life, is this just fantasy..."
Re:The catch... (Score:4, Funny)
no escape from reality. (Ms Windows)
Open your eyes,
look up to the skies and see.... (Mozilla)
I'm just a fool boy,
I don't need sympathy (Linux user)
Cause I'm easy come, easy go
Little high, little low (Mac OSX User)
Any way the wind blows,
doesn't really matter to me... (Windows BSOD)
Now I've got this song stuck in my head. =)
-Cyc
Voice activated Powerpoint? Uhm, no... (Score:3, Insightful)
Come on, this technology has existed for the TV weatherman for years. Why hasn't anybody gotten it right for PowerPoint users yet?
Re:Voice activated Powerpoint? Uhm, no... (Score:2)
Re:Voice activated Powerpoint? Uhm, no... (Score:3, Interesting)
But personally, I think this has great potential for presentations, without disrupting them - especially if you could control the commands used to advance each slide. For example, if you could program the transition to a sales figures slide to be triggered by the words "sales figures for 2002", then it would automatically pull up the right slide when you say "Now let's look at the s
Please disable the... (Score:2)
Regards,
---
"Two sure ways to tell a sexy male; the first is, he has a bad memory. I forget the second."
Will there be a Majel Barret plug-in? (Score:2, Funny)
Let's push the sedimentary lifestyle more.. (Score:3, Funny)
Re:Let's push the sedimentary lifestyle more.. (Score:2, Funny)
Re:Let's push the sedimentary lifestyle more.. (Score:4, Funny)
Depends. How long has he been sitting there?
And then the browser said: (Score:4, Funny)
Through some careful configuring... (Score:5, Informative)
What would be nice is if 'Speech' could recognize the commands for a particular application without switching focus. So I could be coding on one screen while browsing on another.
security risk (Score:4, Funny)
Homophones... (Score:5, Insightful)
English is the problem. (Score:2, Interesting)
Re:Homophones... (Score:5, Funny)
Absolutely - using Dragon Dictate I once asked my browser to go to hotmail.com... I ended up at hotmale.com and that phrase has now become my test for dictation software!!
Re:Homophones... (Score:5, Insightful)
Re:Homophones... (Score:3, Insightful)
Re:Homophones... (Score:3, Interesting)
One potential workaround is to have a short period of 'sensitivity' after common homophones.
For examaple the speaker says 'Final 4' but the browser types 'Final for'. The software recognizes that 'for' is a common homophone and waits a *very* short time ( a second or two ) after the uttering of 'for' for *another* occurance of 'for', which would imply a correction. Also an occurance of a special word eg. 'no', followed
Re:Homophones... (Score:3, Funny)
There are many words in the English language that have homophones.
Eye for won due knot sea awl aught of miss takes re: salting from hoe mow phones. Inn fact, eye am you sing the pro gram write now.
Gimmicky blah blah (Score:4, Funny)
I mean, tabbed browsing is cool, I've gotten used to it. But stuff like mouse gestures, voice recognition, etc, all just seems like fluff.
One could have mapped spoken keywords to mouse/keyboard actions already if this is what they wanted.
It's a hard arena to innovate in. This just seems kind of silly.
What's next, support for force feedback chairs that scroll the browser based on which ass cheek I'm clenching?
Accessibility, accessibility, accessibility... (Score:4, Insightful)
To you, it might be a gimmick. To someone with a disability, this could make life a lot easier.
Try mouse gestures. You might be suprised. (Score:3, Interesting)
Personally I did it because I didn't like how much space the icon toolbar was taking. My use of opera also opens most pages in other windows.
So for instance to reply to you I rightclicked the link and moved the mouse down a bit opening
Re:Mouse gestures... (Score:3, Informative)
Ah, but I use mouse gestures with Firefox every day! There are extensions that add this funcionality. Go to http://texturizer.net/firefox/extensions/ [texturizer.net] and look at the "Mouse gestures" section. I personally use Radial Context -- it's basically mouse gestures w/ a GUI that helps you remember little-used commands.
Browsing with people is a pain (Score:5, Interesting)
The point here is when it's hard to instruct intelligent people how to browse the web, how well can a computer do it? I have my doubts.
Nahhh.. (Score:2)
An important step in computer interaction (Score:2, Insightful)
For a user such as myself a keyboard and mouse is presently more intuitive, but eventually this sort of software should prove very useful, especially as computers become more fully integrated into our lives.
This technology might also be useful with a couple of modifications, for the blind if it was coupled with one of those applications that reads the text from the screen for you.
I hope the next step would be interf
other browser too (Score:2)
link [pocketpcthoughts.com]
It is a multimodal browser, which means it supports VoiceXML basically.
Opera is making one, and so is NetFront (a PPC browser)
speech recongnition... (Score:5, Funny)
"Now gentlemen, pleese-a turn your ettenteeon to-a sleede-a twelve-a. bork!bork!bork!"
It may have a niche. (Score:5, Interesting)
I can't see this having wide acceptance in the corporate world. Cube farms are noisy enough. I can't imagine what it must sound like for everyone to be browsing by voice.
I also can't imagine some of my co-workers saying the addresses of what they browse out loud. *shudder*
Great concept for people with Diabilities (Score:5, Insightful)
the Prez is gonna love this... (Score:5, Funny)
(note to dems, i'm not a troll, i'm canadian)
-
The first thing that popped into my head... (Score:2)
Our browsero, isa beterro, then de Internet Exploraho
(on close...)
OperaMAAANNNN, bye BYE!
Maybe when AI is done (Score:4, Interesting)
Once we have a computer that can do this, we'll have great interfaces - it will be like robo-butler. But we're not there yet, and robo-idiot-child - "I thought you said Quick Bananas, so I googled and we're at the Dole website" - is only going to make things annoying.
It will be a boon to those who can't use point and click for whatever reason, and ignored by everyone else.
In Oslo, Norway (Score:3, Funny)
I'm sorry, but I had to do it just once.
Digital living (Score:2)
Can it then also recognize my behavior patterns and crack open a Pabst and adjust my USBchair eveytime I tell it to launch Quake 3?
AAaaaah....the digital life.
This will be great! (Score:2)
oh god i hope it reconizes hotmail.com and not (Score:2, Funny)
I've tried this with Dragon (Score:5, Interesting)
I got a free copy of dragon dictate once so I trained it as much as possible.
I got mozilla working quite happily, 'down' 'up' 'slow' (that was a good one, it slowly scrolled down), 'back' etc.etc.
the thing I found after weeks of training that it was just so tiring talking all the time
Clippy's back... (Score:2)
SHUT UP!
"Are you sure you want to close the window?"
good, because some of us are wishing for this... (Score:2)
Looks nice on paper.. (Score:2)
It would be annoying as SHIT to hear someone constantly talking giving commands to their computer. It brings to mind that came coming out for PS2.. how annoying. "Move left, move right. Check the drawer. Fire. Fire fire, move left, fire."
It seems more like a gimmick.. like "Hey, look what we can do!" more than something that would actually be beneficial or innovatice. It might be fun for the fi
Now we need filters more than ever! (Score:2)
What about misunderstandings? (Score:2)
Already got it (Score:5, Informative)
Also as the article notes one can buy more extensive add-on products like IBM's Mac/PC ViaVoice & Dragon's family of products as well as numerous other lesser-known and more specialized ones.
That's today, already on millions of desktops, ready and capable of driving web browsers, sitting there ignored.
Why?
Only a Gimmick (Score:2)
I had OS/2 with voice built into the OS, allowing all the software to operate by voice.
Unfortunately, while it was an interesting feature, it was more of a gimmick. I can type far faster and more accurately than I can talk.
The only real market for this product IMHO is for those who are unable to type, but I'm not at all certain that a large portion of this market - technophobes - can be sold.
What will it do when you close the browser? (Score:2)
Voice internet... (Score:4, Interesting)
Imagine a simple voice interface for limited internet functionality. Place microphones and speakers around the house. Now, when I'm sitting on the couch reading a book and I come across I word I haven't seen before, I can say "Hey Frank, lookup the word '...'." Need the weather? "Hey Frank, what's the weather report?".. Etc, etc..
It should be fairly simple to tie a speech recognition engine to some python scripts to perform simple queries and return a parsed result ready for text-to-speech conversion. One big problem the dictionary feature brings out is how the speech recognition would handle unfamiliar words. Even leaving that feature out, it would be nice to have a limited set of features I could use anywhere in the house.
Use some sort of unique gating phrase('Hey Frank!') and look for the nouns and verbs to give it some flexibility.
Re:Voice internet... (Score:3, Interesting)
As seen in Futurama! (Score:3, Funny)
3ACV04 - Luck of the fryish
Presentations? (Score:5, Funny)
Next generation (Score:3, Interesting)
That system was simpler, since it couldn't rely on special voice-HTML markup tags. It took advantage of the fact that any UI element (menu item, button, etc.) in the system can be activated by speaking its text. So they added a quick Hack to Netscape so that every link's text (or ALT text) visible on a screeen would be present on a "Links" menu - thus turning the links into speakable keywords.
It worked very well for browsing. Much less well when you want to enter new URLs. The dictation mode left a bit to be desired. But that was to be expected from the hardware of the time. Voice recognition on OS/2 required a minimum of a 150MHz Pentium, IIRC. (It would work - with much latency - on my 80MHz 486, however.)
This and NASA subvocalization. (Score:3, Interesting)
After NASA announced their subvocalization project (I'm too lazy to find the slashdot URL... someone earn karma for it!) I became excited again.
The problem is if you're in an office you can't just start talking. Right now there are 10 people around me and most people are silently working on there computers. If they all started barking commands it would be loud as hell in here. It just doesn't scale.
If you add the subvocalization work this totally changes the equation. Now I can silently tell my computer to do things while my hands type away.
This is going to ROCK. Talk about multitasking... I can be typing out this slashdot post and without stopping I could launch gaim, ymessenger, make sure I'm on IRC... startup Emacs in the background , etc.
w00t!
Gimme gimme! $100 says the Mac has this next year and Linux has it sometime around 2015.
Here's Hoping... (Score:5, Interesting)
That said, I think the most crippling thing about PowerPoint is its linearity. Not all presentations "want" to be laid out into a preset order of points. If a college professor or a businessperson gets asked a question during a presentation, all too often it is diverted by saying "well, that's coming up in a few slides", or the presentation is interrupted as tangential data is introduced.
Using voice recognition instead of click-through navigation opens up some great possibilities for non-linear presentations, though. Imagine that, instead of organizing your presentation into a linear timeline, you group slides and other media into "points", each of which represents a different idea relevant to your talk. You can arrange these points into a web, indicating what information depends on prior knowledge from other slides, etc. You then assosciate each point with an audio "cue", say a phrase like "projected profit margins" or "the three kingdoms period". You'll note that these phrases are things you're likely to naturally utter in your presentation anyway. This has the advantage of enabling you to speak totally naturally without interrupting your presentation. To avoid accidental jumping, we would have, say, a little translucent blue arrow fade into being every time a cue is recognized, disappearing a few seconds later. If you actually want to jump to a new point, it's just a quick click of a button when you see the blue arrow.
So, imagine you're giving a sales presentation to a group of executives. You notice this particular group is getting bored with your standard sales pitch. No problem, as you just drop a key phrase into your speech, and instantly change your presentation to include information you think will appeal to the business interests of your audience, or simply to their personality. Or, imagine a professor is giving a lecture on a peice of literature. A student asks a question about the author's background, and the professor can easily insert some information on their country, their historical circumstances, and their life.
Of course, organizing this type of presentation requires a greater investment in planning, and certainly requires a little more cognitive ability than your standard PowerPoint fare. However, those who work with these new presentation systems will be giving themselves an undeniable competitive advantage over presenters using linear methods. And those in the audience will be grateful, I'm sure.
Re:OS/2 v4 sort of had this... (Score:3, Interesting)
The WorkplaceShell was, and still is, the most incredible desktop I've ever seen or d