VoIP Calls Double In Quality 116

Posted by Hemos on Monday July 17, 2006 @12:03PM from the sounding-better-and-better dept.

anthm writes "From Newsforge and LinuxPR
FreeSWITCH, an open source soft-switch and IVR platform, have announced that they can support 16khz audio calls thus doubling the potential voice quality. They have had successful tests with a conference bridge, a pass-through SIP call and an IVR that reads RSS news feeds with the Cepstral Text-To-Speech Engine."

Voip-Info.org has a good list of business VoIP providers.

This discussion has been archived. No new comments can be posted.

VoIP Calls Double In Quality

Load All Comments

Search 116 Comments Log In/Create an Account

Comments Filter:

Please get the rest of the telcomms to follow. (Score:4, Informative)

by rob_squared ( 821479 ) writes: <.rob. .at. .rob-squared.com.> on Monday July 17, 2006 @12:06PM (#15731917)

Everything else is stuck at 8khz, so unless your call uses this service end-to-end, there's going to be a downconversion if you're calling someone on a land line. And you'll be stuck with 8khz if you get any calls from someone not on this service.

Still, its a good piece of news, onward and upwards.

*crosses fingers* Please nobody mention video phones. *crosses fingers*

Share
twitter facebook
- Re:Please get the rest of the telcomms to follow. (Score:4, Funny)
  
  by joe 155 ( 937621 ) writes: on Monday July 17, 2006 @12:11PM (#15731966) Journal
  
  I'll agree to some extent that this is good news, but my friend, 16khz is a lot of packages which will have to be squeezed through the current pipes which I recieve my internets through; so this will make the speed of internets go down to no faster than the standard post. Did you know the other day I got an internet which had been sent on Friday! *mubbles to self*...
  
  Parent Share
  twitter facebook
  - Re:Please get the rest of the telcomms to follow. (Score:2)
    
    by a_nonamiss ( 743253 ) writes:
    
    It's just a bunch of pipes...
  - Re:Please get the rest of the telcomms to follow. (Score:2)
    
    by SanityInAnarchy ( 655584 ) writes:
    
    Stop hitting the pipes, Ted. You want to be sober for this legislation thing!
  - - Re:Please get the rest of the telcomms to follow. (Score:2)
      
      by SanityInAnarchy ( 655584 ) writes:
      
      While we're originally making fun of him for the word "tubes", that's only because it's easily memorable/recognizable, compared to the near-incomprehensible comments about a staffer sending an Internet to him...
      
      The real reason we're making fun of him? In the words of John Stewart, "Maybe it's because you don't know jack-shit about the Internet."
- Re:Please get the rest of the telcomms to follow. (Score:2, Informative)
  
  by anthm ( 894202 ) writes:
  
  Yes, you are correct. The benefit comes when both ends of the call are using a 16khz device. Situations where you are connecting to the PSTN would obviously be better suited at 8khz. The media description on the pstn gateway would advertise only 8k so the client would know better than to operate at a higher frequency.
- Re:Please get the rest of the telcomms to follow. (Score:2)
  
  by Jeff DeMaagd ( 2015 ) writes:
  
  Is there a particular reason why it needs to be 16kHz? It's just vocals, what are the upper limit of typical vocals? 8kHz would seem to be plenty unless you are trying to play a violin or something.
  - Re:Please get the rest of the telcomms to follow. (Score:2)
    
    by CastrTroy ( 595695 ) writes:
    
    This way once they take away all the other options we can still use the VOIP system to pirate music.
    - Re:Please get the rest of the telcomms to follow. (Score:1)
      
      by cb0nd ( 893473 ) writes:
      
      Yes, but then there would be DRM to stop those VOIP using thieves and pirates. Not to mention that, even worse, you can actually use VOIP to share lyrics with the songs.
  - Re:Please get the rest of the telcomms to follow. (Score:2)
    
    by Detritus ( 11846 ) writes:
    
    It's also lower distortion and improved signal-to-noise ratio.
  - Re:Please get the rest of the telcomms to follow. (Score:1)
    
    by warsql ( 878659 ) writes:
    
    Oh, just like 640k ought to be enough for anybody
    I know, I know, he never said it.
  - Re:Please get the rest of the telcomms to follow. (Score:2)
    
    by amorsen ( 7485 ) writes:
    
    The real limit is 4kHz, you need 8kHz sample rate to reproduce that. 4kHz is low enough that it's dissicult to diftinguifh between s and f.
  - Re:Please get the rest of the telcomms to follow. (Score:2)
    
    by timeOday ( 582209 ) writes:
    
    Is there a particular reason why it needs to be 16kHz?
    Maybe it will help me understand consonants better.
    Personally I'm disappointed that this is considered impressive. Since it's limited to pure VOIP calls, it should be as simple as selecting a bitrate when you encode an MP3 or record a show on your PVR.
  - Let me be the first to say... (Score:2)
    
    by jheath314 ( 916607 ) writes:
    
    16 kHz should be enough for anyone ;)
  - Speex Wideband Codec (Score:2)
    
    by billstewart ( 78916 ) writes:
    
    Newsforge has no technical information, and Freeswitch is largely Slashdotted, but there's one sentence that says that they're using the Speex Wideband Codec [nyud.net] as their 16kbps codec. One reason Speex is using 16khz sampling is because it's relatively available on PC sound cards, but another reason is that they do a cute sub-band coding technique - instead of representing the 8kHz analog waveform by directly encoding the 16k samples/second, they split the information into two bands - 0-4kHz which they encode
- Re:Please get the rest of the telcomms to follow. (Score:1)
  
  by mattavian ( 988882 ) writes:
  
  how many kHz does a videophone need to work?
  - Re:Please get the rest of the telcomms to follow. (Score:1)
    
    by denttford ( 579202 ) writes:
    
    The kHz here refers to the sampling rate of the audio [wikipedia.org] (and thus an aspect of its quality and does not directly affect the size of the tube needed - only in analogue communications is a direct (and literal) measure of bandwitdh.
    Nobody does videeo conferencing over analogue lines [wikipedia.org] anymore - if they aren't sent over the internet, then they use some sort of leased line or ISDN connection, etc. Incidentally, and this probably will confuse the issue, but TV broadcasts use ~6mHz over analogue connections for one
    - Re:Please get the rest of the telcomms to follow. (Score:1)
      
      by mattavian ( 988882 ) writes:
      
      (j/k) re: parent.
      - Re:Please get the rest of the telcomms to follow. (Score:1)
        
        by Andrew Kismet ( 955764 ) writes:
        
        your sarcasm/humor was lost in the internets. Perhaps you need to put your posts at a higher encoding to make sure you can tell the difference between serious questions and jokes?
      - Re:Please get the rest of the telcomms to follow. (Score:1)
        
        by denttford ( 579202 ) writes:
        
        heh.
        
        sorry, seen such dumb things of late here (there are eight bits in a byte! [slashdot.org]) it's hard to tell anymore.
- Re:Please get the rest of the telcomms to follow. (Score:2)
  
  by riflemann ( 190895 ) writes:
  
  Everything else is stuck at 8khz, so unless your call uses this service end-to-end, there's going to be a downconversion if you're calling someone on a land line. And you'll be stuck with 8khz if you get any calls from someone not on this service.
  
  Even more worrying is that you will get progressively worse audio quality through the telephony chain as the audio undergoes several up and down conversions in sample rate.
  
  One of the really neat things about the common exiting audio codecs in use for telephony (G
  - Multiple codec conversions *are* really bad (Score:2)
    
    by billstewart ( 78916 ) writes:
    
    Don't know where you got your information, but it's unfortunately incorrect. There are some of the early ADPCM codecs (e.g. 32kbps) that do the same transformations every time, so you can convert from 64 to 32 to 64 to 32 to 64 again without additional damage, but most of the newer high-density codecs take a significant hit if you do them two or more times, e.g. 64kbps to 8 to 64 to 8 to 64. I've forgotten the precise MOS scores, but the standard G.729 codec family goes from "better than a cellphone with
Good Work (Score:3, Insightful)

by kasgoku ( 988652 ) writes: on Monday July 17, 2006 @12:14PM (#15731981) Journal

good work there, but all you need is to get the message across. its not like u r singing on the phone and need good voice quality. just do what's needed.

Share
twitter facebook
- Re:Good Work (Score:2, Insightful)
  
  by Tychon ( 771855 ) writes:
  
  But for those of us with a bit of trouble hearing, or when speaking with a person that has a thick and or foreign accent, that extra quality is the difference between a conversation and a stream of "What'd you say?"
- I need it (Score:2)
  
  by r00t ( 33219 ) writes:
  
  Think of the hold music.
  
  Now imagine that it responds to button presses so you can change songs.
  
  "Operator... oh won't you help me make this call..."
So what? (Score:5, Insightful)

by Spazmania ( 174582 ) writes: on Monday July 17, 2006 @12:16PM (#15731996) Homepage

So what? If you're going to up the sampling rate why not go directly to 44khz stereo (CD quality audio) and be done with it? Jumping from the telephony industry standard 8khz to 16 khz is thoroughly uninspired.

Share
twitter facebook
- Re: (Score:1)
  
  by account_deleted ( 4530225 ) writes:
  
  Comment removed based on user account deletion
- Re:So what? (Score:3, Interesting)
  
  by xachen ( 967588 ) * writes:
  
  If you find a codec that does 44kHz stereo, FreeSWITCH will do this. It has no hard limit in it and is variable to any rate! This is just awesome!
  - Re:So what? (Score:2)
    
    by SirDaShadow ( 603846 ) writes:
    
    If you find a codec that does 44kHz stereo, FreeSWITCH will do this. It has no hard limit in it and is variable to any rate! This is just awesome!
    
    Get foobar 2K and grab the free mp4+SBR codec from nero. You can turn a cd quality stereo signal (mp3, whatever) into a svelte, 16kBIT/s 44khzs/stereo (!) signal without much quality loss (well at least compared to current telephony anyway...)
- Re:So what? (Score:1)
  
  by frequenicity ( 989392 ) writes:
  
  What would be the point of going to 44khz stereo for a mono signal?
  - Re:So what? (Score:1)
    
    by zenslug ( 542549 ) * writes:
    
    Make it stereo then. Makes video conferencing even better.
    - Re:So what? (Score:1)
      
      by frequenicity ( 989392 ) writes:
      
      I understand that. I guess I am asking what the advantage for this would be. The audio that you are recieving is coming from a single (mono) source...so it seems to me that boosting the quality to 44khz stereo would really be wasted bandwidth.
      - Re:So what? (Score:1)
        
        by zenslug ( 542549 ) * writes:
        
        use two mics
- Re:So what? (Score:1)
  
  by treeves ( 963993 ) writes:
  
  . . .and I wouldn't call it a doubling of quality. The improvement of adding one octave of high frequency is, subjectively, less than a doubling. A human voice is quite intelligible with no frequencies above 4 kHz or so present.
  - Re:So what? (Score:1)
    
    by billcopc ( 196330 ) writes:
    
    Dude.. have you never worked in a call center ? I would kill to have a phone system that runs at 16khz, better yet 32khz. Don't double the bitrate, maybe a 30% increase would be enough, just move the filter cutoff freq higher because not everyone's voice has intelligible transients in the low-khz range, often times those voices get wrecked by the filtering and all you can hear is mumbling, as if the caller were talking with the mouthpiece in their armpit :P
    
    Higher frequency from the source, then less aggre
    - Re:So what? (Score:1)
      
      by treeves ( 963993 ) writes:
      
      No, I haven't. Maybe that's my problem, I don't like talking on the phone to begin with, so I don't do it anymore than I have to, but I don't think it's because of the audio quality. ;-)
      Thanks for your comment - whoever designs and buys phones and voice networks should obviously give more weight to your opinion than mine.
  - - Re:So what? (Score:1)
      
      by treeves ( 963993 ) writes:
      
      I certainly did not say "no data above 4 kHz".
      No doubt there is a significant perceptible difference, it's just that in terms of *intelligibility* (what really matters for a voice phone call) there isn't THAT big a difference between 8kHz and 16kHz, certainly not a doubling.
      Put it another way, if 95% of listeners can understand a sentence uttered by a speaker at the other end at 8kHz, maybe 96% can understand at 16 kHz. And yes, I just pulled those numbers out of my posterior, but hopefully you get the
- Re:So what? (Score:1)
  
  by anthm ( 894202 ) writes:
  
  It's ok if you are not interested but there are some who are. Here is an article about some of the benefits. http://www.analogzone.com/nett0307.pdf [analogzone.com] as well as a wiki entry from http://www.voip-info.org./ [www.voip-info.org] http://www.voip-info.org/wiki/view/Wideband+VoIP [voip-info.org]
- Re:So what? (Score:2)
  
  by Vellmont ( 569020 ) writes:
  
  So what? If you're going to up the sampling rate why not go directly to 44khz stereo
  
  Because stereo would be a complete waste of bandwidth and processing power (one microphone, one speaker), and the human voice doesn't get anything near 22khz in frequency. Normal speaking voices have an even lower cutoff frequency. The CD standard is great for music, but complete overkill for sending voice.
  - Re:So what? (Score:3, Insightful)
    
    by cdrudge ( 68377 ) writes:
    
    Yeah, but the on hold music sounds great!
- Re:So what? (Score:1)
  
  by slyvren ( 989423 ) writes:
  
  YES! While we're at it why don't we all invest in stereo microphones and headsets. That way when we talk to our friends through the internets we can talk around the mic and make it sound like we're swirling in their heads... GENIUS!
- Because it covers almost all of the human voice (Score:3, Informative)
  
  by Sycraft-fu ( 314770 ) writes:
  
  Our voices don't have that wide a frequency range, there's little up in the high frequencies. A voice sample recorded at 22kHz (11kHz frequency range) is very hard to distinguish from one recorded at 44kHz (22kHz frequency range). In fact you'd need to be using a fairly good mic to really get much of the higher frequencies anyhow. 8kHz works since F1 and F2 (the frequencies of the first two peaks in the harmonic curve) fall under 4kHz for essentially all speakers. F1 and F2 are what we primarly use to deter
- Re:So what? (Score:1)
  
  by Brickwall ( 985910 ) writes:
  
  I wish some people here actually KNEW something about the telephone network. First off, there are still hundreds of thousands of miles of copper wire in the network. Much of it is connected to 'loading coils', which are essentially low-pass filters. Any frequencies over 4kHz are attenuated, so your 44kHz is just a dream. Telephone engineers knew that; that's why they picked the 8kHz sampling rate (Nyquist theory). Second, as someone else pointed out, there remains the question of getting every single telc
  - Re:So what? (Score:2)
    
    by Spazmania ( 174582 ) writes:
    
    You missed the point: with internet connections rapidly reaching video speeds and the telephone network very much tied to 8khz there is no value in having a 16 khz VoIP. If you're going to up the sampling rate only for VoIP, go straight to 44khz and be done with it. Don't brag because you were dumb enough to select a median value.
- Re:So what? (Score:2)
  
  by Antony T Curtis ( 89990 ) writes:
  
  So what? If you're going to up the sampling rate why not go directly to 44khz stereo (CD quality audio) and be done with it? Jumping from the telephony industry standard 8khz to 16 khz is thoroughly uninspired.
  
  16kHz is pretty similar to analog FM radio transmissions and people have been listening to music on that medium for a long time quite satisfactorily. Besides, if you want to have high fidility transmission of music over the internet, there is already pretty decent solutions with streaming ogg/mp3.
  
  IM
Define: IVR (Score:4, Informative)

by theGreater ( 596196 ) writes: on Monday July 17, 2006 @12:18PM (#15732019) Homepage

Google gives the definition of IVR [google.com] as Interactive Voice Response.

So I knew what one was, I just didn't know there was a TLA for them. This inane personal revelation brought to you by the captcha "accuse".

-theGreater.

Share
twitter facebook
Doubling? hardly (Score:4, Insightful)

by MacBoy ( 30701 ) writes: on Monday July 17, 2006 @12:23PM (#15732060)

I fail to see how adding one additional octave of frequency response to the 6 or 7 currently available, can be called "doubling" the quality.

Share
twitter facebook
- - Re:Doubling? hardly (Score:5, Informative)
    
    by jdmicklos ( 865404 ) writes: on Monday July 17, 2006 @12:33PM (#15732130) Homepage
    
    The only real advantage to adding in "unused" octaves is in order to transmit overtones. Overtones shape the sound you can hear even though they may not be hear directly. Think about it as if you were to have a G note at 120 dB playing in an octave that you couldn't hear. It would still cause all things around with a fundamental frequency that is a "G" to vibrate as well as color certain audible noises.
    
    Parent Share
    twitter facebook
    - Re:Doubling? hardly (Score:2)
      
      by timeOday ( 582209 ) writes:
      
      What? No. They're only talking about 16 KHz sampling here, which would capture sounds up to 8 KHz. You can hear 8 KHz directly, these are not "unused" octaves.
    - Re:Doubling? hardly (Score:2)
      
      by Jerry Coffin ( 824726 ) writes:
      
      The only real advantage to adding in "unused" octaves is in order to transmit overtones.
      
      I'm pretty sure his point is that it's only one more octave. What the phone companies consider an "ideal" response for a telephone line is a bandwidth from about 180 Hz to 3-4 KHz or so, with a signal to noise ratio of about 45 dB. That means an ideal POTS line starts with about 4.5 octaves of bandwidth, and this increases that to about 5.5 instead. IOW, even though it doubles the maximum frequency, the perceived c
      - It's for the consonants, not the vowels (Score:2)
        
        by tepples ( 727027 ) writes:
        
        This is a recording of a woman (more or less) singing a scale.
        Of course vowels aren't going to have a lot of content in the upper frequencies. Now try saying "This is the eighth utterance" into a microphone and see what doesn't happen. I did it myself, using a crossover at 4 kHz to split the signal into low-pass left and high-pass right channels. Listen to the Ogg Vorbis file [jk0.org] and play with the balance. Notice how the phoneme /s/ comes through three times clearer when you have both speakers on (8 kHz bandw
- Re:Doubling? hardly (Score:1)
  
  by slyvren ( 989423 ) writes:
  
  Correct me if I'm wrong, but I'm assuming this means the digtal to analog conversion rate. This means it's sampling the analog audio 16,000 times per second instead of 8,000 times per second. Which in theory is double the "quality".
- Re:Doubling? hardly (Score:2)
  
  by ejdmoo ( 193585 ) writes:
  
  I thought this number referred to the sampling rate...
  
  You're thinking 8bit audio to 16bit audio.
  
  CMIIW
  - Re:Doubling? hardly (Score:2, Interesting)
    
    by slyvren ( 989423 ) writes:
    
    Actually 8 bit to 16 bit is far greater than double quality. The quality essentially doubles everytime you add a bit.
  - Re:Doubling? hardly (Score:2)
    
    by cfulmer ( 3166 ) writes:
    
    The maximum frequency that can be carried is proportional to the sampling rate -- if I recall correctly, the highest frequency that can be carried is half the sample rate. Sample 8000 times per second and you can carry up to 4 kHz. At 16000, it's 8 kHz. People can hear up to about 20 kHz, so this does increase the frequency range. Since 'going up an octave' means doubling the frequency, the previous poster was correct. The end result is only to raise the maximum frequency by an Octave.
    
    The bigger proble
- Re:Doubling? hardly (Score:2)
  
  by tverbeek ( 457094 ) writes:
  
  I've got a buddy who uses VOIP, and I can assure you: the quality of his phone calls to me has not doubled. It's all the same old "Dude, there's this chick on tv right now, I'm not sure which channel, who is like majorly hot. Turn it on!"
- Re:Doubling? hardly (Score:1)
  
  by tincho_uy ( 566438 ) writes:
  
  It won't. While there's a noticeable difference in quality when listening to 8KHz and 16KHz sampled speech, it certainly won't double the perceived quality. Even more so if it's in a VoIP context, where other factors such as the loss rate and distribution, forward error correction and the choice of codec (which tend to be of the non-PCM kind) play such big roles. Just my 2 cents...
What's wrong with the current implementation? (Score:2)

by HockeyPuck ( 141947 ) writes:

We're a Cisco VOIP shop and phone conversations sound fine. I'm not sure how going from 8->16 would make it any better.
- Re:What's wrong with the current implementation? (Score:2)
  
  by porkThreeWays ( 895269 ) writes:
  
  It does make a difference. 44KHz would be ideal, but 16 is good. The original 8KHz is a carry over from the old telecom days. That's how much uncompressed voice data they could carry over a single copper line. So in essence voice quality really hasn't improved much on telephones since the 80's.
  
  It would make understanding people who mumble, have poor english skills, lispers, etc, etc, significantly easier. 44KHz would be ideal, but 16 would be an improvement. I'm pretty sure however that many VoIP soft sw
PING Ted Stevens (Score:5, Funny)

by RobTFirefly ( 844560 ) writes: on Monday July 17, 2006 @12:30PM (#15732110) Homepage Journal

This can only mean twice as much material filling up the tubes. [wikipedia.org]

Share
twitter facebook
- Re:PING Ted Stevens (Score:1)
  
  by mr_flea ( 776124 ) writes:
  
  The horrible part is when the tubes get backed up, it makes a horrible mess all over your bathroom floor...
  
  Oh wait...
High-Def Telephony with Open Source Soft-Switch! (Score:2)

by evilviper ( 135110 ) writes:

I wasn't aware that telephones even HAVE "definition", let alone that they are in HIGH DEFINITION now.

definition 4. a. The clarity of detail in an optically produced image, such as a photograph, effected by a combination of resolution and contrast. b. The degree of clarity with which a televised image or broadcast signal is received.

Of course, what do I know... I didn't realize wireless networking equipment had fidelity, either (ie. WiFi).
- Re:High-Def Telephony with Open Source Soft-Switch (Score:1)
  
  by anthm ( 894202 ) writes:
  
  Well sure, VoIP is digital audio over the internet. It contains all of the same properties as any digital audio. It can vary from CD quality down to unintelligable static.
  - Re:High-Def Telephony with Open Source Soft-Switch (Score:2)
    
    by evilviper ( 135110 ) writes:
    
    Congratulations on being the guy who completely missed the point. Perhaps next time you'll try reading my entire post before replying.
    
    "Definition" is a video term, it has NO application at all to audio. It makes no sense.
- Re:High-Def Telephony with Open Source Soft-Switch (Score:2)
  
  by DA-MAN ( 17442 ) writes:
  
  I wasn't aware that telephones even HAVE "definition", let alone that they are in HIGH DEFINITION now.
  
  Apparently audio can have "definition"...
  
  http://en.wikipedia.org/wiki/High_Definition_Radio [wikipedia.org]
  
  Of course, that's only in the same as networking equipment has fidelity...
Only a slight improvement (Score:5, Informative)

by riflemann ( 190895 ) writes: <riflemann@NOSpAm.bb.cactii.net> on Monday July 17, 2006 @12:36PM (#15732153)

Actually, I've used Asterisk to pass through 24KHz Speex encoded audio - very impressive sound quality, but only works when the SIP channel is client to client.

In theory a SIP server doesn't need to know all of the codecs a client supports - the clients themselves negotiate any compatible protocol.

Of course, if the sip server puts itself in the path (such as when it needs to pass through to PSTN or firewalled clients), then 8KHz is the (till now) maximum supported rate.

Share
twitter facebook
- Re:Only a slight improvement (Score:2)
  
  by jmv ( 93421 ) writes:
  
  Actually, I've used Asterisk to pass through 24KHz Speex encoded audio - very impressive sound quality, but only works when the SIP channel is client to client.
  
  Care to provide more info on this. Speex is *not* optimized for 24 kHz so it would probably sound worse than 16 kHz or 32 kHz. If the devs are indeed using 24 kHz, it's probably a bad idea that would be fixed. (BTW, I know what I'm talking about -- I wrote Speex)
- - Re:Right... (Score:2)
    
    by LocalH ( 28506 ) writes:
    
    Wow, you completely missed the sarcasm.
  - Re:Right... (Score:2)
    
    by Loonacy ( 459630 ) writes:
    
    Heck, you have to buy special versions of MS OS to even get 64 bit support on your new "double the processing power" 64 bit processor.
    
    Umm... I'm running at 64 bits right now, and I don't even HAVE an MS OS (assuming you mean MS == Microsoft). Nor did I have to buy any special OS at all. I just downloaded it. Legally, even.
    
    Get off the Internets and turn off your hard drive before you hurt yourself.
The telemarketers (Score:1)

by The Relentless ( 901624 ) writes:

will be able to clearly understand me when I say, "I can't talk now, my leg is on fire."
can we say astroturf? (Score:1)

by bferrell ( 253291 ) writes:

the submitter is the author of the code.

Move along, nothing to see here yet
Big Whoopie (Score:3, Insightful)

by jmorris42 ( 1458 ) * writes: <jmorris@beau.oRABBITrg minus herbivore> on Monday July 17, 2006 @02:11PM (#15732317)

The problem isn't making a software based IVR system or even a softswitch run at a better rate. Now find me a SIP phone that runs at anything other than 8Khz. No, I'm not talking about a F/OSS softphone, but a real hardphone. They have the minimum DSP power the manufacturers can get away with to support 8Khz. Now find me a PRI that can interface with it. For now that is still an issue.

Skype has been running their softphones at higher than 8Khz/8bit so their softswitch obviously was the first widely deployed one to leave 64kbit max quality behind.

Yes, someday all telephony (except legacy telco stuff that will never change, which will be a shrinking market) will offer higher quality audio and an option for video. But not for a few more years until the saturation of next gen telephony products gets better.

Share
twitter facebook
- Ugh, Don't get me Started (Score:2)
  
  by Greyfox ( 87712 ) writes:
  
  I mean sure I can route a call through Enum or DUNDi (Well... my DUNDi peer group only has 2 nodes right now, so that's kind of pointless) and it could be pure digital. I've yet to find softphone I/O solution that doesn't suck (Maybe a bluetooth headset would be OK if it could push that sort of quality) so it's still much easier to dump the call out to an old $10 wireless RadioShak special via the digium FXS card.
  The VOIP to PSTN scene kind of sucks at the moment anyway. There are a lot of fly-by-night op
I don't think this is the real problem (Score:2)

by Sarusa ( 104047 ) writes:

8khz to 16Khz is fine, but that's not usually the problem we encounter with VOIP. It's latency and dropped packets, which this will just make worse. But if you're doing this on your own network only then I can see where this would be neat.
Nothing special here... (Score:1, Informative)

by Anonymous Coward writes:

They're just using a higher quality codec than G.711 (which is the standard for the back-end digital phone system).

The phone people (probabably AT&T) chose that standard since it gave pretty good voice quality given the limitations of current technology.

People are generally happy with the voice quality of the phone system - which is different from the voice quality of the last mile - the analog copper loop to your house, or CDMA/GSM/TDMA to your cell phone.

It's highly unlikely this new codec will catch
Marketing BS (Score:4, Insightful)

by jheath314 ( 916607 ) writes: on Monday July 17, 2006 @02:17PM (#15732390)

This "improvement" is idiotic. The thing which most limits the quality of a VoIP call is delay and jitter, NOT the sampling rate. Guaranteeing the quality of a telephone conversation over the internet is tricky because the internet was originally designed for best-effort packet delivery, with no guarantees on packet delay, sequence, or even (at the network layer) delivery.

If anything, this feature reduces end-to-end quality by doubling the amount of data being sent down the pipe, as you'd need to buffer more data at the same transmission speed to correct for jitter. Brillant!

Share
twitter facebook
- Re:Marketing BS (Score:2, Informative)
  
  by anthm ( 894202 ) writes:
  
  FYI: 20ms of 16khz audio (the typical size of 1 RTP packet) encoded with the Speex Codec http://www.speex.org/ [speex.org] is 43 bytes. 20ms of 8khz audio encoded with the Speex Codec http://www.speex.org/ [speex.org] is 29 bytes which is only 1.4 times as big as it's 8khz counterpart. 20ms of 8khz g711 is 160 bytes so with speex at 16khz, you can still fit 3 calls in the same amount of bandwidth that it takes for one 8khz call. The biggest overhead in VoIP is the various headers on each RTP packet per level of encapsulation,
  - Re:Marketing BS (Score:2)
    
    by jmv ( 93421 ) writes:
    
    Thanks. That's something a lot of people forget. Actually, the overhead of the headers is usually 16 kbps, i.e. about as much as the codec data itself. That's also why very low bit-rate ( 8kbps) codecs are (almost always) useless in VoIP.
- Re:Marketing BS (Score:2)
  
  by amorsen ( 7485 ) writes:
  
  Guaranteeing the quality of a telephone conversation over the internet is tricky because the internet was originally designed for best-effort packet delivery
  
  There's more to VoIP than the Internet, you know. Some of us work with lines which are guaranteed big enough or have QoS.
Is it.. (Score:1)

by bruno.fatia ( 989391 ) writes:

Is it even a difference human ear can notice? I mean, VoIP calls today are pretty good..
- Re:Is it.. (Score:1)
  
  by azurepalm ( 989377 ) writes:
  
  Try doing a Skype call to an international country and you'll see the difference in "reception" (quality). Probably not entirely Skype's fault, but any improvements will make a difference.
- Re:Is it.. (Score:1)
  
  by anthm ( 894202 ) writes:
  
  The more quality, the easier it is to perform detection algorithms for things like speech recognition, and yes you can notice the difference as long as the audio was generated digitally by a microphone+soundcard or with something like cepstral that defaults to 16khz for a reason.
Bits is Bits (Score:2)

by Detritus ( 11846 ) writes:

It's more complicated than doubling the sampling rate. Standard PCM telephony uses 8 kHz sampling rate, 8-bit samples, non-linear encoding. It's fairly simple, resulting in 64 kbps.
Speex is a CELP (code excited linear prediction) codec that is far more complex than the simple PCM system used by the telephone company. The resultant bit rate can be fixed or variable, and is not rigidly tied to the sampling rate used for data acquisition.
My voice bandwith runs at 80 KHz! (Score:2)

by wsanders ( 114993 ) writes:

So it's 10 times better than the Evil (tm) telcos!

And my software puts a green stripe around the edge of the data too... sucka!
Moore's Law (Score:2)

by Goody ( 23843 ) writes:

OMG, at this rate, we'll have 64 kHz calls in 6 years, and 128 kHz in 12 years!!!!

(Going from 8 kHz to 16 kHz isn't a "doubling of quality" :-P )
- Re:Moore's Law (Score:1)
  
  by anthm ( 894202 ) writes:
  
  OMG, actually, we can actually operate at any sample rate we want! 16khz was just a logical test because the phone we tested it with supported it.
First, PSTN is a 4 kilohertz bandwidth (Score:2)

by Beryllium Sphere(tm) ( 193358 ) writes:

Theoretical maximum, may be as low as 3.

Second, this is enough to capture most of a human voice. Can you hit a high "C"? That is about one kilohertz.

Everything above 1kHz is being used to carry ever-dimishing harmonics that provide resolution for fast-rising sounds like "k" and "p". There's a slight loss of detail at 4kHz and very little at 8kHz. There is no honest way to refer to a move from 8 to 16 as "doubling the quality". Sycraft-fu's post has it right. In fact, if I were designing the system I'd put i
16KHz is nothing. (Score:1)

by Darth Android ( 989471 ) writes:

For those of you who are not IRC junkies, the IRC client KVirc [kvirc.net] has built-in support for 44.1 KHz "voice chat" (not sure if it qualifies for "VoIP", but is a simple direct connection between two computers supporting real-time audio transfer). Not only does it support 44.1 KHz, but it has for at least a year (when I started using it). What's the big deal with 16KHz?
- Re:16KHz is nothing. (Score:1)
  
  by anthm ( 894202 ) writes:
  
  If you wish, you are welcome to operate at 44khz. We support any speed we just tested it with 16khz.
16 kHz (Score:2)

by jmv ( 93421 ) writes:

That's called wideband speech. It's been around for 10+ years and Speex [speex.org] supported it about 4 years ago. About time people actually use it (i.e. why people are still using narrowband in VoIP is beyond me).
poor choice of demo (Score:1)

by tartley ( 232836 ) writes:

That's a bit of a retarded demo for the technology: every techie's instincts are screaming "why not just transmit the RSS and convert to speech at the client?"
- Re:Not really free? (Score:1)
  
  by anthm ( 894202 ) writes:
  
  Dear entitlist,
  
  Please send us your name and email address and we will
  send you instructions on how to download our unquestionably
  open source code without having to provide any information.
- Re:Virtually pointless (Score:1)
  
  by anthm ( 894202 ) writes:
  
  You may have a point, we should stick to ITU codecs like perhaps, g722 http://www.umiacs.umd.edu/~desin/Speech1/node3.htm l [umd.edu] Oh waddya know! its a wideband codec! yay does that mean we can use it now???? Not exactly pointless, You can do conferencing, ivr, voicemail and media proxy calls from a SIP phone all at 16khz Mostly only client applications have been able to operate at this rate. Now we actually have a switching platform that will allow people to interact with the calls. The goal is not to make t

There may be more comments in this discussion. Without JavaScript enabled, you might want to turn on Classic Discussion System in your preferences instead.

Please get the rest of the telcomms to follow. (Score:4, Informative)

Re:Please get the rest of the telcomms to follow. (Score:4, Funny)

Re:Please get the rest of the telcomms to follow. (Score:2)

Re:Please get the rest of the telcomms to follow. (Score:2)

Re:Please get the rest of the telcomms to follow. (Score:2)

Re:Please get the rest of the telcomms to follow. (Score:2, Informative)

Re:Please get the rest of the telcomms to follow. (Score:2)

Re:Please get the rest of the telcomms to follow. (Score:2)

Re:Please get the rest of the telcomms to follow. (Score:1)

Re:Please get the rest of the telcomms to follow. (Score:2)

Re:Please get the rest of the telcomms to follow. (Score:1)

Re:Please get the rest of the telcomms to follow. (Score:2)

Re:Please get the rest of the telcomms to follow. (Score:2)

Let me be the first to say... (Score:2)

Speex Wideband Codec (Score:2)

Re:Please get the rest of the telcomms to follow. (Score:1)

Re:Please get the rest of the telcomms to follow. (Score:1)

Re:Please get the rest of the telcomms to follow. (Score:1)

Re:Please get the rest of the telcomms to follow. (Score:1)

Re:Please get the rest of the telcomms to follow. (Score:1)

Re:Please get the rest of the telcomms to follow. (Score:2)

Multiple codec conversions *are* really bad (Score:2)

Good Work (Score:3, Insightful)

Re:Good Work (Score:2, Insightful)

I need it (Score:2)

So what? (Score:5, Insightful)

Re: (Score:1)

Re:So what? (Score:3, Interesting)

Re:So what? (Score:2)

Re:So what? (Score:1)

Re:So what? (Score:1)

Re:So what? (Score:1)

Re:So what? (Score:1)

Re:So what? (Score:1)

Re:So what? (Score:1)

Re:So what? (Score:1)

Re:So what? (Score:1)

Re:So what? (Score:1)

Re:So what? (Score:2)

Re:So what? (Score:3, Insightful)

Re:So what? (Score:1)

Because it covers almost all of the human voice (Score:3, Informative)

Re:So what? (Score:1)

Re:So what? (Score:2)

Re:So what? (Score:2)

Define: IVR (Score:4, Informative)

Doubling? hardly (Score:4, Insightful)

Re:Doubling? hardly (Score:5, Informative)

Re:Doubling? hardly (Score:2)

Re:Doubling? hardly (Score:2)

It's for the consonants, not the vowels (Score:2)

Re:Doubling? hardly (Score:1)

Re:Doubling? hardly (Score:2)

Re:Doubling? hardly (Score:2, Interesting)

Re:Doubling? hardly (Score:2)

Re:Doubling? hardly (Score:2)

Re:Doubling? hardly (Score:1)

What's wrong with the current implementation? (Score:2)

Re:What's wrong with the current implementation? (Score:2)

PING Ted Stevens (Score:5, Funny)

Re:PING Ted Stevens (Score:1)

High-Def Telephony with Open Source Soft-Switch! (Score:2)

Re:High-Def Telephony with Open Source Soft-Switch (Score:1)

Re:High-Def Telephony with Open Source Soft-Switch (Score:2)

Re:High-Def Telephony with Open Source Soft-Switch (Score:2)

Only a slight improvement (Score:5, Informative)

Re:Only a slight improvement (Score:2)

Re:Right... (Score:2)

Re:Right... (Score:2)

The telemarketers (Score:1)

can we say astroturf? (Score:1)

Big Whoopie (Score:3, Insightful)

Ugh, Don't get me Started (Score:2)

I don't think this is the real problem (Score:2)

Nothing special here... (Score:1, Informative)

Marketing BS (Score:4, Insightful)

Re:Marketing BS (Score:2, Informative)

Re:Marketing BS (Score:2)

Re:Marketing BS (Score:2)

Is it.. (Score:1)

Multiple codec conversions are really bad (Score:2)