Slashdot Mirror


Speex Joins Xiph To Bring Free VOIP To The Masses

xercist writes "Xiph.org has added a new project to their plate of goodies- Speex. Speex is an audio codec specifically for, you guessed it, voice. It has integration with Xiph's OGG container, but is mainly being used right now for VOIP. There is currently an XMMS plugin available, and is also supported by LinPhone, OpenH323, and GnomeMeeting. Asterisk PBX is working on adding support. This is not a new project -- Jean-Marc Valin has been hard at work writing the codec for quite a while now. However, Jean-Marc is now a full-fledged member or the Xiph.org team, and in celebration, Speex beta one is being released. Xiph.org has brought you (or is currently working on bringing you) Vorbis, Tremor, Theora, Tarkin, Icecast2, cdparanoia, now Speex, and, of course, the Moaning Goat Meter. This is a LOT to do, so please donate to show your support."

27 of 133 comments (clear)

  1. Great thing by PinkX · · Score: 3, Insightful

    It's good to see the xiph people to bring to the masses such advanced projects - and under a free fashion. I hope the big companies start to take seriously all the Xiph's work by supporting their projects into commercial products, such as hardware OGG players and such.

  2. Good technology for businessmen by Anonymous Coward · · Score: 3, Funny

    With VOIP I can connect to my dialup ISP from out of town without paying long distance fees.

  3. "To the masses"? by Wakko+Warner · · Score: 4, Insightful

    Not to discourage, but it won't really matter to the masses until there's a native, easy-to-use Windows client.

    - A.P.

    --
    "Remember when the U.S. had a drug problem, and then we declared a War On Drugs, and now you can't buy drugs anymore?"
    1. Re:"To the masses"? by CondeZer0 · · Score: 5, Insightful

      It matters to me, and I couldn't care less about the masses or ease of use. I
      care about something that works for me, and that is free of patents and other
      traps.

      I'm sick of people that think that "masses" are all that matters, if that was
      the case we would be all running Windows, listening to boys/girls bands,
      looking TV, drinking coca-cola and living in a big city.

      Whatever the masses do, OGG is one of the most important projects out there to
      protect my freedom of using a hight quality audio format, if you don't like it,
      unlike with some other "DRM enabled" formats, you wont be obligated to use it
      any time soon.

      For all that I care you and all your masses can go use WMA and all it's DRM
      trash, browsing AOL, listening Britney(sp?), going to the cinema to see (checks
      warnerbros.com) Harry Potter, running Windows XP on your palladium enabled
      Pentium 5 and living in NewYork.

      I will continue using ogg, browsing the web with Mozilla, listening to
      Einsturzende Neubauten and Chopin(two examples of things I have been listening
      to today), looking Clockwork Orange, Cube and Totoro, running FreeBSD on my AMD
      Duron, and Plan9 in my old broken Thinkpad; and living in some lost place in
      the North of Sweden.

      Hope you are happy living your prefabricated life in your plastic world. Hurry
      or you are going to miss your daily brainwashing 4 hour sesion of TV. And don't
      forget to stay well away from any book, you may learn something from them!

      *sigh*

      \\Uriel

      --
      "When in doubt, use brute force." Ken Thompson
    2. Re:"To the masses"? by jmv · · Score: 5, Informative

      (I am the Speex author) There are already at least two Windows front-ends: here and here. There may be others I'm not aware of. Note that I haven't developed of tested any of these since I don't use Windows.

  4. Re:But... by PinkX · · Score: 4, Insightful

    does *costs* means anything to you? Seriously, long distance calls can be very expensive, even more for people who must make them frequently (think foreign students, etc). Having the voice calls being transported over TCP/IP, using an already proven technology which covers almost the whole globe (and to some extent outside of it) reduces these costs to nearly 0, except for the VoIP hardware (gateways and such) needed, now think of the advantages to be able to do phone calls from your linux box using *free* software to anywhere in the world, for just the cost of your net link, which most of the time is flat-rated.

  5. Speex sounds nice by deathcubek · · Score: 5, Informative

    I've been playing around with speex when i was working on an audio conferencing. It's a simple api, and the audio quality comes out okay for voice too. (unless you try sending music through, then it really just craps out)

    If only I could get the windows side of the cross-platform audio caputre stuff so nice.

    --

    New worlds are not born in the vacuum of abstract
    ideas, but in the fight for daily bread
    --Rudolf Rocke
    1. Re:Speex sounds nice by jmv · · Score: 4, Informative

      Probably something you've done. For normal use, it's between 5% and 30% on my PIII/1GHz, depending on the sampling rate (8kHz or 16 kHz, only mono supported) and the bit-rate. The encoder also has a "complexity" option similar to the -1 to -9 options in gzip.

  6. Re:But... by Monkelectric · · Score: 5, Funny

    Because my telco company charges me 5c/min for calling a city I can SEE from my backyard on a clear day.

    --

    Religion is a gateway psychosis. -- Dave Foley

  7. Re:But... by Quirk · · Score: 4, Funny

    It's late and I'm too stoned to be posting but isn't the migration of every electronic form of communication a good thang? Then comes mythical intuitive GUIs, and finally integration in cross-realtional dbs.

    --
    "Academicians are more likely to share each other's toothbrush than each other's nomenclature."
    Cohen
  8. Xiph is great by 0x0d0a · · Score: 3, Interesting

    I didn't even know that Monty had also done cdparanoia until recently. Great coder.

    Here's to Xiph -- singlehandledly taking on the tech-media companies (Real/Apple/MS and tons of failed companies) and steadily gaining ground.

    We've had propriatary media formats for a long time. (Incidently, propriatary file formats are one of the strongest weapons incumbents have against upstart open source projects). This is a big movement that's starting to cascade, with more companies joining the Xiph bandwagon daily (and little interest in the MPEG4 people).

  9. donations by jadavis · · Score: 3, Interesting

    I already donated US$7.50 to xiph (10 times the price of an mp3 decoder). It's really not much, but it seemed about right in proportion to the other software I use (especially since I only use xiph's products for my desktop, not servers). I know xiph is working hard, but I feel more of a debt to the creators of linux, gnu, postgres, apache, exim, debian, python, perl, openbsd (I only use their openssh, but that's important), and all the other great projects I didn't mention (and those aren't necessarily in order).

    --
    Social scientists are inspired by theories; scientists are humbled by facts.
  10. Here's my first donation by Gizzmonic · · Score: 4, Funny
    it's some advice:

    Get some better names for your projects! If a 12 year old is embarrassed to say "OGG Tarkin" aloud, then you're not going to sell it outside of the hardcore open source geek community. And I know, you're not trying to make money, but I'm betting you'd even win some mindshare if you were willing to, say:

    1. Give the "OGG" media layer a little class. Call it something the boys in marketing would like, such as "SimpleDirectQuickWebLayer."
    2. Vorbis could be known as "I Can't Believe it's not MP3!" and if you still wanted to use a geeky name, just refer to the acronym, ICBITNMP3, pronounced as "Ich-Bittin-emm pee 3."

    I'm hoping someone else will be kind enough to hire a professional web designer for Xiph, and maybe even a domain name that people could pronounce or remember. Dig deep, folks. I know it's a recession, but every little bit helps.

    --
    (-1, Raw and Uncut is the only way to read)
  11. Re:But... by PinkX · · Score: 3, Informative

    TCP/IP is not referred as just TCP, it's the name given to the suit of protocols that makes the Internet happen, namely TCP and UDP, and IP, ICMP and others on different layers.

  12. Re:But... by lpontiac · · Score: 5, Informative
    Why does everyone insist we need to do absoultely everything over TCP/IP?

    Voice over IP doesn't send voice data over TCP, it uses UDP. UDP isn't complicated at all - it just gives you a way to uniquely identify a machine and say "send this data to it." It doesn't even guarantee delivery of the data. It's probably the best, most accepted way of sending addressed, digital data over wires.

    Now, imagine you're a company that's just put an office up. Would you rather install two sets of wires to each desk (ethernet and phone network), one of which requires you to get a licensed contractor in if you need work done on it? Or a single set of wires which can be maintained by the people who run your computers?

  13. Next step for UT2k3? by SexyKellyOsbourne · · Score: 3, Interesting

    The badly dated Half-Life engine still has one feature that is more powerful than anything in UT2k3: and that's voice over ip, in-game, mostly used in Counter-Strike.

    UT2k3 already uses OGG for its music -- and I recall reading a UT2k3 developer plan file that states the wish for voice-over-ip, but basically they were waiting for someone in the open-source world to do all the work.

    Why just hit a few buttons to say, "Ownage!" when they can hear your true compressed, overly nasal-sounding voice say it -- or perhaps more insulting, filthier things?

  14. Funding standards applications? by GoRK · · Score: 3, Insightful

    This is never going to amount to a mass of anything (save vapor) if it's not applied as a standard and supported in hardware. G.721, G.729, GSM, and aLaw and uLaw are pretty established codec's that in supporting, you can communicate with probalby 99% of VoIP equipment out there....

    It's very sad that speex will never make it as a viable codec for VoIP. Perhaps it would be beneficial for an orginasation such as the FSF to support these open sourc codec's efforts to lobby and apply for standards support so that future products might actually use them one day -- epseically in an application such as VoIP where interoperability is often the number one concern in establishing large scale acceptance.

    ~GoRK

    1. Re:Funding standards applications? by parc · · Score: 3, Insightful

      G.721 and G.729 are CODECs, not VOIP protocols. g.729 came out of the cellphone arena, and 721 is the standard for logarithmic bit-encoding used for T1, etc.

      The problem is that as a company, I can't go out and buy a Ogg codec chip. I CAN with g.721/729. When drawing up circuits, I can place a black box in my diagrams and label it "coding happens here." Until that's available for Ogg, AND you can convince large telcos to switch to it, it won't take off.

  15. Re:But... by Anonymous Coward · · Score: 3, Insightful

    It doesn't even guarantee delivery of the data.

    I believe it says it all when it comes to the difference between VOIP and real telephone service, which usually has a guaranteed below 10^-4 failure rate on a fixed line.

    If my life depended on it, I wouldn't trust VOIP.

  16. Re:But... by cerenyx · · Score: 4, Insightful

    If I'm not wrong (and I may be), reliability is still an issue with VoIP: and taking the hypothetical company -- the question is whether, taking long-distance conference calls and otherwise using the telephone lines extensively for critical purposes, the company would feel more "secure" using the tried and tested analog telephone lines, as opposed to VoIP.

  17. Re:But... H323 and firewalls/gateways by gooofy · · Score: 3, Informative

    While udp certainly is the right choice for transmitting the actual audio data (low latency etc.) this alone doesn't make a complete telephony protocol.

    One standard used often today for call management (listen for incoming calls, register possible recipients etc.) is H323, the one netmeeting and gnomemeeting, among many others, use. Unfortunately H323 does a very bad job when it comes to transmitting data through firewalls, nat-gateways or proxies (typical environment in many companies today) since it contains parts which choose arbitrary high ports for connection. You can work around this by installing e.g. OpenH323 Proxy on your gateway, but usually you'll need your systems administrator to do that - and it is pretty likely that he/she will refuse to do that for security reasons or simply because it can become quite tricky to set up a stable working H323 proxy/gateway (lots of configuration work).

    BTW i've heard that some firewall constructors have basically given up on that matter and simply open all ports when they detect some client intends to do netmeeting.

    --
    time is a funny concept
  18. That title-- by MegaFur · · Score: 3, Funny

    "Speex Joins Xiph To Bring Free VOIP To The Masses"

    --is a little hard to read. It looks like a line from Jabberwocky.

    --
    Furry cows moo and decompress.
  19. Re:voice codecs by grahamsz · · Score: 4, Interesting

    Yes!

    Generally voice is sampled at 8kHz because it generally ranges from around 200Hz to 4kHz and Nyquiest suggests we need a digital sampling rate of twice the highest frequency.

    The reason music sounds crappy is that in a well designed system you will loose all the audio components above 4kHz, and in a badly designed system they will manifest themselves as other lower frequencies.

    Now this *does not* mean that this codec is a bad codec - merely that's it's one optimised for a specific task.

  20. Speex and TAPR by wowbagger · · Score: 4, Informative

    I love the fact that a good, Free Software voice codec is out there, and here are my reasons:

    1) Ham Radio. The Tucson Amateur Packet Radio organization is working on experimental digitized voice over amateur radio applications, and a couple of venders (mostly Kenwood) are offering radios that have this ability. Right now, TAPR are looking at using DVSI's IMBE vocoder, which is QUITE expensive and VERY not-Free. The availability of a Free codec would greatly improve the availabilty of this protocol.
    2) Currently, The Association of Public-Safety Officials (APCO) (the folks who define the specs for the radios used by police, fire, and government) have defined the current digital trunked radio standard, APCO Project 25 as using DVSI's IMBE vocoder. While this is licensed under a Reasonable And Non-Discrimitory license, if you want to license the IMBE vocoder for a P-25 project, you will cough up US$100,000.00 for the privilege (I know firsthand, as the company I work for has done this). Uniden, Radio Shack, and other scanner companies are looking into putting this into their scanners, so they have had to cough it up as well. A Free vocoder would allow anybody to build a product with this capability in it - you could even use a scanner and your sound card to decode the Phase 1 C4FM format signals.

    Like so many other things, a Free Software tool to do these things would greatly accelerate the industry. I hope Xiph does well.

  21. Speex Joins Xiph... by Spunk · · Score: 3, Funny

    In related news, Qwixm released Zblik today, the latest server in the Foogark series, designed to smoothly integrate with the operating systems Njiimakiwup, Ueltwvom, and GNU/Zzyzx.

  22. Other open source codecs by cullenfluffyjennings · · Score: 3, Insightful

    There is open source code for tons of the traditionally G.7xx CODECs around. The issues many of them require licensing various peoples patents. A casual look at speex would make me think that it is quite likely to infringe someone's CELP patents. Does anyone have any thoughts on this? It's really cool to see something like speex happening but there are a few other things that you might want to think about.

    Global IP Sound put out a codec for voice called iLBC. It is specifically designed to avoid infringing known patents. It's sound quality vs. packet loss is very good for IP systems. This is being standardized by the IETF. All the source code is open source and in the draft which you can find at http://www.ietf.org/internet-drafts/draft-ietf-avt -ilbc-codec-00.txt.

    Sun has a free implementation of CCITT compression types G.711, G.721 and G.723 at ftp://ftp.cwi.nl/pub/audio/ccitt-adpcm.tar.gz. This is just a free implementation - it does not give you a license to the patents.

    Various people including Cisco have been working with the license holders of G.729 IPR to make it available for "pre-commercial" systems, developers, and education. http://www.vovida.org/applications/downloads/G729A /

  23. Is it just me?...(warning - contains whining) by Dr.Dubious+DDQ · · Score: 3, Insightful

    ...or is Xiph spreading itself rather thinly these days?

    Ogg Vorbis got out the door, and then it was Tarkin/Icecast2/Theora/Helix and now Speex.

    They're committed to so many projects right now I wonder if any of them will be completed in the next 5 years...

    Theora (my particular favorite) got announced at the beginning of July. The Theora mailing lists' traffic is still made up mostly of people wanting to ask about using VP3 with Microsoft(tm) Windows(tm) Directshow(tm) and such, with only a few brief (but informative) bursts of discussion actually relevant to Ogg Theora. After nearly 3 months of near-silence (not counting the non-Theora related VP3 questions) on the mailing lists and CVS repository, the first Alpha release of Ogg Theora popped up out of nowhere (not even MENTIONED on the mailing lists!)...and quickly returned to silence again. I've actually played with the Alpha code, and it makes me very hopeful for the final product - it's currently unoptimized, but even so its current speed seems about comparable with mjpegtools mpeg2 encoder, and the quality seems quite good at e.g. 300kbps/640x480/29.97fps. With all of the other projects being collected under the Xiph umbrella right now, though, I wonder how much developer time and attention will be available to keep it going...

    (It MAY be that, with both codecs involved in Theora being essentially finished, they figure all they really need to do is finalize the specifications, and then spend a little time doing some optimization and they're done, and since there's almost 9 months to go until their projected 1.0 release date that it can wait...Judging by the quality of the first alpha [and thanks go to Monty at Xiph.org and Dan Miller of On2 for getting things that far along!], they may be right...provided there's time to come back and finish up between the other projects...)

    I'm strongly in favor of every one of the projects they've taken on so far, I just wish it didn't seem like new projects were being added faster than existing ones are being worked on...

    Okay, enough whining from me. I'll go back to quietly waiting impatiently again now...