Slashdot Mirror


Yamaha Releases Singing Synthesis Software

loopdloop writes "The world's first singing synthesis software, Vocaloid, was released by Yamaha this month at the Los Angeles NAMM show. Simply type in the lyrics and notate the vocal expressions to create a completely computer-generated singer. There are also audio demos of the product available." Update: 01/26 21:14 GMT by S : An earlier NYT-authored preview of this software has also been covered on Slashdot.

59 of 344 comments (clear)

  1. Anybody ever say to themselves.... by weston · · Score: 2, Interesting

    "I was going to do that?"

    This was something I was really interested in when choosing a college major, and thought that I'd get into EE CS and do this. Somehow, I've found myself coding web applications instead.

    I'm glad to see somebody's doing it, but man, I think I took a wrong turn somewhere.

  2. Deja Vu by RobPiano · · Score: 4, Interesting

    Vocaloid has been covered on Slashdot before. It is one of the many impressive projects to have at least in part come out of the Music Technology Group at Institut Universitari de L'Audiovisual in Barcelona.

    This is one of many impressive Music Technology groups in the world who is kind enough to provide us with open source software such as CLAM. Similarly there are some groups out there doing interesting things. Needless to say, I could link all day...

    I am a graduate student in this field

    1. Re:Deja Vu by weston · · Score: 2, Informative

      Did you know about Ingo Titze and "Pavarobbotti"?

    2. Re:Deja Vu by kiolbasa · · Score: 2, Funny

      The difference that the singing text-to-speech was just a silly hack of the speech-synthesis software. It wasn't designed for any serious musical use. And Dr. Sbaitso isn't a real psychiatrist.

      --

      Beer wants to be free
    3. Re:Deja Vu by Afrosheen · · Score: 2, Funny

      Ten years ago? BAH! Why 20 years ago my commodore 64 could do this with SID synthesis software.

    4. Re:Deja Vu by Short+Circuit · · Score: 2, Funny

      He certainly helped me set me straight.

      I've been in love with computers ever since.

      er...

    5. Re:Deja Vu by Drantin · · Score: 2, Funny

      But... but he helped me through so many of my problems with day to day life!

      Me: The kids at school are picking on me again.
      DS: Why are the kids at school picking on you again?
      Me: I dunno...
      DS: Why do you dunno?

      ...never mind...

      --
      Actio personalis moritur cum persona. (Dead men don't sue)
    6. Re:Deja Vu by bunyip · · Score: 3, Funny

      Thanks for the info...

      Has Simon Cowell (American Idol) reviewed this software?

  3. Comment removed by account_deleted · · Score: 2, Funny

    Comment removed based on user account deletion

  4. I'm impressed. by emplynx · · Score: 3, Funny

    That's quite amazing. Now we need a computer to write music and songs.

    --
    -Tim
    1. Re:I'm impressed. by aliens · · Score: 4, Funny

      Done.


      Well partly done. I'm sure you could program a computer program to make Teeny boober songs in an afternoon.

      @lyrics = "Baby, ooooo, love, ooooo, dance, oooo, uhuhuh";

      You get the idea ::)

      --
      -- taking over the world, we are.
    2. Re:I'm impressed. by weston · · Score: 2, Interesting

      That's quite amazing. Now we need a computer to write music and songs.

      It's been done. Bach was an early target. Heck, I was writing melody generators and harmonizers 8 years ago (badly, but I was doing it).

    3. Re:I'm impressed. by el-spectre · · Score: 4, Funny

      Ack, if you're gonna do something so evil, don't do it in perl... do it in some other language... VBScript for example...

      --
      "Faith: Belief without evidence in what is told by one who speaks without knowledge, of things without parallel." - A.B.
    4. Re:I'm impressed. by radish · · Score: 2, Insightful

      Jean Michel Jarre used computer based composition (I believe it was based on some kind of genetic "evolution" mechanism) for one of the tracks on his 1990 album "Waiting for Cousteau". It is, to my ears, rather dull, but there you have it :)

      --

      ---- Den ene knappen er powerknapp, den andre er Bender voice knapp "Bite My Shiny Metal Ass"

  5. World's first? by inaeldi · · Score: 2, Interesting

    Shpongle (trance group) used Vocal Writer in their CD that was released in 1998.

  6. So... by PakProtector · · Score: 3, Funny

    ...this is where Britney Spears' talent comes from!

    --

    Edward@Tomato - /home/Edward/ man woman
    man: no entry for woman in the manual.
    "Qua!?"

    1. Re:So... by mcpkaaos · · Score: 4, Insightful

      The irony behind your comment is that vocals from "artists" such as Britney Spears, Christina Aguilara (sp), etc, go through extreme re-engineering after recording (beyond the norm of compression, reverb, EQ, etc.) Once the audio is filtered through tools that re-pitch the parts that go off-key and time-stretch the bits that fall out of rhythm, you have an end result that really isn't all that far from a computer generated voice.

      --
      It goes from God, to Jerry, to me.
    2. Re:So... by rylin · · Score: 2, Funny

      Britney Spears' *what*?

  7. Makes Milli Vanilli look talented by ebrandsberg · · Score: 2

    man, at least Milli Vanilli had singers.

  8. I prefer SimpleText... by OriginalSpaceMan · · Score: 2, Funny

    It had multiple voices... and was fun. MC Hawking style.... "mmmmmmmMMMMMMMMM ya"

    --

    You talk better than you fool!
  9. In other news... by c0dedude · · Score: 5, Funny

    Wow, this must make the RIAA's day. An artist who needs absolutly no pay and who really is property...

    --
    Since when has this country used intellectual elite as a pejorative term?
  10. Only a first step by why-is-it · · Score: 5, Funny

    "The world's first singing synthesis software, Vocaloid, was released by Yamaha this month at the Los Angeles NAMM show.

    Feh! They might be able to program something that sings better than Britney, but until they integrate it with something like this, Ms Spears' talents will continue to be in demand...

    --
    *** Where are we going? And what's with this handbasket?
  11. Yay! No more Britney! by Grey+Ninja · · Score: 2, Funny

    I guess Britney is going to be out of a job now. We all know that any computer can sing better than her, and since neither of them play an instrument, I guess she's screwed.

    Not to mention that the computer is far sexier.

  12. fp! (mine) by psychoticmelody · · Score: 2, Funny

    So will we finally get to replace the prime-time T.V. show American Idol??

    1. Re:fp! (mine) by Libertarian_Geek · · Score: 2, Funny

      A combination of battlebots and american idol.
      BattleIdols!
      Where engineers compete to build the best computer synthesised pop singer.

      --

      www.facebook.com/DareDefendOurRights

      www.fairtax.org
  13. Close the loop... by xmark · · Score: 4, Funny

    The logical next step would be a program that would listen to, and enjoy, the music that other computers write and sing.

    Think of the time it would free up, and the money it would save - you would never have to buy CDs. *cough* of course, some people have already eliminated that expense.

  14. Re:great... by Ba3r · · Score: 2, Funny

    ...first, dancing robots and now singing computers

    sigh... dancing, singing robots?, its been done

  15. so, i hear that by DarkSkiesAhead · · Score: 3, Funny

    Steven Hawking is trying to start up a band.

    1. Re:so, i hear that by vinsci · · Score: 4, Funny

      Pay attention. And look here. :-)

      --

      Trusted Computing FAQ | Free Dawit Isaak!
  16. They now need to work on by mc6809e · · Score: 3, Funny

    Just listened to these.

    While their "Volcaloid" tech is nice, their "Lyricoid" tech needs work.

  17. Actually... by mOoZik · · Score: 5, Informative

    I had the chance to try it out at NAMM and it is VERY difficult to get it to "sing." It can probably be used adequately for backup vocals, but again, it takes a lot of work to get it to sound human. Nevertheless, a step in the right direction.

  18. This is new? by LamerX · · Score: 2, Funny

    I think my Sound Blaster Pro came with this software. It fit on a floppy disk, and you could make the computer sing whatever you typed in. In fact it also came with a psychiatrist named Dr Sbaitso. Just don't cus at him, he gets offended very easily.

    Seriously, I'm sure that this new software is much better. At least I sure hope so...

  19. Simone by Lord_Dweomer · · Score: 2, Informative
    I think they should have called it SimOne v1.0

    Interestingly enough, this made me think of something I read in William Gibsons blog a long time ago. I don't know where it is now though. It was about how in the future, people will be able to take a movie or something on their computer, and tell the computer to replace all the actors heads with dog heads for example, and change what they do and say with simple commands. Perhaps this software is the lower level beginning of making that happen, we'd just need some higher-level controls to make it easy for everybody to use.

    --
    Buy Steampunk Clothing Online!
  20. Re:One for the road... by Afrosheen · · Score: 2, Funny

    Since they're so old and obsolete, if you find one in your attic, I'll take it off your hands for a cool $20 USD.

  21. After listening the demo's, all I can say is by Snork+Asaurus · · Score: 2, Funny
    "Don't quit your day job, Mr. Hawking."

    (No disrespect intended).

    --
    Sigs are bad for your health.
  22. Mandatory Radiohead quote by Petronius · · Score: 3, Insightful

    Fitter, happier, more productive
    Comfortable
    Not drinking too much
    Regular exercise at the gym
    (3 days a week)
    Getting on better with your associate employee contemporaries
    At ease
    Eating well
    (No more microwave dinners and saturated fats)
    A patient better driver
    A safer car
    (Baby smiling in back seat)
    Sleeping well
    (No bad dreams)
    No paranoia
    Careful to all animals
    (Never washing spiders down the plughole)
    Keep in contact with old friends
    (Enjoy a drink now and then)
    Will frequently check credit at (moral) bank (hole in the wall)
    Favours for favours
    Fond but not in love
    Charity standing orders
    On Sundays ring road supermarket
    (No killing moths or putting boiling water on the ants)
    Car wash
    (Also on Sundays)
    No longer afraid of the dark or midday shadows
    Nothing so ridiculously teenage and desperate
    Nothing so childish - at a better pace
    Slower and more calculated
    No chance of escape
    Now self-employed
    Concerned (but powerless)
    An empowered and informed member of society
    (Pragmatism not idealism)
    Will not cry in public
    Less chance of illness
    Tires that grip in the wet
    (Shot of baby strapped in back seat)
    A good memory
    Still cries at a good film
    Still kisses with saliva
    No longer empty and frantic like a cat tied to a stick
    That's driven into frozen winter shit
    (The ability to laugh at weakness)
    Calm
    Fitter
    Healthier and more productive
    A pig in a cage on antibiotics

    --
    there's no place like ~
  23. Don't bring me down... Bruce! by ThePretender · · Score: 4, Funny

    I can't be the only one that thinks the background vocals sound like Electric Light Orchestra??

  24. Really Bad Synths by Entropy248 · · Score: 4, Informative

    Just like many Yamaha firsts, this one may have been overhyped a bit. This sounds like a real person singing in the way that a synth brass pad sounds like a trumpet. There is no way in hell you would ever even consider that these noises were made by a human being. Yes, I understand that most of the samples are in Japanese and might not sound normal to me anyway. But, even if you listen to the ONE in English alone, it sounds like the Bell Labs female voice, but screechy and obnoxious instead of like a drugged out cigarette smoker after a trachyotemy (sp?).

    1. Re:Really Bad Synths by King_TJ · · Score: 2, Insightful

      Your argument is no more credible than the curmudgeons who said that an electric piano sounds so unlike a real piano that it's a total waste, and nobody would ever perform a legitimate creative work using one.

      Think "new tools at an artist's disposal", and "dawn of a new type of digital instrument" instead. If you want this software to sound good in your music mix, you're going to have to invest a considerable amount of time and effort setting it up properly and recording it properly. Therefore, it takes *talent* to make it useful.

      Not only that, but the male and female "singer" Yamaha is offering are only the first 2 of what promises to be a whole slew of virtual singer software packages one can purchase. It stands to reason that the first 2 would be the most "generic" - since they're mostly there for proof of concept purposes, and to cover as wide an audience of potential customers as possible.

      As a hobbyist-musician myself, I think this is a great step forward. It's not going to *replace* real singers any more than drum machines replaced drummers. Instead, it will provide more options to people trying to create different sounds, plus help musicians work on "rough drafts" of songs without all the other musicians having to be present.

      (Actually, with drum machines, I find that real drummers do the best job programming them. I think it will be similar with this software. A real vocalist will know exactly what inflections and volume changes should be programmed in to make the virtual vocalist sound best in an application.)

  25. Re:Human voice by spun · · Score: 4, Insightful

    Machines can have emotions if we want them to. There was even an article in last month's Scientific American by an AI researcher who claims that machines will need emotions for real AI to work. There have been several robotics/AI projects that have attempted to incorporate emotions, Cynthia Breazeal's robot Kismet being the most famous.

    Emotions are an information processing system that works holistically, priming the logical parts of the brain for the kind of work they will need to do. Big orange and black stripey thing running towards you? Prime the brain for a flight or fight response rather than curiosity, i.e. "Run, it's a tiger!" not "I wonder if this orange and black stripey thing wants to play?"

    There remains the problem of qualia. That is, a robot may look for all intents and purposes as if it is having emotions, but does it feel the same things internally as we do? Unfortunately, there's no real way of knowing if even other humans feel the same thing we do.

    When the day comes that a robot belts out a blues song about someone done it wrong and broke it's heart, we will judge it in the same way we judge human singers: Does it look and sound authentic, or is it faking it? If it looks and sounds authentic, I believe that we will take it for granted that it feels the same as we do, just as we take it for granted in other humans.

    --
    - None can love freedom heartily, but good men; the rest love not freedom, but license. -- John Milton
  26. Re:One for the road... by gmaestro · · Score: 5, Funny
    You clearly don't understand. This is the last piece in the puzzle of completely eliminating musicians. We have had a drum machine to replace you for a while, electronic instruments and MIDI.

    Now we can finally get rid of these whiny musicians, always complaining about "I need to feed my family" and "I'm a professional and should be paid like one." Now all of those unskilled morons can be sent to fill up the thousands of food preparation and customer service jobs that our public school system can't seem to find enough people for.

    Sorry about the offtopic (tongue firmly in cheek) rant. You're right. this does sound like a fun toy.

  27. first **commercial** software by stephentyrone · · Score: 2, Informative
    cool as this may be, it's definitely not the first singing synthesis software. CNMAT at berkeley had a neural-net based additive synthesis engine in the mid 90's that did a pretty good human voice (it could even reproduce the voices of specific individuals, as I recall), and did other instruments as well (a mean viola).

    I can't find a link to an actual demo of it simulating a human voice, but here's a page that documents its use to reproduce the sound of a suling (javanese wooden flute). Does a good job too. I've heard it demo'd with a human voice, and it was pretty good (though the neural net needed additional input - the syllable being sung - obviously).

    i'm sure that many of the other academic computer music labs around the world had similar software long before yamaha introduced this package. still cool, though.

  28. Sorry, but,... by El · · Score: 2, Insightful

    I'm sure Britney's "talent" has absolutely nothing to do with her ability to do vocals, and absolutely everything to do with her abilility to take off her clothes...

    --

    "Freedom means freedom for everybody" -- Dick Cheney

  29. walk the walk, or talk the talk by Doc+Ruby · · Score: 2, Insightful

    Talk is cheap, so if you can't implement your invention, punt! Explain it in detail in public, or among a relevant developer group. Then you at least have a chance of being in on the creation, along your lines of vision, and at least get your world bettered, even if you can't cash in by doing the hard part. The joy of coinvention sure beats the bitterness of "coulda, woulda, shoulda". Most of the open source process is based on that crosspollination and mutual assist. Hell, if we all did this better, maybe the docs would get written *first*, rather than never-quite.

    --

    --
    make install -not war

  30. Wow... by CptChipJew · · Score: 4, Interesting

    The one thing I expected they wouldn't get right, was what they did the best.

    When people hold notes, there are natural fluctuations in the tone, nobody can hold a perfect tone without some audible wax or wane.

    But you can hear this simulated amazingly if you listen to that one japanese song with the single male "vocalist".

    --
    Vonal Declosion
  31. Re:Quality by ackthpt · · Score: 2, Insightful
    If you get rid of the background instruments, the synthetic voice still sounds ... well... synthetic.

    So it fits right in with most of the pap on the top 40.

    The real sham is all the manufactured music that's been out there for years and increasing. Just program it a dictionary and it'll do rap, too.

    --

    A feeling of having made the same mistake before: Deja Foobar
  32. Re:Human voice by Afrosheen · · Score: 3, Funny

    I totally agree with you 100%. I mean, come on, the Terminator had a hella thick accent, and he came from like 30 years in the future.

    Ah'll be BECK.

  33. Great. by 2names · · Score: 3, Funny

    It's bad enough we have Cher and Madonna doing the fake electronic yodel dance, now every dipthong with a little cash will be creating them.

    --
    "I'm just here to regulate funkiness."
  34. Re:Human voice by cduffy · · Score: 2, Insightful

    I've done my time in a "philosophy of AI" class -- and frankly, as far as I'm concerned, the Chinese Room argument and the like are bogus. If it looks like a duck, walks like a duck, quacks like a duck -- it's a duck, at least until someone finds out that it doesn't in fact keel over when given cyanide-laced bread and has to plug itself in to charge its batteries every so often.

    Consider: For the purposes of this thread, AI is considered incomplete due to inability to simulate emotions why? Because it's argued that the desired output (emotionally-charged vocalization) is impossible without a precondition which is argued to be impossible. If the output were indeed achieved, then for the purpose of its singing, would the system not for all intents and purposes be emotional, even if one is unable to demonstrate that the system actually experiences qualia? (After all, if we're unable to make this demonstration wrt other humans, why make it a requirement for nonhuman intelligence?)

    Granted, I also think that a machine which can pass an unrestricted Turing Test over an extended period can be safely be considered capable of thought, so it's obvious which side of this debate I land on. :)

  35. Not the only person working on this... by jhhl · · Score: 2, Informative
    I'm glad someone is releasing this kind of thing - many people gave worked with voice algoritms for years doing vocal synthesis.

    • Tellus cassette recording FALSE PHONEMES came out in 1988 or so, from Harvestworks
    • plenty of good resynthesis is out there, using various vocal synth technologies like LPC.
    • MBROLA sings pretty well if you feel like coding up the data
    • FLINGER is open source. It uses the lyric tags in a MIDI stream to drive the synthesis algorithm.
    • Perry Cook's excellent physical modeler Sing1.2 for the NeXT is very, very old at this point.

      So there is prior art spewing out all over the place.

      and how could I leave off:
      eddie and eedie? (search for "Some Velvet Morning"
    --
    -- Real Stupidity is the Artificial Intelligence of the 21st century
  36. asx and webpage html by technix4beos · · Score: 2, Informative

    The "webmaster" who wrote the linked page of demos is linking to ASX files, which in turn link straight to the self-named mp3 files on the server.

    In case the direct "save/play" links do not work with your browser and OS, just replace the asx with mp3, and enjoy.

    --
    user@host$ diff /dev/urandom /dev/uspto
  37. Re:One for the road... by psychoticmelody · · Score: 2, Insightful

    why eliminate musicians? is there no room in life for art anymore? also, if those "unskilled morons" flooded the job market, where would you go? musicians put hard work into their jobs too. although, pop idols today don't seem to. sorry, i had an offtopic rant as well. I'd be glad to see less petty musicians as well, but i don't see why all musician should be eliminated.

  38. entirely computer generated music... by donutz · · Score: 2, Interesting

    You clearly don't understand. This is the last piece in the puzzle of completely eliminating musicians. We have had a drum machine to replace you for a while, electronic instruments and MIDI.

    Now we can finally get rid of these whiny musicians, always complaining about "I need to feed my family" and "I'm a professional and should be paid like one."


    Close...this is just the piece of the puzzle that gets rid of those money-grubbing vocalists. Combine this latest development with a computer composition engine and we won't need any musicians at all!

  39. prank calls... by biz0r · · Score: 2, Funny

    Now I can finally make some real sounding prank calls for once! Oh the joy!

    Although...most idiots out there fall for the midi based voices anyhow...

    --
    /* sig */
  40. Is not by filtersweep · · Score: 5, Insightful

    I have several vocoders, software and hardware, and it is obviously a very different creature if you took the time to listen to the demos. Also, a proper vocoder needs an carrier, and it does not generate the vocal qualities. It merely functions as a formant filter (where the constanants are provided by the vocalist, and the pitch by usually a synthesizer).

    Frankly, this thing just really needs a good plug-in format, like TDM or VST and it will be a gold-mine- not unlike those god-awful pitch-correction plugins that were reputed to give Cher that plastic effect to her voice (like she doesn't have enough plastic as it is). As a standalone app, it is doomed.

    --


    Those that suggest you "dance like no one is watching" really want to see you make a complete fool of yourself.
  41. Similar thing already been done on a 386 in 1998 by PsyQ · · Score: 2, Interesting

    Something similar was done by Alexei Shulgin in 1998, on a 386. Sure, he did it by writing the phonetic instructions for the speech synthesis engine by hand, but Yamaha's solution is just a much more sophisticated (and better funded) version of that.

    Check out 386DX, his band project. Which includes the 386. That only has 4 MB of RAM and also has to do visualizations and MIDI sound at the same time.

    I've had the fortune of seeing him perform live in Linz as well as chatting with him a little, and he came to our school for a lecture. He has a few brilliant projects, maybe you might like WIMP which he developed with a friend.

  42. Groupies by wpiman · · Score: 2, Funny

    This just brings us another day closer to when engineers start having groupies.......

  43. Re:One for the road... by Aqua+OS+X · · Score: 2, Interesting

    I've been playing for about 15 some odd years. I have both an acoustic set and some Roland V drums. The V drums sound very very real if you have them connected to a good PA or nice a recording set up. If you don't have good equipment, and you don't sound check this shit out of them before you play, they sound kind'a fake.

    With V drums you can virtually alter drum woods, alter cymbal metals, alter instrument sizes, switch drum heads (pin strip, coated, etc), place tape or foam on your heads, use brushes, grab and mute cymbal crashes, add a custom levels to the snare gate, tight or loosen head, change room acoustics, stick stuff in the bass drum, etc etc.

    If you know what your doing you can make them sound real and imperfect just like an acoustic set. However, you -need- a good PA, and you need to sound check the shit out of them before you play (quick set up, long sound check).

    --
    "Things are more moderner than before- bigger, and yet smaller- it's computers-- San Dimas High School football RULES!"
  44. Re:One for the road... by iapetus · · Score: 2, Funny

    No, that's alright. I'm in a band that used Fruityloops on a laptop to provide drums until we managed to find a human drummer, but it kept terrible time - it was always out of time with the rest of the band by the end of the song...

    --
    ++ Say to Elrond "Hello.".
    Elrond says "No.". Elrond gives you some lunch.