Slashdot Mirror


Baidu Open-Sources Its Deep Learning Tools (theverge.com)

An anonymous reader quotes a report from The Verge: Microsoft, Google, Facebook, and Amazon have all done it -- and now Baidu's doing it, too. The Chinese tech giant has open sourced one of its key machine learning tools, PaddlePaddle, offering the software up to the global community of AI researchers. Baidu's big claim for PaddlePaddle is that it's easier to use than rival programs. Like Amazon's DSSTNE and Microsoft's CNTK, PaddlePaddle offers a toolkit for deep learning, but Baidu says comparable software is designed to work in too many different situations, making it less approachable to newcomers. Xu Wei, the leader of Baidu's PaddlePaddle development, tells The Verge that a machine translation program written with Baidu's software needs only a quarter of the amount of code demanded by other deep learning tools. Baidu is hoping this ease of use will make PaddlePaddle more attractive to computer scientists, and draw attention away from machine learning tools released by Google and Facebook. Baidu says PaddlePaddle is already being used by more than 30 of its offline and online products and services, covering sectors from search to finance to health. Xu said that if one of its machine learning tools became too monopolistic, it would be like "trying to use one programming language to code all applications." Xu doesn't believe that any one company will dominate this area. "Different tools have different strengths," he said. "The deep learning ecosystem will end up having different tools optimized for different uses. Just like no programming language truly dominates software development."

27 comments

  1. An ecosystem of AI? by gilgongo · · Score: 1

    It's at least an interesting idea that AI might end up mimicking animal minds, with specialist centres dealing with specific aspects of cognition. Wonder what that would translate into for machine AI?

    --
    "And the meaning of words; when they cease to function; when will it start worrying you?"
  2. Meh by melted · · Score: 3, Informative

    I work in a research lab, and as far as we're concerned, game is basically over, and TensorFlow has won. Some computer vision researchers are still using Torch, but even they are considering moving to TF because it's faster, and Lua is, how can I say this diplomatically, not a good choice for many reasons.

    1. Re:Meh by Anonymous Coward · · Score: 0

      How can the game be won and over when we're just starting to scratch this science? Naivety can't possible be a virtue for someone working as a researcher, do they actually pay you for what you do?

    2. Re:Meh by Anonymous Coward · · Score: 1

      What kind of a researcher are you that you can judge a new system 15 minutes after the news came out. You obviously didn't look at it at all because you already know that the thing you use it better.

      Nice research!

    3. Re:Meh by ImdatS · · Score: 5, Interesting

      I'm not sure about that.

      I just briefly glanced at PaddlePaddle and its "QuickStart" is actually a "start" instead of TensorFlow's highly complex unusable documentation.

      PaddlePaddle seems to be directed towards the user instead of the scientific community. I know, TensorFlow has some examples for beginners (MNIST sample) but in order to get something out of TensorFlow I need weeks of reading, trying to understand how it works under the hood and try something out - and in most cases it was just really frustrating.

      Admittedly, I'm not an expert and I'm not in academia - but I want to use it in real-world applications and TensorFlow (without SyntaxNet/Parsey MacParseface) is just ... technology preview ... that I can experiment with but cannot actually use as an outsider for anything practical.

      I'm doing language analysis and working on a product for a customer to reduce the burden of some of his call center agents by applying machine learning to respond to customer's requests automatically.

      The only practical solution so far was using spaCy - TensorFlow was just a mess, including SyntaxNet.

      I'll try out PaddlePaddle, especially because their initial "Quick Start" is actually about a real-world problem.

      There is absolutely nothing about real-world problems such as "Chat" or "FAQ-type bots" using TensorFlow - what I could find so far was only mostly academic mumbo-jumbo.

      Sorry to say that - but most of Google's documentations about their technologies just suck ..

    4. Re:Meh by Anonymous Coward · · Score: 0

      Sorry, TensorFlow is poorly written, it's not something that can be built upon.

    5. Re:Meh by h33t+l4x0r · · Score: 1

      I'm doing language analysis and working on a product for a customer to reduce the burden of some of his call center agents by applying machine learning to respond to customer's requests automatically.

      So how does sentiment analysis or image classification help with that? You already know that their sentiment is negative and their face is frowning.

    6. Re:Meh by houghi · · Score: 0

      reduce the burden of some of his call center agents

      Like the burden of getting up and coming to work. Because if it is succesfull, it will be used to reduce cost and that means reduce FTE count.
      Bit like the self checkout, self driving cars and what not. It is NOT to please people, it is to increase profit at any cost.

      --
      Don't fight for your country, if your country does not fight for you.
    7. Re:Meh by lorinc · · Score: 1

      I've looked the quick start too, and I find it less friendly than Keras, which has a TF backend if you want really want to use TF.

    8. Re:Meh by ImdatS · · Score: 2

      In fact this client of mine has some highly skilled agents whose job is to respond to medical questions - normally.

      Unfortunately, they are also burdened with the normal questions such as "where is my order", "how do I do this on your website", "I forgot my password", etc - coming in via email or other textual interfaces...

      Currently, the aim is to reduce this kind of burden.

    9. Re:Meh by ImdatS · · Score: 1

      Thanks, will check it out and see what I can do with that...

    10. Re:Meh by Anonymous Coward · · Score: 0

      I work in a research lab, and as far as we're concerned, game is basically over, and TensorFlow has won.

      Between TensorFlow, Torch, Theano, Caffe, and PaddlePaddle, that may be true. On the other hand, hopefully, the game is anything but over and there will be a next generation of deep learning frameworks that is better than any of them, because TensorFlow has lots of limitations and problems.

    11. Re:Meh by melted · · Score: 1

      What do you mean "technology preview"? People use it in production. Google itself has used in production for a couple of years now. And it scales down all the way to phones, and up all the way to distributed clusters. If TF is too low level for you, use Keras or TFLearn, those are much higher level.

      As a side note, if you expect a pre-packaged, "production ready" solution to come out of academia, you're naive. This statement does not apply to TF itself: unlike many other frameworks it's a _second_ iteration of a production system that Google ran at scale for many years (Google Brain), and some of Google's best engineers are working on it. And it's just getting started. I bet sometime next year they'll release support for TPUs on Google Cloud, which will make even fairly computationally intensive inference cost effective in real-time.

    12. Re:Meh by ImdatS · · Score: 2

      Apologies for "Technology Preview" - I didn't mean it as "not mature" but rather as "a technology to play around with" - for us "Google Outsiders"...

      Yes, I know that Google has been using it for quite some time... But like anything else I have seen from Google as technology, it seems like a nice technology rather than something to build a product on it - for outsiders.

      To make it short: Google, in my view, makes technology unnecessarily complicated to use. PaddlePaddle seems a lot easier (I looked a little further into PaddlePaddle in the meantime).

      In any case, I will, of course, continue working with TF as well as with PaddlePaddle. And yes, I'll look into Keras as already suggested by someone else.

  3. Andrew Ng by Anonymous Coward · · Score: 2, Interesting

    Andrew Ng now works at Baidu. Any chance that Ng has something to do with this?
    After all, he is responsible for a lot of people understanding Machine Learning at Stanford, Coursera, and Google.
    He seems to be passionate about spreading knowledge of Machine Learning.

  4. The AI Driver by Anonymous Coward · · Score: 0

    Thirty five years ago, before printer drivers allowed software to work with many devices, we had the same interface problems as AI has today. If AI drivers became ubiquitous then every user's interface would be unique. There are several million prototypes of AI solvers out there, as sophisticated users, most of whom would like what Baidu are offering, if only the driver issue was solved between the deep learning supplier community and the user. This *userware* has grown into lopsided interfaces that only unique users know or care about. Nonetheless, all AI interface dreams should be accommodated by any AI Driver Standards Committee.

    I nominate the open source AI primitive, called Ingrid, to have its prototyping language modify and make use Baidu's deep learning toolbox. Do I hear any seconders?

    1. Re:The AI Driver by Anonymous Coward · · Score: 0

      Here, AI. If you are so intelligent, guess the size of my dick.

    2. Re:The AI Driver by Anonymous Coward · · Score: 0

      A little more about me. I have been a practitioner in AI over forty years, all the while waiting for the AI driver. It dawned on me that, if I subcontracted Baidu, or any other provider, into offering Ingrid as a service, they might yet write the AI Driver.

  5. So where's the source by Anonymous Coward · · Score: 0

    Where's the link to the source? I want to actually see it.

    1. Re:So where's the source by EzInKy · · Score: 1

      Still awaiting Chinese Government approval.

      --
      Time is what keeps everything from happening all at once.
    2. Re:So where's the source by ImdatS · · Score: 2

      In the summary, there is a link...

      Here is the link: http://www.paddlepaddle.org/

    3. Re:So where's the source by ShanghaiBill · · Score: 1

      Still awaiting Chinese Government approval.

      Unlikely. Baidu's research lab is in California.

    4. Re:So where's the source by Anonymous Coward · · Score: 0

      With a tunnel straight to Peking, passing He!! on the way!

    5. Re:So where's the source by Anonymous Coward · · Score: 0

      Silly LaoWai! That's just a building with a name on it. Research? More like espionage. What you believe is cute, though.

  6. OH NOES it's Chinese !! by Anonymous Coward · · Score: 0

    We can't trust it USA #1

  7. Baidu should apply that learning by jafiwam · · Score: 1

    Baidu should apply that "deep learning" to it's dumbass search spider that keeps looking for files deleted more than seven years ago.

    Look, dumbass, 404 means it's NOT FUCKING THERE.

  8. So the chinks stole some more tech by Anonymous Coward · · Score: 0

    As usual