Baidu's Supercomputer Beats Google At Image Recognition

Great power by Anonymous Coward · 2015-05-14 12:05 · Score: 4, Insightful

I'm not sure an improvement of .5 percent on image cataloging is really that impressive to get not one but two greats...

Re:Great power by Barny · 2015-05-14 13:16 · Score: 0

"The ImageNet Classification Challenge, as it is called, involves training software on a collection of 1.5 million labeled images in 1,000 different categories, and then asking that software to use what it learned to label 100,000 images it has not seen before."
"Wu said that Minwa had made it possible to train the system on higher-resolution images. It also permitted use of a technique that turned the original 1.2 million training images into two billion by distorting them, flipping them, and altering their colors. Using that larger training set improved accuracy by preventing the system from becoming too fixated on the exact details of the training images, said Wu. The resulting system should be better at handling real-world photos, he said."
And they sort-of cheated to do it. I am sure if Google and MS would do a similar trick with their systems their accuracy would improve too.

--
...
/me sighs
Re: Great power by Anonymous Coward · 2015-05-14 13:28 · Score: 2, Informative

That's not cheating, that's actually how everybody do it.
Re:Great power by ravenshrike · 2015-05-14 15:12 · Score: 1

Unless they ran multiple initial image sets as well as multiple 'test' sets, there's absolutely no way to know whether their win was statistically insignificant.
Re:Great power by Citizen+of+Earth · 2015-05-15 01:13 · Score: 0

The article says "error rate of just 4.58 percent ... Google's system scored a 95.2% and Microsoft's, a 95.06%". That means Google's and Microsoft's error rates are absolutely terrible and they really should just toss a coin!
Re:Great power by asliarun · 2015-05-15 01:49 · Score: 2

The article says "error rate of just 4.58 percent ... Google's system scored a 95.2% and Microsoft's, a 95.06%". That means Google's and Microsoft's error rates are absolutely terrible and they really should just toss a coin!
Err no.. that was just terrible reporting. Google's attempt had an error rate of 4.82 and Microsoft, 4.94.
I guess Baidu reported it this way to make their "win" sound more sensational.
4.58 error rate vs an error rate of 4.82 or 4.94 doesn't sound that phenomenal, I guess.
To quote:
"The system trained on Baidu’s new computer was wrong only 4.58 percent of the time. The previous best was 4.82 percent, reported by Google in March. One month before that, Microsoft had reported achieving 4.94 percent, becoming the first to better average human performance of 5.1 percent."
Re:Great power by Anonymous Coward · 2015-05-15 03:05 · Score: 0

+1. Need more data as 0.22% is not a significant margin. OTOH Baidu probably spent aeons working to get that last 0.22% which is usually what it is when you approach lower and lower classification error rates and then hope that the changes don't go the wrong way, i.e. are really only special case applicable, etc.

Note to Slashdot Editors by Malenx · 2015-05-14 12:07 · Score: 4, Interesting

This is actually News for Nerds.

I'm curious how much difference in computational power was thrown at training these by Google, Microsoft, and Baidu, though it's going to be great to watch how these continue to evolve.

Re:Note to Slashdot Editors by ShanghaiBill · 2015-05-14 12:58 · Score: 1, Insightful

I'm curious how much difference in computational power was thrown at training these
Training a NN requires a lot of cycles (usually GPU farms) but there is a limit to how much is useful. If you just continue to cycle over the same data, you end up over training your network, so that it basically just memorizes the training set, but fails to generalize to other data sets. Rather than just throwing more computational power at the problem, it is usually more productive to use more data, and improve your algorithms and configurations. Using more data wasn't an option in this case, since it was a standard benchmark, so the training set and the test set were fixed.
Anyway, we live in interesting times. I went to see Ex Machina yesterday. I want my own Ava (but with the "stab" feature disabled).
Re:Note to Slashdot Editors by Noah+Haders · 2015-05-14 13:20 · Score: 1, Funny

I went to see Ex Machina yesterday. I want my own Ava (but with the "stab" feature disabled).
Spoiler alert!
Re:Note to Slashdot Editors by camperdave · 2015-05-14 18:41 · Score: 2

Pretty soon Penny's shoe app will become a reality.

--
When our name is on the back of your car, we're behind you all the way!
Re:Note to Slashdot Editors by Anonymous Coward · 2015-05-14 19:54 · Score: 0

Good going, mate. That spoiler was just what I needed.
Re:Note to Slashdot Editors by Anonymous Coward · 2015-05-15 00:55 · Score: 0

Boo this man. BOO.
Re:Note to Slashdot Editors by Anonymous Coward · 2015-05-15 02:27 · Score: 0

Then you're a bit of a cunt for spoiling something.
Re: Note to Slashdot Editors by subh_arya · 2015-05-15 04:34 · Score: 1

Most deep learning algorithms used for image classification tasks use a data augmentation step - wherein they alter the training image through scaling, translation, etc. According to the paper published here:http://arxiv.org/abs/1501.02876, they do additional transformations in the training images to make the learned model even more robust. So the risk of using up cpu cycles on the same data again and again is reduced.

--
A computer scientist is someone who, when told to Go to Hell sees the "go to" rather than the destination, as harmful.

get one now! by jjeffries · 2015-05-14 12:07 · Score: 1

Only $7.42 including shipping! (AC adapter not included.) Estimated 21-47 days delivery time to USA.

Just Guess by Greyfox · 2015-05-14 12:07 · Score: 4, Funny

After all the news stories from the past couple of years, it seems like you could just guess "Yeah, that's a penis" and be correct about half the time. Seems like most people if you give them a camera, they're going to take a picture of a penis with it. And subsequently post that picture to the internet somewhere.

--

I'm trying to teach myself to set people on fire with my mind... Is it hot in here?

Re:Just Guess by __1200333 · 2015-05-14 12:38 · Score: 2

picture of a penis
Or cats!
Re:Just Guess by PPH · 2015-05-14 12:39 · Score: 1

This will make searching for other porn featuring the subject of 4chan threads more efficient. Even a quarter of a percent improvement in performance is going to save companies millions of dollars as their employees browse /b/.

--
Have gnu, will travel.
Re:Just Guess by Anonymous Coward · 2015-05-14 12:53 · Score: 0

but are they 'space' dicks?
Re:Just Guess by Marginal+Coward · 2015-05-14 14:58 · Score: 1

Which reminds me...I sure got tired of all the "Anthony Wiener" jokes a while back, but just think of how much fun we'll be missing: we won't have Letterman to kick him around anymore [sniffle].
Re:Just Guess by Bing+Tsher+E · 2015-05-14 15:02 · Score: 1

But is it a piebald cat, and are the tabby areas grey or orange?

Bad summary by Anonymous Coward · 2015-05-14 12:08 · Score: 5, Insightful

The summary is written to imply that Google/MS have error rates in the 90's, while the competition only has about 5% error. The values got inverted - Google/MS also have error rates around 5%, but are behind by fractions.

Your maths is off... by ZG-Rules · 2015-05-14 12:11 · Score: 5, Insightful

As a pedant, I need to point out that the improvement is 0.24%

"The system trained on Baidu’s new computer was wrong only 4.58 percent of the time. The previous best was 4.82 percent, reported by Google in March. One month before that, Microsoft had reported achieving 4.94 percent, becoming the first to better average human performance of 5.1 percent."

Also why are the numbers reversed to quote success rates for Google and Microsoft in the summary on Slashdot - it would have been much clearer if the actual numbers in the article (which were all error rates) were quoted!

Re:Your maths is off... by rudy_wayne · 2015-05-14 12:44 · Score: 4, Insightful

Also why are the numbers reversed to quote success rates for Google and Microsoft in the summary on Slashdot - it would have been much clearer if the actual numbers in the article (which were all error rates) were quoted!
Because this is Slashdot and it is required that all stories be written as poorly as possible.

Baidu's new computer was wrong only 4.58 percent of the time. The previous best was 4.82 percent, reported by Google in March.

If Google is only wrong 4.82% of the time then why is it whenever I search for an image I get thousands of pictures that have absolutely nothing to do with what I am searching for?
Re:Your maths is off... by Anonymous Coward · 2015-05-14 12:49 · Score: 0

Also why are the numbers reversed to quote success rates for Google and Microsoft in the summary on Slashdot - it would have been much clearer if the actual numbers in the article (which were all error rates) were quoted!
I agree.
Mixing success and failure rates is a sign that the author is trying to confuse or deceive the reader.
Or possibly it is a sign that writers don't understand what they are writing.
Re:Your maths is off... by xvan · 2015-05-14 12:54 · Score: 1

Because Google can't read your mind... yet... so it needs to guess multiple contexts.
Re:Your maths is off... by Anonymous Coward · 2015-05-14 13:16 · Score: 0

My main question is how damn stupid are the humans they are using to have a >5% error rate looking at pictures. I mean, I know most people are too stupid to read, but looking at pictures???
Re:Your maths is off... by Feral+Nerd · 2015-05-14 13:53 · Score: 1

Also why are the numbers reversed to quote success rates for Google and Microsoft in the summary on Slashdot - it would have been much clearer if the actual numbers in the article (which were all error rates) were quoted!
Because this is Slashdot and it is required that all stories be written as poorly as possible.

Baidu's new computer was wrong only 4.58 percent of the time. The previous best was 4.82 percent, reported by Google in March.
If Google is only wrong 4.82% of the time then why is it whenever I search for an image I get thousands of pictures that have absolutely nothing to do with what I am searching for?
Try using Bing image search. I won't say the results are always better but there is much less noise.
Re:Your maths is off... by gl4ss · 2015-05-14 14:34 · Score: 1

different image set.
the real life image set is harder.
also the test has... well.. it has set imagesets for trainig and then as the test. would the chinese CHEAT? surely not! or perhaps they just threw more cpu and memory at the problem until they could beat the previous by a fraction(you can mess with the first dataset if you want to).
who the fuck cares anyways since half the pictures on the internet wont be baidu reachable anyways.

--
world was created 5 seconds before this post as it is.
Re:Your maths is off... by bouldin · 2015-05-14 14:55 · Score: 5, Informative

My main question is how damn stupid are the humans they are using to have a >5% error rate looking at pictures.
This was the first thing I thought after reading the summary, too. I had to dig into a paper about the 2014 ImageNet challenge, but here is the likely answer:

The most common error that an untrained annotator is susceptible to is a failure to consider a relevant class as a possible label because they are unaware of its existence.

My second question was, if humans failed to label the images correctly, how did they get a correct label in the first place?
The methodology they used just to label the images is impressively sophisticated. Briefly, they crowdsourced through Amazon Mechanical Turk. A first person would draw bounding boxes around individual items in each image, then additional people would classify the items in each box. Only when a majority of labelers agreed on a label did they consider the label correct.
Re:Your maths is off... by phantomfive · 2015-05-14 15:35 · Score: 1

Nice research. Most informative post here.

--
"First they came for the slanderers and i said nothing."
Re:Your maths is off... by rtb61 · 2015-05-14 15:57 · Score: 1

Well, I would suppose that means that image recognition is just a matter of opinion of the viewer of the image. Not so much artificial intelligence as bias in pattern recognition. Currently the best goal for artificial intelligence is accurate in context translation services. First the written word and then the spoken word.

--
Chaos - everything, everywhere, everywhen
Re:Your maths is off... by rubycodez · 2015-05-14 16:08 · Score: 2

Get a more mainstream porn fetish, you pervert
Re: Your maths is off... by Anonymous Coward · 2015-05-14 16:37 · Score: 0

I have no love for overfitting to benchmarks, but to be fair, it depends on the way you value success and failure. If every wrong answer costs you $100, and right answers don't earn you anything, then 95%->95.5% is actually a 10% improvement. Similarly, 99%->99.9% will be a 90% improvement.
Re:Your maths is off... by stephanruby · 2015-05-14 18:01 · Score: 1

why is it whenever I search for an image I get thousands of pictures that have absolutely nothing to do with what I am searching for?
Never search for hard core porn with safe-search turned on, that just doesn't work. Yes, I know. Sometimes, I also wish that Google came with an instructional manual, but sadly, it doesn't.
Another area Google fails miserably at, especially compared to Baidu, is searching for copyrighted materials. With Google, it's DMCA this and it's DMCA that, along with a number of annoying paywalls. With Baidu, as long as you're not searching for photos of Tiananmen Square, you're golden.
Re:Your maths is off... by geantvert · 2015-05-15 01:39 · Score: 1

Being even more pedant I will point out that the improvement is a lore more. What is important here is the error rate.
Simply speaking, they went from 4.82 to 4.58 so the improvement is (4.82-4.58)/4.82 = 0.0497 ~= 5%
Another way to see that is that Google made 48200 errors on the full set of 1 Million images while Baidus made only 45800 errors.
Re: Your maths is off... by Anonymous Coward · 2015-05-15 01:40 · Score: 0

Because they don't use this algorithm on Google search. They use an earlier algorithm on Google plus photo search that is quite good. Try searching for an object or person on there.
Re:Your maths is off... by Rutulian · 2015-05-15 06:25 · Score: 1

There is a TED talk about it,
https://www.ted.com/talks/fei_...
Quite interesting.
Re:Your maths is off... by Anonymous Coward · 2015-05-15 07:58 · Score: 0

Follow the money
Re:Your maths is off... by TelevisioSledgicus · 2015-05-15 19:13 · Score: 0

Because this is Slashdot and it is required that all stories be written as poorly as possible.
I believe a more scientific observation would be because our society provides roles for those not capable enough to participate in a field themselves to act as cheerleaders for it - no different than sports writers, most contemporary "news" writers/critics, reporters.... and, you know, those that observe and discuss, but do not do. (like me)

That's only 0.24% gain by Anonymous Coward · 2015-05-14 12:12 · Score: 0

congrats on the sensationalism.

Your monthly algorithm tweak brought to you by... by Idarubicin · 2015-05-14 12:22 · Score: 4, Insightful

Okay, so we have a benchmark where the bog-standard human being scores 94.9%.

Then in February (that's three months ago), Microsoft reports hitting 95.06%; the first score to edge the humans.

Then in March, Google notches 95.18%.

Now it's May, and Baidu puts up a 95.42%.

Meh. Swinging dicks with big iron are twiddling with their algorithms to squeeze out incremental, marginal improvements on an arbitrary task.

“Our company is now leading the race in computer intelligence,” said Ren Wu, a Baidu scientist working on the project. ... “We have great power in our hands—much greater than our competitors.”

I presume that next month it will be IBM boasting about "leading the race" and being "much greater than their competitors". The month after that it will be Microsoft's turn again. Google will be back on top in August or so...unless, of course, some other benchmark starts getting some press.

--
~Idarubicin

The real award goes to Baidu when they can: by Anonymous Coward · 2015-05-14 12:25 · Score: 1

Build a "Watson style" chatterbot that can win on Jeopardy and despite this miraculous achievement, have the company go under because management are dicks to the level of being able to fuck up a free lunch with an error rate of just 4.3%.

I think this is the fastest supercomputer... by Anonymous Coward · 2015-05-14 12:31 · Score: 0

Not quite...

Re:Your monthly algorithm tweak brought to you by. by Jacob+Shumway · 2015-05-14 12:36 · Score: 2

The real question, of course, is whether Google, Microsoft, and Apple will soon have to face a serious international competitor. It's true that Baidu's incremental image recognition changes might not be a game changer. But if there's any substance to these claims about speech recognition, Baidu might be on track to produce an actual competitive advantage in ways highly relevant to consumers.

ok. 0.3%... big deal by Anonymous Coward · 2015-05-14 12:37 · Score: 0

beating humans, Microsoft and Google.

Sure, but not beating several gov't intel organizations around the world... likely.

Better than Google and Microsoft by Anonymous Coward · 2015-05-14 12:44 · Score: 0

But can they claim that they are better than the NSA at image recognition? Somehow I doubt it.

Re:Better than Google and Microsoft by Anonymous Coward · 2015-05-14 21:31 · Score: 0

Why shouldn’t they be. The NSA doesn't care if they have a 100% hit rate just as long as they can throw *some one* in jail they probably don't care who it is.

Coolio by koan · 2015-05-14 12:48 · Score: 2

Baidu said "Your Kung Fu no good in my village"

--
"If any question why we died, Tell them because our fathers lied."

Re:Coolio by dave420 · 2015-05-15 02:40 · Score: 0

Hahaha! Stereotypes are funny! Isn't that right, Mr. obese geographically-challenged backwards American? :)
Re:Coolio by koan · 2015-05-15 04:16 · Score: 1

Yep, I come from a time when all my friends (mixed race as we were) had no issues with stereotypes, we routinely teased each other, I was the "cheese eater" and it's true... I love cheese.
In fact a Latino friend once told me "White folks have nothing on Mexicans when it comes to racism" apparently racism is a real problem down south of the border (and every where else), especially for the indigenous population.
And... every where I have lived in the World, light skin was favored, in arranged marriages light skin drew a higher demand than dark skin.
Funny that some how racism became the domain of white (whatever that is) Americans since I have seen the same behavior every where in the World.
Now the World is made up of cry babies, whiners, and PC fascist.
http://news.dethronethebankste...

--
"If any question why we died, Tell them because our fathers lied."
Re: Coolio by Anonymous Coward · 2015-05-15 05:30 · Score: 0

POWWWW, you hit it on the head. You know where I learned black jokes? From black people. Know where I learned Polock jokes? From polish people. Know where I learned Jewish jokes? From Jewish people. You get my drift.
Sometimes that wouldn't be the case, but it applied a lot.
my all time favorite: how do u get a one arm Polock out of a tree? You wave ;)

Correct me if Im wrong by Anonymous Coward · 2015-05-14 12:55 · Score: 2, Interesting

I only took basic AI in university but...

The power of the computers is not the important thing here if it takes 3 weeks to train the neural network or 1 day does not change the ACCURACY. The running of the NN to identify a picture is also only a fraction of the training time.

Maybe its more about the TRAINING SET here rather than CPU power. It seems extraordinary. 1 000 000 images sorted into 1000 categories must have been done by humans right? Humans sitting like dog,dog,dog,airplane,dog, house, dog OHPLEASELETMEJUSTDIE.

A bigger trainingset is usually a good thing with NN to avoid overtraining which makes it generalize worse on test examples.

Huh? This is not a very powerful computer. by Chalnoth · 2015-05-14 13:03 · Score: 4, Interesting

The computer has 72 processors and 144 GPU's. That's tiny. Seriously tiny. Sure, GPU's are powerful, especially for image processing. But the larger computers these days are running tens to hundreds of thousands of processors in parallel.

For example, assuming each shelf has 2 processors and 4 GPU's, and they can fit 12 shelves into a single rack, that's a total of 2 racks. Compare that to this image of one of Google's datacenters, where you can see dozens of racks, each containing 14 shelves by my count. And that's just one row. These are gigantic warehouses, with row upon row of racks.

The level of processing power claimed here is closer to the level of a university processing cluster. The larger scientific clusters can be ten or a hundred times larger, and it's not clear just how big private datacenters are.

So overall I'm very, very skeptical. There's a very good chance that they fudged the data somehow to make theirs appear better. But if it is better, well, there's no reason why Google and Microsoft couldn't easily outcompete them in short order.

Re: comparing high success / low failure rates by Anonymous Coward · 2015-05-14 13:04 · Score: 0

As you near 100% success, the ratio of failures is much more important than the difference in percentage.

For example, think of the case where two things have 99.75% and 99.99% success (i.e. 0.25% and 0.01% failure). It's the same difference of 0.24%, but it's much more significant because they're closer to 0% failures. In this fictional example, the one with 0.25% failure has 25x as many failures as the one with 0.01% failure!

tl;dr: 4.58% / 4.82% = 0.950, or 5% fewer failures.

Re:Your monthly algorithm tweak brought to you by. by Anonymous Coward · 2015-05-14 13:15 · Score: 0

I wonder how good farce book is at this?

That's great by denkepban · 2015-05-14 13:18 · Score: 0

I was fantasic http://denkepbanpixar.blogspot...

Re:That's great by Anonymous Coward · 2015-05-14 14:54 · Score: 0

Well done. It is off topic though.
(looks like it's his website)
Re:That's great by baloseo · 2015-05-14 15:38 · Score: 0

http://thietbiyteonline.net/da...

Re:Huh? This is not a very powerful computer. by Anonymous Coward · 2015-05-14 13:19 · Score: 2, Insightful

It may not be processing power alone, but perhaps a better learning algorithm.

When it comes to solving problems, elegance can sometimes beat out brute force. :D

Re:Your monthly algorithm tweak brought to you by. by Anonymous Coward · 2015-05-14 13:26 · Score: 0

Marginal improvements are worthwhile if the cost of failure is relatively large. c.f. branch prediction in computer architecture....

Re:Your monthly algorithm tweak brought to you by. by Anonymous Coward · 2015-05-14 13:35 · Score: 1

Baidu doesn't have as many women or as much racial diversity, which is why they're going to fall behind pretty quickly.

PR, not science? by Black+Copter+Control · 2015-05-14 14:01 · Score: 1

You get a small improvement on an old benchmark --- so promote the heck out of it before somebody beats you again, in turn.

There's nothing really wrong with this announcement -- It's just not a big breakthrough of any real sort.

--
OS Software is like love: The best way to make it grow is to give it away.

Re:PR, not science? by Anonymous Coward · 2015-05-14 15:15 · Score: 0

It shows Baidu is competitive as a image search engine. Most of us don't use Baidu so it's good to know.

Re:Huh? This is not a very powerful computer. by Anonymous Coward · 2015-05-14 14:02 · Score: 0

The computer has 72 processors and 144 GPU's. That's tiny. Seriously tiny. Sure, GPU's are powerful, especially for image processing. But the larger computers these days are running tens to hundreds of thousands of processors in parallel.

For example, assuming each shelf has 2 processors and 4 GPU's, and they can fit 12 shelves into a single rack, that's a total of 2 racks. Compare that to this image of one of Google's datacenters, where you can see dozens of racks, each containing 14 shelves by my count. And that's just one row. These are gigantic warehouses, with row upon row of racks.

The level of processing power claimed here is closer to the level of a university processing cluster. The larger scientific clusters can be ten or a hundred times larger, and it's not clear just how big private datacenters are.

So overall I'm very, very skeptical. There's a very good chance that they fudged the data somehow to make theirs appear better. But if it is better, well, there's no reason why Google and Microsoft couldn't easily outcompete them in short order.

This is not a competition of hardware. It is a competition of software and algorithms. The hardware is somewhat irrelevent, and a smaller hardware footprint would actually be more impressive! You also seem to think that the #1 search provider in the chinese market is incapable of standing toe to toe with an American company. I think you need to get out a little more.

Re:King Frosty The First Beats All @ Being First by Anonymous Coward · 2015-05-14 14:16 · Score: 0

Nix...

Re: swinging dicks by Anonymous Coward · 2015-05-14 15:01 · Score: 0

Obligatory link to Richard Pryor's famous joke about big dicks.

If the #t=470 doesn't work, just skip to 7:50. The joke only takes 50 seconds. You won't be disappointed, even though you've probably heard it before. ;)

Mine is bigger - much bigger by Anonymous Coward · 2015-05-14 15:10 · Score: 0

“We have great power in our hands—much greater than our competitors.”

Pffff. I hope it's just a translation artifact, because it sure sounds ridiculously childish.

Re:Your monthly algorithm tweak brought to you by. by Em+Adespoton · 2015-05-14 15:13 · Score: 1

Nah; next will be Wolfram, based on crowdsourcing.

Re:Huh? This is not a very powerful computer. by Anonymous Coward · 2015-05-14 15:22 · Score: 1

You said (emphasis mine):

But the larger computers these days are running tens to hundreds of thousands of processors in parallel.

I don't know what GPU they're using, but if they're Nvidia GeForce GTX Titan Z (700 series; released in March 2014), then that could be 144 * 2880 * 2 = 829,440 shader cores, obviously in parallel (running at 705MHz to 876MHz).

That card can do 8.1 TFLOPs (single-precision), or 2.7 TFLOPs (double-precision). That means 144 of them could do over over 1.1 PFLOPs (single-precision). That's nothing to sneeze at.

p.s. I got my figures from here.

Re:Huh? This is not a very powerful computer. by petherfile · 2015-05-14 15:36 · Score: 1

More info on the specs of the "supercomputer" that TFA only glossed over:

The result is the custom-built supercomputer, which we call
Minwa . It is comprised of 36 server nodes, each with 2
six-core Intel Xeon E5-2620 processors. Each sever con-
tains 4 Nvidia Tesla K40m GPUs and one FDR InfiniBand
(56Gb/s) which is a high-performance low-latency inter-
connection and supports RDMA. The peak single precision
floating point performance of each GPU is 4.29TFlops and
each GPU has 12GB of memory. Thanks to the GPUDirect
RDMA, the InfiniBand network interface can access the re-
mote GPU memory without involvement from the CPU. All
the server nodes are connected to the InfiniBand switch.
Figure 1 shows the system architecture. The system runs
Linux with CUDA 6.0 and MPI MVAPICH2, which also
enables GPUDirect RDMA.

In total, Minwa has 6.9TB host memory, 1.7TB device
memory, and about 0.6PFlops theoretical single precision peak performance.

It's not that powerful overall, but seems to be well thought out for what it is doing. I do the see point about fudging data somehow, they do provide a lot of information of what they supposedly did here

I don't know how this is verifiable, it's not like they have released source code or binaries for the software as far as I can tell.

CPU humor by Anonymous Coward · 2015-05-14 15:42 · Score: 1

Marginal improvements are worthwhile if the cost of failure is relatively large. c.f. branch prediction in computer architecture....

Your Mom's relatively large.

When she gets on the bus the ALU just gives up.

Her Branch prediction is that if she climbs branches they'll break off.

And I hear she likes JZ and XOR.

Re:Huh? This is not a very powerful computer. by Anonymous Coward · 2015-05-14 15:51 · Score: 1

The high end of the TOP 500 super computers use tens of thousands of GPUs (at least among those that use GPUs at all); for instance the Titan at ORNL has 18,688 nVidia Tesla K20's for a total of (roughly) 46 million CUDA cores.

One generally does not count individual CUDA cores, however (nor the equivalent for AMD GPUs).

More info on the ImagNet Competition by gront · 2015-05-14 15:52 · Score: 2

http://image-net.org/challenge...

Has the 2014 competition, including test images and validation images.

Browsing the images, and the 200 or so categories, "artichoke", "strainer", "bowl", "person", "wine bottle"... the challenge is a bit strange: A drawing of a person isn't a "person" category, but a bottle of boyle's cream soda is a "wine bottle".

And why is "artichoke" something we need to identify in photographs?

Re:More info on the ImagNet Competition by gl4ss · 2015-05-14 18:35 · Score: 1

it's not strange at all.
it's just artificial ;)
I mean, the categories aren't exact or well done or even fitting. it's still a manageable recognition contest, it just means that the results aren't useful in comparison if you want to use it to make searches from human queries...
I mean, humans already score lower than google, ms or baidu's engine does. which makes honing these partial percentages pretty stupid.
they should just devise a better contest quite frankly, with combination categories or lists of "whats in the picture in relation to each other", like "wine in a glass" vs. "wine glass and a wine bottle"

--
world was created 5 seconds before this post as it is.
Re:More info on the ImagNet Competition by Beezlebub33 · 2015-05-15 06:29 · Score: 1

they should just devise a better contest quite frankly, with combination categories or lists of "whats in the picture in relation to each other", like "wine in a glass" vs. "wine glass and a wine bottle"
Yes, they should 'just' create a better contest. The issue with that is that creating a contest, identifying objects, labels, testing, error-correcting, etc. is a slow, expensive, and unglamorous process. The ILSVC is only a couple of years old. And already it is showing its age; I really don't think that they expected it to be solved for much longer.
So, what's next in terms of contests? Probably a multi-object challenge, where a picture can have many objects; alternately the task would be to label not only the main object but also the parts. The previous were limited because there was a single primary labeled object. ILSVC doesn't even using a bounding box (which Pascal VOC did). So, the next step is to create a data set with lots of objects and have them all labeled, and the computer has to draw the boundary (not just the bounding box) around the object.
Deciding the performance is a pain in many of these contests, and eventually it becomes kind of arbitrary. How do you decide that a bounding box correctly covers the ground truth bounding box? Any measurement (i.e. 50% overlap) is going to be arbitrary. Doing it for object boundaries is going to be even harder

--
The more people I meet, the better I like my dog.

Re:Huh? This is not a very powerful computer. by Chalnoth · 2015-05-14 16:01 · Score: 2

Sure, but that's why Microsoft and Google will rapidly catch up if the numbers are real. Both employ lots of extremely talented and creative people exactly for solving problems like this, and the methods they use have been published.

Anyway, if they did really manage to produce some better algorithms, that's impressive and important work. But bragging about such a tiny computer seems seriously out of place.

Trick? by Anonymous Coward · 2015-05-14 16:05 · Score: 1

And you get +4 for claiming an algorithm improvement is a 'trick'?

Re:Trick? by Anonymous Coward · 2015-05-15 00:29 · Score: 0

+10,000,000,000 for patenting it.
Re:Trick? by Anonymous Coward · 2015-05-15 03:07 · Score: 0

That is increasing the sample size, not the algorithm. There would be a separate algorithm that took the 1.2 million photos and turned them into 1.2 billion photos. The algorithm then takes 1.2 billion instead of 1.2 million photos to "Learn".
I am also not sure how this makes them the "leader" in "computer intelligence", more like the leader in playing the childhood "Match" game or game show "Concentration". Matching pixels by their value is different than learning something and changing your behavior due to what you learned.
Glorified black-jack deck card sorter.
Re:Trick? by Anonymous Coward · 2015-05-15 04:28 · Score: 0

The technique works for any image set, and is as automatic as the core of the learning. How is this *not* algorithm improvement?

Oblig by Anonymous Coward · 2015-05-14 16:12 · Score: 0

Don't be too proud of this technological terror you've constructed. The ability to recognize an image is insignificant next to the power of the Force.

Re:Oblig by Anonymous Coward · 2015-05-15 03:01 · Score: 0

HTF is this oblig?

Re:Huh? This is not a very powerful computer. by Chalnoth · 2015-05-14 16:23 · Score: 2

a) Each CPU in these clusters typically has anywhere from 4-8 cores, and may support two or more times as many threads.

b) It's far, far more difficult to make full use of GPU hardware than CPU hardware. The best application for stressing GPU hardware is 3D graphics rendering, and even there if you run through the numbers, you find that it's rare that they really push half of their theoretical processing limit. General processing is significantly less efficient on GPU hardware, in particular because it's difficult to come up with computing problems that work well with the GPU's extremely limited I/O compared to their processing power. You need to do a lot of processing on each bit of data read or written to not be limited by either PCI Express bandwidth or video RAM bandwidth. Typical best-case real world scenarios for GPGPU programming put GPU's at closer to 10x or so the performance of CPU's, not 1000x as just looking at the number of shader cores vs. CPU cores might suggest. So they're quite powerful, but not overwhelmingly so. Whether or not they're worth it is highly dependent upon the application.

c) You can bet that companies like Microsoft and Google have a significant number of GPU's in use for specialized tasks.

Queue petty comments from envious Americans by Anonymous Coward · 2015-05-14 17:07 · Score: 0

they just can't let anyone else "win".

Re:Huh? This is not a very powerful computer. by Anonymous Coward · 2015-05-14 17:10 · Score: 0

That just makes the real stuff, their algorithms, even more impressive.

And... by Anonymous Coward · 2015-05-14 17:44 · Score: 0

And...he's humble.

Re:Your monthly algorithm tweak brought to you by. by Anonymous Coward · 2015-05-14 17:49 · Score: 0

But if there's any substance to these claims about speech recognition...

Baidu considers a single month advantage consisting of a fraction of a percent better in image recognition as "leading the race in computer intelligence" and being "much greater than our competitors". Due to that alone any claims they make should have a null hypothesis that they are completely full of it.

Re:Huh? This is not a very powerful computer. by Anonymous Coward · 2015-05-14 18:03 · Score: 0

Until the employ me they will never be able to catch up

Relative improvement by ziggystarsky · 2015-05-14 18:53 · Score: 2

Going from 99.5 to 100.0 percent is extremely impressive, while going from 50.0 to 50.5 is probably just noise.

I wonder if it can outdo Google by Anonymous Coward · 2015-05-14 19:20 · Score: 0

http://xkcd.com/1444/

I have a better test that proves Google is No.1 by dsmatthews9379 · 2015-05-14 19:30 · Score: 1

Do an image search for "man in a purple hat holding a watermelon" Google's results are the most intelligent followed by Bing with Baidu a long way back in third.

Once again, computers beat humans by penguinoid · 2015-05-14 19:41 · Score: 1

First they beat us at math. Then at strategy games. Now they beat us at one of the few things we still did better, visually distinguishing apples from oranges.

--
Don't waste your vote! Vote for whoever you want, unless you live in a swing state it won't matter anyways

now lets consider the next dimension ... time... by swframe · 2015-05-14 19:56 · Score: 1

All of them should have their system train on several seconds of video to identify action: drinking, running, etc. Next, train on sequential actions to identify cause and effect. Next, train on movie,tv,news,youtube,cctv audio->text/dialog/script to identify the relationship between words, actions, cause and effect. Finally, they should build a chatbot that maps input words to actions, actions to cause, cause to effects, etc until they get back to output words. It wouldn't pass a turing test but it would be awesome. For example: Human: If I am in a dark room and I turn on the light, what will happen? If I kiss my wife, what will she do? If I get on one knee and give someone a ring, what is happening? It probably would also help with language translation: map english words to actions in english movies, map similar actions in russian movies to russian words.

Re:Your monthly algorithm tweak brought to you by. by Anonymous Coward · 2015-05-14 21:06 · Score: 1

You do realize WHO is actually working at Baidu right? A Mr. Andrew Ng.

Re:Your monthly algorithm tweak brought to you by. by Anonymous Coward · 2015-05-14 21:29 · Score: 0

Actually I would surprised if Baidu didn't have much more women at work than either Google or Microsoft since China has state policies promoting gender equality.

Re:Huh? This is not a very powerful computer. by Anonymous Coward · 2015-05-14 21:38 · Score: 0

Considering most Google data centers are designed with high redundancy in mind, it is hard to use them as a baseline for anything.

Google take "RAID" (RAIC?) to the next level.

It can also easily be something in the software side of things as well.

It will likely be beaten at some point, then they will swing back, then Microsoft will be like, "guis pls slow down", then they will fake numbers because why not it is Microsoft, they did it before for Xbox One cloud rendering.
It is always a constant race of tweaking.

Regardless, it won't matter because humans are terrible at asking for reliable search parameters most times, which is why you get about 50 other unrelated image terms in your searches because they have to assume you mean every context of the word(s) you queried.

Re:Your monthly algorithm tweak brought to you by. by Anonymous Coward · 2015-05-14 22:21 · Score: 0

But China doesn't have as many women because of the one child policy and female infanticide.

Chinese Supercomputer by Anonymous Coward · 2015-05-14 22:32 · Score: 0

It still mistook a subway carriage for a toilet.

95.42 is "much greater" than 95.2? by Anonymous Coward · 2015-05-14 23:46 · Score: 0

Aren't Asians supposed to be good at math?

LOL China by Anonymous Coward · 2015-05-14 23:59 · Score: 0

So desperate to exaggerate the size of their ePeen

Beating humans? by Anonymous Coward · 2015-05-15 01:12 · Score: 0

If they "outperformed humans", then what's the basis of the truth set? Wasn't it humans who figured out how the images SHOULD have been categorized?

it was less about having more CPU power, then .... by WindBourne · 2015-05-15 01:23 · Score: 2

the fact that the head guy at Baidu now, came from Google. Basically, he took Google's technology and then was funded by China's gov ( who is behind Baidu's funding on this ).

Hopefully, someday soon, the west will realize that hiring Chinese means simply giving your technology over to the CHinese gov.

--
I prefer the "u" in honour as it seems to be missing these days.

The improvement is about 5% by pscottdv · 2015-05-15 01:48 · Score: 1

0.24 is about 5% of 4.82

--

this signature has been removed due to a DMCA takedown notice

Skynet by ArcadeMan · 2015-05-15 02:01 · Score: 1

Do you want Skynet? Because that’s how you get Skynet.

--
Get free satoshi (Bitcoin) and Dogecoins

Re:Your monthly algorithm tweak brought to you by. by LordLimecat · 2015-05-15 02:41 · Score: 1

China has state policies on human rights and freedom of religion, as well.

Re:Your monthly algorithm tweak brought to you by. by Beezlebub33 · 2015-05-15 06:14 · Score: 2

Okay, so we have a benchmark where the bog-standard human being scores 94.9%.

Yes, and now the algorithms are better. More importantly, the 'standard human' only does that when it is paying attention, which it can't do for more than 15 minutes or so. The computer does it day in, day out, forever. And it will get better over time.

Then in February (that's three months ago), Microsoft reports hitting 95.06%; the first score to edge the humans. Then in March, Google notches 95.18%. Now it's May, and Baidu puts up a 95.42%. Meh. Swinging dicks with big iron are twiddling with their algorithms to squeeze out incremental, marginal improvements on an arbitrary task.

You denigrate their work, but that's the way science works: incrementally almost all the time. In any field, you will see tweaking, slight improvements, variations, and a couple of new ideas. And then one of the researchers will hit on the next big idea. So what? What the hell have you done? You're just being a dick.

“Our company is now leading the race in computer intelligence,” said Ren Wu, a Baidu scientist working on the project. ... “We have great power in our hands—much greater than our competitors.”

I presume that next month it will be IBM boasting about "leading the race" and being "much greater than their competitors". The month after that it will be Microsoft's turn again. Google will be back on top in August or so...unless, of course, some other benchmark starts getting some press.

First, what they are doing is very hard. So, yeah, doing 0.25% better than someone else is a big deal. Let's see you do better.

Second, look at the performance over time. There was the NIST handwriting sets, and then the Stanford data sets, then the 'standard' was the PASCAL Visual Object Challenge and people were slowly improving to the point that someone else needed to step up and provide a better standard (more categories and more examples of each). And that was the ILSVC, and now we're down to the last couple percent on those. The next set will be bigger and harder. And performance will improve on that one too. That's expected and a good thing. Image recognition is stunningly hard; thanks to the hard work by these researchers it's gotten a lot better.

here's your obligatory XKCD

--
The more people I meet, the better I like my dog.

"poorly as possible..." Re:Your maths is off... by Fubari · 2015-05-15 07:00 · Score: 1

Perhaps CXIX / CXXV (ratio of roman numerals) instead of some more common representation like like 95.2% ?

Because this is Slashdot and it is required that all stories be written as poorly as possible.

Same ballpark by DrYak · 2015-05-16 05:28 · Score: 1

Exactly what I though too:

Meh, they are all basically in the same ballpark (including humans). No breakthrough achievement. Wake me when some one achieve 10x better than humans and competition.

--
"Sufficiently advanced satire is indistinguishable from reality." - [Tips: 1DrYakQDKCQ6y52z6QbnkxHXAocMZJE61o ]

Slashdot Mirror

Baidu's Supercomputer Beats Google At Image Recognition

115 comments