Baidu Forced To Withdraw Last Month's ImageNet Test Results
elwinc writes: Back in mid-May, Baidu, a computer research and services organization in Mainland China, announced impressive results on the ImageNet "Large Scale Visual Recognition Challenge," besting results posted by Google and Microsoft.
Turns out, Baidu gamed the system, creating 30 accounts and running far more than the 2 tests per week allowed in the contest.
Having been caught cheating, Baidu has been banned for a year from the challenge. I believe all competitors are using variations on the convolutional neural network, AKA deep network. Running the test dozens of times per week might allow a competitor to pre-tune parameters for the particular problem, thus producing results that might not generalize to other problems. All of which makes it quite ironic that a Baidu scientist crowed "Our company is now leading the race in computer intelligence!"
That's what I always say.... (/sarcasm)
"File to fit, pound to insert, paint to match" - Aircraft Maintenance 101
Chinese company caught cheating? NO WAY!
Seriously though, raise your hand if you're surprised.
...gis sdrawkcab (usually not responding to ACs; don't bother posting as AC)
This reminds me of some Chinese ball screws (rotary to linear motion components) my company once ordered from Alibaba. The companies had pictures, drawings and put together quotes for parts but then delivered samples that were just totally totally useless. Some of these 'precision' parts looked like they had been made with a file. It just didn't make any sense that they would waste their and our time on such clearly incompetent products.
But when you go there you realise the problem. It is basically an economy in a state of hyper competition. There is so much competition that people will just try anything get ahead, completely oblivious to the wider problem or goal they are trying to solve. You can see that in how the government had to rationalise the solar industry because nobody could make any money. They are just really really crazy competitive.
The trouble though is that there are now many good Chinese engineers who know what they are doing but are still hyper competitive. I really don't know how us westerners with our 40hr work weeks, healthcare and pensions are going to eventually compete with that until we too are faced with the desperation of trying to escape from abject poverty along with 1 billion other people.
If their computers had come up with the cheating scheme on their own, in response to a question on how to improve their ranking in image recognition, then they WOULD be at the lead in AI research. I dont think that was true, sadly. This was just a Stupid Human Trick
People growing up under oppressive governments have much fewer problems with cheating — because cheating government is a fair game. It rubs off — and the attitude is quickly extended to non-governmental institutions large and even smaller ones.
This is not "racism" — ex-Soviets like myself often have the same problem... A cheating Western student fears (or used to fear) the shame of being exposed. A Chinese — or a Soviet — fears merely getting caught. Like a speeding ticket — there is no shame in driving fast, only in being stopped by "the bear".
China today uses drones to catch cheaters — America had not felt the need for such measures. Perhaps, it was a foolish attitude, because we the immigrants bring all our traits to the "wonderful tapestry of diversity", not just the good ones...
Anybody dealing with Chinese companies (or Russian ones, if you can find any), ought to be careful and not depend merely on trust.
In Soviet Washington the swamp drains you.
You don't understand - it wasn't the researchers who created the accounts, it was the AI! They created a sentient AI that figured the best way to win the contest was to cheat.
They'll just go in and steal the research from another competitor and call it their own. Cheating and espionage are familiar bedfellows.
Harrison's Postulate - "For every action there is an equal and opposite criticism"
Such cheat have been used for years in the field. Before ImageNet, there was the Pascal VOC challenge with about the same rules, and I'm pretty sure all winners were optimizing the hyperparameters their submission on the test dataset.
Seriously, as long as computer vision benchmark are based on a single train/test split, there will be such abuses. If there were several splits with meaningful statistics computed on it, I would be less worried by the overfitting you get by optimizing the hyperparameters.
But hey, you're never gonna make it to CVPR without tunning your method so as to fool reviewers that it performs much better than the state of the art. 0.1% for a good idea, 99.9% for engineering tricks.
Video of some good progressive thrash music
Maybe that's appropriate punishment for children, but these are professional scientists. The only reason nobody has the brass to ban them for life is because their country owns us.
Baidu isn't just "a computer research and services organization", they're the Chinese version of Google. They're a massive company with eight billion USD in revenue last year. The headline is either misleading or completely clueless.
Just because I can hook a shark from a boat, I do no offer to wrestle it in the water.
Surely you jest!
I am very small, utmostly microscopic.
Before I moved over to CS:GO, I was still holding onto CS 1.6. Lots of hackers had the alias with "Baidu" in their name...I know it's not relevant, however this does not surprise me one bit that the company "Baidu" is cheating themselves.
did you forget there are 4x as many Chinese...?
China is a fast-moving, up-and-comer nation in the modern sense, but they lead the way in air pollution. Some mock our nation's EPA, but you can thank them, and other local entities like them, who put air and water quality above corporate profits, despite the many complaints from the largest abusers and other overly friendly corporate shills and lackeys. Allowing businesses to run amok in the name of a few low-paying jobs and letting them skip out on paying a fair share in taxes is how these things happen. Also, the cheating thing. but that is not new and not unique. Just very very funny when just the other week they crowed long and hard about beating a Google at a search or whatever. Priceless! Next up; China solves the problem with nuclear fission reactors by building them out of thousands of live people who are yelled at to contain the plasma or be fired from their sweet 40 cents a day job.
This is the NSA, we're gonna geet U h@x0r5! Also, what is a h@x0r5?
Message from the team in question:
Dear ILSVRC community,
Recently the ILSVRC organizers contacted the Heterogeneous Computing team to inform us that we exceeded the allowable number of weekly submissions to the ImageNet servers (~ 200 submissions during the lifespan of our project).
We apologize for this mistake and are continuing to review the results. We have added a note to our research paper, Deep Image: Scaling up Image Recognition, and will continue to provide relevant updates as we learn more.
We are staunch supporters of fairness and transparency in the ImageNet Challenge and are committed to the integrity of the scientific process.
Ren Wu – Baidu Heterogeneous Computing Team
So, while they deserve the year ban, the apology is nice. It's a shame we can never know what results a fair competition could have yielded ... and an even bigger shame that the media misreported Baidu as overpowering Google. I suppose the damage is done and the ILSVRC has made the right choice.
...
Perhaps I'm misunderstanding the classification problem but why isn't this run like most other classification problems (like Netflix and many other data challenges) where you get ~80% for training and the remaining 20% are held back for the final testing and scoring? Is the tagged data set too small to do this? Seems like wikimedia would contain a wealth of ripe public domain images for this purpose
My work here is dung.
It's china. They probably also stole the tech used in that from a western company.
Westerners (and especially Americans) are self-righteous when cheating, and are convinced that until they admit to cheating, it did not happen.
The Chinese are not, therefore they are called cheaters.
Just watch "Game Over" about the way IBM cheated in the Kasparov vs Deep Blue match.
It will be difficult initially...probably in the next iteration.
Access to the tagged data is (supposed to be) limited so that competitors don't try, try, try again until they can fine tune their software to fit the contest standards.
The car equivalent would be Baidu submitting 500 different cars to a "one car per competitor, mystery contest" and then achieving victory because they were the only ones who submitted a monster truck for what turned out to be a demolition derby.
China will always cheat. That is why we also need to be aware of what is going on with them at all time. This is no different than how they act with their gymnasts. Or how they act in space, or on the sprately islands that belong to other nations.
I prefer the "u" in honour as it seems to be missing these days.
except that they admitted NOTHING. They are claiming that it was a mistake, not out and out lying and cheating. Good Lord, they sound like GM, Daimler, Audi, Toyota, Putin, etc.
I prefer the "u" in honour as it seems to be missing these days.