Extreme Complexity of Scientific Data Driving New Math Techniques

← Back to Stories (view on slashdot.org)

Extreme Complexity of Scientific Data Driving New Math Techniques

Posted by Soulskill on Friday October 11, 2013 @09:59AM from the how-do-you-process-twelve-billion-data-points dept.

An anonymous reader writes "According to Wired, 'Today's big data is noisy, unstructured, and dynamic rather than static. It may also be corrupted or incomplete. ... researchers need new mathematical tools in order to glean useful information from the data sets. "Either you need a more sophisticated way to translate it into vectors, or you need to come up with a more generalized way of analyzing it," [Mathematician Jesse Johnson] said. One such new math tool is described later: "... a mathematician at Stanford University, and his then-postdoc ... were fiddling with a badly mangled image on his computer ... They were trying to find a method for improving fuzzy images, such as the ones generated by MRIs when there is insufficient time to complete a scan. On a hunch, Candes applied an algorithm designed to clean up fuzzy images, expecting to see a slight improvement. What appeared on his computer screen instead was a perfectly rendered image. Candes compares the unlikeliness of the result to being given just the first three digits of a 10-digit bank account number, and correctly guessing the remaining seven digits. But it wasn't a fluke. The same thing happened when he applied the same technique to other incomplete images. The key to the technique's success is a concept known as sparsity, which usually denotes an image's complexity, or lack thereof. It's a mathematical version of Occam's razor: While there may be millions of possible reconstructions for a fuzzy, ill-defined image, the simplest (sparsest) version is probably the best fit. Out of this serendipitous discovery, compressed sensing was born.'"

107 comments

Min score:

Reason:

Sort:

Re: first post by Anonymous Coward · 2013-10-11 10:09 · Score: 0

You win the internets
I dunno about you... by Wolfrider · 2013-10-11 10:09 · Score: 1

...but I don't think I'd want my doctor working from a "fuzzy logic" MRI if I had (God forbid) a BRAIN TUMOR or something...

--
.
== WolfriderV6 == I'm willing to admit that *I just might* be wrong... Are you??
1. Re:I dunno about you... by almitydave · 2013-10-11 10:22 · Score: 4, Funny
  
  Yeah, my doctor couldn't see enough detail in my head x-ray, so he used Photoshop's "content-aware fill" to fix it, and now apparently I need surgery to remove the 3rd half of my brain. I get to keep the 2 extra eyeballs, though.
  (actually, I really really want to see that applied to medical x-rays)
  
  --
  my, your, his/her/its, our, your, their
  I'm, you're, he's/she's/it's, we're, you're, they're
2. Re:I dunno about you... by lgw · 2013-10-11 10:25 · Score: 4, Funny
  
  OF course it works. "Zoom! Enhance!" If TV hasn't taught me that "enhance" works reliably, then TV has taught me nothing.
  
  --
  Socialism: a lie told by totalitarians and believed by fools.
3. Re:I dunno about you... by Anonymous Coward · 2013-10-11 10:33 · Score: 0
  
  The specifics ITFA aside, aren't we really talking about an evolution of statistics, which has firmly established it's role in science?
4. Re:I dunno about you... by mmell · 2013-10-11 10:52 · Score: 1
  
  Hey, if that's all they have to work with (given that current imaging technology is not up to the standard found aboard a Federation starship) - unless you'd rather your surgeon used a divining rod? A surgeon should be aware of the difference between a raw and an enhanced image, and I'm pretty sure that some data is better than none.
5. Re:I dunno about you... by Artifakt · 2013-10-11 11:00 · Score: 1
  
  ... but I don't think I want a military intelligence specialist who has been ordered to find weapons of mass distruction on satellite photos working them over with this sort of software either ...
  
  --
  Who is John Cabal?
6. Re:I dunno about you... by Anonymous Coward · 2013-10-11 11:04 · Score: 1
  
  Too true. And don't forget, the zoom is infinite and the enhance takes a muddy, motion-smeared blob and gives you a professional quality portrait. "Look--the thief has dandruff!"
7. Re:I dunno about you... by timeOday · 2013-10-11 11:14 · Score: 2
  
  Have you ever played with the compression level on jpg? At some point, enough is enough. Now instead of lossy compression, imagine we're talking about how much radiation to shoot into your nads to get a clean xray. There are diminishing returns on image quality for each doubling of the radiation. Are you still so sure you want to turn it up to 11?
8. Re:I dunno about you... by timeOday · 2013-10-11 11:20 · Score: 1
  
  Let's say the analyst has to search a huge area for mobile launchers, and the imagery comes from a satellite with finite bandwidth. Would you rather he search good-quality images of the whole area, or fantastic-quality images of a tiny fraction of the area?
9. Re:I dunno about you... by Arkh89 · 2013-10-11 11:30 · Score: 2
  
  That is NOT the way to understand these sets of techniques. Candes, Tao and Donoho's works are basically about saying : what is the minimum number of measurements that I have to do to make sure that the reconstruction of the signal will be sufficient (for a given task), assuming that the signal has some known properties?
  Let's say you hear the sound of horseshoes while walking in a street, if I ask you what is the color of the coat of the animal, you won't probably start by saying "red" or "blue". This is because you know already some of the classical equine coats colors which means you technically need less information to find the real color.
  This technologies can also help for q signal corrupted by noise since the properties of the first might be, in some way, orthogonal to the last, leading to a clean removal.
10. Re:I dunno about you... by icebike · 2013-10-11 12:16 · Score: 1
  
  Have you ever played with the compression level on jpg? At some point, enough is enough. Now instead of lossy compression, imagine we're talking about how much radiation to shoot into your nads to get a clean xray. There are diminishing returns on image quality for each doubling of the radiation. Are you still so sure you want to turn it up to 11?
  Enough is not where we are at.
  The point is that THAT point hasn't been reached.
  When looking at a highly compressed image of a person's face, you still recognize the face, and there is no reason to speculate about a Nuclear Aircraft Carrier floating on the film of tears on the eyeball. Similarly, when looking at an MRI where you were only able to get a partial image, there is no reason to assume a third eye will somehow be missed in any brain scan that carries detail finer than an eye.
  Doctors aren't total idiots.
  
  --
  Sig Battery depleted. Reverting to safe mode.
11. Re:I dunno about you... by nashv · 2013-10-11 12:16 · Score: 1
  
  Yeah, because the knowledge the doctor is using to diagnose your brain tumour by eye is completely determinstic, right? Because that how human brains work huh...
  
  --
  Entia non sunt multiplicanda praeter necessitatem.
12. Re:I dunno about you... by icebike · 2013-10-11 12:24 · Score: 1
  
  Enhance 224 to 176. Enhance, stop. Move in, stop. Pull out, track right, stop. Center in, pull back. Stop. Track 45 right. Stop. Center and stop. Enhance 34 to 36. Pan right and pull back. Stop. Enhance 34 to 46. Pull back. Wait a minute, go right, stop. Enhance 57 to 19. Track 45 left. Stop. Enhance 15 to 23. Give me a hard copy right there.
  
  --
  Sig Battery depleted. Reverting to safe mode.
13. Re:I dunno about you... by icebike · 2013-10-11 12:36 · Score: 1
  
  As long as the minimal resolution was sufficient to allow the software to to deduce a launcher in spite of drop-out, clearly the most useful image would cover the area just large enough so that the software could deduce the existence of a launcher by running the above mentioned algorithm. There would be no point in having resolution sufficient to read a license plate, when what you are looking for is 30 feet tall, and has a known shape.
  The thing is, people using this imaging technique have to know what they are looking for, and have the luxury of adjusting the satellite for the most economical and useful image.
  When you only a poor MRI of a squirming child, and you don't know what exactly you are looking for, its a different matter.
  
  --
  Sig Battery depleted. Reverting to safe mode.
14. Re:I dunno about you... by icebike · 2013-10-11 12:42 · Score: 2
  
  But every once in a while, you'd be so screwed.
  Occam would surely ride in and save the day.
  
  --
  Sig Battery depleted. Reverting to safe mode.
15. Re:I dunno about you... by jrumney · 2013-10-11 14:01 · Score: 1
  
  Basically, this is an algorithm that, given a fuzzy image with some people looking shapes in it, produces an image with stick figures in their place. "Hey it produced a perfect match" you say, but actually you don't have a non-fuzzy image for comparison, so you don't realise that the one at the back chasing the others is actually Sasquatch, and as any child knows the stick figure match for Sasquatch needs more jagged lines..
16. Re:I dunno about you... by Anonymous Coward · 2013-10-11 19:36 · Score: 0
  
  I don't see how this works for any of these applications. Why would you want to see anything that isn't actually there?
17. Re:I dunno about you... by SuricouRaven · 2013-10-11 20:11 · Score: 2
  
  "Checked. Still no weapons of mass destruction."
  "Damnit... switch to a lower resolution and try again!"
18. Re:I dunno about you... by paskie · 2013-10-11 21:12 · Score: 1
  
  So what do you think the doctor works from now?
  
  --
  It's not the fall that kills you. It's the sudden stop at the end. -Douglas Adams
19. Re:I dunno about you... by cellocgw · 2013-10-12 01:28 · Score: 2
  
  but I don't think I'd want my doctor working from a "fuzzy logic" MRI if I had (God forbid) a BRAIN TUMOR or something...
  Then I got bad news for you: NMR imaging and CAT imaging depend on algorithms with names like "Maximimum A Priori Likelihood Estimation." They *all* depend on making the best bet as to what the reconstructed image should be. It just turns out (thanks to that thing called mathematical statistics) that the correct solution is overwhelmingly positive. "Fuzzy Logic" does not mean what I think you think it means, i.e. "some random drunk posting to /."
  
  --
  https://app.box.com/WitthoftResume Code: https://github.com/cellocgw
20. Re:I dunno about you... by K.+S.+Kyosuke · 2013-10-12 01:54 · Score: 1
  
  As far as X-rays are concerned, perhaps developing better contrast agents would be a better solution to get pictures sharper where you need it.
  
  --
  Ezekiel 23:20
21. Re:I dunno about you... by K.+S.+Kyosuke · 2013-10-12 01:58 · Score: 1
  
  I liked this one more. :-)
  
  --
  Ezekiel 23:20
22. Re:I dunno about you... by Anonymous Coward · 2013-10-12 02:28 · Score: 0
  
  If you're worried about radiation stick to MRIs, ultrasounds and avoid CAT scans and other X-Ray scans.
23. Re:I dunno about you... by TheLink · 2013-10-12 02:30 · Score: 1
  
  Is there a comparison between a partial scan with this processing and a full scan?
  --
  
  Too many replies beneath your current threshold
24. Re:I dunno about you... by Anonymous Coward · 2013-10-12 15:39 · Score: 0
  
  Somebody who works with MRI here...MRIs typically have a tradeoff between detail and scan time, i.e. in rough terms they could fill in much more detail/be less noisy *if* the subject stayed still long enough. When the subject is a beating heart, there just isn't enough time to acquire everything*. The point of the article is that they weren't expecting to get such good images with incomplete data, but in many cases Occam's Razor works. This also applies to CT, where the goal is to get a usable image with as little radiation as possible.
  * some of you might be wondering "what if you sample different points at different times, use multiple heartbeats?". That's an active area of research, and compressed sensing has a role to play there. Many people think of it as a general framework for filling in data gaps in a sensible way.
25. Re:I dunno about you... by RockDoctor · 2013-10-19 06:36 · Score: 1
  
  As TFS says,
  
  [images] such as the ones generated by MRIs when there is insufficient time to complete a scan.
  (my emphasis)
  So, you're in one of two situations : you've got an acute problem - suffocation, massive bleeding, something really, really time-critical - and if they don't stop the MRI now and do something else now, then you're dead meat ; or, there has been some mechanical or financial issue with the machinery and someone is trying to save money by not re-doing the scan. In the one case, you've got a choice between a perfect MRI of a corpse, or an imperfect MRI of someone who's alive. In the other case, well ,someone, somewhere is playing with your life for their financial gain. Do you suffer under some sort of capitalist healthcare system there or something equally barbaric? (I am of course writing from a handful of miles away from where MRI was developed, under a socialized healthcare system. I used to ice climb with people who worked on the development.)
  I've never heard of a brain tumour where taking an extra hour to get a better scan would be life-critical. An extra week, yes that can be an issue. But an hour ; n/a.
  
  --
  Birds are not dinosaur descendants;birds are dinosaurs, for all useful meanings of "birds", "are" and "dinosaurs"
Can we apply this to Slashdot comments? by Anonymous Coward · 2013-10-11 10:09 · Score: 1

Ya, know, filter out all the noise and controversy, and get an output which is purely liberal computer geek rants all in unified, biased agreement. That would be amazing!
1. Re:Can we apply this to Slashdot comments? by AK+Marc · 2013-10-11 10:30 · Score: 1
  
  How is it that all the conservative complain it's all liberals, and the liberals claim it's all libertarians? How can it be "all" groups at the same time?
  
  --
  Learn to love Alaska
2. Re:Can we apply this to Slashdot comments? by Anonymous Coward · 2013-10-11 10:37 · Score: 0
  
  How is it that all the conservative complain it's all liberals, and the liberals claim it's all libertarians? How can it be "all" groups at the same time?
  I love the posts that start with "Everyone one Slashdot thinks" when the author of the post clearly thinks differently.
3. Re:Can we apply this to Slashdot comments? by WillKemp · 2013-10-11 10:45 · Score: 1
  
  If we applied it to tfa, there wouldn't be anything left.
4. Re:Can we apply this to Slashdot comments? by Anonymous Coward · 2013-10-11 10:51 · Score: 1
  
  Come off it. No one on Slashdot likes those posts.
5. Re:Can we apply this to Slashdot comments? by AK+Marc · 2013-10-11 11:31 · Score: 1
  
  Everyone on slashdot thinks your comment is dumb.
  
  --
  Learn to love Alaska
6. Re:Can we apply this to Slashdot comments? by Anonymous Coward · 2013-10-11 17:02 · Score: 0
  
  Only natural since all ideologies are on the same side in the memetic war.
  The side of generalization, simplification, group pressure, mob rule, memetic centralization, inherent communitarianism by structure, inescapable dogma-creation and mythos-building which aggregates unrelenting with time.
  Any free thoughts lose when such is allowed to define the context, it is their enemy and is shouted down, you will find little trace of it in current public society at large. Not in “debates”, not in “reporting”, not in “governance”. Doesn't matter what any dead encrusted words says counter-wise, what they call themselves, or what ideals they claim to represent.
  Ideologies of all sorts are for farm animals bickering about being first in line to the slaughterhouse. Politics, religion, sport, fashion, anything and everything branded. Principles are shallow concepts stripping away context from thought and rendering it mush that is neither connected to the empirical reality nor the abstract reality.
  We have to discard this soon if we want to survive our own nature. The longer we do not do so the larger the gap between our powers and our abilities becomes.
Enough with this big data bullshit by Anonymous Coward · 2013-10-11 10:09 · Score: 3, Insightful

For fuck's sake.
These techniques of dealing with incomplete and unstructured data have existed for decades.
AI researches hyping absolutely everything about their field to get some funding is starting to get on my nerves.
1. Re:Enough with this big data bullshit by WillKemp · 2013-10-11 10:47 · Score: 1
  
  Yep. Just more of the usual "big data" bullshit hype. The sort of nonsense you'd expect from Weird mag.
2. Re: Enough with this big data bullshit by Anonymous Coward · 2013-10-11 18:10 · Score: 0
  
  Sparse coding is not AI. It is a genuinely new mathematical idea.
3. Re:Enough with this big data bullshit by Anonymous Coward · 2013-10-14 13:39 · Score: 0
  
  Or Keith Alexander.
  captcha: depraved
When you don't have all the data by Anonymous Coward · 2013-10-11 10:10 · Score: 1

Make assumptions
Amazing intuition by ZeroPly · 2013-10-11 10:11 · Score: 5, Funny

"They were trying to find a method for improving fuzzy images, such as the ones generated by MRIs when there is insufficient time to complete a scan. On a hunch, Candes applied an algorithm designed to clean up fuzzy images,[...]"

Wow! That would be the last thing I thought of in that situation...

--
Support microSD: in a post 9/11 world, it is unwise to carry your data on media that you cannot comfortably swallow.
1. Re:Amazing intuition by Anonymous Coward · 2013-10-11 10:43 · Score: 1
  
  Yup, quite the intuitive leap. I hope the Nobel committee knows how to reach him.
2. Re:Amazing intuition by Anonymous Coward · 2013-10-11 10:52 · Score: 2, Funny
  
  They were trying to reach him to talk to him. Oh a hunch, the Nobel committee applied a phone designed to reach people.
3. Re:Amazing intuition by Anonymous Coward · 2013-10-11 15:28 · Score: 4, Funny
  
  But it's even more amazing than that.
  The Nobel committee only had the first three digits of his phone (the area code), so they applied the same algorithm, and bam! Turns out it works just as well for phone numbers.
  They got him on the first ring too. But that part is just coincidence.
Comprehension? by Anonymous Coward · 2013-10-11 10:11 · Score: 0

They were trying to find a method for improving fuzzy images, such as the ones generated by MRIs when there is insufficient time to complete a scan. On a hunch, Candes applied an algorithm designed to clean up fuzzy images, expecting to see a slight improvement.
I think I'm having difficulty comprehending these two sentences. This person was trying to accomplish a task, and 'on a hunch' used a tool specifically designed to accomplish that task. How is that a hunch? Isn't that basic reasoning? If I needed to drive a nail into a wall, I would pick up a claw hammer, because it's a tool designed specifically for that task. It wouldn't be a 'hunch,' and I certainly wouldn't be surprised to find that the nail went into the wall.
1. Re:Comprehension? by Anonymous Coward · 2013-10-11 11:21 · Score: 0
  
  It wouldn't be a 'hunch,' and I certainly wouldn't be surprised to find that the nail went into the wall.
  It would be, and you would be, if you were trolling for grant money.
Enhance by Anonymous Coward · 2013-10-11 10:14 · Score: 0

Fir__ P___
1. Re:Enhance by Anonymous Coward · 2013-10-11 10:15 · Score: 0
  
  Fail.
2. Re:Enhance by Anonymous Coward · 2013-10-11 11:23 · Score: 0
  
  Fail.
  No, that's only one word and you changed letters.
  It's obviously "Firm Penis".
clear, but wrong by raymorris · 2013-10-11 10:15 · Score: 2

While there may be millions of possible reconstructions for a fuzzy, ill-defined image, the simplest (sparsest) version is probably the best fit."
Of the millions of possibilities, the sparsest is MOST likely. Perhaps it's twice as likely as any other possibility. That still means it's 99.999% likely to be wrong.
As for the MRI, that fuzzy part is probably noise that can be deleted, except when it's a tumor.
"
1. Re:clear, but wrong by whit3 · 2013-10-16 10:40 · Score: 1
  
  While there may be millions of possible reconstructions for a fuzzy, ill-defined image, the simplest (sparsest) version is probably the best fit."
  Of the millions of possibilities, the sparsest is MOST likely. Perhaps it's twice as likely as any other possibility. That still means it's 99.999% likely to be wrong
  I interpreted this to be a description of maximum-entropy filtering (i.e. making an output image with least information, consistent with input image with sparse information content overlaid with full-textured noise.)
  It certainly works for (for example) starfields seen in a badly focused telescope; NASA used this for Hubble reconstructions.
  Every possible reconstruction will have some 'wrong' content, because noise is guaranteed (if only by the quantum uncertainty principle). That doesn't make the maximum-entropy filter useless, it just means the utility depends on the subject's compliance with the minimum-information assumption.
RIPE FOR FRAUD !! by Anonymous Coward · 2013-10-11 10:16 · Score: 0

Not that that is new !! Math is hard !! And not just for girls !! No one can understand GPL and you expect them to understand MATH ?? No Way Hozay !!
educated guess by themushroom · 2013-10-11 10:16 · Score: 1

Interpolate and extrapolate

--
Laughter is the Spackle of the Soul.
1. Re:educated guess by Anonymous Coward · 2013-10-11 17:07 · Score: 0
  
  But never copolate.
RBTL by tsprig · 2013-10-11 10:18 · Score: 0

R...
B......
T..
L....
We are the ones in need of a network by Vesvvi · 2013-10-11 10:20 · Score: 5, Insightful

I like some of the more subtle details in the title and summary: new math "techniques", "researchers need new mathematical tools", etc.
I find it hard to believe that our sciences are driving the math fields, as mature and well-developed as the math community is. But it is true that existing knowledge and tools from mathematics drive huge advances in the sciences when they are brought to bear. The sad truth is that scientists just don't play terribly well with others (maybe no one does): interdisciplinary work is rare and difficult, and so we end up re-inventing the wheel over and over again. The reality is that the "wheel" being created by the biologist in order to interpret their data is a poor copy of the one already understood by the physicist across campus.
What can we do about this? I'm not sure, but I think it's safe to say that our greatest scientific advances in the next few decades will be the result of novel collaborations, and not novel math or (strictly speaking) novel science.
1. Re:We are the ones in need of a network by JanneM · 2013-10-11 11:42 · Score: 3, Informative
  
  I find it hard to believe that our sciences are driving the math fields, as mature and well-developed as the math community is.
  This has actually always been the norm. Physics has long driven mathematics research for instance; many areas of calculus were created/discovered specifically to solve problems in physics.
  
  --
  Trust the Computer. The Computer is your friend.
2. Re:We are the ones in need of a network by Anonymous Coward · 2013-10-11 13:44 · Score: 1
  
  I find it hard to believe that our sciences are driving the math fields, as mature and well-developed as the math community is.
  Sir, may I introduce you to the field of partial differential equations? I think you would find it absolutely fascinating!
3. Re:We are the ones in need of a network by Anonymous Coward · 2013-10-11 15:57 · Score: 0
  
  While mathematics has done a lot of great work on its own, and things that initially seem useless end up being quite useful down the line, that does not imply that mathematicians have developed everything before it is needed. Pure mathematicians take their own directions with things, which are sometimes immediately useful and sometimes not. There are still plenty of problems that come up in fields of science (especially physics) where the math field has not addressed yet and there suddenly is an immediate need for someone to fill in the gaps.
  Academic fields in general seem to have a lot of communication barriers, and the natural tendency of many researchers is to develop their own tools. That doesn't mean all researchers are like that though, and I'm used to having a couple applied mathematicians in our collaborations for code development, who are also in contact with more pure mathematicians to keep up to date on any new developments.
4. Re:We are the ones in need of a network by fph+il+quozientatore · 2013-10-11 20:10 · Score: 2
  
  I like some of the more subtle details in the title and summary: new math "techniques", "researchers need new mathematical tools", etc.
  
  The summary isn't too inaccurate; what they are talking about is compressed sensing https://en.wikipedia.org/wiki/Compressed_sensing, i.e., the search for sparse (as in: with few nonzero elements) solutions to underdetermined systems of nonlinear equations. "Sparse" is understood in suitable basis, so for instance for a sound it could mean few different frequencies. The problem in itself is NP-hard, but it turns out that in some cases of interest you can get the solution or a reasonable approximation by solving a convex programming problem (minimizing the 1-norm rather than the sparsity).
  
  --
  My first program:
  Hell Segmentation fault
5. Re:We are the ones in need of a network by epine · 2013-10-11 22:49 · Score: 1
  
  The problem in itself is NP-hard, but it turns out that in some cases of interest
  Perfect solutions are often NP-hard in systems where pretty-good solutions are nowhere close to NP-hard in many practical circumstances.
  The declaration of NP-hard is way overrated. We use it mostly because mathematics still can't chew "pretty good" in any rigorous way.
6. Re:We are the ones in need of a network by Frequency+Domain · 2013-10-12 03:17 · Score: 1
  
  I find it hard to believe that our sciences are driving the math fields, as mature and well-developed as the math community is.
  Reminds me of the anecdote about Prof. Rota when he was asked by a reporter why MIT didn't really have an applied math department. He responded, "We do! It's all of those other departments!"
7. Re:We are the ones in need of a network by Anonymous Coward · 2013-10-12 09:40 · Score: 0
  
  as mature and well-developed as the math community is
  Does this mean the same thing as completely divorced from reality? Or were you referring to some other aspect of the math community?
  It is the nature of mathematicians to live in fantasy worlds. Sometimes those fantasy worlds can be mapped onto the real world. But that almost always implies approximation (gasp! horror!).
  There are other problems with mapping a fantasy world onto the real world. For example, much of the Theory of Computation collapses into worthless garbage once one starts to think about the consequences of Thermodynamics and Entropy (all programs will halt!).
  What makes it even harder for the math community to work with the rest of society is that many people outside the community like their exposure to theory to be balanced with applications (and not just so-called "applications" that just map into other areas of mathematical theory), something the math community finds really, really, hard. This is otherwise known as "science": we like to base things on experiments and not just on theory.
MRI with FAKE data- WTF! by Anonymous Coward · 2013-10-11 10:46 · Score: 0, Offtopic

Does Slashdot ONLY promote junk science and maths. Frauds and cons? Yes, we all know about the concept of fractals, and how fractals can be used to create FAKE data that seems semantically pleasing to the Human eye. However, there is a world of difference between producing FAKE data that matches the pattern expectations of our brains, and thus looks correct, and REAL data that science (including medicine) requires.
I am reminded of a true story. In the early days of 'image enhancement', one city in the UK was regularly getting convictions for robbery from surveillance footage that was VERY unclear. They used a so-called specialist image processing company who took the footage, 'enhanced' it (just like you see in CSI) and proved in court the boys-in-blue had caught the guilty party. One problem. It later came to light that the police were giving the enhancement company a photograph of the suspect, and the enhancement company were merging a scan of this photograph with the so-called enhanced video frame. They CLAIMED that this was a legitimate mathematical technique.
Needless to say, with yet another proven case of forensic fraud in the UK 'justice' system, all previous convictions based on such 'evidence' were overturned. The UK police and prosecution bodies have a LONG history of faking forensic evidence- and using prosecution 'experts' spouting absolute nonsense fill with sciencey sounding terms to bamboozle a naive jury (or judge).
Filling in gaps in data DOES have a legitimate place in 'art', like when an old photograph or frame of a movie has a region (damaged or empty down to something like post-conversion 3D) that needs to be filled with a 'texture' that will look to the viewer like it belongs in shot. It has NO, repeat ***NO*** place in science.
Sadly 99.9% of scientists are VERY poor scientists with an extremely limited grasp of fundamental concepts of logic and mathematics. They can quote, repeat, and remember, but they cannot understand. Being the vast majority of ALL scientists, casual sheeple hear more NOISE (fake facts) from the crappy scientists than ever they hear true facts from the good ones. The pseudo science of EUGENICS, for instance, was proven to be total garbage by first class scientists in the UK and Europe at the beginning of the 20th Century, for instance, but was universally adopted by American scientists (who then directly influenced Hitler's scientists), and accepted as proven 'fact' by the population of the USA up through into the 1960s. Even today people like Bill Gates fund the same US Eugenic organisations that operated in the early 20th century, partnered with the Nazis, and pushed for forced sterilisation and medical experimentation on 'inferior' Humans.
Bill Gates continues the work of Eugenics via far more modern methods of societal manipulation of those Gates refers to as 'inferior'. Gates is behind the 'Common Core' curriculum dumbing down the US schools where most ordinary people send their kids, and Gates partnered with RUPERT MURDOCH of Fox News to create the inBloom full surveillance database of every child in the USA
Bill Gates created MSNBC, and you sheeple are constantly told that MSNBC is the very opposite to Fox News. Yet when Bill Gates partners with Murdoch to record intimate details about your child (including sexual development) and your family, and provides that information to any pervert who wants to target children with minimum risk of detection or punishment, you sheeple are supposed to NOT notice this unexplainable arrangement?
Gates even named the company overseeing the database inBloom- which is a pedophile codeword describing a child considered ripe for abuse.
1. Re:MRI with FAKE data- WTF! by Anonymous Coward · 2013-10-11 12:00 · Score: 0
  
  That was actually a fairly good post up until you decided to jump the shark and start ranting about Gates. Either it's a poor attempt at trolling, or the LSD started kicking in...
2. Re: MRI with FAKE data- WTF! by Anonymous Coward · 2013-10-12 01:36 · Score: 0
  
  odd, I thought it was pretty boring till he got to the gates part. He never got to the al gore is an apple board member though. nobody ever gets to that part.
Did he use Photoshop by avandesande · 2013-10-11 10:50 · Score: 1

Loading a MRI in to Photoshop and using a sharpening tool- novel concept!

--
love is just extroverted narcissism
The deeper the gold and diamonds. . . by djupedal · 2013-10-11 11:00 · Score: 1

The harder you have to dig.
Re: first post by Anonymous Coward · 2013-10-11 11:12 · Score: 0

who are these moderators who object to america first?!
These liberals never want to see anyone excel. The insist on equal outcomes not equal opportunity. Everyone has the chance to be first, several times a day here on Slashdot yet the conspiritors will not be happy until they crush your dreams under a hailstorm of moderation that would make lenin blush.
"Big Data" means nothing by reve_etrange · 2013-10-11 11:21 · Score: 1

What is Big Data? They say it is when your problem grows faster than your resources.
Yet, since the 70's we have the concept of NP-Hard: again your problem grows faster than your resources. We have always had "Big Data."

--
.: Semper Absurda :.
1. Re:"Big Data" means nothing by godrik · 2013-10-11 14:09 · Score: 1
  
  "Big Data" is actually a pretty clear problem. It is not when your problems grows faster than your ressources. It is when you are faced with processing massive amount of unstructured data that flow in your system. The data might be untrustable or forged or incomplete. You might want to read on the "Vs of Big Data", it describe pretty well the type of problem it encompasses. Obvisouly not everybody faces such difficult to process data.
  Essentially it is a big word you put to describe most modern data analytics. But most NP-Hard problem do not qualify as they are usually fairly static and the input of the problem is usually quite accurate (if the input is inaccurate, you'll end up doing heuristics anyway because you can't do much better anyway).
  But even some classical polynomial problem are relevant to Big Data. For instance, pick a regular problem in data analytics such as the "most common subset in transactions" and drop 10% more data to your current problem. How do you "update" your solution to the problem without recomputing everything? (I never studied that one, so maybe it is easy.)
  I agree with you on the fact that these problems are not new. Though their reach as grown tremendously in the last 5/10 years with the democratization electronic records.
2. Re:"Big Data" means nothing by reve_etrange · 2013-10-12 11:10 · Score: 1
  
  My whole research experience has been of large, noisy data which is processed with approximation algorithms for NP-Hard problems, so perhaps I am making too broad generalizations about the problems faced by most computer scientists. The hype is still inane given, say, the three decades-long efforts to characterize all protein structures.
  And it's true that there are more, more open data repositories than previously, and many fields of science have adopted submission to central, public repositories as a normal expectation for research work.
  As for updating solutions to large problems, I believe that sometimes it is possible to store data in an intermediate form so that a final analysis can be repeated cheaply, while other times updates must be handled with a statistical sampling approach and still other times they are simply impossible. It can certainly be troublesome. Protein structure alignments are good example of the latter category I think.
  
  --
  .: Semper Absurda :.
3. Re:"Big Data" means nothing by godrik · 2013-10-12 12:52 · Score: 1
  
  I am all with you on this. I was mostly designing approxiamtion algorithm for weird-ass scheduling problem in the past. Recently I have been hired by a university to conduct research on "Big Data" and the problems faced have little to do with Combinatorial Optimization anymore.
  The problem is more on the lines of: Here are the medical record from 20 hospital on the east coast. You got everything: patient files, radios, MRI, blood test, cardios, doctor notes, nurses notes (as text files or images). Your job is to figure "stuff" out of that mess.
  Or: Here comes all the logs of my comercial website, the price of all items, how the website was laid out, where the user clicked and scrolled, the commercial displayed on the site and the log of what they bought. How can I improve my bottom line?
  But I agree with you that the term "Big Data" have been so used recently that we are drowning in it. And everybody tries to claim that they are doing it to benefit from the hype. Just look at how many conferences and workshop are doing big data (in their title) now while there was none two years ago. (Hint: I stopped counting at 5 conferences and 12 workshops.)
career in NSA anyone? by stanlyb · 2013-10-11 11:24 · Score: 1

Anyone who manage to invent a new methods for analyzing such a big data will get job...you guess where....
1. Re:career in NSA anyone? by SuricouRaven · 2013-10-11 20:14 · Score: 1
  
  Unless your college roommate's father was a former Chinese national. Two degrees from potential espionage? No job.
Fascinating! by Fubari · 2013-10-11 11:36 · Score: 2

This! is the kind of article I joined slashdot to find out about.
I wish there was a way to mod actual articles +1 or -1 instead of just modding comments; or to at least toss the submitter a karma point or something.
1. Re:Fascinating! by gl4ss · 2013-10-11 15:50 · Score: 1
  
  go to firehose.
  on the other hand, I would have liked to see actual pictures fixed by his algorithm.
  because without those, the article feels like bullshit. there's even a video in the article. but no "hey here's a csi zoom of this pic x".
  or maybe he has different definition for "perfectly rendered".
  
  --
  world was created 5 seconds before this post as it is.
pictures please? by Anonymous Coward · 2013-10-11 11:46 · Score: 1

Could someone point at some pictures he tried to clean up with these algorithms, and how they got cleaned up, and what existing picture manipulation tools use the same algorithms? And saying what the algorithms were would be good too.
1. Re:pictures please? by anubi · 2013-10-11 12:27 · Score: 1
  
  Anyone have a pointer to the algorithm? I am suspecting some matrix operator that looks at each pixel and its neighbors.
  
  To me, there seems to be plenty if information on recorded video, as it contains previous as well as future frames that should contain sufficient information to provide considerable clarification of a present image frame. Anyone have info on anyone doing this?
  
  --
  "Prove all things; hold fast that which is good." [KJV: I Thessalonians 5:21]
2. Re:pictures please? by GiganticLyingMouth · 2013-10-11 13:14 · Score: 1
  
  To me, there seems to be plenty if information on recorded video, as it contains previous as well as future frames that should contain sufficient information to provide considerable clarification of a present image frame. Anyone have info on anyone doing this?
  This is used already in multi-frame superresolution. TFS seems to be talking about compressive sensing, which is a completely different beast. Compressive sensing is based on assuming sparseness to solve an underdetermined system of linear equations. It doesn't always work (as it's not always a valid assumption), but when it does you can get very impressive results. That is to say, if you have some underdetermined system of equations, it'll have infinite possible solutions. This obviously doesn't lend itself well to getting a good answer from it, but by imposing the condition of sparseness, you can arrive at a (very close to correct) solution.
3. Re:pictures please? by Anonymous Coward · 2013-10-11 17:16 · Score: 0
  
  The paper from Candes is here:
  http://www-stat.stanford.edu/~candes/papers/ExactRecovery.pdf
  He was not actually looking at MRI images, but on a test image.
  The application to MRI was done a bit later:
  http://www.eecs.berkeley.edu/~mlustig/CS/SparseMRI.pdf
  http://www-mrsrl.stanford.edu/studygroup/2/Files/Block_2007_Undersampled.pdf
Old news on old news by key45 · 2013-10-11 11:52 · Score: 3, Funny

4 years ago, Slashdot ran this exact same story http://science.slashdot.org/story/10/03/02/0242224/recovering-data-from-noise about Wired running this exact same story: http://www.wired.com/magazine/2010/02/ff_algorithm/all/1
same answer as any political question by raymorris · 2013-10-11 11:59 · Score: 1, Offtopic

Given the overall percentage of libertarians (1%?) and the overall percentage of liberals (48%?), clearly it isn't anywhere near "all libertarians". This proves that:
The liberals are completely wrong.
That's the only conclusion that can be drawn by anyone who can follow simple logic. People who can follow simple logic knew that already, though.
I'm KIDDING you hyper-sensisitive liberal weenie who is furiously clicking the "reply" button. Sometimes liberals are right, even Obama. Obama was right when he said the lack of a federal budget was a sign of no leadership from the president. Obama was right when he said if the economy isn't back on track in early 2012 he shouldn't be re-elected. Obama was right when he said it would be irresponsible of him to run for president because a presidential candidate should "know what you're doing". Liberals are very often right.
1. Re:same answer as any political question by Anonymous Coward · 2013-10-12 01:24 · Score: 0
  
  Given the overall percentage of libertarians (1%?) and the overall percentage of liberals (48%?), clearly it isn't anywhere near "all libertarians". This proves that:
  Libertarians talk too much?
informercial by stenvar · 2013-10-11 12:11 · Score: 4, Insightful

The whole article is just a sales job:

That is the basis of the proprietary technology Carlsson offers through his start-up venture, Ayasdi, which produces a compressed representation of high dimensional data in smaller bits, similar to a map of London’s tube system.
The first place to look when people make such claims is at their publications, neither Gunnar Carlsson nor Simon DeDeo have significant publications that show that their approach works on real data or standard test sets. The statements in the article that these kinds of approaches are new are also bogus (I don't know whether they are deceptive or ignorant).
Lastly, from a Stanford math professor, I would expect better citation statistics overall; I don't know what's going on there.
http://scholar.google.de/citations?user=nCGwiu0AAAAJ&hl=en
http://scholar.google.de/scholar?as_ylo=2009&q=author:%22gunnar+carlsson%22&hl=en&as_sdt=0,5
1. Re:informercial by RandCraw · 2013-10-11 13:44 · Score: 1
  
  CompTop: Applied and Computational Algebraic Topology
  http://comptop.stanford.edu/
  You need to do more than, "Google: I feel lucky".
2. Re:informercial by Anonymous Coward · 2013-10-11 13:56 · Score: 3, Informative
  
  What are you smoking? 1877 citations since 2008 isn't a good citation statistic? More importantly, judging someone's research value by absolute citation statistic is quite silly; he is a full Stanford Professor for his accomplishments, intellect, and personality (I hear he is a good advisor).
  While the article is quite a promotional piece, you don't know much about the field. Gunnar Carlsson and his group have advanced computational topology moreso than any other. He came up with the concept and way to compute persistent homology, one of the actually useful and computable advancements that has come out of the topology field. It allows you to reason about clustering much better than any adhoc statistical measure.
  The current computational topology tools implemented by grad students today, like PLEX, dont scale very well. His group had some proprietary advancement that scales well, and spun off the data science company.
  If you would like two links that are actually informative about what Gunnar does:
  http://comptop.stanford.edu/
  http://www.ams.org/journals/bull/2009-46-02/S0273-0979-09-01249-X/S0273-0979-09-01249-X.pdf
3. Re:informercial by stenvar · 2013-10-11 14:01 · Score: 2
  
  How does a project web page make up for the lack of relevant peer reviewed publications or lack of citations?
  Where are the published results on real-world data sets? Or do you believe that a lot of verbiage is sufficient?
4. Re:informercial by RandCraw · 2013-10-11 14:39 · Score: 1
  
  And you're claiming the work is invalid because you're unimpressed by the lack of pubs of a new research program. At Stanford?
  In short, the Wired article is interesting while your criticism adds nothing. My advice, FWIW: if you must criticize, be specific. Don't gainsay with, "Your work is uninteresting because I'm unconvinced."
  That makes you sound like a Creationist.
5. Re:informercial by Anonymous Coward · 2013-10-11 15:13 · Score: 0
  
  And you're claiming the work is invalid because you're unimpressed by the lack of pubs of a new research program.
  Where did I say the work was "invalid"? I pointed out that there is little published, even less cited, and that there are no significant published benchmarks.
  
  FWIW: if you must criticize, be specific. Don't gainsay with, "Your work is uninteresting because I'm unconvinced."
  I can't tell whether the work is interesting or uninteresting because there is almost nothing published on it.
6. Re:informercial by Anonymous Coward · 2013-10-11 16:16 · Score: 1
  
  There's plenty of publications if you know where to look.
  Most of these papers are published in peer review journals: http://comptop.stanford.edu/preprints/
  Robert Ghrist's publications are another good source of material: http://www.math.upenn.edu/~ghrist/preprints.html
  and then there's John Harer: http://fds.duke.edu/db/aas/math/faculty/john.harer/publications.html
  and Vin de Silva: http://pages.pomona.edu/~vds04747/public/publications.html
  and Leo Guibas has done lots of computational topology: http://geometry.stanford.edu/member/guibas/
  and... the list goes on and on.
  Finally, math citations and publications have different statistics than those of other fields. I'm not sure what your benchmarks are but Gunnar Carlsson is a top mathematician by any measure.
7. Re:informercial by Anonymous Coward · 2013-10-11 18:16 · Score: 0
  
  Where are the published results on real-world data sets?
  Here.
8. Re:informercial by lorinc · 2013-10-12 03:46 · Score: 1
  
  Seriously? You are judging the work of someone solely on the shape of his citation curve on scholar? Compressed sensing is all but new, right, but selling these guys as bad because they have only 2k citations (!!!) on scholar is a bit exagerated, to say the least.
  
  --
  Video of some good progressive thrash music
9. Re:informercial by stenvar · 2013-10-12 09:20 · Score: 1
  
  Seriously? You are judging the work of someone solely on the shape of his citation curve on scholar?
  No, I'm judging the work by the absence of relevant peer reviewed, high impact publications, and the lack of experiments on large data sets.
  (My comment on the relatively low h-index was merely an aside.)
10. Re:informercial by Anonymous Coward · 2013-10-12 10:26 · Score: 0
  
  Oh, well, stop the presses! One publication in PNAS without control experiments on a problem where the correct answer isn't even known! Clearly, that proves that we have a groundbreaking new technique here!
  Get real.
11. Re:informercial by Anonymous Coward · 2013-10-12 16:04 · Score: 0
  
  Whether Carlsson is a top mathematician is not the question here. The Wired article is about a clustering algorithm and he says he has something groundbreaking new. I'd like to see some papers, code, and data showing that, in the usual way in which people in the community show that sort of thing.
  And while his citation statistics may be good for a top mathematician, for work on clustering and machine learning it is not. Since that's the field he has entered now, that's what matters, isn't it?
This explains ... by PPH · 2013-10-11 12:35 · Score: 1

... CSI Logic.

--
Have gnu, will travel.
"Topology" is not a "Methodology" by IHTFP · 2013-10-11 13:16 · Score: 1

Just plain lame. Nice marketing graphics though. What the hell is this crap doing on /.?
1. Re:"Topology" is not a "Methodology" by Anonymous Coward · 2013-10-11 14:04 · Score: 0
  
  The answer is simple. You probably weren't voting or commenting here. Get to work. And you might want to submit a story or two while you are at it.
compressed sensing is a load of utter horse shit by Anonymous Coward · 2013-10-11 14:12 · Score: 0

snakeoil salesmen
Probably right by coffbr01 · 2013-10-11 14:49 · Score: 1

I applaud the work, seriously. But in some departments of mathematics, statisticians are referred to in the same breath as politicians and liars. I'm not calling the OP a liar, but generalization can lead to incorrect conclusions. Unless it's lupus of course.
Re:compressed sensing is a load of utter horse shi by Anonymous Coward · 2013-10-11 16:22 · Score: 0

Why?
I know this one.... by maroberts · 2013-10-11 19:22 · Score: 1

what is Blade Runner

--
Donte Alistair Anderson Roberts - hi son!
Karma: Chameleon
science is over by mapkinase · 2013-10-11 22:41 · Score: 1

At the risk of repeating Lord Kelvin's folly: science is almost over, that's the root of the "data" problem. Data is so complex, because we exhausted simple systems, and we are trying to tackle irreducible systems.
It's a fallacy.

--
I do not believe in karma. "Funny"=-6. Do good and forbid evil. Yours, Oft-Offtopic Flamebaiting Troll.
Accuracy Of Assumptions by cstacy · 2013-10-12 00:54 · Score: 1

If you love trusting these kinds of compression algorithms, I have a Xerox machine I'd like to introduce you to...
where is this "gem picture"? by shtolcers · 2013-10-13 20:01 · Score: 1

Cmoon If you say smething like: "On a hunch, Candes applied an algorithm designed to clean up fuzzy images, expecting to see a slight improvement. .. What appeared on his computer screen instead was a perfectly rendered image...", your readers (at least me) expect to see this image and to hear about hunch-algorithm. Anyway, here (many clicks later) is original article - https://www.simonsfoundation.org/quanta/20131004-the-mathematical-shape-of-things-to-come/ (but still no pics)
examples... Re:Fascinating! by Fubari · 2013-10-15 05:57 · Score: 1

turns out on the professor's web page Emmanuel Candes, there is a link to Some old talks that shows an example of the kind of transforms / cleanup they're talking about (they're lengthy PDFs, but worth skimming if you're curious about the kidns of images). Nothing like real world pictures; synthetic examples with some shapes (almost like something you could mock up with MS Paint), but the premise is rather interesting.
And I just saw this like on the Candes web page above: this does have some interesting more real-world pictures. Fill in the Blanks: Using Math to Turn Lo-Res Datasets Into Hi-Res Samples (wired, 2010)