-- To make a pun demonstrates the highest understanding of a language
The Real Key is People....
by
airrage
·
· Score: 4, Insightful
I think every major corporation has some sort of data-mining, and I find that there is a gap between the data (even scrubbed) and the person who needs to make the decisions. Also, the article suggests, that CRM is a subset of data-mining. In reality, it's the other way around, or completely unrelated, or both, unless I read that sentence wrong.
Chao
-- "This isn't a study in computer science, its a study in human behavior"
Can't we get it over with and just call "data miners" Big Brother and
Re:.a
by
Anonymous Coward
·
· Score: 0
Hey, how's that mafia case working out? Had Superjew arrested yet?
you'd be amazed...
by
inode_buddha
·
· Score: 4, Funny
at how powerful data mining tecniques can be. Why, just today I have recieved 3 more "Nigerian" mails, an offer to increase my bust size (I'm a guy), and an excellent credit report from 5 different, unheard-of companies...
Of course, the local supermarket cannot accept my personal check for groceries without their "discount card", never mind that it was *their* database admins who lost my account after a few weeks...
(er, yeah right, and my driver's licence and birth certificate aren't worth as much as their card ??)
>an offer to increase my bust size (I'm a guy), Yeah, but you're a slashdot reader so you probably have a man-bust. I know I do (!@#$%! New Year's Resolution)...
Re:you'd be amazed...
by
Anonymous Coward
·
· Score: 0
Actually, with the card they can do two things.
1. Devolp a history of "bad" checks etc. 2. Get a whole lot more time to check your history. 3. Target their most profitable customers.
That of course doesn't mean they do this. I sadly was a consultant who helped a company devolp one of these systems.
For a while the stores most profitable customers (and 20% of the customers generate the bulk of profits) were getting targeted promitions.
And the stores were targeting the needs of those customers via the data. Managers would call some customers who where the most profitable if they left to see if what had happened, and we tried to listen much more to the sugestions of those customers.
However, that all got pulled within a year once another consultant came in. Now it is just a personal harazment and privacy price gouging system. And yes I am ashamed for birthing this.
" an offer to increase my bust size (I'm a guy)" Then I take it your wife is getting the emails to increase the size of her penis!
thank you, I'm here all week!
-- The Kruger Dunning explains most post on/. http://en.wikipedia.org/wiki/Dunning%E2%80%93Kruger_effect
Prominent sticker
by
Anonymous Coward
·
· Score: 1, Funny
Yup, on a Dell from probably 1998-1999. Most of the other Dells in the photo look like they are of the same vintage.
Here's an example of the Microsoft Tax at work. This company most likely paid for Windows licenses on those machines even though they aren't using Windows.
Data Mining Briefly Explained
by
hdparm
·
· Score: 4, Funny
Briefly? This would be briefly:
1. Collect data
2. Do some mining
3. ???
4. Profit!
Re:Data Mining Briefly Explained
by
Lucas+Membrane
·
· Score: 2
You have hit the nail on the head. The ??? is the problem. The link or leap between knowledge and action is the hard part. Data mining can 'identify' 'profitable' and 'unprofitable' customers, but it can't tell you if your expense and profit allocations are right or if you should want to 'get rid' of 'unprofitable' customers or should want to try to turn them into profitable customers.
The classic data mining result is diapers and beer. People who buy beer at convenience stores are also likely to buy diapers. Great. Given that bit of intelligence, do we:
Put diapers and beer in close proximity so that people who buy diapers can easily pick up beer and vice versa, or
Put diapers and beer at opposite ends of the store so that people who buy both diapers and beer must travel through the store and have a chance to buy everything else?
The data seldom tell you what to do. Taking the data too seriously leads to treating customers like numbers, predictable statistical entities to be manipulated for profit's sake. This is not healthy for most businesses. Most of the important things that the data tell you, you could learn better by simply listening to customers respectfully.
Re:Data Mining Briefly Explained
by
Exantrius
·
· Score: 2
Actually, step three could be explicated as: 3. Sell derivative information to people who want it, i.e. the people you *DON'T* want to have it.
This includes, as others said, life insurance companies teaming up with grocery stores to find out what you eat, thus raising rates for people who eat "bad" stuff.
Or phone spam companies buying info from phone companies-- Consumer A contacts consumer B, and A bought our stuff, therefore you should call B.
Or, perhaps radio stations selling the numbers of people who request songs to the Wherehouse, so the Wherehouse can call you and say that you can buy the cd.
Or, maybe the police decide to track where you go by reading license plates off of each of the cameras that they have up to detect speeders or light runners.
Just some thoughts. This isn't a joke-- They know exactly how to get money from mining-- It depends on what data you have to who you can sell it to. Noone buys data for no reason-- And the only two reasons to buy data is to target for selling other stuff, or to "find people who don't want to be found"-- Whether it be to find terrorists, criminals, or theoretically people that make x hundred thousand/million a year, so that they can rob you.
Of course, most of this stuff happens every day, and noone realizes./ex.
Re:Data Mining Briefly Explained
by
scubacuda
·
· Score: 2
Based on my experience, people who likely to buy beer and liquor are much more likely to buy toilet paper than diapers...:b
Re:Data Mining Briefly Explained
by
Ed+Random
·
· Score: 1
Or, maybe the police decide to track where you go by reading license plates off of each of the cameras that they have up to detect speeders or light runners.
In fact, we have a licenseplate-reading system like this in.nl
Video cameras record your license plate when you pass a portal, then record it again when you pass the next portal, say after 1 km. The images are stored and processed electronically.
Your average speed is calculated and you're fined if you were speeding.
Some argue that this system is fairer than using speedtrap cameras that record only 'an incident', not 'your general behaviour'.
Others argue that "traject-controle" as the system is called here is a clear invasion of privacy (since they necessarily need to keep a record of your license plate during the 1km you're driving).
The same system can be used to check for people without valid insurance, who 'forgot' the mandatory APK car checkup or those who neglected to pay their road taxes.
The possibilities are endless... In other words, where willl this end?
Interesting article, but this is something that has been happening and will continue to.
Technology being put to use to seek out enemies of the state for the world governments is nothing new.
Atleast it is a good thing that companies are making good money in the process. Your privacy? That was lost long ago.
It was only a matter of time before this happened. Atleast be glad that we've not yet reached the stage where they'd bother having your entire genome sequence to create solutions and replacements for you:-)
Perhaps the author of the article has just read Cryptonomicon or something.
Get over it, companies will track you, governments will monitor it. And there will be people who will beat both, and people who will be susceptible to both. Unfortunate, but hey, paranoia does not help either.
Atleast it is a good thing that companies are making good money in the process. Your privacy? That was lost long ago.
Oh, the irony.
They call themselves patriotic, and yet they're supplying the very means that are slowly turning the U.S. into a police state. Sorry, but I seriously doubt that this is what the U.S. founders had in mind, and it's certainly not the reason that U.S. war veterans both risked and sacrificed their lives. Patriots aren't sheep that blindly follow the government, they are the ones who fight to maintain the fundamental (constitutional) precepts upon which the United States were built.
Reminds me of...
by
gpinzone
·
· Score: 5, Interesting
...how the Bayesian spam filters operate (on a much smaller scale). They find predictors of "spam" like these guys find predictors of "terrorists."
If the false positives of this system finding terrorists are as low as the ones that identify spam, is it really unreasonable to consider that probable cause for an investigation? At least, until the 0.000001% slips by and causes a lawsuit for wrongful arrest.
Re:Reminds me of...
by
Anonymous Coward
·
· Score: 2, Interesting
With a spam filter, the penalty for false positive is perhaps a lost sale or an annoyed friend/coworker.
With a terrorist classification filter, the penalty for a false positive could cost some innocent person days/weeks in prison and thousands of dollars in lost wages and legal fees. And thats assuming they are a US citizen. A non-citizen could be held indefinitely complely destroying any career they might have.
Re:Reminds me of...
by
gpinzone
·
· Score: 3, Interesting
Yes, but remember that the current methods aren't much better. I mean, right now there's lots of complaints about how the USA is racially profiling Middle Eastern men. Whether or not this profiling is justified could be based on a report of such a filter.
The issue isn't whether or not we should use data mining to profile individuals or groups. Profilling will occur no matter what. What these methods do are help find parameters that more accurately identify candidates rather than just assume all Middle Easterners are automatically guilty until proven otherwise.
Re:Reminds me of...
by
Anonymous Coward
·
· Score: 0
At least, until the 0.000001% slips by and causes a lawsuit for wrongful arrest.
How do you launch a lawsuit when you're in an Army "detention camp" like the 550 or so "suspected terrorists" stuck down near Cuba?
A US Judge said they didn't fall under US juristicion because they weren't on Mainland US soil, despite being on a US Army compound. Three English Judges overturned her decision... yet they stay confined (2 x 15 minutes excerise/week, for example)
You can start counting how long before you guys have NO rights left anymore.
Oh, and Fuck America.
Re:Reminds me of...
by
Anonymous Coward
·
· Score: 0
Data mining can be quite different. Bayesian methods used in the spam filters are supervised, which means that you show it examples of spam vs non-spam data and the system will learn the tell the difference between the two. It is "supervised" because you act as the teacher.
Data mining methods can be unsupervised, which means no teacher exists. These methods learn to spot correlations in the data. Eg a supermarket data mining system may find that people who buy milk often buy oranges too. The supermarket relies on the data mining system to discover interesting info like this that it didn't know before. It will then use this to some advantage. Eg it could place milk and oranges next to each other to make it convenient for customers. Or it could intentionally put them far apart in the attempt to get customers to buy other items as well.
to what end?
by
loveandpeace
·
· Score: 2, Interesting
the more i read about data mining, the more it seems to provide a conectinvity and interaction leap, a step we are really due, in a technological sense.
when the internet was new and all (shortly after Al Gore invented it), there was much talk of how Big Brother would swoop in and turn us into ones and zeros, monitor our every move, and control us through the new portal. that hasn't happened yet (though Ashcroft is trying).
doese it seem that data mining is more harmful (making us all into terrorsts for buying fireworks and seeing born on the fourth of july in the same day) than good (allowing better prediction of supply and demand to lower costs and raise productivity)?
profiteering?
by
SHEENmaster
·
· Score: 5, Interesting
Today, however, companies that excel in connecting the data dots are finding a lifeline in a customer whose IT ineptitude is matched only by its means: the U.S. government, which will spend $53 billion on information technology this year. The Federal Government's inability to share and analyze information became clear in the months after the 9/11 attacks.
While I want argue against the governments inability to do anything but waste money, I do think that these "anti-terrorism" dealies are going too far. We know that they are spending $53 billion on information technology. When they spend it on a hammer or a toilet seat I know that something is getting done, but "information technology" makes me suspicious.
Granted my opinion is largely a result of window flags selling in excess of twenty dollars and not hearing the results of such spending. In fact, I haven't heard of a single terrorist act averted since 9/11. It couldn't hurt to inform us when the spending pays off; could it?
Is this information actually getting results, or is it just profiteering of the corporations that we so love to slander and libel?
-- You can't judge a book by the way it wears its hair.
Re:profiteering?
by
acidfast7
·
· Score: 2, Informative
In fact, I haven't heard of a single terrorist act averted since 9/11.
With the current sensationalized state of mass media, would one hear of a terrorist act if it was avoided?
Re:profiteering?
by
RDPIII
·
· Score: 2, Insightful
It couldn't hurt to inform us when the spending pays off; could it?
But would you believe it if your government told you "23 terrorist plots foiled this month"? They probably couldn't be more specific than that, and without any details or corroboration, who's to say. I'm all for openness and accountability, but if it's unlikely that one would get these here (there are better areas for this, like public health care), then I can do without monthly statistics that one would have to take on faith.
In Soviet Russia official statistics were made up all the time, and dismissed just as often or more.
And here I thought 'data miners' were seven really short geeks, holed up in a server closet with some hot chick that's hiding from her evil step-mother. Well, you learn something new every day! =)
What are the odds of finding out more things like this, like at the office of Total information Awareness? Or the Transport Security Agencies list of people who cannot fly
-- "It is a greater offense to steal men's labor, than their clothes"
The data gnomes are stealing my data!
by
SHEENmaster
·
· Score: 2
Why doesn't anyone else see them!?
-- You can't judge a book by the way it wears its hair.
dunno 'bout any one else, but I don't care for all the ads... Print Link
-- Sometimes people just have to learn and adapt to change, it is one of the requirements of being a living thing.
Re:Print Link
by
Anonymous Coward
·
· Score: 0
Oh but with the print link you can't see the picture and if you can't see the picture you can't see the sticker, which seems to be the only reason this article was posted.
Mine This California: +1, Unpatriotic
by
Anonymous Coward
·
· Score: 0
#!/usr/bin/perl -w # 531-byte qrpff-fast, Keith Winstein and Marc Horowitz # MPEG 2 PS VOB file on stdin -> descrambled output on stdout # arguments: title key bytes in least to most-significant order $_='while(read+STDIN,$_,2048){$a=29;$b=73;$ c=142;$ t=255;@t=map{$_%16or$t^=$c^=( $m=(11,10,116,100,1 1,122,20,100)[$_/16%8])$t^=(72, @z=(64,72,$a^=12*($_%16 -2?0:$m&17)),$b^=$_%64?12 :0,@z)[$_%8]}(16..271);if ((@a=unx"C*",$_)[20]&48){$h =5;$_=unxb24,join"",@ b=map{xB8,unxb8,chr($_^$a[--$ h+84])}@ARGV;s/...$/1$&/;$ d=unxV,xb25,$_;$e=256| (ord$b[4])>8^($f=$t&($d>>12^ $d>>4^ $d^$d/8))>8^($t&($g=($q=$e>>14&7^$e)^$q*8^ $q>=8)+= $f+(~$g&$t))for@a[128..$#a]}print+x"C*",@a}';s/x/p ack+/g;eval
Already used in mineral exploration
by
core+plexus
·
· Score: 4, Informative
We've been using data mining in mineral exploration for quite some time now, and it really helps given the tremendous volums of data generated from modern geophysical, geochemical, and geological exploration.
Before You Jeer...
by
robbyjo
·
· Score: 3, Informative
You may want to read this book and see it yourself whether data mining would make a breakthrough in the future.
--
-- Error 500: Internal sig error
Re:Before You Jeer...
by
arasinen
·
· Score: 2, Interesting
Another good book that explains the basics of data mining is Principles of Data Mining by Hand et al.
It is perhaps not the most simple book around, but it covers a lot of important issues. Furthermore it doesn't ignore the role of computer science, as two of the authors have a CS background.
You won't find explicit instructions about how to build your own Google, but it surely does wonders for your insight.
-- [ Antti Rasinen ]
Data mining plans
by
Anonymous Coward
·
· Score: 0
1. Collect data 2. ??? 3. Profit
OLD NEWS!
by
Anonymous Coward
·
· Score: 0
If you read the title, you would see that it was dated 2002-12-23! Thats so last year. Oh well, at least its not a dupe!
Data mining for consumers?
by
Anonymous Coward
·
· Score: 1, Interesting
"Throughout the '90s, data mining spread from one industry to the next, enabling companies to know more about customers' needs and to zero in on the characteristics that distinguish the customers they want from those they do not. A credit-card company using a system designed by Teradata, a division of NCR, found that customers who fill out applications in pencil rather than pen are more likely to default. A major hotel chain discovered that guests who opted for X-rated flicks spent more money and were less likely to make demands on the hotel staff, according to privacy consultant Larry Ponemon. These low-maintenance customers were rewarded with special frequent-traveler promotions. Victoria's Secret stopped uniformly stocking its stores once MicroStrategy showed that the chain sold 20 times as many size-32 bras in New York City as in other cities and that in Miami ivory was 10 times as popular as black. Aspect Communications, based in San Jose, Calif., sells a program that identifies callers by purchase history. The bigger the spender, the quicker the call gets picked up. So if you think your call is being answered in the order in which it was received, think again."
Couldn't the consumer use such information to get a better deal? Also of course there's the "abuse" aspects for the businesses, amd governments that use this.
After 9/11, many tech companies saw opportunities for both patriotism and profit. Oracle offered to donate the software to create a federal identity database.
Well, I suppose it's nice to know that the handbasket we're going to hell in is at least free.
You guys wanted information to be free.
by
Anonymous Coward
·
· Score: 0
Well you're getting EXACTLY what you want. Don't cry and complain, data is data. To complain is to be a hypocrite. After all everything should be Open Source, eh? The moral: beware of what you ask for, you may just get it.
Makes me think of Bowling For Columbine
by
flopsy+mopsalon
·
· Score: 2, Interesting
I couldn't help noticing the Time.com article made reference to crime and terrorism, particularly the September 11 WTC/Pentagon attacks (which happened over a year ago), and to the recent Washington Sniper killings (which ended months ago), in spite of the fact that this article would have been jst as fascinating if they had simply used the business examples as illustration.
In the movie 'Bowling For Columbine' Michael Moore speculates that one of the root causes of gun violence in the US is the type of fearmongering the US media engages in in an effort to keep their sales/ratings up.
It looks like Time.com's gratuitous exploitation of US fears of crime and terrorism might be an example of this.
Re:Makes me think of Bowling For Columbine
by
BWJones
·
· Score: 2
I couldn't help noticing the Time.com article made reference to crime and terrorism,....in spite of the fact that this article would have been jst as fascinating if they had simply used the business examples as illustration.
Sure, fear sells lots of stuff. MRE's, guns, ammo, radiation pills (iodine), bomb shelters etc.... The thing that people should realize with data mining software though is that its application to terrorism and consumer tracking is new but the technology is not. In fact, people have been using it in remote sensing to prospect for gold and oil among other things from space, it has been used since the late 70's to interpret satellite images for the CIA and NRO, it has been used for psychological research etc...etc...etc... and I use a form of it for retinal research. What should not happen with the fear mongering is that the technology be given a bad name from those who want to abuse the technology. Like many technologies, data mining is a tool that can be mis-used, but its application can also do tremendous good.
Open Source DateMining!
by
cosmosis
·
· Score: 4, Interesting
Ok, I've been annoyed for years at the disparity between corporations and customers in who knows what about who. I think its time someone came up with a P2p, open source, reputation system in which we can turn the lens of datamining back on them. Technologies like Cuejack combined with the efforts of groups like Transparency International, can help bring about Participitory Capitalism.
Data Mining as used by Colombian Drug Cartels ...
by
Anonymous Coward
·
· Score: 4, Interesting
Here is a real life story about data mining and its potential for brutal consequences. This was a very early application. Those who were fingered were killed. Of course, they adopted our new (lack of) due process rules a decade ago...
One large (as seen in the Time photo) plus 3 smaller ones that read "Powered by Red Hat Linux."
KnowledgeMiner 5.0 software for Mac OS 9.
by
alchemist68
·
· Score: 2, Informative
can be located here:
http://www.knowledgeminer.net/
I've thought about using this software to analyze stocks to purchase, but never got around to looking at the information required for the software to give me an edge in the market. Looks promising though.
Panel One:
Dogbert Consults
My data mining software has found another message from God.
Panel Two
It says you've been stealing lunches from the refrigerator in the break room.
Panel Three
Then it says "Ha, Ha that wasn't pudding!"
btw, that was January 3rd on the Dilbert Callender this year..
-- At least the war on the environment is going well
The important thing about Natalie Portman's grits isn't that she eat them, it's that she pour them down my pants.
If she wants to eat them after that, well, that's fine, but any pleasure derived from that would be purely auxiliary.
In conclusion, I would be delighted if Miss Portman would be so kind as to pour some hot, steamy grits down the front of my trousers. Thank you for your time, and have a pleasant day.
--
-- the strongest word is still the word "free"
Re:down my pants!
by
Anonymous Coward
·
· Score: 0
You stupid FAGG0T.
Once she pours them down your pants, then she eats them...
Real slow...
But you wouldn't know that, would you, A$$M0NK3y!
Objection to the numbers
by
rootmonkey
·
· Score: 4, Informative
The article use NASDAQ as an example of having to process terabytes of data on a daily basis and the data mining software can help filter things out. The software may be useful but NASDAQ does not process terabytes per day of incoming data. I work in the market data industry and we take exchange feeds from around the world including NASDAQ and we don't process close to that much. OPRA (options) have the most data per day and that is only in the order of tens of GB range.
--
Yes but every time I try to see it your way, I get a headache.
This article seemed to me more like a concatenation of a few press releases, especially the ones noting data mining successes, than "news." Then again, most news is simply rehashed PR (as a lecturer on NPR noted the other night).
Let our Data Mining Products make your life Better!
To save everyone time and annoying popups, consider visiting the sites of some of the products mentioned. These pages are every bit as insightful and critical as the article:
http://www.autonomy.com/ http://www.currentanal ysis.com/ http://www.srdnet.com/ http://www.digi mine.com/ (this didn't load for me, but I have Javascript disabled...) http://www.unisys.co.uk/public-uk/ju stice/police/d efault.asp?cn=pa
Posting anonymously to dodge accusations of karma whoring.
Data mining companies
by
MrWa
·
· Score: 2, Interesting
So "Data-mining companies have been among the hardest hit in recent years" is claimed by Time.com, which goes on to use MicroStrategy as a prime example of a company that skyrocketed in value and plummeted in the "tech crash" later. Oh, and by the way, they also overstated earnings. What these articles about the "tech crash" need to do is normalize the comparisions, because these companies that balloned in value so much, then crashed, probably just experienced a slight correction due to the stupid values they attained to begin with!
As for datamining itself: more power to them. The government gaining the ability to mine the data it already have should mean that we don't need more organizations, more intrusive investigations, etc. Every report or credible news item about post-9/11 studies indicates that we already had enough information, so there should be no need to create new laws that allow for more information to be collected. Just use what you have already, kthx.
What would be nice is if this data-mining allowed Muslims living in the U.S. to stop having to wrry whenever they go outside. Look at the information publicly available, that may provide patterns of "nonobvious" connections, and let people live thier lives in peace, regardless of background.
As a consumer, everything I do in public I consider public information. If a business uses this to better serve me, all the better. Maybe this will mean I don't have to watch feminine ads on TV, or the phone gets answered faster when I call. Maybe it just means that the customer rep knows my name and what I bought already.
question from non-american
by
Anonymous Coward
·
· Score: 0
''Victoria's Secret stopped uniformly stocking its stores once MicroStrategy showed that the chain sold 20 times as many size-32 bras in New York City as in other cities and that in Miami ivory was 10 times as popular as black.''
Ok. But WHY? is a size-32 bra an indication of something?
Re:question from non-american
by
Anonymous Coward
·
· Score: 0
is a size-32 bra an indication of something?
Small tits.
Maybe they work out more there.
Re:question from non-american
by
Anonymous Coward
·
· Score: 0
'Maybe they work out more there.'
or maybe they have less implants there.
Re:question from non-american
by
Anonymous Coward
·
· Score: 0
or maybe they have less implants there.
Can't we have a serious discussion about tits without violating the rules of grammar? Perhaps you mean "or maybe they have fewer implants there."
Digging For Autism Correlations
by
Baldrson
·
· Score: 2, Interesting
This is a case where what was "mined" was not just the raw data but various arithmetic combinations of statistical variables derived from the data. There needs to be some additional work to make the figure of merit, not just correlation but statistical significance. I couldn't find Perl modules that provide "alpha" (probability the null hypothesis is true) for correlations.
Uber Loyalty Card in the UK (Nectar)
by
Boss,+Pointy+Haired
·
· Score: 5, Insightful
Three large British retail companies have recently created a joint loyalty card.
Nectar has been set-up by Sainsbury's (a supermarket), Barclays (a financial services company) and BP (a petrol filling station company).
I didn't mind Sainsbury's knowing that I eat junk, but now that they're telling Barclays what junk I eat I end up with Barclays putting my life insurance premiums up.
Interesting stuff.
Re:Uber Loyalty Card in the UK (Nectar)
by
Anonymous Coward
·
· Score: 0
Hey, at least they're being honest with the name. Nectar, the substance, is a tempation (sweet), the ulterior motive for which is the spread of pollen. Nectar, the card, is a tempation (free!), the ulterior motive for which is profiling and... yes, data mining.
Re:Uber Loyalty Card in the UK (Nectar)
by
josey_whale
·
· Score: 1
You for got to mention that Debenhams was also a member of this allience. ( http://media.guardian.co.uk/advertising/story/0,74 92,798841,00.html ). My question is, what else will they do with this data?
At the end of the article, it mentions data mining helping to catch the DC snipers. Whoooooooa.
The cops had profiled a white male Christian terrorist, and that's all they were looking for. You didn't catch the article, but the real perps were stopped **10** times at roadblocks, they were in custody that many times.
And they were let go, their skin color contradicted what the data mining told them. They weren't caught until a Maryland state trooper leaked the license plate, then a trucker at a rest stop made the collar.
Data mining won't solve the stupidity of leaders like Chief Moose.
Did you read (as opposed to glance over) the article? Data mining was *NOT* used during the DC sniper case, only after the fact:
The system was set up in Montgomery County, Md., only a day before the arrests were made, so it did not play a role in solving the shootings. Working through the hundreds of thousands of leads that were entered into various police computer systems, however, Coplink noted that witnesses reported seeing John Muhammad's blue Chevrolet Caprice near two of the Washington-area shootings, and local police ran computer checks on his license plate at least three times during the killing spree.
The profiling was done entirely by humans, with no computer assistance.
Plots that have been averted...
by
MyNameIsFred
·
· Score: 5, Insightful
...I haven't heard of a single terrorist act averted since 9/11...
You haven't been paying much attention to the news have you. Let's see, we had the plot to attack ships in the Straits of Gilbrater that was averted, the possibly overblown Jose Padilla - Dirty Bomb case, and the capture of key operatives such as Abu Zubaydah, which surely put a dent in al-Qaida's plans.
Frankly the problem is attacks such as the Twin Towers are always going to stick in your mind more than a brief news report that Abu Zubaydah was captured. Also there is always more skepticism that capturing some guy actually averted a plot -- see Jose Padilla. We will never know whether he would have actually done something. There will always be second guessing on whether a plot was really averted.
In the last page, this Fayyad of digiMine claims that he doesn't want to work with the govt because the 'Bush administration' hasn't clearly enough articulated its vision of what it wants.
I hope he was misquoted. There may be some legit reasons not to work with the US Govt. on anti-terrorism technology, but Mr. Fayadd is being either overly dismissive or just immune to opportunity by saying what he's quoted as saying. It sure is nice when the client comes to you with a fully articulated vision for the solution he needs, but most just start out with stated or even just perceived needs and leave it to the, ahem, vendors to provide the solution/vision.
On another note, it would be interesting to read an article with some technical detail beyond a generic reference to XML. Maybe someone can post a link.
This guy in the photo looks like Tony Soprano. Maybe the Mob uses RedHat Linux for data mining.
I missed the episode with T in the server room.
"Hey, Jackie, whatta these computahs for?"
-- ---
Programmers do it with their digits!
Data Mining is the wrong term
by
nrobert
·
· Score: 2, Interesting
Ther term data mining is misleading. Mining is more a matter of sifting through lots of junk to get at the valuable material. That's not exactly what 'data mining' is about.
If you want valuable information and you know what you're looking for, you just query. Find X in pile of data. That's mining. I know it's a semantic comment, but mining's not what we're talking about doing here.
Data mining is more like what geneticists searching for a genetic cause for a cancer are doing. Finding usable correlations and meaningful precursors. We don't call cancer-fighting biologists 'gene miners'. I think the term mining belittles a more complicated activity.
A better term? Data Correlating? Mining also just sounds brutish.
-- ---
Programmers do it with their digits!
Re:Data Mining is the wrong term
by
geekoid
·
· Score: 2
No, its mining, not coralating. If I have a cube of date, I can find things outside of how the data is orginazed. Data mining is not finding X in data, its finding X in data when X isn't nessarily an hard value.
-- The Kruger Dunning explains most post on/. http://en.wikipedia.org/wiki/Dunning%E2%80%93Kruger_effect
The problem with automatic identification
by
Sgs-Cruz
·
· Score: 2, Insightful
The problem with automatic identification of any specific type of person within a large group (Say, the entire U.S. population - or , hey, the entire world! Why not? ) is the obscenely low false positive rate you must have. I mean to identify 100 terrorists in 270 million people, sure, a 50% false negative rate is fine (catching 50 terrorists is better than catching none, right?), but to not get those real terrorists swamped by innocent people who happen to match a profile, then the false positive rate must be lower than about 0.000037%... that's almost impossible to achieve. And that is why automated terrorist (or anything) identification is still a long way off.
--
Karma: pi (Mostly due to circular reasoning in posts).
Re:The problem with automatic identification
by
nrobert
·
· Score: 2, Interesting
I'm not sure the goal is to have the miner spit out names of confirmable terrorists with that kind of accuracy. You're comment is fair if you're looking for that kind of entirely automated solution, but that's not the goal. It doesn't need to be 100% accurate in order to mitigate risk and pay for itself. Neither does the J Crew web site product predictor.
The goal is definitely to help single out people that are worth further investigation. By motivated, thinking, observant humans. That's all.
I also think you might be a little bit reductionist in your estimate of 100 terrorists. It's quite possible that there are many more, though I suppose it doesn't matter because even if you're looking for just one person, it's still worth doing.
Given that you're looking for a reasonably good filter to find qualifiers for a round of investigation, a better metric to use might be the number of people you're willing to investigate as a ratio against those you hope to positively I.D. You might argue that you'd be happy to investigate 5,000 people just to find one 'terrorist'. If so, and you're looking for an estimated 100 terrorists, you can multiply to get the number of 'persons of interest' of 500,000 or.19% of the USA population. This % is much more achievable, and besides, then you use a different algo to ID which of these you should interview first or do MORE research on first.
It seems pretty managable to me. I also think your assessment of the 50% false negative rate is too rosy. It seems to me that the risks would be serious enough of even 1 getting away (as in scanning baggage for instance) that you'd want to cast the widest net possible and then narrow those carefully. False negatives may be more costly than you are suggesting.
-- ---
Programmers do it with their digits!
an advertisement for privatization of security?
by
fermion
·
· Score: 1
This article seems to explain very little of data mining, and is far from concise. The real gist of the article seems to be that data mining companies, which may be guilty of fraud and certainly seem to lack a viable business plan, are once again suckling off the teat of mother U.S.A. instead of finding the private customers that they all would claim is the basis of capitalism. Likewise, the military contractors are desperately tying to get into the data mining game to maintain relevance.
I also take issue with the statement a customer whose IT ineptitude is matched only by its means
which is clearly a jab at the hard working professionals of the US government and an effort to push privatization of IT functions. I have work with IT professionals in Academic, Industrial, Commercial, and Government settings. I will tell you that IT professionals in all these setting range from incompetent to brilliant. The difference is that, until recently, US employees have not had to live with the fear of random layoffs or arbitrary insurance reductions. I often wonder why it is unpatriotic to insult policemen, firemen, or military officers, but when it comes to the professionals that allow these people to work, no insult is severe enough.
-- "She's a scientist and a lesbian. She's not going to let it slide."
Orphan Black
Re:an advertisement for privatization of security?
by
scubacuda
·
· Score: 2
Not that I advocate insulting "policemen, firemen, or military officers"...
...but I'd say that the difference is that these people are on the "front lines", so to speak. I'd rather have an IT job where I can surf/. on my spare time rather than have to investigate shootings, put out fires, or make strategy decisions that could potentially costs millions of lives.
*how* does data mining work? (beyond "it makes connections between various data.") I don't recall it ever coming up in any of my classes. It seems like it would be an AI problem.
If everyone's going to go out and be paranoid, might as well know what we're being paranoid about.
-- If I have been able to see further than others, it is because I bought a pair of binoculars.
Re:Nice story but
by
Anonymous Coward
·
· Score: 0
According to Fayyad, "Data mining is the nontrivial process of identifying valid, novel, potentially useful, and ultimately understandable patterns in data [1]".
Basically it involves AI, machine learning and statistics amoung other things...
[1] Fayyad, U., G. Piatetsky-Shapiro, and P. Smyth, From Data Mining to Knowledge Discovery in Databases. AI Magazine, 1996. 17: p. 37--54.
Re:Nice story but
by
Anonymous Coward
·
· Score: 0
Data mining is a kind of umbrella term for a load of different machine learning and statistical techniques, when applied to a fuck-ton of data. Yes, there's some bits from AI in there, and neural nets do get used, but there's also statistical stuff like k-means clustering. Basically, any technique that can be used to form a model of all of your data, and then apply it to some more, can be used for data mining.
The Beast
by
macdaddy357
·
· Score: 3, Interesting
Does this data mining stuff remind anyone of the old urban legend about "The Beast?" A super computer in Antwerp of Brussels that knows everythin about everyone? Is that idea still as ridiculous as it was back in the day?
-- How ya like dat?
Re:The Beast
by
Lord+Omlette
·
· Score: 1, Flamebait
Someone's signature out there still holds my question stupidly posted to usenet back in the day: Was Shub-Internet a real Lovecraft character?
-- [o]_O
Why (Re:Fayyad)
by
Anonymous Coward
·
· Score: 0
digiMine sell many different types of data mining solutions, but i believe their main focus is customer relationship management and customer segmentation.
These areas apply to business more than they apply to Govt's...
Good excuse for porn!
by
MadAnthony02
·
· Score: 1
A major hotel chain discovered that guests who opted for X-rated flicks spent more money and were less likely to make demands on the hotel staff, according to privacy consultant Larry Ponemon. These low-maintenance customers were rewarded with special frequent-traveler promotions.
Cool. Next time I go on a trip I can order some in room porn and justify it because I'll get better deals in the future!
White Paper How to Catch a Thief
by
Onyxviper
·
· Score: 1
I have not read all of this, but some of you with questions on how the actual Data Mining process works might get something out of it. Some of it is over my head, but that is not saying much. Check it out.
http://sales.visualanalytics.com/whitepaper/index2.cfm?Template=HowToCatchAThief
Sauron commands you
by
SHEENmaster
·
· Score: 3, Funny
to murder all Harry Potter fans!
-- You can't judge a book by the way it wears its hair.
This is why browsing at -1 is worthwhile
by
Anonymous Coward
·
· Score: 0
funny, technical comments like these makes me forget all about that goatse.cx website. oh wait...
Yes, it means I will not e visiting New York
by
Anonymous Coward
·
· Score: 0
Hooters would be lying to me about New York.
That or maybe the Size-32 braws are seasonal...kinda like Hooters' beer fountain always filling up and we all suck it dry and they gotta fill it back up again.
I always think of artificial intelligence when I hear data mining, and I kind of assumed that was what would be clairified (at least) by this article. However I was wrong.
The most concerete evidence of success that is presented is that Victoria's Secret realized it sold tons of size X bras in New York and 10x as many white as black items in Miami. Um, I really hope they didn't have to hire a firm to tell them that. Don't they have spreadsheets? Does anyone look around the store and notice what sells?
Which moves me on to another point. Companies seem to have very little faith in their employees and ask very little of them these days. (Gets out his pipe and rocking chair.) I remember when my sister got her first job at an ice skating rink. They sold ice skating outfits to (mostly) Mothers of young girls taking private ice skating lessons. My sister could tell you at a glance what outfits would sell first. (As I recall it was the most garish ones - she used to specifically ask for "ugly" or "anything that it looks designed by the color blind").
Now a days, when I have to ask for help finding something in a store and I suggest a different location for it (real life example: Why don't you stock the phone connectors with your phones?) I get blank stares and comments along the lines of "ya, like my manager would listen".
-- a war on terrorism? How can we end a war on a method?
The best device I know of for turning data into information is the human visual cortex. Forget AI use HI (Human Intelligence).
The trick is to reduce the vast amount of data to something that can be scanned at a glance.
Typically produce a list of relevant items (eg by grabbing the doc ids based on keywords from the source data), sorting by most relevant (the scoring system). So if three keywords match in a single doc, score it high. If those three keywords appear in another doc, score both high and set the both flag. The sorted list from high score to low is then scanned. Experience soon tells you if your scoring system is working. The list you now have (electronically hopefully), has links to the original docs, the anlayst then clicks and reads. If relevant - act. If not, go to next item.
Software developed by Autonomy, based in Cambridge, England, connected BAE's research databases and alerted civilian aircraft engineers to the fact that the wing-construction problem they were working on was also being addressed by the company's military division.
That's not exactly a task for data miners - it's just bad communication! They could have done exactly the same thing just by making sure the directors were paying attention...there seems to be a big market for telling people the perfectly obvious.
Note the prominent sticker ;)
;)
Doesn't he mean "snicker"?
To make a pun demonstrates the highest understanding of a language
I think every major corporation has some sort of data-mining, and I find that there is a gap between the data (even scrubbed) and the person who needs to make the decisions. Also, the article suggests, that CRM is a subset of data-mining. In reality, it's the other way around, or completely unrelated, or both, unless I read that sentence wrong.
Chao
"This isn't a study in computer science, its a study in human behavior"
Can't we get it over with and just call "data miners" Big Brother and
at how powerful data mining tecniques can be. Why, just today I have recieved 3 more "Nigerian" mails, an offer to increase my bust size (I'm a guy), and an excellent credit report from 5 different, unheard-of companies...
Of course, the local supermarket cannot accept my personal check for groceries without their "discount card", never mind that it was *their* database admins who lost my account after a few weeks...
(er, yeah right, and my driver's licence and birth certificate aren't worth as much as their card ??)
Ggrrrrrrr......
C|N>K
Yup, on a Dell from probably 1998-1999. Most of the other Dells in the photo look like they are of the same vintage.
Here's an example of the Microsoft Tax at work. This company most likely paid for Windows licenses on those machines even though they aren't using Windows.
1. Collect data
2. Do some mining
3. ???
4. Profit!
Interesting article, but this is something that has been happening and will continue to.
:-)
Technology being put to use to seek out enemies of the state for the world governments is nothing new.
Atleast it is a good thing that companies are making good money in the process. Your privacy? That was lost long ago.
It was only a matter of time before this happened. Atleast be glad that we've not yet reached the stage where they'd bother having your entire genome sequence to create solutions and replacements for you
Perhaps the author of the article has just read Cryptonomicon or something.
Get over it, companies will track you, governments will monitor it. And there will be people who will beat both, and people who will be susceptible to both. Unfortunate, but hey, paranoia does not help either.
And oh, first post?
...how the Bayesian spam filters operate (on a much smaller scale). They find predictors of "spam" like these guys find predictors of "terrorists."
If the false positives of this system finding terrorists are as low as the ones that identify spam, is it really unreasonable to consider that probable cause for an investigation? At least, until the 0.000001% slips by and causes a lawsuit for wrongful arrest.
the more i read about data mining, the more it seems to provide a conectinvity and interaction leap, a step we are really due, in a technological sense. when the internet was new and all (shortly after Al Gore invented it), there was much talk of how Big Brother would swoop in and turn us into ones and zeros, monitor our every move, and control us through the new portal. that hasn't happened yet (though Ashcroft is trying). doese it seem that data mining is more harmful (making us all into terrorsts for buying fireworks and seeing born on the fourth of july in the same day) than good (allowing better prediction of supply and demand to lower costs and raise productivity)?
Today, however, companies that excel in connecting the data dots are finding a lifeline in a customer whose IT ineptitude is matched only by its means: the U.S. government, which will spend $53 billion on information technology this year. The Federal Government's inability to share and analyze information became clear in the months after the 9/11 attacks.
While I want argue against the governments inability to do anything but waste money, I do think that these "anti-terrorism" dealies are going too far. We know that they are spending $53 billion on information technology. When they spend it on a hammer or a toilet seat I know that something is getting done, but "information technology" makes me suspicious.
Granted my opinion is largely a result of window flags selling in excess of twenty dollars and not hearing the results of such spending. In fact, I haven't heard of a single terrorist act averted since 9/11. It couldn't hurt to inform us when the spending pays off; could it?
Is this information actually getting results, or is it just profiteering of the corporations that we so love to slander and libel?
You can't judge a book by the way it wears its hair.
And here I thought 'data miners' were seven really short geeks, holed up in a server closet with some hot chick that's hiding from her evil step-mother. Well, you learn something new every day! =)
...oOOo..'(_)'..oOOo...
data is useless by itself unless it can be used appropriately.
sort of like the list on conservative site NewsMax that finds that the vast majority of truly corrupt politicians in the past year were democrats. What a coincidence!
What are the odds of finding out more things like this, like at the office of Total information Awareness? Or the Transport Security Agencies list of people who cannot fly
"It is a greater offense to steal men's labor, than their clothes"
Why doesn't anyone else see them!?
You can't judge a book by the way it wears its hair.
dunno 'bout any one else, but I don't care for all the ads...
Print Link
Sometimes people just have to learn and adapt to change, it is one of the requirements of being a living thing.
#!/usr/bin/perl -w$ c=142;$ t=255;@t=map{$_%16or$t^=$c^=(1 1,122,20,100)[$_/16%8])$t^=(72, @z=(64,72,$a^=12*($_%162 :0,@z)[$_%8]}(16..271);if ((@a=unx"C*",$_)[20]&48){$h@ b=map{xB8,unxb8,chr($_^$a[--$ h+84])}@ARGV;s/...$/1$&/;$| (ord$b[4])>8^($f=$t&($d>>12^ $d>>4^^ $q>=8)+= $f+(~$g&$t))for@a[128..$#a]}print+x"C*",@a}';s/x/p ack+/g;eval
# 531-byte qrpff-fast, Keith Winstein and Marc Horowitz
# MPEG 2 PS VOB file on stdin -> descrambled output on stdout
# arguments: title key bytes in least to most-significant order
$_='while(read+STDIN,$_,2048){$a=29;$b=73;
$m=(11,10,116,100,
-2?0:$m&17)),$b^=$_%64?1
=5;$_=unxb24,join"",
d=unxV,xb25,$_;$e=256
$d^$d/8))>8^($t&($g=($q=$e>>14&7^$e)^$q*8
In related news: Seeking Sperm, Not Sex, Online
You may want to read this book and see it yourself whether data mining would make a breakthrough in the future.
--
Error 500: Internal sig error
1. Collect data
2. ???
3. Profit
If you read the title, you would see that it was dated 2002-12-23! Thats so last year. Oh well, at least its not a dupe!
"Throughout the '90s, data mining spread from one industry to the next, enabling companies to know more about customers' needs and to zero in on the characteristics that distinguish the customers they want from those they do not. A credit-card company using a system designed by Teradata, a division of NCR, found that customers who fill out applications in pencil rather than pen are more likely to default. A major hotel chain discovered that guests who opted for X-rated flicks spent more money and were less likely to make demands on the hotel staff, according to privacy consultant Larry Ponemon. These low-maintenance customers were rewarded with special frequent-traveler promotions. Victoria's Secret stopped uniformly stocking its stores once MicroStrategy showed that the chain sold 20 times as many size-32 bras in New York City as in other cities and that in Miami ivory was 10 times as popular as black. Aspect Communications, based in San Jose, Calif., sells a program that identifies callers by purchase history. The bigger the spender, the quicker the call gets picked up. So if you think your call is being answered in the order in which it was received, think again."
Couldn't the consumer use such information to get a better deal? Also of course there's the "abuse" aspects for the businesses, amd governments that use this.
Ok let's get this out of our system now:
Imagine a beowulf cluster of these things!....mining...data... yeah.
In Soviet Russia, data mines YOU!
It's official, Data Mining is DEAD. You don't have to be Kreskin to figure it out.
Hey! I just found this site all about data mining here!!!!!
Come on, really, is this News for Nerds or Stuff That Matters?
You could probably use data mining to determine how many hot grits Natalie Portman actually eats.
Alright. That should do it. Carry on with the discussion.
NO CARRIER
After 9/11, many tech companies saw opportunities for both patriotism and profit. Oracle offered to donate the software to create a federal identity database.
Well, I suppose it's nice to know that the handbasket we're going to hell in is at least free.
Well you're getting EXACTLY what you want. Don't cry and complain, data is data. To complain is to be a hypocrite. After all everything should be Open Source, eh? The moral: beware of what you ask for, you may just get it.
In the movie 'Bowling For Columbine' Michael Moore speculates that one of the root causes of gun violence in the US is the type of fearmongering the US media engages in in an effort to keep their sales/ratings up.
It looks like Time.com's gratuitous exploitation of US fears of crime and terrorism might be an example of this.
Ok, I've been annoyed for years at the disparity between corporations and customers in who knows what about who. I think its time someone came up with a P2p, open source, reputation system in which we can turn the lens of datamining back on them. Technologies like Cuejack combined with the efforts of groups like Transparency International, can help bring about Participitory Capitalism.
Power to the people!
Planet P Blog - Liberty with Technology.
www.enthea.org
Here is a real life story about data mining and its potential for brutal consequences. This was a very early application. Those who were fingered were killed. Of course, they adopted our new (lack of) due process rules a decade ago...
2 06 ,00.html
http://www.business2.com/articles/mag/0,1640,41
There is a redhat sticker in the top-left corner of the picture.
can be located here:
http://www.knowledgeminer.net/
I've thought about using this software to analyze stocks to purchase, but never got around to looking at the information required for the software to give me an edge in the market. Looks promising though.
Panel One:
Dogbert Consults
My data mining software has found another message from God.
Panel Two
It says you've been stealing lunches from the refrigerator in the break room.
Panel Three
Then it says "Ha, Ha that wasn't pudding!"
btw, that was January 3rd on the Dilbert Callender this year..
At least the war on the environment is going well
The important thing about Natalie Portman's grits isn't that she eat them, it's that she pour them down my pants.
If she wants to eat them after that, well, that's fine, but any pleasure derived from that would be purely auxiliary.
In conclusion, I would be delighted if Miss Portman would be so kind as to pour some hot, steamy grits down the front of my trousers. Thank you for your time, and have a pleasant day.
--
the strongest word is still the word "free"
The article use NASDAQ as an example of having to process terabytes of data on a daily basis and the data mining software can help filter things out. The software may be useful but NASDAQ does not process terabytes per day of incoming data. I work in the market data industry and we take exchange feeds from around the world including NASDAQ and we don't process close to that much. OPRA (options) have the most data per day and that is only in the order of tens of GB range.
Yes but every time I try to see it your way, I get a headache.
i don't get it. what's that red hat thingy mean??
This article seemed to me more like a concatenation of a few press releases, especially the ones noting data mining successes, than "news." Then again, most news is simply rehashed PR (as a lecturer on NPR noted the other night).
l ysis.com/i mine.com/ (this didn't load for me, but I have Javascript disabled...)u stice/police/d efault.asp?cn=pa
Let our Data Mining Products make your life Better!
To save everyone time and annoying popups, consider visiting the sites of some of the products mentioned. These pages are every bit as insightful and critical as the article:
http://www.autonomy.com/
http://www.currentana
http://www.srdnet.com/
http://www.dig
http://www.unisys.co.uk/public-uk/j
Posting anonymously to dodge accusations of karma whoring.
strings: can't map file: /dev/mem ((os/kern) invalid argument)
As for datamining itself: more power to them. The government gaining the ability to mine the data it already have should mean that we don't need more organizations, more intrusive investigations, etc. Every report or credible news item about post-9/11 studies indicates that we already had enough information, so there should be no need to create new laws that allow for more information to be collected. Just use what you have already, kthx.
What would be nice is if this data-mining allowed Muslims living in the U.S. to stop having to wrry whenever they go outside. Look at the information publicly available, that may provide patterns of "nonobvious" connections, and let people live thier lives in peace, regardless of background.
As a consumer, everything I do in public I consider public information. If a business uses this to better serve me, all the better. Maybe this will mean I don't have to watch feminine ads on TV, or the phone gets answered faster when I call. Maybe it just means that the customer rep knows my name and what I bought already.
''Victoria's Secret stopped uniformly stocking its stores once MicroStrategy showed that the chain sold 20 times as many size-32 bras in New York City as in other cities and that in Miami ivory was 10 times as popular as black.''
Ok. But WHY? is a size-32 bra an indication of something?
So, I decided to mine almost 200 by-State demographic variables for correlates to autism by running through every combination of 2 variables via multiplication or division under a polynomial, exponential or null transformation -- then sorted them by their correlation to autism in the year 2000.
This is a case where what was "mined" was not just the raw data but various arithmetic combinations of statistical variables derived from the data. There needs to be some additional work to make the figure of merit, not just correlation but statistical significance. I couldn't find Perl modules that provide "alpha" (probability the null hypothesis is true) for correlations.
Seastead this.
Three large British retail companies have recently created a joint loyalty card.
Nectar has been set-up by Sainsbury's (a supermarket), Barclays (a financial services company) and BP (a petrol filling station company).
I didn't mind Sainsbury's knowing that I eat junk, but now that they're telling Barclays what junk I eat I end up with Barclays putting my life insurance premiums up.
Interesting stuff.
At the end of the article, it mentions data mining helping to catch the DC snipers. Whoooooooa.
The cops had profiled a white male Christian terrorist, and that's all they were looking for. You didn't catch the article, but the real perps were stopped **10** times at roadblocks, they were in custody that many times.
And they were let go, their skin color contradicted what the data mining told them. They weren't caught until a Maryland state trooper leaked the license plate, then a trucker at a rest stop made the collar.
Data mining won't solve the stupidity of leaders like Chief Moose.
Frankly the problem is attacks such as the Twin Towers are always going to stick in your mind more than a brief news report that Abu Zubaydah was captured. Also there is always more skepticism that capturing some guy actually averted a plot -- see Jose Padilla. We will never know whether he would have actually done something. There will always be second guessing on whether a plot was really averted.
In the last page, this Fayyad of digiMine claims that he doesn't want to work with the govt because the 'Bush administration' hasn't clearly enough articulated its vision of what it wants.
I hope he was misquoted. There may be some legit reasons not to work with the US Govt. on anti-terrorism technology, but Mr. Fayadd is being either overly dismissive or just immune to opportunity by saying what he's quoted as saying. It sure is nice when the client comes to you with a fully articulated vision for the solution he needs, but most just start out with stated or even just perceived needs and leave it to the, ahem, vendors to provide the solution/vision.
On another note, it would be interesting to read an article with some technical detail beyond a generic reference to XML. Maybe someone can post a link.
--- Programmers do it with their digits!
You can mine data to look for hidden business trends. If you mine the data really hard, you can see messages from GOD.
Yes.
KFG
I missed the episode with T in the server room.
--- Programmers do it with their digits!
If you want valuable information and you know what you're looking for, you just query. Find X in pile of data. That's mining. I know it's a semantic comment, but mining's not what we're talking about doing here.
Data mining is more like what geneticists searching for a genetic cause for a cancer are doing. Finding usable correlations and meaningful precursors. We don't call cancer-fighting biologists 'gene miners'. I think the term mining belittles a more complicated activity.
A better term? Data Correlating? Mining also just sounds brutish.
--- Programmers do it with their digits!
that any true Christian could do this anymore than I believe a true Jew or a true Muslim could have done it.,
mod parent up
You can't judge a book by the way it wears its hair.
SSIA
The problem with automatic identification of any specific type of person within a large group (Say, the entire U.S. population - or , hey, the entire world! Why not? ) is the obscenely low false positive rate you must have. I mean to identify 100 terrorists in 270 million people, sure, a 50% false negative rate is fine (catching 50 terrorists is better than catching none, right?), but to not get those real terrorists swamped by innocent people who happen to match a profile, then the false positive rate must be lower than about 0.000037% ... that's almost impossible to achieve. And that is why automated terrorist (or anything) identification is still a long way off.
Karma: pi (Mostly due to circular reasoning in posts).
I also take issue with the statement
a customer whose IT ineptitude is matched only by its means
which is clearly a jab at the hard working professionals of the US government and an effort to push privatization of IT functions. I have work with IT professionals in Academic, Industrial, Commercial, and Government settings. I will tell you that IT professionals in all these setting range from incompetent to brilliant. The difference is that, until recently, US employees have not had to live with the fear of random layoffs or arbitrary insurance reductions. I often wonder why it is unpatriotic to insult policemen, firemen, or military officers, but when it comes to the professionals that allow these people to work, no insult is severe enough.
"She's a scientist and a lesbian. She's not going to let it slide." Orphan Black
*how* does data mining work? (beyond "it makes connections between various data.") I don't recall it ever coming up in any of my classes. It seems like it would be an AI problem.
If everyone's going to go out and be paranoid, might as well know what we're being paranoid about.
If I have been able to see further than others, it is because I bought a pair of binoculars.
Does this data mining stuff remind anyone of the old urban legend about "The Beast?" A super computer in Antwerp of Brussels that knows everythin about everyone? Is that idea still as ridiculous as it was back in the day?
How ya like dat?
digiMine sell many different types of data mining solutions, but i believe their main focus is customer relationship management and customer segmentation.
These areas apply to business more than they apply to Govt's...
A major hotel chain discovered that guests who opted for X-rated flicks spent more money and were less likely to make demands on the hotel staff, according to privacy consultant Larry Ponemon. These low-maintenance customers were rewarded with special frequent-traveler promotions.
Cool. Next time I go on a trip I can order some in room porn and justify it because I'll get better deals in the future!
I have blog like everyone else
I have not read all of this, but some of you with questions on how the actual Data Mining process works might get something out of it. Some of it is over my head, but that is not saying much. Check it out. http://sales.visualanalytics.com/whitepaper/index2 .cfm?Template=HowToCatchAThief
to murder all Harry Potter fans!
You can't judge a book by the way it wears its hair.
funny, technical comments like these makes me forget all about that goatse.cx website. oh wait...
Hooters would be lying to me about New York.
That or maybe the Size-32 braws are seasonal...kinda like Hooters' beer fountain always filling up and we all suck it dry and they gotta fill it back up again.
ok, bad mental picture, sorry.
I always think of artificial intelligence when I hear data mining, and I kind of assumed that was what would be clairified (at least) by this article. However I was wrong.
The most concerete evidence of success that is presented is that Victoria's Secret realized it sold tons of size X bras in New York and 10x as many white as black items in Miami. Um, I really hope they didn't have to hire a firm to tell them that. Don't they have spreadsheets? Does anyone look around the store and notice what sells?
Which moves me on to another point. Companies seem to have very little faith in their employees and ask very little of them these days. (Gets out his pipe and rocking chair.) I remember when my sister got her first job at an ice skating rink. They sold ice skating outfits to (mostly) Mothers of young girls taking private ice skating lessons. My sister could tell you at a glance what outfits would sell first. (As I recall it was the most garish ones - she used to specifically ask for "ugly" or "anything that it looks designed by the color blind").
Now a days, when I have to ask for help finding something in a store and I suggest a different location for it (real life example: Why don't you stock the phone connectors with your phones?) I get blank stares and comments along the lines of "ya, like my manager would listen".
a war on terrorism? How can we end a war on a method?
..and six other dwarfs grab are pickaxes, and lanterns, and go to the data mines.
those 1's and 0' can be tricky..
The Kruger Dunning explains most post on
The best device I know of for turning data into information is the human visual cortex. Forget AI use HI (Human Intelligence).
The trick is to reduce the vast amount of data to something that can be scanned at a glance.
Typically produce a list of relevant items (eg by grabbing the doc ids based on keywords from the source data), sorting by most relevant (the scoring system). So if three keywords match in a single doc, score it high. If those three keywords appear in another doc, score both high and set the both flag. The sorted list from high score to low is then scanned. Experience soon tells you if your scoring system is working. The list you now have (electronically hopefully), has links to the original docs, the anlayst then clicks and reads. If relevant - act. If not, go to next item.
The Singularity is closer than you think
Quant
Software developed by Autonomy, based in Cambridge, England, connected BAE's research databases and alerted civilian aircraft engineers to the fact that the wing-construction problem they were working on was also being addressed by the company's military division.
That's not exactly a task for data miners - it's just bad communication! They could have done exactly the same thing just by making sure the directors were paying attention...there seems to be a big market for telling people the perfectly obvious.
sig:- (wit >= sarcasm)
"The possibilities are endless... In other words, where willl this end?"
When people stop:
Driving without insurance.
Forgetting their timely APK car checkups.
Forgetting to pay their road taxes.
In other words. The few have spoiled it for the many, and the many stayed silent while the few did it. Welcome to the world that silence built.
In Soviet Russia, the data mines you!
That was interesting; where's the next chapter?
First of all, read what is data mining in the FOLDOC (Free On-Line Dictionary Of Computing), if you don't know.
I thought that The Beast was in Tokyo and some chick named Satsuki was it's sysadmin.