Slashdot Mirror


Internet Data Mining for Investment Analysis

CaroKann writes "Reuters is reporting on a Wall Street investment research company, Majestic Research, that is using web crawling techniques to track business performance. Instead of attempting to estimate business conditions by talking to company management, or pounding the pavement visiting stores, this company uses data mining systems to collect real-time sales data and other information on companies that have a web presence. Using this data, Majestic attempts to estimate company earnings more accurately than traditional research outfits."

74 comments

  1. Traditional Wall Street Research? by eldavojohn · · Score: 4, Funny
    But the New York-based research firm is winning converts among hedge funds who say its brand of Web-based quantitative analysis can be more accurate than traditional Wall Street research forecasts.
    Possibly because "traditional Wall Street research" involves reading tea leaves and throwing down chicken bones while watching Alan Greenspan do a rain dance to the gods in hopes that our economy will pick up.

    Economics and future fiscal predictions are completely theoretical. There are just too many variables involved, folks.
    --
    My work here is dung.
    1. Re:Traditional Wall Street Research? by Saevio · · Score: 1

      Would he be by any chance dancing with Ben Bernanke?

    2. Re:Traditional Wall Street Research? by Billosaur · · Score: 1
      Possibly because "traditional Wall Street research" involves reading tea leaves and throwing down chicken bones while watching Alan Greenspan do a rain dance to the gods in hopes that our economy will pick up.

      And despite plaudits and accolades from government and business, Greenspan was no "genius" in the classic sense. He made mistakes early on (ask George Bush the First!) and eventually found a technique (the now famous "Raise/Lower Interest Rates a Quarter Point Shuffle") that seemed to work most of the time. Well, it may have worked most of the time; it worked in the beginning, and eventually people began to see him as a "prophet," to where his just saying something could make the markets dance.

      Ultimately this system will be abused and become worthless, probably right after its greatest success, so that it will perpetuate itself and eventually become a market influence, even though it will have no practical value at that point.

      --
      GetOuttaMySpace - The Anti-Social Network
    3. Re:Traditional Wall Street Research? by nelsonal · · Score: 1

      Greenspan's genious was not in his handling of the Federal Reserve, he was an expert at utilizing the power he built through control of the FOMC to push politicians into deregulating the national economy. Regardless of your opinion of deregulating the national economy, I think anyone can appreciate the level of genious required for a single person to have such a substantial impact on that many decisions without publicly revealing the level of such influence until they near retirement.

      --
      Degaussing scares the bad magnetism out of the monitor and fills it with good karma.
    4. Re:Traditional Wall Street Research? by nelsonal · · Score: 1

      I was a traditional 'Wall Street research analyst' (on the buy side) and found this to be the source of several fruitful ideas (MS/DOJ settlement release half a day before everyone else, Apple G5, AMD/Intel competitive dynamics, and the post 9/11 Akamai vs everyone else's news sites report). You might be surprised how valuable some of the info is around here.

      --
      Degaussing scares the bad magnetism out of the monitor and fills it with good karma.
    5. Re:Traditional Wall Street Research? by Billosaur · · Score: 1

      But having one person wield that kind of power in an economy is anathema, and makes all economic models suspect. Do you now have to add a "Greenspan" factor to your equations? And more importantly, if you do, is this factor transferable, so that the "Greenspan" factor now becomes the "Bernacke" factor and has the same relative weight. No one person (or nation for that matter) should wield that kind of power, lest it create conditions where the misuse or abuse of that power would cause the dominoes of the world economy to topple.

      --
      GetOuttaMySpace - The Anti-Social Network
    6. Re:Traditional Wall Street Research? by Shag · · Score: 1

      Who? ;)

      Oh, wait, is that the new guy?

      He needs to get in the news some more so we can all learn to remember (and maybe even pronounce) his name already.

      --
      Village idiot in some extremely smart villages.
    7. Re:Traditional Wall Street Research? by nelsonal · · Score: 2, Interesting

      The only factor Greenspan had in most models was "the Greenspan put" which impacted growth only indirectly (because it freed an awful lot of risk capital). Macro forecasts are not worthe the paper they are printed on, my models were designed to interrpet when others were missing things like market share shifts and competitive advantages that were forming or decaying. The economy played a relativly small role in how those dynamics shifted (ie Dell was a better competitor than HP in good times 1998-2000, and bad 2001-2003) but to what extent did investors give Dell too much credit and HP too little credit for their successes and failures. Spotting that early is what most analysts are paid to do.

      --
      Degaussing scares the bad magnetism out of the monitor and fills it with good karma.
    8. Re:Traditional Wall Street Research? by Uber+Banker · · Score: 2, Informative

      Macro forecasts are not worthe the paper they are printed on, my models were designed to interrpet when others were missing things like market share shifts and competitive advantages that...

      Good for you. And when you get your fixed interest rate mortgage, when a company takes out a loan for investment, when you have a great investment idea but need to buy a cascade of 10 year futures (invest in a gas pipeline but want to invest in the quality of the management rather than take risks on anything and everything affecting the gas price and the second order effects on your investment) to mitigate your risk and make the bucks you planned, you are relying on macro forecasts.

      They are broad-brush efforts, very rarely would a 'macro person' deny that. But they are essential to the modern running of our global economy. The point of 'analysts' [stock market analysts, rather than economic analysts] is to have ideas about individual companies, perhaps sectors vs. sector or region vs. region if pushed to it. Consensus Forecasts (which accumulate individual stock market analysts across companies, markets and countries) are laughable when looked at on a macro level, even more laughable than economist estimates. There are some excellent analysts, but they are usually the ones that accept estimation is, to varying degrees, educated guesswork, rarely will there be one sure choice; and if there is, that's because time, dedication, luck, restraint and insight have been combined.

      If any readers fancy a quick run-down of macro economic policy, I recommend today's publication of the Bank of England Inflation Report, or more specifically, the live press briefing where topics are discussed in a straight-forward way. Hopefully Bernake will implement something similar at the Federal Reserve to improve on Greenspan's smoke and daggers approach to public statements.

    9. Re:Traditional Wall Street Research? by Anonymous Coward · · Score: 0

      Spotting that early is what most analysts are paid to do.

      Then why do most "analysts" spot things soooo late? Downgrade after a company badly misses it's numbers and says that it is now the new trend. Gee, that is so tough to do. Rarely have I heard of an analyst being early to anything and when it does happen it is more the result of the law of probability.

      I think most analysts are paid to just fart out their mouths...

    10. Re:Traditional Wall Street Research? by Fnkmaster · · Score: 1

      Simple - smarter analysts long ago moved to the buy side. The people left on the sell side are either dumb or young and inexperienced and want to move to the buy side.

  2. Now that the companies know that... by drgonzo59 · · Score: 2, Insightful

    They can create bogus pages to feed to the Majestic bot like in the BMW vs. Google case.

  3. Cue the web spam... by Rob+T+Firefly · · Score: 5, Insightful

    We can expect yet another huge rise in fake blogs, fake product reviews on Amazon and such, and paid shills in chats and message boards. Swell.

  4. My data mining results by CaptainFork · · Score: 4, Funny

    based on manually mining (eg reading) Slashdot I determine a spike in Majestic's share price about now...

  5. Take that! by A+beautiful+mind · · Score: 0, Troll

    So who said you can't track business performance with a regular expression, you pointy haired noob?!

    --
    It takes a man to suffer ignorance and smile
    Be yourself no matter what they say
  6. I call Bull by spectrokid · · Score: 3, Insightful

    TFA mentions data about drug prescriptions by hundreds of physicians. Is that lying around unorganised on the net? Tell me which algorithm you are going to use to predict how many XBOX365 are going to get sold next month by webcrawling??? You think supermarkets post their sales-figures to public webpages? Wallmart is said to have more data off-line than is available on the entire public section of the net. Now give me access to that.. But on the other hand; if you work for the sales-tax administration (in Europe) and all the big companies file their invoices weekly, that is also a good starting point...

    --

    10 ?"Hello World" life was simple then

    1. Re:I call Bull by Anonymous Coward · · Score: 1, Interesting

      Having spoken to the company, I'll call your bull call. There are good proxies for retail sales data available from/over the web - how significant a channel is Amazon for some manufacturers? I don't doubt you can licence all kinds of goodness from other online properties - how about cars.com? Also TFA isn't entirely accurate. As well as data mining the internet, they also have access to a large number of proprietary (& $$) data sources - the drug prescription data you mention comes from companies like IMS Health & ImpactRX.

      Stop whoring! AC me up Scotty!

    2. Re:I call Bull by giorgosts · · Score: 1

      exept that tax statements are confidencial. Quaterly results though of listed companies are public..

    3. Re:I call Bull by nickh01uk · · Score: 1
      If I understand it correctly from reading their site, this company agrees information exchange relationships with major online retailers for point of sale data, aswell as gathering 'truly public' stats on Ebay bids, Amazon stock levels, and other stuff that you can read on the net. This idea isn't all that new, but its the first time we've seen it used across all industries. I remember an industry body for pharmaceuticals that used to ask its members for rolled-up numbers for drug sales each year, and then sell that information (suitably obscured) back to all the other drug companies. "You are number 2 in the market, and the next guy is miles behind you".

      That sort of thing.

      Theres a british company doing similar Internet data mining for the telecoms industry, with some data avaiable on their site. Nick.

  7. mining online news stories for word connotations by Cryofan · · Score: 3, Interesting

    I wrote a project in perl some years ago that would download online financial news stories and count the critical words and weigh their connotational weight, and compare that to the direction of the stock market. For example, if the words "stocks" and "down" started showing up a lot in sentences in online news stories, you might expect a downward trend.

    I posted the preliminary code online in the perl newsgroup.

    google "data mining" "news" "perl" etc

    --
    eat shiat and bark at the moon
  8. Realtime News Analysis by Anonymous Coward · · Score: 2, Interesting

    A friend of mine has developed software that goes even further. It parses streaming news stories for good/bad news and executes orders before humans even finish reading. That advantage is enough to make this company a mint.

    1. Re:Realtime News Analysis by Anonymous Coward · · Score: 0

      Oh yeah. Well, a friend of my friend wrote a book that describes how I can make millions in selling legal online pharmaceuticals - on the internet -- to the entire planet. Does anyone know where I can get V1aGra? Caveat Emptor.

    2. Re:Realtime News Analysis by instarx · · Score: 1

      A friend of mine has developed software that goes even further. It parses streaming news stories for good/bad news and executes orders before humans even finish reading. That advantage is enough to make this company a mint.

      Yeah, riiiight. Who knew it was so easy! Baloney.

      I read financial news stories, and even with my human brain focusing on the article it is difficult to tell whether the information presented is a positive or a negative indicator for the price of the stock.

      Here's an example:
      "XYZ Corp today reported a 23% increase in earnings over the same period last year due to +8% positive sales growth in regional stores."

      "Good news" right? Wrong - because if the market was expecting a 26% increase in earnngs the stock price will immediately corrected down. Or maybe 23% earnings was well below the company's peer group average of 28%. Even humans have trouble determining what is good and what is bad news, and no moronic piece of software that "reads" news streamers for good news is going to do anything other than put you in the poorhouse. (And BTW where exactly are these hypothetical super fast streaming news sources that are ahead of all the financial networks?)

      My suspicion: you are just sitting around knowing nothing about the market or investing and it seems really cool to you that a streaming news reader would make money. So to make it sound better you invent a "friend with a program" that has made a mint. Baloney.

  9. Will it work? Yes and no. by luvirini · · Score: 1
    Certainly at some level the things that happen around a company in public sources have a bearing on the stock price as many people base their investments on such info.

    But the real problem with everything like this is.. even if it works well for many things... there will be those who will try to missuse it.. and finding all those will be very hard. Further it only takes one major problem case and your nice product becomes a laughing stock.

  10. This is great by RoboSpork · · Score: 2, Interesting

    Computers should be able to give a much more unbiased assessment of the economy than any person ever could. People are essentially incapable of interpreting economic data in a straightforward way, political agendas always seem to work their way into economists opinions about the economy. By using algorithms to do the analysis (and allowing market forces to refine those algorithms), we should be able to get a much better understanding of the REAL economy.

    This is a good thing for mankind.

    1. Re:This is great by zenwrench · · Score: 1

      Yes, in fact people *are* essentially incapable of interpreting economic data in a straightforward way ... but then, it is the economic activities of those same seemingly irrational people that the market reflects. My point being that while the machines will likely come to a deeper understanding of the market than us higher-cortex skin-sacks ... it will likely come at the price of what will appear as bias. Without change to the most fundamental mechanisms upon which the economy works, it might well be that no matter how profound our collective understaning becomes the economy will always be capable of 1929's, Asian Crisis and Internet Bubbles. Otherwise, our machines are only serving us to find progressively more convaluted exploits in the market ... and those exploits are all about bias.

    2. Re:This is great by nrrd · · Score: 1

      Well, I think computers and good algorithms can help, but who chooses the algorithms?

      Instead of trying to fully analyze the economy, let's try a relatively simple example: how do you determine who is in poverty in the USA? Do you go by a set income (i.e. anyone who makes less than 18,000/year). If so, what if someone lives in an expensive city vs. an inexpensive rural area? Is it going to be the same amount in each state?

      Or is poverty determined by lifestyle? If you can't afford health insurance are you in poverty? If you can't afford heating for your home in the winter, then are you in poverty? Is the decision process for determining who's in poverty to be decided by Republicans or Democrats? You're going to get very different results depending on who's calling the shots. I could go on, but I think you get the idea.

      --
      "Eye halve a spelling chequer, It came with my pea sea, It plainly marques four my revue, Miss steaks eye kin knot sea"
    3. Re:This is great by instarx · · Score: 1

      By using algorithms to do the analysis (and allowing market forces to refine those algorithms), we should be able to get a much better understanding of the REAL economy.

      There is no "REAL" market! (given the context of the thread I think you mean "market" and not "economy"). There is no holy grail of a perfect technical stock picking algorithm.

      We could have all the information in the world and all the computing power to analyze it and we would still not be able to predict the market because it is driven by psychology, not by data. The dot.com boom was not driven by indictors, it was driven by the mob. If you had done all the analysis of the technicals of those stocks you would never have bought any of them. They were a self-fulfilling prophecy - the stocks went up because people were buying them and people bought them because they were going up. And of course they went down for exactly the same reason.

      The best we can hope to do is to use data mining to determine the factors that make the majority of the mob react in a certain way. "These 20 things have happened therefore people are more likely to think XYZ stock will go up and are therefore more likely to buy XYZ stock, therefore I should buy XYZ stock". There will never be a black box that accepts data and then spits out the "REAL" market decision to buy or sell.

      Even if there were somehow a program that could predict the market then it would immediately nullify itself and become useless. If it said that XYZ was going up for sure then no one would sell XYZ. Since no sellers also means no buyers, XYZ would remain unchanged (and was therefore unpredictable in the first place). It seems clear that as programs become more accurate at predicting stock prices the market will self-correct to make their predictions less accurate.

    4. Re:This is great by RoboSpork · · Score: 1
      Well, no, I did mean economy, and not market. I also was reffering to algorithmns to MEASURE and qauantify the economy, not predict economic trends. Economists have a very hard time measuring very basic hard economic quantifiers like GDP (ever heard of the black market?), total cash flow and the like. They also have a hard time measuring more subjective quantifiers like poverty rate and consumer satisfaction. Maybe you missed your economics class the day the professor explained this, but economics is a science that involves every aspect of all life (not just human). Stock prices are but a small facet of the "economy".

      Your post is full assertions and assumptions that are not relevant to what I was talking about, but lets just address a few of the worst:

      We could have all the information in the world and all the computing power to analyze it and we would still not be able to predict the market because it is driven by psychology, not by data.


      Wrong. The market is driven by two forces, supply and demand. Psychology has an impact on only one of those forces (demand), and the impact of that is alot less than you assert (unless, of course your talking about the hottest/latest tech stock bubble). The fact is that demand is more a function os basic life needs (food, shelter, etc) than it is related to psychology. People need shit, despite the fact that you think they are sheep.

      Even if there were somehow a program that could predict the market then it would immediately nullify itself and become useless.


      You might want to do some more research about the supposed impossibility of stock market predicition algorithmns. Many index funds are managed on a mostly automated basis by guess what, computer algorithmns. These funds have made good returns for their shareholders. Maybe the funds are not doubling value every month as clearly you dream of, but they are holding to the averages, and that is alot better than many mutual funds that are managed by very smart people.
    5. Re:This is great by instarx · · Score: 1

      Maybe you missed your economics class the day the professor explained this, but economics is a science that involves every aspect of all life (not just human).

      What a smarmy repy. I have opinions on the issues and addressed them, but you have to resort to personal attack to make yours. Every point you replied to had a personal insult attached. Since I make my living in the market my thoughts on self-delusional stock-picking algorithms may have some validity. Surely you don't think that just becaue some funds managers use some algorithms that they have any real validity. About ten years ago some hedge fund managers spent tens of millions developing now-infamous software algorithms to make a killing in the market. They tested for a year. It was a sure thing. They were flat broke and out of business in two years, having lost literally hundreds of millions. Significant academic study of what went wrong uncovered that there was an inherent flaw in the basic concept and what they were trying to do was impossible. (So no, I don't need to do more research - you might want to though). Funds managers can hire all the programmers and collect all the data they want but its still GIGO.

    6. Re:This is great by RoboSpork · · Score: 1

      Sorry, I re-read my post and it is flaming sounding, I apologize. I was a wee-bit grumpy before I had my coffee. I wish you luck researching the market and shuffling that gravy around. I have done work with technical analysis myself and I know how hard it can be. I gather from your posts that you regard technical analisys as a tool and not an end-all solution and I would certainly agree with that.

  11. the rise of the machines... by DeveloperAdvantage · · Score: 2, Insightful

    This is interesting stuff. I would like to learn more about the algorithms they use to analyze their data - the article has very few details. It is neat how systems like this are becoming favored over traditional human analysts (or at least reducing the need for people).

    I remember back in grad school in the late 90s I worked on a major project to design an intelligent agent based system including the same functionality, but, in addition to pulling information off the internet, it could also take into account whatever other information could be gathered and interfaced into it (for example, there is also a lot of content on TV which could be fed into a system, in addition to the online data). It was a design project though and not implemented, perhaps I will need to resurrect it!

    I do think the whole area of quantitative or at least semi-quantitative analysis of information, both textual and numerical, is going to explode over the next few years, driven by vast amounts of incredibly cheap computing power and bandwidth. Computer applications do amazing stuff right now, but five years from now truly "intelligent" applications will exist. The term "artificial intelligence" has fallen out of fashion, perhaps a sign of how common place these systems have now become.

    As an example, our local phone company has a voice recognition system which actually works reasonably well, much, much better than anything 5-10 years ago. We are certainly making progress.

    --
    FREE - Java, J2EE and Ajax Audiobooks for Software Developers - www.DeveloperAdvantage.com
  12. Apply bayesian filter for buy/sell advice by Colin+Smith · · Score: 1

    Train it for a particular stock automatically using the actual direction of the stock. Set the filter as one of the inputs among many others (yahoo data) to a genetic algorithm system and then give the lot away free. Bankrupt the big financial advice firms. :)

    Hmm, might be worth using it as an excuse to play with Ruby.

    --
    Deleted
  13. Wait a sec by DrSbaitso · · Score: 2, Interesting
    For example, if the words "stocks" and "down" started showing up a lot in sentences in online news stories, you might expect a downward trend.

    So you wrote a program that would read some stories that said the stock market was going down, and it told you the market was down? Did your program also see if weather news reports contained words like "rain" and "downpour" and hence "predict" rain?
    --
    beware the jabberwock, my son! the jaws that bite, the claws that catch!
    1. Re:Wait a sec by Uber+Banker · · Score: 1

      Did your program also see if weather news reports contained words like "rain" and "downpour" and hence "predict" rain?

      Infact, in most places in the world, predicting the following day's weather to be the same as the present day's weather would be reasonably accurate. It could be further refined by taking into account local weather patterns. Of course in the stock market this should be corrected for in projections using simple statistics, and is done so by most market participants (most weighted by market capitalisation). The constant guessing and counter-guessing of others is where it gets very very hard. If the GP was faster at doing what he/she said, it might work, but then raises the question "why haven't better resourced market participants done it before?". Which then turns into a bit of a circular argument around the Efficient Markets Hypothesis.

  14. Ties to Majestic 12? by Anonymous Coward · · Score: 2, Interesting

    Does anyone know whether Majestic Reasearch has any connections to Majestic 12 (http://www.majestic12.co.uk/)? For those who don't know, Majestic 12 is a distributed search engine. The distributed part is in that they have a bunch of people donate CPU cycles and bandwidth to run a web crawler in a SETI at home fashion. Now i thought this was a good thing to join, because we kind of need some independent alternatives to google. But if it turns out i'm sponsoring some marketing firm, well... i'd feel pretty stupid.

    1. Re:Ties to Majestic 12? by atw · · Score: 1

      > Does anyone know whether Majestic Reasearch has any
      > connections to Majestic 12 (http://www.majestic12.co.uk/ [majestic12.co.uk])?

      As the founder of the Majestic-12 project I can assure you that we are not related in any way, shape or form.

  15. Re:This is great??? by Skewray · · Score: 1

    Good things like the crash of 1987? Computer-run mutual funds appear to do no better than fund managers, either. I suspect that most of the larger mutual funds have relatively strict rules about when to buy and sell, in order to minimize emotional choices.

  16. Stock Memes - I posted this one year ago by broward · · Score: 2, Interesting
  17. Will it work? by Arwing · · Score: 2, Informative

    Nope, it won't, because even if it does, everyone will start using it and render it useless. There is only one trend in stock market that is backed up by statistics over long run and that is the stock market drifts upward overtime. My professor did a exmeripment using computer modeling, basically using a random number generator to decide if the stock market goes up or down, adding the 'upward drift' factor using historical data and comparing it to the actual data over last 75 yrs, and two data looks almost identical. I know it doesn't "prove" my point, but it does show that playing stock market short term is basically a flip of coin.

    1. Re:Will it work? by nickh01uk · · Score: 1

      Theres another company doing similar 'evidence based research/reporting' specifically for the Internet industry here. In particular they publish the 'inside story' on recent M&A and flameouts in the industry.

  18. Several groups do this by JimDog · · Score: 2, Interesting

    I once interviewed with a group in San Francisco that did stuff like this. They weren't clear about who they were working for, but I do remember some of the techniques they mentioned during the interview. Some of these were actually implemented, others were just ideas:

    - An eBay crawler that could estimate the number of auctions and average selling price to predict whether eBay would make their earnings target or not. eBay quickly blacklisted their IP space, so they started using a bunch of open proxies they found.

    - By analyzing client/server communication for the Sims Online, they discovered that each connection was assigned a sequentially incrementing connection ID number. By looking at the rate at which the connection ID numbers were increasing each time they logged in, they determined that the Sims Online wasn't going to be nearly as popular as Electronic Arts was forecasting.

    - They talked about placing a camera somewhere in Union Square (in SF) to monitor the entrace to Tiffany's during the holiday shopping season, and doing image analysis to determine what percentage of shoppers left the store with a Tiffany's bag in hand.

    - Monitoring wireless carriers' spectrum to determine what percentage of GSM/CDMA channels were in use for data vs. voice. The communication itself is encrypted of course, but you can still tell whether a channel is carrying voice or data. They wanted to determine if wireless carriers forecasts about revenue from data services were accurate.

    1. Re:Several groups do this by Uber+Banker · · Score: 0, Flamebait

      Nice examples, very nice. Sounds like a company with some innovative ideas, which are sadly rare in larger financial institutions. While you were unsure of their clients, do you have any published success rates, or even their name?

      The reason? I just may send my CV off!

    2. Re:Several groups do this by Anonymous Coward · · Score: 0

      i agree with you completely on the innovation of the ideas... im not sure if ideas like this just arent published, or if their rare? do you happen to have any links of communities that kinda bubble out ideas like this ?

    3. Re:Several groups do this by Uber+Banker · · Score: 1

      'Hedge funds' are the most generic group these guys would fit in, although they're not an actual hedge fund, I think they ally with this type of investor in terms of being innovative and having a good amount of discression in their concentrated approaches. A good community website for hedge funds (with a good amount of private equity, also potentially similar business tactics, and related companies) is Albourne Village. Subscribe to their daily newsletter and you will see the odd gem of an idea every few days, thought there is a lot of noise to the signal, I think this is the leading website for 'the trade'.

    4. Re:Several groups do this by berkmant · · Score: 1

      Hey Uber Banker, Why don't you send US a CV if you are interested and have something to contribute: tony@majesticresearch.com Cheers!

      --
      "When a true genius appears in this world, you may know him by this sign, that the dunces are all in confederacy against
  19. This is nothing new by atomic777 · · Score: 1

    This is old news. Data mining systems like this have been around for years. Some are even actual real data mining systems (i.e. a SELECT statement against two tables is not data mining!! argh... i digress)

    1. Re:This is nothing new by DeveloperAdvantage · · Score: 1

      Sure, and some computational tools, the early,early,early forerunners of modern computers, are even much older. For example, from http://en.wikipedia.org/wiki/Abacus

      "The origins of the abacus are disputed, suggestions including invention in Babylonia and in China, to have taken place between 2400 BC and 300 BC."

      Granted, there has been some improvement since then.

      --
      FREE - Java, J2EE and Ajax Audiobooks for Software Developers - www.DeveloperAdvantage.com
  20. Well, then they should analyze gold by argoff · · Score: 1

    Forget trying to analize companies for optimum performance. For real performance dump the whole batch and buy precious metals because the fact is that the US economy has more debt than can ever be paid off at face value. IMHO, gold is pratically guaranteed to outperform every investmant class out there.

    Seriously, just watch what happens when the fed decides to print up money to try and stall off a cascading credit collapse. They will print up some, but that will make things worse because it will drive up costs without driving up pay or driving down personal debt. So they will print up more, and that will make things more worse for the same reasons, and so on. When it is all over, costs will likely be 10 x higher while pay stas about the same. I woulnd't be supprised if the dollar stopped being a currency.

    1. Re:Well, then they should analyze gold by Anonymous Coward · · Score: 0

      LOL! I hope you aren't serious about the dollar going out. Don't forget that exchange rates are all done through the dollar. Don't listen to the aforementioned advice above...

    2. Re:Well, then they should analyze gold by Anonymous Coward · · Score: 0

      Fyi, gold prices already are doing better than the rest of the market. They've trippled over the last 5 years, and have gone up 30% in the last 7 months. (compare that to any other investment class) Lets think about this? what is driving that? Answer: a growing loss of faith in the dollar as a reserve currency is what.

      So have us debt cumulative levels decreased lately? no, has the us economy had growth of real total purchasing power lately? no. untill both of those become yes again, don't even think of buying anything other than commodities or precious metals.

    3. Re:Well, then they should analyze gold by Anonymous Coward · · Score: 0

      We laugh (LOL) at what we don't understand. The only thing that the dollar is pegged against is people's faith. In case you don't know, every fiat currency in the history of mankind has ultimately been devalued to nothing. And they each have followed the same track that the dollar is currently following, almost like the stages of a star.

    4. Re:Well, then they should analyze gold by Fnkmaster · · Score: 1

      Almost every part of your post is wrong. The US government's debt (Federal debt only) is about 40% of US GDP. Compare this to countries like Japan which are well over 100%.

      Gold, historically has underperformed any other asset class. Over the last 200 years. Over the last 50 years. And over the last 10 years. Gold is traditionally seen as a good inflation hedge, for obvious reasons, but it's not a good long term investment.

      The US government will never resort to seignorage (printing of money) as you seem to suggest. Any macroeconomist knows this is a recipe for disaster.

      Compare, by the way, the current debt load as a fraction of GDP to that during WWII - it was well over 100% by the end of WWII, vs. ~60% now (including all levels of government). Also remember that because of nominal (and real) GDP growth, this fraction changes over time.

      Anyway, what you are predicting is a complete collapse of sound management of the US economy. I sincerely doubt this will happen in our lifetimes. Stagnation, maybe, inflation, at times, boom and bust cycles, sure, but your prediction is completely alarmist and I see no justification for it at all.

    5. Re:Well, then they should analyze gold by 808140 · · Score: 1

      I've never understood what makes these crazy psycho-libertarians think that there is anything that makes gold of inherently high worth. Honestly, it's just a metal, it's not even that rare, and with the exception of being a particularly good conductor the only thing that makes it valuable is that some people think it's pretty.

      It's really bizarre. It's like they've never bothered to read any economics texts ever.

    6. Re:Well, then they should analyze gold by publius_jr · · Score: 1

      Your very ignorance is what presents my opportunity. It is true that there is little inherent value in gold. It is heavy, malleable, conductive, shiny, stable, and scarce. But what of those dollars you so tout? Their value stems from the demand for American goods and services and their scarcity. America's manufacturing capacity has been declining (e.g. GM) and the supply of dollars expands approximately at a 7% annual rate, doubling roughly every ten years. When our future obligations come due--when American baby boomers demand Social Security and healthcare payments and foreign lenders demand interest payments--the government will need to make some undesirable choices: (1) increase taxes on working Americans, (2) decrease other federal expenditures, and/or (3) print more dollars in an act of partial default. In any case, the assured scarcity of gold presents a great way to maintain purchasing power, a goal the wise investor will not underestimate. In the long term, gold is useful because it stores value, with very little risk. Buying it will not make you rich, unless others ignore this value, trusting their government will not erode their money's value. Nor will it make you poor, unless you overestimate this value, unnecessarily distrusting government. But gold is a very simple investment, for its only real value is that it is a store of value. You completely neglect gold's primary value, focusing solely on its technological uses. As your neglect is common to most people I meet, it is no surprise that gold's current value exceeds its current price. Nevertheless, there are other ways to not only maintain purchasing power but increase it, by investing in something productive, like stocks. This requires risk, although in the long term, it would be more risky not to take this risk. In any case, do not put all your eggs in one basket, as we men are fallible, and hold as few dollars as possible.

    7. Re:Well, then they should analyze gold by argoff · · Score: 1

      Almost every part of your post is wrong. The US government's debt (Federal debt only) is about 40% of US GDP. Compare this to countries like Japan which are well over 100%.

      I didn't say the US government, even though their debt is nice and high too. It's the total debt in the US economy vs what the economy puts out and economic growth. (not to mention unfunded promises)

      Gold, historically has underperformed any other asset class. Over the last 200 years. Over the last 50 years. And over the last 10 years. Gold is traditionally seen as a good inflation hedge, for obvious reasons, but it's not a good long term investment.

      It mostly has. And that is a very bullish argument for gold, because once it starts to get remonitized - the catch-up will be enormous.

      The US government will never resort to seignorage (printing of money) as you seem to suggest. Any macroeconomist knows this is a recipe for disaster.

      There are 270 trillion with a T in outstanding derivative contracts, they can not afford a cascading default at any cost. They will monatize, there is no way arround it.

      Compare, by the way, the current debt load as a fraction of GDP to that during WWII - it was well over 100% by the end of WWII, vs. ~60% now (including all levels of government). Also remember that because of nominal (and real) GDP growth, this fraction changes over time.

      With WW2 most the investment went into infrastructure, the dollar was backed with precious metals, and the US wasn't experiencing technology deflation. This "business cycle" most of the debt went into re-fi's, the dollar is backed by nothing, and the information age has created a very strong technology deflation - the worst since the invention of the cotton-gin and the railroads.

      Anyway, what you are predicting is a complete collapse of sound management of the US economy. I sincerely doubt this will happen in our lifetimes. Stagnation, maybe, inflation, at times, boom and bust cycles, sure, but your prediction is completely alarmist and I see no justification for it

      I am not calling for a collapse of the US economy, but I am calling for a collapse of the dollar, which will likely cause some suvere pain for a while in the US economy too. In fact, it is the US economy making the transition to the information age forcing the death of the dollar and the re-monitisation of gold. Of course, you will have pelnty of warning before that happens. Gold will probably go to about 680 this year, 1600 within a year or two after, stabilize for a while, then head up north of $4000 and seal the fate of the dollar.

      Well, back in 95 they said I was a kook when I said that Linux was going to take over the marketplace, and I bet my career on that (dumping SCO). They lauged and even mocked me. Implied that I was stupid and throwing away my career, but I was so right and they were so wrong. And it brought me a lot of success and opportunity while many of them fell on their faces and became career loosers. But ironcally my success did not bring me joy, because I didn't do it to prove that they were loosers and I was a winner, or to have just another stupid trivia debate - I did it to try and convey some values, understanding, to share some depth, to be sociable. My point is that I am trying to say something that is way beyond price prediction of a stupid yellow metal. The fed is controlling people and lying to people about the value of their money in a way that is unjust and can not and will not survive the information age any more than the unjustices of the slave plantation system could survive the industrial revolution. Once the concept of justice is in place, then everything else is just connecting the dots.

    8. Re:Well, then they should analyze gold by 808140 · · Score: 1

      Money is an inefficient store of value. It, like gold, tends to decline in value over time (inflation) when many other investments increase in value. Gold decreases in value relative to the dollar, and has been doing so for a long time, as the GP pointed out. So gold is, at least at this point, one of the few things out there that is an even less efficient store of value than money.

      Investing in gold is also not productive, from an economic perspective, because gold is useless. You need money to make more money, and that's why the loanable funds market exists. If you're using money as a store of value, you're silly indeed. But I don't think anyone is suggesting that.

      Since you're a libertarian and believe in the intelligence of the market, perhaps you should check out gold futures. If you're so certain the dollar is going to crash, maybe you should put your money where your mouth is and buy an asset whose value has been declining for more than a hundred years. You'll make a bundle.

      But you'll excuse me if I don't share your passion for losing money.

    9. Re:Well, then they should analyze gold by Anonymous Coward · · Score: 0

      Hate to tell you, but American manufacturing capacity isn't declining. It's just not owned by American companies anymore.

      If you look at say, the auto industry, the share of Big Three capacity has declined. But when you put in all the Japanese plants that are in the US, the share of cars made in the US hasn't changed much. When I read it in an article last year I thought it couldn't be true but it is. Go look up the numbers yourself.

  21. Re:mining online news stories for word connotation by Anonymous Coward · · Score: 0

    So, did it work?

  22. Differing Research Methods by VeryHotTopic · · Score: 1

    The firm's methods differ from traditional Wall Street research, where analysts make forecasts based on conversations with company executives, advertisers, suppliers and mall visits to forecast company results and make recommendations.

  23. Re:mining online news stories for word connotation by Cryofan · · Score: 1

    yes, over a limited time, it did seem to work, at least to some degree. I used to have a webpage up on geocities that had a java display of my data. Doubt it is still around.

    One thing that would need to be done is collect a set of the most important words. I never had time to do that (this was a senior project for my comp sci degree).

    I just came up with a set of critical words on my own (probably 30 words or so, of negative, positive and neutral connotational value).

    But if you had completed a collection and study of words and then tracked the appearance and occurence of those words to the financial index values, THEN you could build the program from there.

    --
    eat shiat and bark at the moon
  24. This is not new by GCHQAgent · · Score: 1

    IDL have been doing this for years - http://www.investor-dynamics.com/

  25. Re:mining online news stories for word connotation by Cryofan · · Score: 1

    hmm. can't find any trace of my project on the net anywhere. I know I have the code and data and my paper at home, though...

    --
    eat shiat and bark at the moon
  26. Re:mining online news stories for word connotation by metlin · · Score: 1


    So, is this code still up somewhere? :)

  27. Re:mining online news stories for word connotation by Cryofan · · Score: 1

    wow, I just searched the net and it is NOWHERE to be found. A 70 page paper and a bunch of (pretty bad) perl code. But I do have the code and data at home. Someone perl coder named clarkson fixed the code for me, but I think my own code worked better.

    But here was the url:
    http://www.geocities.com/uhdseniorproject

    now gone, and no cache.

    But I still think the idea is a great one...

    --
    eat shiat and bark at the moon
  28. Maybe they'll soon announce a deal with Google? by abdulzis · · Score: 1

    Maybe they'll soon announce a deal with Google?

    --
    Cheers!! Abdul Aziz
  29. Re:I call Bull (warrented skepticism) by wintermute42 · · Score: 1

    If I can rephrase the skepticism expressed in the parent post: most publicly available information posted on web sites will not yield any startling analysis that can't simply be gained by reading annual and quarterly reports.

    There was some comment in response to the parent post that the data mining company was licensing data. This also sounds suspicious. There is an SEC regulation called FD (for Fair Disclosure). This regulation states that you cannot preferentially provide investor information to someone. FD was put in place to stop companies from selective discloser to their pet stock market analysts who always write good things about the company, while freezing out those who write anything critical. FD says that if you provide material information about the company, you have to make it available to everyone. The only way to comply with FD while providing confidential information would be to provide the information with the understanding that it will be aggregated and that companies will not be identified. For example, oil industry companies could provide information on reserves, which could be aggregated and reported as an industry figure. Even in this case, the aggregated information would have to be available without prejudice (e.g., anyone could by it).

    There has even been some concern about violating Regulation FD if companies allow analysts to travel around and talk to their sales outlets or customer base. The result of this would be material non-public information that could be used by the analyst and was obtained through the cooperation of the company.

    This does not mean that you cannot derive advantage from information. For example, if you have a market model that finds predictive features from data that is either publicly available or that anyone can buy, then you can leverage this information.

  30. Difference between a Prediction and a Summary by B.+Pascal · · Score: 1

    Hello all:

    I like to highlight that there is a difference between a Prediction and a Summary. From what I read so far, the tool posted in the article generates a summary, which maybe used as a prediction.

    Let s(t) be the Summary of a system (in this case, the economy) at any given time, then:

    A prediction, p(S), would be a prediction based on a set of summary S, where: S == {s(t), s(t-1), s(t-2), ... s(t-n)}

    One can always make a prediction based on a very small number of summaries. |S| = 0 is a guess. |S| = 1 means that no past summaries are considered in the predication, just the most up-to-date one. Presumably, the bigger |S| is, the more information is considered in that summary.

    The usefulness of such a tool lies in the value of t. Web-crawling allows one to collect much data in a small amount of time. If one is able to collect a summary quicker than everyone else, then presumably, someone using this summary tool would be able to stay ahead of the trend.

    That being said, one of the input of s(t) is actually publicly available data. Financial reports events after the fact. Information based on actual financial transactions (ones that you can collect if you plan a spybot at the central booth of a major retailer, for example) is much better. At the end of the day, if you want to play a really cut-throat, high profit game of stock trading, I think you are better off having insider info.

    Cheers.

    B. Pascal

  31. Real-Time? by triso · · Score: 1
    ...this company uses data mining systems to collect real-time sales data and other information on companies that have a web presence....
    Can anyone name a company that keeps "real-time sales data" on the web?
    1. Re:Real-Time? by Anonymous Coward · · Score: 0

      Ebay, of course.

  32. The more things change... by randyjg2 · · Score: 1

    This is a pretty common trick, and used in one form or another on Wall Street for many years.

    I have seen ones that scanned EDGAR filings, (got canceled when the company was destroyed in the 9/11 attack), campaign contributions (works wonderfully for the telco and other highly regulated industries). patent filings (generally surprisingly well, though no one knows why), job adss, and many others.

    I even heard of one that analyzed free internet porn...(insert your favorite joke here, but it actually was a fairly good predictor. The cognitive psychology behind it was fascinating).

    Using search engines and NORA text mining is basically a form of technical investing. If you have a data store of any kind whose contents influenced or are influenced by members of the market niche of a company, it can tell you something about the future of that niche. Thats just plain marketing 101...

  33. where does research begin... by tashammer · · Score: 0

    I am curious to know where market research ends and industrial espionage begins. Data mining seems, in some respects, to go beyond mere research when specific companies are selected for "research". In essence there is no agregation of information to present industry-wide assessments and forecasts. Or am i drawing a rather long bow here?

  34. Re:I call Bull (warrented skepticism) by Anonymous Coward · · Score: 0

    I guess whether or not "public" information available on the Internet can prove useful or not depends on what the information is and how it is analyzed. If one saw online for instance that a company was slahing prices across an important cateogry of products by monitoring the major retail channels for that company and noticed this data before the company said anything at the end of the quarter then it could be argued that value was added. As for Reg. FD, the companies data is being licensed from are not the companies being analyzed the data is gathered somewhere in the channel or the food chain so to speak and is completely independent of the companies being analyzed. IMS is the perfect example - if you buy POS data from the Pharmacies about which drugs are being dispensed that is not coming from a large Phareceutical company but can give you good insight into say how Pfizer is doing in the depression market.