Software Predicts Movie Success
scheming daemons writes "TechNewsWorld has an article about software that predicts whether a movie will be successful or not by factoring in its rating by censors (e.g. G, PG, R), strength of the cast, genre, competition from other films at the time of release, special effects, whether it is a sequel, and the number of theaters in which it will show."
More explosions, chases and boobs done with a *smaller budget* == more profit. :)
Trolling is a art,
I expect it's less of an AI and more of a simple collection of linear statistical models (linear regressions and general linear models) using parameters gleaned from the performance of past films.
Unless yours was a reference to the awful film A.I.
"Place me in the company of those who seek Truth, but deliver me from those who believe to have found it."
actually, they did use excel. google cache
The main result is that the method (neural net) works a little better than other methods on the same data (Table 4 of paper). It scores 75% in a test; conventional regression scores 71%. As they say in the statistical literature, "big woop"; the fancy new thing is marginally better than the simple old thing.
As for the practical side of things, the main predictive variable is the number of screens on which the film was initially shown. The next-highest predictive variables are a variable representing the use of technical effects and a variable represengint the actors' reputation. Well, none of these indicates that this tool (or others discussed in the paper) is of any real use to the industry. The suggested use of the tool is to predict movie success. But the main predictive variables all represent things the industry already knew, when the film was being made and promoted. It's like asking a patient if they have a cold, and then charging them to tell them they have a cold.
If this were 1990, the title would read "neural network predicts movie success" and the discussion would be about the impending success of strong AI.
Reading TFA, it's impossible to know whether this study has any value without seing a proper article, as submited to a reputable stats journal.
First of all this sounds like simple statistical classification with pretty obvious variables. However making classification work is not always trivial.
Methodology is the key here. The sample of 800 movies is rather small, and the details on the chosen explanatory variables is sketchy. With enough variables, even meaningless ones, one can explain anything on a training sample. However with proper classification techniques, using for example jacknife/resubstitution/cross-validation one can find out if the classification model has any actual predictive values.
As someone said "anybody can predict the past", and someone else "prediction is rather difficult, especially about the future".