Also, virtually every "word" ends with an "E" - this suggests that perhaps the writer had a number of coding schemes which could be carried out in his head on a word-by-word basis, and he's mainly using coding scheme "E" in this case.
The smallest word I can see is "SE" which appears at least twice - could this be "a" or "I" perhaps?
True, but the Conservative party didn't form the government themselves.
They had to form a coalition with the Liberal Democrats, which was was the only party to vote against the Digital Economy Bill to strengthen copyright enforcement.
Bear in mind that Cameron's party (Conservatives) didn't form the government. They are in a coalition with the Liberal Democrats, and EVERY Lib-Dem MP voted against the Digital Economy Bill which strengthened copyright enforcement.
You are, of course, correct. I meant to say that 33 has only two prime factors. This was meant to be in reference to Carl Sagan's Arecibo Message*, which used a similar encoding scheme, but with 1679 bits forming a 2d image if plotted as rows & columns based on its prime factors (23*73).
33 has only 2 factors: 3 & 11. We can therefore plot the bits as a 2d image in only one of 2 possible ways:
11 rows of 3: --- 110 010 111 001 001 010 100 100 111 010 011 --- or 3 rows of 11: --- 11001011100 10010101001 00111010011 --- This last image looks like the letters AV (easier to view if plotted in a grid), or then again it may just be a pretty pattern. Not enough data to be sure. My next thought was that maybe it evolves into something if plotted in Conway's Game of Life, but it doesn't seem to do much.
Basically, you were asked to predict how a number of users would rate a number of movies, based on their previous ratings of other movies.
You were supplied with 100 million previous ratings (UserID, MovieID, Rating, DateOfRating), with the rating being a number beween 1 and 5 (5=best), and asked to make predictions for a seperate ("hidden") set comprising roughly 10% of the original data. You could then post a set of predictions to their website which would be automatically scored, and you'd receive a RMSE (Root Mean Squared Error) by email.
To avoid the possibility of tuning your predictions based on the RMSE, you could only post one submission per day, and the final competition-winning results would be scored against a seperate hidden set, independent of the daily scoring set.
It really was a fantastic competition, and anyone with a little coding knowledge (or SQL knowledge) could have a decent go at it. Personally, I scored an RMSE of 0.8969, or a 5.73% improvement over Netflix's benchmark Cinematch algorithm, having learnt a huge amount based on the published papers and forum postings of others in the contest, and my own incoherent theories.
In a way, everyone wins. Netflix gets a truly world-class prediction system based on the work of tens of thousands of researchers around the world hammering away for years at a time. Machine learning research moves a big step forward. BellKor et al get a big juicy cheque, and enthusiastic amateurs like myself get access to a huge amount of real-world research and data.
Also, virtually every "word" ends with an "E" - this suggests that perhaps the writer had a number of coding schemes which could be carried out in his head on a word-by-word basis, and he's mainly using coding scheme "E" in this case.
The smallest word I can see is "SE" which appears at least twice - could this be "a" or "I" perhaps?
True, but the Conservative party didn't form the government themselves.
They had to form a coalition with the Liberal Democrats, which was was the only party to vote against the Digital Economy Bill to strengthen copyright enforcement.
Bear in mind that Cameron's party (Conservatives) didn't form the government. They are in a coalition with the Liberal Democrats, and EVERY Lib-Dem MP voted against the Digital Economy Bill which strengthened copyright enforcement.
You are, of course, correct. I meant to say that 33 has only two prime factors. This was meant to be in reference to Carl Sagan's Arecibo Message*, which used a similar encoding scheme, but with 1679 bits forming a 2d image if plotted as rows & columns based on its prime factors (23*73).
* http://en.wikipedia.org/wiki/Arecibo_message
33 has only 2 factors: 3 & 11. We can therefore plot the bits as a 2d image in only one of 2 possible ways:
11 rows of 3:
---
110
010
111
001
001
010
100
100
111
010
011
---
or 3 rows of 11:
---
11001011100
10010101001
00111010011
---
This last image looks like the letters AV (easier to view if plotted in a grid), or then again it may just be a pretty pattern. Not enough data to be sure. My next thought was that maybe it evolves into something if plotted in Conway's Game of Life, but it doesn't seem to do much.
Could you clarify where you got that statistic from? According to my research, the relative murder rates* are:
US: 0.042802 per 1,000 people
UK: 0.014063 per 1,000 people
i.e. you are more than 3 times as likely to be murdered in the US.
*Source http://www.nationmaster.com/graph/cri_mur_percap-crime-murders-per-capita
>That doesn't mean "you need less people" - it means the same person can crank out more stuff.
Unfortunately, if the demand for said "stuff" is finite, and each person can crank out more, then "you need less people" is *exactly* what it means.
Basically, you were asked to predict how a number of users would rate a number of movies, based on their previous ratings of other movies.
You were supplied with 100 million previous ratings (UserID, MovieID, Rating, DateOfRating), with the rating being a number beween 1 and 5 (5=best), and asked to make predictions for a seperate ("hidden") set comprising roughly 10% of the original data. You could then post a set of predictions to their website which would be automatically scored, and you'd receive a RMSE (Root Mean Squared Error) by email.
To avoid the possibility of tuning your predictions based on the RMSE, you could only post one submission per day, and the final competition-winning results would be scored against a seperate hidden set, independent of the daily scoring set.
It really was a fantastic competition, and anyone with a little coding knowledge (or SQL knowledge) could have a decent go at it. Personally, I scored an RMSE of 0.8969, or a 5.73% improvement over Netflix's benchmark Cinematch algorithm, having learnt a huge amount based on the published papers and forum postings of others in the contest, and my own incoherent theories.
In a way, everyone wins. Netflix gets a truly world-class prediction system based on the work of tens of thousands of researchers around the world hammering away for years at a time. Machine learning research moves a big step forward. BellKor et al get a big juicy cheque, and enthusiastic amateurs like myself get access to a huge amount of real-world research and data.