Slashdot Mirror


Crunching the Math On iTunes

markmcb writes "OmniNerd has posted an interesting article about the statistical math behind iTunes. The author makes some interesting observations concerning the same song playing twice in a row during party shuffle play, the impact that star ratings have on playback, and comparisons with plain old random play (star ratings not considered)." From the article: "To test the option's preference for 5-stars, I created a short playlist of six songs: one from each different star rating and a song left un-rated. The songs were from the same genre and artist and were changed to be only one second in duration. After resetting the play count to zero, I hit play and left my desk for the weekend. To satisfy a little more curiosity, I ran the same songs once more on a different weekend without selecting the option to play higher rated songs more often. Monday morning the play counts were as shown in Table 1."

14 of 276 comments (clear)

  1. I am not sure I see what he sees by ReformedExCon · · Score: 3, Interesting

    I'm looking at this data and it seems that iTunes does seem to pick out favorite songs more often than not-so-favorite songs. Which, I suppose, is the whole idea behind the Party Shuffle concept.

    So after analyzing all that data, how does Brian Hansen come to the conclusion that "it's simply the mind's tendency to find a pattern that makes you think iTunes has a preference". Uh, no. It's the software learning that you have a certain type of genre or style that you strongly favor and will selectively pick songs that are related, thus giving you a better-selected playlist.

    And it seems that the program has a bug in that it will play a song twice in a row. That's a real bug (if you don't like that type of thing).

    --
    Jesus saved me from my past. He can save you as well.
  2. Interesting by hattig · · Score: 4, Interesting

    I wish iTunes would get ratings from some online source much like it gets tracknames from Gracenote. Can you imagine a server of user-submitted ratings? You could opt to use an average rating from all users, or a rating from users with particular tasks (i.e., if you are a metaller, then you'll probably not want raver's musical opinions affecting your ratings!).

    Why? Because I haven't got the time to go around rating my entire music library. Judging from that article, it is dangerous to only do a few because of the weighting algorithm used - surely it would be more sensible to assume that 'not rated' meant 3 stars rather than 0 stars? That way you could rate down shitty songs, and rate up excellent songs, but ignore rating the vast majority of songs.

    1. Re:Interesting by the_unknown_soldier · · Score: 3, Interesting

      I only rate good songs... Indifference is the worst rating you can give to a song, so i think "0" fits in pretty well

      As for Gracenote: perhaps sales on the ITMS could act as a gauge of this. e.g. "This is this artist's most downloaded song and this artist compared to similar ones is bought 5x as much, so our algorithms suggest it should be rated 5" Then once you have downloaded it you can change it if you get the time.

    2. Re:Interesting by Stuart+Gibson · · Score: 4, Interesting

      Select All -> Get Info -> My Rating -> Three Stars.

      Rate up and down others as necessary. OK, not the point that default should be doing this for you, but a quick fix if you want it to work that way.

      If you already have songs rated then create a 0 star smart playlist and repeat.

      Stuart

      --
      It's all fun and games until a 200' robot dinosaur shows up and trashes Neo-Tokyo... Again
  3. Re:Reminds me of... by hattig · · Score: 4, Interesting

    Maybe songs need more than one rating.

    Rating For Morning Listening (* for Aphex Twin, Slayer, etc)
    Rating For Afternoon Listening (**)
    Rating For Evening Listening (****)
    Rating For Party Listening (**)
    Rating For ${mood} Listening

    Then instead of getting work done we can spend out entire lives rating music.

  4. Re: Try last.fm by P!Alexander · · Score: 4, Interesting

    That's exactly why I love last.fm (formerly Audioscrobbler & Last.fm). It automatically tracks what you listen to and then allows that information to be used to give you neighbors in the music world based on what interests you have in common. You can add friends, join groups, and even tag your music. All of this is extremely useful in finding new stuff. They've got plugins for all the major media players (and even some minor ones).

    Add on top of that the ability to play a custom-built radio station, set it to play only new music or listen only to music from a particular user profile.

    Linux and BSD supported! Open source plugins and radio station player! Could it get better? ;)

    ---
    but make sure that the last line
    Generated by SlashdotRndSig via GreaseMonkey

  5. Re:Underlying formula by ciroknight · · Score: 5, Interesting

    Your decimals look more like the pricing model than the weights for playing songs..

    5 star - .285 -- $299, iPod (full?) 20gb
    4 star - .238 -- $249, iPod mini 6gb
    3 star - .190 -- $199, iPod mini 4gb
    2 star - .143 -- $149, iPod shuffle 1gb
    1 star - .095 -- $99, iPod shuffle 512mb

    --
    "Victory means exit strategy, and it's important for the President to explain to us what the exit strategy is." G.W.Bush
  6. Re:You wonder why the music industry is mad by hattig · · Score: 3, Interesting

    Music has turned from something that you collect and treasure into something you have and listen to practically all the time. It is very rare you decide you want to listen to just song X these days (in comparison to how much music is listened to overall), and actively put it and actively spend the time solely listening to it. Large mp3 collections have replaced radios at many places, great for getting rid of the music you really dislike and the DJ.

    Would I pay to have my music rated by an external algorithm? No. Would I pay to have my music peer rated? No - I'd also be contributing back to it like I contribute back to Gracenote and FreeDB.

    I suppose it is easiest to just rate everything *** and apply ****/***** and **/* to the tracks I really notice as standing out.

  7. Re:Reminds me of... by Gorath99 · · Score: 3, Interesting

    I was thinking about something like this myself. Basically, what I'd like to have are two flags:

    1: Never play unless I explicitly say so.
    2: Don't include in shuffle.

    The first one I'd use to flag interviews etc. that are sometimes included on albums. Is not necessarily bad content, just something that you don't generally need to hear multiple times.

    The second one is for flagging things like Beethoven's 9th. It's really good music, but you don't want 67 minute long pieces in a random playlist.

    I currently just use the 1 and 2 star ratings for this, but it's not really ideal. It's too bad (but understandable) that iTunes has no option for looking at TXX frames or I could implement it in a better way.

  8. Modal Music by Johnny+Mozzarella · · Score: 5, Interesting

    A friend of mine who worked at a radio station that played a very diverse range of music told me how they select music.

    She said that research had shown that listeners would rate the same song higher if it followed other song of a similar genre. If they play songs of different genres randomly the listener does not enjoy the music as much.

    So their tendency is to play "blocks" of music.
    For example....
    4 Classic Rock songs
    3 Blues Songs
    3 Folk songs
    4 Female Rockers
    3 Grunge
    etc.

    This is common knowledge in the radio world. I wonder if Apple has incorporated this type of logic into it's iTunes algorithms?

    The radio station in question is WXPN and can be found under iTunes > Radio > Public > WXPN

  9. Re:Reminds me of... by Lars+T. · · Score: 3, Interesting

    Use unique strings (as many as you like) in the Comments tag. Like "-don't play unless I say so-" or "-don't include in shuffle-". Then build intelligent playlists accordingly (Comments doesn't include "-don't play unless I say so-"). Errm, better use something shorter like "-DPlay-".

    --

    Lars T.

    To the guy who modded me down from perfect to terrible Karma - Apple haters still suck

  10. Re:Reminds me of... by Dixie_Flatline · · Score: 4, Interesting

    I use the stars to indicate how often things should be played.

    * - Never play. It's only in the list for the sake of completeness (I hate having partial albums)

    ** - Play very rarely. If I'm in the mood, I might listen to it.

    *** - I'll listen to it at least once a week. If it comes up randomly on the shuffle, I won't take it out of the list.

    **** - I can listen to this several times in a day.

    ***** - I'll listen to this song anytime, anywhere. If it comes up twice in a row, no problem. If my playlist only has this song on it, I can cope with that for at least a few hours.

    This means that I have to periodically re-rate the songs. That seems only reasonable, though. Why would songs stay at the same rating forever? As the novelty wears off, I can relegate a song to 4 or 3 stars.

    I also keep extensive smart playlists that make sure that songs that are 3 stars or less only get played once every few days.

  11. the REAL underlying formula by Bert690 · · Score: 5, Interesting
    OK, after a bit more thinking, you were indeed very close. It appears the actual formula is:

    points(0 stars)=1
    points(1 stars)=3
    points(2 stars)=4
    points(3 stars)=5
    points(4 stars)=6
    points(5 stars)=7

    probability(X stars) = points(X stars) / 26

    This yields the following probabilities, listed along side the observed values from the article along with 95% condience intervals.

    p(5 star)=.2692 [.270 +- .0038]
    p(4 star)=.2308 [.230 +- .0036]
    p(3 star)=.1923 [.189 +- .0033]
    p(2 star)=.1538 [.154 +- .0031]
    p(1 star)=.1154 [.118 +- .0027]
    p(0 star)=.0385 [.039 +- .0016]

    As you can see each computed probability falls within the 95% confidence interval, so there's a good chance this is the correct forumla.

    Boy do I have too much time on my hands today.

  12. Crunching the math on Slashdot by AlpineR · · Score: 4, Interesting
    Nice analysis of iTunes. I'm somewhat surprised at the small difference in play frequency between 3, 4, and 5 stars; and disappointed that unrated songs are almost never played. In my collection, unrated means that the music is new to my collection. I think 1 star should be the kiss of death, not a blessing upon a previously unrated song.

    But all this talk of 0, 1, 2, 3, 4, 5 has me thinking of another rating system. Would anybody care to do an analysis of the ratings in Slashdot comments? What are the relative populations (I expect a ton of 2's but how about the rest)? Do comments made in the first hour after a story is posted stand a better chance of reaching +5 than comments made later in the day?

    One of my gripes about the Slashdot comment system is that it discourages contemplation and discussion. Comments made more than 24 hours after a story is posted are rarely read and almost never moderated. This is in contrast with comments system like Usenet or other bulletin boards, where threads can remain lively for weeks.

    AlpineR