Slashdot Mirror


Crunching the Math On iTunes

markmcb writes "OmniNerd has posted an interesting article about the statistical math behind iTunes. The author makes some interesting observations concerning the same song playing twice in a row during party shuffle play, the impact that star ratings have on playback, and comparisons with plain old random play (star ratings not considered)." From the article: "To test the option's preference for 5-stars, I created a short playlist of six songs: one from each different star rating and a song left un-rated. The songs were from the same genre and artist and were changed to be only one second in duration. After resetting the play count to zero, I hit play and left my desk for the weekend. To satisfy a little more curiosity, I ran the same songs once more on a different weekend without selecting the option to play higher rated songs more often. Monday morning the play counts were as shown in Table 1."

62 of 276 comments (clear)

  1. iTunes is a monopoly by Anonymous Coward · · Score: 3, Funny

    As a record store owner, My business faces ruin. CD sales have dropped through the floor. People aren't buying half as many CDs as they did just a year ago. Revenue is down and costs are up. My store has survived for years, but I now face the prospect of bankruptcy. Every day I ask myself why this is happening.

    I bought the store about 12 years ago. It was one of those boutique record stores that sell obscure, independent releases that no-one listens to, not even the people that buy them. I decided that to grow the business I'd need to aim for a different demographic, the family market. My store specialised in family music - stuff that the whole family could listen to. I don't sell sick stuff like Marilyn Manson or cop-killer rap, and I'm proud to have one of the most extensive Christian rock sections that I know of.

    The business strategy worked. People flocked to my store, knowing that they (and their children) could safely purchase records without profanity or violent lyrics. Over the years I expanded the business and took on more clean-cut and friendly employees. It took hard work and long hours but I had achieved my dream - owning a profitable business that I had built with my own hands, from the ground up. But now, this dream is turning into a nightmare.

    Every day, fewer and fewer customers enter my store to buy fewer and fewer CDs. Why is no one buying CDs? Are people not interested in music? Do people prefer to watch TV, see films, read books? I don't know. But there is one, inescapable truth - Internet piracy is mostly to blame. The statistics speak for themselves - one in three discs world wide is a pirate. On The Internet, you can find and download hundreds of dollars worth of music in just minutes. It has the potential to destroy the music industry, from artists, to record companies to stores like my own. Before you point to the supposed "economic downturn", I'll note that the book store just across from my store is doing great business. Unlike CDs, it's harder to copy books over The Internet.

    A week ago, an unpleasant experience with pirates gave me an idea. In my store, I overheard a teenage patron talking to his friend.

    "Dude, I'm going to put this CD on the Internet right away."

    "Yeah, dude, that's really lete [sic], you'll get lots of respect."

    I was fuming. So they were out to destroy the record industry from right under my nose? Fat chance. When they came to the counter to make their purchase, I grabbed the little shit by his shirt. "So...you're going to copy this to your friends over The Internet, punk?" I asked him in my best Clint Eastwood/Dirty Harry voice.

    "Uh y-yeh." He mumbled, shocked.

    "That's it. What's your name? You're blacklisted. Now take yourself and your little bitch friend out of my store - and don't come back." I barked. Cravenly, they complied and scampered off.

    So that's my idea - a national blacklist of pirates. If somebody cannot obey the basic rules of society, then they should be excluded from society. If pirates want to steal from the music industry, then the music industry should exclude them. It's that simple. One strike, and you're out - no reputable record store will allow you to buy another CD. If the pirates can't buy the CDS to begin with, then they won't be able to copy them over The Internet, will they? It's no different to doctors blacklisting drug dealers from buying prescription medicine.

    I have just written a letter to the RIAA outlining my proposal. Suing pirates one by one isn't going far enough. Not to mention pirates use the fact that they're being sued to unfairly portray themselves as victims. A national register of pirates would make the problem far easier to deal with. People would be encouraged to give the names of suspected pirates to a hotline, similar to TIPS. Once we know the size of the problem, the police and other law enforcement agencies will be forced to take piracy seriously. They have fought the War on Drugs with skill, so why not the War on Piracy?

    This evening, m

    1. Re:iTunes is a monopoly by the_unknown_soldier · · Score: 2, Informative

      Before any idiots post saying "You're lying! don't be an idiot" - This post is a satirical post critizing the music industry, it has been posted before on slashdot, it is a joke. It is not serious!

    2. Re:iTunes is a monopoly by shish · · Score: 2, Insightful
      This post is a satirical post critizing the music industry

      ... how? All I see is a tired old troll :/ Where's the funny / insightful that "satire" implies?

      --
      I mod down anyone who says "I will be modded down for this", regardless of the rest of their comment
    3. Re:iTunes is a monopoly by the_unknown_soldier · · Score: 2, Insightful

      "Dude, I'm going to put this CD on the Internet right away."

      "Yeah, dude, that's really lete [sic], you'll get lots of respect."

      I was fuming. So they were out to destroy the record industry from right under my nose? Fat chance. When they came to the counter to make their purchase, I grabbed the little shit by his shirt. "So...you're going to copy this to your friends over The Internet, punk?" I asked him in my best Clint Eastwood/Dirty Harry voice.

       
      You don't find that hilarious? Something is wrong with you!

    4. Re:iTunes is a monopoly by RollingThunder · · Score: 2, Insightful

      Anyone who doesn't catch the inherent parody in

      They have fought the War on Drugs with skill

      clearly shouldn't be allowed online without a minder.

    5. Re:iTunes is a monopoly by PaulusMagnus · · Score: 5, Insightful

      Reads just like one of those anti piracy adverts the MPAA forces us to watch at the movies, or that FACT in the UK put on their DVDs.

      Piracy happens because technology happens. We pirate music because it's easy to copy and considerably less than buying it. We don't pirate books because it's frankly too expensive in photocopying charges but there's a whole collection of pirated PDFs out there, if you care to look.

      Technology changes the world we live in. I don't recall the Horse & Cart Association of America (HCAA) suing people that moved to cars which put them out of business. I also don't recall the MPAA or RIAA suing Intel, IBM or Microsoft for giving us these tools that enable us to pirate music.

      If piracy destroys the music business, so be it. Technology often destroys antiquated business models whether it's children cleaning chimneys, horse drawn carriages, coal mining or farming by hand. These people need to find a business model that works. An artist only makes around 5% from every track sold, the label and distributors cream off the rest. That's unfair, IMO.

      Why do we also need to have movie distributors for every corner of the world bidding for the distribution rights? Are we not one global market?

      I think it's about time that the movie and music industries were overhauled as they've had way too much power and too much of a monopoly for too long. After all, we're not killing people here with this technology, we're just changing lives. We're just hurting the profit margins, I thought this is what happened in a capitalist and democratic society. Why do we in the Western world create these societies with freedom to innovate and freedom to make money but then try to shackle them when it starts to backfire?

      Bring on the technology, lets keep changing the world!!!

    6. Re:iTunes is a monopoly by atldavidg · · Score: 3, Insightful

      From the article: "I grabbed the little shit by his shirt." Technically--yes, this is physical assault because he laid hands on someone in an aggressive manner. And as far as the religious right not fighting back....what rock have you been living under? I do not use the word "christian" because there's very little that is "Christ-like" in the way that most of today's religious leaders and their followers behave. ...but that's another topic entirely...

    7. Re:iTunes is a monopoly by ndtechnologies · · Score: 2, Informative

      How about we give the artist that right to choose whether they want to give their music away for free or not...that is what we do with our website. The artist will get more freedom with us, than with the RIAA...they really need to revamp their business model.

      --
      I have nothing clever to put here...
  2. Ok... by jim_v2000 · · Score: 4, Funny

    So from this we learn that the random play on iTunes really is random, and that rating a song really does have an effect. Who'd a thunk?

    Next, "iTunes really does play tunes!"

    --
    Don't take life so seriously. No one makes it out alive.
    1. Re:Ok... by QuantumG · · Score: 5, Insightful

      Confirming something instead of just assuming it is the case. What's that called again.. oh yeah, Science! Clearly this article is in the wrong section.

      --
      How we know is more important than what we know.
    2. Re:Ok... by Jamu · · Score: 2, Informative

      Like most random numbers generated by computer they aren't really random. They're pseudo-random, generated by a deterministic algorithm and having the appearance of randomness. The less processing power you dedicate to the algorithm, the less random the numbers will appear on inspection. The iTune statistics can only be as good as the algorithm they used to generate their (pseudo-)random numbers.

      You can find true random numbers here and also some more information on pseudo-random and random numbers.

      --
      Who ordered that?
    3. Re:Ok... by TheGavster · · Score: 5, Funny

      All lies! The play order is too complex to occur naturally; there must be some intelligence that selects the order of the songs!

      And, of course, looking into the origins of said intelligence is blasphemy.

      --
      "Because Science" is one step from "Because old book". Try "Because of my experiment testing my falsifiable assertion".
    4. Re:Ok... by Mozk · · Score: 2, Insightful

      Being an almost firm believer in determinism, I would say that there are no such things as (true) random numbers. If we were able to monitor the atmospheric noise (movements of atoms), and predict it perfectly, the numbers would be just as random as pseudo-random numbers generated by computers. However we are nowhere near that kind of technology, like the Googleplex Star Thinker in H2G2, so for the time, yeah, it basically is truly random.

      --
      No existe.
    5. Re:Ok... by kalidasa · · Score: 2

      Damn that's funny. Disturbingly accurate, but funny.

  3. I am not sure I see what he sees by ReformedExCon · · Score: 3, Interesting

    I'm looking at this data and it seems that iTunes does seem to pick out favorite songs more often than not-so-favorite songs. Which, I suppose, is the whole idea behind the Party Shuffle concept.

    So after analyzing all that data, how does Brian Hansen come to the conclusion that "it's simply the mind's tendency to find a pattern that makes you think iTunes has a preference". Uh, no. It's the software learning that you have a certain type of genre or style that you strongly favor and will selectively pick songs that are related, thus giving you a better-selected playlist.

    And it seems that the program has a bug in that it will play a song twice in a row. That's a real bug (if you don't like that type of thing).

    --
    Jesus saved me from my past. He can save you as well.
    1. Re:I am not sure I see what he sees by jim_v2000 · · Score: 4, Informative

      I think his point was that with a random order, it is possible for the same song to play twice in row. Not likely, but possible. He then goes on to say that people sometimes try to find patterns where there are none...which is correct. iTunes just happened to play the same song twice randomly.

      --
      Don't take life so seriously. No one makes it out alive.
    2. Re:I am not sure I see what he sees by hattig · · Score: 2, Interesting

      You can only play the same song twice in a row if the algorithm reshuffles the songlist after every song played.

      If you do a static shuffling, i.e., a shuffle at the beginning of playback, and then trudge through the playlist that was generated then you will certainly get each song played the same number of times, and you won't get repeats. The only chance of getting a repeated song is if the last song of a shuffled playlist is the same as the first song of the next shuffled list, which is 1/n^2.

      You can combine the two however. Have 6 queues, one for *****, another for ****, and so on. Each queue would have its own last-played pointer. Each queue would be randomly shuffled once, until all songs in that queue have been played. Then have your weighting algorithm merely choose which queue to play from, and then play the next song in that queue.

  4. Reminds me of... by Anonymous Coward · · Score: 5, Funny

    the time my 2G iPod seemed to have a liking for the Aphex Twin's Selected Ambient Works Vol. 2. It was playing a track off it pretty much every other song. Those of you who know the album can appreciate that it's not the kind of music that you'd maybe choose as everyday listening material.

    It became so annoying that I ended up removing the album from iTunes, at which point my iPod promptly died. The replacement was big on Roxy Music IIRC...

    1. Re:Reminds me of... by hattig · · Score: 4, Interesting

      Maybe songs need more than one rating.

      Rating For Morning Listening (* for Aphex Twin, Slayer, etc)
      Rating For Afternoon Listening (**)
      Rating For Evening Listening (****)
      Rating For Party Listening (**)
      Rating For ${mood} Listening

      Then instead of getting work done we can spend out entire lives rating music.

    2. Re:Reminds me of... by Gorath99 · · Score: 3, Interesting

      I was thinking about something like this myself. Basically, what I'd like to have are two flags:

      1: Never play unless I explicitly say so.
      2: Don't include in shuffle.

      The first one I'd use to flag interviews etc. that are sometimes included on albums. Is not necessarily bad content, just something that you don't generally need to hear multiple times.

      The second one is for flagging things like Beethoven's 9th. It's really good music, but you don't want 67 minute long pieces in a random playlist.

      I currently just use the 1 and 2 star ratings for this, but it's not really ideal. It's too bad (but understandable) that iTunes has no option for looking at TXX frames or I could implement it in a better way.

    3. Re:Reminds me of... by Lars+T. · · Score: 3, Interesting

      Use unique strings (as many as you like) in the Comments tag. Like "-don't play unless I say so-" or "-don't include in shuffle-". Then build intelligent playlists accordingly (Comments doesn't include "-don't play unless I say so-"). Errm, better use something shorter like "-DPlay-".

      --

      Lars T.

      To the guy who modded me down from perfect to terrible Karma - Apple haters still suck

    4. Re:Reminds me of... by Dixie_Flatline · · Score: 4, Interesting

      I use the stars to indicate how often things should be played.

      * - Never play. It's only in the list for the sake of completeness (I hate having partial albums)

      ** - Play very rarely. If I'm in the mood, I might listen to it.

      *** - I'll listen to it at least once a week. If it comes up randomly on the shuffle, I won't take it out of the list.

      **** - I can listen to this several times in a day.

      ***** - I'll listen to this song anytime, anywhere. If it comes up twice in a row, no problem. If my playlist only has this song on it, I can cope with that for at least a few hours.

      This means that I have to periodically re-rate the songs. That seems only reasonable, though. Why would songs stay at the same rating forever? As the novelty wears off, I can relegate a song to 4 or 3 stars.

      I also keep extensive smart playlists that make sure that songs that are 3 stars or less only get played once every few days.

    5. Re:Reminds me of... by Salvo · · Score: 2, Interesting

      I have three playlists on my iPod, which I use to sort and rate Music.
      'Unrated [infinity]' restricts the selection to Songs with a Rating of -----.
      'Unrated 50' also restricts it to the 50 least recently played songs.
      Finally, '***+' selects songs with a rating of ***--;, ****-, or *****, and then only the 100 least recently played songs.
      All these playlists are Live Updating.

      When I need to rate songs, I play 'Unrated [infinity]' or 'Unrated 50', and rate songs as I play, using the same rules as the Parent. When I just want to listen to good music, I listen to '***+'.

      Also remember that unlike Slashdot, iTunes and the iPod can display Unicode characters, so you can use Real stars and Infinity instead of Asterisks and whatever.

  5. Interesting by hattig · · Score: 4, Interesting

    I wish iTunes would get ratings from some online source much like it gets tracknames from Gracenote. Can you imagine a server of user-submitted ratings? You could opt to use an average rating from all users, or a rating from users with particular tasks (i.e., if you are a metaller, then you'll probably not want raver's musical opinions affecting your ratings!).

    Why? Because I haven't got the time to go around rating my entire music library. Judging from that article, it is dangerous to only do a few because of the weighting algorithm used - surely it would be more sensible to assume that 'not rated' meant 3 stars rather than 0 stars? That way you could rate down shitty songs, and rate up excellent songs, but ignore rating the vast majority of songs.

    1. Re:Interesting by the_unknown_soldier · · Score: 3, Interesting

      I only rate good songs... Indifference is the worst rating you can give to a song, so i think "0" fits in pretty well

      As for Gracenote: perhaps sales on the ITMS could act as a gauge of this. e.g. "This is this artist's most downloaded song and this artist compared to similar ones is bought 5x as much, so our algorithms suggest it should be rated 5" Then once you have downloaded it you can change it if you get the time.

    2. Re:Interesting by Anonymous Coward · · Score: 2, Insightful

      Problem is, then you'd end up playing what ITMS wanted you to play. And then they'd get paid by record companies to fix ratings slightly. And... well, you can probably work it out from there.

    3. Re:Interesting by RAMMS+EIN · · Score: 2, Informative

      Some open-source music players have a system where music is automatically rated based on how often you select the song to play (and possibly on how often you interrupt it once playing). This always seemed like a good way to make the program learn what you like.

      I can't really say how well this works in practice, or which programs support it, because I don't use the feature myself. However, I suspect it would work better than an explicit rating system, much like bayesian spam filters work better than explicit ones.

      --
      Please correct me if I got my facts wrong.
    4. Re:Interesting by Stuart+Gibson · · Score: 4, Interesting

      Select All -> Get Info -> My Rating -> Three Stars.

      Rate up and down others as necessary. OK, not the point that default should be doing this for you, but a quick fix if you want it to work that way.

      If you already have songs rated then create a 0 star smart playlist and repeat.

      Stuart

      --
      It's all fun and games until a 200' robot dinosaur shows up and trashes Neo-Tokyo... Again
    5. Re:Interesting by Feanturi · · Score: 5, Funny

      A public moderation system, cool. That never gets abused anywhere that I know of.

    6. Re:Interesting by rimbaldi · · Score: 2, Informative

      The Music Recommendation System for iTunes from the University of Illinois doesn't solve your problem because it requires you to rate most of your own library. However, it might be an interesting step on the way to solving the problem you raise.

  6. that sucks by Anonymous Coward · · Score: 5, Funny

    I can't tell you how many Christian record stores I'm permanently banned from.

    1. Re:that sucks by peculiarmethod · · Score: 2

      that has to be the shortest troll ever.

      he said _I_ can't tell you.. not _tell me_.

      but good job on the two character troll.

      --
      ** "It's not my job to stand between the people talking to me, and the ones listening to me." -- Pego the Jerk
  7. Finally by Ed+Thomson · · Score: 5, Funny

    Someone to show how cool mathematics is

  8. Re: Try last.fm by P!Alexander · · Score: 4, Interesting

    That's exactly why I love last.fm (formerly Audioscrobbler & Last.fm). It automatically tracks what you listen to and then allows that information to be used to give you neighbors in the music world based on what interests you have in common. You can add friends, join groups, and even tag your music. All of this is extremely useful in finding new stuff. They've got plugins for all the major media players (and even some minor ones).

    Add on top of that the ability to play a custom-built radio station, set it to play only new music or listen only to music from a particular user profile.

    Linux and BSD supported! Open source plugins and radio station player! Could it get better? ;)

    ---
    but make sure that the last line
    Generated by SlashdotRndSig via GreaseMonkey

  9. Underlying formula by Pemdas · · Score: 5, Informative

    From their results, I'd venture a guess as to the underlying algorithm:

    Each song is given a number of points equal to (rating + 1). Then the probability of the song being played is (song rating)/(total points).

    Or, to put more succinctly:

    prob(song) = (rating)/(n + sum(i=1..n)(rating(i)))

    That yields probabilities in the given test case of:

    5 star - .285
    4 star - .238
    3 star - .190
    2 star - .143
    1 star - .095
    0 star - .048

    Which is reasonably close to what the author found. Heck, if I were implementing that feature, it's what I'd try first...

    1. Re:Underlying formula by ciroknight · · Score: 5, Interesting

      Your decimals look more like the pricing model than the weights for playing songs..

      5 star - .285 -- $299, iPod (full?) 20gb
      4 star - .238 -- $249, iPod mini 6gb
      3 star - .190 -- $199, iPod mini 4gb
      2 star - .143 -- $149, iPod shuffle 1gb
      1 star - .095 -- $99, iPod shuffle 512mb

      --
      "Victory means exit strategy, and it's important for the President to explain to us what the exit strategy is." G.W.Bush
  10. Why Assume a Bell Curve? by TubeSteak · · Score: 4, Insightful
    Seriously, on what basis is he assuming a bell curve of preferences?

    You'd think, with iTunes, that people would be buying music they like (a four or five rating) in a much higher proportion than music they'd rate as a three.

    Then there's music added from your own collection. Maybe its just me, but my preferences tend to be stronger than -, 1, 2, 3, 4, 5.

    I usually go through my music collection on a regular basis and delete crap that I don't listen to, which is usually anything less than a three, and definitely a - or a one.

    And is 4334 just a random arbitrary # of songs to use?
    (when you add up X0 through X5)

    --
    [Fuck Beta]
    o0t!
    1. Re:Why Assume a Bell Curve? by hey! · · Score: 2, Informative
      Seriously, on what basis is he assuming a bell curve of preferences?

      When you look at data, particularly from modest numbers of samples, it seldom fits the bell curve nicely. Often it is multimodal.

      However, the Guassian (bell curve) distribution has one important property: it is the distribution that (given a number of stipulations like having a finite integral over any interval, and there being no set uppler or lower limit to the variable) has the highest entropy. It is, in a sense, the most random of random variable distributions.

      So, the reason you tend to fall back on the Gaussian distribution as an assumption is that it is the distribution that, should you assume it, means you are makign the fewest possible assumptions. Any other distribution chosen must be chosen with a reason.

      Now, some judgement has to be exercised still. If I am choosing a distribution to represent a throw of a die, I'll choose the Uniform distribution as being the one that assumes the least. That is because it fails to meet one of the criteria for choosing the Gaussian: there IS a lower limit(1) and an upper limit (6) to the variable. Superficially, this would seem to resemble the iTunes rating system of no rating = 0 through best = 5 (I think). However, note that the rating is actually just a truncated record of an underlying, continuous variable: how much you like or dislike a song. There's no actual limit on that. Sometimes, it is better in situations like this to assume log-normal (geometric) distributions. In a sense we're talking about artifacts of our scale specification rather than anything fundamental in the underlying situation.

      In any case the fact that the user has selected the songs has no actual bearing on the usefulness of Gaussian assumption. I'm dredging this up from a course I took many years ago at MIT called "Stochastic Processes" taught by Giancarlo Rota (18.313 I think it was). If I had administered an IQ test to my fellow students, they would have scored very high indeed. Not only were they selected by the MIT admission process, but they were mostly people (unlike me) who were pursuing degrees in mathematics. But still, the rules apply: there is for practical purposes no upper or lower limit on the IQ of a 18.313 student; but sicne there was a finite supply of 'em, the integral of the distribution function is bound to be finite.

      Of course, since we know there are at least two populations in the class (math majors and assorted dilletentes and mathematical hangers-on like me), you probably could get more precise results by modeling the class that way. But if you're going with a single mean and std dev to describe the population, Guassian's still the best way.

      By the way, if anybody else out there remembers GC Rota, I just googled him and found out that he passed away in 1999 at the young age of 67. A damnable shame too, he was a great mathematician and, if possible, an even greater teacher. Here is a quote:

            Despite the lecturer's best efforts, however, it becomes more difficult to hold the attention of the students as the term wears on, and they start falling asleep in class under those circumstances should be a source of satisfaction for a teacher, since it confirms that they have been doing their jobs.
      --
      Post may contain irony: discontinue use if experiencing mood swings, nausea or elevated blood pressure.
    2. Re:Why Assume a Bell Curve? by Moofie · · Score: 2, Funny

      Did Geddy Lee touch you in your bathing suit place?

      --
      Why yes, I AM a rocket scientist!
  11. Some calculations errors in my opinion.. by Fr4ncis · · Score: 5, Informative
    If you have 2000 songs and 40 of them are from the same artist, there is always a 2% chance of hearing them next with random play. So right after one of their songs finishes, odds almost guarantee they will be played again within the next 50 songs and show a 50% chance they will play again within the next 25 songs. It's simply the mind's tendency to find a pattern that makes you think iTunes has a preference.

    A way to calculate the odds that 2% will be played in the next 50 songs doesn't work 50* (2/100) = 100% as the author does, and neither 25*(2*100) = 50% is correct.

    The correct calculations are: 1-(98/100)^50 = 63% and 1-(98/100)^25 = 39%.
    This way you calculate the odds a song will be played at least once in the next 50 or 25 songs.

    If you want to calculate the odds the song will be played exactly once in the next 50 or 25 songs:

    50 * (2/100) * ((98/100)^49) = 37% or 25 * (2/100) * ((98/100)^24) = 31%.

    I guess that's all..
  12. From the article...trick of the mind by djupedal · · Score: 3, Informative

    For those too lazy to go read for themselves...

    "Many claim to still see patterns as iTunes rambles through their music collection, but the majority of these patterns are simply multiple songs from the same artist. Think of it this way: If you have 2000 songs and 40 of them are from the same artist, there is always a 2% chance of hearing them next with random play. So right after one of their songs finishes, odds almost guarantee they will be played again within the next 50 songs and show a 50% chance they will play again within the next 25 songs. It's simply the mind's tendency to find a pattern that makes you think iTunes has a preference."

    1. Re:From the article...trick of the mind by Soul-Burn666 · · Score: 2, Interesting

      Moreover, following the "Birthday Paradox", if you have N songs and the selection is completely random, then in a list of sqrt(N), there's a 50% chance a song will appear twice.
      For 4000 songs, that's around 64~ songs. So if your player chooses tracks completely randomly then 50% of the times you'll listen to 64 songs, you'll hear the same song twice from those 64.

      Even if your player doesn't play the same song twice, if you have 8000 songs from 4000 artists, 2 songs per artist, then you get a similar calculation.

      --
      ^_^
  13. Rating songs by ptimmons · · Score: 2, Insightful
    I take a small bit of exception to the author's statement:
    "Although the higher rated songs are given preference, you will not definitively hear more 5-star rated songs than all other ratings. Most people follow a bell shaped curve for their ratings, with the 3-star rating being the most common."

    In fact, I find that I tend to rate only the songs I *really* like and *really* dislike, and leave the average songs alone. I suspect that I am not alone here. It's akin to the trend of many online forums to attract polarized opinions; i.e., few people take the time to log into forums and post comments that are middle-of-the-road -- typically they're full of "THIS SUCKS" or "HELL YEAH" posts.
  14. You wonder why the music industry is mad by Anonymous Coward · · Score: 2, Insightful

    "I haven't got the time to go around rating my entire music library"

    Much less pay for it, I assume. I'm a certifiable geezer by slashdot standards (>40), but I do know a thing or two about music. I was a really hardcore music collector in college, and one core credo was to know everything in your collection inside-out. Practically speaking, it meant that any new addition to the library (vinyl or cassette, if you've heard of either) stayed in a "just-in" quarantine section until I had listened to it enough times to consider it "known".

    It is really sad that people amass immense music collections and really don't have any familiarity with the music they have. Well, maybe it isn't actually sad, but I have to wonder what the point is.

    1. Re:You wonder why the music industry is mad by hattig · · Score: 3, Interesting

      Music has turned from something that you collect and treasure into something you have and listen to practically all the time. It is very rare you decide you want to listen to just song X these days (in comparison to how much music is listened to overall), and actively put it and actively spend the time solely listening to it. Large mp3 collections have replaced radios at many places, great for getting rid of the music you really dislike and the DJ.

      Would I pay to have my music rated by an external algorithm? No. Would I pay to have my music peer rated? No - I'd also be contributing back to it like I contribute back to Gracenote and FreeDB.

      I suppose it is easiest to just rate everything *** and apply ****/***** and **/* to the tracks I really notice as standing out.

  15. I found it an odd statement too by wackybrit · · Score: 2, Interesting

    Most people follow a bell shaped curve for their ratings, with the 3-star rating being the most common.

    I mean, where is this statistic coming from?

    In my case the majority of rated songs are 5's, almost the same number of 4's, then some 3's, and hardly any 2's or 1's.. with perhaps 50% left unrated. I use iTunes at least several hours a day. Those of my friends who use iTunes seem to have a similar distribution.

  16. Modal Music by Johnny+Mozzarella · · Score: 5, Interesting

    A friend of mine who worked at a radio station that played a very diverse range of music told me how they select music.

    She said that research had shown that listeners would rate the same song higher if it followed other song of a similar genre. If they play songs of different genres randomly the listener does not enjoy the music as much.

    So their tendency is to play "blocks" of music.
    For example....
    4 Classic Rock songs
    3 Blues Songs
    3 Folk songs
    4 Female Rockers
    3 Grunge
    etc.

    This is common knowledge in the radio world. I wonder if Apple has incorporated this type of logic into it's iTunes algorithms?

    The radio station in question is WXPN and can be found under iTunes > Radio > Public > WXPN

    1. Re:Modal Music by WebGangsta · · Score: 2, Informative
      So their tendency is to play "blocks" of music. For example....
      4 Classic Rock songs
      3 Blues Songs
      3 Folk songs
      4 Female Rockers
      3 Grunge

      Not entirely true, and it depends on the station (as you stated). Some stations make it a point *not* to put songs with female lead singers together; *not* to put songs from the same R&B/Dance/whatever genre together; *not* to put songs from solo artists next to each other. And so on. And don't forget issues with playing more than one old song after another.

      Again, it all depends on the station and format and what they're trying to do at the time, but as a general rule of thumb, there's a lot more analysis and thought that goes into creating a radio playlist than what iTunes automatically does, or by building "blocks" of music (Twofer Tuesdays, Rocktober, and Deep Cut Thursdays aside). It's all about how to best reach the desired target audience for the advertisers.

      In other words, I can't imagine any station that's trying to make money that would rely on solely using an iTunes algorithm (current or proposed) for scheduling music.

      Besides, what would the stations do with all the music scheduling software they already use?

  17. The World Changes by xh3g · · Score: 2, Insightful

    Everything changes, even how we make a living. I myself paint cars. If someone tomorrow came up with a cheap method for producing car parts that never need to be painted, I would be out of a job while the entire world heralded the new technology as a breakthrough.

    We all want to think that once we have our lives in order, they're going to stay that way. Nothing is guaranteed, not even whether yesterdays market will be here tomorrow.

    It's no one's fault but your own if you're selling something no one wants to buy.

    --
    - When you do things right, no one will be sure you've done anything at all.
  18. Strange math crunching by mpiktas · · Score: 2, Interesting

    After reading the article, I still do not understand the iPod's shuffling algorithm.

    The first half of the article is devoted to describing how the writer got the probabilities of rated songs and properties of these probabilities. Although these probabilities give some insight to the shuffling algorithm, they are pretty useless, since they are observed from unrealistic list of songs, i.e. 6 songs with different ratings.

    Then cames the formula in Figure 2. How it is calculated and where from the author takes it, is not explained in the article. Also this formula is not backed up by empirical observation. The rest of the article is devoted to analyzing the effects of this formula, which are interesting, yet could have no importance if the actual formula is different. So this article does not really explore the iPod shuffling algorithm, but explores how would iPod shuffling algorithm work if the probability of the next song is calculated according to the formula provided by author. That is pretty useless, since we all can provide our own formulas and write the articles.

    Now concerning this formula. To me it seems a litlle strange. Consider hypothetical situation of song list with 1000 unrated songs, and one with 5 star rating. The the probability (according to the formula provided by author) that the song with 5 star rating would come up is
    0.27/(1000*0.039+0.27) = 0.006875477
    which is pretty miserable odds. If I rated it so highly, that means I want to hear it a lot, now with such shuffling algorithm, I would hear it slightly more, yet not a lot. Of course, then I could create a playlist, with this song only, but then why one needs rating system, if it does not perform.

    So it would be really interesting to know iPod's shuffling algorithm, to see if it saves the hassle of creating your own playlists. (Or even the possibility to provide your own algorithm), yet this article does not provide any insights.

  19. something i've noticed by macsox · · Score: 4, Funny

    there definitely seems to be some time-based randomness in the selection of tunes. often, i'll hear a song pop up randomly on my ipod in the car on the way into work, and then the song come up again, randomly, while being played on itunes at my desk.

  20. Re:Maybe we should think different by Yahweh+Doesn't+Exist · · Score: 2, Informative

    I mostly use the stars to allow smart playlist generation.

    1 star - bad
    2 stars - background music
    3 stars - good

    4 and 5 stars mean good and very good but also that they'll be included in several smart playlists. so a 4 star isn't necessarily better than a 3 star but I want it on my ipod etc.

  21. Re:Database server that can handle the load? by Unknown+Lamer · · Score: 2, Informative

    The DB server can keep up now, and the protocol is stable. The radio even works fairly well, but during peak hours it sometimes has trouble connecting (I've only ever used it at my Grandmother's house, and her cable net connection is kind of shitty so it may have been that).

    A year ago last.fm/as weren't really usable. Now they are fairly solid. It's worth another shot.

    --

    HAL 7000, fewer features than the HAL 9000, but just as homicidal!
  22. Re:Car has a "random" bug by bullitB · · Score: 3, Funny

    (from bugzilla.audi.com)

    Product: Audi S4
    Component: CD Player
    Status: ASSIGNED
    Severity: Normal
    Hardware: All
    OS: All
    Resolution: Not a bug
    Summary: Car has a "random" bug

    Description:
    I have a certain CD that causes my Audi S4 (when set to random mode) to play the same track over and over and over. Guess somebody didn't prove their recurrence actually worked.

    Solution:
    CD contains only one track. Random mode functioning properly.

  23. Whisky Tango Foxtrot? by Bug-Y2K · · Score: 3, Insightful

    I read the headline and imagine a story about the costs and revenues from Apple iTMS. Analysis of running the datacenters: costs of electricity, bandwidth, storage, etc.

    Instead I read about some geek with way too much time on his hands. Yawn.

  24. the REAL underlying formula by Bert690 · · Score: 5, Interesting
    OK, after a bit more thinking, you were indeed very close. It appears the actual formula is:

    points(0 stars)=1
    points(1 stars)=3
    points(2 stars)=4
    points(3 stars)=5
    points(4 stars)=6
    points(5 stars)=7

    probability(X stars) = points(X stars) / 26

    This yields the following probabilities, listed along side the observed values from the article along with 95% condience intervals.

    p(5 star)=.2692 [.270 +- .0038]
    p(4 star)=.2308 [.230 +- .0036]
    p(3 star)=.1923 [.189 +- .0033]
    p(2 star)=.1538 [.154 +- .0031]
    p(1 star)=.1154 [.118 +- .0027]
    p(0 star)=.0385 [.039 +- .0016]

    As you can see each computed probability falls within the 95% confidence interval, so there's a good chance this is the correct forumla.

    Boy do I have too much time on my hands today.

  25. Dear Apple by NeMon'ess · · Score: 2, Interesting

    Let me, an advanced user who knows a damn thing about computers and interfaces, change the weighting of the stars. I don't want my 5-star songs to play just twice as often as 2-star songs. I want them to 6 times as often. I want 4-stars to play 4 times, 3-stars to play 3 times, and no-stars to play 2 times.

    Why no-stars? Because that way the majority of the collection is unrated. Stared songs really stand out in a playlist. 1 and 2 star songs play less often than no-stars, while 3, 4, and 5 play more often. But I want my favorites to play much more often than your arbitrary algorithm.

  26. least often of each rating by Quevar · · Score: 2, Interesting

    I dislike hearing a song too often and I also like to hear my favorite songs more often, but also not repetitevly. So, I have set up 6 playlists that gives me what I think is a good mix of my songs. They are as follows:
    1: rated 2 and 25 rated by least recently
    2: rated 3 and 73 rated by least recently
    3: rated 4 and 70 rated by least recently
    4: rated 5 and 32 rated by least recently
    playlist is any of playlists 1, 2, 3, or 4 with live updating

    The numbers (25, 73, 70, and 32) come from multiplying the number of songs in each category by the rating-1, so it is essentially the same as the "play higher rated songs" in PartyShuffle. I leave 1 rated songs for ones that I don't listen to very often. This way, I get a random selection of my music that does not repeat a song until I have more-or-less gone through the rest of them in that rating. And, it generally plays the 5 rated songs about 4 times more than the 1 rated songs.

    I found that I do not like the random feature since it often will play one song significantly more than another song. Eventually, it would even out, but in the range of 20 times playing a song, there can be a large discrepancy and I haven't heard some songs in longer than I'd like.

  27. Crunching the math on Slashdot by AlpineR · · Score: 4, Interesting
    Nice analysis of iTunes. I'm somewhat surprised at the small difference in play frequency between 3, 4, and 5 stars; and disappointed that unrated songs are almost never played. In my collection, unrated means that the music is new to my collection. I think 1 star should be the kiss of death, not a blessing upon a previously unrated song.

    But all this talk of 0, 1, 2, 3, 4, 5 has me thinking of another rating system. Would anybody care to do an analysis of the ratings in Slashdot comments? What are the relative populations (I expect a ton of 2's but how about the rest)? Do comments made in the first hour after a story is posted stand a better chance of reaching +5 than comments made later in the day?

    One of my gripes about the Slashdot comment system is that it discourages contemplation and discussion. Comments made more than 24 hours after a story is posted are rarely read and almost never moderated. This is in contrast with comments system like Usenet or other bulletin boards, where threads can remain lively for weeks.

    AlpineR

  28. I like the fact... by battamer · · Score: 2, Insightful

    that this entire project seems to be done while at work. Kudos, you are an inspiration to slackers everywhere.

  29. Re:Car has a "random" bug by R3d+M3rcury · · Score: 2, Interesting

    Actually, with my TT, I've noticed a similar situation.

    First, certain burned CDs (I have yet to see this on a commercial CD) when played on random will only play the first track. However, if it is not played on random, it will play all the tracks. Interestingly enough, though, if I spin the CD in the holder, that will sometimes allow it work correctly.

    The other interesting thing is that the CD player will not repeat tracks until all tracks of the CD have been played (Duh), at which point all bets are off. While I have never seen it play the same track back-to-back, I have seen it play a track, another track, and then the first track again. Note that this only happens once the CD has randomly played all tracks.

    I told the Audi dealer about this. They pretty much said, "Yeah, so?" I sort of agree with them. Of course, as soon as I have the cash, I'm getting an ice>Link and an iPod and I'll toss the CD player.

  30. Car mp3 CD player by suitti · · Score: 2, Funny

    Two years ago, i replaced my car's tape player, which died after only 15 years of service, with an MP3 CD player. I cut a CD with 14 hours of my favorite stuff. I put the CD in the car, set it up to play in shuffle mode, and set out on a cross country trip. It was great. Just before i arrived, i heard a repeat. I was so disgusted i hit the eject button. Fortunately, i had another disc with me. Feh, i said. Can't even go 750 miles without having to change the CD.

    --
    -- Stephen.