Managing Last.FM's "Mountain of Data"
Rob Spengler writes "Last.FM co-founder Richard Jones says the biggest asset the company owns is 'hundreds of terabytes of user data.' Jones adds, '... playing with that data is one of the most fun things about working at the company.' Last.FM, for those who have been living on Mars for the last two years, is the largest online radio outlet, with millions of listeners per day. The company surpassed Pandora and others largely due to its unique datamining features: 'Audioscrobbler,' the company's song/artist naming algorithm, can correctly determine a track even with tens of thousands of false entries. Jones says sitting on that much data has even helped police: 'thieves listening to music on an Audioscrobbler-powered media player have helped police in the US, UK, and other countries track down users' stolen laptops.' Does sitting on a mountain of data make Last.FM powerful enough to start making a stand against the record industry? CBS certainly thinks so — they bought the company for £140 (~$200) million last year."
A buddy of mine used to run this matching website for teachers & students. Free for teachers, and the students had to pay a nominal amount to get the teachers' contact info, and after that, it was up to them to arrange for lessons. The site was popular, and he made decent money at it. I bugged him and bugged him to organize parties, and eventually he came around to my way of thinking (he wanted to make some money without his parasite partner getting it). He used the list of emails from his website to send party invitations for a monthly get-together. He made more money from the parties than he did from the website.
Shutting down free speech with violence isn't fighting fascism. It IS fascism!
what i find most interesting is the order certain songs "go together", like listening to a song from Slayer, followed by, say, "someday i suppose" from the bosstones. when composing songlists, i appreciate how similar songs and moods can flow, but also how the contrast of dissimilar songs can SOMETIMES compliment each other.
a large database could ferret out such instances that might occur frequently in multiple playlists.
I have a similar site that I wrote (pre-audioscrobbler). Granted it's crap, but I have mountains of data also. Closer to 1 tb than hundreds of tb. The question is, how do you monetize the data?
I just don't see how this data is "worth" 200 million bucks. I have some amazing algorithms to do similar cleaning, caching, and recommendations, but still what is that worth?
This is a fairly legit question. If you can figure it out, I can explain to my wife why I have 3 servers in my closet.
The summary wasn't insulting enough, so I think I'll just add a bit extra.
Last.FM is so popular that if you aren't familiar with the service, you must be a drooling, knuckle dragging luddite.
Apparently I'm not one of the cool kids. I'm sad now, and my feelings are hurt.
Last.fm Has all this data and yet so much gets missed. For instance: why doesn't last.fm have a feature to email you when a band you like comes out with a new album?
CBS certainly thinks so -- they bought the company for £140 (~$200) million last year.
Which is why whatever comes of them, at best it will be evolutionary. CBS is part of the old guard RIAA corps, they are just one of the faces of Viacom - all controlled by Summer Redstone. They may have brought some money to the table, but they brought a whole ton of baggage with them too. Enough baggage to make this privacy freak decide they couldn't be trusted with all that data they've been collecting (for example, if they can track down a stolen laptop, they can track down someone playing an MP3 from an illegally leaked pre-release album).
When information is power, privacy is freedom.
Last.FM, for those who have been living on Mars for the last two years, is the largest online radio outlet, with millions of listeners per day.
You know, I'm not exactly what you'd call a Luddite, yet I've never heard of Last.FM. Am I the only one? I kind of doubt it.
I have a general gripe about anyone who writes "for those who have been living on Mars" anytime they reference some moderately popular company, service, or product. It smacks of arrogance, as if to say, if you don't have the same interests as I do, you're obviously disconnected from the mainstream.
Or perhaps I'm just annoyed for being called out on being a bit older and out of touch? Bah!
>>goes back to guarding lawn with a shotgun from an old rocking chair...
Irony: Agile development has too much intertia to be abandoned now.
The company surpassed Pandora and others largely due to its unique datamining features
I would think that being available outside of the USA may have helped quite a bit as well.
with last.fm is how it feeds my OCD issues regarding song playcounts. I nearly lost it when the stupid scrobbler started randomly recording excess playcounts on one album. It screwed with my numbers. Then it stopped counting that album's plays all together.
Seriously though, I have found using the site to be pretty enjoyable. And the advertisements are actually worth keeping AdBlock turned off for. I found a few new artists, some unsigned, that way. I like all the various widgets and things that can crunch my data. Songbird has a last.fm plugin/addon that makes for very easy integration. It's just really useful. I've also found concerts on the site.
I rarely use the social side of it, except with friends I already know. But that's me.
http://transformativeworks.org/
Comment removed based on user account deletion
The company surpassed Pandora and others largely due to its unique datamining features: 'Audioscrobbler,'
I'd say they surpassed Pandora only because Pandora locked out all non-US users a while back. For people who just wanted to listen to music and find out about new artists, Pandora was so much better IMO, last.fm has a clunky, overloaded UI and is too much like myspace ...
"I love my job, but I hate talking to people like you" (Freddie Mercury)
Then why the hell is it that when I run the "Recommendations" stream the algorithm occasionally freaks out and starts pushing one unlistenable noise attack after another at me with tags like brutal death metal, cybergrind, czech, death metal, deathgrind, goregrind, grind, grindcore, noisecore, porngrind, pornogrind, etc. No matter how many times I click the "Do Not Want" button the stuff just keeps coming. It's like a neighbour from hell. And then there's the days when I get nothing but lesbian deathcore vegan grind.
The Last.FM brainfarts seem to persist no matter how many times yoy try to train the recommendation engine using the like/ban buttons and the only way to get them to "reset" to something vaguely approximating normality is to log out, log back in, and run the Library stream for a while.
Still, even with this weirdness it's still better than Pandora at finding new music I actually like.
Da Blog
Comment removed based on user account deletion