Slashdot Mirror


Fixing Steam's User Rating Charts

lars_doucet writes: Steam's new search page lets you sort by "user rating," but the algorithm they're using is broken. For instance, a DLC pack with a single positive review appears above a major game with a 74% score and 15,000+ ratings.

The current "user rating" ranking system seems to divide everything into big semantic buckets ("Overwhelmingly Positive", "Positive", "Mixed", etc.), stack those in order, then sort each bucket's contents by the total number of reviews per game. Given that Steam reviews skew massively positive, (about half are "very positive" or higher), this is virtually indistinguishable from a standard "most popular" chart.

Luckily, there's a known solution to this problem — use statistical sampling to account for disparate numbers of user reviews, which gives "hidden gems" with statistically significant high positive ratings, but less popularity, a fighting chance against games that are already dominating the charts.

49 of 93 comments (clear)

  1. Be sure to click evan miller link by Karganeth · · Score: 2

    http://www.evanmiller.org/how-... This was linked before on slasdot IIRC. This is basically a blog post saying "hey this can be used on steam too, not just amazon!"

  2. Valve Time by UnknownSoldier · · Score: 4, Insightful

    Like all things Valve does on Valve Time, Steam is _slowly_ getting better so I'd imagine this will get fixed ... eventually.

    At least we can give a thumbs up or down to games. The ability to write reviews takes advantage of the best kind of marketing:

    Word of Mouth.

    1. Re:Valve Time by Nemyst · · Score: 3, Interesting

      Valve have not shown a particular tendency towards using algorithms to fix things though - they generally just throw the work to their users, from tagging to rating to curating. While I agree that a fairer system to sum up reviews would be good for the consumer and good for smaller developers, I don't think it's what Valve are looking for. It's quite likely that showing more of the top sellers sells more copies of those games, which have dramatically higher sales volumes and prices than indies. That's a bigger cut for Valve.

      As much as people forget, Valve are not in this to make gaming a better place. They're there to make gobs of money, and have been rather successful at doing so thus far. Considering the sort of talent they hire and have hired in the past, if they truly wanted to fix things, they'd be fixed. If they're not, they either don't consider it important or have a reason for not fixing it.

    2. Re:Valve Time by hairyfeet · · Score: 2

      Uhhh...what is there to fix really? Those "positive" reviews? Well you look closer and its stuff like "This is a bad game but its worth a dollar for the cheese factor" or "If you want to MST3K a game with your friends this is good for that". If you want real reviews go to Metacritic or YouTube, Steam reviews are filled with snarky PC gamers that can enjoy some "so bad its good" cheese.

      --
      ACs don't waste your time replying, your posts are never seen by me.
    3. Re:Valve Time by kubajz · · Score: 2

      As a matter of fact, does anyone know why Steam does not prominently feature Metacritic ratings anymore? Those really helped me choose games that I wanted...

    4. Re:Valve Time by Jupix · · Score: 4, Interesting

      What Steam reviews are actually filled with is information about the games... exactly what you should be interested in, as opposed to a score or a conclusion of some kind.

      The aggregate score in the style of "very positive" etc. can be useful in filtering out the genuinely terrible games, but outside of that, not so much. What's needed for decisionmaking is a lot of information, a search engine, and your own thinking. Steam provides descriptions, tags, and now reviews, and for me anyway it's been incredibly easy lately to figure out whether I want to buy a particular game, or at least investigate it closer elsewhere.

      Scores are almost completely worthless. Doesn't matter what kind they are (Metacritic, user review average, magazine review score). Steam has already done enough for the scoring system. What is there to fix? IMHO they should concentrate on important things like search, GUI and customer service, all of which are pretty terrible for 2014.

    5. Re:Valve Time by hairyfeet · · Score: 4, Informative

      Exactly. I have enjoyed some games that many considered truly terrible, "You Are Empty" springs to mind because while it wasn't anything new or revolutionary the Russian writers created a story that was as batshit as David Lynch movies and it had TWENTY FOOT TALL MUTANT ATTACK CHICKENS...seriously how could I not love a game that considered those a serious enemy type?

      But when I score a game like that I make it clear it is not getting a good review because its some amazing game, instead its strictly for the cheesy goodness. Take a game like "Two Worlds", sure the gameplay is just a generic "hack and slash" just like every other Diablo clone...but the dialog, that dialog was fricking great! It was like a fifth grader had all these "old English" words like "pray" and "forsooth" and had NO fucking idea what they meant but said "Yeah that sounds all cool and hamlet and shit, lets use 'em!" and when combined with actors that sound like they come from a dinner theater in SD doing Macbeth? Priceless!

      On the flipside you have GTA 4 which reviewers tripped over themselves to praise...Dafuq? That game is irritating as shit! Too damned many of the missions can only be gotten by kissing the correct ass and that involves hauling their stupid asses around and doing stupid mini-games to make their worthless ass happy...Why in the fuck am I taking a dope dealer out for dinner like I'm trying to bang him? Just give me the damned mission and STFU! Hell it was so bad there are fricking parody songs making fun of it, yet what did the majority of critics give it? 9s and 10s!

      So give me real gamers ANY day of the week, I'd much rather hear "Its decent, just don't buy it for more than $5 because X/Y/Z makes it not worth more" than listen to the critics which seem more and more to just kiss the industry booty.

      --
      ACs don't waste your time replying, your posts are never seen by me.
    6. Re:Valve Time by Anonymous Coward · · Score: 5, Insightful

      As much as people forget, Valve are not in this to make gaming a better place. They're there to make gobs of money, and have been rather successful at doing so thus far.

      Valve makes gobs of money *because* they make gaming a better place.
      Valve is the sole reason why I'm probably not buying a console this generation, or maybe ever again. Gaming is so much better on PC these days that it just doesn't make sense to lock yourself into the console market anymore. And that's all on Valve.

      So you might be cynical and say Valve only cares about money, but the fact of the matter is that in order to generate that money they need a healthy market. Their interests are aligned with ours.

      Considering the sort of talent they hire and have hired in the past, if they truly wanted to fix things, they'd be fixed. If they're not, they either don't consider it important or have a reason for not fixing it.

      Maybe they've just got better things to do. Valve has their hands on so many pies that I'm sure they could double their workforce tomorrow and still keep everyone occupied.
      The new recommendations system is still new, I'm sure it's under close scrutiny and updates will be coming. But as someone else said, it'll happen on Valve Time.

    7. Re:Valve Time by Marc_Hawke · · Score: 3, Insightful

      Nobody liked Steam when it came out either. There were a lot of things that kept most people away from it:

      1. Always on. This was a problem both in internet connections (which were much more flaky back then) but also PC memory usage. Background processes were a gamer's worst nightmare before RAM sizes gained a few extra digits.

      2. "Vaulted Access." People still wanted physical copies. They didn't trust Steam to be around in 5 years and figured they wouldn't have access to their games anymore.

      3. Other things.

      So, Steam was ignored by a lot of people, except for the games that 'forced' them to use it (Valve games:...CounterStrike and HL2 mostly.) However, (and this is the magic Microsoft needs to find) Valve made steam not suck. People learned to trust it. "Yes" it will be available. "Yes" it will be convenient. "No" it won't hose your experience. And most of all..."Yes" it will be economical.

      Steam was considered draconian, until it proved not to be. And...importantly...it was 'optional' during that testing phase.

      --
      --Welcome to the Realm of the Hawke--
    8. Re:Valve Time by UnknownSoldier · · Score: 1

      I boycotted Steam for the first few years when it came out. Precisely because it sucked for the reasons you said.

      10 years, 334 games, and many gamer friends later Steam works surprisingly well. EA has their piece of crap Origin, Microsoft aborted its Games For Windows Live; I think I'll stick with Steam.

    9. Re:Valve Time by Darinbob · · Score: 1

      Goat Simulator has some great ratings...

    10. Re:Valve Time by PJ6 · · Score: 1

      As a matter of fact, does anyone know why Steam does not prominently feature Metacritic ratings anymore? Those really helped me choose games that I wanted...

      Maybe because games are given very high ratings that completely ignore the PC, even when these ratings are supposed to be for the PC versions?

      I don't know about you, but when I see a AAA PC game also has a console version, I just stop right there and don't buy it, no matter what the ratings are.

    11. Re:Valve Time by cyn1c77 · · Score: 1

      Steam was considered draconian, until it proved not to be. And...importantly...it was 'optional' during that testing phase.

      No, no, no... Steam still IS draconian; you've just have to deal with it and have gotten used to it.

      It's like the TSA-equivalent of gaming. It sucks, but is unavoidable, and eventually you just get used to periodically having your genitals fondled "for the good of the community."

    12. Re:Valve Time by Agripa · · Score: 1

      It's like the TSA-equivalent of gaming. It sucks, but is unavoidable, and eventually you just get used to periodically having your genitals fondled "for the good of the community."

      Except that it is unlike the other TSA-equivalents of gaming produced by EA (*), Microsoft, and others, it is actually not aggravating to use and is without significant restrictions. Valve earned their goodwill and continues to do so. EA and Microsoft squandered it.

      And the Steam TSA-equivalent actually has some benefits like being able to reinstall at any time on any machine without a hassle although I guess there are some exceptions to that . . . from other developers.

      (*) I have not bought a EA game since Archon. I think that highly of them.

    13. Re:Valve Time by Agripa · · Score: 1

      But sooner or later, we'll end up with yet another monopoly that needs to be brought down a few pegs.

      And when that day comes, you can tell us so. And you can have my pitchfork, feathers, and tar.

    14. Re:Valve Time by Agripa · · Score: 1

      I don't know about you, but when I see a AAA PC game also has a console version, I just stop right there and don't buy it, no matter what the ratings are.

      I give them enough benefit of my doubt long enough to watch some gameplay videos. Then I do not buy them.

    15. Re:Valve Time by Agripa · · Score: 1

      The aggregate score in the style of "very positive" etc. can be useful in filtering out the genuinely terrible games, but outside of that, not so much. What's needed for decisionmaking is a lot of information, a search engine, and your own thinking. Steam provides descriptions, tags, and now reviews, and for me anyway it's been incredibly easy lately to figure out whether I want to buy a particular game, or at least investigate it closer elsewhere.

      I usually at least check out games I see other people on my friends list playing regularly but checking them out includes watching some game-play videos and perusing their forums.

  3. Weighted averages by rebelwarlock · · Score: 1

    Why are they suggesting skipping straight to this hot mess instead of using a simple and well tested algorithm?

    1. Re:Weighted averages by lars_doucet · · Score: 1

      Why are they suggesting skipping straight to this hot mess instead of using a simple and well tested algorithm?

      Nor sure I understand? The article explicitly argues for using this simple and well tested algorithm.

    2. Re:Weighted averages by hey! · · Score: 2

      Because this method returns a ranking that seems more intuitively "right"?

      It's worth asking why that should be. Think about a rating scale of one to four stars. What does *averaging* those ratings mean? Yes, I know the *formula* for computing an average, but being able to *compute* an average isn't the same thing being able to *interpret* that average. Why? Because a two star rated item isn't really "twice as good" as one star, a four star rating isn't really "four times as good" as a one star or "twice as good" as a two star. In other words we're not measuring something like height, or number of widgets sold. We're *characterizing* something as shit, meh, OK, or awesome.

      The method proposed is more mathematically defensible. It converts each 1 to 4 score into a coin flip (Bernoulli process for you probability geeks): positive or negative. This actually captures what the rating system does better than treating the ratings as meaningful numbers.

      This doesn't automatically mean you'll get more intuitively pleasing results than using weighted averages; YMMV. But I think this system is bound to give more *consistently* intuitive results across applications and datasets. Weighting the average is an easy, but extremely arbitrary operation. As a compromise you might come up with a formula to tweak the score *toward the mean* depending on how few reviews there were; by adjusting the parameters you use you could probably get that formula to yield an intuitively satisfying ranking. The advantage of using a confidence interval as suggested is that approach is highly likely to yield a reasonable result out of the box, across a wide variety of datasets.

      And after all, even if it looks like a "hot mess" to people who haven't taken advanced statistics, how hard is it to copy and paste a bit of code?

      --
      Post may contain irony: discontinue use if experiencing mood swings, nausea or elevated blood pressure.
  4. It's too bad... by Charliemopps · · Score: 4, Interesting

    It's really too bad the way Valve has screwed the pooch with Steam over the last few years. They literally had The gaming platform for PC all locked up. There was a time where I was desperately hoping they'd have an IPO so I could invest. But they tried to make the store so user friendly to Game controllers... a use case that may very well never become popular... that it's almost useless. Now, the only reason I think they are still relevant is because no one has bothered to try and challenge them. But I think they are 1 clever startup away from losing their position for good.

    There are games on steam to this day, that I cannot find... even using Google searches with the site:steampowered.com modifier. I have to go to the damned games external website and use their link to get to the thing I want to buy. I want to buy it and Steams own search doesn't bring it up because their search algorithm is so broken. I try to browse games and it limits what I can browse to a few dozen. Yet, when I go back 2 days later, its the same few dozen... why doesn't it just show me game after game until I've seen them all? There are over 4000 games on Steam!!

    And you know... I know what people are going to reply to me with... "You didn't click X!" or "You moron, you have to go to the blah page!" or whatever... I'm sure it's entirely my fault for not knowing how to do it right. But let me tell you something... the biggest moron on the planet can walk into Walmart and leave with less money. That's the key to their success. You cannot enter a Walmart and avoid seeing something you need to buy today. You don't have enough money? No worries there's a god damned bank on premises to give you a loan! It's easy to find something you like, it's easy to get it to the register and its easy to get it to your car.

    Why Valve? Why is it so god damned hard to give you my money?!?! I can go on the google play store and spend money with one damned finger! My 4yr old spent $20 on Angry birds slingshots before my wife locked her phone. He couldn't even figure out how to launch a game from your damned app!

    1. Re:It's too bad... by Anonymous Coward · · Score: 3, Interesting

      This I entirely agree with. The funny part of their whole new Discovery design? 99% of that could have been resolved with an Amazon-like system where you could simply get a list of "other people bought this" on all their store pages for games and allow one to readily go to the store page from near every game-related page (art, forums, etc). Of course to do that well would require having multiple tabs and perhaps even a like/dislike system per user to help sort things down--and there's Netflix to inspire on that front.

      Honestly, the only reason I have any Steam games at all is I bought them through bundles. Trying to use Steam to buy games directly is too much of a hassle that almost guarantees that Valve and developers who put games on Steam will never get full listed price from me (unless, like you said, I find the game outside of Steam and Steam's store page the only way to buy it).

    2. Re:It's too bad... by rsmith-mac · · Score: 1

      Just out of curiosity, what is it you can't find? I agree that finding stuff within Steam is hard, but I've yet to have Google fail me. Is it a really common word, or something along those lines?

    3. Re:It's too bad... by Anonymous Coward · · Score: 1

      As funny as it sounds, you just cannot force a company to make more money sometimes.

    4. Re:It's too bad... by Ghjnut · · Score: 1

      I think you're exaggerating the severity of shortcomings Steam currently has. While you may have trouble finding specific games and disagree with their approach to bubbling suggestions to the top, I'm still overwhelmingly satisfied with what they do provide. -On-demand game -Support for a plethora of indie titles -Emphasis on multiplayer/friends and communities -Novel monetization approaches (DOTA) -Strong push for cross-platform support with Valve heading the charge -Investment in bridging the PC/console divide I understand where you're coming from but while you may have a strong aversion to the direction they're going with some of the console-related adjustments, I really don't see the extent of detractions from their core product you vehemently push.

      --
      MouseClass extends ScrollClass, which extends TabClass, which extends SidebarClass, which extends PowerClass, w
  5. Ranked Pairs by Ichijo · · Score: 1

    Or use something like the Condorcet method to put all the games in order from most to least liked, and then assign each game a percentile ranking based on its position on the list.

    --
    Any sufficiently unpopular but cohesive argument is indistinguishable from trolling.
  6. Ratings is just one thing. by Anonymous Coward · · Score: 5, Informative

    As a developer with a title on Steam, I can say ratings are just 1 of many issues with the store.

    I can't tell you how many support tickets I have to go through because steam can't properly launch or update the main binaries that are covered under their drm. Most of the time it's AV issues, but once in a blue moon it's steam borkin' out. (0.25-0.5% of sales)

    In the backend specific game information can't be shown to specific users. So I'm locked out of my real time statistics because my publisher has other titles from other developers...

    But the worst, the god awful worst, are games on Early Access that are pure shit or have been abandoned... They drag the entire system down and majorly screw over legitimate titles that are in Early Access. IMO Steam should have a purging system for these titles. Perhaps even give coupon codes to users who bought games that have been purged.

    As for the rating system. It definitely needs to be weighted. But there should also be incentive to give a ratings (even if they give a review anything) and there should be at least a "maybe" option. The thumbs up/thumbs down system doesn't really do it for me. Specially since I have negative ratings on my project such as "Game needs a German translation. 5/10" :\

    1. Re:Ratings is just one thing. by Darinbob · · Score: 1

      I ended up buying Wasteland 2 on Steam which had a good price, without realizing it was still early access. Gah. Now that it's out I could have gotten it on GOG instead.

  7. A much simpler method by PacoSuarez · · Score: 5, Interesting

    If the only two choices are positive/negative (or thumbs up/thumbs down or some other equivalent 0/1 scheme), here's a formula that should work fairly well:

    (n_positive + 1) / (n_positive + n_negative + 2)

    So a single positive review gives you a score of .6667, and a single negative review gives you .3333. For large numbers of reviews, the score quickly converges to the actual fraction. If you don't have any reviews, you are at .5000.

    The mathematical justification for this formula is that if you try to use a Bayesian approach to estimating the true probability of getting a positive review, and you start with a flat prior, this formula gives you the average of the posterior probability after observing the given number of positive and negative reviews. The full posterior distribution is a beta distribution with parameters alpha=n_positive+1 and beta=n_negative+1.

    This formula is often used when applying Monte Carlo techniques to the game of go. I believe a lot of programmers simply start the counters of wins and losses at 1 to avoid corner cases (like division by 0), and they accidentally use the correct formula.

    1. Re:A much simpler method by Anonymous Coward · · Score: 1

      here's a formula that should work fairly well

      It should, but it doesn't. It converges too quickly to allow useful comparison between things with 100 votes and things with 50k+ votes.

      Think about what happens if there are a 98 upvotes and 2 downvotes; your method rates it as 99 / 102 = 97.06%; meanwhile the error of the mean is still a whopping 3.75%. Ooops. Not good.

      TFA's suggests using Wilson's score. Using 95% confidence, Wilson's score is 93.0% for the 98 up : 2 down case.

    2. Re:A much simpler method by joke_dst · · Score: 1

      You're using the lower bound of Wilson score. But the Wilson score is an interval, in this case 93.6% - 98.9%. The center of that interval is 96.2% which is pretty close to this algorithms 97,1%.

      I agree that you probably should use the lower bound of the Wilson score for these kinds of ratings, but you can't claim he's algorithm is badly wrong.

  8. Uncertainties by Roger+W+Moore · · Score: 3, Informative

    Actually the problem they are wrestling with here is one that has science has had to deal with for a long time: the uncertainty on a measurement. The star ratings are a measure of the popularity of a game so what you are really asking is "given the ratings it received which game is best?".

    Unfortunately with a finite statistical sample you always have some degree of uncertainty and, within this uncertainty your data does not provide any ranking at all: you simply do not know which game is best to any sensible degree of certainty. However while correct this would lead to really confusing rankings since to be fair you would need to randomize the order within the uncertainty of each game's score. This would be complex and confusing to users!

    Instead what they suggest is using a confidence level limit: what score can I be confident that 95% of people would rate the game higher than? We do this all the time in particle physics when we put limits on some new physics which we looked for an did not see. For example the precursor to the LHC, LEP had a result that it was 95% confident that the Higgs boson had a mass higher than 116 GeV/c2 (IIRC). There are better ways to do this than the method they quote but since this is just a game rating and not science it's a fine method to use.

    1. Re:Uncertainties by stewbee · · Score: 1

      I have thought about another similar way to approach from a book I read to approach the above problem for continuous random variables, but I am sure that it could be adopted for what amounts to a categorical* problem here. The approach was a Bayesian estimator that supposed a previous expected value. So in this case, Steam might assign an initial rating of 50% as the baseline mean. As more real samples came in, the estimator was shifted toward the real mean and away from the Baysian prior estimator. In basic terms, it is a weighted average of the prior mean and the actual mean. I found this method in this book. In a nutshell, I think this is in general how Nate Silver does most of his analysis too (ie, with Bayesian estimation). Admittedly I did not RTFA yet, but it looks bit dense and I am pre-coffee, so it could be dangerous to read right now :-)

      *For those not familiar with the terms, user ratings have only a finite number of values they can achieve. Amazon for example have 5 star rating, so the outputs are limited to one of those 5 values. Bernoulli is for only two possible outcome, whereas categorical are for multiple discrete outputs. Continuous on the other hand can achieve any number within the range like 4.234....

  9. Re:Steam is bullshit... by Blaskowicz · · Score: 1

    I remember the very same kind of game percent-ratings on paper magazines in 1992.

  10. Re:Ethics can't be patched in by Anonymous Coward · · Score: 1

    That whole article is about how TotalBiscuit didn't sell out while others did. WTF?

  11. Re:Ethics can't be patched in by Luckyo · · Score: 4, Interesting

    The said comment is complete and utter bullshit. When he did his Guns of Icarus thing with other people, he always disclosed it in video description. After some desperate gamasutra folks (who notably have massive vested interest in sinking Bain, NerdCubed and a couple of other big youtube gaming commentators, because they have been massively eating into their audience numbers) started whining that disclosure in video description was not enough (according to the laws, yes it is), he added a short message to the beginning of every such video where he states any potential sources of interest he may have, down to having received a review copy.

    Notably, gamasutra itself does not do this, and has been central to the whole gamergate scandal which revolves exactly around this kind of acting, only with gamasutra folks not disclosing their conflicts of interest anywhere. Not in descriptions, not in topics, not in articles. Nowhere.

    Frankly, Bain goes to ridiculous lengths to disclose any potential conflict of interest he has nowadays, to the point of holding 3-hour talk marathons with developers (including CEO of the company that bought the sponsored content you talk about) and games media journalists partially on this topic and then puts those on his channel:
    https://www.youtube.com/watch?...

    If you really care about the issue, join #gamergate and go after people who actually do this crap.

  12. Discretionary XKCD by Blaskowicz · · Score: 3, Insightful

    Yep there's one about it!
    It made me not get very enthusiastic about app stores and such.

  13. Re:Ratings and Time Served by RogueyWon · · Score: 1

    I'm not at my home PC right now, so I can't check, but I think this is one of those sim-games that gets annual-ish auto-updates for everybody who already bought it. If it's in any way related to Railworks, then it certainly is.

    What occasionally happens in cases like that is that a version "upgrade" turns out to be a less than positive experience, as long-established features break or are removed. But Steam doesn't reset your "time played" count in those cases. So it's quite possible that these are people who have been playing the game for years and are complaining about the latest forced-update.

    On the other hand, we're talking about rail enthusiasts here... so maybe it does just take 800 hours for them to start getting bored.

  14. Professional vs User reviews by RogueyWon · · Score: 4, Informative

    I've noticed one big difference between the "professional" reviews on major sites and user reviews on Steam/Amazon etc.

    By and large, the professional game reviews tend to cluster their scores in the 6/10 - 8/10 range. You have to be exceptionally good to get above that level or exceptionally bad to fall below it. You also - in most cases - get relatively little variation between professional review scores. A game might be 8/10 on one site and 9/10 on another, but it is rare to see a gap larger than 2 or at most 3 points. It does happen - Alien Isolation has had professional reviews ranging from 4/10 to 10/10 - but generally only with unusual games that go outside the usual templates (like Alien Isolation).

    User reviews on the other hand, tend to be much more polarised. It's by no means unusual for games to pick up 10/10s from some users and 1/10s for another. Personal biases are much more likely to feature in user reviews ("I'm giving this game a 1/10 because I don't like something the developer said on twitter" or "I'm giving this game a 10/10 because I've spent the last 2 years boring everybody rigid about how good it is going to be and don't want to backtrack"). Often, the scores tend to average out in more or less the same place as the professional reviews once you have enough of both, but with much more divergence on the user reviews.

    So which is more useful?

    By and large - and with some important caveats - I find the professional reviews more honest and useful. A lot of people complain about the clustering of scores in the 6/10 to 8/10 range, but the nature of the modern games industry (quite risk-averse, with a lot of project oversight) means that most commercially produced games tend to fall into that range. If you assume a 6/10 is "not great, but overall more good than bad" and an 8/10 is "high enjoyable but not ground-breaking", then you're left with a spectrum into which most major releases fit. The industry does throw out the occasional piece of brilliance - which is usually recognised. And sometimes, things go wrong and it throws out the odd turkey (Aliens: Colonial Marines being perhaps the most recent example). When those things happen, most of the big review sites do seem to reflect them.

    But those caveats I mentioned before are important. The first is that at the end of the day, the people doing the professional reviews are still human and they still have their own biases, preconceptions and agendas. True, they have people watching them to make sure that they don't give free reign to those... but occasionally, those checks and balances fail. In fact, most of the big review sites have a few known quirks that you learn to watch for. Eurogamer, for instance (which despite the criticism I'm about to hand out, I do, in general, rate highly), has a real Nintendo-nostalgia fetish and a habit of over-scoring first party Nintendo games. At the same time, until fairly recently, it went through a phase of trying to shoehorn political correctness into its reviews and marking down a few games which committed real or perceived transgressions (though I've noticed less of this recently).

    The next big caveat with professional reviews is around bugs. The big review sites are often given pre-release copies of games, so that the reviews can go live before release. Indeed, a lack of pre-release reviews is often an early sign that a game will be a turkey (again... Aliens: Colonial Marines had a review embargo until its release day). Thing is, sometimes those review copies are unfinished code. And sometimes they aren't. But regardless, there is a tendancy for professional reviewers to either ignore or to be instructed to ignore bugs, on the basis that "they'll be fixed for release". And, surprise surprise, they often aren't fixed for release. User reviews are often your first warning that a game is a buggy mess - though on PC you do have to try to separate out the inevitable complaints that pop up on every new release's forms to the effect that "it won't run on my 8 year old PC running a

    1. Re:Professional vs User reviews by Darinbob · · Score: 1

      I agree here, the "pros" are very much pro-game, no matter what it is. I've seen reviews that point out all sort of flaws and it still gets 7 out of 10. A recent game full of hype, all reviews glowing, probably get game of the year, yet I look at one game play video and I can tell it's crap and dishonors the source material. But by the professional reviews it looks to be the perfect game for any gamer at all and should even be purchased by non gamers.

      Users reviews however will often bring up stuff the pros leave out. Reboots of a franchise for example, the pros will say how great the graphics are, they'll use language to appear to the mass market, whereas the fans of the series will actually point out how the reboot screws things up or is dumbed down or how the gameplay is not representative. The pros are really out of touch, they play too many games without enough time to adequately review them, they're heavily swayed by the fluff, and are completely inadequate to review niche games.

      For example, no professional reviewer, paid by the big console makers, is going to write "this game was dumbed down for the console crowd". No pro will say "they should have left it alone instead of trying to a misguided reboot."

      On the other hand, sometimes the fans get too emotional. Something that doesn't live up to their expectations is downgraded too harshly, or something that insults the source material becomes unforgiveable (I'm still raging over Mission Impossible movies).

      I'd like to see ratings that are based on the style of play or a comparison to other genres or games. Ie, "if you loved assassin's creed, then you'll love this game, but if you loved Thief The Dark Project you will despise it with all your heart". Doesn't help out the mass market who play anything and everything, but it might help out the more selective game players who've never even seen a console.

  15. Because they don't care by Sycraft-fu · · Score: 1

    Valve is extremely lazy, and Steam has allowed that. They have by far the majority of digital PC game sales, and most PC sales are digital these days. So they make tons of money doing very little work. This has allowed them to do what they really like doing: Faffing about with random projects, not worrying about any kind of deliverables.

    Unfortunately, lacking any real competition, Steam has no reason to really get better. The only store I would say is actually better than Steam is GOG, but they have the problem of not using DRM, which means many publishers won't release on them.

    Steam isn't likely to get much better unless it has to. If some other company makes a good games store, as in one that actually hosts and deals with distribution, not just one that sells Steam keys (as many do) and it starts to eat in to Steam's market, then maybe Valve will give a shit, get a bigger dev team, hire some CS people, etc. However until then, they'll continue to just mess around.

    They aren't beholden to investors, they make so much money (like $40 million PER EMPLOYEE per year) they don't have to adhere to any kind of schedule, and they are the one and only place many will go to buy games. As such, they can have it be a big shitfest and it doesn't really matter.

    Personally, I go out of my way to buy any game I can on GOG, rather than Steam, if said game is available on GOG. They actually curate their store, make sure minimum quality standards are met, make sure old titles run, and don't put out unfinished games (aka Early Access). However, there's a lot I can't get there, and I'm one of the few people who do. Most gamers have this "Steam is the only place to buy, Gabe is god, all hail Valve," attitude.

  16. Re:Ethics can't be patched in by Luckyo · · Score: 1

    It hasn't even started yet.

    https://www.youtube.com/watch?...

  17. Re:Ethics can't be patched in by Luckyo · · Score: 2

    And in case you're actually interested: here's the current status of the boycott campaign known as #gamergate:

    https://gitorious.org/gamergat...

    12 mysogynist kids apparently have indeed moved on. The >12.000 adults pissed off at utter lack of ethics and betrayal by people they expected to stand by them and their hobby on the other hand are just getting started.

  18. Re:Doesn't even look like an algorithm by hey! · · Score: 3, Insightful

    It's not an algorithm, except in the trivial sense. It's a formula for calculating an adjusted rating value that discounts extreme ratings for items with small numbers of reviewers.

    This actually matches what you do intuitively when you see an item with a single rating of 5.0 at the top of a list, just above another item with an average rating of 4.9 from a thousand users. You mentally deduct a bit from the "top rated" item because you know it's probably too high. Likewise a 1.0 rating from a single user is probably too low, so you mentally add a bit to that.

    The question is, how much to deduct or add from the score?

    The approach suggested is to ask a slightly different question. Instead of "what is the average rating of the product", you ask "what percentage of positive ratings can I be 95% certain the product would score above have if *everyone* rated it?" It turns out there's a number of mathematical formulas that are supposed to tell you precisely that.

    There's still a lot of arbitrariness in this approach. Why 95%? I'm reasonably sure that results would be just as intuitively reasonable if we chose 80% instead. But if 95% seems to generate intuitively reasonable results there's no particular reason to monkey with that parameter.

    BUT, I think, the level of arbitrariness involved probably means we could choose a simpler approximation than the Wilson interval if we could dream one up. The more familiar Wald interval taught in basic statistics courses is somewhat simpler, but not so much that it's worth worrying about, at least not if you're doing the calculation on a database server which typically has a few CPU cycles to spare.

    If I were to attempt something like this on a massive scale in an environment where CPU cycles were precious, I'd probably devise some kind of simple algebraic scaling formula that tweaked scores toward the mean, depending on the number of ratings. The results wouldn't be quite as good as the Wald or Wilson intervals, but maybe not so much less good that anyone would notice.

    --
    Post may contain irony: discontinue use if experiencing mood swings, nausea or elevated blood pressure.
  19. A system that works by bsdasym · · Score: 1

    What we really need is something like Pandoras system. Something where my own ratings, as a user, are factored in. If 10,000 random users rank a game as 9/10 but I thought it was a 3/10, we obviously disagree on some things. Match me against people who review similarly to me if you want to help me find games to buy. Use the statistical algorithm as a fallback.

  20. Dumb All Over by Cajun+Hell · · Score: 1

    If instead of talking about Steam, we were talking about iTunes Store or Google Play or XBox Live, 100% of the Steam users here would immediately start laughing about how stupid "those people" are, to be using the store to determine what to buy. That is obviously the very last intell source that you'd use. THAT WOULD BE STUPID.

    But somehow, if you're a Steam user, all your common sense happen to be inapplicable, whenever we happen to be talking about Steam (and you get your common sense back whenever you talk about the iTunes store or XBox Live). You and they can each look down on each other, correctly secure that you're wiser than the other, and oblivious to the fact that you're also dumber than the other.

    And you both chuckle at the guy who uses Amazon's star ratings to determine which widget to buy from Amazon. How fucking moronic is that guy? Doesn't he know how to Google for reviews? He stares back at you, being dumb in his Amazon purchases, yet shaking his head at how idiotic you two behave, when you're shopping for software.

    But anyway, no, obviously of course, you wouldn't ever actually use Steam, to determine what games to buy on Steam. (Steam's rating system is totally irrelevant, because they're selling the things they're trying to rate. It's impossible for anyone to do a good job of that, unless you define the job as Fuck The Users.) To determine what to buy on Steam, you use the same method as you'd use for any other store: you go read disinterested third party reviews published on disinterested third party media, just like you expect the Amazon and Apple and Microsoft and Google customers to do, and you shake your head with sadness and despair for humanity's dim future, every time you see people doing it exactly, perfectly wrong.

    But NOOOO, the one store I use, happens to also be the first store in history to have done it right and be trustworthy! Because I am SPECIAL!!! My vendors never have conflicts of interest!

    "Dumb all over, yes we are, dumb all over, near and far, dumb all over, black and white. People, we is not wrapped tight." -- FZ

    --
    "Believe me!" -- Donald Trump
  21. Re:Doesn't even look like an algorithm by Darinbob · · Score: 1

    You can't really rate games on Steam that well. If you write a review you get a choice of thumbs up or thumbs down. Unless they've got some AI programming analzying the words that are written then how do they rate them? If they rely on number of thumbs up versus thumbs down, then how to they measure a tepid response?

  22. Re:Next: Fix Rotten Tomatoes by Triklyn · · Score: 1

    i trust one reviewer, mark kermode, unless he's out, and i have to make due with his replacement. His recommendation is gospel truth.

  23. Re:Next: Fix Rotten Tomatoes by CrashNBrn · · Score: 1

    Rotten Tomatoes should be thrown in the garbage.

    The wife used to go there for *anything* that was being considered for going-to-the-movies/netflix/rental/etc. Then I asked her what a few shows we had seen that I had chosen (thus we didn't go to RT prior)... they were universally (both critic and fans) panned.

    I'll stick with choosing movies by Actor and Director, it's at least right 66-75% of the time. There are also some Production companies/subsidiary studios that seem to output quality near every time too (Clint Eastwood's company for instance).