Slashdot Mirror


'Tit for Tat' Defeated In Prisoner's Dilemma Challenge

colonist writes "Tit for Tat, the reigning champion of the Iterated Prisoner's Dilemma Competition, has been defeated by a group of cooperating programs from the University of Southampton. The Prisoner's Dilemma is a game with two players and two possible moves: cooperate or defect. If the two players cooperate, they both have small wins. If one player cooperates and the other defects, the cooperator has a big loss and the defector has a big win. If both players defect, they both have small losses. Tit for Tat cooperates in the first round and imitates its opponent's previous move for the rest of the game. Tit for Tat is similar to the Mutual Assured Destruction strategy used by the two nuclear superpowers during the Cold War. Southampton's programs executed a known series of 5 to 10 moves which allowed them to recognize each other. After recognition, the two Southampton programs became 'master and slave': one program would keep defecting and the other would keep cooperating. If a Southampton program determined that another program was non-Southampton, it would defect." Update: 10/14 15:08 GMT by J : If anyone wants to try writing their own PD strategy and see how it fares in a Darwinian contest, I'll host a tournament of Slashdot readers. Here are the docs, sample code, notes on previous runs, and my email address.

83 of 356 comments (clear)

  1. Scary Stuff by mfh · · Score: 5, Funny
    FTA:
    • If you confess and your partner denies taking part in the crime, you go free and your partner goes to prison for five years.
    • If your partner confesses and you deny participating in the crime, you go to prison for five years and yor [sic] partner goes free.
    • If you both confess you will serve four years each.
    • If you both deny taking part in the crime, you both go to prison for two years.
    This sounds pretty much like the RIAA might be involved. I would deny everything if I were you!
    --
    The dangers of knowledge trigger emotional distress in human beings.
    1. Re:Scary Stuff by parvenu74 · · Score: 2, Interesting

      Or under the Patriot Act, whether you confess or not you go to jail for an undetermined period of time, during which charges may or may not be filed against you...

    2. Re:Scary Stuff by Unoti · · Score: 2, Insightful

      But, if you confess, then freedom marches forward!

  2. That's not really so special by Perianwyr+Stormcrow · · Score: 4, Interesting

    In other words, an in-group can work vs. tit for tat if it outnumbers it. I'd like to see a trial with a slow trickle of immigration of tit for tats into a large population of S/M programs. That might be illuminating. I suspect the outcome would be that tit for tat still does well.

    --

    What we call folk wisdom is often no more than a kind of expedient stupidity.-Edward Abbey

    1. Re:That's not really so special by Minwee · · Score: 4, Funny

      I was with you up until you started talking about tits and S/M... Am I still on slashdot, or did I wander onto alt.com by mistake?

    2. Re:That's not really so special by Anonymous Coward · · Score: 5, Informative

      What's being ignored is that the total profit of all the colluding algorithms is less than that of Tit-for-Tat, which makes the solution unviable in real-world Prisoner Dilemma situations. (bidding on large construction projects under certain auction formats, etc)

      As an analogy of unprofitable collusion, I could win the World Series of Poker by hiring enough shills and paying their time and entry fees. I would lose money by doing this, probably more than I could recoup with post-tournament income via endorsements/books/whatever.

      The parent is correct. Tit-for-Tat is still superior in equal numbers, and a modified Tit-for-Tat that can spoof the recognition algorithm of colluders will trounce them.

    3. Re:That's not really so special by harrkev · · Score: 4, Insightful

      Yup! It is the "outnumbers" thing which (in my opinion) makes things unfair.

      Had this been an actual prisoner's dilemma, this winning strategy would require recruiting a large number of thugs who LIKE going to prison and are willing to "take one for the team."

      Although cooperation is not explicitly defined as being against the rules, IMHO, it goes against the "spirit" of the competition. The point is that each algorithm is supposed to act in a greedy manner.

      This will no doubt spark a LOT of discussion, but to me, they "cheated." (OK. Maybe "worked the system" is a better phrase).

      --
      "-1 Troll" is the apparently the same as "-1 I disagree with you."
    4. Re:That's not really so special by jamie · · Score: 3, Informative
      If you'd like to try your hand at it, please feel free to email me the code any program you'd like to try. (Or just describe to me the strategy you want to take and I can probably write code for it.)

      I've written perl scripts to run Prisoner's Dilemma tournaments on my website (the perl source code for all the species in the last run is on that page) and new submissions for the next tournament are welcome. Since my tournament does Darwinian selection on every agent that plays in it -- agents which don't earn points eventually die off, those which earn a lot of points reproduce -- self-sacrificing pairs of strategies won't do very well.

      I haven't yet written code which can trickle in new members over time, but that really wouldn't be very hard to add (I could probably set it up in an hour if you are really interested :)

    5. Re:That's not really so special by KDan · · Score: 2, Insightful

      That's what I thought while reading the article... also making use of that code is assuming a meta-knowledge of the game, ie some sort of way you can have prepared for that particular and specific instance of the game where the specific problem is the prisoner's dilemna with its known simple outcomes. Whereas tit-for-tat is a much more generic theory that can apply to a game which you don't know about yet (eg, say, one that has N possible outcomes rather than only 4), by simply stating "try to choose the mutually beneficial outcome first and then mirror your opponent's moves", the program they devised makes use of specific knowledge of how the game works (eg with those recognition sequences..). Because of that it is clearly inferior and more a hack of PD than a game theory idea.

      But, says Kendall, "Everybody in our field knows the name of Anatol Rapoport, who won the Axelrod competition. So if you can win the 20th-anniversary one, in our field there's a certain historical significance."

      But in this guy's case the significance will be lost because he didn't win through any significant idea, but through a hack. As he says earlier, it's the research that counts, not the outcome.

      Daniel

      --
      Carpe Diem
    6. Re:That's not really so special by phoenix321 · · Score: 2, Interesting

      I think this is a fairly stunning result that is able to explain, why aristocracy and monarchy proved pretty successful for thousands of years.

      If Most sacrifice themselves for a few "chosen" ones they can do better than most others against the rest of the "world". Pretty convincing.

      If this strategy succeeds, we have a fairly sound theory why and how monarchy evolved from simple tribal structures. Secret societies, hierarchies and everything else would suddenly seem logical.

      Does not leave a feelgood-residue like having read Axelrod, but at least we know it now...

  3. That's why... by Stile+65 · · Score: 4, Funny

    ...fraternities and secret societies work so well!

    I'm off to join the Freemasons. Be back in a few.

    --
    I claim first use of "Error No. 0B" - or "No. 0B error." It'll be the new ID 10T!
  4. Practicality by Perianwyr+Stormcrow · · Score: 2, Funny

    I generally hope that knowledge of the prisoner's dilemma will never become a practical factor in my life.

    --

    What we call folk wisdom is often no more than a kind of expedient stupidity.-Edward Abbey

    1. Re:Practicality by Daniel_Staal · · Score: 4, Insightful

      Sorry, you've probably already lost that one. The prisoner's dilemma is quite useful in normal life, or at least the thinking that gives rise to the solution is. It applies any time there is significant advantage to be gained by working together, but also much advantage to be the one 'cheating'.

      For /., try this interpretation:
      If we both share our source code, we will both will be more productive.
      If I share my source code, and you don't, you can be more productive. (Assuming you can use mine.)
      If neither of us share, we both will have to re-create other's work...

      --
      'Sensible' is a curse word.
    2. Re:Practicality by clausiam · · Score: 2, Insightful
      It applies any time there is significant advantage to be gained by working together, but also much advantage to be the one 'cheating'

      As in VotePairing :-)

    3. Re:Practicality by naoursla · · Score: 2, Informative

      Unfortunately, PD has a dominant strategy to cheat your opponent. No matter what your oppoent does, you can do better by cheating.

      Iterated PD is more interested since it lets your opponent punish you for cheating. So you get into some interesting social issues.

      How well a given strategy does depends on the strategies other in its community are using. If the population is heavily cheater based, then agents that cooperate will lose big time. However, if there are enough cooperators, then they will form a coalition of sorts, and even though they will lose to the cheaters, in the end they will come out on top.

    4. Re:Practicality by gadget+junkie · · Score: 2, Informative

      "[...]How well a given strategy does depends on the strategies other in its community are using. If the population is heavily cheater based, then agents that cooperate will lose big time. However, if there are enough cooperators, then they will form a coalition of sorts, and even though they will lose to the cheaters, in the end they will come out on top."

      I remember reading about the tournament in a "Science" magazine article, back when the original tournament was done; Tit for Tat won irrespectively of the number of strategies it played against.
      Furthermore, in a variant in which the winning strategy "spawned" more often, and the loser did not, Tit for Tat became a majority of the population irrespective of the initial sample, except in the extreme case of only one tit for tat and defectors. This is explained better here. The most interesting thing was, for me, the fact that Tit for Tat was superior to a strategy in which the program responded with a delay, i.e. it made the opponent's move two turns down the line. So, remember this if you have kids, or a pet, or both like me: whatever reaction you deem appropriate, it should be done soon, or not at all.

      NOTE: the payoff was described as such:if A cooperates and B cooperates, both get X points; if A "defects" it gets W points to Z for B;if both defect, they get Y, where:

      W > X > Y > Z, all positive integers.

      --
      "If a boss demands loyalty, give him integrity. But if he demands integrity, give him loyalty." (John Boyd, 1927-1997)
    5. Re:Practicality by spitzak · · Score: 4, Insightful

      Yes I agree that public domain code is very much the same as the prisoner delimma.

      The GPL is an attempt to make it *not* the prisoner delimma by forcing the other side to cooperate if you do. This eliminates the losing part of the cooperation choice and thus it is no longer a delimma.

  5. only thing I can say is... by jjeffries · · Score: 3, Funny
    HOW ABOUT A NICE GAME OF CHESS?

    Lameness filter encountered. Post aborted!
    Reason: Don't use so many caps. It's like YELLING. It's not at all like a TALKING COMPUTER. You are a bad man. Go away.

    1. Re:only thing I can say is... by OptimizedPrime · · Score: 2, Funny

      So what you're saying is my algorythm won this year since I kept it written on a post it on my desk instead of submitting it?

  6. Uh, isn't that just cheating? by DoorFrame · · Score: 4, Interesting

    I mean, the whole point of the Prisoner's Dilemma is that you don't have all the information. You don't know what your partner/opponent is going to do and you have decide based entirely on what little information you have based on your history with your partner/opponent. What these people are doing is creating a pattern to be recognized by another player, and then working as a team. And, it's not like they're people where one person might change their mind and decide to defect unilaterally... they're programs. Once they've locked onto each other as the same program, that's it. They'll play to their advantage until the end.

    The real trick is to find a program that can beat other DIFFERENT programs, not beat itself. This seems really stupid, or am I missing something?

    1. Re:Uh, isn't that just cheating? by Urban+Garlic · · Score: 4, Insightful

      Well, part of the interest is that these programmers found a way, within the rules, to get more information, by means of their "secret handshake". The important lesson (to my mind) is that the environment can be manipulated in surprising ways to get a desired result. That's creativity and innovation doing its thing.

      Interestingly, this strategy is also fairly "brittle", I think, in that simple rule-changes could foil it. Requiring only one submission per team, for example, or scoring teams according to the total (or average) scores of all their programs, would complicate any strategy of collusion.

      --
      2*3*3*3*3*11*251
    2. Re:Uh, isn't that just cheating? by Have+Blue · · Score: 2, Informative

      That's why this is the Iterated PD Challenge, not just the classic PD. If the competitors only played ONE round of PD each, the contest would not be very interesting. Their past performance in the contest itself is part of the "history" they are using to evaluate their choices.

    3. Re:Uh, isn't that just cheating? by julesh · · Score: 3, Insightful

      I suspect you could come up with a solution that beats this system by mimicing it, then changing its behaviour suddenly.

      That's why this isn't a good answer to the problem -- not because it's somehow "cheating", but because its a strategy that only works in limited circumstances and fails spectacularly in others. Kind of like chasing down losses when gambling.

    4. Re:Uh, isn't that just cheating? by Minwee · · Score: 4, Insightful

      I don't see it as cheating. It's a lot like Bridge -- The rules say that you can't show your partner your hand and you can't tell them what you have, but you are allowed to use prearranged bidding conventions to pass information across the table. All that the Southhampton agents did was use a bidding strategy. They did act as a team, but they had no out-of-game way of knowing that they were up against a team member. That doesn't break any rules, and it did work. The Southampton team took the top three spots in the competition. If you insist on comparing the entrants to people, consider this. They worked as a team, for the good of the team, knowing that at least some of them would win even if the others bombed. People do that kind of thing all the time outside of competitions. Why should it be so out of place here?

    5. Re:Uh, isn't that just cheating? by gadget+junkie · · Score: 2, Interesting

      "So the payoff of alternating is 2.5. This is still lower than "tit for tat" if the "tit for tat" programme manages to cooperate all the time. But this will generally not be the case."

      In reality this WILL be the case. Tit for Tat cooperates at the first turn, and then copies the opponent, so the payoff is 3 with no volatility of returns.For all the effort they make, both TFTs could go have a beer and let a computer take over ;-).
      S/M has a lower return, PLUS it pays something in the handshake period.

      Now for something really interesting: nobody has really spent time on the economy of effort of the TFT strategy. all the other strategies that proved viable, not only did not win, but they needed substantially more resources (line of code, etc). In my view, that's part of the reason of the survival of cooperation in its present form: it is simple, it works, and when you play with someone using it you're buddies for life.

      --
      "If a boss demands loyalty, give him integrity. But if he demands integrity, give him loyalty." (John Boyd, 1927-1997)
    6. Re:Uh, isn't that just cheating? by snilloc · · Score: 2, Informative
      I'm not sure how this particular iterated PD game works, but in the serious academic version, there is uncertainty about which round is the last round. This accounts for "last round" defections, and also the subsequent collapse of any cooperation strategy by working backwards induction through the game and defecting on the "real" last round to one-up your opponent. (Knowing that your opponent is likely to defect on round N, you defect on N-1, N-2, N-3, back to the start of the game with a dominant "defect" strategy).

      Sort of like the parable of the teacher who said there'd be a pop quiz but you wouldn't know what day it was. Obviously it wasn't Friday, because on Thursday you'd know the quiz was friday. Likewise for Thursday because if the quiz wasn't friday, and if you got to wednesday w/o a quiz, then you would know the quiz was thursday, and therefore the quiz couldn't be on thursday... leading to the conclusion that there is no quiz. Not a perfect example (since then the teacher could give the quiz on any day and it would be equally unknowable given the possibility of "no quiz"), but I hope it clarifies some.

  7. Does this defeat the purpose? by Snowspinner · · Score: 5, Interesting

    This seems to me to be an unfair way to "win." The point of the PD simulation is to talk about whether, in the absence of any social consequences, it is better to screw someone over for money or to work cooperatively with them. It's not a perfect model for that question, but that is still the question that makes us care about the PD in the first place.

    All this has done is make a meta-PD game in which the two programs create a meta-game in which they agree to cooperate. That is to say, this is a solution to the PD problem that relies on the cooperation of a cohort (Someone to keep choosing loyalty while you defect and get all the money). Which is exactly not the point of PD.

    So the real headline, I think, is "Trivial flaw found in definition of Prisoner's Dillema problem. University of Southhampton wastes money demonstrating flaw instead of writing a goddamn paper like a normal person would."

  8. ...by cheating! by The-Bus · · Score: 4, Insightful
    If the program recognized that another player was not a Southampton entry, it would immediately defect to act as a spoiler for the non-Southampton player. The result is that Southampton had the top three performers -- but also a load of utter failures at the bottom of the table who sacrificed themselves for the good of the team. Another twist to the game was the addition of noise, which allowed some moves to be deliberately misrepresented. In the original game, the two prisoners could not communicate. But Southampton's design lets the prisoners do the equivalent of signaling to each other their intentions by tapping in Morse code on the prison wall. Kendall noted that there was nothing in the competition rules to preclude such a strategy, though he admitted that the ability to submit multiple players means it's difficult to tell whether this strategy would really beat Tit for Tat in the original version. But he believes it would be impossible to prevent collusion between entrants.


    Yeah, that's not the Prisoner's Dilemma. Or even the Iterated PD. This whole "signaligng Morse code" on the prison walls is nonsense, because it was not part of the original plan. Just because it's not in the rules doesn't mean you can do it. In Chess there's no rule specifically against me bringing a SuperGrape(TM) onto the board. The SuperGrape(TM) immediately destroys all pawns on a color of my choosing.

    No, it doesn't work that way.

    While this is an interesting experiment, it's not a true victory.
    --

    Small potatoes make the steak look bigger.

    1. Re:...by cheating! by dr_labrat · · Score: 2, Informative

      I'm not sure you read the article correctly.

      this is not cheating, what is happening is throughout the iterations the programs can experience the equivalent of morse code in the patterns of defections and co-operations in the form of the penalties.

      i.e.: 4yrs 4yrs free free 2years.

      As this is iterative, and no actual lifespan (in human terms) the pattern can become quickly evident. Kind of like a "penalty handshake".

      In this way one program can detect the intentions of what is happening, without actually communicating directly.

      --
      The secret of success is honesty and fair dealing. If you can fake those, you've got it made. (Marx)
    2. Re:...by cheating! by wo1verin3 · · Score: 2, Funny

      See I was trying to bring a supergrape in to play against my grandfather and he didn't believe me.. thank god i have someone to back me up now!

    3. Re:...by cheating! by billbaggins · · Score: 4, Informative
      Actually, this is exactly the sort of thing that the organizers were hoping would happen. From the FAQ, question 12:
      But we don't want to [impose limits on the number of entries] as it will be interesting to see if people can come up with strategies that cooperate with themselves within the whole population.
      --
      "The best argument against democracy is a five minute chat with the average voter."
      --Winston Churchill
    4. Re:...by cheating! by Kupek · · Score: 2, Informative

      The whole reason people are interested in the Prisoner's Dilemma (and Iterated) is that they are models for situations encountered in science. This isn't a comepetition like chess where we're trying to see who (as in the humans) are the best at it. We're trying to see what interesting results come of it. This is an interesting result.

  9. Interesting Idea, but... by TJ_Phazerhacki · · Score: 2, Interesting
    How difficult is something like this, really? One of my graduate level friends did some sociology work on a Game theory system like this, and if you know the rules, you can really beat the system.


    Just curious, thats all. Anyone have any experience in the field?

    --
    Physics is nothing like religion. If it was, we'd have an easier time trying to raise money!
  10. Evolutionarily stable? by Dr.+Manhattan · · Score: 4, Insightful
    From TFA:"Our initial results tell us that ours is an evolutionarily stable strategy -- if we start off with a reasonable number of our colluders in the system, in the end everyone will be a colluder like ours," he said.

    It's not clear to me how the entries determined who would be the 'master' and who would be the 'slave'. It seems that if you had lots of 'colluders' around who could be induced to 'suicide' for another's benefit, you'd very quickly get cheaters who worked to be the 'master' in all situations.

    This strikes me as a lot more reminiscient of the Hawk/Dove situation.

    --
    PHEM - party like it's 1997-2003!
  11. Tit for Tat by alexo · · Score: 4, Funny

    Why should Tat get all the fun?

  12. Yes, actually by Perianwyr+Stormcrow · · Score: 3, Informative

    But the proper test is really whether the master half of these programs can do better than tit for tat on a large scale basis. I suspect that the S/M program will still do less simply because it plays a pattern during the interaction phase which is likely to result in tit for tat still coming out ahead- if there is one tit for tat, it won't do so well since the costs of being tit for tat are relevant if you don't know the master sign and most of those you interact with are expecting to hear it. But that's already well known. If tit for tat's numbers start growing, it does better. You see, tit for tat has an identification mechanism too, which is simply that it always starts out nice and immediately gets nasty if it gets fucked. If the number of tit for tats increases to a reasonable critical mass, they can have enough positive reactions to do very well. In fact, they'd become a secret society within the S/Ms!

    In short, if tit for tat is isolated, it won't do so well since everyone is fucking with it. If there are just a few tit for tats out there, their power increases significantly with each one added.

    --

    What we call folk wisdom is often no more than a kind of expedient stupidity.-Edward Abbey

    1. Re:Yes, actually by ErroneousBee · · Score: 2

      I think youve missed the point of the S/M. It can play tit for tat with everyone except opponents identifying themselves as slaves. It will always win in this situation. Unless someone cracks the code of the slaves and starts abusing them too, or its gets its own slavebots to identify and target the opponents master. Also, it is possible for slaves to try and identify what opponents are using what strategies and communicate that to the master who can adjust to exploit it.

      Its means that a new evolution in game theory experiments is occuring, in that now a cold war situation is arising where alliances will have to form against other alliances.

      I think a similar thing is occuring in online casinos, where bot networks are working against real players. This may be fraud, but its very hard to detect.

      --
      **TODO** Steal someone elses sig.
  13. Spirit of the PD by johnthorensen · · Score: 4, Interesting

    Not precisely cheating, as the rules are set up to play this way...but this certainly violates the spirit of the original Prisoner's Dilemma. Why?

    Real prisoners only get to choose ONCE.

    By taking advantage of the multiple-iteration aspect of the simulation with this sort of 'portknocking' strategy, the winning programs kind of take a cheap shot at the original PD.

    Of course, it's all hypothetical anyway, and come to think of it Tit For Tat technically takes advantage of the multiple-iteration aspect as well by doing whatever the opponent did the last time...

    Ah well, at least the Wikipedia entry makes a distinction between regular "Prisoner's Dilemma" and "Iterated Prisoner's Dilemma".

    1. Re:Spirit of the PD by amorsen · · Score: 3, Informative

      Err, of course this is the iterated prisoner's dilemma. It is quite easy to do the optimal thing in the non-iterated case: defect. You couldn't make a competition out of that.

      --
      Finally! A year of moderation! Ready for 2019?
    2. Re:Spirit of the PD by b1t+r0t · · Score: 2, Informative
      Yep. If you know that a given move is the final move, the optimal action is to defect. When it's not iterated, there is only one move, so of course it's going to be the last move.

      I presume there is no way the entrants can know how many moves will be in a given round, or Tit-for-Tat could be slightly be beaten by a modified TfT which always defected in the final round.

      --

      --
      "Open source is good." - Steve Jobs
      "Open source is evil." - Microsoft
  14. The important codicil to the story is... by aug24 · · Score: 4, Insightful

    "The result is that Southampton had the top three performers -- but also a load of utter failures at the bottom of the table who sacrificed themselves for the good of the team."

    J.

    --
    You're only jealous cos the little penguins are talking to me.
  15. The article got it wrong by nels_tomlinson · · Score: 5, Informative
    The article got it wrong: they compared the tit-for-tat strategy for the iterated prisoner's dilemma to mutual assured destruction. That's wrong, since nuclear war is usually considered to be a one-time game: once you've blown each other up, there is no next round. Tit-for-tat requires that there always be a following round.

    Repeated games have radically different outcomes than one-time games. It's long been known that where cooperation is possible, cooperation can beat solitary strategies in repeated games. I really don't think there's anything surprising here.

    1. Re:The article got it wrong by Anonymous Coward · · Score: 5, Insightful

      "The article got it wrong: they compared the tit-for-tat strategy for the iterated prisoner's dilemma to mutual assured destruction. That's wrong, since nuclear war is usually considered to be a one-time game: once you've blown each other up, there is no next round. Tit-for-tat requires that there always be a following round."

      The nuclear MAD comparison is apt, because of the time lag between launch detection and detonation. During the flight time of the first launch, there is time for several rounds to occur.

      Actually, the nuclear standoff could be considered an ongoing PD game with both sides playing Tit-For-Tat strategies. The rounds occur every few minutes with both sides asking "did the other side screw us yet" and responding "no, so we won't screw them yet". This PD game has consisted of millions and millions of turns already, with both players using historical knowledge to influence their current choices.

  16. Asian mentality by pubjames · · Score: 3, Interesting


    ok, here's a weird thought. In many Asian countries, the mentality is to work as a group, rather than individually, with the individual sacrificing themselves for the group if necessary. In the USA and most of the "western" world, we tend to act more as individuals. We tend to think think our system is better, but what if we're wrong? Perhaps, as this experiment shows, the Asian mentality may actually be the superior strategy?

    China has been most consistently the biggest superpower over mankinds history, and it looks like it's going to be that way again in a couple of decades. Perhaps these things are related...

    1. Re:Asian mentality by Perianwyr+Stormcrow · · Score: 3, Insightful

      China has usually been cultured but not powerful. Chinese history is a long sequence of conquests by powerful outsiders (Manchurians, Mongols, Europeans.)

      --

      What we call folk wisdom is often no more than a kind of expedient stupidity.-Edward Abbey

    2. Re:Asian mentality by Anonymous Coward · · Score: 3, Insightful

      Actually they have been powerful, many, many times. But always at the hight of the current dynasty.

      The cycles usually work as such: There's a period of chaos (warring states, etc), usually ending up with some external power coming in and conquering. Then a majority kingdom is established (didn't always own all of what we now call China, or sometimes more than present day, but whatever). Then there'd be a period of hightened trade. The influence of external nations would prompt both an interest in other cultures and a florishing of culture within the country. Then the nation would gain in power, boarders would become more defined, government would be stable and well established. The civilization would reach the peak of it's power. This high period could last anywhere from tens of years (later dynasties) to thousands (early zhao?).

      Then the country would start to get too bureaucratic, too dogmatic. The boarders would be systematically locked down, the country would isolate itself. Xenophobia would reign. Finally interior corruption would fragment the government into separate regions. Civil wars would begin, and with this discord the next batch of foreigners would invade.

      The moral: Isolation and xenophobia suck.

      Mod -2 Too Much Fucking Information ;)

    3. Re:Asian mentality by Jerf · · Score: 2, Insightful

      Perhaps, as this experiment shows, the Asian mentality may actually be the superior strategy?

      Oh, this is a bad time to get all multicultural.

      Sure, it works out great for the Masters, who get to the winners circles on the backs of their Slaves.

      Meanwhile, if you want to call Tit-for-Tat the Western strategy, everybody mostly wins after a while, even though few do really well.

      I don't believe either categorization. I'm just pointing out that if you're going to base your argument on this article, you are saying that it is good that a few individuals come out better, at the expense of a lot of other individuals, in the putative Asian system of thought. Which I find barbaric, though YMMV.

      I once defined a political axis as "people who know they would be kings, vs. people who think they would be serfs". Sounds like I can guess where you come out on that.

    4. Re:Asian mentality by mdfst13 · · Score: 2, Insightful

      "In many Asian countries, the mentality is to work as a group, rather than individually, with the individual sacrificing themselves for the group if necessary."

      But that isn't what this system does. Individuals do not sacrifice themselves for the good of the group; the group sacrifices itself to build up individuals. It is more like a feudal joust. If the king enters, all his opponents withdraw, making him the defacto winner.

  17. It is not the first by Flyboy+Connor · · Score: 4, Interesting
    Axelrod never claimed that Tit-for-Tat was the best PD-playing program. He just stated that Tit-for-Tat would play well against any other combination of programs. Actually, IIRC, in the second tournament he organised Tit-for-Tat came in second. There was a different program that managed to exploit faults in other programs.

    It is easy to score better than Tit-for-Tat in Axelrod's (original) tournament. He included a program that played random moves. It is not difficult to recognise this program after, say, ten moves have been played. You can always defect against random, because its moves are unrelated to its history. So, a program that plays Tit-for-Tat by default, but always defects against Random, scores better than Tit-for-Tat.

    Does this dillute Tit-for-Tat's accomplishment? Of course not. Tit-for-Tat still plays well. And it is such a simple strategy that it can be programmed in two lines ("C on move 1, then copy opponent's previous move"), which none of the other programs achieve. Tit-for-Tat is simple, elegant, and strong. It's beautiful.

    Southamptom entries, on the other hand, are complex, sneaky, and cheating against (perhaps unwritten, but nonetheless agreed-upon) rules. They're ugly. They only prove that backstabbing cheating bastards may defeat just-and-fair if the referee is looking the other way for a moment.

    1. Re:It is not the first by jaaron · · Score: 2, Insightful

      Southamptom entries, on the other hand, are complex, sneaky, and cheating against (perhaps unwritten, but nonetheless agreed-upon) rules. They're ugly. They only prove that backstabbing cheating bastards may defeat just-and-fair if the referee is looking the other way for a moment.

      Sorry to be the one to break it to you, but sometimes life is just that way. :)

      --
      Who said Freedom was Fair?
  18. It always works for card games by CyberGarp · · Score: 2, Insightful

    Communication between secret partners has been one of the most undefeatable stratgies in cards for a long time. Didn't take a computer to figure that out. Someone just figured out how to do in the rules given for this competition.

    --

    I used to wonder what was so holy about a silent night, now I have a child.
  19. Wait a Minute! by Waffle+Iron · · Score: 2, Funny
    What I want to know is:
    What is tat?
    Where do I get it?
    And how do I exchange it for the other thing?

    --Dennis Miller (IIRC)

  20. Kin Selection in Genetic Algorithms by Baldrson · · Score: 3, Informative
    This is a clever demonstration of kin selection among groups of competing algorithms.

    A mathematical treatment of population genetics in groups was given by W. D. Hamilton in "Innate Social Aptitudes of Man". In the last sentence of that paper, Hamilton, the originator of modern kin selection theory, states:

    One hears that game theorists, trying to persuade people to play even two-person games like 'Prisoner's Dilemma', often encounter exasperated remarks like: 'There ought to be a law against such games!' Some of the main points of this paper can be summarized as an answer to this comment: that often, in real life, there is a law, and we can see why, and that sadly we also see the protean nature of this Dilemma, which, when suppressed at one level, gathers its strength at another.
    What Hamilton is referring to is the fact that in any structure of components vs composite, there is the opportunity to defect. An individual gene can defect against the organism within which it resides via, say, meiotic drive. An individual may defect against his tribe made up of his close relatives. A tribe may defect against the others making up a nation. A nation may defect against others making up a geographic race. A geographic race may defect against others making up humanity as a whole.

    It is indeed a dilemma but it isn't without a rigorous treatement within genetic theory.

    Steve Sailer has written an an excellent review of the politically touchy issue of ethnic nepotism given from Hamilton's group selective perspective.

  21. Secret societies & paranoia by G4from128k · · Score: 2, Insightful

    This story illustrates the power of groups and societies to coordinate to the detriment of individuals and outsiders. The Southampton team used a "secret handshake" to recognize members of the society and discriminate against outsiders. It is a natural explanation for people's fear of closed/secret societies -- people fear the group's ability to break the rules of individualistic "fair play."

    If the agents in the game were capable of higher order reasoning and could see these coordinated actions between members, then they would become paranoid -- all the Southampton team members were "out to get them."

    --
    Two wrongs don't make a right, but three lefts do.
  22. It's interesting stuff by Perianwyr+Stormcrow · · Score: 2, Insightful

    Tit for tat has a secret handshake too, but it's a code of ethics. It is robust in any iterated situation. That's what makes it neat.

    --

    What we call folk wisdom is often no more than a kind of expedient stupidity.-Edward Abbey

  23. Tit for Tat always has been beatable... by jafiwam · · Score: 2, Insightful

    Except Tit for Tat is more robust than other plans, deals well with a wide variety of opponents, and is easy for opponents to "figure out" and is "forgiving" so it does not get caught in endless loops of mutual punishment easily.

    Being that, beating Tit for Tat isn't that big of a deal. Doing BETTER than Tit for Tat consistently _IS_ a big deal.

    The game is a positive sum game, so it pays off to end up in a cooperative (or semi-cooperative) sequence over repeated "defections".

    For some good reading on the Prisoner's Dilemma Game and how it fits in some biological systems read;

    "The Evolution of Cooperation" by Robert Axelrod (and newer books)

    "The Selfish-Gene" by Richard Dawkins

    There may be more recent books too, it's been while since I studied the subject.

    Having one plan that can beat Tit for Tat

  24. I must be missing something by Wind_Walker · · Score: 3, Interesting
    Wouldn't an algorithm that defected every time (call it Traitor) beat the Tit for Tat program?

    1st iteration - Traitor defects, TfT cooperates, TfT loses and Traitor wins.
    Nth iteration - both defect, minor losses for both

    Thus Traitor beats TfT... What am I missing?

    1. Re:I must be missing something by Gogl · · Score: 2, Interesting

      You're right that in a one-on-one matchup, always defect would beat TfT. However, the point of TfT isn't that it would win every single one-on-one matchup but that it does extremely well versus any number of other strategies. Your "always defect" would beat TfT, but if you played Grim Trigger then you wouldn't do that well, whereas TfT would do very well playing with Grim Trigger (Grim Trigger is a strategy that cooperates until the opponent defects, and then it defects forever).

      As has been stated in this thread, the claim isn't particularly that TfT is the best strategy of all time in all circumstances, but that it is an elegant and versatile strategy that fares well in a variety of situations.

    2. Re:I must be missing something by Pretzalzz · · Score: 2, Interesting

      This isn't a masochistic game, you win by scoring above 0, and lose by scoring less than 0. In Tit for Tat v Traitor, they both lose.

  25. Did the same thing a few years ago... by Anonymous Coward · · Score: 5, Informative

    The length of the code is one of the largest problems to overcome. Performing any signal other than all-cooperate produces a net loss of 1 or 4 points per round for your team in traditional (0,1,3,5) IPD. Simple signalling, ie 4th round defect was very effective. While the master/slave aspect was amazingly effective in my research, the "spoiler" was not. A small population of master/slaves could invade an arbitrariliy large block of TitForTat if evolution was by duplicating winner and removing loser after n iterations. The population of "spoilers" stagnates very quickly in a large TFT population. TFT should be considered a friend, not an enemy because they are a positive growth environment. Going "spoiler" on any non-TFT/ally was quite effective as any bot not prone to cooperate posed the only real risk of "master" losing.

  26. The winner basically cheated (good for him :) by jamie · · Score: 4, Interesting
    It's pretty trivial that if two or more Dilemma agents are able to recognize each other, they have an advantage over those which cannot. I've got a Prisoner's Dilemma simulation running on my website -- I wrote some code for it over the summer and have been playing around with it on and off.

    Once I experimented with letting the agents recognize which "species" they were in and which "species" their opponent was. The runaway winner, of course, was the one which always cooperated with itself, and was less nice to every other species. (In my version, "less nice" meant playing Tit-For-Tat, but the idea's the same.)

    Being able to do this is like having the teacher's edition. If recognizing which species other agents belong to is allowed, that's a pretty trivial strategy. It's not called cooperation. It's called xenophobia, or to put it into the most familiar anthropomorphization, racism.

    (The life lesson, if I may go out on a limb, is that in an environment where some recognize a quality called "race" and discriminate based on it, being unable to see that quality is a liability. Being truly color-blind means you are unable to recognize not only race but racism, which means you will be taken advantage of.)

    When I ran my first tournament and got some interesting results based on this, I realized that knowledge of what "species" an agent belongs to is too powerful, it throws a monkey wrench into the works. So I scrapped it and moved on to stuff I found more interesting.

    But the winner of this PD tournament was even craftier; he submitted a ton of entries, all of which were xenophobic in this way, except that they all recognized one "species" as the top dog. The other "species" essentially committed suicide to give the highest score to the top dog. That wouldn't have worked in my tournament, since they literally would have committed suicide (my agents starve to death if they don't score high enough) and that would have shaped the resulting environment. Every tournament is artificial in some way, and the human submitting entries to this one was clever enough to take advantage of these particular artificialities.

    Since it's now been shown that inter-agent communication is possible, that's going to be fair game for every tournament from now on. The next step is going to be designing tournaments to work with this trick, not against it. As I wrote to this tournament's organizers:

    Since that's such a powerful strategy, I think the next step in PD tournaments is not to try to overcome it, but to embrace it: allow agents to communicate, not just with their own species, but with whoever they're playing against. My guess is that mere xenophobia would be eclipsed by the much more powerful strategy of joining the ongoing discussion about which agents can and can't be trusted. That's the next big feature I want to try.

    1. Re:The winner basically cheated (good for him :) by jamie · · Score: 2, Informative

      In the particular case I ran, it didn't make any difference. Now, though, I've added errors to what each agent tries to play. Tit-For-Tat responds to those errors and, playing against another Tit-For-Tat, quickly plunks down to (say) 90% cooperation plus or minus error, and random-walks around from there. "Always cooperate," though, sticks at 100% plus or minus error, so it does a little better.

  27. That's right by Anonymous Coward · · Score: 4, Informative

    That's right, traitor (hawk) beats TfT in any given trial.

    BUT, in an environment made up of a few players playing each strategy, then you have the following matchups:

    Hawk vs Hawk. Horrible horrible loss for both of them.
    TfT vs Hawk. Hawk wins, but only by a single round.
    TfT vs TfT. Both TfT 'win' - neither betray the other.

    So, overall, TfT does better than hawk.

    The interesting part isn't beating TfT (which, as you point out, isn't THAT hard to do) but in doing consistently better than it against a wide variety of programs. Which is what TfT has long been the baseline for.

  28. new exploit? by An+ominous+Cow+art · · Score: 2, Funny

    Curse these researchers, now black hats will be using this technique to let exploit code escape from chroot prisons!

  29. Tit for Tat has already been beaten by bigHairyDog · · Score: 2, Informative

    Tit for Tat is outperformed by "Tit for Two Tats", because it is better at avoiding long runs of damaging mutual recrimination. That was 5 years ago. The performance of any of these strategies is only determined by the opponent strategies that they face, which is arbitrary. It is therefore meaningless to talk of one strategy being 'better' than another - most advanced strategies can beat Tit for Tat given the right opponents.

    --

    foo mane padme hum

  30. Except that's not the "prisoner's dilemma" at all! by tomhudson · · Score: 2, Insightful
    Southampton's programs executed a known series of 5 to 10 moves which allowed them to recognize each other
    The whole idea behind the prisoner's dilemma is that neither party is privy to what the other party is currently doing.

    By using this "recognition system", the program is capable of "knowing" in a deterministic fashion what some of the other programs will do in advance.

    In other words, at the very least, a cheat.

  31. Don't you see the beauty? by Q2Serpent · · Score: 3, Interesting

    They "cheated", and the other guy didn't, so they won big! Wasn't that the whole premise?

    -Serpent

    1. Re:Don't you see the beauty? by kisrael · · Score: 4, Insightful

      They "cheated", and the other guy didn't, so they won big! Wasn't that the whole premise?

      Well, they kind of went for a win on the "metalevel", utilizing the circumstances of the competition rather than solving the originally stated issue in an abstract way. On the one hand that's cool because evolution can work like that sometimes, but on the other hand, it really isn't answering the original question any more. (the question is "what's probably the best strategy for any given individual in Prisoner's Dilemna" and they changed the question to "how can we get some individuals to be super-players with the way this prisoner's dilemna simulator is setup"

      --
      SO YOU'RE GOING TO DIE: The Comic for Dealing with Death
    2. Re:Don't you see the beauty? by gadget+junkie · · Score: 2, Interesting

      "the question is "what's probably the best strategy for any given individual in Prisoner's Dilemna" and they changed the question to "how can we get some individuals to be super-players with the way this prisoner's dilemna simulator is setup""

      . How very true.I doubt that this solution is applicable in real life, if only for the fact that one of the assumptions would be that a subset of a winning team consistently and repeatably wants to be defeated. Mother nature took care of those long ago.

      --
      "If a boss demands loyalty, give him integrity. But if he demands integrity, give him loyalty." (John Boyd, 1927-1997)
    3. Re:Don't you see the beauty? by kisrael · · Score: 2, Interesting

      subset of a winning team consistently and repeatably wants to be defeated. Mother nature took care of those long ago.

      Well, it's an interesting philisophical point. It depends on how you define "win" and "lose"...certainly some specices have formed partnerships with other, often larger species, and if you define "win" as something besides "just survive", they might be seen as subjugating themselves to the other creature, so that the partnership prospers, even if their life doesn't seem that swell.

      In other words, nature is more complex than the Prisoner's Dilemna, and sometimes ends up finding stuff more like the "cheaters" solution.

      --
      SO YOU'RE GOING TO DIE: The Comic for Dealing with Death
  32. Re:"This must be banned" by maxwell+demon · · Score: 2, Insightful

    Isn't this the way the terrorist organisation works? The actual attackers totally lose (they lose even their life), and their masters profit from it. The experiment shows that tit for tat isn't a good strategy against this.

    --
    The Tao of math: The numbers you can count are not the real numbers.
  33. Not a cheat by Carmody · · Score: 2, Interesting

    I've been giving talks on the Prisoner's Dilemma for a few years. (No original research, just following the thing and explaining the game to the Youth)

    It is kind of an orthodoxy in the literature: Tit for Tat always ties or loses by a little bit, but in tournaments, it is the best strategy.

    Well - it ain't. Someone found a way around it. Instead of urging rule-changes to prevent this new challenger, we should all be happy and excited that PD tournaments have just got MORE INTERESTING.

    I can't wait to see what happens next - what new programs will emerge to have the advantages of Tit for Tat but also the ability to defend against Master-Slave programs that communicate with each other.

    The game has changed - now let's leave it alone and watch.

    --
    God is real unless declared integer
  34. Interestingly, that's what the omerta is all about by Perianwyr+Stormcrow · · Score: 2, Insightful

    The omerta, or code of silence, is the ideal that the mob works toward when caught. If you get caught, you simply clam up and take whatever's thrown at you as a point of honor. It is instructive, however, that this of course does not apply universally (everyone knows that the mob is rife with snitches.)

    --

    What we call folk wisdom is often no more than a kind of expedient stupidity.-Edward Abbey

  35. nested bogosity by nusratt · · Score: 2, Interesting

    Everyone is posting about how this is bogus because it's really not the same game as PD.

    But even if you don't agree with that view, another important question is:
    in what meaningful sense is this new strategy a "victory"?
    After all, it achieves "victory" for half of the cooperators, at the cost of sacrificing the other half.

    To use one nuclear-war analogy, it's a choice between strategy "A",
    where you acquiesce to the death of half of your populace, with the reward that the remaining populace is completely unaffected --
    and strategy "B", with the guaranteed result that no one dies but everyone is injured.

    Which populace would *you* choose to join on the eve of war?

  36. So, GPL is Tit for Tat by john_lewmanny · · Score: 2, Interesting

    And GPL is more or less 'Tit for Tat' in which it will only cooperate with those also cooperating.

  37. The next game by Gyorg_Lavode · · Score: 2, Interesting

    I think what will become more interesting is that, now that we know the best lone player (tit for tat) can be defeated by players playing together, can we write our players to look for a player trying to communicate to another player so as to take advantage of it. Can my player play tit for tat against normal players, but, when it sees a S/M player, convince the S/M player to play slave for my gain?

    --
    I do security
  38. Creativity is cheating by greg_barton · · Score: 2

    For all of those saying, "Isn't this just cheating?" I say this:

    Creativity is "just cheating." Creativity is breaking the rules in a novel way that sheds new light on reality. And isn't that the holy grail of AI?

    So, was this just cheating? Hell yes. And it's fantastic.

  39. Missing option... by balaam's+ass · · Score: 5, Insightful

    I agree that this defnition of the "Prisoner's Dilemma" is no more than a "meta-game," and not really a problem of philosophical ethics (though it may appear to be to some people.)

    What I find disturbing this is the way that the problem is framed presupposes no underlying system of ethics. To wit....
    * If you confess and your partner denies taking part in the crime, you go free and your partner goes to prison for five years. * If your partner confesses and you deny participating in the crime, you go to prison for five years and yor partner goes free. * If you both confess you will serve four years each. * If you both deny taking part in the crime, you both go to prison for two years. What do you do?

    How about: Tell the truth? Regardless of what your partner does, tell the truth. I find it disturbing that the problem is framed in a way that the actual truth of the matter is irrelevant. (i.e. the problem would be unchanged if I replaced "You and your partner have committed a crime and are caught" with "You and a friend have been accused of a crime which you may or may not have committed.")

    I'm not trolling or off-topic here. I'm dead serious. This formulation of the PD is ethically doomed from the get-go, and thus the results of the experiment may be of interest to mathematical game theorists of this particular game, but I find it unwise to think the results make any significant implications about ethics (or anything else for that matter).

    Someone will counter that since this is a "Prisoner's" dilemma the person involved must be a criminal with no "ethical" principles other than an interest in self-preservation (i.e. the person is already debased as can contribute nothing meaningful on the subject of ethics! ;-) ). I'd say that just because someone committed a crime does not mean they necessarily want to continue committing crimes...

  40. Pavlov, Grim, and the other strategies. by DrRobin · · Score: 4, Interesting

    As a microbiologist with interest in evolution, I have followed this field from afar for years. Looking over the results, I was surprised at how relatively poorly "Pavlov" (win-stay lose-shift) did, since it performs so strongly in noisy, evolutionany, versions of the game. [see:
    http://www.ncbi.nlm.nih.gov/entrez/query.fc gi?hold ing=npg&cmd=Retrieve&db=PubMed&list_uids=8316296&d opt=Abstract
    It was also a bit dismaying to see how well "Grim" (hold a grudge forever) did in both games. In evolutionary versions of the game, Pavlov helps keep down the population of "suckers" (thereby decreasing the food supply for more predatory and parasitic strategies) while still rewarding "provokable" cooperators (thereby increasing the total aggregate "reward" of the ecosystem.
    Also, one essential part of the payoff structure that deserves emphasis is that the payoff for cooperating has to be more than half the average of the winner and loser's payoff for defection, else one benefits by simply alternating each turn. This is a little bit like the winners did here, where they got the top spots at the cost of a lower total take for their "team". One real world example of slashdot interest where this might make sense is if you take these losses in order to eliminate your rivals from the game and then reap monopoly benefits once you control the game (not to mention any names...).
    Maybe someone who has analyzed the results in more detail could comment on how the various well known strategies fared and why.

  41. Key element - guaranteed draw strategy by BobaFett · · Score: 3, Interesting

    While entering a team into a tournament scored for individuals and then sacrificing the whole team for one player is by no means a new idea, what makes it so remarkably successfull here is the existance of a "guaranteed draw" strategy (in this case, always defect). The best individual response to "always defect" is to defect yourself, anything else is a suicide, so if you always defect you can force a draw. Then all your team loses to one team member, and he is the winner.

    Compare this with, for example, a chess tournament. You could secretly enter a team and have them all lose to you. While this will keep you from ending last, it won't assure victory, unless all players are roughly equal. If there is a very strong player, he'll win against all your team, yourself included. So you can cheat by redistributing players of comparable strenghts, but at least you can't rob a clear champion of his deserved victory.

    This is not the case in the PD tournament. But let's redefine the problem slightly: say, if both sides cooperate, each gets a dollar. If then defect, each pays a dollar. Sucker's reward is paying 10 dollars. Now the Southampton team's strategy boils down to using the tournament to give all their money to one player, while paying a hefty tax in the process. There is a cheaper way to do this, just give all money to one guy outside the tournament :) But now we can gauge any strategy: enter one player or a team, recognize your own team members or not, transfer money between team members as you wish, but can you make money, overall, from this tournament?

  42. Does this really apply to human behaviour? by Thangodin · · Score: 2, Interesting

    Optimistic Tit-for-Tat models human behaviour well in a social setting--we give others the benefit of the doubt, and continue to cooperate when others do. When someone violates our trust, we stop trusting them and punish them, but if they act beneficially towards us again, we might be willing to forgive. Most notable, OTFT produces the best overall score, which in competition between social groups is the deciding factor.

    The Southampton strategy is dependent upon large numbers of people who will sacrifice all for the good of the other, and not for the good of their community (the collective performance is worse than OTFT.) I can see sacrifice for the greater good, but this is sacrifice to another person without hope of recompensation or an increase in general wellbeing. This does happen in human societies (I think it's happening now in some political systems), but only when the winner has managed to convince the losers that its all in everyone's best interest. What Southampton has added to this mix is a capacity for extreme self-delusion that directly contravenes the economic assumption of informed choice and self-interest. For purposes of economic modelling, Southampton should probably be disqualified, or these assumptions dropped. But this should also tell you something about what could happen to those nice economic models when they hit the messy world of human beings, who for the most part aren't very informed and often work against their own best interests as a result.

    The consequence for a societal group running Southhampton against an OTFT group would be the defeat of the Southhampton group every time. Selection works at individual AND group levels. So the challenge should probably be two-tier: run the programs individually against each other, and run them as tribes against each other.

  43. Kobiyashi Maru by Bugmaster · · Score: 2, Insightful

    So, essentially, the winning program(s) hacked (or exploited, if you prefer) the game in order to win ? That's pretty clever, but does this count as a true victory ? It's sort of like what Captain Kirk did to rig his Kobiyashi Maru scenario. Sure, he won on a technicality, but in doing so he missed the whole point of the challenge.

    --
    >|<*:=
  44. sed for the rescue by alexo · · Score: 2, Funny
    > Hell, I've got a whole warehouse full of Tat. Where do I exchange it for some tit?

    Easy:
    s/wa/who