'Tit for Tat' Defeated In Prisoner's Dilemma Challenge

Scary Stuff by mfh · 2004-10-14 02:07 · Score: 5, Funny

FTA:

If you confess and your partner denies taking part in the crime, you go free and your partner goes to prison for five years.
If your partner confesses and you deny participating in the crime, you go to prison for five years and yor [sic] partner goes free.
If you both confess you will serve four years each.
If you both deny taking part in the crime, you both go to prison for two years.

This sounds pretty much like the RIAA might be involved. I would deny everything if I were you!

--
The dangers of knowledge trigger emotional distress in human beings.

Re:Scary Stuff by parvenu74 · 2004-10-14 02:10 · Score: 2, Interesting

Or under the Patriot Act, whether you confess or not you go to jail for an undetermined period of time, during which charges may or may not be filed against you...
Re:Scary Stuff by Anonymous Coward · 2004-10-14 02:51 · Score: 1, Interesting

The intresting part of game theory contests like this is when you introduce freewill and project the competition onto real world situations. Game theory is great in understanding the best mean outcomes for a given competative scenario. The problem in the real world comes when one outcome is simply unacceptable to one or more competing parties. Then the parties must play to avoid a given outcome rather than achieve the best result. The situation often determines which of the many game theory strategies offers the best chance of success because in the real world the situation and the sticky issues of free will often determine what "success" means where as in most game theory expiraments the definition of "success" is predetermined and agreed upon.

Still very interesting stuff.
Re:Scary Stuff by Unoti · 2004-10-14 03:23 · Score: 2, Insightful

But, if you confess, then freedom marches forward!
Re:Scary Stuff by davidsyes · 2004-10-14 03:58 · Score: 1

Either I dreamt of this thread, or I saw this somewhere else in the previous 2 weeks...

--
Previously: "Linux... Toward the Sunrise..." Now: "Linux... Toward the-- No, now, part of Every Sunrise"
Re:Scary Stuff by yerfatma · 2004-10-14 04:50 · Score: 1

The problem in the real world comes when one outcome is simply unacceptable to one or more competing parties.
Why can't your situation be described by game theory? Giving proper weights to the various outcomes should provide a useful model.
Re:Scary Stuff by gadget+junkie · 2004-10-14 04:51 · Score: 1

...that adds to the problems explained in prospect theory. the outcome also depends on how the problem is "framed" as anyone that has been offered a compex financial product should know.

--
"If a boss demands loyalty, give him integrity. But if he demands integrity, give him loyalty." (John Boyd, 1927-1997)
Re:Scary Stuff by anonymous+cowherd+(m · 2004-10-14 07:15 · Score: 1

It can. The fundamental work on this was done by John Nash. The "optimal" solution is called the Nash equilibrium.

--
http://neokosmos.blogsome.com
Re:Scary Stuff by Old+Wolf · 2004-10-14 08:22 · Score: 1

Just a note on this interpretation of the PD: 'C' (Cooperate) corresponds to 'deny' and 'D' (defect) corresponds to 'confess' under this explanation, don't get confused if you read some other analysis of the problem that refers to C and D and you have this interpretation in mind.
Re:Scary Stuff by snilloc · 2004-10-14 12:24 · Score: 1

Not exactly. In many games there can be several Nash equilibria, any of which could be a possible outcome.

That's not really so special by Perianwyr+Stormcrow · 2004-10-14 02:09 · Score: 4, Interesting

In other words, an in-group can work vs. tit for tat if it outnumbers it. I'd like to see a trial with a slow trickle of immigration of tit for tats into a large population of S/M programs. That might be illuminating. I suspect the outcome would be that tit for tat still does well.

--

What we call folk wisdom is often no more than a kind of expedient stupidity.-Edward Abbey

Re:That's not really so special by Minwee · 2004-10-14 02:18 · Score: 4, Funny

I was with you up until you started talking about tits and S/M... Am I still on slashdot, or did I wander onto alt.com by mistake?
Re:That's not really so special by Anonymous Coward · 2004-10-14 02:20 · Score: 5, Informative

What's being ignored is that the total profit of all the colluding algorithms is less than that of Tit-for-Tat, which makes the solution unviable in real-world Prisoner Dilemma situations. (bidding on large construction projects under certain auction formats, etc)

As an analogy of unprofitable collusion, I could win the World Series of Poker by hiring enough shills and paying their time and entry fees. I would lose money by doing this, probably more than I could recoup with post-tournament income via endorsements/books/whatever.

The parent is correct. Tit-for-Tat is still superior in equal numbers, and a modified Tit-for-Tat that can spoof the recognition algorithm of colluders will trounce them.
Re:That's not really so special by Anonymous Coward · 2004-10-14 02:24 · Score: 0, Funny

"Am I still on slashdot?"

Well, you made an lame attempt at humor without spending the two seconds required to read the post and summary and comprehend them.

So, yes, you are still on Slashdot.
Re:That's not really so special by Perianwyr+Stormcrow · 2004-10-14 02:24 · Score: 1

fhqwhgads

S/M or slave/master is the Southampton set of programs.

--
What we call folk wisdom is often no more than a kind of expedient stupidity.-Edward Abbey
Re:That's not really so special by Minwee · 2004-10-14 02:31 · Score: 0, Offtopic

Oh, and look. An anonymous coward called me lame in an attempt to make himself look clever.

Yep, definitely slashdot. Thanks!
Re:That's not really so special by harrkev · 2004-10-14 02:45 · Score: 4, Insightful

Yup! It is the "outnumbers" thing which (in my opinion) makes things unfair.

Had this been an actual prisoner's dilemma, this winning strategy would require recruiting a large number of thugs who LIKE going to prison and are willing to "take one for the team."

Although cooperation is not explicitly defined as being against the rules, IMHO, it goes against the "spirit" of the competition. The point is that each algorithm is supposed to act in a greedy manner.

This will no doubt spark a LOT of discussion, but to me, they "cheated." (OK. Maybe "worked the system" is a better phrase).

--
"-1 Troll" is the apparently the same as "-1 I disagree with you."
Re:That's not really so special by jamie · 2004-10-14 02:59 · Score: 3, Informative

If you'd like to try your hand at it, please feel free to email me the code any program you'd like to try. (Or just describe to me the strategy you want to take and I can probably write code for it.)
I've written perl scripts to run Prisoner's Dilemma tournaments on my website (the perl source code for all the species in the last run is on that page) and new submissions for the next tournament are welcome. Since my tournament does Darwinian selection on every agent that plays in it -- agents which don't earn points eventually die off, those which earn a lot of points reproduce -- self-sacrificing pairs of strategies won't do very well.
I haven't yet written code which can trickle in new members over time, but that really wouldn't be very hard to add (I could probably set it up in an hour if you are really interested :)
Re:That's not really so special by KDan · 2004-10-14 03:21 · Score: 2, Insightful

That's what I thought while reading the article... also making use of that code is assuming a meta-knowledge of the game, ie some sort of way you can have prepared for that particular and specific instance of the game where the specific problem is the prisoner's dilemna with its known simple outcomes. Whereas tit-for-tat is a much more generic theory that can apply to a game which you don't know about yet (eg, say, one that has N possible outcomes rather than only 4), by simply stating "try to choose the mutually beneficial outcome first and then mirror your opponent's moves", the program they devised makes use of specific knowledge of how the game works (eg with those recognition sequences..). Because of that it is clearly inferior and more a hack of PD than a game theory idea.

But, says Kendall, "Everybody in our field knows the name of Anatol Rapoport, who won the Axelrod competition. So if you can win the 20th-anniversary one, in our field there's a certain historical significance."

But in this guy's case the significance will be lost because he didn't win through any significant idea, but through a hack. As he says earlier, it's the research that counts, not the outcome.

Daniel

--
Carpe Diem
Re:That's not really so special by blankman · 2004-10-14 03:33 · Score: 1

tit for tat works by imitating previous moves by its opponents. its success depends totally on which opponent it imitates in this case (S/M master = win, S/M slave = lose)
Re:That's not really so special by Anonymous Coward · 2004-10-14 03:52 · Score: 0

You know it was lame. Just take your medicine.
Re:That's not really so special by Anonymous Coward · 2004-10-14 04:04 · Score: 0

Well, I don't think the tool should be illegal, but only the use of it.

And his tool has *only* bad uses, no redeeming ones, and was made for the express intent of hijacking other people's computers to blast people with spam.

So I think it's his behavior, and not his code, that should be deemed illegal.
Re:That's not really so special by Anonymous Coward · 2004-10-14 04:56 · Score: 0

Had this been an actual prisoner's dilemma, this winning strategy would require recruiting a large number of thugs who LIKE going to prison and are willing to "take one for the team." ...it's working in Iraq and Israel/Palestine, however, or maybe that's my interpretation to the mind job the "bomber mullahs" do on those they recruit to blow themselves up with car bombs or with explosives on their bodies.

What if you could model a game that had a few "master" controllers that couldn't/didn't participate directly in PD, but could "direct" or influence the tendancies of the drones...

Oh, wait. Now it sounds like the US Presidential Election campaigns, with talk of "battleground states", "swaying undecided voters", "strengthing the core block", etc.
Re:That's not really so special by the_duke_of_hazzard · 2004-10-14 05:09 · Score: 1

Tit-for-tat was never the "ideal" solution (there can't be one, by definition) - it always depends on the environment (ie the other players/programs in a population). Tit for tat was simply a simple and robust algorithm that works best for many environments, but modifications to it have performed better in certain environments.
Of course if your environment consists of programs of the same type of program, then it's easy to beat tit-for-tat. As I say, it all depends on the environment. I'm surprised this is a story, to be honest.
Re:That's not really so special by WhiteDragon · 2004-10-14 05:11 · Score: 1

But in this guy's case the significance will be lost because he didn't win through any significant idea, but through a hack. As he says earlier, it's the research that counts, not the outcome.
That reminds me of another famous example of winning through a hack. When you are in an impossible situation, hack the situation!

--
Did you mount a military-grade, variable-focus MASER on an unlicensed artificial intelligence?
Re:That's not really so special by OECD · 2004-10-14 05:24 · Score: 1

...and a modified Tit-for-Tat that can spoof the recognition algorithm of colluders will trounce them.

Except then it wouldn't be Tit-For-Tat (T4T). The S/M programs work by using a predefined set of moves to "tap out" a kind of code that other S/M programs will recognize. T4T would have to spoof this code to flush out the S/M programs, so it wouldn't fall back into T4T mode until five (or more) moves in, which really screws with the T4T strategy.
The only way that would work is if there were a large population of programs that had a similar five-move recognition code, which would essentially be starting the game on move six.

--
One man's -1 Flamebait is another man's +5 Funny.
Re:That's not really so special by gadget+junkie · 2004-10-14 05:41 · Score: 1

...maybe M/$ would have been better ;-)

--
"If a boss demands loyalty, give him integrity. But if he demands integrity, give him loyalty." (John Boyd, 1927-1997)
Re:That's not really so special by tsg · 2004-10-14 06:15 · Score: 1

As an analogy of unprofitable collusion, I could win the World Series of Poker by hiring enough shills and paying their time and entry fees. I would lose money by doing this, probably more than I could recoup with post-tournament income via endorsements/books/whatever.

A better analogy would be one where you and a number of friends entered the WSOP and your friends all lost to you (whenever they were at the same table as you). Whatever you won from the tournament you would split with your friends. A bigger bank roll in no-limit poker gives you a significant advantage over other players. By sacrificing themselves, they give you the opportunity to win more than they are likely to win by acting alone which is what, in essence, the S/M programs were doing and in fact has a better chance of doing since they are guaranteed to play against each other.

One of the interesting things in game theory is how it can be used to analyze how a player's best interests can be served by playing to lose.

--
People's desire to believe they are right is much stronger than their desire to be right.
Re:That's not really so special by phoenix321 · 2004-10-14 07:30 · Score: 2, Interesting

I think this is a fairly stunning result that is able to explain, why aristocracy and monarchy proved pretty successful for thousands of years.

If Most sacrifice themselves for a few "chosen" ones they can do better than most others against the rest of the "world". Pretty convincing.

If this strategy succeeds, we have a fairly sound theory why and how monarchy evolved from simple tribal structures. Secret societies, hierarchies and everything else would suddenly seem logical.

Does not leave a feelgood-residue like having read Axelrod, but at least we know it now...
Re:That's not really so special by Anonymous Coward · 2004-10-14 07:41 · Score: 0

"Tit for tat" didn't alert you?
(sorry *hides*)
Re:That's not really so special by Matthew+Weigel · 2004-10-14 09:07 · Score: 1

As a sum it's not profitable. However, this is basically how organized crime works: "if you get to the big time, kid, you'll have someone else be a patsy for you. Until then, you're the patsy."

If you look hard at the statistics, playing that game is bad... but for the ones that rise to the top, it's very profitable.

--
--Matthew
Re:That's not really so special by ignavus · 2004-10-14 12:41 · Score: 1

"America is great because America is good..."

"Sauron was great because Sauron was good, and if ever Sauron ceased to be good, he would cease to be great."

Oh, wait ... "Sauron the Great" wasn't good. Oh, well, back to the mindless patriotism drawing board.

--
I am anarch of all I survey.
Re:That's not really so special by cyberformer · 2004-10-15 04:18 · Score: 1

Not just organized crime. Sounds like the way most companies work.

Where's the news? by Anonymous Coward · 2004-10-14 02:09 · Score: 0

Tit-for-tat, while a simple and effective strategy, isn't perfect. This certainly isn't the first time it has been beaten. What's the big deal?

Re:Where's the news? by gadget+junkie · 2004-10-14 08:17 · Score: 1

I believ it IS the first time.

--
"If a boss demands loyalty, give him integrity. But if he demands integrity, give him loyalty." (John Boyd, 1927-1997)

Gameshow by Dan+East · 2004-10-14 02:10 · Score: 1

The rules are similar to those of the gameshow Friend or Foe.

Dan East

--
Better known as 318230.

Re:Gameshow by mofolotopo · 2004-10-14 03:18 · Score: 1

The rules are similar, but IIRC the makere of Friend or Foe screwed up the reward structure so that the game wasn't a true prisoner's dilemma. Specifically, the penalty for both defecting was actually the same as the penalty for being exploited (you go home with nothing). Cooperating while the other defects is supposed to be worse.

That's why... by Stile+65 · 2004-10-14 02:11 · Score: 4, Funny

...fraternities and secret societies work so well!

I'm off to join the Freemasons. Be back in a few.

--
I claim first use of "Error No. 0B" - or "No. 0B error." It'll be the new ID 10T!

Re:That's why... by !the!bad!fish! · 2004-10-14 02:15 · Score: 1

... Southampton had the top three performers -- but also a load of utter failures at the bottom of the table who sacrificed themselves for the good of the team.
Be aware that you may be required to sacrifice yourself.

--
Kids today are tyrants. They contradict their parent, gobble their food, and tyrannize their teachers. - Socrates 400 BC
Re:That's why... by ch-chuck · 2004-10-14 02:36 · Score: 1

Otherwise known as "old boy networks" that actually exist, for better or worse.

--
try { do() || do_not(); } catch (JediException err) { yoda(err); }
Re:That's why... by CGP314 · 2004-10-14 02:40 · Score: 1

I'm off to join the Freemasons.

If you plan to run for the presidency, don't forget to join Skull and Bones.

-Colin
Re:That's why... by gd23ka · 2004-10-14 02:52 · Score: 1

Southampton had the top three performers -- but also a load of utter failures at the bottom of the table who sacrificed themselves for the good of the team. Which is exactly where most of the lower ranks of secret societies end up. Start your own secret society, it is fun, educational and above all it is profitable. If you want to join MY secret society then during your initation you will be expected to first stick your left thumb into your ass, the other in your mouth and then the left thumb into your mouth and the other into your ass and then back again over and over while chanting: "NOS ES TURBATUS UT RECOLLIGO VICTUS PRO NOSTRUM CANI!" Of course I have a lot of other initation rites for you to go through but these I have to keep secret from the unitiated and depend on your sex.
Re:That's why... by DarkSarin · 2004-10-14 03:07 · Score: 1

Which is a little bit difficult, all things considered.

--
"We don't know what we are doing, but we are doing it very carefully,..." Wherry, R.J. Personnel Psychology (1995)
Re:That's why... by yerfatma · 2004-10-14 04:54 · Score: 1

Unfortuantely, I'm reading /. at work, so my left thumb is busy right now.

Practicality by Perianwyr+Stormcrow · 2004-10-14 02:12 · Score: 2, Funny

I generally hope that knowledge of the prisoner's dilemma will never become a practical factor in my life.

--

What we call folk wisdom is often no more than a kind of expedient stupidity.-Edward Abbey

Re:Practicality by Daniel_Staal · 2004-10-14 02:29 · Score: 4, Insightful

Sorry, you've probably already lost that one. The prisoner's dilemma is quite useful in normal life, or at least the thinking that gives rise to the solution is. It applies any time there is significant advantage to be gained by working together, but also much advantage to be the one 'cheating'.

For /., try this interpretation:
If we both share our source code, we will both will be more productive.
If I share my source code, and you don't, you can be more productive. (Assuming you can use mine.)
If neither of us share, we both will have to re-create other's work...

--
'Sensible' is a curse word.
Re:Practicality by clausiam · 2004-10-14 03:34 · Score: 2, Insightful

It applies any time there is significant advantage to be gained by working together, but also much advantage to be the one 'cheating'
As in VotePairing :-)
Re:Practicality by R.Caley · 2004-10-14 03:38 · Score: 1

I generally hope that knowledge of the prisoner's dilemma will never become a practical factor in my life.
Supermarkets.
You defect by walking out without paying, they defect by putting water rather than beer in the bottle.

--
_O_ .|< The named which can be named is not the true named
Re:Practicality by naoursla · 2004-10-14 03:46 · Score: 2, Informative

Unfortunately, PD has a dominant strategy to cheat your opponent. No matter what your oppoent does, you can do better by cheating.

Iterated PD is more interested since it lets your opponent punish you for cheating. So you get into some interesting social issues.

How well a given strategy does depends on the strategies other in its community are using. If the population is heavily cheater based, then agents that cooperate will lose big time. However, if there are enough cooperators, then they will form a coalition of sorts, and even though they will lose to the cheaters, in the end they will come out on top.
Re:Practicality by gadget+junkie · 2004-10-14 04:47 · Score: 2, Informative

"[...]How well a given strategy does depends on the strategies other in its community are using. If the population is heavily cheater based, then agents that cooperate will lose big time. However, if there are enough cooperators, then they will form a coalition of sorts, and even though they will lose to the cheaters, in the end they will come out on top."

I remember reading about the tournament in a "Science" magazine article, back when the original tournament was done; Tit for Tat won irrespectively of the number of strategies it played against.
Furthermore, in a variant in which the winning strategy "spawned" more often, and the loser did not, Tit for Tat became a majority of the population irrespective of the initial sample, except in the extreme case of only one tit for tat and defectors. This is explained better here. The most interesting thing was, for me, the fact that Tit for Tat was superior to a strategy in which the program responded with a delay, i.e. it made the opponent's move two turns down the line. So, remember this if you have kids, or a pet, or both like me: whatever reaction you deem appropriate, it should be done soon, or not at all.

NOTE: the payoff was described as such:if A cooperates and B cooperates, both get X points; if A "defects" it gets W points to Z for B;if both defect, they get Y, where:

W > X > Y > Z, all positive integers.

--
"If a boss demands loyalty, give him integrity. But if he demands integrity, give him loyalty." (John Boyd, 1927-1997)
Re:Practicality by spitzak · 2004-10-14 05:51 · Score: 4, Insightful

Yes I agree that public domain code is very much the same as the prisoner delimma.

The GPL is an attempt to make it *not* the prisoner delimma by forcing the other side to cooperate if you do. This eliminates the losing part of the cooperation choice and thus it is no longer a delimma.
Re:Practicality by naoursla · 2004-10-14 06:24 · Score: 1

One factor that might have hurt TFT in this tournament is the addition of noise. If TFT thinks the opponent cheated, even if they didn't, TFT will cheat on the next turn. If it is a TFT vs TFT situation, both players will alternately cooperate/defect.

I think this winning team's solution was interesting. Having the ability to recognize a cheater so that you can punish them is necessary for the coop/coop solution. I've heard it speculated that one reason humans have evolved distinctive faces is to make it easier to distinguish social cooperators from cheaters. I personally believe that a lot of the social problems in society today stem from the fact that we live in a mostly anonymous society.

I wrote about some of this last year in my one and only ./ journal entry
Re:Practicality by gadget+junkie · 2004-10-14 07:51 · Score: 1

"If it is a TFT vs TFT situation, both players will alternately cooperate/defect."

In reality, both would continue to cooperate ad infinitum, since no one would defect at the first turn.

As to an anonymous society, I really do not know, but I remember another "real life" view of the prisoner's dilemma especially relevant in this Internet era: sales via catalogue.The formalisation, if you come to think of it, is almost the same, especially if you make a a turnstile-like mechanism of exchange.
this is Offtopic, butI wonder if other slashdotters can help me on that: I remember having read in a sci-fi book an encounter between two civilizations in which, to assess which goods could be traded, the aliens had built a carousel mechanism by which you could take something out only by putting something in return. I remember that the aliens took only cigarettes, tat were like cocaine for them (that tells you how old are the book and I :-( ). Does someone remember the title and author?

--
"If a boss demands loyalty, give him integrity. But if he demands integrity, give him loyalty." (John Boyd, 1927-1997)
Re:Practicality by Anonymous Coward · 2004-10-14 08:08 · Score: 1, Interesting

It's "dilemma", not "delimma"... same root word as "Lemma", meaning theory, in math.
Re:Practicality by spitzak · 2004-10-14 08:41 · Score: 1

Thanks. I thought the spelling was wrong, but could not figure out how to fix it. I was copying the spelling from the earlier post.
Re:Practicality by Matthew+Weigel · 2004-10-14 09:03 · Score: 1

In reality, both would continue to cooperate ad infinitum, since no one would defect at the first turn.

Not if there is noise, which is precisely what naoursla was talking about. However, the alternation won't necessarily happen either; noise can also cause them to stop.

--
--Matthew
Re:Practicality by dnoyeb · 2004-10-14 11:18 · Score: 1

This was obvious to me at first, but I read the first reply to your statement. In a one on one, you are correct, cheating always wins.

But in a group competition, you can loose if you never cooperate. So even this basic one is interesting.
Re:Practicality by skh · 2004-10-14 11:33 · Score: 1

It's refreshing to find someone on /. who welcomes knowing the correct spelling of a word.
Re:Practicality by naoursla · 2004-10-14 13:40 · Score: 1

In a group playing non-iterated PD, a defector playing against a cooperator will beat a cooperator playing against a defector. I still think defecting is the correct strategic plan (which isn't how I run my life since these things are usually iterative -- often unexpectedly so).

An interesting note, when designing the PD payoffs, if the payoffs are such that CD+DC > 2*CC, then it is in both agents' interests to swap between coop and defect out of synch with the other.

only thing I can say is... by jjeffries · 2004-10-14 02:14 · Score: 3, Funny

HOW ABOUT A NICE GAME OF CHESS?

Lameness filter encountered. Post aborted!
Reason: Don't use so many caps. It's like YELLING. It's not at all like a TALKING COMPUTER. You are a bad man. Go away.

Re:only thing I can say is... by cHALiTO · 2004-10-14 02:21 · Score: 1, Funny

Clearly, the only winning move is not to play :)

--
"Luck is my middle name," said Rincewind, indistinctly. "Mind you, my first name is Bad." -- Terry Pratchett
Re:only thing I can say is... by OptimizedPrime · 2004-10-14 02:58 · Score: 2, Funny

So what you're saying is my algorythm won this year since I kept it written on a post it on my desk instead of submitting it?
Re:only thing I can say is... by SamBeckett · 2004-10-14 03:37 · Score: 1

is that the lesbian form of algorithm ??
Re:only thing I can say is... by Lars+T. · 2004-10-14 03:41 · Score: 1

Lameness filter encountered. Post aborted!
Reason: You copied text from the "Post aborted!" page.

--
Lars T.
To the guy who modded me down from perfect to terrible Karma - Apple haters still suck
Re:only thing I can say is... by gadget+junkie · 2004-10-14 08:23 · Score: 1

" Clearly, the only winning move is not to play :) "

No, the only winning strategy is not to lose. both player knew that overkill was a mug's game, but they needed to have a second hit available. Interestingly enough, as Tom Clancy has surmised, it is impossible to get rid of ALL nukes. Think about it. It's game theory as well.

--
"If a boss demands loyalty, give him integrity. But if he demands integrity, give him loyalty." (John Boyd, 1927-1997)

Uh, isn't that just cheating? by DoorFrame · 2004-10-14 02:14 · Score: 4, Interesting

I mean, the whole point of the Prisoner's Dilemma is that you don't have all the information. You don't know what your partner/opponent is going to do and you have decide based entirely on what little information you have based on your history with your partner/opponent. What these people are doing is creating a pattern to be recognized by another player, and then working as a team. And, it's not like they're people where one person might change their mind and decide to defect unilaterally... they're programs. Once they've locked onto each other as the same program, that's it. They'll play to their advantage until the end.

The real trick is to find a program that can beat other DIFFERENT programs, not beat itself. This seems really stupid, or am I missing something?

--

--
RumorsDaily

Re:Uh, isn't that just cheating? by Urban+Garlic · 2004-10-14 02:23 · Score: 4, Insightful

Well, part of the interest is that these programmers found a way, within the rules, to get more information, by means of their "secret handshake". The important lesson (to my mind) is that the environment can be manipulated in surprising ways to get a desired result. That's creativity and innovation doing its thing.

Interestingly, this strategy is also fairly "brittle", I think, in that simple rule-changes could foil it. Requiring only one submission per team, for example, or scoring teams according to the total (or average) scores of all their programs, would complicate any strategy of collusion.

--
2*3*3*3*3*11*251
Re:Uh, isn't that just cheating? by Have+Blue · 2004-10-14 02:25 · Score: 2, Informative

That's why this is the Iterated PD Challenge, not just the classic PD. If the competitors only played ONE round of PD each, the contest would not be very interesting. Their past performance in the contest itself is part of the "history" they are using to evaluate their choices.
Re:Uh, isn't that just cheating? by julesh · 2004-10-14 02:27 · Score: 3, Insightful

I suspect you could come up with a solution that beats this system by mimicing it, then changing its behaviour suddenly.

That's why this isn't a good answer to the problem -- not because it's somehow "cheating", but because its a strategy that only works in limited circumstances and fails spectacularly in others. Kind of like chasing down losses when gambling.
Re:Uh, isn't that just cheating? by Anonymous Coward · 2004-10-14 02:27 · Score: 0

The point is that by alternating between master and slave mode, the new programs' strategy does better than the tit for tat strategy, when the game is played repeatedly.

The game is played repeatedly. At each instance of the game points are awarded.

Of course, these programs lose points when they are the slave, but they are more than compensated for this when they get to run as a master. So on the whole, they do better, precisely because they are cooperating.

(Disclaimer: I haven't read the article.)
Re:Uh, isn't that just cheating? by Minwee · 2004-10-14 02:29 · Score: 4, Insightful

I don't see it as cheating. It's a lot like Bridge -- The rules say that you can't show your partner your hand and you can't tell them what you have, but you are allowed to use prearranged bidding conventions to pass information across the table. All that the Southhampton agents did was use a bidding strategy. They did act as a team, but they had no out-of-game way of knowing that they were up against a team member. That doesn't break any rules, and it did work. The Southampton team took the top three spots in the competition. If you insist on comparing the entrants to people, consider this. They worked as a team, for the good of the team, knowing that at least some of them would win even if the others bombed. People do that kind of thing all the time outside of competitions. Why should it be so out of place here?
Re:Uh, isn't that just cheating? by Anonymous Coward · 2004-10-14 02:32 · Score: 0

It's not really interesting. It completely defeats the purpose of the exercise.
Re:Uh, isn't that just cheating? by julesh · 2004-10-14 02:49 · Score: 1

It's a lot like Bridge -- The rules say that you can't show your partner your hand and you can't tell them what you have, but you are allowed to use prearranged bidding conventions to pass information across the table.

Actually, the rules I understand for bridge specifically disallow prearranged bidding conventions that aren't well known standard conventions. But the rules of IPD don't, so I would guess it's OK there.

But it doesn't work, of course, because all you need to do to beat this strategy is mimic it until it recognises you and then not behave as it expects.
Re: Uh, isn't that just cheating? by Alwin+Henseler · 2004-10-14 02:50 · Score: 1

Well, part of the interest is that these programmers found a way, within the rules, to get more information, by means of their "secret handshake"
So in the beginning, it was essentially 'everyone for himself'. Then innovation: devise a way team members can recognise each other, and work in groups to defeat opponents.
Next logical step: cheating. How about behaving like a member of such a team, but then double-cross them later? It would be interesting to see how such a strategy would work out.
Re:Uh, isn't that just cheating? by iabervon · 2004-10-14 03:13 · Score: 1

People certainly use this strategy in real life, where they will attempt to recognize and sacrifice themselves to benefit particular other people. Much of the animal kingdom uses this as the main social interaction: 1) Is this another wolf? 2) Am I dominant? 3) Profit!

The issue, as is often the case in game theory, is that the mathematical form of the competition does not match the human-language description of the competition. The fact that a program could encounter another program it is not competing against was not considered in the design, but it clearly matters. As such, this is an interesting result in the theory of the design of the competition. In a game thoery competition, you follow only the letter of the law, not the intent.

It would be interesting to permit saved state between pairings, permit random number generation, and require all teammates to have identical programs. Then I suspect that the strategy which produces the winning instance would have each instance randomly determine a strength, and act to recognize the same program, determine the stronger instance, and then give all the points to that one.

It's not really any more of a cheat than tit-for-tat, in any case, which loses every pairing unless the other player never defects (in which case it's a tie), but generally wins the whole competition. The next step is clearly to have multiple instances, and lose the competition most of the time, but win with one instance.
Re:Uh, isn't that just cheating? by Anonymous Coward · 2004-10-14 03:25 · Score: 0

I don't believe that it is 'cheating'. Not to quibble, but I would call it applying a previously unforseen meta-strategy. My interest in competitions such as this is for generated strategy in increasingly complex decisions (see also http://web.cs.ualberta.ca/~darse/rsbpc.html), so the SH strategy doesn't interest me, but whether it is deemed (by the competition) a viable strategy or banned, it has performed a service. If a viable strategy, than the next batch has to prepare for it AND the TFT, and if banned, it revealed the option in order that it could be banned.
Re:Uh, isn't that just cheating? by Lars+T. · 2004-10-14 03:28 · Score: 1

Yeah, but that is only the result of one match. The "cheating" part comes in because they entered many "players" which in advance cooperated so that all but a few would a) knowingly lose their match against the "always master", and b) all make sure that against all others players they minimized both their own and the other's score. IOW they played a meta-PD where they knew about the strategy, with the majority not trying to win, but working together to get the chosen few of the group to win. All this only worked because of the way the tournament was played, a KO system, something in rounds with small groups, or just a different way of scoring (not the sum of all points) would have eliminated the "cheaters" pretty fast.
IOW their strategy isn't necesarily the best way to play the (iterated) PD, but the best way to win that tournament.

--
Lars T.
To the guy who modded me down from perfect to terrible Karma - Apple haters still suck
Re:Uh, isn't that just cheating? by Anonymous Coward · 2004-10-14 03:39 · Score: 0

Yeah, technically this isn't the prisoner's dilemma. It is the PD with a twist, which is fine and their solution is neat... but they shouldn't be advertising it as the new best strategy for the PD. In the single shot version, there is one rational solution: not cooperate. In indefinately repeated versions, the tit-for-tat solution comes out ahead, but other "contracts" will serve as equilibria as well. My favorite is the Grim Trigger.
Re:Uh, isn't that just cheating? by KDan · 2004-10-14 04:13 · Score: 1

Defect - Defect = 0 points - 0 points
Defect - Coop = 10 points - -10 points
Coop - Defect = -10 points - 10 points
Coop - Coop = 5 points - 5 points

The above is the point structure normally used in PD (see http://pespmc1.vub.ac.be/PRISDIL.html ). As should be obvious from that, the way they won is not by having the M/S alternate regularly (because in that case there are no point 'created' in total so the average score of each would be 0), but by completely sacrificing the slave to the master (ie slave ends up with -N and master with +N). This is a totally ridiculous strategy seeing as the game is designed for "selfish agents" and an agent that sacrifices itself for another is obviously not selfish!

Daniel

--
Carpe Diem
Re:Uh, isn't that just cheating? by geekpolitico · 2004-10-14 04:24 · Score: 1

It would be interesting to arrange this competition in an iterative fashion.

1. Every algorithm submits the same number of entrants using that alogrithm.

2. Conduct the competition.

3. Eliminate the bottom 50% performers.

4. Rinse, Lather, Repeat until like Highlander there is only one.

Southampton may have a tough time in this format. They would lose collaborators over successive iterations, while Tit For Tat would still do well.
Re:Uh, isn't that just cheating? by homer_ca · 2004-10-14 04:30 · Score: 1

"They worked as a team, for the good of the team, knowing that at least some of them would win even if the others bombed."

This strategy is common in bicycle racing. The lead rider is supported by his teammates. Among other things, they'll go in front and let him draft off them so he'll be more rested at the end.
Re:Uh, isn't that just cheating? by Anonymous Coward · 2004-10-14 04:45 · Score: 0

You are right, with a payoff matrix that makes no sense.

However, the following payoff matrix is used in the competition (see http://www.prisoners-dilemma.com/)

---C---D
C 3,3 0,5
D 5,0 1,1

So the payoff of alternating is 2.5. This is still lower than "tit for tat" if the "tit for tat" programme manages to cooperate all the time. But this will generally not be the case. Clearly the success of this particular program, like the success of all programs (including t-f-t), depends on the other strategies present in the population (note that the agent in question was programmed to play defect in response to all other agents)
Re:Uh, isn't that just cheating? by Anonymous Coward · 2004-10-14 04:50 · Score: 0

But in that case they would have been better off cooperating..
Re:Uh, isn't that just cheating? by bokmann · 2004-10-14 04:57 · Score: 1

You could also call it 'playing the meta-game'.

If I 'know' the pattern for the secret handshake, I could write a program that behaved the same way, then screwed the other program once I had convinced it I was its master (or slave).

The 'meta-game' is a very real concept in games... This is why chess masters study the previous games of their oppoents to learn their weaknesses, and is probably a MAJORITY of the strategy in games like Texas Hold'em, where you need to know how your opponent behaves when bluffing, etc.
Re:Uh, isn't that just cheating? by Anonymous Coward · 2004-10-14 05:18 · Score: 0

Right, but it is understood, and used by all the teams, at least in the Pro peleton, CatI/II in the US, etc. But you still get singletons going off of the front... which helps make it so exciting. Will the breakaway stay away, or will the peleton suck them back in 2 KM from the finish?

It certainly doesn't work that way in Cat III/IV racing!
Re: Uh, isn't that just cheating? by mdfst13 · 2004-10-14 05:21 · Score: 1

"How about behaving like a member of such a team, but then double-cross them later?"

But you don't have to double-cross them. They double-cross themselves. The whole point of the exercise is to give the master program a perfect score. If you just pretend to be a master, they will try to give you a perfect score. On average, a member of the team will do less well than a non-member; it's just that some members will get perfectly good scores and other members will get perfectly bad scores. If you pretend to be a member, they will help you without ever expecting you to help them.
Re:Uh, isn't that just cheating? by kwatz · 2004-10-14 05:31 · Score: 1

I do think it's an interesting approach, but I don't believe it should be compared with TFT directly. In Southhampton's case, their "strategy" is actually a pair of prisoner's dilemma strategies, whereas TFT is one. Either half of the pair alone will probably not perform as well as TFT.

Another facet that's interesting is that this approach is only as good as observed in that competition when the pattern they use to detect friendly agents is unknown to the general population. Once I learn their pattern, I can easily create a strategy that acts like a Southhampton agent but defects at key points when it is expected to cooperate. If we agree that colluding agents isn't against the spirit of the competition, I can create a whole group of anti-Southhampton agents. The best performing Southhampton agent (the 'master' in the 'master-slave' pair, who always exploits the slave) no longer collects any points, and the slave can now be consistently exploited by non-Southhampton agents. Naturally, the same idea holds true for TFT - I'm not saying Southhampton is unique in this respect. However, it's possible that Southhampton is more vulnerable to this kind of premeditated 'attack.'
Re:Uh, isn't that just cheating? by Iamthewalrus · 2004-10-14 05:49 · Score: 1

I suspect you could come up with a solution that beats this system by mimicing it, then changing its behaviour suddenly.

Unlikely. I'm sure that the programmers added in a check so that even if the "handshake" comes out correctly, the agents will start defecting as soon as their opponent does something that is not consistent with the M/S paradigm. You might be able to always claim to be a master, though.

--
Help prevent the slashdot effect; stop reading the articles.
Re:Uh, isn't that just cheating? by Anonymous Coward · 2004-10-14 05:50 · Score: 0

The part that's cheating in this is the following: Prisoner's Dilemma is a game that involves two players, and only those two. It's repeated many times in this competition, but the basic game is that it only involves two players, and they're trying to do best for themselves. These 60 some entries changed the game from a two player game to a multiplayer game. It does have interesting implications, yes. But it's not the original game, and the rules have been changed not by agreement or the organizers, but by an entrant. Thus, cheating. (and interesting, too).

Trying to think of a better analogy, it might be like auto racing--with the 3 or 4 car teams that exist. One could use the strategy that one car will go for the win, and the others will try to cause huge accidents to narrow down the field (or targeted accidents to take out top contenders). I'd think that most would consider this cheating. Given the people who watch racing though, probably would be seen as more exciting and actually increase ratings.

And one additional reason this is cheating--if everybody did this, the competition would come down to how well you could get cooperation between the programs, not how well they play the game. And even in this case, with one team doing this, it comes down to how well they identify each other, not how well they play game.
Re: Uh, isn't that just cheating? by HiThere · 2004-10-14 06:06 · Score: 1

Rather like those beetles that pretend to be bees, and live inside the hive?

--

I think we've pushed this "anyone can grow up to be president" thing too far.
Re:Uh, isn't that just cheating? by wendyg · 2004-10-14 07:05 · Score: 1

I (the journalist who wrote the Wired News story) asked both the team leader and the organizer about that (and asked both whether they thought the strategy wasn't cheating), and the organizer said that the problem with limiting each team to one player is that you could never assure that two teams weren't acting in collusion.

I think it's cheating in terms of the original spirit of the game, but it wasn't against the rules of the competition, and both Kendall and Jennings said that in the end they thought the agent research was more important than the competition itself.

wg
Re:Uh, isn't that just cheating? by julesh · 2004-10-14 07:45 · Score: 1

The way to beat it:

- Work exactly like it does until the last round on which you would normally cooperate
- Defect on that round

I think this would improve your score to its detrement.
Re:Uh, isn't that just cheating? by gadget+junkie · 2004-10-14 08:37 · Score: 2, Interesting

"So the payoff of alternating is 2.5. This is still lower than "tit for tat" if the "tit for tat" programme manages to cooperate all the time. But this will generally not be the case."

In reality this WILL be the case. Tit for Tat cooperates at the first turn, and then copies the opponent, so the payoff is 3 with no volatility of returns.For all the effort they make, both TFTs could go have a beer and let a computer take over ;-).
S/M has a lower return, PLUS it pays something in the handshake period.

Now for something really interesting: nobody has really spent time on the economy of effort of the TFT strategy. all the other strategies that proved viable, not only did not win, but they needed substantially more resources (line of code, etc). In my view, that's part of the reason of the survival of cooperation in its present form: it is simple, it works, and when you play with someone using it you're buddies for life.

--
"If a boss demands loyalty, give him integrity. But if he demands integrity, give him loyalty." (John Boyd, 1927-1997)
Re:Uh, isn't that just cheating? by Anonymous Coward · 2004-10-14 10:07 · Score: 0

Interestingly, this strategy is also fairly "brittle", I think, in that simple rule-changes could foil it. Requiring only one submission per team, for example, or scoring teams according to the total (or average) scores of all their programs, would complicate any strategy of collusion.

No way! You can allow only one entry per team, but how do you know that multiple teams aren't colluding? You can't!

I thought this was a very interesting "distributed attack" on the system.

To default this strategy you need to not let those colluding teams outnumber you...

It just goes to show how important SOCIAL issues are, sometimes more than TECHNICAL issues.

This may not appeal to geeks because they are not social. It should appeal to true hackers because it is thinking outside the box.

(Interesting similarity to _Bringing Down The House_, no?)
Re:Uh, isn't that just cheating? by feepness · 2004-10-14 11:44 · Score: 1

it is simple, it works, and when you play with someone using it you're buddies for life.

Actually no. If you are doing TFT, you are buddies until the first time they piss you off (or vice-versa).
Re:Uh, isn't that just cheating? by imaginate · 2004-10-14 12:11 · Score: 0, Troll

That means they violated TFT.

If they violated it once, they can violate it again, sacrificing themselves to get back in your favor (like real life).
Re:Uh, isn't that just cheating? by snilloc · 2004-10-14 12:53 · Score: 2, Informative

I'm not sure how this particular iterated PD game works, but in the serious academic version, there is uncertainty about which round is the last round. This accounts for "last round" defections, and also the subsequent collapse of any cooperation strategy by working backwards induction through the game and defecting on the "real" last round to one-up your opponent. (Knowing that your opponent is likely to defect on round N, you defect on N-1, N-2, N-3, back to the start of the game with a dominant "defect" strategy).
Sort of like the parable of the teacher who said there'd be a pop quiz but you wouldn't know what day it was. Obviously it wasn't Friday, because on Thursday you'd know the quiz was friday. Likewise for Thursday because if the quiz wasn't friday, and if you got to wednesday w/o a quiz, then you would know the quiz was thursday, and therefore the quiz couldn't be on thursday... leading to the conclusion that there is no quiz. Not a perfect example (since then the teacher could give the quiz on any day and it would be equally unknowable given the possibility of "no quiz"), but I hope it clarifies some.
Re: Uh, isn't that just cheating? by blue+trane · 2004-10-14 17:23 · Score: 1

So your program that figured out their recognition sequence would score, on average, higher than their programs - since your program would have no slaves, it would simply take advantage of their slaves. So you would win...
Re:Uh, isn't that just cheating? by KDan · 2004-10-14 20:40 · Score: 1

Not only that, but it's also a generic algorithm applicable to many varied situations, whereas some other algorithms (like the one which won) are only applicable to PD.

Daniel

--
Carpe Diem

Does this defeat the purpose? by Snowspinner · 2004-10-14 02:14 · Score: 5, Interesting

This seems to me to be an unfair way to "win." The point of the PD simulation is to talk about whether, in the absence of any social consequences, it is better to screw someone over for money or to work cooperatively with them. It's not a perfect model for that question, but that is still the question that makes us care about the PD in the first place.

All this has done is make a meta-PD game in which the two programs create a meta-game in which they agree to cooperate. That is to say, this is a solution to the PD problem that relies on the cooperation of a cohort (Someone to keep choosing loyalty while you defect and get all the money). Which is exactly not the point of PD.

So the real headline, I think, is "Trivial flaw found in definition of Prisoner's Dillema problem. University of Southhampton wastes money demonstrating flaw instead of writing a goddamn paper like a normal person would."

--
Philip Sandifer's academic website

Re:Does this defeat the purpose? by Anonymous Coward · 2004-10-14 03:38 · Score: 0

No, it does not defeat the purpose.

I'm glad you got (+5) Interesting instead of Insightful or Informative, because your post is misleading. The point of this PD contest is to set up some simple rules to see what strategies work in the presence of prior knowledge about the opponent's choices. While the teamwork method shown may not arise in most human interactions, it seems clear that it could, given this meta-game.

The original competition revealed Tit-for-Tat as a strategy which in fact never beats a single particular opponent, beat all opponents in the average. If you know the one-off PD, you know the perfectly analyzable best choice in that situation is always to defect - but TFT cooperates the first time. The point of this competition is to find strategies for the iterated prisoner's dilemma, where opponents know each other's prior moves. Yes, it's a different game; that's the point you missed.
Re:Does this defeat the purpose? by Daetrin · 2004-10-14 05:11 · Score: 1

I think if you want to look at it from a socialogical standpoint what they did makes a lot of sense. They've just demonstrated that a large group that cooperates can do well by screwing the system as long as they can convince peons to sacrifice themselves for the winners.
Sounds like a pretty good description of organized crime and corrupt corporations.
So like real life, the people who play fair do moderatly well, the heads of gangs and large corporations do great, and their underlings and everyone who deals with them gets screwed.
Of course in real life you can't get all the peons to sacrfice themselves at every occasion. You have to make sure that the percieved rewards are greater than the percieved risks or you don't get any more volunteers. However given the nature of human greed and the frequent human inability to calculate probabilities, that usually isn't too hard.

--
This Space Intentionally Left Blank
Re:Does this defeat the purpose? by blonde+rser · 2004-10-14 10:58 · Score: 1

The point of the PD simulation is to talk about whether, in the absence of any social consequences, it is better to screw someone over for money or to work cooperatively with them.

How do you figure that? If that were true then why are the players aloud to keep track of their opponents previous moves? If there are no consequences, no memory of your prior moves, then the answer is it better to screw someone over. The fun and interest of this sort of play is the social consequences.
Re:Does this defeat the purpose? by Snowspinner · 2004-10-23 04:55 · Score: 1

I think you miss the point of my claim about social consequences. The issue prior moves is a social consequence WITHIN THE GAME.

What I mean is that, if you screw someone over, a Mafia hitman is not going to murder your family. That is to say, I'm referring to social consequences that come from situating the game in some kind of social context.

--
Philip Sandifer's academic website
Re:Does this defeat the purpose? by Snowspinner · 2004-10-23 04:59 · Score: 1

No, no, I understand completely. But even in the case of an iterated prisoner's dillemma, the relevent prior knowledge is the question of opponent actions - not opponent identity. That is to say, what it's testing is still behavior as shaped by other people's behavior. The handshake, though possible under this model, still fundamentally changes what the game is modeling because it introduces the variable of knowing who you're dealing with. This is still a substantive change.

--
Philip Sandifer's academic website
Re:Does this defeat the purpose? by Snowspinner · 2004-10-23 05:01 · Score: 1

Yes, I agree wholeheartedly. It does model all of these things.

My point was that the prisoner's dillemma model is, I think, far more interesting when it doesn't model those things, and when it instead models a sort of pure state of social interaction.

--
Philip Sandifer's academic website

...by cheating! by The-Bus · 2004-10-14 02:15 · Score: 4, Insightful

If the program recognized that another player was not a Southampton entry, it would immediately defect to act as a spoiler for the non-Southampton player. The result is that Southampton had the top three performers -- but also a load of utter failures at the bottom of the table who sacrificed themselves for the good of the team. Another twist to the game was the addition of noise, which allowed some moves to be deliberately misrepresented. In the original game, the two prisoners could not communicate. But Southampton's design lets the prisoners do the equivalent of signaling to each other their intentions by tapping in Morse code on the prison wall. Kendall noted that there was nothing in the competition rules to preclude such a strategy, though he admitted that the ability to submit multiple players means it's difficult to tell whether this strategy would really beat Tit for Tat in the original version. But he believes it would be impossible to prevent collusion between entrants.

Yeah, that's not the Prisoner's Dilemma. Or even the Iterated PD. This whole "signaligng Morse code" on the prison walls is nonsense, because it was not part of the original plan. Just because it's not in the rules doesn't mean you can do it. In Chess there's no rule specifically against me bringing a SuperGrape(TM) onto the board. The SuperGrape(TM) immediately destroys all pawns on a color of my choosing.

No, it doesn't work that way.

While this is an interesting experiment, it's not a true victory.

--

Small potatoes make the steak look bigger.

Re:...by cheating! by Perianwyr+Stormcrow · 2004-10-14 02:20 · Score: 1

I don't see how this is really against the rules, but it also doesn't give a very interesting solution that tells us anything more than we already knew.

I'd say that a real test would involve players vs. 10 times as many tit for tats as there are entrants.

--
What we call folk wisdom is often no more than a kind of expedient stupidity.-Edward Abbey
Re:...by cheating! by julesh · 2004-10-14 02:23 · Score: 1

Err... what's the difference between this and iterated prisoner's dilemma?
Re:...by cheating! by dr_labrat · 2004-10-14 02:24 · Score: 2, Informative

I'm not sure you read the article correctly.

this is not cheating, what is happening is throughout the iterations the programs can experience the equivalent of morse code in the patterns of defections and co-operations in the form of the penalties.

i.e.: 4yrs 4yrs free free 2years.

As this is iterative, and no actual lifespan (in human terms) the pattern can become quickly evident. Kind of like a "penalty handshake".

In this way one program can detect the intentions of what is happening, without actually communicating directly.

--
The secret of success is honesty and fair dealing. If you can fake those, you've got it made. (Marx)
Re:...by cheating! by wo1verin3 · 2004-10-14 02:25 · Score: 2, Funny

See I was trying to bring a supergrape in to play against my grandfather and he didn't believe me.. thank god i have someone to back me up now!
Re:...by cheating! by Anonymous Coward · 2004-10-14 02:32 · Score: 0

Mmmmmmmmmmm............ SuperGrapes
Re:...by cheating! by SlamMan · 2004-10-14 02:35 · Score: 1

Couldn't agree more. The whole issue behind the prisoner's dilemma is trust and payoff. The Tit for Tat wins because it changes the rules of the game; buy adding in other factories. Its like saying you won at Monopoly buy bribing two of the other three players to play poorly. It works, but its doesn't exactly count as a winning strategy.

--
Mod point free since 2001
Re:...by cheating! by billbaggins · 2004-10-14 02:47 · Score: 4, Informative

Actually, this is exactly the sort of thing that the organizers were hoping would happen. From the FAQ, question 12:
But we don't want to [impose limits on the number of entries] as it will be interesting to see if people can come up with strategies that cooperate with themselves within the whole population.

--
"The best argument against democracy is a five minute chat with the average voter."
--Winston Churchill
Re:...by cheating! by AGMW · 2004-10-14 03:08 · Score: 1

Mmmmmmmmmmm............ SuperGrapes
I'll have the soup please.

--
Eclectic beats from Leeds, UK
handmadehands.co.uk
Re:...by cheating! by Anonymous Coward · 2004-10-14 03:14 · Score: 0

Dammit, you actually tricked me into googling for "chess supergrape".
Re:...by cheating! by Anonymous Coward · 2004-10-14 03:36 · Score: 0

psst. you should read up on the difference between the verb 'buy' and the adverb(?) 'by'
Re:...by cheating! by Lars+T. · 2004-10-14 03:38 · Score: 1

It's a lot like Karate Kid, where the evil guy tells the fighters from his dojo to injure the hero - even if that gets them disqualified - so the evil dojo wins the tournament. Against the rule? No. Ethical? Well, The Karate Kid won in the end, so it obviously wasn't ;-)

--
Lars T.
To the guy who modded me down from perfect to terrible Karma - Apple haters still suck
Re:...by cheating! by SlamMan · 2004-10-14 04:32 · Score: 1

Or you could not care about pedantic spelling mistakes on Slashdot.

--
Mod point free since 2001
Re:...by cheating! by Kupek · 2004-10-14 05:02 · Score: 2, Informative

The whole reason people are interested in the Prisoner's Dilemma (and Iterated) is that they are models for situations encountered in science. This isn't a comepetition like chess where we're trying to see who (as in the humans) are the best at it. We're trying to see what interesting results come of it. This is an interesting result.
Re:...by cheating! by kwatz · 2004-10-14 05:41 · Score: 1

To expand on what you're saying...

Axelrod explicitly states that "there is no mechanism available to the players to make enforceable threats or commitments" (1984). "Enforceable" is the key. We might agree to both cooperate next time we play, but there's nothing stopping me from defecting instead. The Southhampton strategy creators are essentially making these commitments.
Re:...by cheating! by gadget+junkie · 2004-10-14 08:48 · Score: 1

"If the program recognized that another player was not a Southampton entry, it would immediately defect to act as a spoiler for the non-Southampton player. The result is that Southampton had the top three performers -- but also a load of utter failures at the bottom of the table who sacrificed themselves for the good of the team."

That is possible in real life, think mutual funds families:

1.you decide which fund will be best of breed;

2. take money out of the looosers, and pump the winner;

3. advertise on the WSJ that you have the best fund in the industry;

4. PROFIT!!

BTW, don't let the SEC know: they usually bite your head off and use it for bowling.

--
"If a boss demands loyalty, give him integrity. But if he demands integrity, give him loyalty." (John Boyd, 1927-1997)

Interesting Idea, but... by TJ_Phazerhacki · 2004-10-14 02:15 · Score: 2, Interesting

How difficult is something like this, really? One of my graduate level friends did some sociology work on a Game theory system like this, and if you know the rules, you can really beat the system.

Just curious, thats all. Anyone have any experience in the field?

--
Physics is nothing like religion. If it was, we'd have an easier time trying to raise money!

Evolutionarily stable? by Dr.+Manhattan · 2004-10-14 02:17 · Score: 4, Insightful

From TFA:"Our initial results tell us that ours is an evolutionarily stable strategy -- if we start off with a reasonable number of our colluders in the system, in the end everyone will be a colluder like ours," he said.

It's not clear to me how the entries determined who would be the 'master' and who would be the 'slave'. It seems that if you had lots of 'colluders' around who could be induced to 'suicide' for another's benefit, you'd very quickly get cheaters who worked to be the 'master' in all situations.

This strikes me as a lot more reminiscient of the Hawk/Dove situation.

--
PHEM - party like it's 1997-2003!

Re:Evolutionarily stable? by Perianwyr+Stormcrow · 2004-10-14 02:28 · Score: 1

Yes, it's not stable at all. I don't see that this strategy would expand beyond people emulating the masters if they ever figured out the code. No one wants to willingly be the late entrants in a pyramid scheme.

--
What we call folk wisdom is often no more than a kind of expedient stupidity.-Edward Abbey
Re:Evolutionarily stable? by hyphz · 2004-10-14 03:54 · Score: 1

> I don't see that this strategy would expand
> beyond people emulating the masters if they
> ever figured out the code. No one wants to
> willingly be the late entrants in a pyramid
> scheme.

Well, one thing I can think of is writing a program which remembers the opponent's moves in previous games it has played and replays them to new opponents to see what the results are...

Tit for Tat by alexo · 2004-10-14 02:17 · Score: 4, Funny

Why should Tat get all the fun?

Re:Tit for Tat by jaraxle · 2004-10-14 02:48 · Score: 1

To paraphrase George Carlin (or was it Dennis Miller?):

If everyone's giving tit for tat, where is this "tat" and how can I get me some?

~jaraxle
Re:Tit for Tat by stalky14 · 2004-10-14 03:56 · Score: 1

Hell, I've got a whole warehouse full of Tat. Where do I exchange it for some tit?
...Sean.

Silly... by patniemeyer · 2004-10-14 02:17 · Score: 1

Southampton's programs executed a known series of 5 to 10 moves which allowed them to recognize each other. After recognition, the two Southampton programs became 'master and slave': one program would keep defecting and the other would keep cooperating.

Am I the only one who thinks this is just kind of obvious and silly?

Also, what kind of "moves" can be made by a "prisoner" that can be seen by the other prisoner?

Pat

Re:Silly... by Anonymous Coward · 2004-10-14 02:33 · Score: 0

According to what one player chooses in every turn, you can write a binary number that you can use for encoding any message.

Yes, actually by Perianwyr+Stormcrow · 2004-10-14 02:18 · Score: 3, Informative

But the proper test is really whether the master half of these programs can do better than tit for tat on a large scale basis. I suspect that the S/M program will still do less simply because it plays a pattern during the interaction phase which is likely to result in tit for tat still coming out ahead- if there is one tit for tat, it won't do so well since the costs of being tit for tat are relevant if you don't know the master sign and most of those you interact with are expecting to hear it. But that's already well known. If tit for tat's numbers start growing, it does better. You see, tit for tat has an identification mechanism too, which is simply that it always starts out nice and immediately gets nasty if it gets fucked. If the number of tit for tats increases to a reasonable critical mass, they can have enough positive reactions to do very well. In fact, they'd become a secret society within the S/Ms!

In short, if tit for tat is isolated, it won't do so well since everyone is fucking with it. If there are just a few tit for tats out there, their power increases significantly with each one added.

--

What we call folk wisdom is often no more than a kind of expedient stupidity.-Edward Abbey

Re:Yes, actually by ErroneousBee · 2004-10-14 03:22 · Score: 2

I think youve missed the point of the S/M. It can play tit for tat with everyone except opponents identifying themselves as slaves. It will always win in this situation. Unless someone cracks the code of the slaves and starts abusing them too, or its gets its own slavebots to identify and target the opponents master. Also, it is possible for slaves to try and identify what opponents are using what strategies and communicate that to the master who can adjust to exploit it.

Its means that a new evolution in game theory experiments is occuring, in that now a cold war situation is arising where alliances will have to form against other alliances.

I think a similar thing is occuring in online casinos, where bot networks are working against real players. This may be fraud, but its very hard to detect.

--
**TODO** Steal someone elses sig.
Re:Yes, actually by Anonymous Coward · 2004-10-14 05:08 · Score: 0

...but perhaps what will actually end up though will be analogous to just before WWI, where all of the European countries were tied up in a complex web of treaties, mutual assistance pacts, etc. How will the "switzerland" of PD code, "Tit for Tat", fare then?

Eventually, the code for detecting behavior and modifying self's behavior (the feedback loop) will get too complicated, and there will be "analysis paralysis" in various code loops. But maybe this is a good thing, also. Let it run to its fantastic extreme, and see what comes out...

"Dr. Strangelove" comes to mind...

Spirit of the PD by johnthorensen · 2004-10-14 02:19 · Score: 4, Interesting

Not precisely cheating, as the rules are set up to play this way...but this certainly violates the spirit of the original Prisoner's Dilemma. Why?

Real prisoners only get to choose ONCE.

By taking advantage of the multiple-iteration aspect of the simulation with this sort of 'portknocking' strategy, the winning programs kind of take a cheap shot at the original PD.

Of course, it's all hypothetical anyway, and come to think of it Tit For Tat technically takes advantage of the multiple-iteration aspect as well by doing whatever the opponent did the last time...

Ah well, at least the Wikipedia entry makes a distinction between regular "Prisoner's Dilemma" and "Iterated Prisoner's Dilemma".

Re:Spirit of the PD by amorsen · 2004-10-14 02:39 · Score: 3, Informative

Err, of course this is the iterated prisoner's dilemma. It is quite easy to do the optimal thing in the non-iterated case: defect. You couldn't make a competition out of that.

--
Finally! A year of moderation! Ready for 2019?
Re:Spirit of the PD by jefu · 2004-10-14 03:31 · Score: 1

Actually in a single iteration of the PD, the best strategy is always to defect. Indeed, defecting is the best strategy on any Iterated Prisoners Dilemma where the number of iterations is known. It is only over time, with an unknown number of iterated plays with the same opponent that tit-for-tat (or tit-for-two-tats which I think I remember is slightly more robust) that the game becomes interesting.
And however hypothetical it may seem, the IPD is an interesting model for the development of trust, of social behavior and for things like negotiations between mistrustful software agents.
Re:Spirit of the PD by b1t+r0t · 2004-10-14 03:37 · Score: 2, Informative

Yep. If you know that a given move is the final move, the optimal action is to defect. When it's not iterated, there is only one move, so of course it's going to be the last move.
I presume there is no way the entrants can know how many moves will be in a given round, or Tit-for-Tat could be slightly be beaten by a modified TfT which always defected in the final round.

--

--
"Open source is good." - Steve Jobs
"Open source is evil." - Microsoft
Re:Spirit of the PD by Pendersempai · 2004-10-14 06:26 · Score: 1

The original PD would make for a terrible competition because the best strategy is already well known: Defect. You have to iterate to make things at all interesting.
Re:Spirit of the PD by cyberformer · 2004-10-15 04:44 · Score: 1

But defecting isn't the optimal choice. The whole point of the prisoners' dilemma is that the optimal action is for both particpants to cooperate, but each is idividually tempted to defect. That's why it's called a dilemma.

Defecting may seem to be a better strategy, but keep in mind that whatever factors (genes, environmenmt or in this case, programming) influence your decision to defect are also likely to be influencing the other player.

The important codicil to the story is... by aug24 · 2004-10-14 02:19 · Score: 4, Insightful

"The result is that Southampton had the top three performers -- but also a load of utter failures at the bottom of the table who sacrificed themselves for the good of the team."

J.

--
You're only jealous cos the little penguins are talking to me.

Positive message for our children by lowe0 · 2004-10-14 02:20 · Score: 0, Troll

So, the lesson here is... if they're not your friend, fuck them over the first chance you get.

No denying it though - it obviously works damn well. Just ask the Bush administration.

The article got it wrong by nels_tomlinson · 2004-10-14 02:21 · Score: 5, Informative

The article got it wrong: they compared the tit-for-tat strategy for the iterated prisoner's dilemma to mutual assured destruction. That's wrong, since nuclear war is usually considered to be a one-time game: once you've blown each other up, there is no next round. Tit-for-tat requires that there always be a following round.

Repeated games have radically different outcomes than one-time games. It's long been known that where cooperation is possible, cooperation can beat solitary strategies in repeated games. I really don't think there's anything surprising here.

--
See what I've been reading.

Re:The article got it wrong by Anonymous Coward · 2004-10-14 02:36 · Score: 5, Insightful

"The article got it wrong: they compared the tit-for-tat strategy for the iterated prisoner's dilemma to mutual assured destruction. That's wrong, since nuclear war is usually considered to be a one-time game: once you've blown each other up, there is no next round. Tit-for-tat requires that there always be a following round."

The nuclear MAD comparison is apt, because of the time lag between launch detection and detonation. During the flight time of the first launch, there is time for several rounds to occur.

Actually, the nuclear standoff could be considered an ongoing PD game with both sides playing Tit-For-Tat strategies. The rounds occur every few minutes with both sides asking "did the other side screw us yet" and responding "no, so we won't screw them yet". This PD game has consisted of millions and millions of turns already, with both players using historical knowledge to influence their current choices.
Re:The article got it wrong by samyool · 2004-10-14 02:38 · Score: 1

well, this isn't necessarily true.

If there were no further rounds in mutually assured destruction, then the initial stockpile of weapons would have been enough. Instead, both sides increased production and acquisition in phases to (roughly) match each other. Likewise with increased placement of devices in new strategic locations.
Re:The article got it wrong by dajak · 2004-10-14 06:04 · Score: 1

The difference between PD and MAD is that you can announce what you are going to do, and you can publicly change your position on what you are going to do. This is an essential characteristic of a legal system. See Why agents comply with norms, and why they should by Giovanni Sartor for a very accessible introduction.
The prisoners are not supposed to negotiate, and that is the essential difference. The MAD doctrine doesn't work if you keep it secret, or the opponent doesn't believe it is what you will do.
Re:The article got it wrong by drlake · 2004-10-14 09:19 · Score: 1

MAD isn't iterated as you note, but it also really isn't accurate to call it PD. In MAD, you get the same score whether you retailate against an attack or not. Of course, the real problem with treating MAD as a 2x2 game is that it really is an extended form game, with the players taking their moves in sequence rather than simultaneously. You may be able to simulate MAD reasonably well with an extended form of PD, but that's not what we're dealing with here.
Re:The article got it wrong by oiuyt · 2004-10-15 03:12 · Score: 1

Also note that a first-strike capability that would wipe out a significant response would create a "last turn" situation, where it's arguable that defecting is the best strategy. I'd argue that half a destroyed world without an Evil Empire is worse than a cold-war, but....

-B

SuperGrape by kahei · 2004-10-14 02:25 · Score: 1

Ah, you're using the UK SuperGrape rules. I think most competitions nowadays use the Russian rules (SuperGrape takes _all_ pieces except King on a given color).

Of course in USA play the SuperGrape is almost worthless as your opponent can simply invoke the Citrus Rule next turn (which usually destroys the whole board).

--
Whence? Hence. Whither? Thither.

Re:So, in real world terms.... by afd8856 · 2004-10-14 02:25 · Score: 0, Redundant

You have the nukes. Why shouldn't they have them?

--
I'll do the stupid thing first and then you shy people follow...

Asian mentality by pubjames · 2004-10-14 02:26 · Score: 3, Interesting

ok, here's a weird thought. In many Asian countries, the mentality is to work as a group, rather than individually, with the individual sacrificing themselves for the group if necessary. In the USA and most of the "western" world, we tend to act more as individuals. We tend to think think our system is better, but what if we're wrong? Perhaps, as this experiment shows, the Asian mentality may actually be the superior strategy?

China has been most consistently the biggest superpower over mankinds history, and it looks like it's going to be that way again in a couple of decades. Perhaps these things are related...

Re:Asian mentality by Perianwyr+Stormcrow · 2004-10-14 02:31 · Score: 3, Insightful

China has usually been cultured but not powerful. Chinese history is a long sequence of conquests by powerful outsiders (Manchurians, Mongols, Europeans.)

--
What we call folk wisdom is often no more than a kind of expedient stupidity.-Edward Abbey
Re:Asian mentality by Anonymous Coward · 2004-10-14 02:55 · Score: 0

Having to read speeches written by Japanese school children for a living, I can tell you that this is false. The little bastards would stab each other in the back the first chance they get, and then write a poor English sentence about it.

The other day almost none of the kids said they would give up their seats for old people on the train. It's the end of Japan I tell ya, the end of Japan!
Re:Asian mentality by Anonymous Coward · 2004-10-14 03:03 · Score: 3, Insightful

Actually they have been powerful, many, many times. But always at the hight of the current dynasty.

The cycles usually work as such: There's a period of chaos (warring states, etc), usually ending up with some external power coming in and conquering. Then a majority kingdom is established (didn't always own all of what we now call China, or sometimes more than present day, but whatever). Then there'd be a period of hightened trade. The influence of external nations would prompt both an interest in other cultures and a florishing of culture within the country. Then the nation would gain in power, boarders would become more defined, government would be stable and well established. The civilization would reach the peak of it's power. This high period could last anywhere from tens of years (later dynasties) to thousands (early zhao?).

Then the country would start to get too bureaucratic, too dogmatic. The boarders would be systematically locked down, the country would isolate itself. Xenophobia would reign. Finally interior corruption would fragment the government into separate regions. Civil wars would begin, and with this discord the next batch of foreigners would invade.

The moral: Isolation and xenophobia suck.

Mod -2 Too Much Fucking Information ;)
Re:Asian mentality by anaesthetica · 2004-10-14 03:12 · Score: 1

China has been most consistently the biggest superpower over mankinds history
Yeah, except for when they were ruled by the Mongols, and when they were conquered by the Japanese, and when they were carved up by the Europeans, and when they were dominated by the United States.

--
The Rise and Fall of Online Community
Re:Asian mentality by Len+Budney · 2004-10-14 03:14 · Score: 1

In many Asian countries, the mentality is to work as a group, rather than individually, with the individual sacrificing themselves for the group...Perhaps, as this experiment shows, the Asian mentality may actually be the superior strategy?

If we grant your rather broad assumption about the "asian mentality", it's still hard to see what makes that "the superior strategy". As TFA said, the strategy created a vast pool of losers, and a small number of very big winners. Is that really "superior"? For the three big winners, it clearly is--but what about the millions of peasants on whose back the winners stand?

Len.
Re:Asian mentality by Anonymous Coward · 2004-10-14 03:32 · Score: 0

You'd be surprised at the "average" American who would, say, force an airliner down in a field killing the few on the plane rather than hundreds on the ground.

What you speak of is the rule of logic as opposed to humanity. In the end, it may be the better choice, though, the less humane choice. Vulcans verses humans. I think most Americans use humane logic.

If you are speaking to business purposes...many Americans still give great sacrifices to their employers. This is changing though, as many companies have demonstrated a total lack of respect and loyalty to their workers. Hence, workers are becoming unwilling to sacrifice for work and will jump ship as soon as a slightly better oppurtunity arises. This is obviously bad for business, the economy, and workers. What I am saying is that if an American worker is treated right, he WILL balance what is good for his company with what is good for himself, some times even taking it in the butt if the benefit for his company is greater. When he is disrespected, his own status becomes the sole concern.

In a military sense, a dictatorship with self-sacrificing servants is very strong. It can act swiftly and without internal bickering. This is obvious to anyone. A democracy is, on the surface, weaker. However, history shows that democratic, western societies have pretty much ruled the roost. The Muslims noticed this back in the 18th-19th centuries and it scared the devil out of them (temporarily.) This caused much upheavel in due to some Muslims recongnizing the good and power (mainly power) of democracy and wanting to incorporate such items into their beliefs. However, democracy is incompatible with slavery, concubines, 12 wives, and putting *victims* to death. Christians did some horrid things over the years...many of the same things mentioned above. However, these were evils of men against men...not requirements written in the Bible. In Islam, these items are required in the "good" book. Their prophet practiced them himself. You be the judge.

One note on internal bickering...it is good that we have that in the U.S. HOWEVER, it should be private bickering...not Ds and Rs battling it out in the UN, Paris, Baghdad, CNN, etc. During war, a unified front should be presented publicly, CSPAN should shut off its cameras, and privately, let the fisty cuffs commence.

China will be a great power. There will be unrest there as people continue to seek freedom, however. China has civil war to worry about. Creating massive projects such as the Great Wall and the Three Gourges Dam is much easier in dictatorial societies...not in societies where subjects willingly die to build such projects for the greater good. Dictatorial societies have built many great things over the years: the Pyramids, the first satellite, the Great Wall, etc. Everyone of them has fallen, hard. Depraved slaves, no matter how noble in spirit, are not as effective in technology and science as individuals working for their own "selfish" interests. I suspect that the first Soviet astronaut would have gone had he known his ship would blow up anyway, but, if he changed his mind at the last instant, he probably would have met a worse fate.
I end this abruptly as duty calls....
Re:Asian mentality by Jerf · 2004-10-14 03:47 · Score: 2, Insightful

Perhaps, as this experiment shows, the Asian mentality may actually be the superior strategy?

Oh, this is a bad time to get all multicultural.

Sure, it works out great for the Masters, who get to the winners circles on the backs of their Slaves.

Meanwhile, if you want to call Tit-for-Tat the Western strategy, everybody mostly wins after a while, even though few do really well.

I don't believe either categorization. I'm just pointing out that if you're going to base your argument on this article, you are saying that it is good that a few individuals come out better, at the expense of a lot of other individuals, in the putative Asian system of thought. Which I find barbaric, though YMMV.

I once defined a political axis as "people who know they would be kings, vs. people who think they would be serfs". Sounds like I can guess where you come out on that.
Re:Asian mentality by Anonymous Coward · 2004-10-14 04:00 · Score: 0

That's the American way! Don't diss it!
Re:Asian mentality by mdfst13 · 2004-10-14 05:37 · Score: 2, Insightful

"In many Asian countries, the mentality is to work as a group, rather than individually, with the individual sacrificing themselves for the group if necessary."

But that isn't what this system does. Individuals do not sacrifice themselves for the good of the group; the group sacrifices itself to build up individuals. It is more like a feudal joust. If the king enters, all his opponents withdraw, making him the defacto winner.
Re:Asian mentality by cypher_6502 · 2004-10-14 07:26 · Score: 1

You can readily observe this in nature, ants, termites, bees, and wasps have survived for millions of years on this strategy; working together as a group. Pay attention next time you encounter an anthill, termite mound and bee hive.
Re:Asian mentality by Anonymous Coward · 2004-10-14 08:58 · Score: 0

In Islam, these items are required in the "good" book. Their prophet practiced them himself. You be the judge.

The Bible (in the Old Testament) has more God directed evil than the Koran. King David in Psalms sings about how wonderful it is to dash in the heads of the babies of his enemies. The Jews are ordered by God to kill all the men, all the old women and all the nonvirgin females (that are non jew) in the land West of the river Jordon - and to rape the non-virgin women (take them against their will as brides - multiple brides per man). Abraham married his half sister then lied about it so a powerful ruler would have sex with her.

Crack open your Bible and read it.
Re:Asian mentality by Anonymous Coward · 2004-10-14 09:06 · Score: 0

Boarders?

Avast ye matey! There be barbarians at the border!
Re:Asian mentality by Vitriol+Angst · 2004-10-14 10:16 · Score: 1

I would say a cooperative strategy is superior, in almost every case. I think the western stereotype is a badly understood Darwinian survival of the fittest model. But new research in evolution, biology and anthropology show that most diverse systems compete in a symbiotic way. Meaning that a lion is not always evolving to have bigger teeth and the antelope not always getting faster or breeding more. The prey allow for the predator and vice versa. Otherwise, both would be in an arms race of survival with little margin to tolerate scarce food and new invading parasites (for example).

Also, the concept of the Alpha male turns out to be a little wrong; while many species have a more dominant male that gets to mate more frequently, almost no packed animal groups have leaders with absolute say. Scientists have observed primates, wolves and even cows are almost democratic in deciding pack needs and that no leader can bully a pack where more than 50% want to do something different. The idea that most pack animals survive in a Democratic fashion of majority rule and that this is a trait for survival goes against conventional wisdom.

I think that a future winner of this event will submit cooperative packs that pass information to eachother about the relative "integrity" of opponents. The more sophisticated and communicative the group, the more honesty is rewarded. I wish I could remember it the mathemetician, but a few years ago I read about a man who had a very good mathematical proof of altruism. Showing that statistically, honest and principled individuals in a cooperative/competitivel society eventually win out. That "good guys" would outbreed selfish individuals.

This is a facinating competition, and I think that it could only be helped by NOT confining the rules with what you call CHEATING. By not having any rules at all, you can more closely test true strategies. There are many examples of creatures in nature that sacrifice themselves for a genetically related group (bees for one). I like things that can bust up our pre-conceived notions.

--
>>"ad space available -- low rates!!!"
Re:Asian mentality by Fortress · 2004-10-14 14:07 · Score: 1

China has usually been cultured but not powerful. Chinese history is a long sequence of conquests by powerful outsiders (Manchurians, Mongols, Europeans.)

It is interesting to note that none of the conquests stuck for more than a generation because of their powerful culture. China changes her invaders more than her invaders change her.
Re:Asian mentality by Kowelli · 2004-10-15 07:49 · Score: 1

What about working as a group, but with the individuals not sacrificing themselves for the group? That's the best strategy, but it needs conscience and matureness of the majority of individuals to be achieved.
What I'm saying here, in a philosophical or 'spiritual' level, if you want, is: Individuality must not be sacrifized for the group, but increased, so their higher manifestations could emerge(conscience, and the knowledge that 'the other is me'), and so the group would start cooperating without sacrificing nobody.

Mankind is actually in that kind of 'learning', both as individuals, and as a 'group'.

Best regards,

Albert

It is not the first by Flyboy+Connor · 2004-10-14 02:28 · Score: 4, Interesting

Axelrod never claimed that Tit-for-Tat was the best PD-playing program. He just stated that Tit-for-Tat would play well against any other combination of programs. Actually, IIRC, in the second tournament he organised Tit-for-Tat came in second. There was a different program that managed to exploit faults in other programs.

It is easy to score better than Tit-for-Tat in Axelrod's (original) tournament. He included a program that played random moves. It is not difficult to recognise this program after, say, ten moves have been played. You can always defect against random, because its moves are unrelated to its history. So, a program that plays Tit-for-Tat by default, but always defects against Random, scores better than Tit-for-Tat.

Does this dillute Tit-for-Tat's accomplishment? Of course not. Tit-for-Tat still plays well. And it is such a simple strategy that it can be programmed in two lines ("C on move 1, then copy opponent's previous move"), which none of the other programs achieve. Tit-for-Tat is simple, elegant, and strong. It's beautiful.

Southamptom entries, on the other hand, are complex, sneaky, and cheating against (perhaps unwritten, but nonetheless agreed-upon) rules. They're ugly. They only prove that backstabbing cheating bastards may defeat just-and-fair if the referee is looking the other way for a moment.

Re:It is not the first by jaaron · 2004-10-14 03:00 · Score: 2, Insightful

Southamptom entries, on the other hand, are complex, sneaky, and cheating against (perhaps unwritten, but nonetheless agreed-upon) rules. They're ugly. They only prove that backstabbing cheating bastards may defeat just-and-fair if the referee is looking the other way for a moment.

Sorry to be the one to break it to you, but sometimes life is just that way. :)

--
Who said Freedom was Fair?
Re:It is not the first by Artifakt · 2004-10-14 03:20 · Score: 1

Except it is precisely life that isn't that way. This new strategy is like allowing unselfish genes in an evolutionary model. In a living model, a strategy like this allows "winning" for a few given example of the 'species', but only by driving the whole 'species' towards extinction. Extinction is certainly an odd definition of winning.

--
Who is John Cabal?
Re:It is not the first by Baldrson · 2004-10-14 03:42 · Score: 1

Southamptom entries, on the other hand, are complex, sneaky, and cheating against (perhaps unwritten, but nonetheless agreed-upon) rules. They're ugly. They only prove that backstabbing cheating bastards may defeat just-and-fair if the referee is looking the other way for a moment.
No -- they also prove that the unwritten but nonetheless agree-upon rules are not allowing for kin selection. By not allowing for kin selection you are violating another and more important layer of unwritten, but nonetheless agreed-up rules: That PD tournaments be relevant. Perhaps its time to write down these more important rules of the PD tournaments so that the protean nature of the PD is demonstrated between levels of selection, as described by Hamilton.

--
Seastead this.
Re:It is not the first by rmaraujo · 2004-10-14 03:53 · Score: 1

In the computer tournaments described in Axelrods book (The Evolution of Co-operation), Tit for Tat wins in BOTH rounds. What he does claim is that it depends largely on the kind of other agents present in the game: if you have one TFT against many "defect always", TFT will certainly lose. Anyway, this "victory" over tit for tat feels like a cheat: you have agents that are not selfish and exist only to lose...
Re:It is not the first by Anonymous Coward · 2004-10-14 08:33 · Score: 0

No, Tit for Tat won the second tournament too.

You're missing the point. It is trivial to design a program that does better than just TFT (always defect, for example) if there only were those two. However, the object to design one that does better than TFT against a number of opponents who you know nothing about is certainly not easy. On the contrary, history has proven that it is extraordinarily difficult (as is evident by TFT winning both tournaments).
Re:It is not the first by Anonymous Coward · 2004-10-14 08:35 · Score: 0

It is easy to score better than Tit-for-Tat in Axelrod's (original) tournament. He included a program that played random moves. It is not difficult to recognise this program after, say, ten moves have been played. You can always defect against random, because its moves are unrelated to its history. So, a program that plays Tit-for-Tat by default, but always defects against Random, scores better than Tit-for-Tat.

Then you have to recognize that the program behaves randomly. That isn't necessarily trivial.

Actually.. by Anonymous Coward · 2004-10-14 02:29 · Score: 0

..it's half the fun you would normaly get ;-)

ITERATED dilemma by Anonymous Coward · 2004-10-14 02:29 · Score: 0

The possible moves are:
* Defect (turning in your mate) - this means you avoid the 'worst' case (when your opponent turns you in but you stay silent) but also avoids the 'best' case (When you both stay silent)
* Stay silent (the reverse of the above)

Simple PD is interesting but trivial. A quick
count of the matrix of possible moves shows that your best option is to always defect. The counterintuitiveness of this is why it is interesting.

HOWEVER - in this game, the simple game is iterated.
And you quickly learn that when the 'last' game is known, everyone struggles to defect then - and it quickly devolve sinto the simple case.
Interesting things happen once the game is open ended. Dove strategies (which sometimes, or even always, stay silent) start to win.

The whole point of the iterated game is to try and learn about your opponent - applying a strategy which is 'better' than theirs.
The long time 'best' of these is Tit-For-Tat. Which, as named, is the simple strategy of doing to your opponent whatever they did to you last round.
That way you play as well as possible against a hawk (a player who always defects) and also as well as possible against a dove.
Against itself, it ALSO plays as well as posisble, since they both sit there never seeing a defection and so never defecting - thus getting the highest possible score.

The interesting part of this study is that they claim that their colluding program doesn't just win against the existing programs by overwhelming them - but that it is also a stable strategy. That is, a program which tends to copy good strategies that it sees, will tend to adopt this strategy and ALSO do well.

It does, however, seem very cheaty.

Re:ITERATED dilemma by Anonymous Coward · 2004-10-14 04:43 · Score: 0

Nitpick: the absolute best scenario is when you betray and the other player doesn't, rather than when neither betrays.

That's why this strategy scores more: when it finds its opposite, it'll rack up optimal results like there's no tomorrow while tit-for-tat generally has a best scenario of total nonbetrayal, which scores lower.

Nothing personal by tit4tat · 2004-10-14 02:31 · Score: 1

This is not the first time that tit-for-tat has failed to win an iterated prisoner's dilemma competition. (Cannot find a good link to past results of similar competitions, but here's a link to results of one simulation.)

The cooperation strategy used by the Southhampton programs is interesting. At first it may seem unrealistic and not very informative about human behavior, especially in the context of life and death decisions. In other words, what incentive do I have to cooperate with a "brother" to my detriment, when at the end of the day, he lives and I die? But you can think of rationales for such a strategy (e.g., familial ties, idealistic reasons). These rationales may not be rational, strictly speaking -- If I die, what do I really care what happens after I'm gone? I cannot know or benefit from the beyond. (Or can I?) -- but being able to identify, account for and respond to them rationally would be beneficial.

Re:Nothing personal by Anonymous Coward · 2004-10-14 02:58 · Score: 0

Rationalism is not the same as solipsism.
Re:Nothing personal by boutell · 2004-10-14 03:22 · Score: 1

The selfish gene theory suggests that we may behave altrustically for the sake of others to whom we are genetically related; for instance, when a bird risks death to warn the flock of a predator rathern than simply fleeing alone, its genes are better propagated by the survival of the flock of relatives than by its solitary survival. Human altruism, which of course often extends beyond family members, may have developed from this basis.

--
Check out the Apostrophe open-source CMS: http://www.apostrophenow.com/

This sucks! by Libor+Vanek · 2004-10-14 02:33 · Score: 0

This is very easy to defeat! Just imagine - imitate "master" behaviour and then (after recognized by opponent) just abuse it...

It always works for card games by CyberGarp · 2004-10-14 02:34 · Score: 2, Insightful

Communication between secret partners has been one of the most undefeatable stratgies in cards for a long time. Didn't take a computer to figure that out. Someone just figured out how to do in the rules given for this competition.

--

I used to wonder what was so holy about a silent night, now I have a child.

Wait a Minute! by Waffle+Iron · 2004-10-14 02:34 · Score: 2, Funny

What I want to know is:
What is tat?
Where do I get it?
And how do I exchange it for the other thing?

--Dennis Miller (IIRC)

Re:Wait a Minute! by Anonymous Coward · 2004-10-14 09:08 · Score: 0

What is tat?

Tat is slang for big floppy man-tits.

Where do I get it?

Reading slashdot, chances are about even you already have them...

And how do I exchange it for the other thing?

As soon as you find out, please let the rest of us know.

Horribly wrong, too. I hope. by Anonymous Coward · 2004-10-14 02:34 · Score: 0

If Mutually-Assured-Destruction really were an implementation of one-time-only PD, then only non-rational play would explain the outcome.

Or, to put it colloquially:

Either world leaders are insane, or we're all dead.

Kin Selection in Genetic Algorithms by Baldrson · 2004-10-14 02:35 · Score: 3, Informative

This is a clever demonstration of kin selection among groups of competing algorithms.

A mathematical treatment of population genetics in groups was given by W. D. Hamilton in "Innate Social Aptitudes of Man". In the last sentence of that paper, Hamilton, the originator of modern kin selection theory, states:

One hears that game theorists, trying to persuade people to play even two-person games like 'Prisoner's Dilemma', often encounter exasperated remarks like: 'There ought to be a law against such games!' Some of the main points of this paper can be summarized as an answer to this comment: that often, in real life, there is a law, and we can see why, and that sadly we also see the protean nature of this Dilemma, which, when suppressed at one level, gathers its strength at another.

What Hamilton is referring to is the fact that in any structure of components vs composite, there is the opportunity to defect. An individual gene can defect against the organism within which it resides via, say, meiotic drive. An individual may defect against his tribe made up of his close relatives. A tribe may defect against the others making up a nation. A nation may defect against others making up a geographic race. A geographic race may defect against others making up humanity as a whole.

It is indeed a dilemma but it isn't without a rigorous treatement within genetic theory.

Steve Sailer has written an an excellent review of the politically touchy issue of ethnic nepotism given from Hamilton's group selective perspective.

--
Seastead this.

Re:Kin Selection in Genetic Algorithms by Snowspinner · 2004-10-23 05:07 · Score: 1

I tend to think kin selection was deliberately, rather than accidentally, excluded from the model.

--
Philip Sandifer's academic website

Secret societies & paranoia by G4from128k · 2004-10-14 02:36 · Score: 2, Insightful

This story illustrates the power of groups and societies to coordinate to the detriment of individuals and outsiders. The Southampton team used a "secret handshake" to recognize members of the society and discriminate against outsiders. It is a natural explanation for people's fear of closed/secret societies -- people fear the group's ability to break the rules of individualistic "fair play."

If the agents in the game were capable of higher order reasoning and could see these coordinated actions between members, then they would become paranoid -- all the Southampton team members were "out to get them."

--
Two wrongs don't make a right, but three lefts do.

Not unfair by Anonymous Coward · 2004-10-14 02:36 · Score: 0

A strategy is just simply a function that maps an observed history into some sort of behaviour.
So this is not unfair. It does, however, require the presence of other nice agents that cooperate.
I conject that given this last fact it would be easy to design strategies that do better by exploiting this cooperative behaviour: simple design a strategy so that the program initially seems to cooperate (by making the correct "handshakewith the nice agents, but even when pretends to be in slave mode quickly starts to defect. It would probably do even better probably; and then you would just have the old prisoner's delemma back (i.e. cooperation is unsustainable).

Re:Not unfair by trewornan · 2004-10-14 03:39 · Score: 1

It would probably do even better
There's no "probably" about it - when it's in master mode it can reap all the benefits and in slave mode it can defect and reduce it's losses. A tit for tat program which can recognise/fake the handshake and respond accordingly is going to trounce this.

It's interesting stuff by Perianwyr+Stormcrow · 2004-10-14 02:36 · Score: 2, Insightful

Tit for tat has a secret handshake too, but it's a code of ethics. It is robust in any iterated situation. That's what makes it neat.

--

What we call folk wisdom is often no more than a kind of expedient stupidity.-Edward Abbey

"The Prisoner's Dilemma" game online by helfen · 2004-10-14 02:36 · Score: 1

I did a research on this topic and I found a fun little game. You can play in "The Prisoner's Dilemma" against a computer opponent - , and discover some strategies.

Re:"The Prisoner's Dilemma" game online by Nimloth · 2004-10-14 03:21 · Score: 0

Wow, well that's a nice algorithm:

Albert: 1057 years You: 12 years
Albert should learn his lesson after 50 times not confessing.

Tat gets tit? by FerretFrottage · 2004-10-14 02:37 · Score: 1

Well then tat must really suck

--
"Look Lois, the two symbols of the Republican Party: an elephant, and a fat white guy who is threatened by change."

Tit for Tat always has been beatable... by jafiwam · 2004-10-14 02:39 · Score: 2, Insightful

Except Tit for Tat is more robust than other plans, deals well with a wide variety of opponents, and is easy for opponents to "figure out" and is "forgiving" so it does not get caught in endless loops of mutual punishment easily.

Being that, beating Tit for Tat isn't that big of a deal. Doing BETTER than Tit for Tat consistently _IS_ a big deal.

The game is a positive sum game, so it pays off to end up in a cooperative (or semi-cooperative) sequence over repeated "defections".

For some good reading on the Prisoner's Dilemma Game and how it fits in some biological systems read;

"The Evolution of Cooperation" by Robert Axelrod (and newer books)

"The Selfish-Gene" by Richard Dawkins

There may be more recent books too, it's been while since I studied the subject.

Having one plan that can beat Tit for Tat

Re:Tit for Tat always has been beatable... by barawn · 2004-10-14 02:57 · Score: 1

Being that, beating Tit for Tat isn't that big of a deal. Doing BETTER than Tit for Tat consistently _IS_ a big deal.

I think you missed some points of the research - the idea was to find "how many colluders do I need to beat Tit for Tat?"

Tit for Tat is quite possibly the best "single person" strategy. Against any other opponents not working together, Tit for Tat will typically win in a long enough iteration.

Now, we know that Tit for Tat isn't the best in cooperating environments - as someone else here pointed out, if you've got a Random opponent, a "use Tit for Tat, watch for Random, if Random, slam him to the wall." strategy will probably do better. That's akin to "cooperation", or at least "collusion" - said person will have more information.

The interesting point here is how many colluders you needed here - this showed it was about 20 or so. That's a large number. Given the total number of entrants, it says something about roughly how large an organization must be before it can score better than the ideal "logical person".

Given the fact that the systems that the PD applies to don't preclude collusion, this may have some interesting applications there. Certainly one can imagine applications in biology - cells, after all, do kill themselves to ensure the survival of the macroorganism. There might be some information here on the evolution of multicellular organisms.

NOOOOOOOOO!!!!!!! by Marquis+de+Sade · 2004-10-14 02:39 · Score: 1

I beat you thusly anyway.

I must be missing something by Wind_Walker · 2004-10-14 02:41 · Score: 3, Interesting

Wouldn't an algorithm that defected every time (call it Traitor) beat the Tit for Tat program?

1st iteration - Traitor defects, TfT cooperates, TfT loses and Traitor wins.
Nth iteration - both defect, minor losses for both

Thus Traitor beats TfT... What am I missing?

Re:I must be missing something by Anonymous Coward · 2004-10-14 02:52 · Score: 1, Informative

You aren't missing anything. In fact, TfT never beats *any* other strategy. The best it can do is tie.

Also worth noting is that the Southampton programs *were* cooperating -- with each other.
Re:I must be missing something by p3d0 · 2004-10-14 04:39 · Score: 1

What you are missing is that nobody ever claimed TfT would never lose.

--
Patrick Doyle
I mod down every jackass who puts his moderation policy in his sig. Oh, wait a sec....
Re:I must be missing something by Anonymous Coward · 2004-10-14 04:55 · Score: 0

Welcome to the game.

10 players. 5TFT, 5Traitors 45 plays round robin.

each TFT plays 20 rds against TFT. 3pts x20 =60pt
each TFT gets cheated once by each traitor 0pts x5=0pts
each TFT cheats each cheater 4 times = 1pt x20=20pts

At the end of the day each TFT scores 80pts

Each traitor plays 5 rds against an unwitting TFT = 5pts x5 = 25pts
Each traitor plays 20 rds against knowing TFTs = 1pt x 20 = 20pts
Each traitor plays 20 rds against traitors = 1pt x20=20pts

At the end of the day each Traitor scores 65pts

Equal TFT destroys Traitor 80/65 on average. Actually the difference increases with every additional round played.
Re:I must be missing something by Gogl · 2004-10-14 06:41 · Score: 2, Interesting

You're right that in a one-on-one matchup, always defect would beat TfT. However, the point of TfT isn't that it would win every single one-on-one matchup but that it does extremely well versus any number of other strategies. Your "always defect" would beat TfT, but if you played Grim Trigger then you wouldn't do that well, whereas TfT would do very well playing with Grim Trigger (Grim Trigger is a strategy that cooperates until the opponent defects, and then it defects forever).

As has been stated in this thread, the claim isn't particularly that TfT is the best strategy of all time in all circumstances, but that it is an elegant and versatile strategy that fares well in a variety of situations.
Re:I must be missing something by Pretzalzz · 2004-10-14 08:18 · Score: 2, Interesting

This isn't a masochistic game, you win by scoring above 0, and lose by scoring less than 0. In Tit for Tat v Traitor, they both lose.

So, Southampton lost big by Flyboy+Connor · 2004-10-14 02:41 · Score: 1

Since the two Southampton programs work together, I would say they both earn half of their combined winnings. From that point of view, Southampton lost big. Of course, if you are allowed to completely discount your losing entries, it is easy to defeat Southampton: simply create one master with TWO slaves.

Once I "won" a PD-tournament by fiddling with the organising engine. The friendly engine supplied the programs with the recent histories, and I simply inserted all cooperates in my opponent's table. The scores were only counted at the end. This was in a trial session, and I only wrote the program to expose a problem in the engine. I wouldn't dream of entering the program for real, because it would defeat the purpose of the tournament. I can't say I have much respect for the Southampton team that didn't have any qualms about cheating.

Did the same thing a few years ago... by Anonymous Coward · 2004-10-14 02:42 · Score: 5, Informative

The length of the code is one of the largest problems to overcome. Performing any signal other than all-cooperate produces a net loss of 1 or 4 points per round for your team in traditional (0,1,3,5) IPD. Simple signalling, ie 4th round defect was very effective. While the master/slave aspect was amazingly effective in my research, the "spoiler" was not. A small population of master/slaves could invade an arbitrariliy large block of TitForTat if evolution was by duplicating winner and removing loser after n iterations. The population of "spoilers" stagnates very quickly in a large TFT population. TFT should be considered a friend, not an enemy because they are a positive growth environment. Going "spoiler" on any non-TFT/ally was quite effective as any bot not prone to cooperate posed the only real risk of "master" losing.

Re:Did the same thing a few years ago... by Neb+Namwen · 2004-10-14 12:05 · Score: 1

A small population of master/slaves could invade an arbitrariliy large block of TitForTat if evolution was by duplicating winner and removing loser after n iterations.
How? Since the slave serves the master by losing to it, after a few generations the slaves will all be extinct.

Evolution figured this one out LONG ago: by museumpeace · 2004-10-14 02:43 · Score: 1

In an organism as ancient and lowly as the slime mold, a genetic feedback mech evolved so that cheating would balance with altruism...You should hope your species hangs around as long as the slime mold [> 1 billion years!] see this article at BetterHumans and elsewhere.

--
SLASHDOT: news for people who can't concentrate on work or have no life at all and got tired of yelling back at the TV.

so difficult! by earthstar · 2004-10-14 02:43 · Score: 0

Man, this game has stuff!!!
It takes so much of my brain's processing power like no other game!

On a side note, i bet this story will get over 1000 comments.Sure.

--

Why does yahoo do this

Breaking news! by eddy · 2004-10-14 02:44 · Score: 1

So...

In a championship of iterated games where I got to play against myself occasionally, I was able to win the aggregated championship score due to my "insightful" ability to optimize playing against myself.

Did I get that right?

--
Belief is the currency of delusion.

Re:So, in real world terms.... by Anonymous Coward · 2004-10-14 02:45 · Score: 0

You have the nukes. Why shouldn't they have them?

Because the North Korean government does things like kidnap citizens from other countries, conducts experiments on human subjects, and starves their population.

All in all, since the North Korean really can't build nukes without China's tacit acquiescence, I'd say we should go to the Chinese and say "If North Korea doesn't give up its nukes now, we'll support the nuclear arming of Japan, South Korea, and Taiwan."

That'd go over well with the Commies in Peking.

Oh, and you should see from both game theory and the results of Jimmy Carter's previous negotiations during the Clinton administration with North Korea why negotiating directly with North Korea only is a bad idea. Not that being a bad idea ever stopped John Kerry from opening his yap.

Did you also know that a few days after Kerry went off about North Korea in the first debate, the North Koreans pulled out of negotiations and want to wait until after the US elections? I guess they think they can get a better deal from Kerry.

Telepathy? by earthstar · 2004-10-14 02:47 · Score: 0

While the PD game works by those questions tha thave to be individually answered,I have a thought.

Is telepathy scientifically possible? ( communicating to another person without any contact - not even eye,but by sheer WILL)

--

Why does yahoo do this

Re:Telepathy? by earthstar · 2004-10-14 02:53 · Score: 0

Usually Iam reminded of telepathy only when iam stuck up with a question i didnt know the answer to, in an exam hall,and wished that i could ask my pal by meer 'waves' ;-) .
But this prisoners dilemma reminded me of that.!

--

Why does yahoo do this

The winner basically cheated (good for him :) by jamie · 2004-10-14 02:50 · Score: 4, Interesting

It's pretty trivial that if two or more Dilemma agents are able to recognize each other, they have an advantage over those which cannot. I've got a Prisoner's Dilemma simulation running on my website -- I wrote some code for it over the summer and have been playing around with it on and off.

Once I experimented with letting the agents recognize which "species" they were in and which "species" their opponent was. The runaway winner, of course, was the one which always cooperated with itself, and was less nice to every other species. (In my version, "less nice" meant playing Tit-For-Tat, but the idea's the same.)

Being able to do this is like having the teacher's edition. If recognizing which species other agents belong to is allowed, that's a pretty trivial strategy. It's not called cooperation. It's called xenophobia, or to put it into the most familiar anthropomorphization, racism.

(The life lesson, if I may go out on a limb, is that in an environment where some recognize a quality called "race" and discriminate based on it, being unable to see that quality is a liability. Being truly color-blind means you are unable to recognize not only race but racism, which means you will be taken advantage of.)

When I ran my first tournament and got some interesting results based on this, I realized that knowledge of what "species" an agent belongs to is too powerful, it throws a monkey wrench into the works. So I scrapped it and moved on to stuff I found more interesting.

But the winner of this PD tournament was even craftier; he submitted a ton of entries, all of which were xenophobic in this way, except that they all recognized one "species" as the top dog. The other "species" essentially committed suicide to give the highest score to the top dog. That wouldn't have worked in my tournament, since they literally would have committed suicide (my agents starve to death if they don't score high enough) and that would have shaped the resulting environment. Every tournament is artificial in some way, and the human submitting entries to this one was clever enough to take advantage of these particular artificialities.

Since it's now been shown that inter-agent communication is possible, that's going to be fair game for every tournament from now on. The next step is going to be designing tournaments to work with this trick, not against it. As I wrote to this tournament's organizers:

Since that's such a powerful strategy, I think the next step in PD tournaments is not to try to overcome it, but to embrace it: allow agents to communicate, not just with their own species, but with whoever they're playing against. My guess is that mere xenophobia would be eclipsed by the much more powerful strategy of joining the ongoing discussion about which agents can and can't be trusted. That's the next big feature I want to try.

Re:The winner basically cheated (good for him :) by PTBarnum · 2004-10-14 04:14 · Score: 1

You say that you had a strategy that would always cooperate if playing against itself, and would play tit-for-tat otherwise, but tit-for-tat will already always cooperate with itself. In what way does this make the winner "less nice" to foreigners?
Re:The winner basically cheated (good for him :) by Anonymous Coward · 2004-10-14 04:16 · Score: 0

That is, mere xenophobia would turn into scientific fascism. :(
It seems that if one is suitably prejudiced, one's better off than anyone playing "fair". I'll try to ignore that knowledge. :)
Re:The winner basically cheated (good for him :) by p3d0 · 2004-10-14 04:36 · Score: 1

My guess is that mere xenophobia would be eclipsed by the much more powerful strategy of joining the ongoing discussion about which agents can and can't be trusted.
Wow. That is cool.

--
Patrick Doyle
I mod down every jackass who puts his moderation policy in his sig. Oh, wait a sec....
Re:The winner basically cheated (good for him :) by jamie · 2004-10-14 04:48 · Score: 2, Informative

In the particular case I ran, it didn't make any difference. Now, though, I've added errors to what each agent tries to play. Tit-For-Tat responds to those errors and, playing against another Tit-For-Tat, quickly plunks down to (say) 90% cooperation plus or minus error, and random-walks around from there. "Always cooperate," though, sticks at 100% plus or minus error, so it does a little better.
Re:The winner basically cheated (good for him :) by Anonymous Coward · 2004-10-14 05:05 · Score: 0

This highlights a flaw in the compitition.

The contest rules have an assumed relationship between the two agents all ready built-in. How could one agent defect against another if the two agents don't know anything about each other, in any scenario? What could they offer to get a reward?

The artifical agents entered into the compitition then represent the point where the prisoners are isolated; it is assumed selfish interests will always take over. If both agents are equal in their resolve not ot defect, this is because they agreed on that prior to their capture.

Compitition beween two strangers is not the "prisoners dilemma", even if the rewards/consequenses are the same.
Re:The winner basically cheated (good for him :) by figment · 2004-10-14 09:29 · Score: 1

Unfortunately the cheating made it defeat the entire purpose of IPD in the first place. The idea behind it (and game theory in general), is that a person will choose actions that will maximize his profit/utility/happiness/etc. With this said, IPD is useful because it can be drawn into games such as repeated-period pricing in a Cournot duopoly - where each firm maximizes their profit.

SO fastforward to what we have today, a strategy where we create 500bazillion smaller firms all for the purpose of going bankraupt, so that one parent firm makes all the money. Huh? Right, that doesn't make any sense.

The *correct* way to think about it, is that if you're going to have said cooperation, the coalitions that cooperate will split the prizes between them, such that if this team had 80 entries, their end utility (ie end profit) would be 1/80th of the one program that won. In this case, then drawn back into the real-world example of a firm, you can see why this isn't a valid strategy, due to the start-up costs of making a bazillion fake firms who all lose money, etc.

So yes, this team won. Yes, it was a pretty ingenius way of winning. But no, it has little if any use to actual game theory.
Re:The winner basically cheated (good for him :) by jamie · 2004-10-14 10:35 · Score: 1

Right, good analysis. What does it mean to say the human "won" when, on balance, his submitted strategies did worse than the median? If there were an entry fee and the pot split proportional to final score, he'd be out of a lot of money. So another way of looking at this is that the human took advantage of an artificially-constructed winning condition.

Cheat? by oniony · 2004-10-14 02:51 · Score: 1

Hmm, super-cooperation. They are cooperating outside of the problem to achieve a goal outside of the problem. I think this is just a cheat. The programs are not out for personal gain at all and so are not truly participtaing.

--

Powered by onion juice.

That's right by Anonymous Coward · 2004-10-14 02:52 · Score: 4, Informative

That's right, traitor (hawk) beats TfT in any given trial.

BUT, in an environment made up of a few players playing each strategy, then you have the following matchups:

Hawk vs Hawk. Horrible horrible loss for both of them.
TfT vs Hawk. Hawk wins, but only by a single round.
TfT vs TfT. Both TfT 'win' - neither betray the other.

So, overall, TfT does better than hawk.

The interesting part isn't beating TfT (which, as you point out, isn't THAT hard to do) but in doing consistently better than it against a wide variety of programs. Which is what TfT has long been the baseline for.

new exploit? by An+ominous+Cow+art · 2004-10-14 02:52 · Score: 2, Funny

Curse these researchers, now black hats will be using this technique to let exploit code escape from chroot prisons!

Tit for Tat has already been beaten by bigHairyDog · 2004-10-14 02:52 · Score: 2, Informative

Tit for Tat is outperformed by "Tit for Two Tats", because it is better at avoiding long runs of damaging mutual recrimination. That was 5 years ago. The performance of any of these strategies is only determined by the opponent strategies that they face, which is arbitrary. It is therefore meaningless to talk of one strategy being 'better' than another - most advanced strategies can beat Tit for Tat given the right opponents.

--

foo mane padme hum

Re:Tit for Tat has already been beaten by tepples · 2004-10-14 08:58 · Score: 0, Troll

Tit for Tat is outperformed by "Tit for Two Tats"

So now do I have to get two tattoos in order to get a woman to expose herself to me?
Re:Tit for Tat has already been beaten by emurphy42 · 2004-10-14 13:12 · Score: 1
The performance of any of these strategies is only determined by the opponent strategies that they face, which is arbitrary. It is therefore meaningless to talk of one strategy being 'better' than another - most advanced strategies can beat Tit for Tat given the right opponents.
Players of the card game Magic: the Gathering (where you get to choose which cards go into your deck) refer to this as the "metagame":
1. Strategy X becomes popular
2. Cards that defend against X (but are less useful against other strategies) become popular
3. X becomes less popular
4. Cards that defend against X become less popular
5. X becomes popular again
A Google search on "Tit for Two Tats" turns up, among othe things, some excerpts from The Selfish Gene by Richard Dawkins. While Tit for Two Tats would have won the first tournament, it lost the second one, because the metagame had changed.
Programmers in the second tournament had all been provided with the results of the first, including Axelrod's analysis of why Tit for Tat and other nice and forgiving strategies had done so well. It was only to be expected that the contestants would take note of this background information, in one way or another. In fact, they split into two schools of thought. Some reasoned that niceness and forgivingness were evidently winning qualities, and they accordingly submitted nice, forgiving strategies. John Maynard Smith went so far as to submit the super-forgiving Tit for Two Tats. The other school of thought reasoned that lots of their colleagues, having read Axelrod's analysis, would now submit nice, forgiving strategies. They therefore submitted nasty strategies, trying to exploit these anticipated softies!

But once again nastiness didn't pay. Once again, Tit for Tat, submitted by Anatol Rapoport, was the winner, and it scored a massive 96 per cent of the benchmark score. And again nice strategies, in general, did better than nasty ones. All but one of the top 15 strategies were nice, and all but one of the bottom 15 were nasty. But although the saintly Tit for Two Tats would have won the first tournament if it had been submitted, it did not win the second. This was because the field now included more subtle nasty strategies capable of preying ruthlessly upon such an out-and-out softy.
I'm reminded now of Iocaine Powder, the program that won the first Ro-Sham-Bo (iterated rock-paper-scissors) contest. Its strategy was basically the following: "Consider a whole bunch of different strategies, figure out which one would have given me the highest score if I had used it from the beginning, and use that one next turn." I wonder whether that concept has been adapted to the Iterated Prisoner's Dilemma, and if so, how well it holds up?

Actually, it's quite interesting.... by StressGuy · 2004-10-14 02:54 · Score: 1

There are a lot of post about the Southamton programs "cheating" and I agree. However, the interesting part of the prisoner's dilemma, or the zero-sum game is that this is one of those areas where math and philosophy overlap. This math problem yields a facinating insight into society. I would see the "cheating" Southamton method as analogous to a "charismatic leader or organization" who has amassed an array of followers willing to sacrifice themselves for his (or thier) benefit.

Think of people or organizations that have fanatical followers. I think these guys might be on to something quite fascinating.

--
A goal is a dream with a deadline

Re:Actually, it's quite interesting.... by Control+Group · 2004-10-14 03:17 · Score: 1

Er
PD is specifically not a zero-sum game: there can be a net gain from cooperation, such that the system as a whole comes out ahead.
This is why it's important that the lone cooperator serve 5 years, mutual cooperators serve 2 years, and mutual defectors serve 4 years. From a system standpoint, the end result is either 4 total years (mutual cooperation), 5 total years (single defection) or 8 total years (mutual defection).
Since the systemic outcomes differ, it's a positive-sum game. In a zero sum game, the systemic result would be the same regardless of the participant's cooperation. The fact that it's not zero sum is precisely what makes it interesting.

--

Reality has a conservative bias: it conserves mass, energy, momentum...

"This must be banned" by Tablizer · 2004-10-14 02:55 · Score: 1

After recognition, the two Southampton programs became 'master and slave'

This game will be banned under the Patriot Act. Ashcroft will claim that if Osoma got ahold of these results, he will use it to justify making us all Islamic slaves, with Osoma the Master.

--
Table-ized A.I.

Re:"This must be banned" by maxwell+demon · 2004-10-14 03:18 · Score: 2, Insightful

Isn't this the way the terrorist organisation works? The actual attackers totally lose (they lose even their life), and their masters profit from it. The experiment shows that tit for tat isn't a good strategy against this.

--
The Tao of math: The numbers you can count are not the real numbers.

Models real world? by MoebiusStreet · 2004-10-14 02:55 · Score: 1

At first blush it seems like a "so what?", but in the context of current events, I wonder if it does give us a little insight.

As quoted above, we see that there are a few winners who got there only by making other players siphon their own potential into the designated winner.

Does this differ significantly from the US presidential elections? I mean, here we have two people who have convinced a nation that one of these two are the only ones who can be winners, and all of us other players can only playing our game their way. They're ensuring their own success, and for some reason we peons are going along with the game.

Re:Models real world? by Anonymous Coward · 2004-10-14 02:59 · Score: 0

Sour grapes that the Libertarian candidate doesn't stand any chance to win?

WOPR and fries? by pgfault · 2004-10-14 02:56 · Score: 0

Hopefully they affixed plenty of LEDs to the front of their server hardware and forced their developers to use 300bps acoustic couplers.

"How about a nice game of Tit for Tat?"

That seems cheating to me. by rguiu · 2004-10-14 02:56 · Score: 1

I can find a similar example. Having a league of an sport that most of the results are draws or very equal.
Someone takes part not with a team, but with 15 teams, 14 of the teams will lose without efford against the 15th, but will fight hard with the other 5 external teams, my team number 15th will have at least 14 victories and may be some of the other 5....easy champion.

That seems what they used, now with that strategy someone could send 1 million entries and will be the winner easily.

What I'd like to know is ... by Anonymous Coward · 2004-10-14 02:57 · Score: 0

How did all the Southampton entries do as a combined aggregate?

Tit for Tit by Tablizer · 2004-10-14 02:58 · Score: 0, Redundant

I'll willingly become a slave if my partners have nice tits. I call this algorithm tit for tit :-p

--
Table-ized A.I.

Cooperation is the loser... by CODiNE · 2004-10-14 02:58 · Score: 1

I don't really get this, looking at the rules it seems the best outcome is to screw people over. Cooperation gives small wins for both... is this just to illustrate a zero-sum game or is this supposed to actually model real-life social interaction? I do not believe that in reality if everyone cooperated there would be a small payoff for all, and that the secret to life is to screw unto others. I know the situation is two prisoners being interogated, which is not the same as say... two neighbors fighting over leaves falling on each other's lawns from opposite tress for example. Yet it seems to be used as a general society model for human interaction. Notice

A common view is that the puzzle illustrates a conflict between individual and group rationality. A group whose members pursue rational self-interest may all end up worse off than a group whose members act contrary to rational self-interest.

So there it implies that the prisoners dilemma parallels everyday living and society, and yet it is not really an appropriate model for day to day life is it? Maybe I'm missing the point here, it just seems the message is "Being a selfish jerk is better than a nice guy, so live appropriately".

-Don.

--
Cwm, fjord-bank glyphs vext quiz

Re:Cooperation is the loser... by Anonymous Coward · 2004-10-14 03:44 · Score: 0

The article does a bad job of explaining it. PD can be demonstrated a lot of ways with different numbers, but the basic idea is:

a)Both players start with X points
b)If you cooperate, nothing happens
c)If you defect, you gain N points while the opponent loses 2N points

Cooperating is zero-sum while defecting is negative-sum.

Tit-for-tat is only the optimal strategy in cases where the game will be repeated an unknown number of times. In a never-repeated game, always defecting is optimal, and oddly in any fixed number of repeats always defecting is also optimal (everyone will defect the last round, so defect then...so the last round is already effectively decided, therefore everyone will defect the next-to-last round, so defect then...et cetera).

Personally I think this isn't news at all, it's saying "hey, I found a way to make one of my teams win if I enter two teams to compete against everybody else's one!" If I owned two baseball teams and had A always forefit to B, B would of course get a good record.
Re:Cooperation is the loser... by Chyeld · 2004-10-14 03:48 · Score: 1

Really though, isn't that how life is? Ignoring the whole Religious "the meek shall inherit the earth" spiel we are fed, when exactly have you seen a situation where not being selfish rewards you more over being selfish?

We'd like to think that if we all play by the rules, are polite and considerate to everyone, that we'll be sucessful. But the sad truth, of this world at least, is that life is typically far easier if you worry about yourself first, and others second.

That doesn't mean that it's ok to do so, society was build upon the efforts of trillions who cared more about other people than themselves (to some sense or another), and it can only survive if there continues to be a stream of such people.

A key to life is to realize and accept both of these facts, that our continued survival depends on mutual cooperation AND that the short term rewards for not cooperating are always going to be a tempation. And to plan accordingly.
Re:Cooperation is the loser... by narcc · 2004-10-14 07:08 · Score: 1

Reminds me of a couple of interesting problems

Auction a dollar

and Divide the dollar

--
Required reading for internet skeptics

Its a tit-for-tat world by essreenim · 2004-10-14 03:02 · Score: 1

Defecting gets you a win ..
Copying the other players moves gets you a win ..

Except that's not the "prisoner's dilemma" at all! by tomhudson · 2004-10-14 03:03 · Score: 2, Insightful

Southampton's programs executed a known series of 5 to 10 moves which allowed them to recognize each other

The whole idea behind the prisoner's dilemma is that neither party is privy to what the other party is currently doing.

By using this "recognition system", the program is capable of "knowing" in a deterministic fashion what some of the other programs will do in advance.

In other words, at the very least, a cheat.

Bridge conventions by BridgeBum · 2004-10-14 03:03 · Score: 1

With a name like BridgeBum, how could I not reply to this? :-)

I'll address the tournament rules of bridge, rather than the Laws of Contract Bridge, which are different.

Basically, the governing bodies in power running tournaments can and do restrict agreements (known as 'conventions') that are allowed between partnerships. However, these restrictions are not so stifling as to only allow known agreements...if that were the case, no invention could happen.

The rules relating to conventions basically fall into two categories:

1) Any agreements you have must be made available to your opponents upon request. There may also be proceedural changes ('Alerts') for agreements which are unusual in nature to help inform the opponents when they should inquire.

2) Any conventions you play are 'categorized' and must fall into the allowed categories for the event. In the US/Canada/Mexico part of the world, they have in general the most restrictive rules about conventions, but even within those frameworks, inovation can and does happen.

Another good general rule: the higher the level of competition (National event, International, etc.), the more liberal the rules are about conventions.

In parts of the world outside North America, the local tournaments tend to be more liberal. Austrailia/New Zealand probably have the loosest rules restrictions of anywhere in the world. (I'm not sure there *are* any restrictions, except for the disclosure rule #1 above.)

More information:

American Contract Bridge League
World Bridge Federation

--
My UID is the product of 2 primes.

Re:Bridge conventions by trewornan · 2004-10-14 03:33 · Score: 1

In UK rules the only restriction on bidding is that you can't bid one no-trump if you hold a singleton. Really is a mystery to me why they object so strongly to this though.
Re:Bridge conventions by rcs1000 · 2004-10-14 03:38 · Score: 1

Totally OT... but I thought you'd enjoy this.

When I was at University, I was friends with (and played a lot of poker with) the Bridge crowd.

Once, right at the end of term, the Bridge team was playing in a local tournament and was a player short. (The team had actually already won the county league, so this was strictly for fun.) Tom, one of my friends, and a member of the British Under-21 Bridge Team asked me if I'd like to come along and play. I pointed out that I didn't really understand Stayman, Blackwood, or any of that stuff. No problems, he said, this is all for fun, just play naturally.

So, there am I (a very average bridge player) paired with one of the country's best, playing a form of the game (duplicate pairs) I never even knew existed.

About our third table we came up against a very serious looking pair who presented us with cards explaining their bidding convention, and asked what we were using. Tom explained that we had no system, and was roundly disbelieved.

Anyway, I bid 2 Hearts, and was doubled. So, I redoubled.

"WHAT DOES THAT MEAN???!? WHAT CONVENTION ARE YOU USING???" asked the woman, very upset and thinking we're passing secret messages.

Tom replied "I believe it means he thinks he's going to make two hearts."

Beautiful!

--
--- My dad's political betting
Re:Bridge conventions by Anonymous Coward · 2004-10-14 03:39 · Score: 0

In UK rules the only restriction on bidding is that you can't bid one no-trump if you hold a singleton. Really is a mystery to me why they object so strongly to this though.

You're wrong, I'm afraid. Most events are played at EBU level 3 (more or less the old General License) or level 4 (more or less Restricted License).

Level 4 gives you more of a free rein, but you are still unable to play HUMs (such as strong pass systems) and various other things. The EBU website has the details.
Re:Bridge conventions by Shimbo · 2004-10-14 04:31 · Score: 1

In UK rules the only restriction on bidding is that you can't bid one no-trump if you hold a singleton.

That's not true; in the UK we get laughed at by our European colleagues for having a conventions booklet over 50 pages long. And as it happens, you have been allowed to open 1NT with a singleton for a couple of years, even at level 3.

Stored state: Is that allowed? by peterpi · 2004-10-14 03:08 · Score: 1

Isn't storing state from one game to the next a form of cheating? I always assumed the process should be stateless.

Don't you see the beauty? by Q2Serpent · 2004-10-14 03:12 · Score: 3, Interesting

They "cheated", and the other guy didn't, so they won big! Wasn't that the whole premise?

-Serpent

Re:Don't you see the beauty? by kisrael · 2004-10-14 03:45 · Score: 4, Insightful

They "cheated", and the other guy didn't, so they won big! Wasn't that the whole premise?

Well, they kind of went for a win on the "metalevel", utilizing the circumstances of the competition rather than solving the originally stated issue in an abstract way. On the one hand that's cool because evolution can work like that sometimes, but on the other hand, it really isn't answering the original question any more. (the question is "what's probably the best strategy for any given individual in Prisoner's Dilemna" and they changed the question to "how can we get some individuals to be super-players with the way this prisoner's dilemna simulator is setup"

--
SO YOU'RE GOING TO DIE: The Comic for Dealing with Death
Re:Don't you see the beauty? by gadget+junkie · 2004-10-14 05:37 · Score: 2, Interesting

"the question is "what's probably the best strategy for any given individual in Prisoner's Dilemna" and they changed the question to "how can we get some individuals to be super-players with the way this prisoner's dilemna simulator is setup""

. How very true.I doubt that this solution is applicable in real life, if only for the fact that one of the assumptions would be that a subset of a winning team consistently and repeatably wants to be defeated. Mother nature took care of those long ago.

--
"If a boss demands loyalty, give him integrity. But if he demands integrity, give him loyalty." (John Boyd, 1927-1997)
Re:Don't you see the beauty? by kisrael · 2004-10-14 06:02 · Score: 2, Interesting

subset of a winning team consistently and repeatably wants to be defeated. Mother nature took care of those long ago.

Well, it's an interesting philisophical point. It depends on how you define "win" and "lose"...certainly some specices have formed partnerships with other, often larger species, and if you define "win" as something besides "just survive", they might be seen as subjugating themselves to the other creature, so that the partnership prospers, even if their life doesn't seem that swell.

In other words, nature is more complex than the Prisoner's Dilemna, and sometimes ends up finding stuff more like the "cheaters" solution.

--
SO YOU'RE GOING TO DIE: The Comic for Dealing with Death
Re:Don't you see the beauty? by Anonymous Coward · 2004-10-14 07:03 · Score: 0

You doubt the applicability to real life? Look around you.

Our world is filled with people who accept the short end of the stick because of a belief that they deserve to lose. Why else would so many of the workers at large corporations accept such low wages while certain players (i.e. top execs) get huge compensation packages? Many of them accept the notion that if they work hard, apply themselves, etc etc, that they might one day be one of those top execs. But clearly, most of them cannot attain top dog status because the numbers don't allow it.

I could go on and on about the real world strategies used to convince players to take one for the team, as it were, but I think you can get the general idea.
Re:Don't you see the beauty? by gadget+junkie · 2004-10-14 08:06 · Score: 1

"certainly some specices have formed partnerships with other, often larger species, and if you define "win" as something besides "just survive", they might be seen as subjugating themselves to the other creature, so that the partnership prospers, even if their life doesn't seem that swell."

To the extent that the prisoners' dilemma formalizations are adaptable to evolution theory, the superiority of Tit for Tat is still valid in this case. think about how many successive generations it must have taken for symbiosis to appear, evidently all cooperating.

--
"If a boss demands loyalty, give him integrity. But if he demands integrity, give him loyalty." (John Boyd, 1927-1997)
Re:Don't you see the beauty? by kisrael · 2004-10-14 08:27 · Score: 1

I think you're right in that in general the Tit for Tat of "trust but verify" is often a good strategy. My main point was evolution, like the team that made the "cheating" solution, fills niches that you don't know were available and always questions assumptions...I read about attempts to breed circuits, like a minimal number of gate oscillator...they thought they had a real champion, but what they actually had bred was a radio antenna, pulsing in frequency with their PCs RF noise! "Evolution in the lab" unfortunately often leads to surprising bittleness and dependence on lablike conditions of temperature, light, etc...you really need a lab with something approaching the variety of conditions on the earth to get evolution to do your bidding!

--
SO YOU'RE GOING TO DIE: The Comic for Dealing with Death
Re:Don't you see the beauty? by cyberformer · 2004-10-15 04:23 · Score: 1

There is the "one day I'll be at the top" hope, which is usually wrong. But there are also sound practical reasons why people work at extremely low wages. In particular, they know they're likely to get fired if they complain.

Everyone could get together and demand higher wages (ie. a union), but cooperation among many participants is much harder to organize than among two
Re:Don't you see the beauty? by Anonymous Coward · 2004-10-18 00:43 · Score: 0

...defeated by a group of cooperating programs from the University of Southampton. The Prisoner's Dilemma is a game with two players
Well, with the winning group plus the defeated individual, I make it at least three playes. So yes, in a way they "cheated". It's interesting but not exactly startling when you think about it.

Best IPD strategy is a misnomer by Anonymous Coward · 2004-10-14 03:12 · Score: 0

The results depend greatly on the makeup of the tournament. Even with zero cooperating strategies like Southampton's, it's not hard to select a group of rules such that tit-for-tat loses the round robin tournament.

A much more interesting question is whether a population of S/M rules like Southampton's would survive in an evolutionary simulation (i.e. the population of rule x at iteration t depends on how well rule x did at iteration t-1). I suspect not, since the slave rules will die out quickly, leaving the masters with no victims to leech off of.

The whole concept, contrary to what the wired writeup implies, is very uninteresting. It does not occur in nature or strategic economic/political models since there are no unconditional slave strategies in such systems, i.e. they don't survive and are clearly unfit, in the evolutionary sense of the term.

One easy way to find a "winning" strategy for non-evolutionary round robin simulations, given the other participating rules, is to optimize for a weighted average of the other rules' decisions, creating a "compound rule" that simply consults all other strategies and makes its decision based what the others would do. In a sufficiently diverse set of rules, such a winning compound rule can almost always be found.

VK

IDRIPD (I did research on the iterated prisoner's dilemma)

Neural networks by wed128 · 2004-10-14 03:19 · Score: 1

Seems like this is the perfect application for evolutionary programming...

Not a cheat by Carmody · 2004-10-14 03:21 · Score: 2, Interesting

I've been giving talks on the Prisoner's Dilemma for a few years. (No original research, just following the thing and explaining the game to the Youth)

It is kind of an orthodoxy in the literature: Tit for Tat always ties or loses by a little bit, but in tournaments, it is the best strategy.

Well - it ain't. Someone found a way around it. Instead of urging rule-changes to prevent this new challenger, we should all be happy and excited that PD tournaments have just got MORE INTERESTING.

I can't wait to see what happens next - what new programs will emerge to have the advantages of Tit for Tat but also the ability to defend against Master-Slave programs that communicate with each other.

The game has changed - now let's leave it alone and watch.

--
God is real unless declared integer

I must have been unclear.... by StressGuy · 2004-10-14 03:22 · Score: 1

The PD is indeed different than the zero-sum game but they are related. When I said the PD *or* the zero-sum game I wasn't referring to them as synonyms, I was mearly acknowledging thier relationship. I seem to recall that there is a "collective farming" analogy to the zero-sum game.

--
A goal is a dream with a deadline

Re:I must have been unclear.... by Control+Group · 2004-10-14 07:50 · Score: 1

Ah...I see how you meant it, now. Sorry, I lent excess weight to the comma between dilemma and or.
My apologies for what must have seemed incredibly condescending. My bad.

--

Reality has a conservative bias: it conserves mass, energy, momentum...

New Game by Anonymous Coward · 2004-10-14 03:22 · Score: 0

I don't like Tit for Tat. How about Tit for Cock?

OPEN source ! who did it? by earthstar · 2004-10-14 03:24 · Score: 0

who did open source?

thats one of the biggest cooperative things..........did asians do it? OR
The western world?

--

Why does yahoo do this

Am I missing something? by earthstar · 2004-10-14 03:30 · Score: 0

i played the Prisoners Dilemma at , http://www.princeton.edu/~mdaniels/PD/PD.html

I always confessed ,and i kept winning.Albert kept losing.I stopped thegame after albert got 300 years and I got none.

Am i missing something?

--

Why does yahoo do this

i had to reread your post like five times by Trepidity · 2004-10-14 03:34 · Score: 1

I kept reading "The prisoner's dilemma is quite useful in normal life, or at least the drinking game that gives rise to the solution is."

--
10 PRINT CHR$(205.5+RND(1)); : GOTO 10

Easy to defeat by Bill,+Shooter+of+Bul · 2004-10-14 03:36 · Score: 1

Just create one that cheats. You just need to figure out the signature of the other S/M programs, emulate it and exploit it. You know in advance now how its going to respond every time.

--
Well.. maybe. Or Maybe not. But Definitely not sort of.

Beating Tit for Tat is easy by BlueStraggler · 2004-10-14 03:38 · Score: 1

Tit for Tat never wins a Prisoner's Dilemma challenge - it never defects first, and it never defects more than its opponent. Thus it will always tie or lose. It's in the aggregate score over all players combined that it comes out on top.

Furthermore, in the original Prisoner's Dilemma tournament, there were several rather obvious strategies that would have defeated Tit for Tat in aggregate score, but simply were not submitted.

Parent is right, mod up by GFW · 2004-10-14 03:41 · Score: 1

I read all the visible posts looking for one that talked about evolutionary models and the slaves dying out. Parent is the only one I saw, but I might have missed another. Anyway, Parent is right, long term this slavery strategy doesn't work (just as in human history ...) On the compound rule strategy, if you know Southampton is taking part, you write a program that takes the first few moves to see if it's opponent is a Southampton Slave and takes advantage if it is. If not, it then plays a couple of cooperations in a row (to fix any recriminations with a tit-for-tat-like opponent) then plays simple tit-for-tat.

In my Journal by naoursla · 2004-10-14 03:41 · Score: 1

My one and only journal entry is about Prisoner's Dilemma (that I wrote during a class on auctions and game theory).

http://slashdot.org/~naoursla/journal

I always thought... by Singletoned · 2004-10-14 03:45 · Score: 1

I always thought that the Prisoner's Dilemma was whether to pick up the soap when you drop it in the shower.

Tit for Tat?! by Evil+Willow · 2004-10-14 03:46 · Score: 1

My only question is whats Tat and where do I trade it in???

It IS iterated PD by Baldrson · 2004-10-14 03:50 · Score: 1

Yeah, that's not the Prisoner's Dilemma. Or even the Iterated PD

It is perfectly legitimate iterated PD. There is nothing in the rules of iterated PD that says records of moves cannot be kept and acted on. Indeed, that's precisely the idea of the iterated PD as explicitly stated.

--
Seastead this.

Team Scoring by radtea · 2004-10-14 03:50 · Score: 1

From TFA: The result is that Southampton had the top three performers -- but also a load of utter failures at the bottom of the table who sacrificed themselves for the good of the team.

Effectively, the Southampton group entered a team in a competition that scores individuals, so the "winners" were individual programs that had the backing of many other individual programs.

An alternative, and arguably superior, means of scoring entries would be one in which teams were scored rather than individual programs. In this contest the Southampton team would not do nearly so well.

If individual scoring continues we can expect to see entries that will attempt to recognize the Southampton programs and put a spanner in their works. These programs will be known as "labour unions", "socialist agitators" or "liberals".

The Southamptonites might counter with a raft of programs that attempt to identify the spoilers, but in this environment it will be hard to do much against them, as their activity will be fundamentally aimed at wrecking rather than winning.

The whole thing starts to look depressingly familiar, doesn't it? And all because the scoring system can be gamed by allowing one program to exploit the efforts of others...

--
Blasphemy is a human right. Blasphemophobia kills.

It fulfills the purpose. by Baldrson · 2004-10-14 03:54 · Score: 1

You cannot ignore kin selection in any attempt to model game theory if that model is to be relevant at all.

--
Seastead this.

Elmer Fudd by Sleen · 2004-10-14 03:57 · Score: 1

Rabbit Season!
Duck Season!
Rabbit Season!
Duck Season!
Rabbit Season!
Rabbit Season!
Duck Season fire now!!! Pow!!

Pulls beak back around...

hmmm.. by jlramirez · 2004-10-14 03:58 · Score: 1

wonder if it can classify images in these categories: * Hardcore * Softcore * Positions * Misionary * 69 ... ... ...

--
"Me claiming Satan exist is just as valid as you claiming an atom exists" - 1inChrist

Re:hmmm.. by jlramirez · 2004-10-14 04:01 · Score: 1

lol, please forgive. This comment was supposed to be posten on the desktop google news item.

--
"Me claiming Satan exist is just as valid as you claiming an atom exists" - 1inChrist

Nature "cheats" exactly this way. by Baldrson · 2004-10-14 03:59 · Score: 1

Genes are given before life. They affect the way memories are used in exactly the same sense that algorithms given before play in the iterated prisoner's dilemma affect the way records of opponenets' moves are used:

kin selection.

--
Seastead this.

Iterated PD doesn't "contradict" PD by Baldrson · 2004-10-14 04:05 · Score: 1

The iterated PD is a different game than the PD and it is in fact more "in the spirit" of natural science than is the PD where there can be no memory of what opponents have done. Indeed, the way these guys "abused" the iterated PD is "in the spirit" of natural science since it gets to the heart of something all evolutionary biologists now accept: kin selection.

--
Seastead this.

Nothing new by RichDice · 2004-10-14 04:05 · Score: 1

While I have to admit this master/slave approach sounds neat the first time you hear about it, I have to break it to the ./ers: it's been done before. Many times.

I remember The Perl Journal sponsoring a PD competition a few years ago, and I think there was an entry (well, a set of entries) like this. [ And that certainly wasn't new even back then. ] Interesting, in the post-mortem for that competition, the judge was disappointed that no-one had tried winning the competition by rewriting the Perl symbol table and substituting in an "always defect" subroutine for your opponent. (The PD version of Orbital Mind Control Lasers, I suppose.)

So, while this is an interesting and fun little news story, I wish it weren't presented as though "'Tit for Tat' has been defeated! This is a first, a breakthrough!!!!!" It's not.

Cheers,
Richard

Interestingly, that's what the omerta is all about by Perianwyr+Stormcrow · 2004-10-14 04:09 · Score: 2, Insightful

The omerta, or code of silence, is the ideal that the mob works toward when caught. If you get caught, you simply clam up and take whatever's thrown at you as a point of honor. It is instructive, however, that this of course does not apply universally (everyone knows that the mob is rife with snitches.)

--

What we call folk wisdom is often no more than a kind of expedient stupidity.-Edward Abbey

nested bogosity by nusratt · 2004-10-14 04:13 · Score: 2, Interesting

Everyone is posting about how this is bogus because it's really not the same game as PD.

But even if you don't agree with that view, another important question is:
in what meaningful sense is this new strategy a "victory"?
After all, it achieves "victory" for half of the cooperators, at the cost of sacrificing the other half.

To use one nuclear-war analogy, it's a choice between strategy "A",
where you acquiesce to the death of half of your populace, with the reward that the remaining populace is completely unaffected --
and strategy "B", with the guaranteed result that no one dies but everyone is injured.

Which populace would *you* choose to join on the eve of war?

So, GPL is Tit for Tat by john_lewmanny · 2004-10-14 04:20 · Score: 2, Interesting

And GPL is more or less 'Tit for Tat' in which it will only cooperate with those also cooperating.

Re:So, GPL is Tit for Tat by Daniel_Staal · 2004-10-14 05:07 · Score: 1

I had noticed that. ;)

You'll notice how well it is doing...

--
'Sensible' is a curse word.

The next game by Gyorg_Lavode · 2004-10-14 04:22 · Score: 2, Interesting

I think what will become more interesting is that, now that we know the best lone player (tit for tat) can be defeated by players playing together, can we write our players to look for a player trying to communicate to another player so as to take advantage of it. Can my player play tit for tat against normal players, but, when it sees a S/M player, convince the S/M player to play slave for my gain?

--
I do security

Re:Interestingly, that's what the omerta is all ab by dmaxwell · 2004-10-14 04:23 · Score: 1

If omerta is coupled with benefits like good legal representation and protection from the worst elements of prison society, it could still be quite effective. Of course, it is also understood that the reverse is true and that any snitch who walks is a marked man. It does work out to a variation on iterated PD. If the mob becomes known for screwing over those who play by omerta rules, the number of snitches rises.

selfish by slothman32 · 2004-10-14 04:29 · Score: 1

Isn't defecting sort of like a selfish strategy? Someone might want to be nice and cooperate even though it hurts them. If this really happened and the police were asking me this. I wouldn't just squeal on the other person. I would effectively want a contract that says If I squeal but he does also mine doesn't count. That way I can't lose. If me and my partners agreed that even if we defected we would still use that method it's like a cooperation.

--
Why don't you guys have friends or journals?

Electronic Survivor by sleepingsquirrel · 2004-10-14 04:32 · Score: 1

So basically, we're talking about a computerized version of the TV show Survivor? That could be very interesting. I'm betting a good strategy for that game is to create a small alliance which as a unit plays tit-for-tat against the others.

Creativity is cheating by greg_barton · 2004-10-14 04:36 · Score: 2

For all of those saying, "Isn't this just cheating?" I say this:

Creativity is "just cheating." Creativity is breaking the rules in a novel way that sheds new light on reality. And isn't that the holy grail of AI?

So, was this just cheating? Hell yes. And it's fantastic.

Don't forget the "average" win by Clueless+Moron · 2004-10-14 04:37 · Score: 1

Mutual cooperation (which TfT's get when they meet) lead to each player getting 0.5 years, for an average of 0.5 years.

The "sacrificial" algorithm described in this article means the defector gets 0 years, but the cooperator gets 10 years, for an average of a whopping 5 years. That stinks; the only way to get worse is to do constant mutual defection (for an average of 6 years).

Most PD games I've seen in the past have involved "community", or team, scores. In those situations, TfT has always done well because they get the best possible average score when they meet each other, and the only way to beat TfT is to do chronic defection, which leads to a pretty pyrrhic victory, because your only advantage over the TfT was the one single defection you did at the start.

If I had to live in a PD society, I'd pick a TfT one in a flash!

Missing option... by balaam's+ass · 2004-10-14 04:51 · Score: 5, Insightful

I agree that this defnition of the "Prisoner's Dilemma" is no more than a "meta-game," and not really a problem of philosophical ethics (though it may appear to be to some people.)

What I find disturbing this is the way that the problem is framed presupposes no underlying system of ethics. To wit....
* If you confess and your partner denies taking part in the crime, you go free and your partner goes to prison for five years. * If your partner confesses and you deny participating in the crime, you go to prison for five years and yor partner goes free. * If you both confess you will serve four years each. * If you both deny taking part in the crime, you both go to prison for two years. What do you do?

How about: Tell the truth? Regardless of what your partner does, tell the truth. I find it disturbing that the problem is framed in a way that the actual truth of the matter is irrelevant. (i.e. the problem would be unchanged if I replaced "You and your partner have committed a crime and are caught" with "You and a friend have been accused of a crime which you may or may not have committed.")

I'm not trolling or off-topic here. I'm dead serious. This formulation of the PD is ethically doomed from the get-go, and thus the results of the experiment may be of interest to mathematical game theorists of this particular game, but I find it unwise to think the results make any significant implications about ethics (or anything else for that matter).

Someone will counter that since this is a "Prisoner's" dilemma the person involved must be a criminal with no "ethical" principles other than an interest in self-preservation (i.e. the person is already debased as can contribute nothing meaningful on the subject of ethics! ;-) ). I'd say that just because someone committed a crime does not mean they necessarily want to continue committing crimes...

Re:Missing option... by cnoocy · 2004-10-14 08:59 · Score: 1

I've always thought of the prisoners in the Prisoner's Dilemma as being political prisoners of some sort, whose crime is nebulous, and who cannot trust the authorities to be just. Fair, yes, but not just.

--
This sig is not the Zahir. Lucky for you.
Re:Missing option... by Anon,+a+Moscow+RD · 2004-10-14 10:27 · Score: 1

How did this get modded "Funny"? I didn't laugh.
Re:Missing option... by Anonymous Coward · 2004-10-14 10:28 · Score: 0

I find it unwise to think the results make any significant implications about ethics (or anything else for that matter).
Where the Prisoner's Dilemma does come about is in foreign policy (among other places). Do you build up huge stockpiles of weapons, or do you start disarming in hopes your enemy will do the same?
The Prisoner's Dilemma has absolutely nothing to do with prisoners whatsoever. You're looking for ethics behind a very concrete problem, when indeed the Prisoner's Dilemma is designed to be abstract. It's not a problem; rather, it's a representative for a class of problems.
Re:Missing option... by Mr2001 · 2004-10-14 11:50 · Score: 1

The Prisoner's Dilemma isn't supposed to provide insight into ethics. Obviously the ethical thing to do is to tell the truth. The PD is, however, applicable to fields like economics, which is basically the study of greedy human behavior. It's no good for telling us how people should act, but it's great for predicting how people do act in the real world.

--
Visual IRC: Fast. Powerful. Free.
Re:Missing option... by balaam's+ass · 2004-10-14 14:40 · Score: 1

Thanks, this is a helpful distinction. However, I find that many people nevertheless DO try to use it as a basis of ethics. It fact, I recall a prominent Parade magazine article in which none other than Carl Sagan was arguing precisely that, and was advocating that people base ALL their interpersonal relationships on Tit-for-Tat. (Someone will counter that Sagan was a total quack... I won't try to defend him. :-) )
Re:Missing option... by Anonymous Coward · 2004-10-14 19:05 · Score: 0

Go to your local library and get "The Selfish Gene". It will explain the significance of TfT. It is a big deal if a better strategy is found.
Re:Missing option... by Anonymous Coward · 2004-10-14 19:29 · Score: 0

The success of strategies such as tit-for-tat are an argument that even a completly self-intrested person will tend to behave somewhat ethically in an fair situation such as IPD.

Pavlov, Grim, and the other strategies. by DrRobin · 2004-10-14 05:09 · Score: 4, Interesting

As a microbiologist with interest in evolution, I have followed this field from afar for years. Looking over the results, I was surprised at how relatively poorly "Pavlov" (win-stay lose-shift) did, since it performs so strongly in noisy, evolutionany, versions of the game. [see:
http://www.ncbi.nlm.nih.gov/entrez/query.fc gi?hold ing=npg&cmd=Retrieve&db=PubMed&list_uids=8316296&d opt=Abstract
It was also a bit dismaying to see how well "Grim" (hold a grudge forever) did in both games. In evolutionary versions of the game, Pavlov helps keep down the population of "suckers" (thereby decreasing the food supply for more predatory and parasitic strategies) while still rewarding "provokable" cooperators (thereby increasing the total aggregate "reward" of the ecosystem.
Also, one essential part of the payoff structure that deserves emphasis is that the payoff for cooperating has to be more than half the average of the winner and loser's payoff for defection, else one benefits by simply alternating each turn. This is a little bit like the winners did here, where they got the top spots at the cost of a lower total take for their "team". One real world example of slashdot interest where this might make sense is if you take these losses in order to eliminate your rivals from the game and then reap monopoly benefits once you control the game (not to mention any names...).
Maybe someone who has analyzed the results in more detail could comment on how the various well known strategies fared and why.

Re:Pavlov, Grim, and the other strategies. by Anonymous Coward · 2004-10-14 13:54 · Score: 0

These various "winning" strategies make me wonder if a lot of human behaviors can be explained as winning strategies. What constitutes winning, of course, depends on what one values and also one's perspective (long, short, individual, group).

Just look at all of the odd behaviors that exist in the realm of romatic relationships. Some people are "players" constantly go after and then dumping the best catches around. Others are constantly getting dumped or are constantly getting into relationships with people that treat them like garbage. While a few people find someone right away that is good for them and they settle down.
Re:Pavlov, Grim, and the other strategies. by DrRobin · 2004-10-14 14:34 · Score: 1

I was hoping someone who knows more about this than I would step in, but on the off chance that anyone is still reading this sub-thread, I think there is a deep real world principle at stake. In noisy, evolutionary simulations of the IPD, what happened in the ref I cited was that for some conditions TFT would arise and spread until all-cooperate variants could begin to proliferate under the protection of the cloud of TFT's. Once the all-C (what I called, a bit too harshly, "suckers") population reached a high enough level, all-defect (predators) swept through and caused the whole "ecosystem" to collapse into a low-payoff all-defect state. Under some conditions of noise, mutation rate, and payoff matrix, though, the "pavlov" strategy (where you make the same move if it worked well for you the last time with that partner, and make the opposite move if it did not) would arise and become stable at high overall payoff over very long simulations, much less susceptible to crashing into all-defect. "Pavlov" is perfectly willing to punish defectors, reward rational cooperates, but take advantage of naive cooperators, which made it much more stable in the long run. I think the difference between this and the recent contest is that the contest had a fixed set of entrants that could not be invaded by new all-C or all-D mutants. Without trying to oversimplify a deep subject, I would suggest an expansion of the classical population biology notions of ecological niches to include producers, predators, parasites, and policers. There is also a whole rich literature in widening the basic PD schema to include things like partial cooperation and "reputaton" building. Maybe the IPD would be fun for a real slashdot network game...

Re:Interestingly, that's what the omerta is all ab by mdfst13 · 2004-10-14 05:13 · Score: 1

No, there are four situations in the prisoner's dilemma:

Neither confess
I confess, you don't
You confess, I don't
Both confess

The way that the dilemma works is that each of us is personally better off if we confess. Thus, the greedy solution is that we both confess. The point of omerta is to move us into the neither confess section. In neither confess, both are better off than both confess.

The way that this program works is that it picks one to confess and the other not to confess. This is obviously stupid for the one who does not confess. Since this is a program though, you can get the program to behave that way.

It is the rough equivalent of playing chess by controlling both sides. If you do, it is possible for black to checkmate in two moves. However, this would never happen in a real chess match, because no one would actually make the two stupid moves by white.

Re:Stored state: Is that allowed? by Anonymous Coward · 2004-10-14 05:22 · Score: 0

If that were true we'd just have a long string of one-off prisoner's dilemmas, with everyone always defecting.

I, for one, ... by Anonymous Coward · 2004-10-14 05:26 · Score: 0

... would like to welcome our new Southampton master overlords.

real life is more complex than the game by Anonymous Coward · 2004-10-14 05:41 · Score: 0

One of the premise of this dilema is that the defector becomes free when the other cooperates.

In real life, this is not as simple as this, since usually both players are intertwined to each other in some way, such that if one 'dies', then the other free player cannot sustain itself long enough to survive without the 'dead' (or jailed) player.

I've seen many theories behind evolutionary survival that doesn't take this account.

I believe species will only go through this dilemma if there isn't other choices available, .e.g., they didn't have to commit the crime in the first place (Most probably the environment was set up that way by outside factor, such as greed, fun in competition, not enough resources to share, etc.)

The real solution is not to restrict to these choices, but to make sure it doesn't come to these choices in the first place.

The game would be more interesting if we incorporate some type of penalty or reward based on survival of the whole, not just one player.

Surviving diversity is a progress, while surviving of restricted few is doom to total annihilation.

But why be "average" by Anonymous Coward · 2004-10-14 05:55 · Score: 0

TFT is the equivalent of "No Child Gets Ahead" If you are content to take a random walk in life go for it. Don't be fooled by people telling you the "optimum" score per round is 6pts for cooperation as that only applies to pairs. Without much effort I designed a strategy "casino" that provided randomized "rewards and punishment" for defection. Against someone with a probability of cheating its personal average could go above 3.0 (6.0/2) Naturally this was at the expense of the cheater having to pay back a lot more than was won. On average my strategy did better against cheaters/randomizing algorithms than TfT. Against TfT it was a draw since my algorithm never cheats first. As long as there were cheaters my algorithm would invade any size TFT.

To make the invasion work my randomizer caused some to "Win Big" These big winners would spawn generations of cheaters usually at the expense of TFT which the cheaters bring the average down on quickly. Thus increasing my strategy's advantage over TFT.

This applies to the real world. Casinos and lotteries use the same principle. Give someone a 1:40,000,000 chance of becoming the richest person around. If there are 300,000,000 people in the population it is worth the negative expectation of buying the lotto ticket. Your odds of being the richest by not playing is 1:300,000,000 Your odds of being the richest by playing 1:40,000,000. Of course, the house, as always is the big winner.

Getting back to IPD, the "secret" to beating TfT is to encourage as much cheating as possible in the general population. By making your personal strategy cooperate until cheated upon you will always be doing as-well or better on average than any TfT player.

Enough Grim in the simulation will help keep the TFT from being eliminated by the cheaters early on. Grim's survival was always poor as it could never re-attained cooperation with the occasional cheaters which is needed for long-term viability.

Key element - guaranteed draw strategy by BobaFett · 2004-10-14 06:04 · Score: 3, Interesting

While entering a team into a tournament scored for individuals and then sacrificing the whole team for one player is by no means a new idea, what makes it so remarkably successfull here is the existance of a "guaranteed draw" strategy (in this case, always defect). The best individual response to "always defect" is to defect yourself, anything else is a suicide, so if you always defect you can force a draw. Then all your team loses to one team member, and he is the winner.

Compare this with, for example, a chess tournament. You could secretly enter a team and have them all lose to you. While this will keep you from ending last, it won't assure victory, unless all players are roughly equal. If there is a very strong player, he'll win against all your team, yourself included. So you can cheat by redistributing players of comparable strenghts, but at least you can't rob a clear champion of his deserved victory.

This is not the case in the PD tournament. But let's redefine the problem slightly: say, if both sides cooperate, each gets a dollar. If then defect, each pays a dollar. Sucker's reward is paying 10 dollars. Now the Southampton team's strategy boils down to using the tournament to give all their money to one player, while paying a hefty tax in the process. There is a cheaper way to do this, just give all money to one guy outside the tournament :) But now we can gauge any strategy: enter one player or a team, recognize your own team members or not, transfer money between team members as you wish, but can you make money, overall, from this tournament?

That's my point by Perianwyr+Stormcrow · 2004-10-14 06:06 · Score: 1

The omerta insists that someone up the scale is going to be better off if you sacrifice yourself. The implication is that someday it might be you up there, and that you have to rely on the lower folks burning themselves to better your chances.

--

What we call folk wisdom is often no more than a kind of expedient stupidity.-Edward Abbey

Does this really apply to human behaviour? by Thangodin · 2004-10-14 06:18 · Score: 2, Interesting

Optimistic Tit-for-Tat models human behaviour well in a social setting--we give others the benefit of the doubt, and continue to cooperate when others do. When someone violates our trust, we stop trusting them and punish them, but if they act beneficially towards us again, we might be willing to forgive. Most notable, OTFT produces the best overall score, which in competition between social groups is the deciding factor.

The Southampton strategy is dependent upon large numbers of people who will sacrifice all for the good of the other, and not for the good of their community (the collective performance is worse than OTFT.) I can see sacrifice for the greater good, but this is sacrifice to another person without hope of recompensation or an increase in general wellbeing. This does happen in human societies (I think it's happening now in some political systems), but only when the winner has managed to convince the losers that its all in everyone's best interest. What Southampton has added to this mix is a capacity for extreme self-delusion that directly contravenes the economic assumption of informed choice and self-interest. For purposes of economic modelling, Southampton should probably be disqualified, or these assumptions dropped. But this should also tell you something about what could happen to those nice economic models when they hit the messy world of human beings, who for the most part aren't very informed and often work against their own best interests as a result.

The consequence for a societal group running Southhampton against an OTFT group would be the defeat of the Southhampton group every time. Selection works at individual AND group levels. So the challenge should probably be two-tier: run the programs individually against each other, and run them as tribes against each other.

Tit for Tat? by Pan+T.+Hose · 2004-10-14 06:30 · Score: 1

Tit for Tat? I don't know the Bible very well, but wasn't it Eye for Eye, Tit for Tit?

--
Sincerely,
Pan Tarhei Hosé, PhD.
"Homo sum et cogito ergo odi profanum vulgus et libido."

There's a despair.com Demotivator about this by Hentai · 2004-10-14 06:32 · Score: 1

I saw this article, and immediately thought of this poster. Apparently, truer words were never spoken.

--
-Hentai [in vita non pacem est]

"FAIR" is a four letter word... by karlandtanya · 2004-10-14 06:34 · Score: 1

The same set of rules is available for all competitors to read, understand, and apply to their designs.
Whining about whether an innovative strategy is "fair" is merely demonstrates the loser's desire to bring the winner down to his own level of incompetence. Rather than to improve his own game.
If the rules allow it, it's legal. Fair has nothing to do with it.
The players acted within the rules.
Some of them won the game; others lost. That's why it's called a competition.
Some losers didn't understand the rules as well as the winners did.
That's why they're losers. They didn't win.
Sucks to lose, doesn't it? Now, go home and apply what you learned. Maybe next time you can win.

Just to satisfy your apparent desire to derive social and perhaps even moral implications from this competition, I suggest you read some of the other comments to the article. You will note that in this case, we can apply lessons from the simulation to everyday life. The concept here is called "taking one for the team".

--
"Reality is that which, when you stop believing in it, it doesn't go away." - Philip K. Dick

Re:"FAIR" is a four letter word... by Snowspinner · 2004-10-23 05:04 · Score: 1

"Fair" also has nothing to do with this, which is a mathematical simulation, not a sports competition.

--
Philip Sandifer's academic website

Kobiyashi Maru by Bugmaster · 2004-10-14 06:35 · Score: 2, Insightful

So, essentially, the winning program(s) hacked (or exploited, if you prefer) the game in order to win ? That's pretty clever, but does this count as a true victory ? It's sort of like what Captain Kirk did to rig his Kobiyashi Maru scenario. Sure, he won on a technicality, but in doing so he missed the whole point of the challenge.

--
>|<*:=

Re:Kobiyashi Maru by Anonymous Coward · 2004-10-14 12:08 · Score: 0

I though that was the whole point of the contest. He was clever enough to win an unwinnable battle.

You're full of it! by Anonymous Coward · 2004-10-14 07:22 · Score: 0

NOS ES TURBATUS UT RECOLLIGO VICTUS PRO NOSTRUM CANI... It makes us angry to collect the food for our dogs???????????

best strategy ever by Anonymous Coward · 2004-10-14 07:24 · Score: 0

always defect, your bound to win or tie.

Re:Interestingly, that's what the omerta is all ab by Jardine · 2004-10-14 07:26 · Score: 1

If you get caught, you simply clam up and take whatever's thrown at you as a point of honor.

I figured that was less to do with honour and more to do with not getting killed.

Problem with averages by rawb · 2004-10-14 07:40 · Score: 1

Something slightly different would work much better, I believe, at least if our goal is to maximize total for your group instead of just one instantiation of your program.

Specifically... it's nice to have 100 slaves starve themselves and work for you, where you get 5 and they get 0. If you intend on letting your slaves starve and then die, then you've just done very very well for yourself.

If you and your slave cooperated, however, you'd each get 3, a total of 6, which is greater than 5 of course.

So perhaps the best thing to do is cooperate with your slaves for a total of 6, and then take some away for yourself. Then you don't have to waste 2 going to find more slaves ;)

Or... use propaganda to convince your slaves to give you 2 of their 3 from the cooperation because you need to use those 2 to keep them safe from terrorists. Or call it "taxes", or tithe, or some such.

In this way you'll be doing right by the group while still being an exploitative bastard of an overlord, and your people will love you for it. Don't outright screw your group by not cooperating... cooperate with them! Then take the benefits.

sed for the rescue by alexo · 2004-10-14 07:46 · Score: 2, Funny

> Hell, I've got a whole warehouse full of Tat. Where do I exchange it for some tit?

Easy:

s/wa/who

Where are the tits? by Anonymous Coward · 2004-10-14 07:48 · Score: 0

And for that matter, what is a 'tat'? Whatever a 'tat' is, I doubt anyone will trade its own tits to get it.

T4T that good? by Matrix14 · 2004-10-14 07:51 · Score: 1

Is this that important? I know I wrote a Prisoner's Dilemma bot back in 2000 that could consistently beat Tit-for-Tat, and my classmates wrote bots that consistently beat mine. I had thought Tit-for-Tat was just the champion of a particularly famous PD tournament.

Old Tactic by biomass · 2004-10-14 08:26 · Score: 1

Had a friend who took 2nd in a core war competion
due to someone else who took this kind of statagy.
You could submit as many candidates as you like.
The winner submitted a bunch who were identical
except for one which had the "queen bee" flag set.
The drones could determine when they were battling
against the "queen bee", and if so, would go belly up.

Tit for Tat is subobtimal, but robust. by Anonymous Coward · 2004-10-14 08:31 · Score: 0

Tit for Tat tends not to be an optimal strategy in most populations. But it is a very robust one; it works well in many different populations. What has happened previous times algorithms designed specifically to beat tit-for-tat were introduced was that they weren't as good against other such algorithms as tit-for-tat was (they weren't as robust), and tit-for-tat ended up winning anyway.

Tits for Tattoos by tepples · 2004-10-14 08:44 · Score: 1

where is this "tat" and how can I get me some?

If you want to attract women who admire tattoos ("tit for tat"), start here and then replace fort wayne IN with your city and state/province. However, to satisfy most employers' policies, make sure to get your tat where a work uniform would ordinarily cover it up.

"Yeah, he's getting ink done" by tepples · 2004-10-14 08:50 · Score: 1

You can get a tat (short for tattoo) by opening the yellow pages and turning to "tattooing". Then hang out in bars, show women your new tat, and if you're lucky, they may show you their...

Just make sure you get a 13 and not a 31.

Electoral College by tepples · 2004-10-14 08:54 · Score: 1

The NBA Finals isn't scored by total points. Nor is the World Series or the US Presidential election. Given a population of Hawks and TfTs, which would win more games?

Re:Electoral College by LtOcelot · 2004-10-14 10:00 · Score: 1

TfTs, since members of the population can work together. Hawks have the advantage only when this option is removed.

CCG Meta-games by Aexia · 2004-10-14 09:21 · Score: 1

You see the same thing happen constantly in games like Magic the Gathering. There'll be a "hot new deck" archetype out there dominating the tournaments. Then, out of the blue, someone with an otherwise lousy deck designed specifically to beat that dominant deck takes a few high profile tournaments.

Everyone scrambles to retool their decks to beat the counter-deck and the process begins anew. Usually, a format will eventually reach some sort of equilibrium...

It's basically evolution. Adapt your deck to the environment or lose.

An excellent PD book by kjones692 · 2004-10-14 09:32 · Score: 1

Here is an excellent Prisoner's Dilemma book that deals not only with the Prisoner's Dilemma itself but with all of mathematical game theory, and connects it all to real-world situations in fascinating ways. It really changed the way I look at these things... now I can see Prisoner's Dilemmas everywhere.

For example: Some of my teachers at school favor a lenient teaching style, going easy on actual work, giving troublemakers benefit of the doubt, and so on. This could be seen as "cooperation", where other teachers "defect" by assuming that kids are always trying to do the least work for the most benefit and get away with it, and so are strict and give lots of work. The students "cooperate" by not taking advantage of the curriculum, not cutting corners, doing work thoroughly, not cheating on tests, participating more in class, etc. They "defect" by trying to get away by doing as little work as possible.

Now, the greatest benefit for all involved is mutual cooperation; a lenient teaching style combined with students who won't take advantage of that lenience. However, I've noticed that most teachers (primarily the young, naïve ones) choose to cooperate, and the kids respond by defecting; given an opportunity, they'll slack. Older, more experienced teachers defect by being strict from the start, and students who cooperate by doing work thoughtfully and thoroughly get screwed and fall behind those who cut corners and cheat (the defectors).

It's not an exact parallel, since you can generally tell what the other side is doing, but it's close, and reasonable too.

So the point is, read the book. It's good.

--

Love the Third Amendment?

Re:An excellent PD book by iridium_ionizer · 2004-10-14 14:15 · Score: 1

I think there could be a more applicable dilemma out there that could be almost as simple to explain. Why not do the bread dilemma (BD)? Workers of varying strength obtain bread. A much stronger worker can take away bread (as a thug/boss), but if they take away too much bread then the workers will die and he will have less bread.

Would this predict modern wage disparity or unions?

My thoughts by Anonymous Coward · 2004-10-14 09:54 · Score: 0

I think the human reaction to the way this program behaves is more interesting that the actual contest.

"The program cheated!"
"There's no rule saying it couldn't do what it did!"
etc.

To make things interesting again by shimmin · 2004-10-14 10:00 · Score: 1

One method of discouraging this strategy would be to run the tournament based on a ladder system rather than a round robin: the slaves fall to the bottom, where they will never interact with the masters again.

TFT and M/S in Online Games by Anonymous Coward · 2004-10-14 10:18 · Score: 0

I don't need a perl simulation to evaluate the strategies. Online gaming does it nicely for me.

There are always team killers (defectors) in most online games eg. half-life/far cry/call of duty.

The most successful strategy in discouraging defections in those games is TFT (making team killers sit out after a kill).

The M/S strategy is found in clans, clan games or password protected games. ... and the problem with those games is the very few quantity of players it results in. Makes it harder to play a game.

perfect knowledge? by underbider · 2004-10-14 11:06 · Score: 1

Does it make it easier that there is secrecy? The programs don't know the other programs strategies.

Spatialization by Agilis · 2004-10-14 11:07 · Score: 1

Spatialization vs Round Robin of Iterated PD actualy can yield vastly different outcomes for identical 'worlds'. And my gut feeling is that these colluding strategies would not do nearly as well when they can only interact with a limited number of strategies in it's field of vision.

In our group's research we've been using Spatialized IPD among other projects, to come to look at prejudice reduction, which is a real-world version of fair-play and cooperation version betrayal.

There are some papers here for anyone interested. Some of the results are simple and yet very striking.
http://www.computationalphilosophy.org/

Fast forward to today where... by Aexia · 2004-10-14 12:12 · Score: 1

SO fastforward to what we have today, a strategy where we create 500bazillion smaller firms all for the purpose of going bankraupt, so that one parent firm makes all the money. Huh? Right, that doesn't make any sense.

Billions of people work themselves to death so that the wealthiest top 0.1% can become even wealthier.

I know! It's absurd!

Re:Fast forward to today where... by figment · 2004-10-14 14:52 · Score: 1

Your logic is totally incorrect.
Let's take the example of a large multinational corporation, and a single individual who pretty much is a peon on the corporate structure.
This single person works because he wants to. He makes himself better off by working, because from it, he gains a wage. He must enjoy it to some extent, or at least he must enjoy it more than sitting at home and not working at all. If he preferred to stay home and not get paid, then he would choose that instead. It doesn't matter that the company makes way more money than he does, he is in some manner content (although possibly unhappy) since because he gets paid, he can pay the bills. In summary, working is better than sitting on your ass at home not getting paid.
Contrast to the IPD game. Each of the sacrificial IPD players could have made themselves better off, but instead chose not to. Depending on the payoff tables, it is most likely that the mean payoff (if they had split the prize 80 ways) would be less than if each person had play tit-for-tat. So with the real world example, this would be analgous for someone to go into work for a big multinational corporation for no wage at all.

Depends... by Aexia · 2004-10-14 13:30 · Score: 1

On the duplication algorithm. The worker bees may die but the queen bee might be able to replace them fast enough and with the benefit of pooling resources, grow faster than competing algorithms.

I don't get it... by Anonymous Coward · 2004-10-14 15:57 · Score: 0

Aren't you just hacking the system by the creation of an insurance benefits system, So that if a Slave "takes one for the team" they receive a greater reward then the punishment given by the game. In this case the benefit being the winning of the competition. With the flaw of this system being that the size of such a group would be limited based on the reward given by wining if you limited everything to the game. But this being Ok since everyone else would be playing fair or tit for tat.

Or in laymen's terms creating.

Then how do you get around the problem of trust?

If you have a secret society then doesn't that imply some sort of hierarchy? For example, like Slashdot, something as arbitrary as a number given based on the order in which member join, So that you have some way of working out who the masters and slaves are going to be in a given situation (once you identify them), So what eventually happens is that a low ranking member of the society is going be 100% sure of how a high ranker is going to vote while a high ranker will never be sure of whether a low ranker is going to sell them out (after all they lose either way). With such a clear depreciation of trust doesn't any advantage disappear?

Of course, you could get around this problem by making everything equal with everyone just picking balls out of a hat. But then aren't you just replacing one game with another? How are you suppose to have this game when everyone isn't meant to know each other? And in the ideal situation of total self-interest at play, then why would the losers of such a lottery not take the advantage screwing the high rankers? After all why do the high rankers think lowers are going to play fair even when - by definition - they are already creating. Hell how are you suppose to know if the other player is even in the secret society or that they maybe just some dude who got lucky in the bidding process.

It doesn't make sense.

Mosteller 1987 by Anonymous Coward · 2004-10-14 17:05 · Score: 0

In the second prisoner's dilemma, you hsould not be still at 2/3 since the warden just stated that you and b will be parolled. The stating of the question as who else besides me will be parolled assumes that you are one of the people being parolled. Unless the warden is a real dick, then you should be able to assume that you are included as one of the two that will be making parole.

This is the SAME as Tit for Tat. by Fantastic+Lad · 2004-10-14 19:06 · Score: 1

Here's why. . .

TFT is based on the fundamental principal that cooperative behavior is more effective than selfish behavior, (so long as people have the brains and back-bone to face defectors when they show up.)

Southampton's new program is simply another aspect of cooperative behavior. The apparent superiority stems from the fact that the it is utilizing the energy potential of two deliberate cooperators rather than a single 'smart' program trying to shepherd a selfish program into cooperative behavior. Or to put it another way, Southampton uses two programs which are working in a co-linear fashion toward the same goal, while the original TFT program must work by itself toward that goal with the equivalent of a retarded child who, with effort, can sometimes assist, but who will also put up annoying resistance from time to time.

Saying that Southampton's program 'beats' the original TFT program is like saying, Cooperative Behavior 'trumps' Cooperative Behavior, which curiously is itself an example of self-serving terminology. Cooperators don't think in terms of 'winning' against other units within the same system. (Good guys don't play chess.)

In any case. . . imagine now three programs which are able to work together to maximize energy potentials. Or four. The amazing part is that the harvested energy potentials don't grow in a linear fashion. They have the ability to grow in a geometric curve. Hmm! (One might think of Groklaw, which has the ability to do thousands of hours of legal research on levels which even a large corporation cannot afford to match. Service to Others v.s. Service to Self.)

The logic presented by these rudimentary programs, if built upon, can become the most sensible energy management system that can ultimately exist; that is, a cell-type network of other-serving individuals willing to give whenever needed, however needed, and who can be assured of being given to when in need themselves.

Naturally, such a system requires that every member be two things;

1) Willing to co-operate fully and without selfish tendency. And,

2) Mastering the 'Tat'. --That is, overcoming our social programming so that we can in fact be brutal when we are being brutalized.

Cutting off a selfish defector when their defection becomes apparent is clearly one of the most difficult things in this world for people to do. If we could bring ourselves to not "Turn the Other Cheek" and to not, "Forgive and Forget", and generally not "Die on the Cross," then things like Enron and rogue presidents could have been easily recognized early on, (when the first business irregularities showed up in the energy biz, or when Shrub was blowing up frogs with fire-crackers), and have been prevented before they ever got rolling. If people had a higher level of awareness, stronger back-bones and less indoctrination, then society would work, I think, with far fewer wars and psychopathic leaders.

But anyway, my point is that such systems CAN and do work in our contemporary environment, albeit, in a limited fashion. On the large scale, though. . , things founder partly because the selfish have taken too much territory and the spineless (who are also often the mindless), are so massive in number. As such, a sensible system is going to be log-jammed with issues of self-service and self-destruction.

But then, I largely see this world as a school whereby people learn the hard way that selfishness destroys, and that the only road out comes through dropping those aspects of ourselves and growing aware, strong and courageous.

-FL

Re:This is the SAME as Tit for Tat. by Kowelli · 2004-10-15 07:24 · Score: 1

I deeply agree with you.
Speaking of the game in particular, it is clear that this is an example of cooperative behaviour. These players are simply co-operating(note the term, co-operation, operation in conjunt). The fact that one of them sacrifices for the other does not change this. I wonder what the outcome would be if they don't act this way, but cooperating in the full sense of the term. Or, as you said before, if many of them were cooperating together. The outcome would grow geometrically.
Generically speaking, is amazing how simple and clear truths, that can be seen with a little of introspection and meditation by and unprejudiced mind, are so intensely scrutinized and analyzed by so called modern 'science'. Only to come back later to the same basic truths. This time, confirmed by 'science' :-)

Best regards,

Albert

Is this a virus? by melvster · 2004-10-14 21:01 · Score: 0

A certain code injected into a computer system causes it to become a slave of the original master computer. Then the slave works for the master.

Other work on collusion in games by sjmurdoch · 2004-10-14 21:31 · Score: 1

Last year I co-authored a paper - Covert Channels for Collusion in Online Computer Games (PDF 151K) which dealt with a similar subject. Rather than IPD, it deals with a Connect-4 competition, but many of the ideas are the same.

It also discusses the link between communication in games like this and the concern of covert channels in (generally military) multi-level secure systems. Another interesting area is the link between these types of competitions and voting algorithms, since they may be a good way of designing collusion resistant competitions, or proving that they are impossible.

--
Steven Murdoch.
web: http://www.cl.cam.ac.uk/users/sjm217/

This does not solve anything ! by glacote02 · 2004-10-15 01:39 · Score: 0

A Nash equilibrium is when you have no profitable deviation _if your opponents stick to their strategies_ . It is a folk theorem that in an infinitely repeated game any payoff in the convex hull of rational payoffs (including highest payoffs) is a Nash payoff.

The competition however involves your facing an unknown strategy. What they did was construct a profile of strategies which reach a high payoff, and tweak it to allow them to identify each other. The profile of strategies is not even Nash; they did not prove anything, they just fooled the evaluation method.

Sidenote: notes on the repeated PD.
1) If the game lasts one single step, whatever your opponent does it is always better for you to defect. So you defect and so does your opponent.
2) If the game lasts 10542 steps, you know that at the 10542-th step you and your opponent will both defect. So there is no point in cooperating at stage 10541, so you also both defect. And so on. Thus the only sustainable combination of strategies (= Nash) is to defect from day one.
3) If the game has unknown length or is infinite, then cooperating becomes sustainable. Actually any payoff in the convex hull is a Nash payoff.
This is with perfectly rational players; real world players are not however.

Now have finite state automata play the repeated prisoner's dilemma, and define their "size" as their number of states. A finite state automaton of size n can not "count" up to n+1; then even in the finitely repeated PD, if its length is bigger than both lengthes then cooperation becomes sustainable. The actual result (due to A. Neyman http://ratio.huji.ac.il/dp/dp69.pdf, Th1 p9) is that as soon as _one_ of both players is approximately not larger than the exponential of the length of the game, then any payoff in the convex hull of rational payoffs can be approximated.

Similar tight results for push-back automata or Turing machines of bounded Kolmogorov complexity are unknown yet.

The interesting question is to design a Nash pair of strategies which reach the highest payoff but do so with a limited number of allowed lines of code (= Kolmogorov complexity). This is definetely no trivial problem: even if I claim to always cooperate, once I know that my opponent is dumb it may be easy (?) for me to pretend to cooperate but later betray him nonetheless ...

Genes Don't Have Ethics by cyberformer · 2004-10-15 05:06 · Score: 1

The original prisoners' dilemma is really about honor among thieves: Criminals do better in a cooprating group like the Mafia than as individuals who will sell each other out at the first opportunity.

But when applied to evolutionary theory, it's talking about genes, which on their own don't have any ethics (or other motivation). It shows that organisms which follow a "tit-for-tat" rule are more likley to survive and reproduce than those that follow some other strategy.

Eventually, you get a population where nearly every organism is folowing "tit-for-tat." Add consciousness and empathy, and you get the Golden Rule.

Re:Genes Don't Have Ethics by Anonymous Coward · 2004-10-15 06:18 · Score: 0

No, you don't. What you get is a simple mathematical game, which you claim models gene interaction.

How is it that genes "cooperate"? Can they "defect" against the enzyme copying them? No.

"Add consciousness and empathy"... And just how do you intend to do that?

Generalized Prisoners' Dilemma explores signalling by ahauser · 2004-10-16 09:04 · Score: 1

If you are interested in a more-challenging version of the iterated prisoners' dilemma, one that allows signalling and coalitions without shills, try the generalized PD. The citation is:

Fader, Peter and John R. Hauser (1988), "Implicit Coalitions in a Generalized Prisoner's Dilemma," Journal of Conflict Resolution, Vol. 32, No. 3, (September), 553-582.

The authors are now on the Wharton and MIT Sloan faculties, respectively.

Hybrid Tit for Tat is likely next by harborpirate · 2004-10-18 05:55 · Score: 1

The next logical step in this is likely to be a Tit for Tat program which starts off by imitating the "code" of the Master Slave, identifying itself as the Master. If the other program does not play along after a certain number of moves, it makes a cooperate move and ignores the move that the other program makes, in order to "seed" a Tit for Tat. Then it goes into Tit for Tat mode.

This program would take slight losses in some cases, but would likely come out ahead of Southampton due to its ability to cooperate with Tit for Tat.

In fact, I'm surprised that Southampton didn't choose to start their "code" with cooperations to test for Tit for Tat first. Southamptons entries could make a suspicious defect in response to the cooperate at some point to initiate the code. The "Master" entries could have had a higher score by cooperating with any entries that always cooperate, such as Tit for Tat will as long as it is not betrayed.

--
// harborpirate
// Slashbots off the starboard bow!

Slashdot Mirror

'Tit for Tat' Defeated In Prisoner's Dilemma Challenge

356 comments