Fluctuation andNoiseLetters Vol.0,No.0(2009)000–000 (cid:13)c WorldScientificPublishingCompany 0 1 0 2 Equilibrium Solution to the Lowest Unique Positive Integer Game n a J 7 SEUNGKIBAEKandSEBASTIANBERNHARDSSON Department of Theoretical Physics, Ume˚a University,901 87 Ume˚a, Sweden ] [email protected], [email protected] O C Received(receiveddate) . h Revised(reviseddate) t Accepted (accepted date) a m [ Weaddresstheequilibriumconceptofareverseauctiongamesothatnoonecanenhance theindividualpayoffbyaunilateralchangewhenalltheothersfollowacertainstrategy. 1 Inthis approach the combinatorial possibilitiesto consider become verymuch involved v even for a small number of players, which has hindered a precise analysis in previous 5 works. We herepresentasystematic wayto reachthe solutionforageneral number of 6 players, and show that this game is an example of conflict between the group and the 0 individualinterests. 1 Keywords: Lowestuniquepositiveintegergame;Nashequilibrium;projectionoperator; . socialdilemma 1 0 0 1. Introduction 1 Game theory deals with a situation where each player’s payoff is dependent not : v only on her own behavior but also on other players’. Players will generally have i X conflictinginterestswitheachother,buttheymaygetbetteroffbyinteractingwith r others in various ways [1]. For example, in the minority game [2], a player gets a a point when her choice is minor among players. Thus, even if a choice happened to be successful at previous rounds, it is hard to remain minor with that choice as more and more players will also select it. Although it is not possible for all the playersto win together,the averageprobabilityof winning canbecome enhancedif they behave in proper ways. Recently, an extreme version of the minority game was proposed as a special caseofthereverseauction,whichiscalledthelowest uniquepositive integer(LUPI) game [3]. This game has n >2 players, and each of them may choose one integer from 1 to n. A player wins a point by choosing the lowest unique number. That means, simply choosing the number 1 is not a good strategy since it is very likely to be chosen by other players, too. In Ref. [4], this LUPI game was analyzed in terms of the Nash equilibrium (NE): Let us imagine that every player participates Equilibrium Solution to the LUPI game in a game with her own strategy, which is publicly known to the other players. If no player can improve her payoff by changing her strategy alone, the set of the players’ strategies is called a NE. Even though this is an adequate framework to analyzeagame,theactualenumerationofcombinatorialpossibilitiesbecomeshard to manage as the number of players increases. Although Ref. [4] tried to find an expressionfor generaln, many of the possible cases were missing in the probability calculations, even for n=4. According to the suggestedformula in Ref. [4], all the loosingbidsshouldbeabovethewinningnumberorallshouldbebelowthewinning number. Furthermore, if they are below the winning number, they should all be on the same number. However, there exist many more cases that would generate the same winningnumber. We henceforthpresentawayto takeallthe possibilities into account correctly for general n and analyze the results thereby obtained. 2. Equilibrium Suppose that each player draws numbers from her own probability distribution, which we will denote as her strategy. As explained above, we will find a NE in this strategy space. Among several different NE’s in the LUPI game, we will be concerned with one that can be prevalent among these n players, as studied in Ref. [4]. This approach actually corresponds to the stability concept in the evo- lutionary game theory [5]. Let us imagine that n 1 people are using a certain − strategy, ~p = (p ,p ,...p ), where p means the probability to choose number i. 1 2 n i The idea is to choose ~p so that the chance of winning for the nth player becomes independent of her strategy. We consider all the possible cases that those n 1 − people can make. These can be symbolically represented by n n−1 Z = p , (1) 0 i ! i=1 X fromwhichallthecombinationscanbeobtainedasseparateterms. Forexample,in athree-playersgamewegetZ =(p +p +p )2 =p2+p2+p3+2p p +2p p +2p p , 0 1 2 3 1 2 3 1 2 1 3 2 3 where the first three terms on the right-hand side mean that players 1 and 2 chose the same number 1, 2 or 3, respectively. The other three terms correspond to the caseswhenplayers1or2havechosentwodifferentnumbers. Together,these terms representallthepossibleoutcomesofthegame(asseenfromthethirdplayer)since they add up to the probability 1 due to the normalization condition np =1. i i Now, starting from Eq. (1), we extract all the cases where there is a winner, P among the n 1 players, at number 1. These are characterized by all the terms − which contain only one p . In other words, if we write Eq. (1) as a polynomial in 1 p : 1 Z =A(p ,...,p )+B(p ,...,p )p +C(p ,...,p )p2+..., 0 2 n 2 n 1 2 n 1 those cases are expressed by the second term and one can easily find it by B = Baek and Bernhardsson dZ /dp . In other words, we find the cases with no winner at number 1 as 0 1|p1=0 dZ 0 Z = Z p 1 0 1 − dp 1(cid:12)p1=0 (cid:12) n n(cid:12)(cid:12)−1 n n−2 = p (n 1)p p . i 1 i ! − − i=1 i6=1 X X Ifwearefurthertoexcludethecaseswherethereisawinneratnumber2,weapply the same operation on Z as follows: 1 dZ 1 Z = Z p 2 1 2 − dp 2(cid:12)p2=0 (cid:12) n n(cid:12)(cid:12)−1 n n−2 = p (n 1)p p i 1 i ! − − i=1 i6=1 X X n−2 n−3 n n (n 1)p p +(n 1)(n 2)p p p . 2 i 1 2 i − − − − i6=2 i6=1,2 X X Generalizingthisidea,wecanfindcaseswherethereisnowinneruptoanarbitrary number i by the following recursion relation: dZ i−1 Z =Z p . (2) i i−1 i − dp i (cid:12)pi=0 (cid:12) (cid:12) Formally, we may define Li as the linearity pro(cid:12)jection operator with respect to pi. Let us denote a generic polynomial of p ,...,p as Q, which may have an index 1 n to indicate more than one polynomial. The operator L can be algebraically repre- i sentedasL [Q]=p E D [Q],whereE iseliminationofp bysubstitutingzeroand i i i i i i · D is differentiation with respect to p . In addition, a and b mean some coefficients i i independent of p . We then have the following relations for these operators: i L [apn]=ap δ • i i i 1,n L L [Q]=L [Q] i i i • · L L [Q]=L L [Q] i j j i • · · L [aQ +bQ ]=aL [Q ]+bL [Q ] i 1 2 i 1 i 2 • E L [Q]=L E [Q] j i i j • · · L [Z ]=0, if i j i j • ≤ where δ is the Kronecker delta. It immediately follows that 1,n k Z = (1 L ) [Z ]. (3) k i 0 ( − ) i=1 Y Equilibrium Solution to the LUPI game Let us now calculate the probability for the nth player to win the LUPI game by choosing number i. Then within the other n 1 people, there should be no winner − uptoi 1andnooneshouldchoosethenumberi. The lastconditioncanbeeasily − imposedbythesametrick. Thatis,wesimplysubstitutezeroforp toexcludeevery i case where p appears. Therefore, the probability of our concern is simply written i asc (p~)=E [Z ]= i−1(1 L ) E [Z ]andthenthplayer’sexpectedpayoff i i i−1 j=1 − j · i 0 is n o Q n W(~π;~p)= c (p~)π , i i i=1 X where ~π =(π ,π ,...,π ) is the nth player’s strategy. 1 2 n For example, n−1 n c = Z = p =(1 p )n−1, (4) 1 0|p1=0 i − 1 i6=1 X n−1 n−2 n n c = Z = p (n 1)p p 2 1|p2=0 i − − 1 i i6=2 i6=1,2 X X = (1 p )n−1 (n 1)p (1 p p )n−2, (5) 2 1 1 2 − − − − − c = Z 3 2|p3=0 n−1 n−2 n n = p (n 1)p p i 1 i − − i6=3 i6=1,3 X X n−2 n−3 n n (n 1)p p +(n 1)(n 2)p p p 1 i 1 2 i − − − − i6=2,3 i6=1,2,3 X X = (1 p )n−1 (n 1)p (1 p p )n−2 3 1 1 3 − − − − − (n 1)p (1 p p )n−2 2 2 3 − − − − + (n 1)(n 2)p p (1 p p p )n−3. (6) 1 2 1 2 3 − − − − − From the normalization condition, p = 1 n−1p , indeed we have only n 1 n − i=1 i − degrees of freedom in choosing p~ so that P n−1 W(~π;p~)= (c c )π +c . i n i n − i=1 X The NE solution, ~p , should satisfy c (p~ ) = constant for every i so that W NE i NE cannot be better by changing ~π. That is, the chance of winning is the same on all numbers. For n = 3, all the above calculation coincides with that presented in Ref. [4]. Note that we haven 1 degreesof freedomand n 1 equations so we can − − solvethese simultaneously. Figure1(a)showssolutions for somen values, obtained by using the Newton method. In Fig. 1(b), we plot its scaled version, considering that the horizontal axis naturally scales with n, and the vertical axis roughly with 1/n. Baek and Bernhardsson 0.3 3 n=9 n=9 (b) 10 10 0.2 11 2 11 pi 12 n pi 12 0.1 1 (a) Uniform 0 0 0 2 4 6 8 10 12 0 0.2 0.4 0.6 0.8 1 i i/n Fig1. (a)NEstrategies,p~NE,withvaryingthenumberofplayers,n. (b)Scaledplotswithrespect ton. Asnincrease,thesolutiondeviates morefromtheuniformsolution(dotted). 100 n=6 1 9 10-1 12 Maximum ci10-2 ∞ n W 0.5 (b) UnifoNrmE (a) Ref. [3] Ref. [4] 10-3 0 0 2 4 6 8 10 12 14 3 6 9 12 i n Fig2. (a)Theexpected chanceofwinningbychoosingi,whentheothern−1playersfollowthe uniform solution defined as pi = 1/n. (b) Comparison of outcomes scaled with n. The uniform solutiongivesanoutcomethatrapidlyconverges tothetheoretical maximumW =1/n(dotted), while the NE outcome remains below that. The other two curves represent outcomes from the strategies suggestedinpreviousworks. Setting c =c from Eqs. (4) and (5), we find 1 2 (1 p )n−1 (1 p )n−1 =(n 1)p (1 p p )n−2. (7) 2 1 1 1 2 − − − − − − Sincetheright-handsideshouldbepositive,wecanconcludethatp <p forevery 2 1 finite n, which means the equilibrium solution cannot be uniform. Its implication is remarkable: Let us imagine that everyoneemploys the same strategy p~. In other words, ~π =p~. The question is what the strategy ~p should be in order to maximize the expected payoff, W(p~;~p). For example, for n=3, we find W(p~;p~)=(1 p )(1 p )(1 p ), 1 2 3 − − − which is maximized at a uniform solution, ~p = (1/3,1/3,1/3). This is clearly ∗ differentfromthe NEsolution, ~p =(2√3 3,2 √3,2 √3)[4]. The expression NE − − − is not so simple for n > 3, but solving W(p~;~p) = 0 under p = 1, we still ∇p~ i i see that the uniform solution gives the best outcome to the population on average. P Although weconfirmedthis only for n 12,this resultis plausible sincethe riskof ≤ overlappingchoices will be minimized by the uniform solution. On the other hand, we already know that the equilibrium strategy cannot be uniform. Indeed, if all the people follow ~p = (1/n,...,1/n), a player’s chance of winning rapidly decays ∗ with her chosen number i [Fig. 2(a)]. For example, one may use the formulation abovetoobtainc =e−1(1 e−1)i−1forlargen,whichcoincidewiththemultinomial i − analysisinRef.[3]. Therefore,eachplayerismotivatedtodeviatefromthisuniform Equilibrium Solution to the LUPI game solutionbychoosinglownumbersmorefrequently. Inshort,thereexistsadilemma between the best and the equilibrium strategy. Note that since each play only has one winner, the theoretical upper limit for the gain per play is 1/n. Let us then comparethesethreecases: theequilibriumoutcome,thebestpossibleoutcome,and the theoreticalupperlimit[Fig2(b)]. While the uniformsolutionquicklyconverges to the theoretical maximum, the NE solution remains suboptimal in this plot. For comparison, we also plot outcomes from the strategies in previous works: Ref. [3] suggested a strategy with p = p = 1/2 which gives W(p~,p~) = 21−n, while in 1 2 Ref. [4] the solutionis approximatelywritten as p =2−i for i<n with p =21−n. i n Note that the latter one actually performs better than presented in Ref. [4], since we have taken all the possibilities into consideration. In any case, these become smaller than the outcome from our NE solution. As Eq.(3)indicates, the number oftermsto considerincreasesexponentiallyas n grows, making the exact enumeration of probabilities intractable. Although our currentcomputationalresourcesallowustoarriveonlyupton=12,ourprocedure providesasystematicwaytotakeallthecombinatorialcasesintoaccountforanyn, inprinciple. We believethatthe behaviorsshowninFig.2giveaclue to anticipate p~ in the limit of large n. Specifically, we sketch a way to perform this task: The NE NE solution is obtained when c = c for all i and j. This means that we can i j set c to a certain constant c for i = 1,...,n. The point is that c has only one i 0 i more variable, p , than c , so that we can attack each variable p one by one. i i−1 i For example, Eq. (4) with c = c gives p (c ) = 1 c1/(n−1). Substituting this 1 0 1 0 − 0 value into Eq. (5) and with c = c gives a value for p (c ), and so on. If the 2 0 2 0 correct NE-value is chosen so that c = c then the equilibrium solution, ~p 0 NE NE is obtained. When there exist several real solutions for p then the correct root i can be found using the restriction 0 < p < 1 and the normalization condition i i p < 1. The procedure for a three-players (n = 3) game, where we have j=1 j c =c =28 16√3 0.287 (Fig. 2b), goes as follows: P0 NE − ≈ c =c =(1 p )2 p 0.4643. 1 0 1 1 − ⇒ ≈ c =c =(1 p )2 2p (1 p p ) 2 0 2 1 1 2 − − − − [inserting p ] p 0.2684. 1 2 ⇒ ⇒ ≈ c =c =(1 p )2 2p (1 p p ) 2p (1 p p )+2p p 3 0 3 1 1 3 2 2 3 1 2 − − − − − − − [inserting p and p ] p 0.2673 0.0246i. 1 2 3 ⇒ ⇒ ≈ ± The fact that the last solution is complex indicates that the value of c is not 0 exactly correct. Adding more digits decreases the imaginary part while the real part moves closer to the correctsolution (e.g., c =0.287187gives p =0.267949 0 3 − 5.04978 10−4i). Sincethereisnogeneralformulaforrootsofpolynomialsofdegree × larger than four, it seems that this procedure should be carried out numerically. Figure 3 shows the behavior of c for i = 1 to 4 and n = 9. The solution of p is i i obtained in each step by finding the crossing point c = c (where p has been i 0 i−1 inserted into c ). i To sum up, the benefit of this procedure is that if we know the correct NE- value, c = c , then we can calculate the probabilities one by one up to any i 0 NE Baek and Bernhardsson 1 c 1 c 2 c 3 c 4 c NE ci 0.1 0 0.2 0.4 p i Fig 3. The expressions ci as functions of pi for n = 9 players. A Nash equilibrium solution is obtained when ci = cNE for all i ≤ n. p1 is found by the condition c1 = cNE. Inserting this value into c2 allows for a determination of p2 as the crossing point between c2 and cNE, and so on. OrthevalueofcNE canbeobtainedinthiswaybyfindingc0 whichfulfillsthenormalization n condition Pi=1pi =1. The two dotted horizontal lines represent the interval within which cNE mustlieifweusetheexpressionsuptoc4 insteadofcn. we want or are capable of doing within our computational resources, for any n. In manysituationsitmight,forexample,beenoughtoknowthesolutionuptocertain i above which p ’s are negligible. The method of solving for all i simultaneously i instead results in an all-or-nothing situation. Of course, the down side is that we need to know c in advance. However,recalling that c =W W(p~ ;~p ), NE NE NE NE NE ≡ we see the limiting behavior lim nc = lim nW in Fig. 2(b). Putting n→∞ NE n→∞ NE such a c = c into this procedure may thus yield p~ for large n. The solution 0 NE NE obtained in this way up to i = 4 for n = 9 and c = 0.0985 c (see Fig. 3) is 0 NE ≈ p~=(0.2515,0.2349,0.2087,0.1643,...). Thiscanbecomparedtotheresultobtained fromsolvingforallisimultaneously,whichisp~=(0.2515,0.2348,0.2086,0.1641,...). ThismethodcouldalsobeusedtofindtheactualNEsolution,withoutknowing c , by iterating the procedure until a self-consistent solution is found for all i. NE If this is again beyond our computational power, one could at least determine an interval in which c = c lies. Let’s say that we use expressions for c up to 0 NE i i = j. If c is too small then the normalization gets violated by j p > 1. If 0 i=1 i c is too large then the crossing points result in probabilities that are too small to 0 P add up to one in total, as long as p is a monotonically decreasing function of i. i Thus, the largest possible sum of all n probabilities is obtained if p is uniform for i i j, so the sum j p +(n j)p must be larger or equal to one. The interval ≥ i=1 i − j obtained for n =9 and j =4 is marked by the two horizontal, dotted lines in Fig. P 3 (c = 0.078 and c = 0.146). This interval becomes narrower as higher 0,min 0,max values of i are addressed. Note that when c is changed all the curves c for i > 1 0 i also change since we get new p ’s with j <i to be inserted into c . j i 3. Discussion We have shown a way of taking all the combinatorial possibilities into account for the LUPI (lowestunique positive integer)game andproposedaprocedureto find a Equilibrium Solution to the LUPI game Nashequilibriumforageneralnumberofplayers,n. Itturnsout,however,thatthe number of terms to consider grows exponentially in n, and that the computational capability of our personal computer can only solve the problem up to n = 12 players. To dealwith this problemwe alsosuggestanalternativewayofsolvingfor the equilibrium strategy for each value of i at a time, by inserting an estimate of the average expected payoff of each player. This procedure allows the problem to be solved sequentially instead of solving the whole problem simultaneously. Wealsohavefoundthatauniformsolutiondistributesnearlyoptimaloutcomes equally among all the players. However, this would be driven to a suboptimal equilibrium solution, so we may regard this n-person game as an example of a social dilemma to manifest a conflict between group and individual interests. Acknowledgments S.K.B.acknowledgesthesupportfromtheSwedishResearchCouncilwiththeGrant No. 621-2002-4135. References [1] H.Gintis. Gametheoryevolving:aproblem-centered introductiontomodelingstrategic interaction. Princeton University Press, Princeton, 2000. [2] D. Challet, M. Marsili, and Y.-C. Zhang. Minority games : interacting agents in financial markets. Oxford University Press, Oxford,2005. [3] Q. Zeng, B. R. Davis, and D. Abbott. Reverse auction: the lowest unique positive integer game. Fluct. Noise Lett., 7:L439–L447, 2007. [4] A.P.Flitney. Commentson‘reverseauction:thelowestuniquepositiveintegergame’. Fluct. Noise Lett., 8:C1–C4, 2008. [5] J. W. Weibull. Evolutionary Game Theory. MIT Press, Cambridge, 1995.