Shapley value

Summary

The Shapley value is a solution concept in cooperative game theory. It was named in honor of Lloyd Shapley, who introduced it in 1951 and won the Nobel Memorial Prize in Economic Sciences for it in 2012.[1][2] To each cooperative game it assigns a unique distribution (among the players) of a total surplus generated by the coalition of all players. The Shapley value is characterized by a collection of desirable properties. Hart (1989) provides a survey of the subject.[3][4]

Lloyd Shapley in 2012

The setup is as follows: a coalition of players cooperates, and obtains a certain overall gain from that cooperation. Since some players may contribute more to the coalition than others or may possess different bargaining power (for example threatening to destroy the whole surplus), what final distribution of generated surplus among the players should arise in any particular game? Or phrased differently: how important is each player to the overall cooperation, and what payoff can he or she reasonably expect? The Shapley value provides one possible answer to this question.

For cost-sharing games with concave cost functions, the optimal cost-sharing rule that optimizes the price of anarchy, followed by the price of stability, is precisely the Shapley value cost-sharing rule.[5] (A symmetrical statement is similarly valid for utility-sharing games with convex utility functions.) In mechanism design, this means that the Shapley value solution concept is optimal for these sets of games.

Formal definition edit

Formally, a coalitional game is defined as: There is a set N (of n players) and a function   that maps subsets of players to the real numbers:  , with  , where   denotes the empty set. The function   is called a characteristic function.

The function   has the following meaning: if S is a coalition of players, then  (S), called the worth of coalition S, describes the total expected sum of payoffs the members of   can obtain by cooperation.

The Shapley value is one way to distribute the total gains to the players, assuming that they all collaborate. It is a "fair" distribution in the sense that it is the only distribution with certain desirable properties listed below. According to the Shapley value,[6] the amount that player i is given in a coalitional game   is

 
 

where n is the total number of players and the sum extends over all subsets S of N not containing player i, including the empty set. Also note that   is the binomial coefficient. The formula can be interpreted as follows: imagine the coalition being formed one actor at a time, with each actor demanding their contribution   as a fair compensation, and then for each actor take the average of this contribution over the possible different permutations in which the coalition can be formed.

An alternative equivalent formula for the Shapley value is:

 

where the sum ranges over all   orders   of the players and   is the set of players in   which precede   in the order  . Finally, it can also be expressed as

 

which can be interpreted as

 

In terms of synergy edit

From the characteristic function   one can compute the synergy that each group of players provides. The synergy is the unique function  , such that

 

for any subset   of players. In other words, the 'total value' of the coalition   comes from summing up the synergies of each possible subset of  .

Given a characteristic function  , the synergy function   is calculated via

 

using the Inclusion exclusion principle. In other words, the synergy of coalition   is the value   , which is not already accounted for by its subsets.

The Shapley values are given in terms of the synergy function by[7][8]

 

where the sum is over all subsets   of   that include player  .

This can be interpreted as

 

In other words, the synergy of each coalition is divided equally between all members.

Examples edit

Business example edit

Consider a simplified description of a business. An owner, o, provides crucial capital in the sense that, without him/her, no gains can be obtained. There are m workers w1,...,wm, each of whom contributes an amount p to the total profit. Let

 

The value function for this coalitional game is

 

Computing the Shapley value for this coalition game leads to a value of mp/2 for the owner and p/2 for each one of the m workers.

This can be understood from the perspective of synergy. The synergy function   is

 

so the only coalitions that generate synergy are one-to-one between the owner and any individual worker.

Using the above formula for the Shapley value in terms of   we compute

 

and

 

The result can also be understood from the perspective of averaging over all orders. A given worker joins the coalition after the owner (and therefore contributes p) in half of the orders and thus makes an average contribution of   upon joining. When the owner joins, on average half the workers have already joined, so the owner's average contribution upon joining is  .

Glove game edit

The glove game is a coalitional game where the players have left- and right-hand gloves and the goal is to form pairs. Let

 

where players 1 and 2 have right-hand gloves and player 3 has a left-hand glove.

The value function for this coalitional game is

 

The formula for calculating the Shapley value is

 

where R is an ordering of the players and   is the set of players in N which precede i in the order R.

The following table displays the marginal contributions of Player 1.

 

Observe

 

By a symmetry argument it can be shown that

 

Due to the efficiency axiom, the sum of all the Shapley values is equal to 1, which means that

 

Properties edit

The Shapley value has many desirable properties.

Efficiency edit

The sum of the Shapley values of all agents equals the value of the grand coalition, so that all the gain is distributed among the agents:

 

Proof:  

since   is a telescoping sum and there are |N|! different orderings R.

Symmetry edit

If   and   are two actors who are equivalent in the sense that

 

for every subset   of   which contains neither   nor  , then  .

This property is also called equal treatment of equals.

Linearity edit

If two coalition games described by gain functions   and   are combined, then the distributed gains should correspond to the gains derived from   and the gains derived from  :

 

for every   in  . Also, for any real number  ,

 

for every   in  .

Null player edit

The Shapley value   of a null player   in a game   is zero. A player   is null in   if   for all coalitions   that do not contain  .

Stand-alone test edit

If   is a subadditive set function, i.e.,  , then for each agent  :  .

Similarly, if   is a superadditive set function, i.e.,  , then for each agent  :  .

So, if the cooperation has positive externalities, all agents (weakly) gain, and if it has negative externalities, all agents (weakly) lose.[9]: 147–156 

Anonymity edit

If   and   are two agents, and   is a gain function that is identical to   except that the roles of   and   have been exchanged, then  . This means that the labeling of the agents doesn't play a role in the assignment of their gains.

Marginalism edit

The Shapley value can be defined as a function which uses only the marginal contributions of player   as the arguments.

Characterization edit

The Shapley value not only has desirable properties, it is also the only payment rule satisfying some subset of these properties. For example, it is the only payment rule satisfying the four properties of Efficiency, Symmetry, Linearity and Null player.[10] See[9]: 147–156  for more characterizations.

Aumann–Shapley value edit

In their 1974 book, Lloyd Shapley and Robert Aumann extended the concept of the Shapley value to infinite games (defined with respect to a non-atomic measure), creating the diagonal formula.[11] This was later extended by Jean-François Mertens and Abraham Neyman.

As seen above, the value of an n-person game associates to each player the expectation of his contribution to the worth or the coalition or players before him in a random ordering of all the players. When there are many players and each individual plays only a minor role, the set of all players preceding a given one is heuristically thought as a good sample of the players so that the value of a given infinitesimal player ds around as "his" contribution to the worth of a "perfect" sample of the population of all players.

Symbolically, if v is the coalitional worth function associating to each coalition c measured subset of a measurable set I that can be thought as   without loss of generality.

 

where  denotes the Shapley value of the infinitesimal player ds in the game, tI is a perfect sample of the all-player set I containing a proportion t of all the players, and   is the coalition obtained after ds joins tI. This is the heuristic form of the diagonal formula.

Assuming some regularity of the worth function, for example assuming v can be represented as differentiable function of a non-atomic measure on I, μ,   with density function  , with   (   the characteristic function of c). Under such conditions

 ,

as can be shown by approximating the density by a step function and keeping the proportion t for each level of the density function, and

 

The diagonal formula has then the form developed by Aumann and Shapley (1974)

 

Above μ can be vector valued (as long as the function is defined and differentiable on the range of μ, the above formula makes sense).

In the argument above if the measure contains atoms   is no longer true—this is why the diagonal formula mostly applies to non-atomic games.

Two approaches were deployed to extend this diagonal formula when the function f is no longer differentiable. Mertens goes back to the original formula and takes the derivative after the integral thereby benefiting from the smoothing effect. Neyman took a different approach. Going back to an elementary application of Mertens's approach from Mertens (1980):[12]

 

This works for example for majority games—while the original diagonal formula cannot be used directly. How Mertens further extends this by identifying symmetries that the Shapley value should be invariant upon, and averaging over such symmetries to create further smoothing effect commuting averages with the derivative operation as above.[13] A survey for non atomic value is found in Neyman (2002)[14]

Generalization to coalitions edit

The Shapley value only assigns values to the individual agents. It has been generalized[15] to apply to a group of agents C as,

 

In terms of the synergy function   above, this reads[7][8]

 

where the sum goes over all subsets   of   that contain  .

This formula suggests the interpretation that the Shapley value of a coalition is to be thought of as the standard Shapley value of a single player, if the coalition   is treated like a single player.

Value of a player to another player edit

The Shapley value   was decomposed in[16] into a matrix of values

 

Each value   represents the value of player   to player  . This matrix satisfies

 

i.e. the value of player   to the whole game is the sum of their value to all individual players.

In terms of the synergy   defined above, this reads

 

where the sum goes over all subsets   of   that contain   and  .

This can be interpreted as sum over all subsets that contain players   and  , where for each subset   you

  • take the synergy   of that subset
  • divide it by the number of players in the subset  . Interpret that as the surplus value player   gains from this coalition
  • further divide this by   to get the part of player  's value that's attributed to player  

In other words, the synergy value of each coalition is evenly divided among all   pairs   of players in that coalition, where   generates surplus for  .

In machine learning edit

The Shapley value provides a principled way to explain the predictions of nonlinear models common in the field of machine learning. By interpreting a model trained on a set of features as a value function on a coalition of players, Shapley values provide a natural way to compute which features contribute to a prediction [17] or contribute to the uncertainty of a prediction.[18] This unifies several other methods including Locally Interpretable Model-Agnostic Explanations (LIME),[19] DeepLIFT,[20] and Layer-Wise Relevance Propagation.[21]

See also edit

References edit

  1. ^ Shapley, Lloyd S. (August 21, 1951). "Notes on the n-Person Game -- II: The Value of an n-Person Game" (PDF). Santa Monica, Calif.: RAND Corporation.
  2. ^ Roth, Alvin E., ed. (1988). The Shapley Value: Essays in Honor of Lloyd S. Shapley. Cambridge: Cambridge University Press. doi:10.1017/CBO9780511528446. ISBN 0-521-36177-X.
  3. ^ Hart, Sergiu (1989). "Shapley Value". In Eatwell, J.; Milgate, M.; Newman, P. (eds.). The New Palgrave: Game Theory. Norton. pp. 210–216. doi:10.1007/978-1-349-20181-5_25. ISBN 978-0-333-49537-7.
  4. ^ Hart, Sergiu (May 12, 2016). "A Bibliography of Cooperative Games: Value Theory".
  5. ^ Phillips, Matthew; Marden, Jason R. (July 2018). "Design Tradeoffs in Concave Cost-Sharing Games". IEEE Transactions on Automatic Control. 63 (7): 2242–2247. doi:10.1109/tac.2017.2765299. ISSN 0018-9286. S2CID 45923961.
  6. ^ For a proof of unique existence, see Ichiishi, Tatsuro (1983). Game Theory for Economic Analysis. New York: Academic Press. pp. 118–120. ISBN 0-12-370180-5.
  7. ^ a b Grabisch, Michel (October 1997). "Alternative Representations of Discrete Fuzzy Measures for Decision Making". International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems. 5 (5): 587–607. doi:10.1142/S0218488597000440. ISSN 0218-4885.
  8. ^ a b Grabisch, Michel (1 December 1997). "k-order additive discrete fuzzy measures and their representation". Fuzzy Sets and Systems. 92 (2): 167–189. doi:10.1016/S0165-0114(97)00168-1. ISSN 0165-0114.
  9. ^ a b Herve Moulin (2004). Fair Division and Collective Welfare. Cambridge, Massachusetts: MIT Press. ISBN 9780262134231.
  10. ^ Shapley, Lloyd S. (1953). "A Value for n-person Games". In Kuhn, H. W.; Tucker, A. W. (eds.). Contributions to the Theory of Games. Annals of Mathematical Studies. Vol. 28. Princeton University Press. pp. 307–317. doi:10.1515/9781400881970-018. ISBN 9781400881970.
  11. ^ Aumann, Robert J.; Shapley, Lloyd S. (1974). Values of Non-Atomic Games. Princeton: Princeton Univ. Press. ISBN 0-691-08103-4.
  12. ^ Mertens, Jean-François (1980). "Values and Derivatives". Mathematics of Operations Research. 5 (4): 523–552. doi:10.1287/moor.5.4.523. JSTOR 3689325.
  13. ^ Mertens, Jean-François (1988). "The Shapley Value in the Non Differentiable Case". International Journal of Game Theory. 17 (1): 1–65. doi:10.1007/BF01240834. S2CID 118017018.
  14. ^ Neyman, A., 2002. Value of Games with infinitely many Players, "Handbook of Game Theory with Economic Applications," Handbook of Game Theory with Economic Applications, Elsevier, edition 1, volume 3, number 3, 00. R.J. Aumann & S. Hart (ed.).[1]
  15. ^ Grabisch, Michel; Roubens, Marc (1999). "An axiomatic approach to the concept of interaction among players in cooperative games". International Journal of Game Theory. 28 (4): 547–565. doi:10.1007/s001820050125. S2CID 18033890.
  16. ^ Hausken, Kjell; Mohr, Matthias (2001). "The Value of a Player in n-Person Games". Social Choice and Welfare. 18 (3): 465–83. doi:10.1007/s003550000070. JSTOR 41060209. S2CID 27089088.
  17. ^ Lundberg, Scott M.; Lee, Su-In (2017). "A Unified Approach to Interpreting Model Predictions". Advances in Neural Information Processing Systems. 30: 4765–4774. arXiv:1705.07874. Retrieved 2021-01-30.
  18. ^ Watson, David; O’Hara, Joshua; Tax, Niek; Mudd, Richard; Guy, Ido (2023). "Explaining Predictive Uncertainty with Information Theoretic Shapley" (PDF). Advances in Neural Information Processing Systems. 37. arXiv:2306.05724. Retrieved 2023-12-19.
  19. ^ Ribeiro, Marco Tulio; Singh, Sameer; Guestrin, Carlos (2016-08-13). ""Why Should I Trust You?"". Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. New York, NY, USA: ACM. pp. 1135–1144. doi:10.1145/2939672.2939778. ISBN 978-1-4503-4232-2.
  20. ^ Shrikumar, Avanti; Greenside, Peyton; Kundaje, Anshul (2017-07-17). "Learning Important Features Through Propagating Activation Differences". PMLR: 3145–3153. ISSN 2640-3498. Retrieved 2021-01-30.
  21. ^ Bach, Sebastian; Binder, Alexander; Montavon, Grégoire; Klauschen, Frederick; Müller, Klaus-Robert; Samek, Wojciech (2015-07-10). Suarez, Oscar Deniz (ed.). "On Pixel-Wise Explanations for Non-Linear Classifier Decisions by Layer-Wise Relevance Propagation". PLOS ONE. 10 (7). Public Library of Science (PLoS): e0130140. Bibcode:2015PLoSO..1030140B. doi:10.1371/journal.pone.0130140. ISSN 1932-6203. PMC 4498753. PMID 26161953.

Further reading edit

  • Friedman, James W. (1986). Game Theory with Applications to Economics. New York: Oxford University Press. pp. 209–215. ISBN 0-19-503660-3.

External links edit