Running head: The Shapley Value approach to variance decomposition


USING THE SHAPLEY VALUE APPROACH TO VARIANCE DECOMPOSITION IN STRATEGY RESEARCH: DIVERSIFICATION, INTERNATIONALIZATION, AND CORPORATE GROUP EFFECTS ON AFFILIATE PROFITABILITY

DMITRY SHARAPOV*
Imperial College Business School
Tanaka Building, South Kensington Campus
London SW7 2AZ, U.K.
dmitry.sharapov@imperial.ac.uk
* corresponding author

PAUL KATTUMAN
University of Cambridge
Cambridge Judge Business School, Trumpington Street 
Cambridge CB2 1AG, U.K.
p.kattuman@jbs.cam.ac.uk

DIEGO RODRIGUEZ
Universidad Complutense de Madrid and GRIPICO
Facultad de Ciencias Económicas y Empresariales
28223 Pozuelo de Alarcón – Madrid, Spain
drodri@ccee.ucs.es

F. JAVIER VELAZQUEZ
Universidad Complutense de Madrid and GRIPICO
Facultad de Ciencias Económicas y Empresariales
28223 Pozuelo de Alarcón – Madrid, Spain
javel@ccee.ucs.es

Running head: The Shapley Value approach to variance decomposition

Key words: variance decomposition, Shapley Value regression, corporate group, diversification, internationalization 


USING THE SHAPLEY VALUE APPROACH TO VARIANCE DECOMPOSITION IN STRATEGY RESEARCH: DIVERSIFICATION, INTERNATIONALIZATION, AND CORPORATE GROUP EFFECTS ON AFFILIATE PROFITABILITY

Abstract

Research Summary: Variance decomposition methods allow strategy scholars to identify key sources of heterogeneity in firm performance. However, most extant approaches produce estimates that depend on the order in which sources are considered, the ways they are nested, and which sources are treated as fixed or random effects. In this paper, we propose the use of an axiomatically justified, unique, and effective solution to this limitation: the “Shapley Value” approach. We show its effectiveness compared to extant methods using both simulated and real data, and use it to explore how the importance of business group effects varies with group diversification and internationalization in a large, representative sample of European firms. We thus demonstrate the method’s superior accuracy and its usefulness in asking and answering new questions.

Managerial Summary: A key contribution of strategic management research to managerial practice is identifying drivers of firm performance that operate at firm, corporation, industry, and national levels. A branch of this research measures the relative importance of factors at these different levels in producing variation in firm performance, thus helping top managers focus efforts on aspects of their businesses most likely to yield performance differences. However, estimates produced by extant methods are sensitive to method used, and to modelling choices made. This paper proposes the use of the “Shapley Value” approach, which is free from such sensitivity, shows its effectiveness compared to extant methods, and uses it to explore how the importance of factors at the level of the business group varies with group diversification and internationalization.    


Variance decomposition methods have been vital in research on whether sources of heterogeneity in firm performance reside at the business unit, corporation, or industry level (e.g., Schmalensee, 1985; Rumelt, 1991; McGahan & Porter, 1997; Misangyi, Elms, Greckhamer, & Lepine, 2006; Guo, 2017). More recently, these methods have also proven useful in evaluating whether a range of other influences on firm performance actually “matter” in explaining its variation across firms, including country and regional effects (Makino, Isobe, & Chan, 2004; Chan, Makino, & Isobe, 2010; McGahan & Victer, 2010; Ma, Tong, & Fitza, 2013), ownership (Fitza & Tihanyi, 2018), and Chief Executive Officers (Crossland & Hambrick, 2007; Fitza, 2017; Quigley & Graffin, 2017). 
	However, while the methods used for variance decomposition have been improved in a number of ways (for an overview, see Guo, 2017: 1328-1330), most extant approaches share an important limitation: unless the effects under study are orthogonal, the estimates are sensitive to choices regarding the order in which the effects are introduced into the models; which effects are treated as fixed versus random; and which effects are considered to be nested in others.[footnoteRef:1] This implies that these methods produce estimates of the share of variance accounted for by different effects that may be lower- or upper-bound estimates, or anywhere in between, depending on the above choices. [1:  While random effects variance decomposition methods, which we will refer to as Variance Components Analysis (VCA), do not share this order-dependence limitation when used to estimate models without a nesting structure, this is because they make strong assumptions regarding effect distributions. We provide a further discussion of the VCA approach and compare the results that it generates to those produced by other methods below.   ] 

	In this paper, we draw on the statistical literature (Young, 1985; Grömping, 2007; Pintér, 2011) to propose an axiomatically justified, unique, and effective solution to this limitation: the Shapley Value approach. The Shapley Value for a given effect is its contribution to model explanatory power, averaged (with weights) over all possible sequential orders in which the effects could be introduced into the regression model. In the following sections, we introduce the Shapley Value method, before showing its effectiveness compared to currently used approaches, using both a simulation and an empirical application. The latter employs the Shapley Value approach to explore how the importance of corporate group effects changes depending on the extent of the group’s diversification and internationalization, using data on a large and representative sample of European firms. We conclude by discussing how the Shapley Value method could be used in future research, both by improving the reliability of the evidence in long-running debates regarding the sources of heterogeneity in firm performance, as well as by allowing strategy scholars to ask new questions regarding the importance of different effects under different conditions (e.g., Arora et al., 2016).
THE SHAPLEY VALUE METHOD	
A fundamental question in strategic management research is why some businesses are successful while others are not. The approach taken in a continuing literature on business performance variation that started with Schmalensee (1985) is based on the view that the key first step is to identify the levels at which important, performance-relevant factors operate. If for example, controlling for all else, performance differences between businesses are notably greater than performance differences between corporate groups, industries, or countries that they belong to, then focusing attention on understanding what business-level features and strategies mark out high performing businesses from poor performers would be productive. As in that literature, the focus of this paper is on estimating the relative dispersion importances of the levels at which factors affecting performance operate. The dispersion importance of a factor can be estimated based on the extent to which it accounts for variation in business performance in a regression model.[footnoteRef:2] If the regressor sets were all orthogonal to each other, then the overall variance in profitability will decompose exactly between the regressor sets and yield unique estimates of their dispersion importances. But corporate groups commonly span more than one industry, and more than one country, while industries span all countries. Due to this cross-nesting, firm profitability will not decompose exactly or uniquely between these levels [2:  This notion of relative dispersion importance is different from the more common notion of the level importance of a factor, which relates to the extent to which the factor accounts for the expected value of performance. It is thus possible for a regressor to have high level importance, by being influential in determining the expected value of performance, while having low dispersion importance, if it does not vary much within the population being studied. We thank the editor for this point.] 

	The ANOVA and variance components analysis (VCA) approaches used in the early studies (Schmalensee, 1985; Rumelt, 1991) and widely adopted since (e.g., McGahan & Porter, 1997, 2002; Hawawini, Subramanian, & Verdin, 2003), have acknowledged limitations in dealing with this problem. The fixed-effects ANOVA approach assigns the covariance between effects to the effect introduced first into the specification, along with a share of the unique contribution of any omitted but correlated effect to model explanatory power (Grömping, 2007, see Online Appendix 1 for details). Authors have tended to address the resulting indeterminacy by presenting estimates from a number of different paths from the null to the full model, but have generally not reconciled results from different model paths in a consistent manner. Estimated effect contributions thus remain sensitive to the choice of model paths presented.
	While VCA does not share the order-dependence limitation of other methods and does allow for covariances between effects if these are specified in the model, these covariances are assumed to be random. The appropriateness of this strong assumption has been criticized (Guo, 2017), as has the method’s lack of power in finding small but significant effects (Brush & Bromiley, 1997; Hough, 2006). The critique relating to the sum of squares procedure used to estimate the variance components in earlier papers has been overcome through advances in estimation methods. Recent work has taken a random effects approach to variance decomposition using the same maximum likelihood or restricted maximum likelihood estimation techniques as multilevel models (see below), without nesting some effects in others or using fixed effects (Hough, 2006; Marchenko, 2006). However, the reliance on strong assumptions regarding independence and joint normality of the random effects remains. 
Alternative approaches have also been used, including simultaneous equation modeling (Brush, Bromiley, & Hendrickx, 1999), non-parametric estimation (Ruefli & Wiggins, 2003) and multilevel modeling (Hough, 2006; Misangyi et al., 2006; Guo, 2017). These are not exempt from criticism (McGahan & Porter, 2005; Hough, 2006; Guo, 2017). The multilevel approach appears to be the most promising, as it explicitly takes the cross-nested structure of variation in firm performance into account. Examples include observations across time being nested within firms, and firms in turn being cross-nested within both business groups and industries (Misangyi et al., 2006), or firms being nested within corporate groups (Majumdar & Bhattacharjee, 2014). Thus, this approach allows for the estimation of random effects variance components like the VCA method but allows for general multilevel error structures in deriving more accurate variance component estimates (Hough, 2006; Guo, 2017). It can produce estimates of the dispersion importance of both random and fixed effects. However the estimates produced by a cross-nested multilevel approach will also depend on choices of which effects are considered to be nested in others, and which cross-nesting effects (e.g., industry or corporate group) are treated as being random versus fixed (see Misangyi et al., 2006: 580-581).[footnoteRef:3]   [3:  While Misangyi et al. (2006) justify their choices based on the guidance provided by the methodological literature, which states that the cross-nested effect with a larger number of observations be treated as random, and the one with a smaller number of observations as fixed, they accept that there is no theoretical justification for this choice. It is also easy to see how following the methodological guidance could lead to different choices across studies, depending on sample properties and the level of granularity at which certain effects, e.g., industry, are classified.  ] 

	In this paper we focus on a fully rationalized and unique solution to the general problem of fairly allocating model  to regressors that may be correlated.[footnoteRef:4] The conceptual basis of the Shapley Value allocation, as this approach is called in the statistical literature, lies in interpreting the regression as a cooperative game in which the regressors are players and the model  is the collective value that is to be divided fairly among the regressor-players (Pintér, 2011). The underlying principle, first formulated in transferable utility co-operative game theory as a solution to the problem of sharing gains from cooperation among members of a coalition (Shapley, 1953), is to allocate to each member in the coalition, the marginal contributions that she can make to its joint output. If there is any degree of similarity between coalition members in their ability to contribute to output, then any member’s marginal contribution will be sensitive to the set of other members who have already contributed. The Shapley Value method neutralizes this dependence by allocating to each member the expectation of her marginal contribution, taken over all possible sequences in which she can contribute to the coalition. This logic translates in a straightforward way to regression-based decomposition of the variation in firm performance among different (groups of) variables. The Shapley Value allocation of dispersion importance of regressors is general and has been shown to be the only allocation that meets the axiomatic requirements for a proper decomposition (Grömping, 2007).[footnoteRef:5]  Online Appendix 1 presents the technical details of the approach and estimator properties. [4:  When sets of effects are involved,  (adjusted-) corrects  for the degrees of freedom and is the estimator of explained variance that is to be divided up between regressor sets. ]  [5:  Pintér (2011), following Young (1985), proved that in regression games, the solution concept satisfies the following three essential requirements of dispersion importance estimators, if and only if it is the Shapley Value solution: 
Efficiency: The full model  must be decomposed exactly among the regressor variables.
Equal treatment: If two regressors are equivalent in the sense that the full model  is unchanged regardless of which the two are included in the model, then their dispersion importances must be equal. 
Monotonicity: In comparing two regression models, if a regressor variable contributes more to the explanatory power of the first model than to the explanatory power of the second, then its dispersion importance must be higher in the first model than in the second.] 

We now compare the performance of the Shapley Value approach and alternative methods. For this purpose, we first use a simulation with a known data generating process. Next, we proceed to a large-sample empirical application. The latter seeks to answer two questions: how much corporate group effects matter in explaining variance in the profitability of European firms; and of how their importance is affected by the extent of corporate group diversification and internationalization.  
SIMULATION
To evaluate the performance of the Shapley Value approach compared to the ANOVA, multilevel, and VCA methods, we use a Monte Carlo simulation. We generate a simulated data set with industry, corporate group, firm and year effects with defined variance-covariance structures, which together determine the profitabilities of the simulated firms. We then apply the Shapley Value approach, alongside the alternative methods, and compare the results with the known data generating process. Our simulated full model is specified as:

where  is the profitability of corporate group ’s business unit in industry  at time ,  is the overall average profitability,  is the profitability component characterizing industry , and  is the profitability component characterizing corporate group . As described in more detail in Online Appendix 2, these effects are constructed to be correlated with each other. A firm belonging to corporate group  and operating in industry  has the profitability component , itself constructed to be independent of the corporate group and industry effects.  are year-specific profitability components, and  are normally distributed error terms, uncorrelated with any of the other effects. We compare the proportions of total variance allocated to the effects by the Shapley Value approach as well as alternative methods, against the true values following from the parameters used in the data generating process described in Online Appendix 2. 
	We examine estimates from the ANOVA, multilevel, VCA, and Shapley Value approaches. For the ANOVA approach we examine 2 paths from null to full model (in the spirit of McGahan & Porter, 1997), introducing the effects in the orders: a) year, industry, corporate group, firm; and b) year, corporate group, industry, firm. For industry and corporate group effects we present the ANOVA estimates from both paths as upper and lower bound estimates. For the multilevel approach, we follow the Hierarchical Linear Modeling (HLM) approach of Misangyi et al. (2006) but provide estimates for models both when corporate group effects are treated as being random and industry effects as fixed, and vice-versa. The VCA results are produced by estimating a crossed-effects model (one where no effect is considered to be nested in another) using Stata’s xtmixed command. For the Shapley Value approach we find the weighted average of the contribution of each set of effects to explaining model  over all possible orders in which the effects can be introduced.
	The results based on simulated data for 1,000 firms belonging to 500 corporate groups operating in 250 industries over 4 years, with each corporate group operating in two industries, are presented in Online Appendix 2. It can be seen that the Shapley Value approach provides more accurate estimates of effect contribution to variance in profitability than commonly used alternative methods and is the appropriate method for use in variance decomposition.
EMPIRICAL APPLICATION
Research has shown that being an affiliate of a corporate group can have a significant bearing on firm performance (Almeida & Wolfenzon, 2006; Belenzon & Berkovitz, 2010; Bertrand, Mehta, & Mullainathan, 2002; Carney, Gedajlovic, Heugens, Van Essen, & Van Oosterhout, 2011; Chang & Hong, 2000).[footnoteRef:6] A meta-analysis of 141 studies found the relationship between group affiliation and firm performance to be negative and significant on average, with institutional factors and strategic actions at firm and group levels playing important roles in this relationship (Carney et al., 2011). While such meta-analytic estimates are useful to understand the level importance of group affiliation as a driver of firm performance, it is also necessary to consider the dispersion importance of corporate group effects that is independent of the variation in other performance drivers at the firm, industry and country levels. The empirical application of the Shapley Value method that follows demonstrates its usefulness in providing more precise estimates of the importance of diverse sources of performance variation, compared to previously used variance decomposition methods. It also highlights the potential of Shapley Value approach to ask and answer new questions that contribute to our understanding of the implications of corporate group membership for affiliates. We illustrate this by examining how the relative dispersion importance of corporate group effects in explaining firm performance varies with the extent of diversification and internationalization of the group.  [6:  While some prior work has used the terminology of “business groups” or “family business groups”, we follow Belenzon, Berkovitz, & Rios (2013) in using the concept of “corporate groups”, which is codified in European legal, cultural, and economic institutions, and in which groups are defined through equity-ownership ties. ] 

	Diversification and internationalization will be germane to how much corporate groups matter in explaining the variance in affiliate performance. How germane, will depend on the extent to which the potentially homogenizing corporate group influence on group members (e.g., Barney, 1997; Bowman & Helfat, 2001; Caves, 1996; Gulati, Nohria, & Zaheer, 2000; Hitt, Hoskisson, & Kim, 1997; Kali & Sarkar, 2011), will be realized in practice, given the likely costs and difficulties accompanying diversification and internationalization (e.g., Chari, Devaraj, & David, 2008; Crossland & Hambrick, 2007; Hashai, 2015; Levinthal & Wu, 2010; Lu & Beamish, 2004; Nohria & Ghoshal, 1994; Rawley, 2010; Zhou, 2011). To see which influence dominates, we apply the Shapley Value method and compare the resulting estimates of firm, corporate group, industry, country, and year effects against those produced by ANOVA, HLM, and VCA methods. We then use the approach to analyze how the extent of diversification and internationalization determines the importance of corporate group effects in explaining variation in firm performance. 
Data and methods
We chose a setting that enables us to compare the estimates of corporate group effects produced by the Shapley Value approach against those produced by other methods: one that accommodates diversification both within and across corporate group affiliates, as well as corporate group internationalization. Specifically, we study the population of non-financial firms across 25 European countries using the complete version of the Amadeus database maintained by Bureau van Dijk. The database contains balance sheet information and additional data for about 14 million European firms. For comparability with previous studies, we use return on assets (ROA), to measure profitability.[footnoteRef:7] We restrict the sample to firms that provide full information on ROA, industrial classification of activity, and their number of employees, over the period 2002-2006, prior to advent of the global financial crisis in 2007. We drop firms corresponding to two specific industries: NACE codes 7415 (Chain services and non-financial holdings) and 7487 (Other Business activities). This is akin to the exclusion of depositary institutions in research using the Compustat database. [7:  ROA is defined as the ratio between profits before taxes and fixed assets. Hawawini et al. (2003) use value-based measures of performance as well as ROA in their analysis and find the results to be similar.] 

The basic statistical unit in our analysis is the firm: a legal unit that reports its own accounts and is legally distinct from other entities that it owns or is owned by. The firm's country is the country in which it reports accounts. Industries are defined according to the European Statistical Classification of Economic Activities (NACE) at the 4-digit level. 
In order to account for corporate group influences on the performance of firms in a precise manner, we define a corporate group as the set of firms which, though legally distinct, are bound together by ties of majority share ownership. This is considered to be sufficient to provide a clear basis for effective managerial control (OECD, 2005, p. 49). This definition, along with the features of our data and the European legal environment, allow us to accurately account for the extent of corporate group diversification and internationalization. As we have information on every industry that every firm operates in, our measures of diversification capture diversification both among and within corporate group affiliates. In terms of internationalization, a firm wishing to operate outside of its home country must, in the European context, open a subsidiary in the target host country, which will legally be another firm, and will enter in our dataset as such.
We identify an ownership link when a firm has owns more than 50% of the equity of another firm. This threshold is sufficient to enable the apex owning firm to determine corporate policy of owned firms, by choosing appropriate directors if necessary.[footnoteRef:8] This method of delineating corporate groups is in line with research on groups in Europe (Belenzon et al., 2013), and elsewhere (e.g., Cestone & Fumagalli, 2005; Morck, 2005). We impose the additional condition that ownership be above the 50% threshold for at least two years, as we do not want to include transient ties which are unlikely to provide the same kinds of benefits as established ones (Gulati et al., 2000: 208). Using these criteria, we identify 887,443 links between pairs of firms. [8:  This is also possible by controlling more than half the shareholders' voting power indirectly. We are restrictive in requiring controlling ownership.] 

As we require information on the industrial classification of each firm at the level of 4-digit NACE for the repeated sampling procedure that we use, we exclude links when (mainly non-European) subsidiary firms are not in the Amadeus database. We are left with 450,782 links between 628,055 firms. Of these, 28.7% are solely main firms, 66.1% are solely subsidiaries and the remaining 5.1% are simultaneously main (with at least one subsidiary) and subsidiary.
	From these ownership links we identify 179,089 corporate groups. The majority of groups are constituted by a unique link between two firms. The average group consists of 2.5 links, but the biggest group has 1,096 links. Overall, 66.1% of all corporate groups have all their firms in the same country, and larger groups are more likely to be internationalized.
To bring our analysis methods to data, we use a stratified random sampling procedure and draw 100 samples of 5,000 firms each that are representative of the underlying population of firms along country, industry, and size dimensions. The stratification criteria are sourced from the Structural Business Statistics (SBS) database of the Statistical Office of the European Commission (Eurostat), which provides information on the numbers of firms in each European Union country and Norway, classified by industry and size-class. The final stage in our sampling procedure is the selection of firms with and without corporate group membership. Half of each of our samples is drawn from the population of group members. As the tracing of corporate groups is completed before sampling, we are able to identify cases of firms belonging to the same corporate group even when they are linked through firms that are not included in the sample. 
We use the re-sampling approach for three reasons. First, the Amadeus data is not representative of the underlying population of firms; the database is known to be biased towards larger firms. Re-sampling enables us to overcome this bias and to provide estimates that are representative of the underlying population of firms. Second, the large size of the Amadeus database (14 Million firms) makes it computationally infeasible to use all available data to estimate fixed- or random-effect regression models. Third, re-sampling enables us to obtain the bootstrapped sampling distributions, and corresponding standard errors and confidence intervals, as explained below.  
To judge statistical significances of the estimates generated by Shapley Value, ANOVA, HLM, and VCA methods, we need their sampling distributions. With our randomly drawn re-samples being representative of a sufficiently large population, we can estimate bootstrapped confidence intervals. This is preferable to using the asymptotic distributions of the estimators for inference. We report simple 95-percent 2-tailed confidence limits (Efron & Tibshirani, 1993). In terms of procedure, we (re)sample (size = 5000) from the dataset with replacement; calculate the Shapley Value, ANOVA, HLM, and VCA estimates; and repeat this step 100 times, obtaining 100 sets of bootstrapped estimates for each method. These bootstrap statistics are then rank ordered, and the confidence limits are obtained as the 2.5% and 97.5% percentiles. These percentile intervals are nonparametric in that the critical values are obtained by rank, without restrictive assumptions such as normality, and are straightforward to estimate as well as interpret.
RESULTS
Table 1 provides the summary statistics across our 100 (stratified) samples drawn from the database. Each of these samples comprises 5000 firms, spanning 25 countries and 44 two-digit industries. From any corporate group contained in it, each sample contains between 2 and 14 affiliates, operating in up to 6 industries, and domiciled in up to 7 countries.
Tables 1 and 2, and Figure 1 about here
Table 2 presents the results. These results take note of potential participation by firms in more than one industry by including an additional dummy variable to indicate each secondary industry in which each firm operates. This makes our estimates robust to changes in firms’ primary industries[footnoteRef:9] while also correcting for the likely downward bias on industry effects that can arise from only primary industries being reckoned in regression-based variance decomposition (Bowman & Helfat, 2001: 14-15). Corporate effects are likely to be underestimated in samples that include unaffiliated firms (Bowman & Helfat, 2001). We therefore obtain estimates for samples that include only firms belonging to corporate networks.[footnoteRef:10] [9:  A limitation of the AMADEUS data that we use is that information on firm-industry affiliation is provided only for the most recent year of the data. Including secondary industries in our estimation of industry effects ensures that our estimates are not affected by cases in which a firm’s primary industry changed between the first (2002) and last years of our data (2006), as long as the new primary industry was previously in the firm’s set of secondary industries.  ]  [10:  When unaffiliated firms are also included, estimates of both industry and corporate group effects are lower, while firm effects are higher. These results are available on request from the authors.] 

Overall, the baseline Shapley Value results presented in the first column suggest that firm effects constitute the most important component in the variance of firm profitability in Europe, with a mean Shapley Value of 34.1%. Corporate group effects are second in importance, accounting for 12.5% of the variance in firm profitability. Industry effects appear to account for only 4.1% of variance in firm profitability. Both country and year effects are of little importance, though the nulls of no country or year effects at all are ruled out by the confidence intervals. 
The small magnitude of the Shapley Value for year effects (0.1% of the variance in firm profitability – of the same order of magnitude as the year component reported in the literature) is not surprising, given that the period of analysis was relatively stable economically. The Shapley Value estimate of country effects (0.4% of the variance) is lower than that obtained by Makino et al. (2004), whose data on subsidiaries of Japan headquartered firms span a wider and more heterogeneous set of countries (79 in all). Estimates in McGahan & Victer (2010) of home-country effects lie in the range of 2.6%-3.0%, but they find comparably small home country-effects among European firms, falling to 0.3% with a matched Amadeus sample.
Table 2 also presents estimates of effect importance produced by ANOVA, HLM, and VCA approaches. The ANOVA results presented in columns 2 and 3 are the effect contributions to model explanatory power (increase in  due to the introduction of the effect in question) based on the two model paths closest to those used in the existing literature. In both cases, year effects are introduced first into the model, then country effects, while firm effects are introduced last. The results presented in column `ANOVA G, I’ are estimates from a model path in which corporate group effects are introduced third into the model, while industry effects are fourth; column `ANOVA I, G’ corresponds to industry effects being introduced third into the model, before corporate group effects. These results show that the magnitudes of ANOVA estimates of industry and corporate group effects are dependent on the order in which the effects are introduced, with those that are introduced third claiming around 2.5% more of the total variance compared to those introduced fourth. It is interesting to note, as in the simulation results, that the ANOVA estimates for industry and corporate group effects are much larger than those produced by the Shapley Value, HLM, and VCA approaches, while firm effects are estimated to be much smaller. Based on comparison with the Shapley Value and HLM estimates, and the simulation results, it seems likely that this reflects a bias in the ANOVA approach if it is used to consider only some particular model path to the exclusion of others. Among model paths considered here, industry and corporate group effects claim a large share of their covariances with firm effects, leading to the overestimation of the former and the underestimation of the latter. 
The next three columns of Table 2 present HLM results for models in which country (`HLM C’), industry (`HLM I’), or both country and industry (`HLM C&I’) fixed effects are added to models with random firm and corporate group effects.[footnoteRef:11] While the estimates of these alternative models are not very different from one another, firm effect estimates are somewhat higher than those produced by the Shapley Value approach, while industry effects appear to be lower, although the differences are far less pronounced compared to the ANOVA approach. The VCA results presented in the last column of the table are broadly similar to those from the HLM models, but with higher estimates for country effects, lower estimates for corporate group effects, and wider confidence intervals compared to all other methods. The wider confidence intervals are likely due to the samples having to be carefully split (ensuring that group members are kept together in the resulting sub-samples) in order to achieve convergence in the estimation of crossed-effects models with a large number of random effects. This is a further downside of the VCA approach in practice.   [11:  We chose to treat corporate group effects as random effects following both the methodological guidance mentioned in Misangyi et al. (2006) and the results of our simulation, which suggest that doing so produces estimates with lower RMSEs than if we were to treat corporate group effects as fixed and industry effects as random instead. ] 

The above results are the baseline, unrestricted estimates of the importance of corporate group effects. We now proceed to examine whether and how corporate group effects change with the extents of diversification and internationalization of the group. Once again, we compare the results yielded by the Shapley Value approach with those produced by alternative methods.
	We begin by considering the relationship between group size, in terms of the overall number of the group’s affiliates, and the extent of corporate effects. To do this we analyze sub-samples from our data that include corporate groups containing k or more ownership links (affiliates), allowing k to range from 1, the baseline model, to 10, representing the largest groups. 
To explore how corporate group influence on affiliate performance changes with the extent of related diversification, we estimate the importance of corporate effects in groups spanning k1 or more NACE 4-digit industries (k1 ranging from 1 to 10) within the same NACE 2-digit sector. To examine how unrelated diversification affects the influence of corporate groups on firm performance, we consider samples including groups spanning k2 or more 2-digit NACE industries, k2 ranging from 1 to 10. Finally, to evaluate the evidence in relation to internationalization, we consider samples of groups that span at least k3 countries, allowing k3 to range from 1 to 10.
	Figure 1 presents the results from the Shapley Value approach (top-left panel), ANOVA when corporate group effects are introduced before industry effects (top-right panel, corresponding to the `ANOVA G, I’ column in table 3)[footnoteRef:12], HLM with both country and industry fixed-effects (bottom-left panel, corresponding to the sixth column of table 3), and VCA (bottom-right panel), calculated using the above defined counter-cumulative sequences of sub-samples that differ in the extent group of diversification and internationalization. The bootstrapped 95% confidence intervals of these estimates are shown as light-grey lines on the figures. [12:  Results from `ANOVA I, G’ are similar and are available on request from the authors.] 

While the results from the Shapley Value and ANOVA approaches appear to show similar patterns, the substantial differences in estimates of effect magnitudes between ANOVA and other approaches persist in this analysis. The ANOVA estimates are substantially larger in all cases. When compared to the Shapley Value and ANOVA results, those produced by the VCA and HLM approaches seem to be less precise, strikingly so in the HLM case, especially once corporate group span increases beyond 2. Interestingly, while the bootstrapped confidence intervals for the ANOVA, VCA, and Shapley Value approaches are largely symmetric, the HLM confidence intervals are highly skewed – bounded by zero on one side but sometimes very large on the other. Overall, these results suggest that neither the ANOVA nor the HLM or VCA approaches are well-suited for the investigation of contingencies affecting the importance of certain effects in explaining variation in firm performance. On that basis we focus on the Shapley Value results below.  
The proportion of variance in firm profitability accounted for by corporate group effects does not change much with an increasing number of ownership links in the group, suggesting that the results below are indeed driven by diversification and internationalization, rather than simply group scale. The proportion of the variance in firm profitability that is accounted for by corporate group effects falls precipitously as groups come to operate in more than four sub-industries within the same 2-digit sector. Specifically, the Shapley Value of corporate group effects remains roughly stable for groups operating in between one and four related industries, fluctuating between 12.5% and 11.5%, before declining sharply to 3.1% as the number of sub-industries within the 2-digit sector increases from 4 to 7. Operating in an increasing number of unrelated industries is associated with the share of variance in firm profitability accounted for by corporate group effects falling from 12.5% in the unrestricted case to 8.5% for groups spanning at least 5 (unrelated) 2-digit NACE industries. It falls further to 6.3% for groups that span at least 10 unrelated industries. Internationalization is associated with the corporate group effect falling to 10.8% (from 12.5%) for groups spanning at least two countries, and to 9.5% for groups spanning at least 3 countries. Beyond this degree of internationalization, the share of explained by corporate group effects remains stable.
DISCUSSION
This paper has sought to add the Shapley Value method to the toolkit of strategy researchers by establishing its reliability in performing variance decompositions. We highlighted the axiomatic rationale underpinning the method, and the uniqueness and superior properties of its estimates when effects are correlated. We compared it with extant methods and showed its greater accuracy and precision, both with a simulated dataset and in an empirical application. Finally, the Shapley Value approach has also been shown to be better suited for answering novel research questions, such as how the dispersion importance of corporate groups varies with the extent of the group’s diversification and internationalization. We now proceed to discuss these contributions, and to consider the opportunities for future research opened up by the Shapley Value approach. 
	The results from the simulation and the empirical application demonstrate the value of the Shapley Value approach compared to extant variance decomposition methods. The Shapley Value approach provides more accurate measures of effect importance compared to an ANOVA approach, whose estimates are sensitive to the order in which effects are introduced; and also compared to HLM, whose estimates vary more across different samples drawn from the same population - increasingly so as sample size decreases. The Shapley Value estimates are also more accurate and precise than those produced by VCA. 
	Our analysis of how diversification and internationalization affects corporate group influence demonstrates that beyond providing more accurate answers to old research questions, the Shapley Value method enables scholars to address novel research questions that could not be reliably answered using extant methods. In particular, an analysis of the contingencies that influence effect importance - which, as demonstrated, cannot be reliably accomplished using extant methods - has the potential to illuminate the ongoing debate about sources of variation in firm profitability (e.g., Crossland & Hambrick, 2007; Ma et al., 2013; Fitza, 2017; Guo, 2017; Quigley & Graffin, 2017; Fitza & Tihanyi, 2018). The greater reliability of the Shapley Value approach when applied to datasets with fewer observations should also prove an important advantage for researchers seeking to understand novel organizational phenomena such as the drivers of success in crowdfunding campaigns (e.g., Dushnitsky & Fitza, 2018), the factors influencing the growth trajectories of decentralized autonomous organizations (e.g., Hsieh, Vergne, Anderson, Lakhani, & Reitzig, 2018), and the role of artificial intelligence technologies in shaping the performance of firms and markets (Agrawal, Gans, & Goldfarb, 2019). 
 	Our results suggest that corporate groups that span a greater number of industries and countries account for a smaller proportion of the variation in profitability of affiliate firms, particularly so as the extent of related diversification increases. The increase in within-group variance in profitability accompanying related diversification is therefore significantly greater than any increase in between-group variance. This implies that even the best-managed corporate groups struggle to effectively exploit their group-level advantages in a large number of related industries in the face of adjustment and coordination costs. It is particularly noteworthy that the effect of increasing unrelated diversification on the proportion of variation in the profitability of affiliates accounted for by the corporate group is smaller than that associated with increasing related diversification. This suggests that, in the case of increasing unrelated diversification, either within-group heterogeneity in affiliate profitability increases by less, or that between-group heterogeneity (in the cross-section) increases by more. An explanation for the former could relate to the reallocation of capital from better performing affiliates to poorer-performing ones. The latter could result from decisions on the extent of a group’s unrelated diversification sometimes being made by those unable to manage unrelated businesses effectively. Both possibilities have been discussed in the literature on corporate finance focused on potential agency problems in conglomerates (see Maksimovic & Phillips, 2007, for a review). In future research it may be fruitful to examine the types of corporate structures under which either or both issues would be most pronounced. 
Interestingly, the relationship between corporate group effects and the extent of group internationalization is different. In the European setting, the sizes of group effects fall as groups come to span a minimum of two, and then of three countries, but remain largely stable with further internationalization. Whether this finding holds for groups operating across more institutionally heterogeneous countries is a fruitful international management research question.
	The Shapley Value approach has limitations. First, it is computationally expensive. As 2K coalitions can be constituted out of K regressors, the number of regressions required to find the Shapley Value increases exponentially with the number of regressors. Second, the size and power properties of tests of hypotheses using the approach requires more research. Third, while the Shapley Value approach satisfies the essential and desirable property of proper and exact decomposition into non-negative dispersion importances, with any regressor having a non-zero coefficient in the full model always receiving non-zero dispersion importance, the converse of this - that the share allocated to a regressor with coefficient equal to zero in the full regression should be zero - is not satisfied. However, this unsatisfied feature is not a desirable property when there is model uncertainty and potential mediation effects. Finally, regressors with high Shapley Values are natural candidates to prioritize when the objective is to influence performance. But it must be noted that Shapley Values are based on the full set of regressors in the model. The correlation structure among the regressors must therefore not be ignored in drawing practical conclusions on ways to enhance performance.
	In this paper we have sought to show that the Shapley Value method improves the reliability of estimates apportioning heterogeneity in firm performance. It also allows strategy scholars to ask new questions regarding contingencies driving effect importance. We hope that the Shapley Value approach will be a valuable addition to the methodological toolbox of strategic management research. 

Acknowledgements: The authors are grateful for the suggestions and guidance of associate editor Sea-Jin Chang and three anonymous reviewers. The paper was also shaped by feedback from Gerry McNamara and seminar participants at the Academy of Management Annual Meeting 2011, VI Jornadas de Integración Económica 2010, Universidad Complutense de Madrid, and Cambridge Judge Business School.

REFERENCES
Agrawal A, Gans J, Goldfarb A (eds.) 2019. The Economics of Artificial Intelligence: An Agenda. 	University of Chicago Press: Chicago, IL.
Almeida H, Wolfenzon D. 2006. Should business groups be dismantled? The equilibrium costs of 	efficient internal capital markets. Journal of Financial Economics, 79: 99-144.
Arora A, Gittelman M, Kaplan S, Lynch J, Witchell W, Siggelkow N. 2016. Question-based 	innovations in strategy research methods. Strategic Management Journal, 37: 3-9.
Barney JB. 1997. Gaining and sustaining competitive advantage. Addison-Wesley: Reading, MA.
Belenzon S, Berkovitz, T. 2010. Innovation in business groups. Management Science, 56(3): 519-535.
Belenzon S, Berkovitz T, Rios LA. 2013. Capital markets and firm organization: How financial 	development shapes European corporate groups. Management Science, 59(6): 1326-1343.
Bertrand M, Mehta P, Mullainathan, S. 2002. Ferreting out tunneling: An application to Indian 	business groups. Quarterly Journal of Economics, 117: 121-148.
Bowman EH, Helfat CE. 2001. Does corporate strategy matter? Strategic Management Journal, 22: 1–	23.
Brush TH, Bromiley, P. 1997. What does a small corporate effect mean? A variance components 	simulation of corporate and business effects. Strategic Management Journal, 18: 825-835. 
Brush TH, Bromiley P, Hendrickx M. 1999. The relative influence of industry and corporation on 	business segment performance: an alternative estimate. Strategic Management Journal, 20: 519–	547.
Carney M, Gedajlovic ER, Heugens PPMAR, Van Essen M, Van Oosterhout JH. 2011. Business 	group affiliation, performance, context, and strategy: A meta-analysis. Academy of Management 	Journal, 54(3): 437-460.  
Caves RE. 1996. Multinational enterprise and economic analysis (2nd ed.) Harvard University Press: 	Cambridge, MA.
Cestone G, Fumagalli C. 2005. The strategic impact of resource flexibility in business groups. 	RAND Journal of Economics, 36(1): 193-214. 
Chan CM, Makino S, Isobe T. 2010. Does subnational region matter? Foreign affiliate performance 	in the United States and China. Strategic Management Journal, 31: 1226-1243.
Chang SJ, Hong, J. 2000. Economic performance of group-affiliated companies in Korea: 	Intragroup resource sharing and internal business transactions. Academy of Management Journal, 	43(3): 429-448.
Chari MDR, Devaraj S, David P. 2008. The impact of information technology investments and 	diversification strategies on firm performance. Management Science, 54(1): 224-234.
Crossland C, Hambrick DC. 2007. How national systems differ in their constraints on corporate 	executives: a study of CEO effects in three countries. Strategic Management Journal, 28: 767–789.
Dushnitsky G, Fitza M. 2018. Are we missing the platforms for the crowds? Comparing investment 	drivers across multiple crowdfunding platforms. Journal of Business Venturing Insights, 10.
Efron B, Tibshirani RJ. 1993. An introduction to the bootstrap. Chapman & Hall: London, UK.
Fitza M. 2017. How much do CEOs really matter? Reaffirming that the CEO effect is mostly due to 	chance. Strategic Management Journal, 38: 802-811.
Fitza M, Tihaniy L. 2018. How much does ownership form matter? Strategic Management Journal, 38: 	2726-2743.
Grömping U. 2007. Estimators of relative importance in linear regression based on variance 	decomposition. The American Statistician, 61(2): 139-147.
Gulati R, Nohria N, Zaheer A. 2000. Strategic networks. Strategic Management Journal, 21: 203–215.
Guo G. 2017. Demystifying variance in performance: A longitudinal multilevel perspective. Strategic 	Management Journal, 38: 1327–1342.
Hashai N. 2015. Within-industry diversification and firm performance – An S-shaped hypothesis. 	Strategic Management Journal, 36: 1378-1400.
Hawawini G, Subramanian V, Verdin P. 2003. Is performance driven by industry- or firm-specific 	factors? A new look at the evidence. Strategic Management Journal, 24: 1-16.
Hitt MA, Hoskisson RE, Kim H. 1997. International diversification: Effects on innovation and firm 	performance. Academy of Management Journal, 40: 767-798.
Hough JR. 2006. Business segment performance redux: a multilevel approach. Strategic Management 	Journal, 27: 45–61.
Hsieh Y-Y, Vergne J-P, Anderson P, Lakhani K, Reitzig M. 2018. Bitcoin and the rise of 	decentralized autonomous organizations. Journal of Organization Design, 7: 14.
Kali R, Sarkar J. 2011. Diversification and tunneling: Evidence from Indian business groups. Journal 	of Comparative Economics, 39: 349-367. 
Levinthal D, Wu B. 2010. Opportunity costs and non-scale free capabilities: Profit maximization, 	corporate scope and profit margins. Strategic Management Journal, 31: 780-801. 
Lu JW, Beamish PW. 2004. International diversification and firm performance: The S-curve 	hypothesis. Academy of Management Journal, 47(4): 598-609.
Ma X, Tong T, Fitza M. 2013. How much does subnational region matter? Evidence from 	multinational corporations subsidiaries in a large emerging economy. Journal of International 	Business Studies, 44(1): 66-87.
Majumdar SK, Bhattacharjee A. 2014. Firms, markets, and the state: Institutional change and 	manufacturing sector profitability variances in India. Organization Science, 25: 509-528. 
Makino S, Isobe T, Chan CM. 2004. Does country matter? Strategic Management Journal, 25: 1027–	1043.
Maksimovic V, Phillips GM. 2007. Conglomerate firms and internal capital markets. In B. Espen 	Eckbo (Ed.) Handbook of Corporate Finance (pp. 423–77). North-Holland: Amsterdam, NL.
Marchenko Y. 2006. Estimating variance components in Stata. Stata Journal, 6: 1-21.
McGahan AM, Porter ME. 1997. How much does industry matter, really? Strategic Management 	Journal, 18: 15–30.
McGahan AM, Porter ME. 2002. What do we know about variance in accounting profitability? 	Management Science, 48: 834–851.
McGahan AM, Porter ME. 2005. Comment on `Industry, corporate, and segment effects and business performance: a non-parametric approach' by Ruefli and Wiggins. Strategic Management Journal, 26: 873–	880.
McGahan AM, Victer R. 2010. How much does home country matter to corporate profitability? 	Journal of International Business Studies, 41: 142–165.
Misangyi VF, Elms H, Greckhamer T, Lepine JA. 2006. A new perspective on a fundamental debate: 	a multilevel approach to industry, corporate, and business unit effects. Strategic Management 	Journal, 27: 571–590.
Morck R. 2005. How to Eliminate Pyramidal Business Groups: The Double Taxation of Inter-Corporate 	Dividends and Other Incisive Uses of Tax Policy. MIT Press: Cambridge, MA.
Nohria N, Ghoshal S. 1994. Differentiated fit and shared values: Alternatives for managing 	headquarters-subsidiary relations. Strategic Management Journal, 15(6): 491-502.
OECD. 2005. Measuring Globalisation: OECD Handbook on Economic Globalisation Indicators. OECD 	Publishing: Paris, France.
Pintér M. 2011. Regression games. Annals of Operations Research, 186: 263–274.
Quigley TJ, Graffin SD. 2017. Reaffirming the CEO effect is significant and much larger than chance: 	A comment on Fitza (2014). Strategic Management Journal, 38: 793-801.
Rawley E. 2010. Diversification, coordination costs, and organizational rigidity: Evidence from 	microdata. Strategic Management Journal, 31: 873-891.
Ruefli TW, Wiggins RR. 2003. Industry, corporate, and segment effects and business performance: a 	non-parametric approach. Strategic Management Journal, 24: 861–879.
Rumelt RR. 1991. How much does industry matter? Strategic Management Journal, 12: 167–185.
Schmalensee R. 1985. Do markets differ much? American Economic Review, 75(2): 341-351.
Shapley LS. 1953. A value for n-person games. In Contributions to the Theory of Games II, Kuhn 	H, Tucker AW (eds). Annals of Mathematics Studies, 28: 307–317. Princeton University Press: 	Princeton, NJ.
Young HP. 1985. Monotonic solutions of cooperative games. International Journal of Game Theory, 14: 	65–72.
Zhou YM. 2011. Synergy, coordination costs, and diversification choices. Strategic Management Journal, 	32: 624-639.


Running head: The Shapley Value approach to variance decomposition


21

		
Figure 1: Comparison of variance decomposition methods in estimating corporate group effect share of total variance as minimum number of firms/industries/countries which they span increases (Means, 100 samples, bootstrapped 95% confidence intervals in grey)


Table 1: Sample summary statistics (average over 100 samples) 
	
	Mean
	Minimum
	Maximum

	Countries
	25
	25
	25

	2-digit Industries
	44
	44
	44

	3-digit Industries
	192
	183
	200

	4-digit Industries
	415
	400
	435

	Corporate Groups
	1072
	1052
	1089

	Firms in Corporate Groups
	2504
	2495
	2505

	Firms per Corporate Group
	2.11
	2
	14

	Industries per Corporate Group

	2-digits
	1.59
	1
	6

	3-digits
	1.73
	1
	8

	4-digits
	1.8
	1
	9

	Industries per Firm (4-digits)
	2.21
	1
	33

	Countries per Corporate Group
	1.2
	1
	7


Table 2: Contributions to Explanatory Power by Effect Type (Means, bootstrapped 95% confidence intervals in brackets, 100 samples)
	
	Shapley
	ANOVA G, I
	ANOVA I, G
	HLM C
	HLM I
	HLM C&I
	VCA

	Country
	0.36% 
[0.20, 0.50]
	0.84% 
[0.43, 1.33]
	0.84% 
[0.43, 1.33]
	0.43% 
[-0.08, 0.87]
	
	2.21% 
[1.21, 4.55]
	2.30%
[0.00, 17.97]

	Industry
	4.07% 
[3.43, 4.84]
	6.48% 
[5.01, 7.80]
	8.92% 
[7.45, 10.30]
	
	1.62% 
[0.81, 4.14]
	
	2.39%
[1.01, 4.24]

	Corporate Group
	12.46% [11.53, 13.36]
	25.53% [23.44, 27.59]
	23.09% [21.23, 25.31]
	12.58% 
[9.20, 12.91]
	11.29% 
[7.97, 12.26]
	10.84% 
[7.79, 12.16]
	9.61%
[5.57, 13.76]

	Year
	0.06% 
[0.01, 0.13]
	0.04% 
[-0.004, 0.1]
	0.04% 
[-0.004, 0.1]
	0.14% 
[0.02, 0.15]
	0.14% 
[0.02, 0.15]
	0.14% 
[0.02, 0.15] 
	0.08%
[0.00, 0.33]

	Firm
	34.11% [32.75, 36.10]
	18.17% [16.47, 20.18]
	18.17% [16.47, 20.18]
	37.14% [36.69, 42.57]
	37.24% [35.48, 41.28]
	37.11% [35.30, 40.93]
	37.84% [32.94, 42.95]


ONLINE APPENDIX 1: SHAPLEY VALUE TECHNICAL DETAILS AND PROPERTIES
In the context of estimating relative dispersion importances of the levels at which factors affecting firm performance operate, the starting point is a linear regression model of profitability at firm level. The dependent variable  pertains to the profitability of firm  belonging to a corporate group  while active in industry  and located in country , observed in year.  The regression model is:

The effects of interest are represented by the estimated parameter sets (,  ,  ,  ,  ) attached to the sets of binary regressors indicating: the firms , corporate networks (, industries , countries  and years [footnoteRef:13] To estimate dispersion importances of these effects, we need to decompose the total variance in profitability explained by these factors, which is captured by model  - a consistent estimator of the explained variance of the dependent variable in a regression model[footnoteRef:14] - into exact components that are specific to each of the above regressor sets. [13:  The firm effect for firm ,  , is obtained as , where  is a column vector with th element equal to 1 representing firm , 0 otherwise, andis the parameter vector of firm fixed effects in the regression model.  Similarly, the “corporate group effect” for group is  industry effect for industry  is , country effect for country  is ,  and year effect for year  is . It is straightforward to group regressors (for example, firm dummy variables as above) into sets, so that they always enter the model together, or not at all.  In what follows we use `regressor’ to refer to single regressor variables as well as to such regressor sets.  It is also straightforward to expand this model and introduce continuous variables into the specification to capture some of the observed performance relevant factors directly.]  [14:  When sets of effects are involved,  (adjusted-) corrects  for the degrees of freedom and is the estimator of explained variance that is to be divided up between regressor sets. For example, the dispersion importance of firm effects is reckoned as the change in model  when all firm dummy variables are removed from the model jointly. This can be interpreted as the contribution that factors that reside at the level of the firm, as a group, make to the variance of profitability over and above the contributions of other regressor sets.] 

If the regressor sets were all orthogonal to each other, then the overall variance in profitability would decompose exactly between the regressor sets and yield unique estimates of their dispersion importances. But corporate groups commonly span more than one industry, and more than one country, while industries span all countries. Due to this cross-nesting, firm profitability model  will not decompose exactly or uniquely between these levels[footnoteRef:15]  [15:  As  Sum of the variances within each effect set, ,  ,  ,  ,    plus twice the covariances between each pair from the above effect sets. ] 

To fix ideas, consider a simplified example where there are only two levels at work in determining firm profitability: firms, and corporate groups to which firms belong. The regression model is:  , where the firm indicator variables constitute the   matrix , and group indicator variables constitute the square   matrix . The full model  , the share of variance of  that is explained by  and  together, can be written as . If firm indicator variables in  alone are included in the model, the allocation of  to coalition , written as , is that part of the variance of   that can be explained by firms: . As corporate group indicator variables in  are not yet in the model, to the extent there is any correlation between the effects of  and of ,  will include a share that would be attributed to  in any rationalizable allocation based on the full model. The nesting of firms within corporate groups makes the contribution of  to model  sensitive to whether or not  is already in the regression.  If group indicator variables in  now enter the regression model, the (incremental) contribution of  to  is   The nesting of firms within corporate groups - in general terms, multicollinearity between regressors - makes the contribution of each effect set to model  sensitive to whether or not the other effect set is already in the regression model.
The Shapley Value solution concept, applied to this regression game, demands symmetric treatment of both effects sets. Both must have the opportunity to contribute to model at the first position, and hence at the second position as well.  The four possible specifications, based on sequential orderings of the regressor sets on the paths from the intercept-only model to the full model are:  i.a)  is the only regressor set included; i.b)  is added to ; ii.a)  is the only regressor set included; and ii.b)  is added to . The Shapley Value estimate of each coalition’s dispersion importance is the average of its incremental contributions over both orderings. In the simple example above, the dispersion importance of firm effects is a weighted average of the incremental contributions to  of  over (i.a) and (ii.b) specifications. Likewise, the dispersion importance of corporate group effects is the average of the incremental contributions to  of  over (i.b) and (ii.a).  
The basic idea of the Shapley allocation as described above was first proposed with ad-hoc justification by Lindeman, Merenda and Gold (1980), and independently by Kruskal (1987). Chevan and Sutherland (1991) generalized the idea. Stufken (1992) noticed that the approach was equivalent to the Shapley value solution concept in transferable utility cooperative games (Shapley, 1953).  Budescu (1993) and Azen and Budescu (2003) used extensions of essentially the same idea, calling it dominance analysis.  Lipovetsky and Conklin (2001) cast the approach explicitly in the game theoretic form applying the Shapley Value concept. Pintér (2011), following Young (1985), presented axiomatic rationalization of the Shapley value allocation of dispersion importance of regressors (see footnote 4 in the paper).  A unified treatment and a detailed review is offered in Grömping (2007), with discussion of application of Shapley value in other statistical estimation problems.
	To understand the properties of the Shapley Value approach to measuring dispersion importances, it is useful to consider the linear regression model in its general form:[footnoteRef:16]  [16:  This specification can be readily generalised to include interactions and categorical variables. ] 


The Shapley Value allocation, which follows from the minimal essential properties of a dispersion importance measure - efficiency, equal treatment and monotonicity (see footnote 4 in the main text of the paper) - requires that each regressor ,  be permitted to contribute to model at each possible sequential position going from the null model to the full model specification.  This requires allowing all possible subsets of the set of regressors bar  to contribute ahead of , and finding the overall average of incremental contributions of  to model . The number of possible sequential orderings in which regressors are entered into the model, i.e., the number of possible permutations of the regressor indices , is Let a permutation of the regressor indices be denoted .  Let the set of regressors appearing before in the order be denoted .  The value of regressors in coalition  is denoted .  The incremental value of   when it joins coalition  is  . Averaging over all possible incremental values, the Shapley Value dispersion importance of  is given by: 

An equivalent and computationally more economic way to write this, in terms of the incremental value of  to all possible subsets of the full regressor set excluding , is by considering all possible sub-models:

Where  is an element of the power set of the set of all regressors excluding , and  is the cardinality of .  
To see the working of the approach with more clarity, consider a simple example (Grömping, 2007) where there are only two regressors in the model: 
.
The variance equation is: 


The part of the sum of squares of  that is explained by , when it is the first and only regressor, and both  and  are mean-centered, can be written as:


This shows that as the only regressor in the model,  captures the entire covariance term in the variance equation (the middle term above), and in addition, to the extent that  and  are correlated,  captures a part of the unique contribution of the second regressor to model explained sum of squares (the last term above).  The incremental variance explained by  when it joins  is limited to what remains of the full model explained sum of squares: .  Likewise, the incremental contribution to full model sum of squares by  when it joins  when it enters first is: .  
	The Shapley Value dispersion importance of each regressor is the average over the two orders. This resulting structure of the allocation of covariation among the effects is the only one that meets the axiomatic requirements for a proper decomposition of variance. The Shapley value for is:
-
And correspondingly for . Thus, each regressor receives half of the covariance term in the variance equation. Depending on the extent of correlation, the regressor with larger coefficient/variance combination (, gives over a part of its contribution to the regressor with smaller corresponding value.[footnoteRef:17]  [17:  Note that is the coefficient one would obtain for the standardized regressor . ] 

A number of useful implications follow: i) any regressor with a non-zero coefficient in the full regression model will register positive dispersion importance; ii) a regressor that does not contribute to the fit of any of the sub-models will register zero dispersion importance; iii) in the case of correlated regressors, a variable can receive a non-zero estimate of dispersion importance if it contributes to the fit of any sub-model, even if its coefficient in the full model is zero.  This property is particularly useful because the true causal relationship is almost always unknown (Grömping, 2007).
One limitation of the Shapley value estimator of dispersion importance is that, like extant estimators, its finite sample analytic sampling distribution is not available. The asymptotic distribution can be used for inference under the strong assumption that the regressors are multivariate normal. The better approach to inference is to use bootstrapped confidence intervals, as we do in this paper.
REFERENCES
Azen R, Budescu DV. 2003. The dominance analysis approach for comparing predictors in multiple 	regression. Psychological Methods, 8: 129–148.
Budescu DV. 1993. Dominance analysis: A new approach to the problem of relative importance in 	multiple regression. Psychological Bulletin, 114: 542–551.
Chevan, A, Sutherland M. 1991. Hierarchical partitioning. The American Statistician, 45: 90–96.
Grömping U. 2007. Estimators of relative importance in linear regression based on variance 	decomposition. The American Statistician, 61(2): 139-147.
Kruskal W. 1987. Relative importance by averaging over orderings. The American Statistician, 41: 6-10.
Lindeman RH, Merenda PF, Gold RZ. 1980. Introduction to Bivariate and Multivariate Analysis. Scott, 	Foresman: Glenview, IL.
Lipovetsky S, Conklin M. 2001. Analysis of regression in game theory approach. Applied Stochastic 	Models in Business and Industry, 17: 319–330.
Pintér M. 2011. Regression games. Annals of Operations Research, 186: 263–274.
Shapley LS. 1953. A value for n-person games. In Contributions to the Theory of Games II, Kuhn H, Tucker AW (eds). Annals of Mathematics Studies, 28: 307–317. Princeton University Press: 	Princeton, NJ.
Stufken J. 1992. On hierarchical partitioning. The American Statistician, 46: 70–71.
Young HP. 1985. Monotonic solutions of cooperative games. International Journal of Game Theory, 14: 	65–72.
ONLINE APPENDIX 2: SIMULATION DETAILS
The challenge in apportioning the variance of profitability is to divide covariances between the various sets of effects considered, and allocate them exactly and uniquely to these effects sets in a way that is true to the data generation process. The data generating process in the Monte Carlo simulation described below is designed to help evaluate the different variance allocation methods used in prior research and the Shapley Value approach discussed in this paper, relative to each other.
In the simulation, profitability is determined by industry, corporate group, firm and year effects. By design, the industry effects and the corporate group effects covary with each other, while the firm effects and the year effects do not covary with any other effect. Thus, the general variance covariance structure relating to profitability is:

In order to compare the different approaches in an unconfounded way, we specify a transparent, symmetric, joint distributional structure for industry and corporate group effects (see next paragraphs for details). Due to this symmetric structure in which the two sets of effects have equal mean impacts on profitability, with near-equal variances, and necessarily symmetric covariance, the only rationalizable allocation of the covariation between the two effects is its equal division between them.[footnoteRef:18]  Thus the proportions of total variance attributable to the industry and the corporate group effects are  and  respectively. As the year effects and firm effects do not covary with any other effect the true proportions of attributable to them are  and  respectively.   [18:  In Appendix 1 it is shown that in the more general case, when multiple effects have different impacts on the dependent variable, and/or have different variances, then the Shapley value allocation of covariation between the effects will be unequal. The precise structure of the allocation follows from the axioms - the minimal essential properties of dispersion importance: efficiency, equal treatment and monotonicity - that underpin the Shapley Value allocation. ] 

	We compare these true shares with the proportions of total variance that are attributed to each effect as estimated by ANOVA, multilevel (HLM), and VCA methods, as well as by the Shapley Value regression method. Our objective in this simulation is to identify the method that produces estimates closest to the true values, and which can therefore be identified as the method most suitable for this type of analysis.
	The results presented here are based on data simulated for 500 corporate groups operating in 250 industries over 4 years, with each corporate group operating in two industries, resulting in 1,000 firms. First, we draw 250 industry effects and 250 average corporate group effects within an industry from a bivariate normal distribution with mean zero and variance 25 for both effects, and with a fixed correlation  between them . To examine the accuracy of the methods when applied to data with different correlation structures we allow the correlation  in the data generating process to vary from 0 to 0.9. 
Next, we generate two individual corporate group effects for each industry by adding a standard normal random variable to the average corporate group effects. We now have 500 corporate groups, each operating in a primary industry. To assign a second industry to each corporate group while maintaining the correlation structure between industry and corporate group effects, we generate a hypothetical second industry effect for each corporate group by adding an random variable  to their primary industry effect. We then match this hypothetical second industry effect to the closest existing industry effect that is different from the corporate group’s primary industry effect. The data now consists of 1,000 observations of corporate group profitability in 250 industries, making up 1,000 firms. 
	We next generate the firm effects, . The firm effects are thus drawn independently from the industry and corporate group effects. To construct the year effects, we create an additional three copies of the data set and assign a year effect,  to each of the four annual data sets. The final step is to construct the profitability measure as the sum of industry, corporate group, firm and year effects, plus a classical error term, . We run the data generating process 100 times for each value of the correlation between industry and corporate group effects ranging from 0 to 0.9 in steps of 0.1, making 1,000 runs of the simulated data generating process in total. 
	Given the parameters of our data generating process, we can calculate the components of the variance decomposition. ,  (variance of the average corporate group effect plus variance of the random variable used to generate individual corporate group effects), , , , and . 
Figure A1 about here
	The simulation results are presented in Figure A1. The ANOVA upper and lower bound industry and corporate group effect estimates diverge significantly, both from each other, and from the true values. The contribution of the correlated effect that is introduced first into the model is over-estimated, as expected from the discussion in the previous section. The ANOVA method also appears to underestimate firm effects compared to their true contribution to variance when these are introduced last into the model. The HLM upper and lower bound estimates of industry and corporate group effects again diverge substantially from one another and from the true values with increasing correlation between the two effects, while producing more accurate estimates of firm effects than ANOVA. The VCA method produces estimates of industry and corporate group effect importance that are closer to the theoretical values than the HLM results, but this approach over-estimates the influence of industry effects, and under-estimates that of corporate group effects as the covariance between the two increases. In contrast to these approaches, the Shapley Value regression estimates of the proportions of total variance attributable to each effect appear to stay consistently close to the true value for every effect. The minor deviations can be explained, following the derivations in Appendix 1, by the difference between the variances of industry and corporate group effects. 
Table A1 about here
	The performance of the different methods can also be compared using root mean square error (RMSE) to measure how much the effects estimates of each method deviate from the true values on average. The RMSE of each method for each effect is reported in Table A1. The Shapley Value regression estimates have the lowest RMSE of all four methods for all effects with the exception of firm effects, for which the HLM model with corporate group fixed effects nested in industry random effects (contrary to the methodological guidance followed by Misangyi et al., 2006), produces a marginally smaller RMSE. For the year effect alone, ANOVA does equally well as the Shapley Value, however for industry and corporate group effects ANOVA upper and lower bound estimate RMSEs are substantially higher than those of the Shapley Value approach. The upper and lower bound ANOVA RMSEs are also very different from each other.  While the difference between upper and lower bound HLM RMSEs is less pronounced than those of the ANOVA approach, again these are substantially higher than the Shapley Value RMSEs for industry and corporate group effects. VCA RMSEs for industry and corporate group effects are the second-lowest after those produced by the Shapley Value approach, but the RMSE for firm effects of the VCA approach is higher than that of either HLM model (as well as of the Shapley Value).[footnoteRef:19] [19:  Brush and Bromiley (1997) suggest that corporate effects may only affect a subset of affiliates and show that VCA severely underestimates the size of corporate effects in such situations. To evaluate whether the Shapley Value approach is subject to a similar bias, we re-ran the Monte Carlo simulations by recalculating the profitability measure with the coefficient on corporate parent effects set to zero for half of the business units of each corporate parent (corresponding to model 2, scale 1.0 in Table 1 of Brush and Bromiley (1997: 831)). The results, available on request from the authors, are very similar to those presented in figure 1 and table 1. The Shapley Value approach does therefore appear to perform well even in cases when corporate group effects affect only a subset of the group’s affiliates.] 


Figure A1: Comparison of variance decomposition methods


	Table A1: Simulation: Root Mean Square Errors

	
	Year
	Industry
	Corporate Network
	Firm

	ANOVA Upper Bound
	0.0029
	0.1660
	0.2872
	0.2524

	ANOVA Lower Bound
	0.0029
	0.0433
	0.1020
	0.2524

	HLM Upper
	0.0032
	0.1252
	0.1361
	0.0372

	HLM Lower
	0.0032
	0.0906
	0.1009
	0.0297

	VCA
	0.0032
	0.0635
	0.0531
	0.0451

	Shapley value
	0.0029
	0.0190
	0.0457
	0.0327


VCA

size	1	2	3	4	5	6	7	8	9	10	Affiliates	9.6097946166992188E-2	9.6097946166992188E-2	0.1005653589963913	0.10429375618696213	0.10951210558414459	0.11480069905519485	0.10595739632844925	0.10784386843442917	0.10719002038240433	0.10277660191059113	Affiliates CIL	5.573669821023941E-2	5.573669821023941E-2	5.8574430644512177E-2	5.0521556288003922E-2	6.0691211372613907E-2	6.0714073479175568E-2	3.7642639130353928E-2	4.2281277477741241E-2	3.9157286286354065E-2	3.3884856849908829E-2	Affiliates CIU	0.13762445747852325	0.13762445747852325	0.13926216959953308	0.14920055866241455	0.16258908808231354	0.17332476377487183	0.16340692341327667	0.16347287595272064	0.16413137316703796	0.1596994549036026	Rel inds	9.6097946166992188E-2	9.3564979732036591E-2	0.12392183393239975	0.1403103768825531	5.9620916843414307E-2	1.3939277268946171E-2	2.6573546230792999E-2	Rel inds CIL	5.573669821023941E-2	6.5674126148223877E-2	7.8126512467861176E-2	6.8637296557426453E-2	1.182407253705331E-12	3.7761977831321664E-18	1.3378941204261279E-20	Rel inds CIU	0.13762445747852325	0.12074223160743713	0.17768782377243042	0.22812046110630035	0.17055079340934753	7.5430653989315033E-2	0.17358402907848358	Unrel inds	9.6097946166992188E-2	9.4077058136463165E-2	8.8889442384243011E-2	9.2372722923755646E-2	3.9540842175483704E-2	4.728689044713974E-2	4.9736969172954559E-2	5.8343298733234406E-2	5.3524564951658249E-2	5.9256028383970261E-2	Unrel inds CIL	5.573669821023941E-2	4.0746979415416718E-2	5.9451363980770111E-2	5.3436730057001114E-2	5.5883508175611496E-3	3.5254510585218668E-3	6.4525909060364484E-9	1.7418100739519105E-8	9.72675898158748E-11	2.9727171855305386E-15	Unrel inds CIU	0.13762445747852325	0.13104557991027832	0.11906228214502335	0.13513900339603424	0.11205259710550308	0.12395381927490234	0.15856793522834778	0.15245690941810608	0.14507268369197845	0.17127376794815063	Countries	9.6097946166992188E-2	7.4178971350193024E-2	5.0666384398937225E-2	6.0655973851680756E-2	5.6353975087404251E-2	5.9689134359359741E-2	6.8885922431945801E-2	6.6670231521129608E-2	6.6167764365673065E-2	7.7056393027305603E-2	Countries CIL	5.573669821023941E-2	2.9854742810130119E-2	1.0797048918902874E-2	1.2928235344588757E-2	6.8096448667347431E-3	1.0004437528550625E-2	1.2728598900139332E-2	5.9957024641335011E-3	5.4122176952660084E-3	2.6375984307378531E-3	Countries CIU	0.13762445747852325	0.11317490786314011	9.4409935176372528E-2	0.11390472948551178	0.11034300923347473	0.11659823358058929	0.14541801810264587	0.13720335066318512	0.14450652897357941	0.17370146512985229	Corporate group span


Corporate group proportion of total variance


36

image1.png

image2.png

image3.png

image4.png

image5.png

image6.png

image7.png