Cambridge Working Papers in Economics: 1906 SEMIPARAMETRIC NONLINEAR PANEL DATA MODELS WITH MEASUREMENT ERROR Oliver Linton Ji-Liang Shiu 18 January 2019 This paper develops identification and estimation of the parameters of a nonlinear semi-parametric panel data model with mismeasured variables as well as the corresponding average partial effects using only three periods of data. The past observables are used as instruments to control the measurement error problem, and the time averages of perfectly observed variables are used to restrict the unobserved individual-specific effect by a correlated random effects specification. The proposed approach relies on the Fourier transforms of several conditional expectations of observable variables. We estimate the model via the semi-parametric sieve minimum distance estimator. The finite-sample properties of the estimator are investigated through Monte Carlo simulations. We use our method to estimate the effect of the wage rate on labor supply using PSID data. Cambridge Working Papers in Economics Faculty of Economics Semiparametric Nonlinear Panel Data Models with Measurement Error Oliver Linton∗ Ji-Liang Shiu† January 12, 2019 Abstract This paper develops identification and estimation of the parameters of a nonlin- ear semi-parametric panel data model with mismeasured variables as well as the corresponding average partial effects using only three periods of data. The past observables are used as instruments to control the measurement error problem, and the time averages of perfectly observed variables are used to restrict the un- observed individual-specific effect by a correlated random effects specification. The proposed approach relies on the Fourier transforms of several conditional expec- tations of observable variables. We estimate the model via the semi-parametric sieve minimum distance estimator. The finite-sample properties of the estimator are investigated through Monte Carlo simulations. We use our method to estimate the effect of the wage rate on labor supply using PSID data. Keywords: Correlated random effects, Measurement error, Nonlinear panel data models, Semi-parametric identification ∗Department of Economics, University of Cambridge, Email: obl20@cam.ac.uk. †Institute for Economic and Social Research, Jinan University. Email: jishiu.econ@gmail.com. 1. Introduction The availability of panel data allows economists to control for unobservable individual- specific characteristics that may be correlated with explanatory variables. Substantial progress has been made on how to handle linear or nonlinear models ignoring the potential presence of measurement error. However, many economic quantities such as work hours, earnings, fringe benefits, employment, and health in surveys are frequent- ly measured with errors, especially when longitudinal information is collected through one-time retrospective surveys.1 This concern has been heightened by the increased use of longitudinal data sets; mismeasurement of the panel data may lead to false re- sults or obscure the true economic relationships. The estimation problems caused by the mismeasurement of economic data may be greatly exacerbated when one tries to control for the consequences of unobserved individual effects by using standard fixed effects or first-differenced estimators. We consider the following semi-parametric nonlinear panel data model with un- known finite-dimensional parameter β0 (1) Yit =m ( Wit, X∗it,Ci;β0 )+Uit, i = 1, . . . ,n, t= 1, . . . ,T. In this model, Yit is an observed scalar dependent variable, Wit are perfectly observed explanatory variables, X∗it is a latent continuously distributed mismeasured variable, Ci is an unobserved individual-specific effect, and Uit is an unobserved random vari- able. The function m may not be separable with regard to Wit, X∗it, and Ci, but it belongs to a known, finite-dimensional, parametric family. We focus on the case where the data consists of a large number of individuals observed through a small (fixed) number of time periods. The variable X it is a proxy or measure of the unobserved true regressor X∗it. The model described in Eq. (1) has two aspects that are new in the literature of 1The problems of the measurement error have raised great concern in a number of practical applica- tions. Studies in Bollinger (1998), Bound, Brown, Duncan, and Rodgers (1994), and Bound, Brown, and Mathiowetz (2001) provide evidences of the measurement errors in economics data sets. 2 panel data models with measurement errors. First, the unobserved heterogeneity en- ters the structural regression function nonseparably and without imposing a linear index structure. Second, the potentially nonlinear regression function also contains a mismeasured variable (nonseparably) along with other explanatory variables. This proposed regression model is consistent with a structural function derived from a dy- namic utility maximization problem with flexible preferences. For example, models of this form can arise in the study of life cycle labor supply with individual preference. See e.g., Koebel, Laisney, Pohlmeier, and Staat (2008).2 Linear panel data models with measurement error problems have been widely stud- ied in the literature including: Griliches and Hausman (1986), Wansbeek and Koning (1991), Biørn (1992), and Wansbeek (2001). Their approaches involved first applying an appropriate transformation to handle the unobserved effect and then using instru- ments in a generalized method of moments (GMM) framework. On the other hand, if we ignore the measurement error problem in Eq. (1), then the model belong to the class of nonseparable panel data models, which have been studied in: Evdokimov (2011), Chernozhukov, Fernández-Val, Hahn, and Newey (2013), Hoderlein and White (2012), Chen and Swanson (2012), Hoderlein and Mammen (2007), Altonji and Matzkin (2005), and Chernozhukov, Fernandez-Val, Hoderlein, Holzmann, and Newey (2015). In par- ticular, Chernozhukov, Fernández-Val, Hahn, and Newey (2013), Graham and Powell (2012), and Hoderlein and White (2012) use changes over time in x to obtain the ce- teris paribus effect of x on y for identification and estimation of nonseparable models. Wilhelm (2015) considers nonlinear panel data models with measurement error where fixed effects are additively separable. He differences out the fixed effects and provides a nonparametric identification result without requiring any extra variable other than outcomes and observed regressors. However, in nonseparable panel data models it is not clear how to remove the unobserved heterogeneity and address measurement error problems simultaneously (first differencing does not work), so there is a fundamental difference between additively separable models and nonseparable models. 2Our model could accommodate Eq. (23.13) in Koebel, Laisney, Pohlmeier, and Staat (2008) with δ= δi which depends on individual i and thus the equation is a special case of our formula provided. 3 Besides the short panel data setting considered here, there is a lot of closely related work in the large panel literature, but not allowing for measurement error. Alvarez and Arellano (2003) investigate the linear panel regression models with fixed effects for large n,T, and they find that their GMM estimator has an asymptotic bias of an order 1/n and does not cause bias for large T. Akashi and Kunitomo (2012) use the approach in Alvarez and Arellano (2003) to study panel dynamic simultaneous equation models. Hahn and Kuersteiner (2002) characterize the bias of the fixed effect estimator by allowing both n and T to approach infinity and the ratio n/T to approach a constant. We develop an identification technique that builds on previous work of Schennach (2007), concerning the identification and estimation of nonlinear measurement error models with instruments. The identification strategy is to employ Fourier transforms of conditional expectations of observable variables and to provide a closed form solu- tion to the regression function based on these transforms. We generalize the method of Schennach (2007) by allowing for a measurement error term in the regression func- tion with an additional unobserved individual-specific effect in a panel data setting. The proposed method works in a way that panel data contains enough information on observables to identify the mismeasured variable X∗it, and the unobserved individual- specific effect Ci. While the past observables are used as instruments to control the measurement error problem, the time averages of perfectly observed variables are used to restrict the unobserved individual-specific effect by a correlated random effects specification. Altonji and Matzkin (2005) and Wooldridge (2005) have used correlated random effects (CRE) approaches to nonlinear panel data models to control the unob- served individual-specific effect. Thus, the nonseparable regression function of interest also admits a similar representation of the closed form solution in Schennach (2007) under a mild regularity condition. We propose an estimation method that closely follows the identification result, in particular it builds on knowledge of the three conditional expectations. We propose a sieve minimum distance (hereafter SMD) estimator for the parameters of interest. Then, estimating the parameters of interest by implementing the methods of series or 4 sieve estimation developed in Ai and Chen (2007), which is an extension of SMD esti- mation in Ai and Chen (2003) and Newey and Powell (2003). The estimation procedure consists of applying the SMD method to a vector of moment conditions with different conditioning variables related to the identification result. It follows that the SMD esti- mator for the finite-dimensional parameters of the structural function is p n-consistent and asymptotically normally distributed. The rest of the paper is organized as follows. Section 2 describes the identification assumptions and strategy for nonlinear panel data models with measurement errors. Section 3 covers the SMD estimation procedure based on the identification restric- tions in Section 2. Section 4 discusses the implementation of the SMD estimator and presents its Monte Carlo simulation. Section 5 presents our empirical application, the elasticity of labor supply. Section 6 concludes. All proofs are collected in the Appendix. 2. Semiparametric Identification Without loss of generality, we consider both Wit and X∗it to be scalar quantities (a multi- variate case can be straightforwardly provided). To avoid confusion, upper case letters are used exclusively for random variables and lower case letters are used exclusively for non-random quantities corresponding to its upper case random variables. The data {yit,wit, xit} is independently and identically distributed across i for each t and it is an observable random sample for {Yit,Wit, X it} for i = 1,2, . . . ,n and t= 1, . . . ,T ≥ 2. Assumption 2.1. (Correlated Random Effects (CRE)) There exists a nonzero coefficient λ0 such that Ci =λ0W i+ηi, where W i = 1T ∑T t=1 Wit is denoted as the time average of the perfectly observed explana- tory variables. The remainder term ηi is independent of W i. Assumption 2.1 can be generalized to include more perfectly observed explanatory variables. For example, if there exists another time-invariant variable Z i, we can 5 consider the following CRE specification Ci =λ01W i+λ02Z i+ηi. Including more control variables in the specification may make the independent as- sumption of the projection error ηi more reasonable. Assumption 2.2. (Classical measurement error): (i)(Past variables as IV) There exists an unknown function ht at time t satisfying X∗it = ht(G i, 0. Assumption 2.3 ensures that the Fourier transforms of the conditional expectations to be well defined members of a subclass of locally integrable functions. Define the Fourier transforms of the function m and the conditional expectations Rt(g,w;w) and St(g,w;w) defined in Assumption 2.3: Fy(w,ξ1,ξ2)= ∫ ∫ Rt(g,w;w)eiξ1 geiξ2wdgdw(2) Fxy(w,ξ1,ξ2)= ∫ ∫ St(g,w;w)eiξ1 geiξ2wdgdw(3) Fm(w,ξ1,ξ2;β0)= ∫ ∫ m ( w, x, c;β0 ) eiξ1xeiξ2cdxdc,(4) 7 where i=p−1. Define also φv(ξ1)= ∫ eiξ1 v˜ fV˜it(v˜)dv˜ and φη(ξ2)= ∫ eiξ2η˜ fη˜i (η˜)dη˜, where fV˜it(v˜) and fη˜i (η˜) are the density functions of V˜it and η˜i, respectively. Lemma 2.1. Suppose that Assumptions 2.1, 2.2, and 2.3 hold. Then, Fy(w,ξ1,ξ2)= 1 λ0 Fm(w,ξ1, ξ2 λ0 )φv(ξ1)φη( ξ2 λ0 ),(5) Fxy(w,ξ1,ξ2)= 1 λ0 − i ∂Fm(w,ξ1, ξ2 λ0 ) ∂ξ1 φv(ξ1)φη( ξ2 λ0 ).(6) Proof. See the appendix. Assumption 2.4. Suppose that: (i) ∫ |v˜| fV˜it(v˜)dv˜ ≤ A <∞, ∫ |η˜| fη˜i (η˜)dη˜ ≤ A <∞; and (ii) the characteristic functions φv(ξ1) 6= 0, and φη(ξ2) 6= 0 are continuously differentiable for all ξ1,ξ2 ∈R. Assumption 2.5. Set Θ as a parameter space containing β0. There exists a finite or infinite constant ζ¯ > 0 and some wit such that for all β ∈ Θ : (i) Fm(wit,ξ1,ξ2;β) 6= 0 almost everywhere in [−ζ¯, ζ¯]2 and (ii) Fm(wit,ξ1,ξ2;β)= 0 for all |ζ1|, |ζ2| > ζ¯. Assumptions 2.4 and 2.5 are standard in the deconvolution literature. Assumption 2.4(ii) requires that the characteristic functions of V and η˜ are non-vanishing, which excludes uniform or triangular distributions, for example. Exploiting the conditional mean function in Eq. (A.5) by replacing fη˜i (η˜) by fη˜i ;γ(η˜), we have the following result. Denote γ= (β,λ) and γ is a (dβ+2)×1-dimensional vector. Consider the parametric conditional mean function in Eq. (A.16): Rt(g,w;w,γ)=E[Yit|Wit =w,G˜ i,