ebook img

Backfitting and smooth backfitting for additive quantile models PDF

0.33 MB·English
Save to my drive
Quick download
Download
Most books are stored in the elastic cloud where traffic is expensive. For this reason, we have a limit on daily download.

Preview Backfitting and smooth backfitting for additive quantile models

TheAnnalsofStatistics 2010,Vol.38,No.5,2857–2883 DOI:10.1214/10-AOS808 (cid:13)c InstituteofMathematicalStatistics,2010 BACKFITTING AND SMOOTH BACKFITTING FOR ADDITIVE 0 QUANTILE MODELS 1 0 By Young Kyung Lee1, Enno Mammen2 and Byeong U. Park3 2 v Kangwon National University, University of Mannheim and o Seoul National University N Inthispaper,westudytheordinarybackfittingandsmoothback- 1 fitting as methods of fitting additive quantile models. We show that 1 thesebackfittingquantileestimatorsareasymptoticallyequivalentto thecorresponding backfittingestimators of theadditivecomponents ] T in aspecially-designed additive mean regression model. This implies S that the theoretical properties of the backfitting quantile estimators . are not unlike those of backfitting mean regression estimators. We h also assess the finite sample properties of the two backfitting quan- t a tile estimators. m [ 1. Introduction. Nonparametricadditivemodelsarepowerfultechniques 1 for high-dimensionaldata. Theyenableustoavoid thecurseof dimensional- v ity and estimate the unknown functions in high-dimensional settings at the 2 same accuracy as in univariate cases. In the mean regression setting, there 9 5 havebeenmanyproposalsforfittingadditivemodels.Theseincludetheordi- 2 narybackfittingprocedureofBuja,HastieandTibshirani(1989),whosethe- . 1 oretical properties were studied later by Opsomer and Ruppert (1997) and 1 Opsomer (2000), the marginal integration technique of Linton and Nielsen 0 (1995), and the smooth backfitting of Mammen, Linton and Nielsen (1999), 1 : Mammen and Park (2006) and Yu, Park and Mammen (2008). It is widely v i accepted that the marginal integration method still suffers from the curse X r a Received October 2009; revised January 2010. 1Supported by Basic Science Research Program through the National Research Foun- dation of Korea (NRF) funded by the Ministry of Education, Science and Technology (2009-0058380). 2Supportedby DFG-project MA 1026/9-2 of theDeutscheForschungsgemeinschaft. 3Supported by Mid-career Researcher Program through NRF grant funded by the MEST (No.2010-0017437). AMS 2000 subject classifications. Primary 62G08; secondary 62G20. Key words and phrases. Backfitting,nonparametricregression,quantileestimation,ad- ditive models. This is an electronic reprint of the original article published by the Institute of Mathematical Statistics in The Annals of Statistics, 2010,Vol. 38, No. 5, 2857–2883. This reprint differs from the original in pagination and typographic detail. 1 2 Y. K.LEE, E. MAMMEN AND B. U.PARK of dimensionality since it does not produce rate-optimal estimates unless smoothness of the regression function increases with the number of additive components. On the contrary, the ordinary backfitting and smooth backfit- ting are known to achieve the univariate optimal rate of convergence under certain regularity conditions. In this paper, we are concerned with nonparametric estimation of addi- tive conditional quantile functions. Conditional quantile estimation is also a very useful tool for exploring the structure of the conditional distribu- tion of a response given a predictor. A collection of conditional quantiles, when graphed,give a pictureof the entire conditional distribution.It can be used directly to construct conditional prediction intervals. Also, it may be a basis for verifying the presence of conditional heteroscedasticity; see Furno (2004), for example. Various other applications of conditional quantile es- timation may be found in Yu, Lu and Stander (2003). In the nonadditive setting, there have been many proposals for this problem, which include the work by Jones and Hall (1990), Chaudhuri(1991), Yu and Jones (1998) and Lee, Lee and Park (2006). There have been also some proposals for additive quantileregression.FanandGijbels(1996)providedadirectextensionofthe ordinary backfitting method to quantile regression, but without discussing its statistical properties. Lu and Yu (2004) gave a heuristic discussion of the asymptotic limit of a backfitting local linear quantile estimator. Horowitz and Lee (2005) studied an extension of the two-stage procedure of Horowitz and Mammen (2004) to quantile regression. Their estimator is a one-step kernel smoothing iteration of an orthogonal series estimator. The main theme of this paper is to discuss the statistical properties of theordinaryandsmoothbackfittingmethodsinadditivequantileregression. The methods are difficult to analyze since there exists no explicit definition fortheordinarybackfittingestimator and,forbothestimators,theobjective functions defining the estimators are not differentiable. We borrow empir- ical process techniques to tackle the problem. In particular, we devise a theoretical mean regression model by using a Bahadur representation for the sample quantiles. We show that the least squares ordinary and smooth backfitting estimators in this theoretical mean regression model are asymp- totically equivalent to the corresponding quantile estimators in the original model. This makes the theoretical properties of the two backfitting quantile estimators well understood from the existing theory for the corresponding least squares backfitting mean regression estimators. The theory was con- firmed by a simulation study. Also, it was observed in the simulation study thatthesmoothbackfittingestimator outperformedtheordinarybackfitting estimator in additive quantile regression. The paper is organized as follows. In the next section, the ordinary and smooth backfitting methods for additive quantile regression are introduced ADDITIVEQUANTILEMODELS 3 and their theoretical properties are provided. In Section 3, some computa- tional aspects of the smooth backfitting method are discussed. The simula- tion results for the finite sample properties of the two backfitting methods are presented in Section 4. Technical details are given in Section 5. 2. Mainresults. Itisassumedforone-dimensionalresponsevariablesY1, ...,Yn that (2.1) Yi=m +m (Xi)+···+m (Xi)+εi, 1≤i≤n. 0 1 1 d d Here, εi are error variables, m ,...,m are unknown functions from R to 1 d R satisfying m (x )w (x )dx = 0 for some weight functions w , m is j j j j j j 0 an unknown constant, and Xi =(Xi,...,Xi) are random design points in Rd. ThroughoRut the paper, we assum1 e thatd(Xi,εi) are i.i.d. and that Xi j takes its values in a bounded interval I . Furthermore, it is assumed that j the conditional α-quantile of εi given Xi equals zero. This model excludes interesting auto-regression models,butit simplifies our asymptotic analysis. Weexpectthatourresultscanbeextendedtodependentobservationsunder mixing conditions. Theordinarybackfittingestimator isbasedonaniterativealgorithm.The estimate of m is updated by the following equation: j n d mˆBF(x )=argmin τ Yi−θ−mˆBF− mˆBF(Xi) j j α 0 ℓ ℓ θ∈Θ ! i=1 ℓ=1,=6 j X X (2.2) ×K (x ,Xi). j,hj j j Here,τ isthesocalled“checkfunction”definedbyτ (u)=u{α−I(u<0)}, α α and K are kernel functions with bandwidth g; see the assumptions below. j,g Tosimplifythemathematical argumentation, theminimization in(2.2)runs over a compact set Θ. It is assumed that all values of the function m lie in j the interior of Θ. As in the case of mean regression, the ordinary backfitting estimator is not defined as a solution of a global minimization problem. The smooth backfitting estimator is also based on an iterative algorithm. The estimate of m is updated by the following integral equation: j n d mˆSBF(x )=argmin τ Yi−θ−mˆSBF− mˆSBF(x ) j j α 0 ℓ ℓ θ∈Θ i=1Z ℓ=1,=6 j ! X X (2.3) × K (x ,Xi)dx ·K (x ,Xi), ℓ,hℓ ℓ ℓ ℓ j,hj j j ℓ=1,=6 j Y where the integration is over the support of (Xi,...,Xi ,Xi ,...,Xi). 1 j−1 j+1 d This is an iterative scheme for obtaining mˆSBF,j =0,1,...,d, which mini- j 4 Y. K.LEE, E. MAMMEN AND B. U.PARK mize n d τ Yi−mˆSBF− mˆSBF(x ) α 0 j j ! i=1Z j=1 X X (2.4) ×K (x ,Xi)···K (x ,Xi)dx ···dx , 1,h1 1 1 d,hd d d 1 d where the integration is over the support of Xi. The minimizations or iter- ations are done under the constraints (2.5) mˆl(x )w (x )dx =0, j=1,...,d and l=BF,SBF j j j j j ZIj for some weight functions w . One may take unknown weight functions such j as the marginal densities of X and use consistent estimators of them as j the weight functions w in the integrals (2.5). But this would lead to more j complicated bias calculation. We compare our model (2.1) with the following theoretical model. For i=1,...,n, let Z1,...,Zn be one-dimensional variables such that (2.6) Zi=m +m (Xi)+···+m (Xi)+ηi. 0 1 1 d d Here,theconstantm ,thefunctionsm ,...,m andthecovariatesXi,...,Xi 0 1 d 1 d are those in (2.1). The error variables ηi are defined by I(εi≤0)−α ηi=− , f (0|Xi) ε|X where f is the conditional density of ε given X. This definition is moti- ε|X vatedfromtheBahadurrepresentationofsamplequantiles[Bahadur(1966)]. For anindependentsampleof ε1,...,εn withdensities f andα-quantiles be- i ing equal to 0, the Bahadur expansion states that the αth sample quantile θˆ of ε1,...,εn is asymptotically equivalent to the weighted average α n f (0)ηi i=1 i , n f (0) P i=1 i whereηi=−{I(εi≤0)−α}f (0)P−1.Thus,theestimatorθˆ isasymptotically i α equivalent to the minimizer of n θ→ f (0)(ηi−θ)2. i i=1 X This consideration suggests that the ordinary and smooth backfitting es- timators defined at (2.2) and (2.3), respectively, may be approximated well by the corresponding weighted local least squares estimators in the model (2.6). Note that the model (2.6) is an additive model with errors ηi having ADDITIVEQUANTILEMODELS 5 conditional mean zero given the covariates Xi. Thus, the weighted ordi- ∗,BF nary backfitting estimators mˆ in this model are defined by the following j iterations: n d 2 mˆ∗,BF(x )=argmin Zi−θ−mˆ∗,BF− mˆ∗,BF(Xi) j j 0 ℓ ℓ θ∈Θ ( ) i=1 ℓ=1,=6 j X X ×f (0|Xi)K (x ,Xi) ε|X j,hj j j (2.7) n d = Zi−mˆ∗,BF− mˆ∗,BF(Xi) f (0|Xi)K (x ,Xi) 0 ℓ ℓ ε|X j,hj j j ( ) i=1 ℓ=1,=6 j X X n −1 × f (0|Xi)K (x ,Xi) . ε|X j,hj j j ( ) i=1 X ∗,SBF Also, the weighted smooth backfitting estimators mˆ in the model (2.6) j are defined by ∗,SBF ∗,SBF ∗,SBF mˆ (x )=m˜ (x )−mˆ j j j j 0 (2.8) d fˆw (x ,x ) − mˆ∗,SBF(x ) Xj,Xℓ j ℓ dx , ℓ ℓ fˆw (x ) ℓ ℓ=X1,=6 jZ Xj j where n m˜∗,SBF(x )=n−1 Zif (0|Xi)K (x ,Xi)fˆw (x )−1, j j ε|X j,hj j j Xj j i=1 X n fˆw (x )=n−1 f (0|Xi)K (x ,Xi), Xj j ε|X j,hj j j i=1 X n fˆw (x ,x )=n−1 f (0|Xi)K (x ,Xi)K (x ,Xi) Xj,Xℓ j ℓ ε|X j,hj j j ℓ,hℓ ℓ ℓ i=1 X are weighted modifications of themarginal Nadaraya–Watson estimator and the kernel estimators of the one- and two-dimensional marginal densities of X, respectively. The latter two are in fact kernel estimators of fw (x )= f (0|x)f (x)dx =f (0,x ), Xj j ε|X X −j ε,Xj j Z fw (x ,x )= f (0|x)f (x)dx =f (0,x ,x ), Xj,Xℓ j ℓ ε|X X −(j,ℓ) ε,Xj,Xℓ j ℓ Z respectively, where x =(x ,...,x ,x ,...,x )⊤ and x is a vector −j 1 j−1 j+1 d −(j,ℓ) that has elements x with 1≤l≤d and l6=j,ℓ. l 6 Y. K.LEE, E. MAMMEN AND B. U.PARK Our first result (Proposition 2.1) shows that each application of the up- dating equations (2.7) and (2.8) in the theoretical model (2.6), respectively, lead to asymptotically equivalent results with those at (2.2) and (2.3) in the original model (2.1). In the next step, we will apply Proposition 2.1 for iterative applications of the backfitting updates. We will show that the asymptoticequivalenceremainstoholdforiterativeapplicationsoftheback- fitting procedures as long as the number of iterations is small enough. By extending the results for backfitting and smooth backfitting estimators in mean regression, we will use this fact to get our main result (Theorem 2.2). Thelatterstatesanasymptoticnormalityresultfortheordinaryandsmooth backfitting quantile estimators in additive models. Its proof is based on an argument that carries an asymptotic normality result in mean regression over to quantile regression. We now introduce assumptions that guarantee asymptotic equivalence between the mean and the quantile backfitting estimators after one cycle of update. Further assumptions that are needed for iterative updates will be given after Proposition 2.1. For simplicity, we state Proposition 2.1 and its conditions only for the updates of the first additive component. In abuse of notation, we denote the estimators of the components m ,2≤j≤d, at the j preceding iteration step, by mˆl,...,mˆl, where l stands for BF, SBF, ∗,BF 2 d or ∗,SBF. Theupdatesof thefirstcomponent thatare obtained by plugging these estimators into (2.2), (2.3), (2.7) and (2.8), respectively, are denoted by mˆBF,mˆSBF,mˆ∗,BF and mˆ∗,SBF. Thus, for simplicity of notation, we use 1 1 1 1 the same kind of symbol for the updates (j =1) and for the inputs of the backfitting algorithms (2≤j≤d). We make the following assumptions: (A1) The d-dimensional vector Xi has compact support I =I ×···×I 1 d for boundedintervals I =[a ,b ] and its density f is continuous and j j j X strictly positive on I. (A2) There exist constants C ,C >0 such that for all x ∈I , 1≤j ≤d, K S j j the kernels K (x ,·) are positive, bounded by C g−1, have bounded j,g j K support ⊂[x−C g,x+C g], and are Lipschitz continuous with Lips- S S chitzconstantboundedbyC g−2.Theweightfunctionsw arebounded K j functions with w (x )≥0 for x ∈I and w (x )dx >0. j j j j j j j (A3) The conditional density f (0|x) of ε given X =x is bounded away ε|X R from zero and infinity for x∈I. Furthermore, it satisfies the following uniform Lipschitz condition: |f (e|x)−f (0|x)|≤C |e| ε|X ε|X 1 for x∈I and for e in a neighborhood of 0 with a constant C >0 that 1 does not depend on x. (A4) The bandwidths h ,...,h are of order n−1/5. 1 d ADDITIVEQUANTILEMODELS 7 Assumptions (A1)–(A4) are standard smoothing assumptions. In partic- ular, (A2) is fulfilled for convolution kernels with an appropriate boundary correction. For the properties of the updated estimators, the estimators at the pre- ceding iteration step need to fulfill certain regularity conditions. We will proceed with the following assumptions that are stated for some constants 0<ρ≤1, ∆ ,∆ ,∆ >0 and 0≤ξ≤(1+ρ)∆ . 1 2 3 1 (A5) For j=2,...,d, it holds for l=BF and l=SBF that sup |mˆl(x )−m (x )|=O (n−(4+4ρ)/(10+15ρ)−∆1), j j j j P aj+CShj≤xj≤bj−CShj sup |mˆl(x )−m (x )|=O (n−[(4+4ρ)/(10+15ρ)−∆1]/2). j j j j P aj≤xj≤bj (A6) Thereexist randomfunctions g ,...,g withderivatives that fulfillthe 2 d Lipschitz condition |g′(x )−g′(x∗)|≤C|x −x∗|ρnξ j j j j j j for j=2,...,d and x ,x∗∈I . Furthermore, these functions satisfy j j j sup |mˆl(x )−g (x )|=O (n−2/5−∆2) j j j j P aj≤xj≤bj for l=BF and l=SBF. (A7) For j=2,...,d, it holds for l=BF and l=SBF that sup |mˆl(x )−mˆ∗,l(x )|=O (n−2/5−∆3), j j j j P aj+CShj≤xj≤bj−CShj sup |mˆl(x )−mˆ∗,l(x )|=O (n−1/5−∆3). j j j j P aj≤xj≤bj We briefly comment on the assumptions (A5)–(A7). A more detailed dis- cussion is given after Theorem 2.2. Assumption (A5) requires suboptimal rates for the preceding estimators that are plugged in for the update of the first component. Assumption (A6) states that the class of possible re- alizations of the preceding estimators is not too rich. We assume that the preceding estimators are in a neighborhood of the class of functions with Lipschitz continuous derivatives. Other classes could be used but for a Lips- chitz class it is relatively easy to check if a function belongs to it. Note that we do not assume that the quantile estimator itself has a smooth deriva- tive. In general, such an assumption does not hold because quantile kernel estimators are not smooth. Assumption (A7) is very natural. It states that the estimators that are plugged into the updating equation of the quantile model and of the mean regression model differ only by second order terms. 8 Y. K.LEE, E. MAMMEN AND B. U.PARK Withoutthisassumption,itcannotbeexpectedthattheupdatedestimators differ also only by second order terms. We will see below that this assump- tion is automatically fulfilled if we apply Proposition 2.1 for an analysis of iterative applications of the backfitting algorithms. In the assumptions (A5) and (A7), if one replaces the interior region [a +C h ,b −C h ] by the j S j j S j whole range [a ,b ] and if one uses boundary corrected kernels, then one j j can also replace in Proposition 2.1 the suprema over the interior region by those over the whole range, and the estimators achieve the rate n−2/5 at the boundary, too. Proposition 2.1. Under the assumptions (A1)–(A7), it holds for the updated estimators with l=BF and with l=SBF that for some δ>0 sup |mˆl(x )−mˆ∗,l(x )|=O (n−2/5−δ), 1 1 1 1 P a1+CSh1≤x1≤b1−CSh1 sup |mˆl(x )−mˆ∗,l(x )|=O (n−1/5−δ). 1 1 1 1 P a1≤x1≤b1 The additional factor n−δ allows an iterative application of the propo- sition. This has an important implication. We recall that the backfitting algorithms for mean regression have a geometric rate of convergence. In particular, in the case of smooth backfitting, only square integrability for the initial estimator is required for the algorithm to achieve the geometric rate of convergence, see Theorem 1 of Mammen, Linton and Nielsen (1999). BF,[0] BF,[0] Suppose one chooses square integrable functions, say mˆ ,...,mˆ as 2 d the starting value in the algorithm for the backfitting quantile estimator and that one runs a cycle of backfitting iterations (2.2) for j = 1,...,d. BF,[l] BF,[l] Then we get updates mˆ ,...,mˆ with l=1 and after further cycles 2 d with l>1. (Note that by construction of the backfitting estimator we do BF,[0] not need a pilot version of m .) Then, one can think of running the 1 backfitting mean regression algorithm (2.7) with the same initial estima- BF,[0] BF,[0] tors mˆ ,...,mˆ in parallel with the backfitting quantile regression 2 d ∗,BF,[l] ∗,BF,[l] algorithm (2.2). This results in updates mˆ ,...,mˆ for l≥1. In 2 d the proof of our next theorem, we will see that after l cycles of the two parallel iterations, the difference mˆBF,[l]−mˆ∗,BF,[l] is of order O (n−2/5−δ) j j P in the interior, and of order O (n−1/5−δ) at the boundaries. This holds as P long as l≤C logn with C small enough. On the other hand, we will iter iter show that mˆ∗,BF,[Citerlogn] is asymptotically equivalent to the limit of the j ∗,BF,[∞] backfitting algorithm mˆ , if C is large enough. If the pilot esti- j iter BF,[0] BF,[0] mators mˆ ,...,mˆ are accurate enough, then the constant C can 2 d iter be chosen such that both requirements are fulfilled. This will allow us to ADDITIVEQUANTILEMODELS 9 get the asymptotic limit distribution of mˆ∗,BF,[Citerlogn], and thus that of j mˆBF,[Citerlogn]. j Similarfindingsalsoholdforthesmoothbackfittingestimator. Wedenote SBF,[0] SBF,[0] SBF,[l] the starting values by mˆ ,...,mˆ and the updates by mˆ ,..., 2 d 2 SBF,[l] ∗,SBF,[l] ∗,SBF,[l] mˆ or mˆ ,...,mˆ , respectively. d 2 d The following theorem summarizes our discussion. For the theorem, we need the following additional assumptions: (A8) There exist constants c ,C >0, C′ ≥0 such that for a +C′h ≤ K D S j S j x ,u ≤b −C′h it holds that K (x ,u )=h−1K[h−1(x −u )] for j j j S j j,hj j j j j j j afunctionK with K(v)dv=1and vK(v)dv=0.Forallx ,u ∈I , j j j 1≤j ≤d, the kernels K (x ,u ) have a second derivative w.r.t. x j,g j j j R R that is bounded by C g−3 and they fulfill K (x ,v )dv ≥c and D j,g j j j K K (v ,u )dv =1. j,g j j j R (A9) The function fw (x |x ) ≡ fw (x ,x )/fw (x ) has a second R Xk|Xj k j Xj,Xk j k Xj j derivative w.r.t. x that is bounded over x ∈I , x ∈I , 1≤j,k≤d, j j j k k k6=j. The last condition in (A8) implies that the one-dimensional kernel den- sity estimators integrate to one and that they are equal to the correspond- ingmarginalizationofhigher-dimensionalproduct-kerneldensityestimators. This assumption simplifies bias calculation of the backfitting estimators. Theorem 2.2. Assume that (A1)–(A4), (A8) and (A9) hold, and that (A5) and (A6) are satisfied by mˆBF = mˆBF,[0] and mˆSBF = mˆSBF,[0] (j = j j j j 2,...,d) with ξ,∆ ,∆ ,2 − 1+ρ 4 −∆ >0 small enough. Then, we get for 2 3 5 2+3ρ5 1 mˆl,iter=mˆl,[Citerlogn] with an appropriate choice of C =C (l=BF and j j iter iter,l l=SBF) that for a <x <b j j j nh [mˆl,iter(x )−m (x )−h2β (x )] j j j j j j j j p α(1−α) →N 0, f (x ) K2(u)du f (0,x )2 Xj j (cid:18) ε,Xj j Z (cid:19) in distribution, where β (x )=β∗(x )− β∗(u )w (u )du , β∗(x )=h−2× j j j j j j j j j j j j m′(x ) (u − x )K (x ,u )du + µ 1m′′(x ) + µ β∗∗(x ), µ = j j j j j,hj j j j 2,RK2 j j 2,K j j 2,K v2K(v)dv and (β∗∗,...,β∗∗) is a tuple of functions that minimizes R 1 d R d 2 ∂f (0,x)/∂x m′(x ) ε,X j −β∗∗(x ) f (0,x)dx. j j f (0,x) j j ε,X Z "j=1(cid:18) ε,X (cid:19)# X 10 Y. K.LEE, E. MAMMEN AND B. U.PARK Note that the first term in the definition of β∗ is of order n1/5 at the j boundarybutvanishes in the interior of I .Because of the normingwith the j weightfunctionw ,thebiasfunctionβ isshiftedfromβ∗ by β∗(u )w (u )du . j j j j j j j j One can estimate the bias and the variance terms because they only require R two-dimensional objects if one calculates them with the backfitting algo- rithms. We now come back to discussion of the assumptions (A5)–(A7). Assump- tion (A5) allows that the starting estimators have a suboptimal rate. In particular, it requires that the starting estimators are consistent. For exam- ple, one could use here orthogonal series estimators, smoothing splines or sieve estimators. In the simulations, we got good results by using constant functions as starting values, that is, functions that are not consistent. For backfitting mean regression, it is known that every starting value works. Be- cause ofthenonlinearity ofquantile regression,we donotexpectthatsuch a result can beproved for quantile regression. In our result,we didnot specify the required rate for the pilot estimator. But, if one does this, we conjecture that one can get the statement of Theorem 2.2 with pilot estimators that have much slower rates. For such a theorem, one has to prove a modifica- tion of Proposition 2.1 with the following statement: for the estimators at the preceding stage of the backfitting algorithms, less accurate error bounds would suffice to get that the difference between the backfitting estimators mˆ and mˆ∗ at the current stage of the algorithm is of higher order than the 1 1 accuracy of the preceding estimators. This would allow one to weaken the assumptions on the rate of the starting estimators. Assumption (A7) is not required for Theorem 2.2. This is because run- ning the iterative algorithms (2.7) and (2.8) is only imaginary and in the proof we choose to use the same starting values as in the real iterative al- gorithms (2.2) and (2.3), respectively. Thus, (A7) is automatically satisfied at the beginning of the iterations. Proposition 2.1 tells us that the updated estimators also fulfill (A7). This holds with the same rate but with mul- tiplicative factors. For this reason, after L backfitting cycles the difference between the mean regression and the quantile regression estimators is not of order (C ×L)n−2/5−δ, but of order CLn−2/5−δ, for some δ>0, C >1. For a number of iterations, C logn such that C logC <δ this is of order iter iter o(n−2/5). Comparedwiththeresultsformeanregressionbackfittingestimators, our results for quantile estimation are weaker in two aspects. First, we need ini- tial estimators that are consistent, whereas in mean regression one can start with arbitrary initial values. This restriction comes from the nonlinearity of the quantile functional. Second, we put restrictions on the number of itera- tion steps.Itmustbeof logarithmic order witha factor thatis nottoo small and not too large. When letting run the two parallel backfitting procedures

See more

The list of books you might like

Most books are stored in the elastic cloud where traffic is expensive. For this reason, we have a limit on daily download.