ebook img

Smooth Kernel Estimation of a Circular Density Function: A Connection to Orthogonal Polynomials on the Unit Circle PDF

0.11 MB·
Save to my drive
Quick download
Download
Most books are stored in the elastic cloud where traffic is expensive. For this reason, we have a limit on daily download.

Preview Smooth Kernel Estimation of a Circular Density Function: A Connection to Orthogonal Polynomials on the Unit Circle

Smooth Kernel Estimation of a Circular Density Function: A 6 1 Connection to Orthogonal 0 2 Polynomials on the Unit Circle n a J Yogendra P. Chaubey 9 1 Department of Mathematics and Statistics, Concordia University, Montr´eal, QC H3G 1M8 ] e-mail: [email protected] T S Abstract: Inthisnoteweprovideasimpleapproximationtheorymoti- . h vation for the circular kerneldensity estimation and further explore the t usefulness of the wrapped Cauchy kernel in this context. It is seen that a m thewrappedCauchykernelappearsasanaturalcandidateinconnection to orthogonal series density estimation on a unit circle. This adds fur- [ ther weight to the considerable role of the wrapped Cauchy in circular 1 statistics. v 3 AMS2000subjectclassifications:Primary62G07;secondary62G20. 5 Keywordsandphrases:Circularkerneldensityestimator,Orthogonal 0 series density. 5 0 . 1 1. Introduction 0 6 1 Consider an absolutely continuous (with respect to the Lebesgue measure) v: circular density f(θ),θ [ π,π], i.e f(θ) is 2π periodic, ∈ − − i X π r f(θ) 0 for θ R and f(θ)dθ = 1. (1.1) a ≥ ∈ Z −π In the literature on modeling circular data, starting from the classical text of Mardia (1972), there appear many standard texts such as Fisher (1993), Jammalamadaka and SenGupta (2001) and Mardia and Jupp (1972) that cover parametric models along with many inference problems. More recently various alternatives to these classical parametric models, exhibiting asymme- try and multimodality have been investigated with respect to their mathe- maticalpropertiesandgoodnessoffittosomerealdata;seeAbe and Pewsey Y.P. Chaubey/Smooth Circular Density: A Connection to OPUC 2 (2011),Jones and Pewsey (2012)),Kato and Jones (2015),Kato and Jones (2010), Minh and Farnum (2003) and Shimizu and Iida (2002). In cases where multimodal andor asymmetric models may be appropriate, semiparametricr or nonparametric modelling may be considered more appro- priate. Fern¨andez-Dur¨an (2004) and Mooney et al. (2003) considered semi- parametric analysis based on mixture of circular normal and von Mises dis- tributions and Hall et al. (1987), Bai et al. (1988), Fisher (1989), Taylor (2008) and Klemela¨ (2000) have considered nonparametric approaches. Given a random sample (θ ,....θ ) from the density (1.1), the circular 1 N kernel density estimator is given by N 1 fˆ(θ;h) = k (θ θ ) (1.2) h j N − Xj=1 where k (θ φ) is a circular kernel density function that is concentrated h − around φ as h h for some known h . As motivated in Taylor (2008), a 0 0 → natural choice for the kernel function is one of the commonly used circular probability densities, such as the wrapped normal distribution, or the von Mises distribution. Taylor (2008) investigated the use of von Mises kernel, in which case the density estimator is given by N 1 fˆ (θ;ν) = exp ν(θ θ ) (1.3) VM j N(2π)I (ν) { − } 0 Xj=1 where I (ν) is the Bessel function of order r and ν is the concentration pa- 0 rameter. Di Marzio et al. (2009) considered the use of circular kernels to circular regression while extending the use of von Mises kernels to more gen- eral circular kernels. In the present note I demonstrate that the wrapped Cauchy kernel presents itself as the kernel of choice by considering an esti- mation problem on the unit circle. We also show that this approach leads to orthogonalseries density estimation, however notruncationoftheseries isre- quired. It may be noted that the wrapped Cauchy distribution with location parameter µ and concentration parameter ρ is given by 1 1 ρ2 f (θ;µ,ρ) = − , π θ < π, (1.4) WC 2π1+ρ2 2ρcos(θ µ) − ≤ − − that becomes degenerate at θ = µ as ρ 1. The estimator of f(θ) based on → Y.P. Chaubey/Smooth Circular Density: A Connection to OPUC 3 the above kernel is given by N 1 fˆ (θ;ρ) = f (θ;θ ,ρ). (1.5) WC WC j N Xj=1 In Section 2, we provide a simple approximation theory argument behind thenonparametricdensityestimatorofthetypeintroducedin(1.2)and(1.3). In Section 3, first we present some basic results from the literature on or- thogonal polynomials on the unit circle and then introduce the strategy of estimating f(θ) by estimating an expectation of a specific complex function, that in turn produces the non-parametric circular kernel density estimator in (1.5). The next section shows that the circular kernel density estimator is equivalent to the orthogonal series estimation in a limiting sense. This equiv- alence establishes a kind of qualitative superiority of the kernel estimator over the orthogonal series estimator that requires the series to be truncated, however the kernel estimator does not have such a restriction. 2. Motivation for the Circular Kernel Density Estimator The starting point of the nonparametric density estimation is the theorem given below from approximation theory (see Mhaskar and Pai (2000)). Be- fore giving the theorem we will need the following definition: Definition 2.1. Let K C∗ where C∗ denotes the set of periodic analytic n { } ⊂ functions with a period 2π. We say that K is an approximate identity if n { } A. K (θ) 0 θ [ π,π]; n B. 1 π K≥ (θ∀) =∈1;− 2π −π n C. limR max K (θ) = 0 for every δ > 0. n→∞ |θ|≥δ n Thedefinitionaboveismotivatedfromthefollowingtheoremwhichissimi- lar tothe oneused inthetheory of linear kernel estimation (see Prakasa Rao (1983)). Theorem 2.1. Let f C∗, K be approximate identity and for n = 1,2,... n ∈ { } set 1 π f∗(θ) = f(η)K (η θ)dη. (2.1) n 2π Z − −π Then we have lim sup f∗(x) f(x) = 0. (2.2) n→∞x∈[−π,π]| − | Y.P. Chaubey/Smooth Circular Density: A Connection to OPUC 4 Note that taking the sequence of concentration coefficients ρ ρ such n ≡ that ρ 1, the density function of the Wrapped Cauchy will satisfy the n → conditions in the definition in place of 2πK . The integral in the above the- n orem. In general 2πK may be replaced by a sequence of periodic densities n on [ π,π], that converge to to a degenerate distribution at θ = 0. − For a given random sample of θ ,...,θ from the circular density f, the 1 N Monte-Carlo estimate of f∗ is given by N 1 f˜(θ) = K (θ θ), (2.3) n j (2π)N − jX=11 the suffix n for the kernel K may be a function of the sample size N. The kernel given by the wrapped Cauchy density satisfies the assumptions in the above theorem that provides the estimator proposed in (1.5). 3. Some Preliminary Results from Complex Analysis Let D be the open unit disk, z z < 1 , in Z and let µ be a continuous { | | | } measure defined on the boundary ∂D, i.e. the circle z z = 1 . The point { | | | } z D will be represented by z = reiθ for r [0,1),θ [0,2π) and i = √ 1. ∈ ∈ ∈ − A standard result in complex analysis involves the Poisson representation that involves the real and complex Poisson kernels that are defined as 1 r2 P (θ,ϕ) = − (3.1) r 1+r2 2rcos(θ ϕ) − − for θ,ϕ [0,2π) and r [0,1) and by ∈ ∈ ω +z C(z,ω) = (3.2) ω z − for ω ∂D and z D. The connection between these kernels is given by the ∈ ∈ fact that P (θ,ϕ) = Re C(reiθ,eiϕ) = (2π)f (θ;ϕ,ρ). (3.3) r WC The Poisson representation says that if g is analytic in a neighborhood of D¯ with g(0) real, then for z D, ∈ eiθ +z dθ g(z) = Re(g(eiθ)) (3.4) Z (cid:18)eiθ z(cid:19) 2π − Y.P. Chaubey/Smooth Circular Density: A Connection to OPUC 5 (see (Simon , 2005, p. 27)). This representation leads to the result (see (ii) in 5 of Simon (2005)) that for Lebesgue a.e. θ, § limF(reiθ) F(eiθ) (3.5) r↑1 ≡ exists and if dµ = w(θ)dθ +dµ with dµ singular, then 2π s s w(θ) = ReF(eiθ), (3.6) where eiθ +z F(z) = dµ(θ). (3.7) Z (cid:18)eiθ z(cid:19) − Our strategy for smooth estimation is the fact that for dµ = 0 we have s 1 f(θ) = limRe F(reiθ), (3.8) 2π r↑1 where eiθ +z F(z) = f(θ)dθ. (3.9) Z (cid:18)eiθ z(cid:19) − We define the estimator of f(θ) motivated by considering an estimator of F(z), the identity (3.6) and (3.8), i.e. 1 fˆ(θ) = Re F (reiθ) (3.10) r N 2π where 1 N eiθj +reiθ F (reiθ) = , (3.11) N N (cid:18)eiθj reiθ(cid:19) Xj=1 − where r has to be chosen appropriately. Recognize that N 1 F (reiθ) = C(z,ω ), (3.12) N j N Xj=1 where ω = eiθj, then using (3.4), we have j N 1 Re F (reiθ) = P (θ,θ ), (3.13) N r j N Xj=1 Y.P. Chaubey/Smooth Circular Density: A Connection to OPUC 6 and therefore N 1 fˆ(θ) = P (θ,θ ) r r j (2π)N Xj=1 N 1 = f (θ;θ ,r), (3.14) WC j N Xj=1 that is of the same form as in (1.5). 4. Orthogonal Series Estimation WegettheorthogonalexpansionofF(z)withrespecttothebasis 1,z,z2,... { } as ∞ F(z) = 1+2 c zn (4.1) n Xn=1 where c = e−inθf(θ)dθ, n Z is the jth trigonometric moment. The series is truncated at some term N∗ so that the the error is negligible. However, we show below that estimating the trigonometric moment c ,n = 1,2,... as n N 1 cˆ = e−inθj, n N Xj=1 Y.P. Chaubey/Smooth Circular Density: A Connection to OPUC 7 the estimator of F(z) is the same as given in the previous section. This can be shown by writing N ∞ 2 Fˆ(z) = 1+ e−inθjzn N { } Xj=1 Xn=1 N ∞ 2 = 1+ (ω¯ z)n ;ω = einθj j j N { } Xj=1 Xn=1 N 2 ω¯ z j = 1+ N (cid:18)1 ω¯ z(cid:19) Xj=1 − j N 2 1 ω¯ z j = + N (cid:18)2 1 ω¯ z(cid:19) Xj=1 − j N 1 1+ω¯ z j = N (cid:18)1 ω¯ z(cid:19) Xj=1 − j N 1 = C(z,ω ), j N Xj=1 which is the same as F (z) given in (3.12). This ensures that the orthogonal N series estimator of the density coincides with the circular kernel estimator. The determination of the smoothing constant may be handled based on the cross validation method outlined in Taylor (2008). Remark: Note that the simplification used in the above formulae does not work for r = 1. Even though, the limiting form of (4.1) is used to define an orthogonal series estimator as given by N n∗ 1 1 fˆ (θ) = + cosn(θ θ ), (4.2) S j 2π πN − Xj=1 Xn=1 where n∗ is chosen according to some criterion, for example to minimize the integrated squared error. Thus the above discussion presents two contrasting situations: in one we have to determine the number of terms in the series and in the other number of terms in the series is allowed to be infinite, however, we choose to evaluate Re F(reiθ) for some r close to 1 as an approximation to Re F(eiθ). Y.P. Chaubey/Smooth Circular Density: A Connection to OPUC 8 References Abe, T. and Pewsey, A. (2011). Symmetric Circular Models Through Dupli- cationandCosinePerturbation.Computational Statistics & Data Analysis, 55(12), 3271–3282. Bai, Z.D., Rao, C.R., Zhao, L.C. (1988). Kernel Estimators of Density Func- tion of Directional Data. Journal of Multivariate Analysis, 27(1), 24–39. DiMarzio,M.,Panzera,A.,Taylor,C.C.(2009).LocalPolynomialRegression forCircular Predictors. Statistics & Probability Letters,79(19),2066–2075. Fern¨andez-Dur¨an, J.J. (2004). Circular distributions based on nonnegative trigonometric sums, Biometrics, 60, 499–503. Fisher, N.I. (1989). Smoothing a sample of circular data, J. Structural Geol- ogy, 11, 775–778. Fisher, N.I. (1993). Statistical Analysis of Circular Data. Cambridge Univer- sity Press, Cambridge. Hall P, Watson GP, Cabrera J (1987). Kernel Density Estimation for Spher- ical Data. Biometrika, 74(4), 751–762. Jammalamadaka, S. R., SenGupta, A. (2001). Topics in Circular Statistics. World Scientific, Singapore. Jones, M.C. and Pewsey, A (2012). Inverse Batschelet Distributions for Cir- cular Data. Biometrics, 68(1), 183–193. Kato, S. and Jones, M.C. (2010). A family of distributions on the circle with links to, and applications arising from, M¨obius transformation. J. Amer. Statist. Assoc. 105, 249-262. Kato, S. and Jones, M. C. (2015). A tractable and interpretable four- parameter family of unimodal distributions on the circle. Biometrika, 102(9), 181–190. Klemela¨, J. (2000). Estimation of Densities and Derivatives of Densities with Directional Data. Journal of Multivariate Analysis, 73(1), 18–40. Mardia, K.V. (1972). Statistics of Directional Data. Academic Press, New York. Mardia, K.V. and Jupp, P. E. (2000). Directional Statistics. John Wiley & Sons, New York, NY, USA. Mhaskar,H.N.andPai,D.V.(2000).Fundamentalsof approximation Theory. Narosa Publishing House, New Delhi, India. Minh, D. and Farnum, N. (2003). Using bilinear transformations to induce probability distributions. Commun. Stat.-Theory Meth., 32, 1–9. Mooney A., Helms P.J. and Jolliffe, I.T. (2003). Fitting mixtures of von Y.P. Chaubey/Smooth Circular Density: A Connection to OPUC 9 Mises distributions: a case study involving sudden infant death syndrome, Comput. Stat. Data Anal., 41, 505– 513. Prakasa Rao, B.L.S. (1983). Non Parametric Functional Estimation. Aca- demic Press, Orlando, Florida. Simon, B. (2005). Orthogonal Polynomials on the Unit Circle, Part 1: Clas- sical Theory. American Mathematical Society, Providence, Rhode Island. Shimizu, K. and Iida, K. (2002). Pearson type vii distributions on spheres. Commun. Stat.–Theory Meth., 31, 513–526. Taylor, C.C. (2008). Automatic Bandwidth Selection for Circular Density Estimation. Computational Statistics & Data Analysis, 52(7), 3493–3500.

See more

The list of books you might like

Most books are stored in the elastic cloud where traffic is expensive. For this reason, we have a limit on daily download.