Optimal bandwidth estimators of kernel density functionals for contaminated data


Gündüz N., Aydın C.

JOURNAL OF APPLIED STATISTICS, cilt.48, ss.2239-2258, 2021 (SCI-Expanded) identifier identifier identifier

  • Yayın Türü: Makale / Tam Makale
  • Cilt numarası: 48
  • Basım Tarihi: 2021
  • Doi Numarası: 10.1080/02664763.2021.1944999
  • Dergi Adı: JOURNAL OF APPLIED STATISTICS
  • Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus, Academic Search Premier, ABI/INFORM, Aerospace Database, Business Source Elite, Business Source Premier, CAB Abstracts, Veterinary Science Database, zbMATH
  • Sayfa Sayıları: ss.2239-2258
  • Anahtar Kelimeler: Bandwidth, density functionals, density estimation, kernel smoothing, contaminated data
  • Gazi Üniversitesi Adresli: Evet

Özet

In this study, we provide simulation-based exploration and characterization of the two most crucial kernel density functionals that play a central role in kernel density estimation, considering the probability density functions that are members of the location-scale family. Kernel density functional estimates are known to rely on the choice of preliminary bandwidth. Normal-scale estimators are commonly used to obtain preliminary bandwidth estimates, with the assumption that the data come from normal distribution. Here, we present an alternative approach, called the Cauchy-scale estimators, to obtain preliminary bandwidth estimates. In this approach, data are assumed to come from a Cauchy distribution. Furthermore, analysis results related to the sampling distribution of bandwidth estimators based on the normal- and Cauchy-scale approaches are presented. As a case study, we provide a comprehensive characterization of different contamination levels with a simulation study constructed for the random samples from normal distributions with various parameters and various contamination levels. The proposed preliminary bandwidth selection shows lower variance in both mixture and contaminated data in our simulations. Besides, functional bandwidth presents results similar to the simulation results in the applications we made on the real data set.