Detecting atypical data in air pollution studies by using shorth intervals for regression
ESAIM: Probability and Statistics, Tome 9 (2005), pp. 230-240.

To validate pollution data, subject-matter experts in Airpl (an organization that maintains a network of air pollution monitoring stations in western France) daily perform visual examinations of the data and check their consistency. In this paper, we describe these visual examinations and propose a formalization for this problem. The examinations consist in comparisons of so-called shorth intervals so we build a statistical test that compares such intervals in a nonparametric regression model. This allows to detect atypical data. A practical application of the test is given.

DOI : 10.1051/ps:2005013
Classification : 62G08, 62G09, 62G10, 62P12
Mots-clés : air pollution, validation, regression, bootstrap, shorth
@article{PS_2005__9__230_0,
     author = {Durot, C\'ecile and Thi\'ebot, Karelle},
     title = {Detecting atypical data in air pollution studies by using shorth intervals for regression},
     journal = {ESAIM: Probability and Statistics},
     pages = {230--240},
     publisher = {EDP-Sciences},
     volume = {9},
     year = {2005},
     doi = {10.1051/ps:2005013},
     mrnumber = {2167325},
     zbl = {1137.62408},
     language = {en},
     url = {http://www.numdam.org/articles/10.1051/ps:2005013/}
}
TY  - JOUR
AU  - Durot, Cécile
AU  - Thiébot, Karelle
TI  - Detecting atypical data in air pollution studies by using shorth intervals for regression
JO  - ESAIM: Probability and Statistics
PY  - 2005
SP  - 230
EP  - 240
VL  - 9
PB  - EDP-Sciences
UR  - http://www.numdam.org/articles/10.1051/ps:2005013/
DO  - 10.1051/ps:2005013
LA  - en
ID  - PS_2005__9__230_0
ER  - 
%0 Journal Article
%A Durot, Cécile
%A Thiébot, Karelle
%T Detecting atypical data in air pollution studies by using shorth intervals for regression
%J ESAIM: Probability and Statistics
%D 2005
%P 230-240
%V 9
%I EDP-Sciences
%U http://www.numdam.org/articles/10.1051/ps:2005013/
%R 10.1051/ps:2005013
%G en
%F PS_2005__9__230_0
Durot, Cécile; Thiébot, Karelle. Detecting atypical data in air pollution studies by using shorth intervals for regression. ESAIM: Probability and Statistics, Tome 9 (2005), pp. 230-240. doi : 10.1051/ps:2005013. http://www.numdam.org/articles/10.1051/ps:2005013/

[1] L. Bel, L. Bellanger, V. Bonneau, G. Ciuperca, D. Dacunha-Castelle, C. Deniau, B. Ghattas, Y. Misiti and G. Oppenheim, Éléments de comparaison de prévisions statistiques des pics d'ozone. Rev. Statist. App. 3 (1999) 7-25. | Numdam

[2] C. Durot and K. Thiébot. Bootstrapping the shorth for regression. Submitted (2003). | Numdam

[3] P. Hall, J.W. Kay and D.M. Titterington, Asymptotically optimal difference-based estimation of variance in nonparametric regression. Biometrika 77 (1990) 521-529.

[4] K. Thiébot, Synthèse de l'enquête sur la procédure de validation de données dans les résaux de surveillance de pollution athmosphérique. Technical report, Air Pays de la Loire (1998).

Cité par Sources :