To validate pollution data, subject-matter experts in Airpl (an organization that maintains a network of air pollution monitoring stations in western France) daily perform visual examinations of the data and check their consistency. In this paper, we describe these visual examinations and propose a formalization for this problem. The examinations consist in comparisons of so-called shorth intervals so we build a statistical test that compares such intervals in a nonparametric regression model. This allows to detect atypical data. A practical application of the test is given.
Mots clés : air pollution, validation, regression, bootstrap, shorth
@article{PS_2005__9__230_0, author = {Durot, C\'ecile and Thi\'ebot, Karelle}, title = {Detecting atypical data in air pollution studies by using shorth intervals for regression}, journal = {ESAIM: Probability and Statistics}, pages = {230--240}, publisher = {EDP-Sciences}, volume = {9}, year = {2005}, doi = {10.1051/ps:2005013}, mrnumber = {2167325}, zbl = {1137.62408}, language = {en}, url = {http://www.numdam.org/articles/10.1051/ps:2005013/} }
TY - JOUR AU - Durot, Cécile AU - Thiébot, Karelle TI - Detecting atypical data in air pollution studies by using shorth intervals for regression JO - ESAIM: Probability and Statistics PY - 2005 SP - 230 EP - 240 VL - 9 PB - EDP-Sciences UR - http://www.numdam.org/articles/10.1051/ps:2005013/ DO - 10.1051/ps:2005013 LA - en ID - PS_2005__9__230_0 ER -
%0 Journal Article %A Durot, Cécile %A Thiébot, Karelle %T Detecting atypical data in air pollution studies by using shorth intervals for regression %J ESAIM: Probability and Statistics %D 2005 %P 230-240 %V 9 %I EDP-Sciences %U http://www.numdam.org/articles/10.1051/ps:2005013/ %R 10.1051/ps:2005013 %G en %F PS_2005__9__230_0
Durot, Cécile; Thiébot, Karelle. Detecting atypical data in air pollution studies by using shorth intervals for regression. ESAIM: Probability and Statistics, Tome 9 (2005), pp. 230-240. doi : 10.1051/ps:2005013. http://www.numdam.org/articles/10.1051/ps:2005013/
[1] Éléments de comparaison de prévisions statistiques des pics d'ozone. Rev. Statist. App. 3 (1999) 7-25. | Numdam
, , , , , , , and ,[2] Bootstrapping the shorth for regression. Submitted (2003). | Numdam
and .[3] Asymptotically optimal difference-based estimation of variance in nonparametric regression. Biometrika 77 (1990) 521-529.
, and ,[4] Synthèse de l'enquête sur la procédure de validation de données dans les résaux de surveillance de pollution athmosphérique. Technical report, Air Pays de la Loire (1998).
,Cité par Sources :