We investigate the optimality for model selection of the so-called slope heuristics, -fold cross-validation and -fold penalization in a heteroscedatic with random design regression context. We consider a new class of linear models that we call strongly localized bases and that generalize histograms, piecewise polynomials and compactly supported wavelets. We derive sharp oracle inequalities that prove the asymptotic optimality of the slope heuristics – when the optimal penalty shape is known – and -fold penalization. Furthermore, -fold cross-validation seems to be suboptimal for a fixed value of since it recovers asymptotically the oracle learned from a sample size equal to of the original amount of data. Our results are based on genuine concentration inequalities for the true and empirical excess risks that are of independent interest. We show in our experiments the good behavior of the slope heuristics for the selection of linear wavelet models. Furthermore, -fold cross-validation and -fold penalization have comparable efficiency.
Accepté le :
DOI : 10.1051/ps/2017005
Mots-clés : Nonparametric regression, heteroscedastic noise, random design, model selection, cross-validation, wavelets
@article{PS_2017__21__412_0, author = {Navarro, Fabien and Saumard, Adrien}, title = {Slope heuristics and {V-Fold} model selection in heteroscedastic regression using strongly localized bases}, journal = {ESAIM: Probability and Statistics}, pages = {412--451}, publisher = {EDP-Sciences}, volume = {21}, year = {2017}, doi = {10.1051/ps/2017005}, mrnumber = {3743921}, zbl = {1395.62093}, language = {en}, url = {http://www.numdam.org/articles/10.1051/ps/2017005/} }
TY - JOUR AU - Navarro, Fabien AU - Saumard, Adrien TI - Slope heuristics and V-Fold model selection in heteroscedastic regression using strongly localized bases JO - ESAIM: Probability and Statistics PY - 2017 SP - 412 EP - 451 VL - 21 PB - EDP-Sciences UR - http://www.numdam.org/articles/10.1051/ps/2017005/ DO - 10.1051/ps/2017005 LA - en ID - PS_2017__21__412_0 ER -
%0 Journal Article %A Navarro, Fabien %A Saumard, Adrien %T Slope heuristics and V-Fold model selection in heteroscedastic regression using strongly localized bases %J ESAIM: Probability and Statistics %D 2017 %P 412-451 %V 21 %I EDP-Sciences %U http://www.numdam.org/articles/10.1051/ps/2017005/ %R 10.1051/ps/2017005 %G en %F PS_2017__21__412_0
Navarro, Fabien; Saumard, Adrien. Slope heuristics and V-Fold model selection in heteroscedastic regression using strongly localized bases. ESAIM: Probability and Statistics, Tome 21 (2017), pp. 412-451. doi : 10.1051/ps/2017005. http://www.numdam.org/articles/10.1051/ps/2017005/
Wavelet estimators in nonparametric regression: a comparative simulation study. J. Stat. Softw. 6 (2001) 1–83. | DOI
, and ,Wavelet methods for curve estimation. J. Amer. Statist. Assoc. 89 (1994) 1340–1353. | DOI | MR | Zbl
, and ,S. Arlot, -fold cross-validation improved: -fold penalization. Preprint (2008). | arXiv
S. Arlot, Choosing a penalty for model selection in heteroscedastic regression (2010). | arXiv
Data-driven calibration of linear estimators with minimal penalties. Adv. Neural Infor. Process. Syst. 22 (2009) 46–54.
andA survey of cross-validation procedures for model selection. Stat. Surv. 4 (2010) 40–79. | DOI | MR | Zbl
and ,Segmentation of the mean of heteroscedastic data via cross-validation. Stat. Comput. 21 (2011) 613–632. | DOI | MR | Zbl
and ,Choice of for -fold cross-validation in least-squares density estimation. J. Mach. Learn. Res. 17 (2016) 1–50. | MR | Zbl
and ,Data-driven calibration of penalties for least-squares regression. J. Mach. Learn. Res. 10 (2009) 245–279.
and ,Slope heuristics: overview and implementation. Stat. Comput. 22 (2012) 455–470. | DOI | MR | Zbl
, and ,Minimum contrast estimators on sieves: exponential bounds and rates of convergence. Bernoulli 4 (1998) 329–375. | DOI | MR | Zbl
and ,Minimal penalties for Gaussian model selection. Probab. Theory Related Fields 138 (2007) 33–73. | DOI | MR | Zbl
and ,A comparative study of ordinary cross-validation, -fold cross-validation and the repeated learning-testing methods. Biometrika 76 (1989) 503–514. | DOI | MR | Zbl
,Adaptive wavelet estimation: a block thresholding and oracle inequality approach. Ann. Statist. 27 (1999) 898–924. | MR | Zbl
,Wavelet shrinkage for nonequispaced samples. Ann. Statist. 26 (1998) 1783–1799. | MR | Zbl
and ,Wavelet estimation for samples with random uniform design. Statist. Probab. Lett. 42 (1999) 313–321. | DOI | MR | Zbl
and ,G. Castellan, Modified Akaike’s criterion for histogram density estimation. Technical report 99.61, Université Paris-Sud (1999).
A new perspective on least squares under convex constraint. Ann. Statist. 42 (2014) 2340–2381, 12. | DOI | MR | Zbl
,Wavelets on the interval and fast wavelet transforms. Appl. Comput. Harmon. Anal. 1 (1993) 54–81. | DOI | MR | Zbl
, and ,A. Donoho, D. Maleki and M. Shahram, Wavelab 850 (2006).
Ideal spatial adaptation by wavelet shrinkage. Biometrika 81 (1994) 425–455. | DOI | MR | Zbl
and ,The predictive sample reuse method with applications. J. Amer. Statist. Assoc. 70 (1975) 320–328. | DOI | Zbl
,L. Györfi, M. Kohler, A. Krzyżak and H. Walk, A distribution-free theory of nonparametric regression. Springer Series in Statistics. Springer Verlag, New York (2002). | MR | Zbl
Interpolation methods for nonlinear wavelet regression with irregularly spaced design. Ann. Statist. 25 (1997) 1912–1925. | DOI | MR | Zbl
and ,W. Härdle, G. Kerkyacharian, D. Picard and A. Tsybakov, Wavelets, approximation, and statistical applications. Vol. 129 of Lect. Notes Statist. Springer Verlag, New York (1998). | MR | Zbl
Wavelet regression in random design with heteroscedastic dependent errors. Ann. Statist. 37 (2009) 3396–3430. | DOI | MR | Zbl
and ,Oracle inequalities for cross-validation type procedures. Electron. J. Stat. 6 (2012) 1803–1837. | DOI | MR | Zbl
and ,Optimal model selection for density estimation of stationary data under various mixing conditions. Ann. Statist. 39 (2011) 1852–1877. | DOI | MR | Zbl
,Optimal model selection in density estimation. Ann. Inst. Henri Poincaré Probab. Stat. 48 (2012) 884–908. | DOI | Numdam | MR | Zbl
,S. Mallat, A wavelet tour of signal processing: the sparse way. Academic press (2008). | MR | Zbl
Exact risk analysis of wavelet regression. J. Comput. Graph. Statist. 7 (1998) 278–309.
, , , and ,P. Massart, Concentration inequalities and model selection, Vol. 1896 of Lect. Notes Math. Springer, Berlin (2007). Lectures from the 33rd Summer School on Probability Theory held in Saint-Flour, With a foreword by Jean Picard (2003) 6–23. | MR | Zbl
A. Muro and S. van de Geer, Concentration behavior of the penalized least squares estimator. Preprint (2015). | arXiv | MR
G. Nason, Wavelet shrinkage using cross-validation. J.R. Stat. Soc. Ser. B (1996) 463–479. | MR | Zbl
A. Saumard, Nonasymptotic quasi-optimality of AIC and the slope heuristics in maximum likelihood estimation of density using histogram models (2010). hal-00512310.
Optimal upper and lower bounds for the true and empirical excess risks in heteroscedastic least-squares regression. Electron. J. Statist. 6 (2012) 579–655. | DOI | MR | Zbl
,Optimal model selection in heteroscedastic regression using piecewise polynomial functions. Electron. J. Statist. 7 (2013) 1184–1223. | DOI | MR | Zbl
,Optimal global rates of convergence for nonparametric regression. Ann. Statist. 10 (1982) 1040–1053. | DOI | MR | Zbl
,S. van de Geer and M. Wainwright, On concentration for (regularized) empirical risk minimization. Preprint (2016). | arXiv | MR
Model selection in nonparametric regression. Ann. Statist. 31 (2003) 252–273. | DOI | MR | Zbl
,Cité par Sources :