Estimating Shape Parameters of Piecewise Linear-Quadratic Problems

Zheng, Peng; Ramamurthy, Karthikeyan Natesan; Aravkin, Aleksandr Y.

doi:10.5802/ojmo.10

Zheng, Peng ¹ ; Ramamurthy, Karthikeyan Natesan ² ; Aravkin, Aleksandr Y.³

¹ Department of Health Metrics Sciences, University of Washington Seattle, WA, USA
² IBM Thomas J. Watson Research Center Yorktown Heights, NY, USA
³ Department of Applied Mathematics, University of Washington Seattle, WA, USA

Open Journal of Mathematical Optimization, Tome 2 (2021), article no. 8, 18 p.

Résumé

Piecewise Linear-Quadratic (PLQ) penalties are widely used to develop models in statistical inference, signal processing, and machine learning. Common examples of PLQ penalties include least squares, Huber, Vapnik, 1-norm, and their asymmetric generalizations. Properties of these estimators depend on the choice of penalty and its shape parameters, such as degree of asymmetry for the quantile loss, and transition point between linear and quadratic pieces for the Huber function.

In this paper, we develop a statistical framework that can help the modeler to automatically tune shape parameters once the shape of the penalty has been chosen. The choice of the parameter is informed by the basic notion that each PLQ penalty should correspond to a true statistical density. The normalization constant inherent in this requirement helps to inform the optimization over shape parameters, giving a joint optimization problem over these as well as primary parameters of interest. A second contribution is to consider optimization methods for these joint problems. We show that basic first-order methods can be immediately brought to bear, and design specialized extensions of interior point (IP) methods for PLQ problems that can quickly and efficiently solve the joint problem. Synthetic problems and larger-scale practical examples illustrate the utility of the approach. Code for the new IP method is implemented using the Julia language (https://github.com/UW-AMO/shape-parameters/tree/ojmo).

Reçu le : 2020-12-30
Accepté le : 2021-08-30
Accepté après révision le : 2021-08-31
Publié le : 2021-09-23

DOI : 10.5802/ojmo.10

Affiliations des auteurs :

Zheng, Peng ¹ ; Ramamurthy, Karthikeyan Natesan ² ; Aravkin, Aleksandr Y. ³

@article{OJMO_2021__2__A8_0,
     author = {Zheng, Peng and Ramamurthy, Karthikeyan Natesan and Aravkin, Aleksandr Y.},
     title = {Estimating {Shape} {Parameters} of {Piecewise} {Linear-Quadratic} {Problems}},
     journal = {Open Journal of Mathematical Optimization},
     eid = {8},
     pages = {1--18},
     publisher = {Universit\'e de Montpellier},
     volume = {2},
     year = {2021},
     doi = {10.5802/ojmo.10},
     language = {en},
     url = {http://www.numdam.org/articles/10.5802/ojmo.10/}
}

TY  - JOUR
AU  - Zheng, Peng
AU  - Ramamurthy, Karthikeyan Natesan
AU  - Aravkin, Aleksandr Y.
TI  - Estimating Shape Parameters of Piecewise Linear-Quadratic Problems
JO  - Open Journal of Mathematical Optimization
PY  - 2021
SP  - 1
EP  - 18
VL  - 2
PB  - Université de Montpellier
UR  - http://www.numdam.org/articles/10.5802/ojmo.10/
DO  - 10.5802/ojmo.10
LA  - en
ID  - OJMO_2021__2__A8_0
ER  -

%0 Journal Article
%A Zheng, Peng
%A Ramamurthy, Karthikeyan Natesan
%A Aravkin, Aleksandr Y.
%T Estimating Shape Parameters of Piecewise Linear-Quadratic Problems
%J Open Journal of Mathematical Optimization
%D 2021
%P 1-18
%V 2
%I Université de Montpellier
%U http://www.numdam.org/articles/10.5802/ojmo.10/
%R 10.5802/ojmo.10
%G en
%F OJMO_2021__2__A8_0

Zheng, Peng; Ramamurthy, Karthikeyan Natesan; Aravkin, Aleksandr Y. Estimating Shape Parameters of Piecewise Linear-Quadratic Problems. Open Journal of Mathematical Optimization, Tome 2 (2021), article  no. 8, 18 p. doi : 10.5802/ojmo.10. http://www.numdam.org/articles/10.5802/ojmo.10/

Bibliographie
Cité par

[1] Aravkin, Aleksandr Y.; Burke, James V.; Pillonetto, Gianluigi Sparse/Robust Estimation and Kalman Smoothing with Nonsmooth Log-Concave Densities: Modeling, Computation, and Theory, J. Mach. Learn. Res., Volume 14 (2013), pp. 2689-2728 | MR | Zbl

[2] Aravkin, Aleksandr Y.; Burke, James V.; Pillonetto, Gianluigi Generalized system identification with stable spline kernels, SIAM J. Sci. Comput., Volume 40 (2018) no. 5, p. B1419-B1443 | DOI | MR | Zbl

[3] Aravkin, Aleksandr Y.; Drusvyatskiy, Dmitriy; van Leeuwen, Tristan Efficient quadratic penalization through the partial minimization technique, IEEE Trans. Autom. Control, Volume 63 (2017) no. 7, pp. 2131-2138 | DOI | MR | Zbl

[4] Aravkin, Aleksandr Y.; Van Leeuwen, Tristan Estimating nuisance parameters in inverse problems, Inverse Probl., Volume 28 (2012) no. 11, 115016, 13 pages | MR | Zbl

[5] Arlot, Sylvain; Celisse, Alain et al. A survey of cross-validation procedures for model selection, Stat. Surv., Volume 4 (2010), pp. 40-79 | MR | Zbl

[6] Attouch, Hédy; Bolte, Jérôme; Redont, Patrick; Soubeyran, Antoine Proximal alternating minimization and projection methods for nonconvex problems: An approach based on the Kurdyka–Łojasiewicz inequality, Math. Oper. Res., Volume 35 (2010) no. 2, pp. 438-457 | DOI | Zbl

[7] Bell, Bradley M.; Burke, James V.; Schumitzky, Alan A relative weighting method for estimating parameters and variances in multiple data sets, Comput. Stat. Data Anal., Volume 22 (1996) no. 2, pp. 119-135 | DOI | MR | Zbl

[8] Bera, Anil K.; Galvao, Antonio F.; Montes-Rojas, Gabriel V.; Park, Sung Y. Asymmetric Laplace Regression: Maximum Likelihood, Maximum Entropy and Quantile Regression, J. Econom. Methods, Volume 5 (2016) no. 1, pp. 79-101 | MR | Zbl

[9] Bergstra, James S.; Bardenet, Rémi; Bengio, Yoshua; Kégl, Balázs Algorithms for hyper-parameter optimization, Advances in Neural Information Processing Systems, MIT Press (2011), pp. 2546-2554

[10] Bergstra, James S.; Bengio, Yoshua Random search for hyper-parameter optimization, J. Mach. Learn. Res., Volume 13 (2012), pp. 281-305 | MR | Zbl

[11] Bolte, Jérôme; Sabach, Shoham; Teboulle, Marc Proximal alternating linearized minimization for nonconvex and nonsmooth problems, Math. Program., Volume 146 (2014) no. 1-2, pp. 459-494 | DOI | MR | Zbl

[12] Boyd, Stephen; Vandenberghe, Lieven Convex optimization, Cambridge University Press, 2004 | Zbl

[13] Candès, Emmanuel J; Li, Xiaodong; Ma, Yi; Wright, John Robust principal component analysis?, J. ACM, Volume 58 (2011) no. 3, 11, 37 pages | MR | Zbl

[14] Chandrasekaran, Venkat; Sanghavi, Sujay; Parrilo, Pablo A.; Willsky, Alan S. Sparse and low-rank matrix decompositions, IFAC Proceedings Volumes, Volume 42 (2009) no. 10, pp. 1493-1498 | DOI

[15] Donoho, David L Compressed sensing, IEEE Trans. Inf. Theory, Volume 52 (2006) no. 4, pp. 1289-1306 | DOI | MR | Zbl

[16] Driggs, Derek; Becker, Stephen; Aravkin, Aleksandr Y. Adapting regularized low-rank models for parallel architectures, SIAM J. Sci. Comput., Volume 41 (2019) no. 1, p. A163-A189 | DOI | MR | Zbl

[17] Huber, Peter J. Robust statistics, 523, John Wiley & Sons, 2004

[18] Hutter, Frank; Hoos, Holger H.; Leyton-Brown, Kevin Sequential model-based optimization for general algorithm configuration, International Conference on Learning and Intelligent Optimization, Springer (2011), pp. 507-523 | DOI

[19] Klein, Aaron; Falkner, Stefan; Bartels, Simon; Hennig, Philipp; Hutter, Frank Fast Laplace optimization of machine learning hyperparameters on large datasets (2016) (https://arxiv.org/abs/1605.07079)

[20] Koenker, Roger; Hallock, Kevin F. Quantile regression, J. Econom. Perspect., Volume 15 (2001) no. 4, pp. 143-156 | DOI

[21] Kojima, Masakazu; Megiddo, Nimrod; Noma, Toshihito; Yoshise, Akiko A Unified Approach to Interior Point Algorithms for Linear Complementarity Problems, Lecture Notes in Computer Science, 538, Springer, 1991 | MR

[22] Li, Lisha; Jamieson, Kevin; DeSalvo, Giulia; Rostamizadeh, Afshin; Talwalkar, Ameet Hyperband: A Novel Bandit-Based Approach to Hyperparameter Optimization (2016) (https://arxiv.org/abs/1603.06560)

[23] Li, Liyuan; Huang, Weimin; Gu, Irene Yu-Hua; Tian, Qi Statistical modeling of complex backgrounds for foreground object detection, IEEE Trans. Image Process., Volume 13 (2004) no. 11, pp. 1459-1472

[24] Nemirovski, Arkadi; Nesterov, Yurii Interior-Point Polynomial Algorithms in Convex Programming, Studies in Applied Mathematics, 13, Society for Industrial and Applied Mathematics, 1994 | MR

[25] Orban, Dominique; Arioli, Mario Iterative Solution of Symmetric Quasi-Definite Linear Systems, SIAM Spotlights, 3, Society for Industrial and Applied Mathematics, 2017 | MR | Zbl

[26] Peng, Yigang; Ganesh, Arvind; Wright, John; Xu, Wenli; Ma, Yi RASL: Robust alignment by sparse and low-rank decomposition for linearly correlated images, IEEE Trans. Pattern Anal. Mach. Intell., Volume 34 (2012) no. 11, pp. 2233-2246 | DOI

[27] Rockafellar, R. Tyrrell; Wets, Roger J.-B. Variational analysis, 317, Springer, 2009

[28] Snoek, Jasper; Larochelle, Hugo; Adams, Ryan P. Practical Bayesian optimization of machine learning algorithms, Advances in neural information processing systems, Curran Associates Inc. (2012), pp. 2951-2959

[29] Tibshirani, Robert Regression shrinkage and selection via the lasso, J. R. Stat. Soc., Ser. B, Stat. Methodol., Volume 58 (1996) no. 1, pp. 267-288 | MR | Zbl

[30] Tu, Shiyi; Wang, Min; Sun, Xiaoqian Bayesian variable selection and estimation in maximum entropy quantile regression, J. Appl. Stat., Volume 44 (2017) no. 2, pp. 253-269 | DOI | MR | Zbl

[31] Turk, Matthew A.; Pentland, Alex P. Face recognition using eigenfaces, Computer Vision and Pattern Recognition, IEEE (1991), pp. 586-591 | DOI

[32] Wright, Stephen J. Primal-Dual Interior-Point Methods, Society for Industrial and Applied Mathematics, 1997 | Zbl

[33] Yu, Keming; Moyeed, Rana A. Bayesian quantile regression, Stat. Probab. Lett., Volume 54 (2001) no. 4, pp. 437-447 | MR | Zbl

[34] Zhang, Zhengdong; Ganesh, Arvind; Liang, Xiao; Ma, Yi Tilt: Transform invariant low-rank textures, Int. J. Comput. Vision, Volume 99 (2012) no. 1, pp. 1-24 | DOI | MR | Zbl

Cité par Sources :