Stochastic filtering and optimal control of pure jump Markov processes with noise-free partial observation

Calvia, Alessandro

doi:10.1051/cocv/2019020

Calvia, Alessandro

ESAIM: Control, Optimisation and Calculus of Variations, Tome 26 (2020), article no. 25.

Suite au passage du modèle économique de la revue en S20, le texte intégral des articles des années 2019 et 2020 est accessible uniquement sur le site de l'éditeur et est réservé aux abonnés.

Résumé

We consider an infinite horizon optimal control problem for a pure jump Markov process X, taking values in a complete and separable metric space I, with noise-free partial observation. The observation process is defined as Y$$ = h(X$$), t ≥ 0, where h is a given map defined on I. The observation is noise-free in the sense that the only source of randomness is the process X itself. The aim is to minimize a discounted cost functional. In the first part of the paper we write down an explicit filtering equation and characterize the filtering process as a Piecewise Deterministic Process. In the second part, after transforming the original control problem with partial observation into one with complete observation (the separated problem) using filtering equations, we prove the equivalence of the original and separated problems through an explicit formula linking their respective value functions. The value function of the separated problem is also characterized as the unique fixed point of a suitably defined contraction mapping.

Reçu le : 2018-03-15
Accepté le : 2019-04-02
Première publication : 2020-11-26
Publié le : 2020-03-03

MR Zbl

DOI : 10.1051/cocv/2019020

Classification : 93E11, 93E20, 60J25, 60J75
Mots-clés : Stochastic filtering, partial observation control problem, pure jump processes, piecewise-deterministic Markov processes, Markov decision processes

@article{COCV_2020__26_1_A25_0,
     author = {Calvia, Alessandro},
     title = {Stochastic filtering and optimal control of pure jump {Markov} processes with noise-free partial observation},
     journal = {ESAIM: Control, Optimisation and Calculus of Variations},
     publisher = {EDP-Sciences},
     volume = {26},
     year = {2020},
     doi = {10.1051/cocv/2019020},
     mrnumber = {4071313},
     zbl = {1441.93309},
     language = {en},
     url = {http://www.numdam.org/articles/10.1051/cocv/2019020/}
}

TY  - JOUR
AU  - Calvia, Alessandro
TI  - Stochastic filtering and optimal control of pure jump Markov processes with noise-free partial observation
JO  - ESAIM: Control, Optimisation and Calculus of Variations
PY  - 2020
VL  - 26
PB  - EDP-Sciences
UR  - http://www.numdam.org/articles/10.1051/cocv/2019020/
DO  - 10.1051/cocv/2019020
LA  - en
ID  - COCV_2020__26_1_A25_0
ER  -

%0 Journal Article
%A Calvia, Alessandro
%T Stochastic filtering and optimal control of pure jump Markov processes with noise-free partial observation
%J ESAIM: Control, Optimisation and Calculus of Variations
%D 2020
%V 26
%I EDP-Sciences
%U http://www.numdam.org/articles/10.1051/cocv/2019020/
%R 10.1051/cocv/2019020
%G en
%F COCV_2020__26_1_A25_0

Calvia, Alessandro. Stochastic filtering and optimal control of pure jump Markov processes with noise-free partial observation. ESAIM: Control, Optimisation and Calculus of Variations, Tome 26 (2020), article no. 25. doi : 10.1051/cocv/2019020. http://www.numdam.org/articles/10.1051/cocv/2019020/

Bibliographie
Cité par

[1] A. Almudevar, A dynamic programming algorithm for the optimal control of piecewise deterministic Markov processes. SIAM J. Control Opti. 40 (2001) 525–539. | DOI | MR | Zbl

[2] S. Altay, K. Colaneri and Z. Eksi, Portfolio optimization for a large investor controlling market sentiment under partial information. SIAM J. Financ. Mat. 10 (2019) 512–546. | DOI | MR | Zbl

[3] S. Asmussen, Applied Probability and Queues (Stochastic Modelling and Applied Probability). Vol. 51 of Applications of Mathematics, 2nd edn. Springer-Verlag, New York (2003). | MR | Zbl

[4] A. Bain and D. Crisan, Fundamentals of Stochastic Filtering. Springer, New York (2009). | DOI | Zbl

[5] E. Bandini, Constrained BSDEs driven by a non quasi-left-continuous random measure and optimal control of PDMPs on bounded domains. Preprint (2017). | arXiv | MR

[6] E. Bandini, Optimal control of piecewise deterministic Markov processes: a BSDE representation of the value function. ESAIM: COCV 24 (2018) 311–354. | Numdam | MR | Zbl

[7] E. Bandini and M. Fuhrman, Constrained BSDEs representation of the value function in optimal control of pure jump Markov processes. Stoch. Process. Appl. 127 (2017) 1441–1474. | DOI | MR | Zbl

[8] E. Bandini, A. Cosso, M. Fuhrman and H. Pham, Randomized filtering and Bellman equation in Wasserstein space for partial observation control problem. Stoch. Process. Appl. 129 (2019) 674–711. | DOI | MR | Zbl

[9] E. Bandini, F. Confortola and A. Cosso, BSDE representation and randomized dynamic programming principle for stochastic control problems of infinite-dimensional jump-diffusions. Preprint (2018). | arXiv | MR

[10] E. Bandini, A. Cosso, M. Fuhrman and H. Pham, Backward SDEs for optimal control of partially observed path-dependent stochastic systems: a control randomization approach. Ann. Appl. Probab. 28 (2018) 1634–1678. | DOI | MR | Zbl

[11] G. Barles, Solutions de viscosité des équations de Hamilton-Jacobi. Vol. 17 of Mathematiques & Applications. Springer-Verlag, Paris (1994). | MR | Zbl

[12] A. Bensoussan, M. Çakanyıldırım and S.P. Sethi, On the optimal control of partially observed inventory systems. C. R. Math. Acad. Sci. Paris 341 (2005) 419–426. | DOI | MR | Zbl

[13] A. Bensoussan, J. Frehse and P. Yam, Mean Field Games and Mean Field Type Control Theory. Springer Briefs in Mathematics. Springer, New York (2013). | DOI | MR | Zbl

[14] D.P. Bertsekas and S.E. Shreve, Stochastic Optimal Control: The Discrete Time Case. Vol. 139 of Mathematics in Science and Engineering. Academic Press, Inc., New York, London (1978). | MR | Zbl

[15] V.I. Bogachev, Measure Theory, Vol. I, II. Springer-Verlag, Berlin (2007). | DOI | MR | Zbl

[16] P. Brémaud, Point Processes and Queues. Springer Series in Statistics. Springer-Verlag, New York (1981). | MR | Zbl

[17] A.E. Bryson, Jr. and D.E. Johansen, Linear filtering for time-varying systems using measurements containing colored noise. IEEE Trans. Automat. Contr. AC-10 (1965) 4–10. | DOI | MR

[18] E. Buckwar and M.G. Riedler, An exact stochastic hybrid model of excitable membranes including spatio-temporal evolution. J. Math. Biol. 63 (2011) 1051–1093. | DOI | MR | Zbl

[19] A. Calvia, Optimal control of continuous-time Markov chains with noise-free observation. SIAM J. Control Optim. 56 (2018) 2000–2035. | DOI | MR | Zbl

[20] C. Ceci and A. Gerardi, Filtering of a Markov jump process with counting observations. Appl. Math. Optim. 42 (2000) 1–18. | DOI | MR | Zbl

[21] C. Ceci and A. Gerardi, Nonlinear filtering equation of a jump process with counting observations. Acta Appl. Math. 66 (2001) 139–154. | DOI | MR | Zbl

[22] C. Ceci and A. Gerardi, Controlled partially observed jump processes: dynamics dependent on the observed history. In Vol 47 of Proceedings of the Third World Congress of Nonlinear Analysts, Part 4 Catania, 2000 (2001) 2449–2460. | MR | Zbl

[23] C. Ceci, A. Gerardi and P. Tardelli, Existence of optimal controls for partially observed jump processes. Acta Appl. Math. 74 (2002) 155–175. | DOI | MR | Zbl

[24] K. Colaneri, Z. Eksi, F. Rüdiger and M. Szölgyenyi, Optimal liquidation under partial information with price impact. Preprint (2019). | arXiv | MR

[25] F. Confortola and M. Fuhrman, Filtering of continuous-time Markov chains with noise-free observation and applications. Stochastics 85 (2013) 216–251. | DOI | MR | Zbl

[26] O.L.V. Costa and F. Dufour, Continuous Average Control of Piecewise Deterministic Markov Processes. Springer Briefs in Mathematics. Springer, New York (2013). | MR | Zbl

[27] O.L.V. Costa, F. Dufour and A.B. Piunovskiy, Constrained and unconstrained optimal discounted control of piecewise deterministic Markov processes. SIAM J. Control Optim. 54 (2016) 1444–1474. | DOI | MR | Zbl

[28] M.G. Crandall, H. Ishii and P.-L. Lions, User’s guide to viscosity solutions of second order partial differential equations. Bull. Am. Math. Soc. (N.S.) 27 (1992) 1–67. | DOI | MR | Zbl

[29] D. Crisan, M. Kouritzin and J. Xiong, Nonlinear filtering with signal dependent observation noise. Electron. J. Probab. 14 (2009) 1863–1883. | DOI | MR | Zbl

[30] M.H.A. Davis, Control of piecewise-deterministic processes via discrete-time dynamic programming, in Stochastic Differential Systems (Bad Honnef, 1985). Vol. 78 of Lecture Notes in Control and Information Sciences. Springer, Berlin (1986) 140–150. | DOI | MR | Zbl

[31] M.H.A. Davis and M. Farid, Piecewise-deterministic processes and viscosity solutions, in Stochastic Analysis, Control, Optimization and Applications. Systems Control Foundations and Applications. Birkhäuser Boston, Boston, MA (1999) 249–268. | MR | Zbl

[32] M.H.A. Davis, Markov Models and Optimization. Vol. 49 of Monographs on Statistics and Applied Probability. Chapman and Hall, London (1993). | MR | Zbl

[33] M.A.H. Dempster, Optimal control of piecewise deterministic Markov processes, in Applied Stochastic Analysis (London, 1989). Vol. 5 of Stochastics Monographs, Gordon and Breach, New York (1991) 303–325. | MR | Zbl

[34] R.J. Elliott, L. Aggoun and J.B. Moore, Hidden Markov Models: Estimation and Control. Vol. 29 of Applications of Mathematics (New York). Springer-Verlag, New York (1995). | MR | Zbl

[35] G. Fabbri, F. Gozzi and A. Swiech, Stochastic Optimal Control in Infinite Dimension: Dynamic Programming and HJB Equations, With a Contribution by Marco Fuhrman and Gianmario Tessitore. Vol. 82 of Probability Theory and Stochastic Modelling. Springer, Cham (2017). | DOI | MR | Zbl

[36] W.H. Fleming and H.M. Soner, Controlled Markov Processes and Viscosity Solutions. Vol. 25 of Stochastic Modelling and Applied Probability, 2nd edn. Springer, New York (2006). | MR | Zbl

[37] L. Forwick, M. Schäl and M. Schmitz, Piecewise deterministic Markov control processes with feedback controls and unbounded costs. Acta Appl. Math. 82 (2004) 239–267. | DOI | MR | Zbl

[38] M. Jacobsen, Point Process Theory and Applications: Marked Point and Piecewise Deterministic Processes. Probability and Its Applications. Birkhäuser Boston, Inc., Boston, MA (2006). | MR | Zbl

[39] J. Jacod, Multivariate point processes: predictable projection, Radon-Nikodým derivatives, representation of martingales. Z. Wahrscheinlichkeitstheorie und Verw. Gebiete 31 (1974) 235–253. | DOI | MR | Zbl

[40] M. Joannides and F. Legland, Nonlinear filtering with continuous time perfect observations and noninformative quadratic variation, in Proceeding of the 36th IEEE Conference on Decision and Control (1997) 1645–1650. | DOI

[41] I. Kharroubi and H. Pham, Feynman-Kac representation for Hamilton-Jacobi-Bellman IPDE. Ann. Probab. 43 (2015) 1823–1865. | DOI | MR | Zbl

[42] H. Körezlioğlu and W.J. Runggaldier, Filtering for nonlinear systems driven by nonwhite noises: an approximation scheme. Stoch. Stoch. Rep. 44 (1993) 65–102. | DOI | MR | Zbl

[43] G. Last and A. Brandt, Marked Point Processes on the Real Line: The Dynamic Approach. Probability and Its Applications (New York). Springer-Verlag, New York (1995). | MR | Zbl

[44] R.H. Martin, Jr. Differential equations on closed subsets of a Banach space. Trans. Am. Math. Soc. 179 (1973) 399–414. | DOI | MR | Zbl

[45] J.R. Norris, Markov Chains. Vol. 2 of Cambridge Series in Statistical and Probabilistic Mathematics. Cambridge University Press, Cambridge (1998). | MR | Zbl

[46] V. Renault, M. Thieullen and E. Trélat, Optimal control of infinite-dimensional piecewise deterministic Markov processes and application to the control of neuronal dynamics via Optogenetics. Netw. Heterog. Media 12 (2017) 417–459. | DOI | MR | Zbl

[47] L.C.G. Rogers and D. Williams, Diffusions, Markov processes, and Martingales (Foundations). Vol. 1 of Wiley Series in Probability and Mathematical Statistics: Probability and Mathematical Statistics, 2nd edn. John Wiley & Sons, Ltd., Chichester (1994). | MR | Zbl

[48] Y. Takeuchi and H. Akashi, Least-squares state estimation of systems with state-dependent observation noise. Automatica J. IFAC 21 (1985) 303–313. | DOI | MR | Zbl

[49] D. Vermes, Optimal control of piecewise deterministic Markov process. Stochastics 14 (1985) 165–207. | DOI | MR | Zbl

[50] J.T. Winter, Optimal Control of Markovian Jump Processes with Different Information Structures. Ph.D. thesis, Universität Ulm (2008).

[51] J. Xiong, An Introduction to Stochastic Filtering Theory. Oxford University Press, New York (2008). | DOI | MR | Zbl

[52] A.A. Yushkevich, On reducing a jump controllable Markov model to a model with discrete time. Theory Probab. Appl. 25 (1980) 58–69. | DOI | Zbl

Cité par Sources :

This research was partially supported by three GNAMPA-INdAM projects in 2015, 2016 and 2017 and by MIUR-PRIN 2015 project Deterministic and stochastic evolution equations.