References
- Carmon Y, Shwartz A. Markov decision processes with exponentially representable discounting. Oper Res Lett. 2009;37:51–55.
- Feinberg EA, Shwartz A. Markov decision models with weighted discounted criteria. Math Oper Res. 1994;19:152–168.
- Hinderer K. Foundations of non-stationary dynamic programming with discrete-time parameter. Berlin: Springer-Verlag; 1970. (Lecture Notes Oper. Res. Math. Syst.; 33).
- Jasso-Fuentes H, Menaldi JL, Prieto-Rumeau T. Discrete-time control with non-constant discount factor. Math Meth Oper Res. 2020;92:377–399.
- Minjárez-Sosa JA. Markov control models with unknown random state-action-dependent discount factors. TOP. 2015;23:743–772.
- Schäl M. Conditions for optimality and for the limit of n-stage optimal policies to be optimal. Z Wahrs Verw Gerb. 1975;32:179–196.
- Wei Q, Guo X. Markov decision processes with state-dependent discount factors and unbounded rewards/costs. Oper Res Lett. 2011;39:369–374.
- González-Hernández J, López-Martínez RR, Pérez-Hernández R. Markov control processes with randomized discounted cost in Borel space. Math Meth Oper Res. 2007;65:27–44.
- González-Hernández J, López-Martínez RR, Minjárez-Sosa JA. Adaptive policies for stochastic systems under a randomized discounted criterion. Bol Soc Mat Mex. 2008;14:149–163.
- González-Hernández J, López-Martínez RR, Minjárez-Sosa JA. Approximation, estimation and control of stochastic systems under a randomized discounted cost criterion. Kybernetika. 2009;45:737–754.
- González-Hernández J, López-Martínez RR, Minjárez-Sosa JA, et al. Constrained Markov control processes with randomized discounted cost criteria: occupation measures and extremal points. Risk and Decision Analysis. 2013;4:163–176.
- González-Hernández J, López-Martínez RR, Minjárez-Sosa JA, et al. Constrained Markov control processes with randomized discounted rate: infinite linear programming approach. Optim Control Appl Meth. 2014;35:575–591.
- Altman E. Constrained Markov decision processes with total cost criteria: occupation measures and primal LP. Math Meth Oper Res. 1996;43:45–72.
- Alvarez-Mena J, Hernández-Lerma O. Convergence of the optimal values of constrained Markov control processes. Math Meth Oper Res. 2002;55(3):461–484.
- Guo XP. Constrained nonhomogeneous Markov decision processes with expected total reward criterion. Acta Appl Math Sin (English Ser). 2000;23:230–235.
- Hernández-Lerma O, González-Herná ndez J, López-Martínez RR. Constrained average cost Markov control processes in Borel spaces. SIAM J Control Optim. 2003;42:442–468.
- Jasso-Fuentes H, Menndoza-Pérez AF de-la-Cruz-Courtois OA. Constrained Markov decision processes in Borel spaces: from discounted to average optimality. Math Meth Oper Res. 2016;84:489–525.
- Piunovskiy AB. Optimal control of random sequences in problems with constraints. Dordrecht: Kluwer; 1997.
- Hernández-Lerma O, Lasserre JB. Discrete-time Markov control processes: basic optimality criteria. New York: Springer-Verlag; 1996.
- Hernández-Lerma O, Lasserre JB. Further topics on discrete-time Markov control processes. New York: Springer-Verlag; 1999.
- Bertsekas DP, Shreve SE. Stochastic optimal control. New York: Academic Press; 1978.
- Dynkin EB, Yushkevich AA. Controlled Markov processes. Berlin: Springer-Verlag; 1979.
- Billingsley P. Convergence of probability measures. New York: Wileym; 1968.
- Bourbaki N. Integration, chap. IX. Paris: Hermann; 1969.
- Luenberger DG. Optimization by vector space methods. New York: Wiley; 1969.
- Hernández-Lerma O, Romera R. The scalarization approach to multiobjective Markov control problems: why does it work? Appl Math Optim. 2004;50:279–293.