bibtype C - Conference Paper (international conference)
ARLID 0448938
utime 20240111140909.1
mtime 20151023235959.9
WOS 000387898900125
title (primary) (eng) Second Order Optimality in Transient and Discounted Markov Decision Chains
specification
page_count 6 s.
media_type P
serial
ARLID cav_un_epca*0447280
ISBN 978-80-261-0539-8
title Procedings of the 33rd International Conference Mathematical Methods in Economics MME 2015
page_num 731-736
publisher
place Plzeň
name University of West Bohemia, Plzeň
year 2015
keyword dynamic programming
keyword discounted and transient Markov reward chains
keyword reward-variance optimality
author (primary)
ARLID cav_un_auth*0101196
full_dept (cz) Ekonometrie
full_dept (eng) Department of Econometrics
department (cz) E
department (eng) E
full_dept Department of Econometrics
name1 Sladký
name2 Karel
institution UTIA-B
fullinstit Ústav teorie informace a automatizace AV ČR, v. v. i.
source
source_type pdf soubor
url http://library.utia.cas.cz/separaty/2015/E/sladky-0448938.pdf
cas_special
project
ARLID cav_un_auth*0292652
project_id GA13-14445S
agency GA ČR
project
ARLID cav_un_auth*0321097
project_id GA15-10331S
agency GA ČR
abstract (eng) The article is devoted to second order optimality in Markov decision processes. Attention is primarily focused on the reward variance for discounted models and undiscounted transient models (i.e. where the spectral radius of the transition probability matrix is less than unity). Considering the second order optimality criteria means that in the class of policies maximizing (or minimizing) total expected discounted reward (or undiscounted reward for the transient model) we choose the policy minimizing the total variance. Explicit formulae for calculating the variances for transient and discounted models are reported along with sketches of algoritmic procedures for finding second order optimal policies.
action
ARLID cav_un_auth*0320824
name Mathematical Methods in Economics 2015 /33./
dates 09.09.2015-11.09.2015
place Cheb
country CZ
RIV BC
reportyear 2016
num_of_auth 1
presentation_type PR
inst_support RVO:67985556
permalink http://hdl.handle.net/11104/0250633
confidential S
arlyear 2015
mrcbU34 000387898900125 WOS
mrcbU56 pdf soubor
mrcbU63 cav_un_epca*0447280 Procedings of the 33rd International Conference Mathematical Methods in Economics MME 2015 978-80-261-0539-8 731 736 Plzeň University of West Bohemia, Plzeň 2015