UTIA - Library

bibtype

J - Journal Article

ARLID

0449029

utime

20240103210937.7

mtime

20151023235959.9

WOS

000358811700008

SCOPUS

84938521589

DOI

10.1239/jap/1437658607

title (primary) (eng)

Sample-Path Optimal Stationary Policies in Stable Markov Decision Chains with Average Reward Criterion

specification

page_count	20 s.
media_type	P

serial

ARLID

cav_un_epca*0256875

ISSN

0021-9002

title

Journal of Applied Probability

volume_id

volume

2 (2015)

page_num

419-440

publisher

name	Cambridge University Press

keyword

Dominated Convergence theorem for the expected average criterion

keyword

Discrepancy function

keyword

Kolmogorov inequality

keyword

Innovations

keyword

Strong sample-path optimality

author (primary)

ARLID	cav_un_auth*0307645
name1	Cavazos-Cadena
name2	R.
country	MX
share	50

author

ARLID	cav_un_auth*0238984
name1	Montes-de-Oca
name2	R.
country	MX
share	25

author

ARLID	cav_un_auth*0101196
name1	Sladký
name2	Karel
full_dept (cz)	Ekonometrie
full_dept	Department of Econometrics
department (cz)	E
department	E
institution	UTIA-B
full_dept	Department of Econometrics
share	25
fullinstit	Ústav teorie informace a automatizace AV ČR, v. v. i.

source

url	http://library.utia.cas.cz/separaty/2015/E/sladky-0449029.pdf

cas_special

project

project_id	171396
agency	GA AV ČR
country	CZ
ARLID	cav_un_auth*0307567

abstract (eng)

This work concerns discrete-time Markov decision chains with denumerable state and compact action sets. Besides standard continuity requirements, the main assumption on the model is that it admits a Lyapunov function m. In this context the average reward criterion is analyzed from the sample-path point of view. The main conclusion is that, if the expected average reward associated to m^2 is finite under any policy, then a stationary policy obtained from the optimality equation in the standard way is sample-path average optimal in a strong sense.

reportyear

2016

RIV

num_of_auth

inst_support

RVO:67985556

permalink

http://hdl.handle.net/11104/0250631

cooperation

ARLID	cav_un_auth*0320900
name	Universidad Aut onoma Metropolitana, Campus Iztapalapa, Avenida San Rafael Atlixco #186, Colonia Vicentina, M exico 09340,
country	MX

confidential

mrcbT16-e

STATISTICS&PROBABILITY

mrcbT16-f

0.864

mrcbT16-g

0.131

mrcbT16-h

999.9

mrcbT16-i

0.00534

mrcbT16-j

0.784

mrcbT16-k

2677

mrcbT16-s

0.724

mrcbT16-4

mrcbT16-5

0.618

mrcbT16-6

mrcbT16-7

mrcbT16-B

50.157

mrcbT16-C

34.6

mrcbT16-D

mrcbT16-E

mrcbT16-P

34.553

arlyear

2015

mrcbU14

84938521589 SCOPUS

mrcbU34

000358811700008 WOS

mrcbU63

cav_un_epca*0256875 Journal of Applied Probability 0021-9002 1475-6072 Roč. 52 č. 2 2015 419 440 Cambridge University Press