bibtype C - Conference Paper (international conference)
ARLID 0510771
utime 20240103222900.4
mtime 20191111235959.9
DOI 10.18267/pr.2019.los.186.61
title (primary) (eng) A Robustified Metalearning Procedure for Regression Estimators
specification
page_count 10 s.
media_type E
serial
ARLID cav_un_epca*0510552
ISBN 978-80-87990-18-6
title The 13th International Days of Statistics and Economics Conference Proceedings
page_num 617-626
publisher
place Slaný
name Melandrium
year 2019
editor
name1 Löster
name2 T.
editor
name1 Pavelka
name2 T.
keyword model choice
keyword computational statistics
keyword robustness
keyword variable selection
author (primary)
ARLID cav_un_auth*0345793
name1 Kalina
name2 Jan
institution UTIA-B
full_dept (cz) Stochastická informatika
full_dept (eng) Department of Stochastic Informatics
department (cz) SI
department (eng) SI
full_dept Department of Stochastic Informatics
fullinstit Ústav teorie informace a automatizace AV ČR, v. v. i.
author
ARLID cav_un_auth*0382123
name1 Neoral
name2 A.
country CZ
source
url http://library.utia.cas.cz/separaty/2019/SI/kalina-0510771.pdf
cas_special
project
project_id GA17-07384S
agency GA ČR
ARLID cav_un_auth*0345381
abstract (eng) Metalearning represents a useful methodology for selecting and recommending a suitable algorithm or method for a new dataset exploiting a database of training datasets. While metalearning is potentially beneficial for the analysis of economic data, we must be aware of its instability and sensitivity to outlying measurements (outliers) as well as measurement errors. The aim of this paper is to robustify the metalearning process. First, we prepare some useful theoretical tools exploiting the idea of implicit weighting, inspired by the least weighted squares estimator. These include a robust coefficient of determination, a robust version of mean square error, and a simple rule for outlier detection in linear regression. We perform a metalearning study for recommending the best linear regression estimator for a new dataset (not included in the training database). The prediction of the optimal estimator is learned over a set of 20 real datasets with economic motivation, while the least squares are compared with several (highly) robust estimators. We investigate the effect of variable selection on the metalearning results. If the training as well as validation data are considered after a proper robust variable selection, the metalearning performance is improved remarkably, especially if a robust prediction error is used.
action
ARLID cav_un_auth*0382124
name International Days of Statistics and Economics /13./
dates 20190905
mrcbC20-s 20190907
place Prague
country CZ
RIV BA
FORD0 10000
FORD1 10100
FORD2 10101
reportyear 2020
presentation_type PR
inst_support RVO:67985556
permalink http://hdl.handle.net/11104/0301154
confidential S
arlyear 2019
mrcbU14 SCOPUS
mrcbU24 PUBMED
mrcbU34 WOS
mrcbU63 cav_un_epca*0510552 The 13th International Days of Statistics and Economics Conference Proceedings 978-80-87990-18-6 617 626 Slaný Melandrium 2019
mrcbU67 340 Löster T.
mrcbU67 340 Pavelka T.