UTIA - Library

bibtype

C - Conference Paper (international conference)

ARLID

0510771

utime

20240103222900.4

mtime

20191111235959.9

DOI

10.18267/pr.2019.los.186.61

title (primary) (eng)

A Robustified Metalearning Procedure for Regression Estimators

specification

page_count	10 s.
media_type	E

serial

ARLID

cav_un_epca*0510552

ISBN

978-80-87990-18-6

title

The 13th International Days of Statistics and Economics Conference Proceedings

page_num

617-626

publisher

place	Slaný
name	Melandrium
year	2019

editor

name1	Löster
name2	T.

editor

name1	Pavelka
name2	T.

keyword

model choice

keyword

computational statistics

keyword

robustness

keyword

variable selection

author (primary)

ARLID	cav_un_auth*0345793
name1	Kalina
name2	Jan
institution	UTIA-B
full_dept (cz)	Stochastická informatika
full_dept (eng)	Department of Stochastic Informatics
department (cz)	SI
department (eng)	SI
full_dept	Department of Stochastic Informatics
fullinstit	Ústav teorie informace a automatizace AV ČR, v. v. i.

author

ARLID	cav_un_auth*0382123
name1	Neoral
name2	A.
country	CZ

source

url	http://library.utia.cas.cz/separaty/2019/SI/kalina-0510771.pdf

cas_special

project

project_id	GA17-07384S
agency	GA ČR
ARLID	cav_un_auth*0345381

abstract (eng)

Metalearning represents a useful methodology for selecting and recommending a suitable algorithm or method for a new dataset exploiting a database of training datasets. While metalearning is potentially beneficial for the analysis of economic data, we must be aware of its instability and sensitivity to outlying measurements (outliers) as well as measurement errors. The aim of this paper is to robustify the metalearning process. First, we prepare some useful theoretical tools exploiting the idea of implicit weighting, inspired by the least weighted squares estimator. These include a robust coefficient of determination, a robust version of mean square error, and a simple rule for outlier detection in linear regression. We perform a metalearning study for recommending the best linear regression estimator for a new dataset (not included in the training database). The prediction of the optimal estimator is learned over a set of 20 real datasets with economic motivation, while the least squares are compared with several (highly) robust estimators. We investigate the effect of variable selection on the metalearning results. If the training as well as validation data are considered after a proper robust variable selection, the metalearning performance is improved remarkably, especially if a robust prediction error is used.

action

ARLID	cav_un_auth*0382124
name	International Days of Statistics and Economics /13./
dates	20190905
mrcbC20-s	20190907
place	Prague
country	CZ

RIV

FORD0

10000

FORD1

10100

FORD2

10101

reportyear

2020

presentation_type

inst_support

RVO:67985556

permalink

http://hdl.handle.net/11104/0301154

confidential

arlyear

2019

mrcbU14

SCOPUS

mrcbU24

PUBMED

mrcbU34

WOS

mrcbU63

cav_un_epca*0510552 The 13th International Days of Statistics and Economics Conference Proceedings 978-80-87990-18-6 617 626 Slaný Melandrium 2019

mrcbU67

340 Löster T.

mrcbU67

340 Pavelka T.