UTIA - Library

bibtype

V - Research Report

ARLID

0357265

utime

20240103194938.1

mtime

20110404235959.9

title (primary) (eng)

Fast Dependency-Aware Feature Selection in Very-High-Dimensional Pattern Recognition Problems

publisher

place	Praha
name	ÚTIA AV ČR, v.v.i
pub_time	2011

specification

page_count	9 s.

edition

name	Research Report
volume_id	2295

keyword

feature selection,

keyword

high dimensionality

keyword

ranking

keyword

generalization

keyword

over-fitting

keyword

stability

keyword

classification

keyword

pattern recognition

keyword

machine learning

author (primary)

ARLID	cav_un_auth*0101197
name1	Somol
name2	Petr
full_dept (cz)	Rozpoznávání obrazu
full_dept (eng)	Department of Pattern Recognition
department (cz)	RO
department (eng)	RO
institution	UTIA-B
full_dept	Department of Pattern Recognition
fullinstit	Ústav teorie informace a automatizace AV ČR, v. v. i.

author

ARLID	cav_un_auth*0101091
name1	Grim
name2	Jiří
full_dept (cz)	Rozpoznávání obrazu
full_dept	Department of Pattern Recognition
department (cz)	RO
department	RO
institution	UTIA-B
full_dept	Department of Pattern Recognition
fullinstit	Ústav teorie informace a automatizace AV ČR, v. v. i.

source

url	http://library.utia.cas.cz/separaty/2011/RO/somol-fast dependency-aware feature selection in very-high-dimensional pattern recognition problems.pdf

cas_special

project

project_id	1M0572
agency	GA MŠk
ARLID	cav_un_auth*0001814

project

project_id	2C06019
agency	GA MŠk
ARLID	cav_un_auth*0216518

research

CEZ:AV0Z10750506

abstract (eng)

The paper addresses the problem of making dependency-aware feature selection feasible in pattern recognition problems of very high dimensionality. The idea of individually best ranking is generalized to evaluate the contextual quality of each feature in a series of randomly generated feature subsets. Each random subset is evaluated by a criterion function of arbitrary choice (permitting functions of high complexity). Eventually, the novel dependency-aware feature rank is computed, expressing the average benefit of including a feature into feature subsets. The method is efficient and generalizes well especially in very-high-dimensional problems, where traditional context-aware feature selection methods fail due to prohibitive computational complexity or to over-fitting. The method is shown well capable of over-performing the commonly applied individual ranking which ignores important contextual information contained in data.

reportyear

2012

RIV

mrcbC52

4 O 4o 20231122134453.4

permalink

http://hdl.handle.net/11104/0195583

arlyear

2011

mrcbTft

\nSoubory v repozitáři: 0357265.pdf

mrcbU10

2011

mrcbU10

Praha ÚTIA AV ČR, v.v.i