Established in 2005 under support of MŠMT ČR (project 1M0572)

Publications

Oscillating feature subset search algorithm for text categorization

Typ:
Jornal article
Authors:
Name of journal:
Lecture Notes in Computer Science
Year:
2006
Number:
4225
Pages:
578-587
ISSN:
0302-9743
Anotation:
The usability of the Oscillating Search algorithm for feature/word selection (FS) in text categorization is explored. The multiclass Bhattacharyya distance for multinomial model as the global feature subset selection criterion for reducing the dimensionality of the bag of words vector document representation is used.This criterion takes into consideration inter-feature relationships.The experiments illustrate that using a non-trivial FS algorithm brings substantial improvement in classification accuracy.
 
Copyright 2005 DAR XHTML CSS