Publikace
Structural Poisson Mixtures for Classification of Documents
Typ:
Konferenční příspěvek
Název sborniku:
Proceedings of the 19th International Conference on Pattern Recognition
Místo vydání:
Los Alamitos
Klíčová slova:
classification of documents, Poisson mixtures, Structural ap
Anotace:
Considering the statistical text classification problem we approximate class-conditional probability distributions by structurally modified Poisson mixtures. By introducing the structural model we can use different subsets of input variables to evaluate conditional probabilities of different classes in the Bayes formula. The method is applicable to document vectors of arbitrary dimension without any preprocessing. The structural optimization can be included into the EM algorithm in a statistically correct way.