Keywords:
interactive statistical model, EM algorithm, data modelling,
Anotation:
This paper describes the application of a recently developed method of interactive statistical database presentation to the 2001 Czech Census. The method is based on estimating the multivariate probability distribution of the original microdata. The estimated statistical model in the form of a distribution mixture of product components can be used as a knowledge base of a probabilistic expert system. In this way we can derive the statistical properties of data interactively without any further access to the source database. The statistical model does not contain the original data and therefore can be distributed without any confidentiality concerns. The accuracy achievable by the statistical model is comparable with that of the anonymised subsets of microdata.