publications – andreas karwath

2018

Geilke, Michael; Karwath, Andreas; Frank, Eibe; Kramer, Stefan

Online estimation of discrete, continuous, and conditional joint densities using classifier chains Journal Article

In: Data Mining and Knowledge Discovery, vol. 32, no. 3, pp. 561-603, 2018, ISSN: 1384-5810.

Abstract | Links | BibTeX | Tags: artificial intelligence, data mining, density estimation, machine learning, stream mining

@article{geilke2018a,

title = {Online estimation of discrete, continuous, and conditional joint densities using classifier chains},

author = {Michael Geilke and Andreas Karwath and Eibe Frank and Stefan Kramer},

url = {https://doi.org/10.1007/s10618-017-0546-6},

doi = {10.1007/s10618-017-0546-6},

issn = {1384-5810},

year  = {2018},

date = {2018-05-01},

urldate = {2018-05-01},

journal = {Data Mining and Knowledge Discovery},

volume = {32},

number = {3},

pages = {561-603},

publisher = {Springer US},

abstract = {We address the problem of estimating discrete, continuous, and conditional joint densities online, i.e., the algorithm is only provided the current example and its current estimate for its update. The family of proposed online density estimators, estimation of densities online (EDO), uses classifier chains to model dependencies among features, where each classifier in the chain estimates the probability of one particular feature. Because a single chain may not provide a reliable estimate, we also consider ensembles of classifier chains and ensembles of weighted classifier chains. For all density estimators, we provide consistency proofs and propose algorithms to perform certain inference tasks. The empirical evaluation of the estimators is conducted in several experiments and on datasets of up to several millions of instances. In the discrete case, we compare our estimators to density estimates computed by Bayesian structure learners. In the continuous case, we compare them to a state-of-the-art online density estimator. Our experiments demonstrate that, even though designed to work online, EDO delivers estimators of competitive accuracy compared to other density estimators (batch Bayesian structure learners on discrete datasets and the state-of-the-art online density estimator on continuous datasets). Besides achieving similar performance in these cases, EDO is also able to estimate densities with mixed types of variables, i.e., discrete and continuous random variables.},

keywords = {artificial intelligence, data mining, density estimation, machine learning, stream mining},

pubstate = {published},

tppubtype = {article}

}

2016

Geilke, Michael; Karwath, Andreas; Kramer, Stefan

Online density estimation of heterogeneous data streams in higher dimensions Conference

Machine learning and knowledge discovery in databases : European Conference, ECML PKDD 2016, Riva del Garda, Italy, September 19-23, 2016 : Proceedings Part 1, 2016.

Abstract | Links | BibTeX | Tags: data mining, density estimation, stream mining

2015

Geilke, Michael; Karwath, Andreas; Kramer, Stefan

Modeling recurrent distributions in streams using possible worlds Conference

2015 IEEE International Conference on Data Science and Advanced Analytics, DSAA 2015, IEEE, 2015, ISBN: 978-1-4673-8272-4.

Abstract | Links | BibTeX | Tags: density estimation, machine learning, possible worlds, stream mining

2014

Geilke, Michael; Karwath, Andreas; Kramer, Stefan

A probabilistic condensed representation of data for stream mining Conference

International Conference on Data Science and Advanced Analytics, DSAA 2014, IEEE, 2014.

Abstract | Links | BibTeX | Tags: density estimation, machine learning, stream mining

2013

Geilke, Michael; Frank, Eibe; Karwath, Andreas; Kramer, Stefan

Online Estimation of Discrete Densities Conference

IEEE 13th International Conference on Data Mining, ICDM 2013, IEEE, 2013, ISSN: 1550-4786.

Abstract | Links | BibTeX | Tags: density estimation, machine learning, stream mining