# Machine Learning Methods for Diease Prediction with Claims Data

IEEE International Conference on Healthcare Informatics (ICHI), 2018

## Abstract

One of the primary challenges of healthcare delivery is aggregating disparate, asynchronous data sources into meaningful indicators of individual health. We combine natural language word embedding and network modeling techniques to learn meaningful representations of medical concepts by using the weighted network adjacency matrix in the GloVe algorithm, which we call Code2Vec. We demonstrate that using our learned embeddings improve neural network performance for disease prediction. However, we also demonstrate that popular deep learning models for disease prediction are not meaningfully better than simpler, more interpretable classifiers such as XGBoost. Additionally, our work adds to the current literature by providing a comprehensive survey of various machine learning algorithms on disease prediction tasks.

## BibTeX


@inproceedings{christensen2018machine,
title={Machine learning methods for disease prediction with claims data},
author={Christensen, Tanner and Frandsen, Abraham and Glazier, Seth and Humpherys, Jeffrey and Kartchner, David},
booktitle={2018 IEEE International Conference on Healthcare Informatics (ICHI)},
pages={467--4674},
year={2018},
organization={IEEE}
}