David Kartchner

Researcher + Entrepreneur (ML + Healthcare``)

I develop large language models for clinical applications, particularly focused on building AI that can generate rigorous medical insights from real-world data. I combine knowledge graphs and large language models to generate hypotheses, and extract structured clinical data to evaluate them.

Industry Experience

March 2024 - Present
Truveta, Bellevue, WA
Machine Learning Researcher, Clinical LLMs
Mentor: Mehraveh Salehi, Saman Zarandioon
Training agent-based LLM systems to perform end-to-end medical research with real-world data. Designed and implemented LLM evaluation suite automatically evaluate the quality of LLM outputs. Designed multi-agent LLM systems to automatically identify and correct erroneous training data. Current projects are focused on automatically evaluating LLM performance on medical tasks and using LLMs to systematically probe for areas of model weakness
Sept 2022 - April 2024
Glassbox Health, Atlanta, GA
Co-Founder, CTO
Built an LLM-based assistant to provide personalized navigation of medical bills and healthcare costs. Our service reduced medical bills by 67% on average across all uses.
Summer 2022
Enveda Biosciences, Boulder, CO
Data Science Intern, Knowledge Graph
Mentor: Daniel Domingo-Fernandez, David Healey, Joe Davison
Performed systematic survey + implementation fo 20+ entity linking NLP models to improve accuracy evidence-based compound prioritization
Summer 2021
Facebook, Menlo Park, CA
Applied Research Science Intern, Enterprise Product Applied Research
Mentor: Minhazul Islam Sk
Designed and trained transformer-based semantic search document retrieval and ranking system to improve efficiency of customer support agents
Summer 2020
GlaxoSmithKline, Philadelphia, PA
Research Intern, AI/ML Engineering
Mentor: Anne Cocos
Built model jointly embed free-text entity mentions with structured entity knowledge graph for 30M research articles/abstracts and KG with 5M edges. Developed end-to-end pipeline to download, preprocess, and identify high-quality entity links for biomedical entities in 30M research articles. Engineered parallel model training workflow on distributed supercomputing cluster utilizing 10,000+ CPU cores and dozens of GPUs.
Nov 2018 - Aug 2019
Padsplit, Atlanta, GA
Data Science Consultant, Data Research
Created credit scoring model and interactive job density visualizations to move into new domestic markets.
Summer 2018
Recursion Pharmaceuticals, Salt Lake City, UT
Data Science Intern, Machine Learning
Mentor: Andrew Blevins
Developed and deployed recommender system to infer biological mechanism of action and repurposing potential of 1M+ compounds
May 2016 - May 2018
Intermountain Healthcare, Salt Lake City, UT
Data Science Intern, Population Health Analytics
Mentor: Andy Merrill
Built and deployed models to forecast individual patient risk of chronic disease onset and long-term complex care from EHR and environmental data. Published in IEEE ICHI (2017) and AJRCCM (2018).

Education

2018 - 2023
Ph.D. in Computational Science & Engineering
Georgia Institute of Technology, Atlanta, GA
Advisor: Cassie Mitchell, Co-advisor: None
Thesis: Extracting and Structuring Information for Clinical Meta-Analysis and Drug Repurposing
Committee: Cassie Mitchell, Chao Zhang, Duen Horng "Polo" Chau, Jon Duke, Daniel Domingo-Fernández
2017 - 2018
M.S. in Mathematics
Brigham Young University, Provo, UT
Thesis: ActuarAI: Machine Learning Models for Patient Disease Forecasting and Representation
Committee: Jeffrey Humpherys, Tyler Jarvis, David Wingate
GPA: 4.00/4.00
Thesis
2010 - 2016
B.S. in Applied & Computational Mathematics
Brigham Young University, Provo, UT
Thesis: Walking the Walk: An Exploratory Analysis in Biometric Gait Recognition
Magna Cum Laude, University Honors Overall GPA: 3.96/4.00 Applied and Computational Mathematics Emphasis (ACME)
Thesis

Honors and Awards

2022
1st Place and People's Choice, Georgia Tech Startup Exchange Pitch Competition
Medical billing startup to identify and correct errors in patient medical bills
2018
National Science Foundation GRFP Honorable Mention
Learning to Prescribe Optimal Disease Treatment via Machine Learning
2015
Dean and Helen Robinson Scholarship
Scholarship given to outstanding undergraduates in mathematics for Putnam Mathematics competition
2016
BYU University Honors
Awarded to undergraduates who write a thesis complete requirements in leadership, service, and cross-disciplinary scholarship.
2010-2016
BYU Heritage Scholarship
Full-tuition merit based scholarship for incoming students
2011
Amberly Rupp "Circle of Honor" Essay Contest Award
1st-place in university-wide essay contest
2010
National Merit Scholarship
Merit-based scholarship awarded top <1% of incoming university students

Selected Publications*

A Comprehensive Evaluation of Biomedical Entity Linking Models
David Kartchner, Jennifer Deng, Shubham Lohiya, Tejasri Kopparthi, Prasanth Bathala, Daniel Domingo-Fernández, Cassie Mitchell
The 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP). Singapore, 2023.
Project PDF BibTeX
Literature-Based Discovery to Elucidate the Biological Links between Resistant Hypertension and COVID-19
David Kartchner, Kevin McCoy, Janhvi Dubey, Dongyu Zhang, Kevin Zheng, Rushda Umrani, James Kim, Cassie Mitchell
Biology (Biology). 2023.
Project PDF BibTeX
Zero-Shot Information Extraction for Clinical Meta-Analysis using Large Language Models
David Kartchner, Irfan Al-Hussaini, Selvi Ramalingam, Olivia Kronick, Cassie Mitchell
22nd Workshop on Biomedical Natural Language Processing (BioNLP). Toronto, Canada, 2023.
Project PDF BibTeX
BioSift: A Dataset for Filtering Biomedical Abstracts for Drug Repurposing and Clinical Meta-Analysis
David Kartchner, Irfan Al-Hussaini, Haydn Turner, Jennifer Deng, Shubham Lohiya, Prasanth Bathala, Cassie Mitchell
46th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR). Taipei, Taiwan, 2023.
Project BibTeX
Rule-Enhanced Active Learning for Semi-Automated Weak Supervision
David Kartchner, Davi Nakajima An, Wendi Ren, Chao Zhang, Cassie Mitchell
AI (AI). Online, 2022.
Project PDF BibTeX
Machine Learning Methods for Diease Prediction with Claims Data
Tanner Christensen, Abraham Frandsen, Seth Glazier, Jeff Humpherys, David Kartchner
IEEE International Conference on Healthcare Informatics (ICHI). New York City, NY, USA, 2018.
Project PDF BibTeX DOI
Short-Term Elevation of Fine Particulate Matter Air Pollution and Acute Lower Respiratory Infection
Benjamin D. Horne, Elizabeth A. Joy, Michelle G. Hofmann, Per H. Gesteland, John B. Cannon, Jacob S. Lefler, Denitza P. Blagev, E. Kent Korgenski, Natalie Torosyan, Grant I. Hansen, David Kartchner, C. Arden Pope III
American Journal of Respiratory and Critical Care Medicine (AJRCCM). New York, NY, USA, 2018.
Project PDF BibTeX DOI

Volunteer & Leadership Experience

2019 - 2022
Youth Mentor
Church of Jesus Christ of Latter-day Saints, Atlanta, GA
Organize community service projects and teach leadership & life skills to youth ages 8-17
Fall 2019
English Teacher
Catholic Charities Atlanta, Atlanta, GA
Taught semester-long English as a second language course for immigrants to United States
2015-2018
Volunteer Translator
Brigham Young University, Provo, UT
Provided occasional translation services to Tagalog-speaking visitors to BYU. Translation servies provided for visiting dignitaries at international Law and Religion symposium and Filipino missionaries receiving training prior to full-time service.
2017-2018
Student Alumni Relations Representative
College of Physical and Mathematical Sciences, Brigham Young University, Provo, UT
Organized college-wide student-alumni networking dinner. Organized fundraising event for student-to-student need-based scholarship program. Met regularly with dean to discuss and address student needs.
Nov 2011 - Nov 2013
Full-time Missionary and Representative
Church of Jesus Christ of Latter-day Saints, San Pablo, Philippines
Taught lessons in Tagalog language designed to strengthen families and communities. Organized quarterly conference and trainings for volunteers across six cities. Gathered and analyzed organizational data for regional leadership. Organized and coordinated community service projects with local leaders.

Technical Skills

Mathematics: Matrix Analysis, Complex Analysis, Functional Analysis, Numerical Linear Algebra, Control Theory, Probability Theory, Parallel Computing, Algorithm Design, Linear & Nonlinear Optimization, Active Learning, Advanced Econometrics, Abstract Algegra, Differential Equations

Machine Learning: Natural Language Processing (NLP), Large Language Models (LLMs), Knowledge Graphs, Deep Learning, Bayesian Statistics, Computer Vision, Semi-Supervised Learning, Weak Supervision, Information Retrieval

Packages: Pytorch, Pandas, SpaCy, NLTK, RDKit, Huggingface, LangChain, OpenAI

Programming: Python, R, Stata, Mathematica

Web: HTML, Web scraping, SQL, Cypher, LaTeX, Markdown, Jekyll, Git, Google API suite

Visualization: Figma, Seaborn, Bokeh, Draw.io

Languages: English (Native), Tagalog (Professional), Spanish (Intermediate), German (Intermediate)