Skip to main content

Dataset / Long Term Horizon Predictions and Feature Explainability of Time Series Continuous Glucose …

Have a question about this item?

Item information. View source record on contributor's website.

Title
Long Term Horizon Predictions and Feature Explainability of Time Series Continuous Glucose Monitor Data
Contributor
Joe, Leslie R.
Kanjaria, Karina J.
Monsivias, Carlos S.
O’Laughlin, Kate B.
Date Created and/or Issued
2023-01-07 to 2023-06-09
Contributing Institution
UC San Diego, Research Data Curation Program
Collection
Data Science & Engineering Master of Advanced Study (DSE MAS) Capstone Projects
Rights Information
Under copyright
Constraint(s) on Use: This work is protected by the U.S. Copyright Law (Title 17, U.S.C.). Use of this work beyond that allowed by "fair use" or any license applied to this work requires written permission of the copyright holder(s). Responsibility for obtaining permissions and any use and distribution of this work rests exclusively with the user and not the UC San Diego Library. Inquiries can be made to the UC San Diego Library program having custody of the work.
Use: This work is available from the UC San Diego Library. This digital copy of the work is intended to support research, teaching, and private study.
Rights Holder and Contact
O'Laughlin, Kate B.; Monsivias, Carlos S.; Joe, Leslie R.; Kanjaria, Karina J.
Description
Completed as a Capstone project for the Data Science & Engineering MAS program (DSE260 Capstone), this project utilizes Dexcom's data from their Continuous Glucose Monitors (CGM) to predict and describe Type 2 diabetic patients' future time above or below healthy glucose levels. The time series data, recorded in 5 minute intervals over 365 days for 8000 patients, is used to extract features such as entropy (predictability), variance (from Poincaré plots), among others. These features—alongside demographic data such as treatment type, patient age, and patient sex—are used in XGBoost classification models to predict if the amount of time a patient will spend out of range the next day is more, less, or the same as the amount of time a patient spent out of range the day before. The XGBoost classification models also provide feature importance, which informs the patient of which features affect their outcome the most. This project used pre-existing data from Dexcom. However, this data is not available for sharing due to Dexcom's licensing terms. Also, we did not receive permission to share the Dexcom model.
Research Data Curation Program, UC San Diego, La Jolla, 92093-0175 (https://lib.ucsd.edu/rdcp)
O’Laughlin, Kate B.; Monsivias, Carlos S.; Joe, Leslie R.; Kanjaria, Karina J.; Burks, Jamie H.; Smarr, Benjamin L. (2024). Long Term Horizon Predictions and Feature Explainability of Time Series Continuous Glucose Monitor Data. In Data Science & Engineering Master of Advanced Study (DSE MAS) Capstone Projects. UC San Diego Library Digital Collections. https://doi.org/10.6075/J0XW4K0B
Type
dataset
Identifier
ark:/20775/bb9329453v
Language
English
Subject
Continuous glucose monitoring
Time series
Data science
Machine learning
Capstone projects
XGBoost
Dexcom
Feature extraction
PySpark
Type 2 Diabetes Mellitus
Data Science & Engineering Master of Advanced Study (DSE MAS)
DSE MAS - 2023 Cohort

About the collections in Calisphere

Learn more about the collections in Calisphere. View our statement on digital primary resources.

Copyright, permissions, and use

If you're wondering about permissions and what you can do with this item, a good starting point is the "rights information" on this page. See our terms of use for more tips.

Share your story

Has Calisphere helped you advance your research, complete a project, or find something meaningful? We'd love to hear about it; please send us a message.

Explore related content on Calisphere: