We sought to help remedy the challenge of access to real-world datasets by generating large, population-level de-identified datasets from electronic health records across >100 hospitals. We created educational exercises that leverage the datasets to teach data science skills such as data wrangling and management. These educational materials are designed to be shared widely across informatics, computer science, information science, and other programs.


Brian Dixon (Presenter)
Indiana University Fairbanks School of Public Health

Saurabh Rahurkar, Regenstrief Insitute
Titus Schleyer, Regenstrief Insitute

