9:15: Opening remarks + program overview
10:15: Academic advisor presentations (Glenn)
11:30: Lunch + meet the department staff (Baker Hall 129)
1:30: Lab
Health (Scaife 234)
Sports (Scaife 236) + 3:00: Guest speaker
Department of Statistics & Data Science, Carnegie Mellon University
Lead instructor: Quang Nguyen (preferred form of address: Quang)
Teaching assistants
Health: Princess Allotey, Julian Braganza, Hao Lee, James Leiner
Sports: Yuchen Chen, Sara Colando, Erin Franke, Leigh Preimesberger
Explore cutting-edge statistics and data science methodology with applications in
Heathcare: UnitedHealth Group Bridges to Healthcare Technology
Sports: Carnegie Mellon Sports Analytics Camp (CMSACamp)
“The best thing about being a statistician…is that you get to play in everyone’s backyard.” — John W. Tukey
Develop fundamentals research skills: data wrangling, visualization, modeling, communication
Become familiar with R
, tidyverse
, Quarto (Markdown syntax), Git/GitHub
Become familiar with cutting-edge statistical machine learning techniques
Create a portfolio of projects and practice reproducible research
Network with academic researchers and industry professionals
Check these frequently!
Lectures
Speaker/webinar sessions
Labs
Mon–Fri, 9:15–10:45am, Scaife 234
A few scheduling notes:
Mon–Fri, 1:30–3pm
Scaife 234 (Health) or Scaife 236 (Sports)
Demo labs
Project labs
will begin with a mini EDA project
then shift to focus on main capstone project
Either mid-day (in between lecture and lab) or after lab
Scaife 234 / MS Teams (Health) or Scaife 236 / Zoom (Sports)
Note: dates/times may vary; check calendar
Health: UHG webinars, individual meetings with mentors
Sports: project pitches, guest speakers
Dates/times may vary; check calendar
Location: Baker Hall 129
Don’t hesitate to take more food with you!
Practice understanding the structure of a dataset and perform basic EDA tasks (e.g., data wrangling, data visualization) in R
, and using GitHub for collaboration
Work in groups of 2–3
Timeline
Release date: Thursday, June 5
6-minute presentation (no notes/scripts) on Tuesday, June 17 during lab
Analyze a dataset in health or sports analytics to answer a research question that is important to people in your respective field
Work in groups of 2–3
Presentation checkpoint(s) (no notes/scripts)
Deliverables (more details will be provided later on)
Wednesday–Friday
Quang out of town
Lectures and labs as usual
Guest lectures by TA experts
TAs will run labs as usual
Thursday: EDA project released during lab
Fill out the survey forms (Communication and Data Science Background)
Reset CMU wifi password (for non-CMU students)
Check Calendar, Slack, email often
In-person attendance
Be on time. PLEASE.
This applies to lectures, labs, other sessions (e.g., webinars, guest speakers, other activities)
This is part of the Code of Conduct
Participate and ask questions
Work together. Help and support each other.
Enjoy, learn and grow