layout: true background-image: url(SAC RGB Logo EST 1899.jpg) background-position: 50% 100%; background-size: 10% --- name: title class: inverse, center, middle # Chris Papalia ## Teaching High School Statistics .pull-left[ ###
[@PappyPapalia](https://twitter.com/PappyPapalia) ###
[Personal Website](https://highschoolteach-r.netlify.app/) ###
[chris.papalia@sac.on.ca](mailto:chris.papalia@sac.on.ca) ###
[https://highschoolteach-r-tdw.netlify.com/](https://highschoolteach-r-tdw.netlify.com/) ] .pull-right[ <img src="https://rohanalexander.com/images/tdw_cn_tower.png" width="50%" /> ] --- layout: true background-image: url(SAC_Logo_1899.jpg) background-position: 5% 100% background-size: 15% --- # Chris Papalia #### Education Background - B.A. Physics - [Macalester College](https://www.macalester.edu/), 2007 - B.Ed. Mathematics and Physics - [University of Toronto/OISE](https://www.utoronto.ca/), 2009 - M.Ed. Curriculum, Teaching, Learning - [University of Toronto/OISE](https://www.utoronto.ca/), 2018 #### Professional Background - [Durham District School Board](https://www.ddsb.ca/en/index.aspx) 2009 – 2012 - [St. Andrew’s College](www.sac.on.ca) 2012 – Present #### Professional Interests - Teaching Statistics - Data Science and Analytics in R - Tech Integration into Teaching & Learning - Coaching Hockey, Baseball, and Squash --- layout: false class: inverse # Focus .pull-left[ ## This is a "show and tell" to describe what I've done to develop a better stats experience in high school and explore some of the next steps for high school statistics. ### I hope something about this talk can resonate with you and puts into perspective what you see from your students ] .pull-right[ <!-- --> ] --- class: inverse, center, middle # 1. My Context at SAC # 2. Sharing what I do # 3. The Future of High School Statistics? --- name: Context class: center, middle, inverse # My Context --- layout: true background-image: url(SAC_Logo_1899.jpg) background-position: 5% 100% background-size: 15% --- # Statistics at St. Andrew's College .pull-left[ ## Grade 12 - MDM4U: Mathematics of Data Management * Three sections, n ~ 15 ## Grade 12 - AP - Statistics * Two sections, n ~ 15 ### ~ 60% of senior students ] .pull-right[ <img src="SAC_campus.jpg" width="125%" /> ] ??? Since my colleague and I arrived in 2012 the program has grown considerably - picked up an extra section of AP Stats and full sections of MDM --- # Units of Study: MDM4U .pull-left[ 1. One-Variable Data (categorical/numerical) 2. Individuals, Z-Scores, Comparing Individuals 3. Two Variable Data - Stacked/dodged bar plots and scatter plots 4. Study and Experimental Design Principles 5. Probability and Counting (Independence, Conditional) 6. Probability Distributions (binomial, geometric, discrete random variables) 7. Project/Investigation - Using RStudio (hopefully); Data Visualizations ] .pull-right[ ### Big Ideas: See Data. Ask Questions. Try to find answers. Lots of emphasis on using software like **Excel** and **Fathom** to: * explore data * simulate outcomes * visualize data Most of the data we look at is already in **"nice, tidy, rectangles"** that I provide to students There is emphasis on **projects that are meaningful to students** where student find data they're interested in using. ] ??? This is their opportunity to use the software tools we have access to.. Excel, Fathom, R and finding data in several sources. An interesting conflict is we really get to see the gap in reproducibility and data cleaning/tidying and the frustrations in using software like excel with dates, times, and then getting the data to a point where it's usable. Story of spending one hour trying to automate something that would take you 10 minutes... trade-off when student see this as a one-off. Especially with folks who are doing simulations etc. --- # Units of Study: AP-Stats .pull-left[ 1. Exploring One-Variable Data 2. Exploring Two-Variable Data 3. Collecting Data 4. Probability, Random Variables, and Probability Distributions 5. Sampling Distributions 6. Inference for Categorical Data 7. Inference for Numerical Data 8. Inference with Chi-Squares 9. Inference with Linear Regression ] .pull-right[ ### Big Ideas: See Data. Ask Questions. Try to find answers. Prepare for Exam The bulk of the year is spent working towards AP Exam preparation -> **fewer projects** The end of the year project conflicts with their other AP - Exams -> **optional investigation** For those who are in the AP - Capstone, I am their "capstone advisor" and encourage them to use RStudio for their analysis. * **{patentr}** for analysis on video game patents * **{tidytext}** for analysis on Drake's lyrics * **Excel** for others... pivot tables; grouping; **reproducibility** is really a problem I've discovered. ] ??? Caden - manually entering pdf data with characters when he has access to the csv. Ben and Marshall vs Caden and Paul... Sample code and comments on a test file vs videos and screen clips and file structure changes. It's old-hat for you folks, but it was a huge moment for me to truly understand the value of reproducibility. Similarly, the students really "got it" when I suggested doing it on a small file and then apply the code to other features. In both classes I try to use R coding to explore data before I give it to them to look at and ask questions about... This is particularly useful in the one and two-variable --- # Day-to-day... .pull-left[ #### 75-minute periods (every other day - full year) #### Textbook: OpenIntro Stats Reading and Example Problems #### AP Free-Response Problems and OpenIntro Problems during class and homework #### Data analysis projects to apply their learning to data of their choice #### Data, Simulations, Spreadsheets as often as possible ] .pull-right[ <img src="class_photo.jpg" width="150%" /> ] ??? Themes: sports, nhl-reference, mlbstatcast, kaggle imdb, tidytuesday, statcan Videos of me entering data into excel and then filtering it into other sheets and looking using pivot tables. --- layout: false name: Sharing class: center, middle, inverse # Sharing what I do --- layout: true background-image: url(SAC_Logo_1899.jpg) background-position: 5% 100% background-size: 15% --- # Me teaching stats 12 years ago .pull-left[ ## Calculations by hand ## TI 84 was a luxury ## Reserve a computer lab once/month ] .pull-right[ <img src="https://media0.giphy.com/media/3ARqfaxs6Km88/giphy.webp?cid=ecf05e475w1v94op2b42h3uvhl0xgpr3bxxew01k2yj0tynk&rid=giphy.webp&ct=g" width="100%" /> ] --- class: inverse, middle, center background-image: url("https://images.unsplash.com/photo-1559827291-72ee739d0d9a?ixid=MnwxMjA3fDB8MHxzZWFyY2h8MTB8fGxhbmRzY2FwZXxlbnwwfHwwfHw%3D&ixlib=rb-1.2.1&auto=format&fit=crop&w=500&q=60") background-size: cover # Journey...2009 - 2019 ### Part I - Unit Restructure ### Part II - Software ### Part III - Confidence --- #
[#Rstats](https://twitter.com/search?q=%23rstats&src=typed_query) .pull-left[ ### R for Data Science ### Dr. Mine Çetinkaya-Rundel (DS in a Box) ### RStudio Community (Webinars and more) ### Tidy Tuesday ### Arts + Program at SAC ] .pull-right[ <img src="https://media4.giphy.com/media/3og0IH0HMb7Itm7uiA/200w.webp?cid=ecf05e47kki6onyk3jh3mhbh8ts0dk3ts2ipjjs5b2g65o9z&rid=200w.webp&ct=g" width="100%" /> ] --- # Learning Activities ### 1. Everything is framed in describing one-variable statistics **shape, centre, spread** (foundation of the course) ### 2. Emphasis on working with data to illustrate concepts ### 3. Refer to AP Exam questions and scoring guidlines ### 4. Project Prep - Interpret (don't report) the statistics and visualizations --- # Everything is framed by describing one-variable statistics .pull-left[ We try to use simulations and visualizations with every topic <div class="figure"> <img src="fathom_clip.jpg" alt="This is the software called Fathom" width="150%" /> <p class="caption">This is the software called Fathom</p> </div> ] .pull-right[ <div class="figure"> <img src="coke_pepsi_trials.jpg" alt="This is a beginning to simulation-based inference in the experimental design unit" width="150%" /> <p class="caption">This is a beginning to simulation-based inference in the experimental design unit</p> </div> ] --- # Emphasis on data and applets to illustrate concepts .pull-left[ <iframe src="https://rpsychologist.com/viz/" width="150%" height="400px"></iframe> ] .pull-right[ <div class="figure"> <img src="art_of_stat.jpg" alt="Art of Stat - Web Apps" width="150%" /> <p class="caption">Art of Stat - Web Apps</p> </div> * **{palmerpenguins}, {gapminder}, {mpg}, collect our own, sampling, [skewthescript.org](https://skewthescript.org/)** * Applets: [**Art of Stats - Web Applets**](https://artofstat.com/web-apps), [**Kristoffer Magnusson - Visualzations**](https://rpsychologist.com/viz/) ] --- # Refer to AP Exam questions and scoring guidlines .pull-left[  ] .pull-right[ ## MS Teams Collaboration Space/Shared Space/Board ### Group work, presentations, and grading rubrics ### Pass the problem, create the statistics/data [The AP Statistics Exam Page](https://apcentral.collegeboard.org/courses/ap-statistics/exam/past-exam-questions?course=ap-statistics) * This page offers all of the previous AP Free-Response Questions and Solution Rubrics ] --- # Projects and Readings ## [hockey-reference](https://www.hockey-reference.com/) ## [gapminder](https://www.gapminder.org/) ## [fivethirtyeight data](https://data.fivethirtyeight.com/) ## [Freakonomics Podcasts](https://freakonomics.com/podcast/what-can-uber-teach-us-about-the-gender-pay-gap/) --- # More Information - My talk from OAME May 2017/May 2019 <iframe src="https://highschoolteach-r-beyond-the-basics.netlify.app/#1" width="100%" height="400px"></iframe> --- layout: false name: Future class: center, middle, inverse # The Future? --- layout: true background-image: url(SAC_Logo_1899.jpg) background-position: 5% 100% background-size: 15% --- class: inverse background-image: url(https://images.unsplash.com/photo-1617478140791-f3034c29a6ae?ixid=MnwxMjA3fDB8MHxzZWFyY2h8NjJ8fGJyaWRnZSUyMHRoZSUyMGdhcHxlbnwwfHwwfGJsYWNrfA%3D%3D&ixlib=rb-1.2.1&auto=format&fit=crop&w=500&q=60) background-size: cover # Gaps ## 1. Ontario's curricular focus/elective course ## 2. Calculus remains the "Crystal Palace" in math ## 3. No coherent computing background ??? MDM4U lives in the intersection of folks who dislike math, and folks who are interested for a "good grade" Admissions officers differentiate based on calculus Teachers lack the knowledge-base to support a scaled statistics course * Software remains limited for teachers/students/AP programs and so there is very limited ability for students to look at an examine models, regression, and prediction. On-boarding with the RStudio Cloud program was great but then it's been monetized Teaching teachers to trouble-shoot basic problems is a mess... --- # The Future - Data Science in High School? .pull-left[  ] .pull-right[  ] ??? Some Social Justice Data - skew the script Some sports data - Michael Lopez Five Thirty Eight Tidy Tuesday --- # More coding...?
[Kareem Carr](https://twitter.com/kareem_carr/status/1374407093919223819) .pull-left[  ] .pull-right[  ] --- # Data Science/Statistics in High School looks like this... <!-- --> ## Tidy, Transform, and Model require technical skills that most students don't have... --- layout: false class: inverse, center, middle # Strategies ####(Good and Bad) --- layout: true background-image: url(SAC_Logo_1899.jpg) background-position: 5% 100% background-size: 15% --- # Live Coding EDA
.pull-left[ ## I explore the data and discuss questions and answers with the class ## [David Robinson Screencasts](https://www.youtube.com/channel/UCeiiqmVK07qhY-wvg3IZiZQ) ] .pull-right[ <img src="peng_test.gif" width="150%" /> ] ??? Students and I try to ask questions and find answers within the data I include this process on a test and have them "explore" it --- # Jumping Into OpenIntro Labs
.pull-left[ ## Last year I used RStudio Cloud to have students work through the labs associated with the [**`tidyverse`**](http://openintrostat.github.io/oilabs-tidy/01_intro_to_r/intro_to_r.html) versions of the OpenIntro labs in November/December. ] .pull-right[ <img src="https://media4.giphy.com/media/jBBRs81dGWHIY/200w.webp?cid=ecf05e475v8tdtslt159f7k9yzlykc5pcly0sg5tthncu27v&rid=200w.webp&ct=g" width="150%" /> ] --- # Co-Curricular Programming - [R4DV Club](https://highschoolteach-r-club.netlify.app/)
.pull-left[ ## Fall 2018 - Core group of 6 * ### Tidy Tuesday and Shoeless Joe's ## Spring 2020 - Core Group of 6 * ### Launch website using [{distill}](https://rstudio.github.io/distill/website.html) and use RStudio.Cloud for the activities ] .pull-right[ <iframe src="https://highschoolteach-r-club.netlify.app/week02-saving-a-plot" width="100%" height="400px"></iframe> ] --- # My Experiment This Year...
.pull-left[ ### I will offer several opportunities for students to use RStudio with links to helpful resouces for students to work through data ### [Rstudio Primers](https://rstudio.cloud/learn/primers) ### [DoSS Toolkit](https://www.dosstoolkit.com/) ### Allow Excel as another option ] .pull-right[ <img src="https://media2.giphy.com/media/RLnTaVfBJ7Sxv5mIVg/giphy.gif?cid=ecf05e47zpda4qp2ixmt8ptzqsmg1x97lt8jyla2jk9b1sn3&rid=giphy.gif&ct=g" width="150%" /> ] --- # The Future - Projects and Partnerships
### Great relationship with our CS department * We want to collaborate with CS and eventually work towards creating a cohort of students who are using computing (and version control?) to solve problems in several fields. ### Finding ways for students to apply statistical analysis and computing through the AP Capstone Program and the R for Data Visualization Club * Marshall used {patentsview} to extract patents and use tidyverse to model patents over time * Ben used {tidytext} and references to do sentiment analysis with Drake lyrics * Select group of student sign-up for the club ([See the R Club website from last year](https://highschoolteach-r-club.netlify.app/)) --- layout: false class: inverse, center, middle # Thank You! .right[.footnote[[Thank You!](#title)]]