# Materials

• Class Notes

Slides

• Activity

## Cross-validation activity

Activity on building and cross-validating models that predict the total price totalPr of copies of Mario Kart Wii sold on eBay.

• Class Notes

Slides

• Class Notes

Slides

• Class Notes

Slides

• Class Notes

Slides

• Class Notes

Slides

• Activity

## infer activity

Activity using a dataset from the Mythbusters television show to practice building hypothesis tests with the infer package

• Class Notes

Slides

## An advanced example of a PMF visualization

Reading showing an example for how to use PMFs to compare the difference between two data distributions.

A practical example that shows how to use PMFs to resolve a paradox.

• Guide

## Comparing percentile rank

An example for how to use the CDF to compare measurements across different groups.

• Class Notes

Slides

## Cumulative distribution functions

Reading about how to use R to compute, visualize, and apply percentiles of a dataset.

• Class Notes

Slides

## Beginner’s Guide on Web Scraping in R (using rvest) with hands-on example

A beginner’s guide on how to perform Web Scraping in R.

• Guide

Selectorgadget is a javascript bookmarklet that allows you to interactively figure out what css selector you need to extract desired components from a page.

• Class Notes

Slides

• Activity

## Web scraping

Interactive demo on how to use the rvest web-scraping tools.

• Class Notes

Slides

• Activity

## Exploring the Medicare dataset II

Continuation of the instructor-led exploration of the Medicare inpatient payments dataset.

• Activity

## Exploring the Medicare dataset I

Instructor-led exploration of the Medicare inpatient payments dataset.

• Class Notes

## Class 11: Data wrangling IV

Slides

• Activity

Interactive demonstration showing how to apply the Tidy Data principles to a typical classroom gradebook.

• Class Notes

Slides

• Class Notes

Slides

• Activity

## dplyr demos II

Continuation of the interactive demonstration of the major features found in the dplyr package.

• Class Notes

Slides

• Activity

## dplyr demos I

An interactive demonstration of the major features found in the dplyr package.

• Guide

## Describing univariate and bivariate data

How to write about visualizations of univariate (one variable) and bivariate (two variables) data.

• Class Notes

Slides

• Class Notes

Slides

• Class Notes

Slides

• Class Notes

Slides

• Activity

## Can Twitter predict election results?

An introductory activity about a data science study that used Twitter data to predict election outcomes.