Materials

  • Class Notes

    Class 28: Modeling IV

  • Activity

    Cross-validation activity

    Activity on building and cross-validating models that predict the total price totalPr of copies of Mario Kart Wii sold on eBay.

  • Class Notes

    Class 27: Modeling III

  • Class Notes

    Class 26: Modeling II

  • Class Notes

    Class 25: Inference and simulations IV / Modeling I

  • Class Notes

    Class 24: Inference and simulations III

  • Class Notes

    Class 23: Inference and simulations II

  • Activity

    infer activity

    Activity using a dataset from the Mythbusters television show to practice building hypothesis tests with the infer package

  • Class Notes

    Class 22: Inference and simulations I

  • Reading

    An advanced example of a PMF visualization

    Reading showing an example for how to use PMFs to compare the difference between two data distributions.

  • Reading

    Class size paradox

    A practical example that shows how to use PMFs to resolve a paradox.

  • Guide

    Comparing percentile rank

    An example for how to use the CDF to compare measurements across different groups.

  • Class Notes

    Class 21: Statistical distributions II

  • Reading

    Cumulative distribution functions

    Reading about how to use R to compute, visualize, and apply percentiles of a dataset.

  • Reading

    Probability mass functions

    Reading about how to connect probabilities with values in a dataset.

  • Class Notes

    Class 20: Statistical distributions I

  • Reading

    Beginner’s Guide on Web Scraping in R (using rvest) with hands-on example

    A beginner’s guide on how to perform Web Scraping in R.

    Download


      LINK

  • Guide

    SelectorGadget
    Vignette

    Selectorgadget is a javascript bookmarklet that allows you to interactively figure out what css selector you need to extract desired components from a page.

    Download

  • Class Notes

    Class 19: Introduction to Web Scraping II/Principles of Data Collection

  • Activity

    Web scraping

    Interactive demo on how to use the rvest web-scraping tools.

  • Class Notes

    Class 16: Introduction to Web Scraping I

  • Activity

    Exploring the Medicare dataset II

    Continuation of the instructor-led exploration of the Medicare inpatient payments dataset.

  • Activity

    Exploring the Medicare dataset I

    Instructor-led exploration of the Medicare inpatient payments dataset.

  • Class Notes

    Class 11: Data wrangling IV

  • Activity

    Tidy gradebook

    Interactive demonstration showing how to apply the Tidy Data principles to a typical classroom gradebook.

  • Class Notes

    Class 10: Data wrangling III

  • Class Notes

    Class 9: Data wrangling II

  • Activity

    dplyr demos II

    Continuation of the interactive demonstration of the major features found in the dplyr package.

  • Class Notes

    Class 8: Data wrangling I

  • Activity

    dplyr demos I

    An interactive demonstration of the major features found in the dplyr package.

  • Guide

    Describing univariate and bivariate data

    How to write about visualizations of univariate (one variable) and bivariate (two variables) data.

  • Class Notes

    Class 7: Data visualization III

  • Class Notes

    Class 6: Data visualization II

  • Class Notes

    Class 5: Introduction to data / Data visualization I

  • Class Notes

    Class 2: The data scientist’s toolbox I

  • Activity

    Can Twitter predict election results?

    An introductory activity about a data science study that used Twitter data to predict election outcomes.