Lecture 5

Overview

  • Define and use factors in R
  • Review the fundamental dplyr “verbs” for data manipulation (see Lecture 3) and learn new ones
  • Data cleaning: recoding and renaming variables; difference between syntactic vs. non-syntactic variable names
  • Data cleaning: missing data

Readings

Required:

Optional/Skim:

  • “Chapter 18 Missing values” from “R for Data Science” 2nd Edition. Not immediately relevant, but offers several tips that could be beneficial for your future work.

Class materials

Run the code below in your console to download today’s materials: usethis::use_course("css-materials/dplyr-more-and-cleaning")