Lecture 6
Duke University
STA 199 - Fall 2023
2022-09-14
– Clone ae-05
– HW-2 Due Tuesday (19th) on Gradescope
– Lab-1 Due Friday (16th) on Gradescope
– Reminder: Commit and push AEs at end of class
– Keep up with Slack
– Are you still in the GitHub organization?
– Continue practicing with dplyr
functions
– Join datasets
Identify which dplyr
functions chooses rows or changes columns of an existing data set. Also, define what each function does.
– select()
– slice()
– arrange()
– select()
- column
– slice()
– arrange()
– select()
- column
– slice()
- row
– arrange()
– select()
- column
– slice()
- row
– arrange()
- row
Data merging is the process of combining two or more data sets into a single data set. Most often, this process is necessary when you have raw data stored in multiple files, worksheets, or data tables, that you want to analyze together.