Getting and Cleaning Data

Offered by Johns Hopkins University. Before you can work with data you have to get some. This course will cover the basic ways that data can ... Enroll for free.

Taught by
Jeff Leek, PhD
Associate Professor, Biostatistics
and 2 more instructors

Offered by
Johns Hopkins University

Reddit Posts and Comments

r/rstats • comment
1 points • mCmurphyX

The Johns Hopkins University R courses on Coursera by Roger Peng, Jeff Leek, and Brian Caffo are fantastic and mostly in base R

  • R Programming
  • Advanced R Programming
  • Getting & Cleaning Data
  • Exploratory Data Analysis

Winston Chang's Cookbook for R:

An Introduction to Data Cleaning with R free ebook by Edwin de Jonge & Mark van der Loo

r/datascience • comment
1 points • HailSeitan999

Because DataCamp is known for welcoming sexual harrassy behavior and their CEO seems like a child, here are some alternative recs about cleaning and the other stuff that makes up 95% of data jobs:

here's what to look for, every time:

here's a coursera course on it:

here's how SQL works:

here's how tidyverse works, including various read/write libraries + opinions on data types + code reusability that are generally applicable:

hopefully a helpful (+ free) set of alternatives!

I'd also say hard choices around interpolation vs exclusion seem hard to find material on, probably because so data/resource/context specific, but also good to be aware of.