Getting and Cleaning Data

share ›
‹ links

Below are the top discussions from Reddit that mention this online Coursera course from Johns Hopkins University.

Offered by Johns Hopkins University. Before you can work with data you have to get some. This course will cover the basic ways that data can ... Enroll for free.

Reddsera may receive an affiliate commission if you enroll in a paid course after using these buttons to visit Coursera. Thank you for using these buttons to support Reddsera.

Taught by
Jeff Leek, PhD
Associate Professor, Biostatistics
and 2 more instructors

Offered by
Johns Hopkins University

Reddit Posts and Comments

2 posts • 7 mentions • top 2 shown below

r/rstats • comment
1 points • mCmurphyX

The Johns Hopkins University R courses on Coursera by Roger Peng, Jeff Leek, and Brian Caffo are fantastic and mostly in base R

  • R Programming https://www.coursera.org/learn/r-programming
  • Advanced R Programming https://www.coursera.org/learn/advanced-r
  • Getting & Cleaning Data https://www.coursera.org/learn/data-cleaning
  • Exploratory Data Analysis https://www.coursera.org/learn/exploratory-data-analysis

Winston Chang's Cookbook for R: http://www.cookbook-r.com/

An Introduction to Data Cleaning with R free ebook by Edwin de Jonge & Mark van der Loo https://cran.r-project.org/doc/contrib/de_Jonge+van_der_Loo-Introduction_to_data_cleaning_with_R.pdf

r/datascience • comment
1 points • HailSeitan999

Because DataCamp is known for welcoming sexual harrassy behavior and their CEO seems like a child, here are some alternative recs about cleaning and the other stuff that makes up 95% of data jobs:

here's what to look for, every time: https://twitter.com/b0rk/status/1182288624018247685

here's a coursera course on it: https://www.coursera.org/learn/data-cleaning

here's how SQL works: https://twitter.com/b0rk/status/1184571894722449409

here's how tidyverse works, including various read/write libraries + opinions on data types + code reusability that are generally applicable: http://r4ds.had.co.nz/

hopefully a helpful (+ free) set of alternatives!

I'd also say hard choices around interpolation vs exclusion seem hard to find material on, probably because so data/resource/context specific, but also good to be aware of.