#
Data Mining

Below are the **top discussions** from
Reddit that mention this
online Coursera specialization from
**University of Illinois at Urbana-Champaign**.

The Data Mining Specialization teaches data mining techniques for both structured data which conform to a clearly defined schema, and unstructured data which exist in the form of natural language text.

Data Clustering Algorithms Text Mining Data Visualization (DataViz) Data Mining Data Visualization Software Tableau Software Data Virtualization Information Retrieval (IR) Document Retrieval Machine Learning Recommender Systems Probabilistic Models

Accessible **for free**.
Completion certificates **are offered**.

Affiliate disclosure: ** Please** use the
blue and
green

**buttons**to visit Coursera if you plan on enrolling in a course. Commissions Reddsera receives from using these links will keep this site

**online**and

**ad-free**. Reddsera

*will not*receive commissions if you

*only*use course links found in the below Reddit discussions.

Taught by

**John C. Hart**

Professor of Computer Science

and 2 more instructors

Offered by

**University of Illinois at Urbana-Champaign**

This specialization includes these
**6 courses**.

Learn the general concepts of data mining along with basic methodologies and applications.

John C. Hart

13 mentions

Recent years have seen a dramatic growth of natural language text data, including web pages, news articles, scientific literature, emails, enterprise documents, and social media such as blog articles, forum posts, product reviews, and tweets.

ChengXiang Zhai

8 mentions

This course will cover the major techniques for mining and analyzing text data to discover interesting patterns, extract useful knowledge, and support decision making, with an emphasis on statistical approaches that can be generally applied to arbitrary text data in any natural language with no or m...

ChengXiang Zhai

9 mentions

Learn the general concepts of data mining along with basic methodologies and applications.

Jiawei Han

4 mentions

Discover the basic concepts of cluster analysis, and then study a set of typical clustering methodologies, algorithms, and applications.

Jiawei Han

2 mentions

Note: You should complete all the other courses in this Specialization before beginning this course.

Jiawei Han

0 mentions

#### Reddit Posts and Comments

0 posts • 45 mentions • top 7 shown below

**r/datascience**• post

7 points • complexgunner

##### How is Data Mining Specialization by Illinois on Coursera?

Coursera link: https://www.coursera.org/specializations/data-mining

How is it compared to other Data Mining MOOCs?

**r/datamining**• comment

2 points • TheRedHawk88

Coursera is my go-to.

https://www.coursera.org/specializations/data-mining

**r/datascience**• post

98 points • Executer13

##### What do you think of the learning path I built?

Hi,

I am interested in data science, but I am studying a completely unrelated subject (related with biology & health). Dropping out my current course is not an option, as I like it; however, I think that if I learned a little bit about data science, perhaps I could find job opportunities in my area which require a great biology/health knowledge and some data science knowledge too.

The curriculum I built based on my internet research is at the end of this post. I would like to know if you would change anything! Since I don't know much (only basic statistics & basic Python), it was not easy to find the best online resources. And if anyone else wants to follow this learning plan, I invite you to do so!

Here it is (note: content between square brackers, [], is optional, and should be used mostly as a reference or to fill in the gaps):

*MATHEMATICS*

**LINEAR ALGEBRA**

Khan Academy: https://www.khanacademy.org/math/linear-algebra.

Book *The Manga Guide to Linear Algebra*

[MIT OpenCourseWare: https://ocw.mit.edu/courses/mathematics/18-06-linear-algebra-spring-2010/

Book *Linear Algebra / Levandosky*

Book *Strang, Gilbert. Introduction to Linear Algebra.*]

**MULTIVARIABLE CALCULUS**

Khan Academy: https://www.khanacademy.org/math/multivariable-calculus

[MIT OpenCourseWare: https://ocw.mit.edu/courses/mathematics/18-02-multivariable-calculus-fall-2007/index.htm]

*STATISTICS*

**INFERENTIAL & DESCRIPTIVE STATISTICS**

Udacity’s Intro to Descriptive Statistics: https://www.udacity.com/course/intro-to-descriptive-statistics--ud827

Udacity’s Intro to Inferential Statistics: https://www.udacity.com/course/intro-to-inferential-statistics--ud201

Book *OpenIntro Statistics*

[Book *Think Stats*]

**PROBABILITY**

edX’s Introduction to Probability - The Science of Uncertainty: https://www.edx.org/course/introduction-probability-science-mitx-6-041x-2

*INTRODUCTION TO COMPUTERS & PROGRAMMING*

**COMPUTER SCIENCE**

edX’s Intro to Computer Science: https://www.edx.org/course/introduction-computer-science-harvardx-cs50x

**INTRODUCTION TO PYTHON**

How to Think Like a Computer Scientist: http://interactivepython.org/runestone/static/thinkcspy/index.html

Udacity’s Design of Computer Programs: https://www.udacity.com/course/design-of-computer-programs--cs212

**INTRODUCTION TO GIT & COMMAND LINE**

Better Explained Blog Posts’ Series: https://betterexplained.com/articles/a-visual-guide-to-version-control/#Other_Posts_In_This_Series

Code School’s Git Course: https://www.codeschool.com/courses/try-git

Learn Enough Command Line to be Dangerous: https://www.learnenough.com/command-line-tutorial

*INTRODUCTION TO DATA*

**SPREADSHEETS & MODELS**

Coursera’s Introduction to Spreadsheets and Models: https://www.coursera.org/learn/wharton-introduction-spreadsheets-models

**DATABASES**

Stanford’s Introduction to Databases: https://lagunita.stanford.edu/courses/Engineering/db/2014_1/about

[Khan Academy’s Intro to SQL: Querying and Managing Data: https://www.khanacademy.org/computing/computer-programming/sql]

**INTRODUCTION TO DATA MANAGEMENT**

Coursera’s Getting and Cleaning Data: https://www.coursera.org/learn/data-cleaning

**INTRODUCTION TO DATA VISUALISATION**

Book *The Visual Display of Quantitative Information*

Introduction to Infographics & Data Visualization Course: https://www.youtube.com/playlist?list=PLa4VFIBUKrgLao-DalwedOCiq9RV6MPk9

*TOOLS APPLIED TO DATA*

**PYTHON LIBRARIES (NUMPY, PANDAS, MATPLOTLIB & SEABORN)**

Book *Python for Data Analysis* / Book *Python Data Science Handbook.*

Udacity’s Intro to Data Analysis: https://www.udacity.com/course/intro-to-data-analysis--ud170

NumPy Beginner: https://www.youtube.com/watch?v=gtejJ3RCddE

Awesome Data Science. 2.0 Introduction to Pandas and Exploratory Data Analysis: https://www.youtube.com/watch?v=ZrRpN_IrcBA

MatPlotLib Tutorial Series - Graphing in Python: https://www.youtube.com/playlist?list=PLQVvvaa0QuDfefDfXb9Yf0la1fPDKluPF

The Ultimate Python Seaborn Tutorial: Gotta Catch ‘Em All: https://elitedatascience.com/python-seaborn-tutorial

[Udemy’s Python for Data Science & Machine Learning Bootcamp: https://www.udemy.com/python-for-data-science-and-machine-learning-bootcamp/]

*DATA WRANGLING WITH PYTHON & OPENREFINE*

Book *Data Wrangling with Python*

BigDataUniversity’s Open Refine 101: https://cognitiveclass.ai/courses/introduction-to-openrefine/

*DATA VISUALISATION WITH TABLEAU & D3.JS*

Coursera’s Data Visualization and Communication with Tableau: https://www.coursera.org/learn/analytics-tableau

Your Guide to Becoming a Tableau Expert: https://www.analyticsvidhya.com/learning-paths-data-science-business-analytics-business-intelligence-big-data/tableau-learning-path/

Udacity’s Data Visualization and D3.js: https://www.udacity.com/course/data-visualization-and-d3js--ud507

Book *Interactive Data Visualization*

*MACHINE LEARNING*

**INTRODUCTION TO MACHINE LEARNING**

Coursera’s Machine Learning: https://www.coursera.org/learn/machine-learning (plus unofficial notes: http://www.holehouse.org/mlclass/)

[Book *An Introduction to Statistical Learning*]

**APPLIED MACHINE LEARNING WITH PYTHON: SCIKIT LEARN**

Andreas Mueller Jupyter Notebooks: https://github.com/amueller/scipy_2015_sklearn_tutorial/tree/master/notebooks

BigDataUniversity’s Machine Learning with Python: https://cognitiveclass.ai/courses/machine-learning-with-python/

**INTRODUCTION TO DEEP LEARNING**

Udacity’s Deep Learning: https://www.udacity.com/course/deep-learning--ud730

[DL for Computer Vision: http://cs231n.stanford.edu/syllabus.html]

*BIG DATA*

**INTRODUCTION TO BIG DATA**

Coursera’s Introduction to Big Data: https://www.coursera.org/learn/big-data-introduction

**HADOOP**

BigDataUniversity’s Hadoop Fundamentals: https://cognitiveclass.ai/learn/hadoop/

**SPARK**

BigDataUniversity’s Spark Fundamentals: https://cognitiveclass.ai/learn/spark/

**DATA MINING**

Coursera’s Data Mining: https://www.coursera.org/specializations/data-mining

**r/Python**• comment

1 points • SimpleScribbler

You can audit the courses in this Coursera data mining specialization for free. It's all about text analysis/NLP.

**r/artificial**• post

2 points • seducer4real

##### Course recommendation for pursuing a career in AI. Looking for feedback and advice.

Hello, I'm currently following a machine learning course in order to learn more about AI and was wondering which other course(s) I should add to this in order to further my knowledge of AI. Machine learning course: https://www.coursera.org/learn/machine-learning I'd prefer if the course was on Coursera but I'm open to other options. Here are other courses I thought i should add to my list: Neural networks: https://www.coursera.org/learn/neural-networks# Discrete optimization: https://www.coursera.org/learn/discrete-optimization Data mining: https://www.coursera.org/specializations/data-mining Thanks for your help. Any other feedback is appreciated.

**r/UIUC**• post

1 points • macoit18

##### Online MCS - Classes that can be taken on Coursera

Hi everyone,

​

I have been admitted to the program but it seemed too expensive to me and at the end I had decided not to accept the offer of admission. However, I thought of something and probably someone here knows about that.

I would be possible to take as many classes as possible on Coursera using financial aid. I mean, how many classes of the program can be taken as independent courses on Coursera right now?

​

Until now I am aware of the following two specialization:

https://www.coursera.org/specializations/data-mining

https://www.coursera.org/specializations/cloud-computing

​

Do you know if the courses in each specialization correspond to the ones done in the degree or can be converted into credits? Do you have a full list of the courses for the MCS?

Has anyone tried to get the degree in this way?

​

Thanks

**r/agi**• post

1 points • seducer4real

##### Course recommendation for pursuing a career in AI. Looking for feedback and advice.

Hello,

I'm currently following a machine learning course in order to learn more about AI and was wondering which other course(s) I should add to this in order to further my knowledge of AI.

Machine learning course: https://www.coursera.org/learn/machine-learning

I'd prefer if the course was on Coursera but I'm open to other options.

Here are other courses I thought i should add to my list:

Neural networks: https://www.coursera.org/learn/neural-networks#

Discrete optimization: https://www.coursera.org/learn/discrete-optimization

Data mining: https://www.coursera.org/specializations/data-mining

Thanks for your help. Any other feedback is appreciated.