Chevron Left
Back to Getting and Cleaning Data

Learner Reviews & Feedback for Getting and Cleaning Data by Johns Hopkins University

4.5
stars
8,062 ratings

About the Course

Before you can work with data you have to get some. This course will cover the basic ways that data can be obtained. The course will cover obtaining data from the web, from APIs, from databases and from colleagues in various formats. It will also cover the basics of data cleaning and how to make data “tidy”. Tidy data dramatically speed downstream data analysis tasks. The course will also cover the components of a complete data set including raw data, processing instructions, codebooks, and processed data. The course will cover the basics needed for collecting, cleaning, and sharing data....

Top reviews

HS

May 2, 2020

This course provides an introduction of some important concepts and tools on a very important aspect of data science: cleaning and organizing data before any analysis. A must for any data scientist.

DH

Feb 1, 2016

Easy, mostly instructive Course. The Assignments and quizzes are quite good, and illustrates the lessons very well.

See the videos for general presentation, but use the energy on the excersizes.

Filter by:

1151 - 1175 of 1,311 Reviews for Getting and Cleaning Data

By Rigoberto Á

Nov 26, 2017

The professor Leak is not as gifted (in terms of teaching skills) as R. Peng. In some of the lectures he just reads what it's in the presentations but he does not go very deep into them.

By Deleted A

Aug 12, 2016

Contents in first half weeks are very superficial, have low depth so that do not help me do some meaningful studies. But later ones are good for understanding the structure of data.

By Shuwen Y

May 28, 2016

less hands-on exercises and this course covers too much topics without details. More like general intro to each tool and data sources. Swirl is still a great package for practice.

By Christoph G

Jun 12, 2016

I liked it, but I had the impression it wasn't as prepared as the other courses. Especially with the course assignment I had a bit trouble to understand, what was wanted.

By Angela L

Jan 19, 2016

This is not a beginner's course, so a decent grasp of the R language is necessary. It is best to take this course after some stints with Data Camp, Swirl, or Code School.

By Juan D P M

Oct 13, 2021

Some lessons are pretty good, but there is a gap between the lessons and the assigments. You're evaluated in some aspects that are not very explained in the lessons.

By Sahil S

Jan 16, 2021

Assiignments are out of date, some commands are deprecated. Also, the quizzes and projects require more in depth lessons or practice problems to complete. Thank you

By Yuqi J

Jan 14, 2019

Some of the lectures on loading data were very dry, but I guess that can't really be helped. Also the final course project's requirements were on the vague side.

By Buddsalakhum R

Mar 1, 2021

I think this one is easier than the R programming but still, more exercises to practice would be good. Also, the assignment's explanation should be more simple.

By Samantha H

May 30, 2018

It would be better if we had practice problems along the way. This course seemed to have a lot of commands that didn't stick until I put them into practice.

By András H

Mar 4, 2018

This is an important course, but many updates will be needed. There are only a few exercise task, there could be more. The swirl part of the course is good.

By Andrew C

Sep 20, 2016

The best course so far (though that's not saying much). This course would be better as a follow up to an example workflow showing an end to end analysis.

By Amir A S

Mar 8, 2020

Not on par with the course sets before it, could have been a bit more explaining as cleaning data is one of the most important parts of data science.

By Ranto R

Jul 6, 2020

The content of the course is very interesting and useful. However, I found very challenging the difference between lectures and quizzes/assignements

By Bijan S

Jan 30, 2016

The selection of topics is great. However, the course is too abstract. I think some of the materials deserve to be discussed more comprehensively.

By Paymon H

Apr 14, 2016

Toughest class in the offering. There really could've been 2 classes for cleaning data. I struggled with the lecturer's style (spoke too fast).

By nicolas r

Sep 15, 2021

Some assessments are not well related with topics developed during classes. Then, it is difficult to understand what it is the goal to achieve.

By Daniel D

Sep 3, 2019

The project description could have been a lot more descriptive for what we were supposed to do. Otherwise, I had a lot of fun with this section.

By Teodor I

Jan 9, 2018

The non-clickable links in the pdf are a major problem. You need to figure it out. Create a link reference field for each video or something.

By Nishi G

Apr 27, 2017

To get value from the class, I used other on-line materials to understand the topic. Thus, I spent about 20 hours/week on this class.

By Paul M

May 15, 2018

I felt I had to teach myself in this course. I feel I would have had the same learning experience if I had purchased a good R book.

By Richard C

May 10, 2018

A lot of things covered in this course in a short time. Probably should have been 8 weeks with more homework done step by step.

By Sergey K

Mar 16, 2021

great materials but unrelated to assignments. great real life experience "here is link. make it nice. deadline yesterday"

By Diana L G F

May 18, 2020

Sometimes I felt the were not evaluating my learning or ability to do things but my ability to read instructor's mind.

By Hassan D

Apr 17, 2020

it gives us good information, but the information sometimes are incomplete or need to be updated( created in 2014!)