Chevron Left
Back to Introduction to Data Science in Python

Learner Reviews & Feedback for Introduction to Data Science in Python by University of Michigan

4.5
stars
27,057 ratings

About the Course

This course will introduce the learner to the basics of the python programming environment, including fundamental python programming techniques such as lambdas, reading and manipulating csv files, and the numpy library. The course will introduce data manipulation and cleaning techniques using the popular python pandas data science library and introduce the abstraction of the Series and DataFrame as the central data structures for data analysis, along with tutorials on how to use functions such as groupby, merge, and pivot tables effectively. By the end of this course, students will be able to take tabular data, clean it, manipulate it, and run basic inferential statistical analyses. This course should be taken before any of the other Applied Data Science with Python courses: Applied Plotting, Charting & Data Representation in Python, Applied Machine Learning in Python, Applied Text Mining in Python, Applied Social Network Analysis in Python....

Top reviews

YH

Sep 28, 2021

This is the practical course.There is some concepts and assignments like: pandas, data-frame, merge and time. The asg 3 and asg4 are difficult but I think that it's very useful and improve my ability.

CB

Feb 6, 2023

The assessments, quizzes, and course coverage are quite good. The main points are covered, although it does not cover everything. Additionally, it provides opportunities to learn and conduct research.

Filter by:

3601 - 3625 of 5,948 Reviews for Introduction to Data Science in Python

By JME

Oct 3, 2017

r

o

b

By Sk M u

Oct 22, 2024

no

By Nirav N

Mar 10, 2023

NA

By afraj m [ T ]

Jul 13, 2020

no

By Aditi N

Dec 31, 2019

ok

By Ankit S

Jun 6, 2022

.

By CHARISHMA S

Jun 13, 2021

By Raj R

Sep 4, 2020

d

By Shrikant S T

Aug 20, 2020

.

By Vaibhav D K

Aug 8, 2020

.

By SHREEKHAA V K M

Jul 13, 2020

.

By Prathmesh M

Jul 9, 2020

.

By 장동희

Jun 10, 2020

By Priya G K

Jun 1, 2020

.

By Junaid L S

May 14, 2019

G

By merve s

Mar 11, 2019

t

By Srinivasarao M

Feb 4, 2019

V

By David A D V

Jan 22, 2019

I

By Anyi L

Aug 20, 2018

a

By Khushpreet S

Apr 23, 2018

G

By Thomas

Dec 29, 2017

g

By James R

Feb 5, 2017

.

By Howard C

Nov 6, 2017

Very good content. One downside for me was that being new to Python, Pandas, Numpy, Scipy etc, I found the amount of new information being thrown at me to be a bit overwhelming. Each of these languages/packages could be a separate course even before you start talking about Data Analysis concepts. I was able to complete all the assignments, but I feel like I know "just enough to be dangerous".

Speaking of the assignments, if you're a newbie like me, give yourself plenty of time to complete to work on them. My rule of thumb was to multiply the "estimated time" for each assignment by a factor of 4. The assignment that was supposed to take 2 hours ended up taking my whole Saturday and the 4 hour project at the end of the course pretty much consumed an entire weekend. This might not apply if you have previous experience in this development environment or are just smarter than me ;-)

Not everything that you need to know to do the homework is provided in the lecture, so expect to spend a lot of time in StackOverflow. The discussion forums are also very useful. Sometimes a teaching assistant will offer some hints that make all the difference.

One gripe I have is with the automated grader. It's a great idea, but sometimes you can submit a fairly complicated bit of code and the only feedback you get from the grader is: "Wrong!". My suggestion: have two data sets, one for testing and another for grading. Then students could openly discuss and debug their test results in the discussion forums without violating the Honor Code. They would still have to submit a valid algorithm to pass against the test data.

By Dionyssios M

Nov 19, 2017

I am a PhD scientist and heavy user of matlab, R, Stata, bash scripting, and some more esoteric computer languages. I took this course with the idea of covering some background in python skills in a structured manner, the goal being to move many of my data science and some of my data processing code to python.

I found the exercises useful. The lectures are not bad, I just felt they were an overview that either didn't connect much with some of the minutiae of the assignments or they were not always key to me given my background. Eg I found the week 2 videos more interesting; week 4 videos far less so especially the video about running a t-test in python (my statistical skillset is far more advanced).

The real point of frustration is the grader which is extremely sensitive to slight variations. I feel there should be a feedback system where users/students document such cases that could then become a FAQ. Examples:

Grader chokes on type but won't tell me: Submitting string 'True' instead of Boolean True.

Grader chokes on useless (non)significant digits: using round(*,2) at one point crashes the submitted work.

These "errors" are so slight that are almost beyond the human ability to catch them. The result is that, in part, the course turns from 'learning python skills' to 'getting to understand minutiae of what the grader does' which can be really frustrating.

In sum, I believe there is value in this course but the grader is fairly broken and needs a FAQ or similar to warn re choke points generated from trivial differences. I am subtracting stars in the review for that particular reason.

By Declan C

Sep 18, 2017

Overall I would certainly recommend this course, I've found it immediately relevant in my field of science/engineering. It is fast-paced and difficult yes, especially for those of us with limited python experience, but that kick it gives leaves you with some solid, immediately-applicable skills.

Where it goes well: Strong content and excellent delivery.

Fosters independence. The course starts from the basics but accelerates at a fast pace, introducing you to roots of concepts but expecting you to expand on them yourself with outside resources rather than rote-learning. In this manner it leaves you VERY prepared to tackle unscripted challenges.

Concise content. The lectures videos themselves contain almost zero fluff. The lecturer conveys relevant information in a very smooth and efficient manner. Replaying specific parts to revise or solidify understanding becomes a pleasure due to this.

Where it could be improved: Could be a more polished.

Time estimates for the assignments were WAY off. I do not mind a challenging assignment, however if it advertises that it will take 3 hours, I would hope not to expect to spend closer to 20 hours, however this certainly was the case. Multiply the estimated time by 4-5 to get a more realistic time.

The assignment wording can sometimes be a little ambiguous. It's almost mandatory to go through the forum posts for clarification. I realise some things were noticed after the publication of the course, and contained by pinned posts in the forum, but perhaps if the next installment of the course could be updated, ironing out some of these wrinkles.