Chevron Left
返回到 Introduction to Data Science in Python

学生对 University of Michigan 提供的 Introduction to Data Science in Python 的评价和反馈

4.5
27,271 个评分

课程概述

This course will introduce the learner to the basics of the python programming environment, including fundamental python programming techniques such as lambdas, reading and manipulating csv files, and the numpy library. The course will introduce data manipulation and cleaning techniques using the popular python pandas data science library and introduce the abstraction of the Series and DataFrame as the central data structures for data analysis, along with tutorials on how to use functions such as groupby, merge, and pivot tables effectively. By the end of this course, students will be able to take tabular data, clean it, manipulate it, and run basic inferential statistical analyses. This course should be taken before any of the other Applied Data Science with Python courses: Applied Plotting, Charting & Data Representation in Python, Applied Machine Learning in Python, Applied Text Mining in Python, Applied Social Network Analysis in Python....

热门审阅

AN

Aug 15, 2020

I found this course appealing because it was more practical based.it helped me alot in getting hands on experience and most of all I have learned how to solve real world problem with python libraries

ME

Jul 26, 2020

Quizzes were very challenging and interesting. I learned alot. The main drawback in this course is that the materials are insufficient to answer the assignments.And some questions were not so clear.

筛选依据:

3626 - Introduction to Data Science in Python 的 3650 个评论(共 5,994 个)

创建者 Jan-Jaap G d S

Nov 18, 2017

n/a

创建者 JME

Oct 3, 2017

r

o

b

创建者 Sk M u

Oct 22, 2024

no

创建者 Nirav N

Mar 9, 2023

NA

创建者 afraj m [ T ]

Jul 13, 2020

no

创建者 Aditi N

Dec 31, 2019

ok

创建者 Ankit S

Jun 6, 2022

.

创建者 CHARISHMA S

Jun 13, 2021

创建者 Raj R

Sep 4, 2020

d

创建者 Shrikant S T

Aug 20, 2020

.

创建者 Vaibhav D K

Aug 8, 2020

.

创建者 SHREEKHAA V K M

Jul 13, 2020

.

创建者 Prathmesh M

Jul 9, 2020

.

创建者 장동희

Jun 10, 2020

创建者 Priya G K

Jun 1, 2020

.

创建者 Junaid L S

May 13, 2019

G

创建者 merve s

Mar 11, 2019

t

创建者 Srinivasarao M

Feb 4, 2019

V

创建者 David A D V

Jan 22, 2019

I

创建者 Anyi L

Aug 19, 2018

a

创建者 Khushpreet S

Apr 23, 2018

G

创建者 Thomas

Dec 28, 2017

g

创建者 James R

Feb 5, 2017

.

创建者 Howard C

Nov 6, 2017

Very good content. One downside for me was that being new to Python, Pandas, Numpy, Scipy etc, I found the amount of new information being thrown at me to be a bit overwhelming. Each of these languages/packages could be a separate course even before you start talking about Data Analysis concepts. I was able to complete all the assignments, but I feel like I know "just enough to be dangerous".

Speaking of the assignments, if you're a newbie like me, give yourself plenty of time to complete to work on them. My rule of thumb was to multiply the "estimated time" for each assignment by a factor of 4. The assignment that was supposed to take 2 hours ended up taking my whole Saturday and the 4 hour project at the end of the course pretty much consumed an entire weekend. This might not apply if you have previous experience in this development environment or are just smarter than me ;-)

Not everything that you need to know to do the homework is provided in the lecture, so expect to spend a lot of time in StackOverflow. The discussion forums are also very useful. Sometimes a teaching assistant will offer some hints that make all the difference.

One gripe I have is with the automated grader. It's a great idea, but sometimes you can submit a fairly complicated bit of code and the only feedback you get from the grader is: "Wrong!". My suggestion: have two data sets, one for testing and another for grading. Then students could openly discuss and debug their test results in the discussion forums without violating the Honor Code. They would still have to submit a valid algorithm to pass against the test data.

创建者 Dionyssios M

Nov 19, 2017

I am a PhD scientist and heavy user of matlab, R, Stata, bash scripting, and some more esoteric computer languages. I took this course with the idea of covering some background in python skills in a structured manner, the goal being to move many of my data science and some of my data processing code to python.

I found the exercises useful. The lectures are not bad, I just felt they were an overview that either didn't connect much with some of the minutiae of the assignments or they were not always key to me given my background. Eg I found the week 2 videos more interesting; week 4 videos far less so especially the video about running a t-test in python (my statistical skillset is far more advanced).

The real point of frustration is the grader which is extremely sensitive to slight variations. I feel there should be a feedback system where users/students document such cases that could then become a FAQ. Examples:

Grader chokes on type but won't tell me: Submitting string 'True' instead of Boolean True.

Grader chokes on useless (non)significant digits: using round(*,2) at one point crashes the submitted work.

These "errors" are so slight that are almost beyond the human ability to catch them. The result is that, in part, the course turns from 'learning python skills' to 'getting to understand minutiae of what the grader does' which can be really frustrating.

In sum, I believe there is value in this course but the grader is fairly broken and needs a FAQ or similar to warn re choke points generated from trivial differences. I am subtracting stars in the review for that particular reason.