The "Data Processing and Manipulation" course provides students with a comprehensive understanding of various data processing and manipulation concepts and tools. Participants will learn how to handle missing values, detect outliers, perform sampling and dimension reduction, apply scaling and discretization techniques, and explore data cube and pivot table operations. This course equips students with essential skills for efficiently preparing and transforming data for analysis and decision-making.

Data Processing and Manipulation
本课程是 Data Wrangling with Python 专项课程 的一部分

位教师:Di Wu
访问权限由 New York State Department of Labor 提供
您将学到什么
Understand the importance of data processing and manipulation in the data analysis pipeline.
Learn techniques to handle missing values and outliers, data reduction, and data scaling and discretization.
Understand the concept of data cube and perform multidimensional aggregation for exploratory analysis.
您将获得的技能
要了解的详细信息

添加到您的领英档案
6 项作业
了解顶级公司的员工如何掌握热门技能

积累特定领域的专业知识
- 向行业专家学习新概念
- 获得对主题或工具的基础理解
- 通过实践项目培养工作相关技能
- 获得可共享的职业证书

该课程共有4个模块
The "Missing Values and Outliers" week focuses on how to handle missing values and detect outliers using the Pandas library. You will learn essential techniques to identify and address missing data effectively, as well as methods to detect and manage outliers in datasets.
涵盖的内容
3个视频5篇阅读材料2个作业1个讨论话题
The "Data Reduction" week focuses on how to reduce data through sampling and dimensionality reduction using the Pandas library. You will learn essential techniques to obtain manageable subsets of data while preserving meaningful information for analysis and visualization.
涵盖的内容
2个视频3篇阅读材料1个作业1个讨论话题
The "Scaling and Discretization" week focuses on the importance of data scaling and discretization in the data preprocessing process. You will learn why and how to perform data scaling to normalize variables and handle data with different scales. Additionally, you will explore the concept of data discretization and its application in transforming continuous data into categorical representations.
涵盖的内容
2个视频3篇阅读材料1个作业1个讨论话题
The "Data Warehouse" week focuses on the concepts and methodologies of organizing data using data cubes and pivot tables in Pandas. You will learn the importance of data warehousing for efficient data management and analysis, as well as how to construct data cubes and pivot tables to facilitate multidimensional data exploration.
涵盖的内容
2个视频3篇阅读材料2个作业1个讨论话题
获得职业证书
将此证书添加到您的 LinkedIn 个人资料、简历或履历中。在社交媒体和绩效考核中分享。
位教师

人们为什么选择 Coursera 来帮助自己实现职业发展

Felipe M.

Jennifer J.

Larry W.

Chaitanya A.
从 Data Science 浏览更多内容

Microsoft

University of California San Diego




