Packt
Intermediate Data Analysis Techniques with Pandas

通过 Coursera Plus 解锁访问 10,000 多门课程。开始 7 天免费试用

Packt

Intermediate Data Analysis Techniques with Pandas

包含在 Coursera Plus

深入了解一个主题并学习基础知识。
4.9

(12 条评论)

中级 等级

推荐体验

1 周 完成
在 10 小时 一周
灵活的计划
自行安排学习进度
深入了解一个主题并学习基础知识。
4.9

(12 条评论)

中级 等级

推荐体验

1 周 完成
在 10 小时 一周
灵活的计划
自行安排学习进度

您将学到什么

  • Utilize advanced data selection and column operations techniques in Pandas.

  • Employ various filtering techniques to enhance data extraction precision.

  • Apply Pandas methods proficiently to clean and prepare data effectively.

  • Manage and manipulate MultiIndex and text data within Pandas for comprehensive data handling.

要了解的详细信息

可分享的证书

添加到您的领英档案

作业

8 项作业

授课语言:英语(English)

了解顶级公司的员工如何掌握热门技能

Petrobras, TATA, Danone, Capgemini, P&G 和 L'Oreal 的徽标

积累特定领域的专业知识

本课程是 Data Analysis with Pandas and Python 专项课程 专项课程的一部分
在注册此课程时,您还会同时注册此专项课程。
  • 向行业专家学习新概念
  • 获得对主题或工具的基础理解
  • 通过实践项目培养工作相关技能
  • 获得可共享的职业证书

该课程共有7个模块

In this module, we will explore the foundational concepts of working with DataFrames in Pandas, starting with a comparison of Series and DataFrame methods and attributes. You will learn to select and manipulate both single and multiple columns, and add new columns to your DataFrames. We will cover the use of value_counts for column analysis and strategies for handling missing values. Additionally, you'll master data type conversions using the astype method, sorting DataFrames with sort_values and sort_index, and ranking values within columns using the rank method.

涵盖的内容

14个视频2篇阅读材料1个作业

In this module, we will dive into filtering data within DataFrames. You'll be introduced to the dataset and learn memory optimization techniques. We will cover filtering rows based on conditions and using logical operators like AND (&) and OR (|). Advanced filtering methods such as isin, isnull, and notnull will be explored. You'll also learn to filter data within a range using the between method, identify and handle duplicates with duplicated and drop_duplicates, and find and count unique values using unique and nunique methods.

涵盖的内容

10个视频1个作业

In this module, we will explore essential data extraction techniques in Pandas. You'll start with an introduction to the dataset and learn to set and reset indices using set_index and reset_index methods. We will cover retrieving rows by index positions with iloc and by labels with loc, and understand the second arguments for precise data retrieval. You'll learn to overwrite individual and multiple values, rename index labels or columns, and delete rows or columns. Advanced extraction techniques like sampling with the sample method, extracting specific rows with nsmallest and nlargest, conditional filtering with where, and executing functions across DataFrame rows or columns with apply, will also be covered.

涵盖的内容

13个视频1个作业

In this module, we will focus on working with text data in Pandas. You'll start with an introduction to the dataset and learn to use common string methods for text data manipulation. We will cover filtering DataFrame rows using string methods and applying these methods to DataFrame indices and columns. You'll master the split method to divide text data into multiple parts and enhance your skills with additional practice exercises. Finally, you'll learn to customize text splitting using the expand and n parameters of the split method for more detailed analysis.

涵盖的内容

7个视频1个作业

In this module, we will explore the advanced capabilities of MultiIndex in Pandas, starting with an introduction to its concepts. You'll learn to create and manage MultiIndex DataFrames for complex data grouping and analysis. We will cover techniques to extract and rename index level values for clarity, and how to sort and extract specific rows for better data organization. Additionally, you'll master methods like transpose, stack, and unstack to reshape DataFrames, and apply pivot, melt, and pivot_table methods to reorganize and transform data efficiently.

涵盖的内容

12个视频1个作业

In this module, we will delve into the GroupBy functionality in Pandas, starting with an introduction to its essential concepts for data aggregation. You'll learn to use the groupby method to group data and retrieve specific groups with the get_group method. We will explore various aggregation methods available on GroupBy objects and cover techniques for grouping data by multiple columns. Additionally, you'll master the agg method to apply multiple operations on grouped data and learn to iterate through groups for individual data processing.

涵盖的内容

7个视频1个作业

In this module, we will explore essential techniques for merging DataFrames in Pandas. You'll begin with an introduction to various merging methods, followed by a detailed look at using the pd.concat function to concatenate DataFrames along a specified axis. We will cover left joins and the use of left_on and right_on parameters for specific column matching, as well as inner joins to combine DataFrames based on intersecting keys. Additionally, you'll learn about full-outer joins to merge DataFrames including all keys from both frames, and how to merge by indexes using left_index and right_index parameters. Finally, you'll be introduced to the join method as a simpler alternative for merging DataFrames.

涵盖的内容

10个视频1篇阅读材料2个作业

获得职业证书

将此证书添加到您的 LinkedIn 个人资料、简历或履历中。在社交媒体和绩效考核中分享。

位教师

Packt - Course Instructors
Packt
1,186 门课程292,430 名学生

提供方

Packt

从 Data Analysis 浏览更多内容

人们为什么选择 Coursera 来帮助自己实现职业发展

Felipe M.
自 2018开始学习的学生
''能够按照自己的速度和节奏学习课程是一次很棒的经历。只要符合自己的时间表和心情,我就可以学习。'
Jennifer J.
自 2020开始学习的学生
''我直接将从课程中学到的概念和技能应用到一个令人兴奋的新工作项目中。'
Larry W.
自 2021开始学习的学生
''如果我的大学不提供我需要的主题课程,Coursera 便是最好的去处之一。'
Chaitanya A.
''学习不仅仅是在工作中做的更好:它远不止于此。Coursera 让我无限制地学习。'
Coursera Plus

通过 Coursera Plus 开启新生涯

无限制访问 10,000+ 世界一流的课程、实践项目和就业就绪证书课程 - 所有这些都包含在您的订阅中

通过在线学位推动您的职业生涯

获取世界一流大学的学位 - 100% 在线

加入超过 3400 家选择 Coursera for Business 的全球公司

提升员工的技能,使其在数字经济中脱颖而出

常见问题