Foundations for Data Analytics Part 2

Foundations for Data Analytics Part 2

位教师：Qurat-ul-Ain Azim

访问权限由 New York State Department of Labor 提供

7个模块

深入了解一个主题并学习基础知识。

初级等级

无需具备相关经验

1 周完成

在 10 小时一周

灵活的计划

自行安排学习进度

7个模块

深入了解一个主题并学习基础知识。

初级等级

无需具备相关经验

1 周完成

在 10 小时一周

灵活的计划

自行安排学习进度

您将获得的技能

要了解的详细信息

可分享的证书

添加到您的领英档案

作业

13 项作业

授课语言：英语（English）

了解顶级公司的员工如何掌握热门技能

了解关于 Coursera for Business 的更多信息

Petrobras, TATA, Danone, Capgemini, P&G 和 L'Oreal 的徽标

该课程共有7个模块

This course offers students an opportunity to learn fundamentals of computation required to understand and analyze real world data. The course helps students to work with modern data structures, apply data cleaning and data wrangling operations. The course covers conceptual and practical applications of probability and distribution, cluster analysis, text analysis and time series analysis.

In this module, you will explore the realm of time series data, gaining a comprehensive understanding of its characteristics, components (trend, seasonality, and noise), and prevalent sources across diverse domains. Through effective visualization techniques and descriptive statistics, you will acquire the skills to recognize patterns and trends within time series data.

涵盖的内容

5个视频5篇阅读材料2个作业

5个视频总计28分钟

Meet Your Faculty1分钟
Course Overview2分钟
Time Series Feature Extraction11分钟
Permutation Entropy and Complexity Method11分钟
CECP Example3分钟

5篇阅读材料总计49分钟

Course Introduction2分钟
Syllabus - Foundations of Data Analytics Part 25分钟
Academic Integrity1分钟
Time Series Feature Extraction1分钟
Permutation Entropy and Complexity Method40分钟

2个作业总计20分钟

Module 8 Assess Your Learning: Time Series Feature Extraction10分钟
Module 8 Assess Your Learning: Time Series Features10分钟

This module focuses on feature extraction in time series data analysis, emphasizing the identification and utilization of diverse features. We will explore how these features capture essential information, enabling a comprehensive understanding of time series data. You will gain practical insights into the application of various feature types, enhancing your ability to extract meaningful patterns and make informed analyses in the dynamic field of time series data analysis.

涵盖的内容

5个视频5篇阅读材料3个作业

5个视频总计23分钟

Text Processing4分钟
Text Processing Basics: Tokenization and Stemming3分钟
Bag of Words (BoW)2分钟
TF-IDF and Word Embeddings9分钟
Text Analysis Techniques5分钟

5篇阅读材料总计36分钟

Text Processing3分钟
Text Processing Basics: Tokenization and Stemming26分钟
Bag of Words (BoW)1分钟
TF-IDF and Word Embeddings3分钟
Text Analysis Techniques3分钟

3个作业总计55分钟

Module 9 Assess Your Learning: Text Processing Basics20分钟
Module 9 Assess Your Learning: BoW and TF-IDF20分钟
Module 9 Assess Your Learning: Text Analysis Techniques15分钟

This module focuses on the comprehensive preprocessing and analysis of textual data. You will acquire practical skills in text data preprocessing, encompassing tasks such as tokenization, stemming, and stopword removal. We will discuss diverse methods for representing text data, including bag-of-words (BoW), Term Frequency-Inverse Document Frequency (TF-IDF), and word embeddings. We will also explore various text analysis techniques such as sentiment analysis, topic modeling, and named entity recognition. The practical application of these techniques enables you to extract meaningful insights, patterns, and nuanced meanings from textual data, empowering you to navigate and derive value from the intricate landscape of text analysis.

涵盖的内容

2个视频2篇阅读材料2个作业

In this module, we examine network theory, equipping you with a foundational understanding of nodes, edges, and graphs. We will explore various network types, from social networks to keyword co-occurrence networks, learning to discern their relevance in diverse domains. Practical application includes extracting and creating keyword co-occurrence networks from text data through preprocessing, keyword identification, and relationship construction. You will then analyze these networks, employing measures like centrality and community detection, enhancing your ability to interpret results. This module culminates in the extraction of meaningful insights, enabling you to identify keywords and thematic clusters within textual data through the lens of network analysis.

涵盖的内容

3个视频3篇阅读材料2个作业

3个视频总计29分钟

Fundamentals of Complex Network11分钟
Text Analysis Using Keyword Co-Occurrence Network12分钟
Keyword Co-occurrences Networks5分钟

3篇阅读材料总计9分钟

Fundamentals of Complex Network3分钟
Text Analysis Using Keyword Co-Occurrence Network3分钟
Keyword Co-occurrences Networks3分钟

2个作业总计40分钟

Module 11 Assess Your Learning: Fundamentals of Complex Networks20分钟
Module 11 Assess Your Learning: Keyword Co-Occurrence Networks20分钟

涵盖的内容

2个视频4篇阅读材料2个作业

2个视频总计21分钟

Statistics in Data Analysis: Random Variables13分钟
Statistics in Data Analysis: Probability Distribution Functions8分钟

4篇阅读材料总计121分钟

Random Variables80分钟
Examples: Random Variables20分钟
Probability Distribution Functions1分钟
Examples: Probability Distribution Functions20分钟

2个作业总计25分钟

Module 12 Assess Your Learning: Random Variables10分钟
Module 12 Assess Your Learning: Probability Distribution Functions15分钟

In this module, you will inspect the intricate world of joint probability distributions. You will develop the skill to identify and interpret these distributions, employing probability mass functions (PMFs) for discrete variables and probability density functions (PDFs) for continuous variables. This module will further equip you with the capability to calculate and interpret marginal probability distributions, involving the summing or integrating of variables within a joint distribution. The theoretical insights and practical calculations will help you gain a complete understanding of the relationships between variables and the nuanced exploration of joint, marginal, and conditional probability distributions.

涵盖的内容

1个视频2篇阅读材料1个作业

In this module, you will explore the fundamental concept of mathematical expectation, or expected value, in probability theory. Through theory and practice, you will calculate the expected value for both discrete and continuous random variables, gaining insights into its significance as a measure of central tendency. We will also explore the statistical concepts of covariance and correlation, guiding participants in the calculation of coefficients to quantify relationships between pairs of random variables. Interpretation of these results allows you to classify the degree and direction of association through positive, negative, or zero covariance/correlation values. Additionally, the module addresses the concept of independence, elucidating its relationship with zero covariance and correlation.