This course offers students an opportunity to learn fundamentals of computation required to understand and analyze real world data. The course helps students to work with modern data structures, apply data cleaning and data wrangling operations. The course covers conceptual and practical applications of probability and distribution, cluster analysis, text analysis and time series analysis.

Foundations for Data Analytics Part 2
访问权限由 New York State Department of Labor 提供
您将获得的技能
要了解的详细信息

添加到您的领英档案
13 项作业
了解顶级公司的员工如何掌握热门技能

该课程共有7个模块
In this module, you will explore the realm of time series data, gaining a comprehensive understanding of its characteristics, components (trend, seasonality, and noise), and prevalent sources across diverse domains. Through effective visualization techniques and descriptive statistics, you will acquire the skills to recognize patterns and trends within time series data.
涵盖的内容
5个视频5篇阅读材料2个作业
5个视频• 总计28分钟
- Meet Your Faculty• 1分钟
- Course Overview• 2分钟
- Time Series Feature Extraction• 11分钟
- Permutation Entropy and Complexity Method• 11分钟
- CECP Example• 3分钟
5篇阅读材料• 总计49分钟
- Course Introduction• 2分钟
- Syllabus - Foundations of Data Analytics Part 2• 5分钟
- Academic Integrity• 1分钟
- Time Series Feature Extraction• 1分钟
- Permutation Entropy and Complexity Method• 40分钟
2个作业• 总计20分钟
- Module 8 Assess Your Learning: Time Series Feature Extraction• 10分钟
- Module 8 Assess Your Learning: Time Series Features• 10分钟
This module focuses on feature extraction in time series data analysis, emphasizing the identification and utilization of diverse features. We will explore how these features capture essential information, enabling a comprehensive understanding of time series data. You will gain practical insights into the application of various feature types, enhancing your ability to extract meaningful patterns and make informed analyses in the dynamic field of time series data analysis.
涵盖的内容
5个视频5篇阅读材料3个作业
5个视频• 总计23分钟
- Text Processing• 4分钟
- Text Processing Basics: Tokenization and Stemming• 3分钟
- Bag of Words (BoW)• 2分钟
- TF-IDF and Word Embeddings• 9分钟
- Text Analysis Techniques• 5分钟
5篇阅读材料• 总计36分钟
- Text Processing• 3分钟
- Text Processing Basics: Tokenization and Stemming• 26分钟
- Bag of Words (BoW)• 1分钟
- TF-IDF and Word Embeddings• 3分钟
- Text Analysis Techniques• 3分钟
3个作业• 总计55分钟
- Module 9 Assess Your Learning: Text Processing Basics• 20分钟
- Module 9 Assess Your Learning: BoW and TF-IDF• 20分钟
- Module 9 Assess Your Learning: Text Analysis Techniques• 15分钟
This module focuses on the comprehensive preprocessing and analysis of textual data. You will acquire practical skills in text data preprocessing, encompassing tasks such as tokenization, stemming, and stopword removal. We will discuss diverse methods for representing text data, including bag-of-words (BoW), Term Frequency-Inverse Document Frequency (TF-IDF), and word embeddings. We will also explore various text analysis techniques such as sentiment analysis, topic modeling, and named entity recognition. The practical application of these techniques enables you to extract meaningful insights, patterns, and nuanced meanings from textual data, empowering you to navigate and derive value from the intricate landscape of text analysis.
涵盖的内容
2个视频2篇阅读材料2个作业
2个视频• 总计8分钟
- N-Grams and Bi-Grams• 4分钟
- Word Correlation Using PMI• 3分钟
2篇阅读材料• 总计6分钟
- N-Grams and Bigrams• 3分钟
- Word Correlation Using PMI• 3分钟
2个作业• 总计20分钟
- Module 10 Assess Your Learning: N-Grams and Bigrams• 10分钟
- Module 10 Assess Your Learning: Word Correlations Using PMI• 10分钟
In this module, we examine network theory, equipping you with a foundational understanding of nodes, edges, and graphs. We will explore various network types, from social networks to keyword co-occurrence networks, learning to discern their relevance in diverse domains. Practical application includes extracting and creating keyword co-occurrence networks from text data through preprocessing, keyword identification, and relationship construction. You will then analyze these networks, employing measures like centrality and community detection, enhancing your ability to interpret results. This module culminates in the extraction of meaningful insights, enabling you to identify keywords and thematic clusters within textual data through the lens of network analysis.
涵盖的内容
3个视频3篇阅读材料2个作业
3个视频• 总计29分钟
- Fundamentals of Complex Network• 11分钟
- Text Analysis Using Keyword Co-Occurrence Network• 12分钟
- Keyword Co-occurrences Networks• 5分钟
3篇阅读材料• 总计9分钟
- Fundamentals of Complex Network• 3分钟
- Text Analysis Using Keyword Co-Occurrence Network• 3分钟
- Keyword Co-occurrences Networks• 3分钟
2个作业• 总计40分钟
- Module 11 Assess Your Learning: Fundamentals of Complex Networks• 20分钟
- Module 11 Assess Your Learning: Keyword Co-Occurrence Networks• 20分钟
In this module, we examine network theory, equipping you with a foundational understanding of nodes, edges, and graphs. We will explore various network types, from social networks to keyword co-occurrence networks, learning to discern their relevance in diverse domains. Practical application includes extracting and creating keyword co-occurrence networks from text data through preprocessing, keyword identification, and relationship construction. You will then analyze these networks, employing measures like centrality and community detection, enhancing your ability to interpret results. This module culminates in the extraction of meaningful insights, enabling you to identify keywords and thematic clusters within textual data through the lens of network analysis.
涵盖的内容
2个视频4篇阅读材料2个作业
2个视频• 总计21分钟
- Statistics in Data Analysis: Random Variables• 13分钟
- Statistics in Data Analysis: Probability Distribution Functions• 8分钟
4篇阅读材料• 总计121分钟
- Random Variables• 80分钟
- Examples: Random Variables• 20分钟
- Probability Distribution Functions• 1分钟
- Examples: Probability Distribution Functions• 20分钟
2个作业• 总计25分钟
- Module 12 Assess Your Learning: Random Variables• 10分钟
- Module 12 Assess Your Learning: Probability Distribution Functions• 15分钟
In this module, you will inspect the intricate world of joint probability distributions. You will develop the skill to identify and interpret these distributions, employing probability mass functions (PMFs) for discrete variables and probability density functions (PDFs) for continuous variables. This module will further equip you with the capability to calculate and interpret marginal probability distributions, involving the summing or integrating of variables within a joint distribution. The theoretical insights and practical calculations will help you gain a complete understanding of the relationships between variables and the nuanced exploration of joint, marginal, and conditional probability distributions.
涵盖的内容
1个视频2篇阅读材料1个作业
1个视频• 总计5分钟
- Joint, Marginal and Conditional Distributions• 5分钟
2篇阅读材料• 总计41分钟
- Joint, Marginal and Conditional Distributions• 1分钟
- Examples: Joint, Marginal and Conditional Distributions• 40分钟
1个作业• 总计10分钟
- Module 13 Assess Your Learning: Joint, Marginal, and Conditional Distributions• 10分钟
In this module, you will explore the fundamental concept of mathematical expectation, or expected value, in probability theory. Through theory and practice, you will calculate the expected value for both discrete and continuous random variables, gaining insights into its significance as a measure of central tendency. We will also explore the statistical concepts of covariance and correlation, guiding participants in the calculation of coefficients to quantify relationships between pairs of random variables. Interpretation of these results allows you to classify the degree and direction of association through positive, negative, or zero covariance/correlation values. Additionally, the module addresses the concept of independence, elucidating its relationship with zero covariance and correlation.
涵盖的内容
3个视频5篇阅读材料1个作业
3个视频• 总计13分钟
- Mathematical Expectation• 5分钟
- Mathematical Expectation Pt 2• 4分钟
- Covariance and Correlation• 4分钟
5篇阅读材料• 总计15分钟
- Mathematical Expectation• 1分钟
- Example: Mathematical Expectation• 10分钟
- Covariance and Correlation• 1分钟
- Course Conclusion• 1分钟
- Congratulations! • 2分钟
1个作业• 总计10分钟
- Module 14 Assess Your Learning: Covariance and Correlation• 10分钟
位教师

提供方

提供方

Founded in 1898, Northeastern is a global research university with a distinctive, experience-driven approach to education and discovery. The university is a leader in experiential learning, powered by the world’s most far-reaching cooperative education program. The spirit of collaboration guides a use-inspired research enterprise focused on solving global challenges in health, security, and sustainability.
人们为什么选择 Coursera 来帮助自己实现职业发展

Felipe M.

Jennifer J.

Larry W.

Chaitanya A.
从 Data Science 浏览更多内容
NNortheastern University
课程

课程

课程

课程
