This course offers students an opportunity to learn fundamentals of computation required to understand and analyze real world data. The course helps students to work with modern data structures, apply data cleaning and data wrangling operations. The course covers conceptual and practical applications of probability and distribution, cluster analysis, text analysis and time series analysis.

Foundations for Data Analytics Part 1
访问权限由 New York State Department of Labor 提供
您将获得的技能
要了解的详细信息

添加到您的领英档案
14 项作业
了解顶级公司的员工如何掌握热门技能

该课程共有7个模块
In this module, we will focus on Python programming fundamentals. The aim is to help you understand Python's basic syntax, data types, and operators, enabling the creation of simple programs. Additionally, we will cover the use of if statements, loops, and proper indentation to control program flow, fostering a foundational understanding of essential control structures in Python programming.
涵盖的内容
5个视频6篇阅读材料2个作业1个讨论话题
5个视频• 总计19分钟
- Course Overview• 2分钟
- Meet Your Faculty• 1分钟
- Python Fundamentals• 8分钟
- Control Structures Pt 1• 3分钟
- Control Structures Pt 2• 6分钟
6篇阅读材料• 总计83分钟
- Course Introduction• 2分钟
- Syllabus - Foundations of Data Analytics Part 1• 5分钟
- Academic Integrity• 1分钟
- Python Fundamentals• 40分钟
- Control Structures Pt 1• 10分钟
- Control Structures Pt 2• 25分钟
2个作业• 总计20分钟
- Module 1 Assess Your Learning: Python Fundamentals• 10分钟
- Module 1 Assess Your Learning: Control Structures• 10分钟
1个讨论话题• 总计20分钟
- Meet Your Fellow Learners• 20分钟
In this module we will dive into the diverse landscape of Python data structures, including lists, dictionaries, sets, tuples, and arrays. By exploring real-world use cases, you will uncover the unique strengths and weaknesses of each data structure. You will gain insights into recognizing and understanding the characteristics of these structures, empowering you to make informed choices when tackling programming challenges. Through hands-on practice, you will develop the skills to select and apply the most suitable data structure to efficiently solve a wide range of problems, enhancing their proficiency in Python programming.
涵盖的内容
2个视频2篇阅读材料1个作业
2个视频• 总计13分钟
- List, Tuples, and Arrays• 8分钟
- Dictionaries• 5分钟
2篇阅读材料• 总计200分钟
- List, Tuples, and Arrays• 170分钟
- Dictionaries• 30分钟
1个作业• 总计10分钟
- Module 2 Assess Your Learning: Lists, Tuples, Arrays & Dictionaries• 10分钟
In this module we will introduce DataFrames, a pivotal tool in data manipulation and analysis. You will grasp the fundamental concepts of DataFrames, learning how to create, manipulate, and access data efficiently. You will gain essential skills for basic data exploration–including summarizing data, indexing, and slicing, enabling them to extract meaningful insights. Furthermore, this module equips learners with the expertise to clean and preprocess data, covering handling missing values, filtering data, merging/joining datasets, and transforming data for analysis readiness. By the end of this module, you will harness DataFrames for advanced data analysis, mastering group-wise operations, aggregation, and statistical analysis.
涵盖的内容
3个视频3篇阅读材料2个作业
3个视频• 总计19分钟
- Introduction to DataFrames• 9分钟
- Data Cleaning and Transformation• 5分钟
- Data Aggregation• 5分钟
3篇阅读材料• 总计195分钟
- Introduction to DataFrames• 160分钟
- Data Cleaning and Transformation• 10分钟
- Data Aggregation• 25分钟
2个作业• 总计20分钟
- Module 3 Assess Your Learning: DataFrames• 10分钟
- Module 3 Assess Your Learning: Data Cleaning, Transformation and Aggregation• 10分钟
This module will equip you with a comprehensive toolkit for proficient data exploration and analysis. It covers the essential techniques and tools for effectively summarizing data sets, encompassing statistical summaries, data visualization, and data cleaning methods. You will learn how to identify and assess missing data, outliers, and anomalies, vital tasks during the initial exploratory phase of data analysis. Furthermore, you will develop the ability to uncover patterns, relationships, and trends within the data using various visualizations, including scatter plots, histograms, and correlation matrices, enabling you to extract valuable insights and make informed decisions from the data.
涵盖的内容
2个视频2篇阅读材料1个作业
2个视频• 总计14分钟
- Data Exploration Techniques• 5分钟
- Visualization Methods for Pattern Recognition• 9分钟
2篇阅读材料• 总计102分钟
- Data Exploration Techniques• 100分钟
- Visualization Methods for Pattern Recognition• 2分钟
1个作业• 总计10分钟
- Module 4 Assess Your Learning: Visualization Methods for Pattern Recognition• 10分钟
In this module, we will delve into the fundamental concepts of clustering, a critical component of data analysis and pattern recognition. You will learn to recognize the importance of clustering and its role in identifying meaningful groups within data. You will explore key concepts, including data similarity, distance metrics, and the objective of grouping similar data points together. Additionally, the module equips you with the skills to assess the quality of clustering results through evaluation metrics like silhouette score and Dunn index, as well as visual inspection of clustering plots. By the end of this module, you will be proficient in understanding, applying, and evaluating clustering techniques for effective data analysis and pattern recognition.
涵盖的内容
4个视频4篇阅读材料2个作业
4个视频• 总计24分钟
- Clustering Techniques• 4分钟
- Pattern Recognition using Clustering Techniques• 3分钟
- Understanding Clustering Fundamentals• 7分钟
- Cluster Validity• 11分钟
4篇阅读材料• 总计135分钟
- Clustering Techniques• 30分钟
- Pattern Recognition using Clustering Techniques• 30分钟
- Understanding Clustering Fundamentals• 15分钟
- Cluster Validity• 60分钟
2个作业• 总计20分钟
- Module 5 Assess Your Learning: Clustering Techniques• 10分钟
- Module 5 Assess Your Learning: Cluster Validity• 10分钟
This module will provide you with a comprehensive exploration of clustering algorithms, enabling proficiency in this crucial data analysis technique. You will identify various clustering algorithms, such as k-means, hierarchical clustering, and DBSCAN, along with their underlying principles and assumptions. The expertise you will gain will help you determine the most suitable clustering algorithm based on data characteristics and objectives, and you will learn to implement these algorithms using programming languages like Python and tools such as scikit-learn. The clustering quality can be evaluated using internal and external validation methods, as discussed in Week 5.
涵盖的内容
4个视频4篇阅读材料3个作业
4个视频• 总计14分钟
- K-Means Clustering• 6分钟
- Hierarchical Clustering• 4分钟
- DBSCAN• 2分钟
- Choosing Appropriate Clustering Algorithms• 2分钟
4篇阅读材料• 总计4分钟
- K-Means Clustering• 1分钟
- Hierarchical Clustering• 1分钟
- DBSCAN• 1分钟
- Choosing Appropriate Clustering Algorithms• 1分钟
3个作业• 总计30分钟
- Module 6 Assess Your Learning: K-Means Clustering• 10分钟
- Module 6 Assess Your Learning: Hierarchical Clustering• 10分钟
- Module 6 Assess Your Learning: DBSCAN• 10分钟
In this module, you will explore the realm of time series data, gaining a comprehensive understanding of its characteristics, components (trend, seasonality, and noise), and prevalent sources across diverse domains. Through effective visualization techniques and descriptive statistics, you will acquire the skills to recognize patterns and trends within time series data.
涵盖的内容
4个视频5篇阅读材料3个作业
4个视频• 总计17分钟
- Time Series Analysis• 2分钟
- Components of Time Series• 3分钟
- Visual Analytics of Time Series• 5分钟
- Time Series Data Techniques• 7分钟
5篇阅读材料• 总计20分钟
- Time Series Analysis• 1分钟
- Components of Time Series• 3分钟
- Visual Analytics of Time Series• 3分钟
- Time Series Data Techniques• 3分钟
- Congratulations!• 10分钟
3个作业• 总计30分钟
- Module 7 Assess Your Learning: Time Series Analysis• 10分钟
- Module 7 Assess Your Learning: Visual Analytics of Time Series• 10分钟
- Module 7 Assess Your Learning: Time Series Data Techniques• 10分钟
位教师

提供方

提供方

Founded in 1898, Northeastern is a global research university with a distinctive, experience-driven approach to education and discovery. The university is a leader in experiential learning, powered by the world’s most far-reaching cooperative education program. The spirit of collaboration guides a use-inspired research enterprise focused on solving global challenges in health, security, and sustainability.
人们为什么选择 Coursera 来帮助自己实现职业发展

Felipe M.

Jennifer J.

Larry W.

Chaitanya A.
从 Data Science 浏览更多内容
NNortheastern University
课程

课程
LLearnQuest
课程

课程
