This course delves into both the theoretical aspects and practical applications of data mining within the field of engineering. It provides a comprehensive review of the essential fundamentals and central concepts underpinning data mining. Additionally, it introduces pivotal data mining methodologies and offers a guide to executing these techniques through various algorithms. Students will be introduced to a range of data mining techniques, such as clustering, the extraction of association rules, support vector machines, neural networks, and the exploration of other complex techniques. Additionally, we will use case studies to explore the application of data mining across diverse sectors, including but not limited to manufacturing, healthcare, medicine, business, and various service industries.

Machine Learning and Data Analytics Part 2
访问权限由 Coursera Learning Team 提供
您将获得的技能
您将学习的工具
要了解的详细信息

添加到您的领英档案
7 项作业
了解顶级公司的员工如何掌握热门技能

该课程共有7个模块
In this module, we'll learn two powerful techniques for refining predictive models: Ridge regression and Lasso regression. These methods address the challenge of overfitting in linear regression by introducing regularization techniques. Ridge regression employs L2 regularization to control the magnitude of coefficients, while Lasso regression utilizes L1 regularization to perform feature selection. Throughout this module, we'll explore the principles behind Ridge and Lasso regression, examine their mathematical foundations, understand how they tackle overfitting, learn how to implement them in practical scenarios, and discuss the intricacies of these essential regression techniques.
涵盖的内容
3个视频9篇阅读材料1个作业
3个视频•总计35分钟
- Ridge Regression•15分钟
- Lasso Regression •11分钟
- Feature Selection Machine Learning Model•9分钟
9篇阅读材料•总计74分钟
- Course Introduction•10分钟
- Meet Your Faculty: Chinthaka Pathum "Dinesh" Herath Gedara•10分钟
- Machine Learning and Data Analytics Part 2 Syllabus•10分钟
- Academic Integrity•10分钟
- Ridge Regression•15分钟
- Advantages and Limitations of Ridge Regression•3分钟
- Lasso Regression •5分钟
- Advantages and Limitations of Lasso Regression•3分钟
- Feature Selection Machine Learning Model•8分钟
1个作业•总计10分钟
- Module 8: Assess Your Learning: Key Regression Techniques•10分钟
In this module, we'll dive into the core principles and algorithms of clustering in data mining. You'll learn about key techniques such as K-Means, hierarchical clustering, and DBSCAN. Through hands-on activities and real-world datasets, you'll learn to identify patterns and groupings effectively. With K-Means clustering, we’ll explore how to partition data into distinct groups based on similarity. Hierarchical clustering will help us dive into creating dendrograms to visualize relationships between data points. Finally, DBSCAN will introduce you to density-based clustering, ideal for detecting outliers and noise in your data. Get ready to unlock the power of clustering algorithms!
涵盖的内容
3个视频7篇阅读材料1个作业
3个视频•总计39分钟
- K-Means Clustering•16分钟
- Hierarchical Clustering (Distance Matrices)•13分钟
- DBSCAN•10分钟
7篇阅读材料•总计243分钟
- K-Means Clustering•95分钟
- K-Means Clustering: Advantages and Limitations•3分钟
- Hierarchical Clustering (Distance Matrices)•95分钟
- Hierarchical Clustering Advantages and Limitations•2分钟
- DBSCAN•25分钟
- Silhouette Coefficient and Variance Ratio Criterion•20分钟
- DBSCAN Advantages and Limitations•3分钟
1个作业•总计10分钟
- Module 9: Assess Your Learning: Clustering Algorithms•10分钟
In this module, we discuss the fundamental concepts and algorithms of association rule mining, including Apriori and FP-Growth. Through the Association Rule Mining lesson, you'll grasp the essence of discovering meaningful patterns and relationships in large datasets. The FP-Growth (Frequent Pattern Growth) Algorithm lesson will equip you with the skills to create and implement efficient algorithms for identifying frequent itemsets and strong association rules. Additionally, you'll learn how collaborative filtering, a technique widely used in recommendation systems, leverages association rule mining to provide personalized recommendations. By the end of the module, you'll be adept at evaluating the effectiveness of association rules using key metrics such as support, confidence, and lift.
涵盖的内容
3个视频6篇阅读材料1个作业
3个视频•总计37分钟
- Association Rule Mining•21分钟
- FP-Growth •6分钟
- Collaborative Filtering•11分钟
6篇阅读材料•总计125分钟
- Association Rule Mining•70分钟
- Apriori Algorithm Advantages and Limitations•3分钟
- FP-Growth •45分钟
- FP-Growth: Advantages and Limitations•3分钟
- Collaborative Filtering•2分钟
- Collaborative Filtering: Advantages and Limitations•2分钟
1个作业•总计10分钟
- Module 10: Assess Your Learning: Association Rule Mining•10分钟
In this module, you'll master the application of support vector machines (SVMs) for classification tasks, learning to leverage this powerful discriminative algorithm effectively. We'll explore the significance of support vectors in defining the margin that separates different classes, enhancing the model's generalization capabilities. You'll gain insights into the differences between a hard margin SVM and soft margin SVM, with a focus on handling real-world, noisy data. Additionally, we'll delve into the mathematical formulation of the soft margin SVM, emphasizing the objective function and its critical role in balancing margin width and classification accuracy.
涵盖的内容
3个视频5篇阅读材料1个作业
3个视频•总计20分钟
- SVM Hard Margins•7分钟
- SVM Soft Margins•9分钟
- Kernels•4分钟
5篇阅读材料•总计150分钟
- SVM Hard Margins•70分钟
- SVM Soft Margins•40分钟
- Kernels•20分钟
- Kernel Trick Example•10分钟
- Kernels Advantages and Limitations•10分钟
1个作业•总计30分钟
- Module 11: Assess Your Learning: Support Vector Machines•30分钟
This module is designed to provide you with a comprehensive understanding of neural network architectures and their functionality. You will explore the intricacies of feedforward networks, the mechanics of backpropagation, and foundational concepts in deep learning. Through practical examples and hands-on exercises, you'll learn how to build and train neural networks to solve complex problems. By the end of this module, you will have a solid grasp of how neural networks operate and be prepared to apply deep learning techniques in various real-world scenarios.
涵盖的内容
3个视频3篇阅读材料1个作业
3个视频•总计27分钟
- Neural Networks•16分钟
- Implementing Neural Networks•4分钟
- Chain Rule•7分钟
3篇阅读材料•总计85分钟
- Neural Networks•15分钟
- Implementing Neural Networks•60分钟
- Advantages and Limitations of Neural Networks•10分钟
1个作业•总计10分钟
- Module 12: Assess Your Learning: Neural Networks•10分钟
Welcome to the Text Mining Module! This module will equip you with essential skills and knowledge in text mining, covering key concepts, techniques, and the related challenges you may encounter. You will learn about text preprocessing, tokenization, and feature extraction, all crucial for transforming raw text into valuable data. We will delve into applying various text mining algorithms for practical applications, such as information retrieval, sentiment analysis, and topic modeling. By the end of this module, you will be adept at leveraging text mining tools to uncover insights and patterns from textual data, enhancing your data analysis capabilities.
涵盖的内容
2个视频3篇阅读材料1个作业
2个视频•总计26分钟
- Text Mining•13分钟
- Text Preprocessing and Feature Extraction•13分钟
3篇阅读材料•总计48分钟
- Text Mining•45分钟
- Text Preprocessing and Feature Extraction•1分钟
- Text Preprocessing Advantages and Limitations•2分钟
1个作业•总计10分钟
- Module 13: Assess Your Learning: Text Mining•10分钟
In this module, you will explore the realm of time series analysis, mastering fundamental concepts and techniques essential for understanding and analyzing temporal data patterns. You will gain an understanding of the triad of trend, seasonality, and noise, deciphering their influence on time series behavior. Through hands-on exercises, you will learn to discern patterns, identify trends, and isolate seasonal fluctuations within time series data. Building upon this foundation, you will then embark on a journey through advanced time series modeling techniques. You will wield powerful tools such as ARIMA, exponential smoothing, and LSTM networks to forecast future trends and detect anomalies within temporal data streams. By the module's conclusion, you will emerge equipped with the skills and knowledge necessary to harness the predictive power of time series analysis in diverse domains.
涵盖的内容
3个视频4篇阅读材料1个作业
3个视频•总计43分钟
- Time Series Analysis•11分钟
- ARIMA Model•18分钟
- Key Time Series Models and Techniques•15分钟
4篇阅读材料•总计63分钟
- Time Series Analysis•40分钟
- ARIMA Models•10分钟
- ARIMA Models Advantages and Limitations•3分钟
- Congratulations!•10分钟
1个作业•总计10分钟
- Module 14: Assess Your Learning: Time Series Analysis•10分钟
位教师

提供方

提供方

Founded in 1898, Northeastern is a global research university with a distinctive, experience-driven approach to education and discovery. The university is a leader in experiential learning, powered by the world’s most far-reaching cooperative education program. The spirit of collaboration guides a use-inspired research enterprise focused on solving global challenges in health, security, and sustainability.
人们为什么选择 Coursera 来帮助自己实现职业发展

Felipe M.

Jennifer J.

Larry W.

Chaitanya A.
从 Data Science 浏览更多内容
NNortheastern University
课程
NNortheastern University
课程

课程

