This course delves into both the theoretical aspects and practical applications of data mining within the field of engineering. It provides a comprehensive review of the essential fundamentals and central concepts underpinning data mining. Additionally, it introduces pivotal data mining methodologies and offers a guide to executing these techniques through various algorithms. Students will be introduced to a range of data mining techniques, such as data preprocessing, the extraction of association rules, classification, prediction, clustering, and the exploration of complex data, and will implement a capstone project exploring the same. Additionally, we will use case studies to explore the application of data mining across diverse sectors, including but not limited to manufacturing, healthcare, medicine, business, and various service industries.

Practical Engineering Data Mining: Techniques and Uses
访问权限由 Coursera Learning Team 提供
您将获得的技能
- Forecasting
- Applied Machine Learning
- Data Preprocessing
- Data Visualization Software
- Data Mining
- Data Cleansing
- Data Analysis
- Data Transformation
- Big Data
- Data-Driven Decision-Making
- Model Evaluation
- Dimensionality Reduction
- Exploratory Data Analysis
- Statistical Analysis
- Unsupervised Learning
- Regression Analysis
- Classification Algorithms
- Predictive Analytics
- 技能部分已折叠。显示 10 项技能,共 18 项。
要了解的详细信息

添加到您的领英档案
5 项作业
了解顶级公司的员工如何掌握热门技能

该课程共有4个模块
In this module, participants will explore essential data concepts across domains, understanding diverse data types, attributes, and features. They will grasp the fundamental principles, methodologies, and scope of data mining, enabling them to effectively analyze data and extract valuable insights. Through this comprehensive approach, learners will gain proficiency in utilizing key data concepts, facilitating informed decision-making and innovation across various domains.
涵盖的内容
5个视频8篇阅读材料2个作业2个讨论话题
This module aims to impart a comprehensive understanding of data concepts, spanning various domains. Participants will learn to differentiate between different data types, attributes, and features. They will explore fundamental principles and methodologies of data mining, enabling them to extract meaningful insights from datasets. By mastering these objectives, learners will be equipped with the knowledge and skills necessary to analyze data effectively and make informed decisions in diverse professional settings.
涵盖的内容
3个视频13篇阅读材料1个作业1个讨论话题
Throughout this module, we will jump into the realm of dimensionality reduction, a technique for simplifying complex datasets to facilitate efficient analysis and visualization. By implementing dimensionality reduction methods such as Principal Component Analysis (PCA) and t-Distributed Stochastic Neighbor Embedding (t-SNE), we will gain insight into how to effectively reduce the number of features while preserving essential information. We'll learn to select and apply the most suitable dimensionality reduction techniques based on data types and analytical goals, thereby enhancing model performance and interpretability. This module shares the tools to navigate and extract meaningful insights from high-dimensional datasets, paving the way for more effective data analysis and decision-making.
涵盖的内容
4个视频9篇阅读材料1个作业1个讨论话题
In this module, we learn the concept of the Bias-Variance Trade-off in machine learning. Striving for models that generalize well requires navigating the delicate balance between bias and variance to avoid underfitting and overfitting. Bias represents the error from oversimplifying a complex problem, while variance quantifies the model's sensitivity to different training data subsets. We explore strategies to combat bias and variance in developing models that strike the right balance between accuracy and generalization. Transitioning to regression metrics, we look at practical tools used to measure and evaluate model performance in regression tasks, focusing on metrics like Root Mean Squared Error (RMSE). Finally, we navigate the landscape of assessing model performance in binary classification tasks, exploring advanced measures like the F1-Score, Matthews Correlation Coefficient (MCC), propensity scores, and the AUC-ROC curve.
涵盖的内容
5个视频10篇阅读材料1个作业1个讨论话题
位教师

人们为什么选择 Coursera 来帮助自己实现职业发展

Felipe M.

Jennifer J.

Larry W.

Chaitanya A.
从 Data Science 浏览更多内容

Northeastern University

Northeastern University

Northeastern University

Northeastern University


