Welcome to Advanced Machine Learning Techniques, where you'll dive deep into sophisticated approaches that power modern AI applications. We'll explore five key areas of advanced ML: ensemble methods for combining models, dimensionality reduction techniques for handling complex data, natural language processing for text analysis, reinforcement learning for decision-making systems, and automated machine learning for optimization. You'll work hands-on with industry-standard tools including Scikit-learn, XGBoost, NLTK, PyTorch, and MLflow, learning how to implement and optimize advanced algorithms in real-world scenarios.

Advanced Machine Learning Techniques
本课程是多个项目的一部分。
访问权限由 Coursera Learning Team 提供
推荐体验
推荐体验
中级
Basic familiarity with Python syntax, Data structures and Linear Algebra concepts like vectors, matrices, dot products, and Eigenvalues.
推荐体验
推荐体验
中级
Basic familiarity with Python syntax, Data structures and Linear Algebra concepts like vectors, matrices, dot products, and Eigenvalues.
要了解的详细信息

添加到您的领英档案
22 项作业
了解顶级公司的员工如何掌握热门技能

积累特定领域的专业知识
- 向行业专家学习新概念
- 获得对主题或工具的基础理解
- 通过实践项目培养工作相关技能
- 获得可共享的职业证书

该课程共有5个模块
In this module, you will establish ensemble learning techniques including bagging, boosting, and stacking. You'll learn how to combine multiple models to improve predictive performance and implement them using popular libraries like Scikit-learn, XGBoost, and LightGBM. Through hands-on practice, you'll evaluate ensemble models using cross-validation and learn to optimize their hyperparameters.
涵盖的内容
16个视频8篇阅读材料5个作业4个非评分实验室
16个视频• 总计48分钟
- Welcome to Advanced Machine Learning Techniques• 2分钟
- Why Single Decision Trees Can Overfit: A Visual Primer• 3分钟
- How Bagging Stabilizes Predictions and Reduces Variance• 2分钟
- Random Forest for Classification: Iris Dataset Walkthrough• 4分钟
- Random Forest for Regression: Predicting House Prices• 3分钟
- Why Weak Learners Fail — And What Boosting Tries to Fix• 2分钟
- How Boosting Learns from Mistakes — One Model at a Time• 3分钟
- Implementing XGBoost and LightGBM for Boosted Classification• 3分钟
- What Is Stacking? A Simple Visual Explanation• 3分钟
- How to Train a Stacking Model (Without Leaking Data)• 4分钟
- Hands-On: Setting Up Base Models for Stacking in Scikit-learn• 5分钟
- Hands-On: Training and Evaluating a Stacked Ensemble in Python• 3分钟
- Cross-Validation Basics: How It Works, Why It Matters, and Why a Single Data Split Can Mislead You• 3分钟
- How Cross-Validation Makes Model Comparison More Reliable• 3分钟
- Cross-Validation with cross_val_score: Comparing Ensemble Models• 2分钟
- Hyperparameter Tuning with GridSearchCV: Optimizing XGBoost• 3分钟
8篇阅读材料• 总计74分钟
- Understanding Bagging and Random Forests • 8分钟
- Understanding Hyperparameters in Random Forests• 10分钟
- Boosting Algorithms Explained: From AdaBoost to XGBoost & LightGBM• 10分钟
- Tuning Boosting Models: Key Hyperparameters Explained• 10分钟
- When and How to Use Stacking Effectively• 8分钟
- Stacking in Practice: Understanding the StackingClassifier Structure• 8分钟
- Implementing Cross-Validation• 10分钟
- Cross-Validation and the Bias-Variance Trade-Off in Ensemble Models• 10分钟
5个作业• 总计90分钟
- Ensemble Learning Mastery• 30分钟
- Knowledge Check: Bagging and Random Forests• 15分钟
- Knowledge Check: Boosting and Its Applications• 15分钟
- Knowledge Check: StackingClassifier in Action• 15分钟
- Knowledge Check: Model Evaluation for Ensembles• 15分钟
4个非评分实验室• 总计240分钟
- Bagging in Action: Predicting Customer Churn with Random Forest• 60分钟
- Using Boosting Models to Predict Heart Disease• 60分钟
- Building and Evaluating a StackingClassifier on Loan Default Data• 60分钟
- Comparing Ensemble Models with Cross-Validation• 60分钟
This module will help you master dimensionality reduction techniques to handle high-dimensional data effectively. You'll learn to apply Principal Component Analysis (PCA) to reduce dimensionality while retaining key features, use t-distributed Stochastic Neighbor Embedding (t-SNE) to visualize high-dimensional data in 2D/3D space for clustering and pattern recognition, and implement Uniform Manifold Approximation and Projection (UMAP) for efficient dimensionality reduction, leveraging its speed and structure-preserving properties.
涵盖的内容
8个视频7篇阅读材料4个作业3个非评分实验室
8个视频• 总计16分钟
- Why Reducing Dimensions Makes Your Models Work Better• 2分钟
- Implementing PCA Step-by-Step in Python-ASSE• 2分钟
- How PCA Reduces Dimensions and Visualizes Patterns• 2分钟
- Why PCA Isn't Always Enough: Enter t-SNE• 2分钟
- Hands-On with t-SNE: Visualizing Complex Patterns in 2D• 2分钟
- Why UMAP Is a Game-Changer for Visualizing and Modeling Complex Data• 2分钟
- Visualizing Digits with UMAP in Python• 2分钟
- Using UMAP-Transformed Features for Classification• 2分钟
7篇阅读材料• 总计52分钟
- Why We Use PCA: Dimensionality Reduction & Variance• 8分钟
- How PCA Works: Eigenvectors, Projection & Explained Variance• 8分钟
- What Is t-SNE and How Is It Different from PCA?• 6分钟
- How to Use t-SNE Effectively: Parameters, Best Practices, and Pitfalls• 6分钟
- Visualizing High-Dimensional Data: Why PCA and t-SNE Aren't Always Enough• 6分钟
- UMAP Demystified: What It Is—and What It Isn't• 8分钟
- Using UMAP Effectively: Parameters, Use Cases, and Cautions• 10分钟
4个作业• 总计75分钟
- Dimensionality Reduction Mastery• 30分钟
- Knowledge Check: Principal Component Analysis (PCA)• 15分钟
- Knowledge Check: t-SNE Concepts & Use Cases• 15分钟
- Knowledge Check: UMAP Essentials• 15分钟
3个非评分实验室• 总计180分钟
- Reducing Dimensionality with PCA: From 64 Features to 2• 60分钟
- Visualizing Handwritten Digit Clusters with t-SNE• 60分钟
- Exploring UMAP for Visualization and Modeling• 60分钟
In this module, you'll focus on natural language processing techniques from basic text preprocessing to advanced sentiment analysis. You'll learn how to preprocess text data using tokenization, stopword removal, and stemming/lemmatization with Natural Language Toolkit (NLTK) and spaCy. Through implementation of text classification using various techniques like Bag-of-Words, TF-IDF, and word embeddings, you'll gain practical experience in NLP tasks. You'll also train sentiment analysis models using Hugging Face Transformers and Scikit-learn.
涵盖的内容
13个视频6篇阅读材料5个作业4个非评分实验室
13个视频• 总计27分钟
- Understanding Natural Language Processing: Why It Matters Today• 2分钟
- Cleaning Raw Text Step by Step – From Noise to Tokens• 2分钟
- Stemming vs. Lemmatization – What's the Difference?• 2分钟
- From Text to Bag-of-Words – Your First Text Vectorizer• 1分钟
- Going Beyond Counts – TF-IDF in Action• 2分钟
- Extracting Token Embeddings with Hugging Face Transformers• 2分钟
- Sentence-Level Embeddings and Similarity Scoring• 3分钟
- How Tokenization Works: Words, Subwords, and Transformers• 2分钟
- Getting Word Vectors and Token Similarity with spaCy• 2分钟
- Creating Sentence Embeddings with Hugging Face Transformers• 2分钟
- TF-IDF Vectorization for Sentiment Data• 2分钟
- Training and Evaluating a Sentiment Classifier• 1分钟
- Fine-Tuning BERT for Sentiment Analysis with Hugging Face Transformers• 3分钟
6篇阅读材料• 总计47分钟
- Why Preprocessing Text Is the First Step to Better Models• 8分钟
- Stemming, Lemmatization, and Tools to Preprocess• 8分钟
- From Words to Counts – Understanding BoW and TF-IDF• 8分钟
- From Vectors to Meaning – Embeddings and When to Use Them• 6分钟
- Tokenizers and Embeddings: How Modern NLP Models Understand Language• 10分钟
- Text Classification: From Features to Predictions• 7分钟
5个作业• 总计90分钟
- NLP Mastery – From Text to Classification• 30分钟
- Knowledge Check: Text Preprocessing Techniques• 15分钟
- Knowledge Check: Word Representations• 15分钟
- Knowledge Check: Tokenization & Embeddings • 15分钟
- Knowledge Check: Sentiment Classification Workflows• 15分钟
4个非评分实验室• 总计240分钟
- Clean Your First NLP Dataset: News Headlines Edition• 60分钟
- Comparing Sparse and Dense Text Representations in Practice• 60分钟
- Compare Static vs. Contextual Embeddings for Sentence Similarity• 60分钟
- Classical vs. Transformer Sentiment Models: A Head-to-Head Comparison• 60分钟
Reinforcement Learning Description: In this module, you'll explore the fundamentals of reinforcement learning (RL), including Markov Decision Processes (MDPs) and reward-based learning. You'll understand the key components of RL systems and implement both policy-based and value-based learning techniques. Through practical examples and hands-on implementation, you'll discover how RL is applied in real-world scenarios like robotics, gaming, and finance.
涵盖的内容
7个视频5篇阅读材料4个作业3个非评分实验室
7个视频• 总计17分钟
- What Makes Reinforcement Learning Different• 2分钟
- Getting Started with Reinforcement Learning: Agents, Actions, and Rewards• 4分钟
- Simulating a Reinforcement Learning Loop in Python• 2分钟
- Understanding Q-Learning and the Bellman Update• 2分钟
- Implementing Q-Learning in GridWorld• 2分钟
- Building a Policy Network and Sampling Actions• 2分钟
- Training with the REINFORCE Algorithm• 3分钟
5篇阅读材料• 总计40分钟
- Key Concepts of Reinforcement Learning• 8分钟
- The Markov Decision Process and RL Terminology• 8分钟
- Value vs Policy: Two Ways to Train an RL Agent• 10分钟
- How RL Powers Robots, Games, and Financial Decisions• 6分钟
- Challenges and Frontiers of Real-World RL• 8分钟
4个作业• 总计75分钟
- Reinforcement Learning Mastery• 30分钟
- Knowledge Check: RL Fundamentals• 15分钟
- Knowledge Check: Q-Learning vs. REINFORCE• 15分钟
- Knowledge Check: RL in the Real World• 15分钟
3个非评分实验室• 总计180分钟
- Simulate Your First RL Environment with an Agent in GridWorld• 60分钟
- Train Your First Q-Learning and REINFORCE Agents• 60分钟
- Simulating a Real-World Decision Task Using RL Concepts• 60分钟
This module focuses on automated machine learning techniques and model optimization. You'll learn to automate model selection and hyperparameter tuning using Auto-sklearn and GridSearchCV, and optimize models using MLflow for experiment tracking and reproducibility. You'll also explore Bayesian optimization techniques to improve model accuracy. The module concludes with a comprehensive capstone project that combines multiple techniques from throughout the course.
涵盖的内容
10个视频6篇阅读材料4个作业1个编程作业3个非评分实验室
10个视频• 总计20分钟
- Rapid Model Benchmarking with LazyPredict• 2分钟
- Prototyping Classification Pipelines with PyCaret• 2分钟
- Getting Started with Auto-sklearn for Model Selection• 2分钟
- Feature Engineering and Pipeline Analysis with Auto-sklearn• 2分钟
- Hyperparameter Tuning with GridSearchCV• 3分钟
- Efficient Hyperparameter Tuning with RandomizedSearchCV• 3分钟
- What Is Bayesian Optimization and How Does It Work?• 2分钟
- Hands-On: Hyperparameter Tuning with Optuna• 2分钟
- Tracking ML Experiments with MLflow• 2分钟
- Registering and Managing Models with MLflow• 2分钟
6篇阅读材料• 总计56分钟
- The Power and Pitfalls of Automated Machine Learning• 10分钟
- What Are Hyperparameters and Why They Matter• 10分钟
- Search Strategies and Tips for Effective Hyperparameter Tuning• 10分钟
- Why Experiment Tracking Matters in ML Projects• 8分钟
- Introduction to MLflow for Model Tracking and Versioning• 8分钟
- How to Think Like an ML Engineer During Your Final Project• 10分钟
4个作业• 总计75分钟
- AutoML and Model Optimization Mastery• 30分钟
- Knowledge Check: Automated Model Selection Tools• 15分钟
- Knowledge Check: Hyperparameter Tuning• 15分钟
- Knowledge Check: Experiment Tracking & Deployment• 15分钟
1个编程作业• 总计150分钟
- Capstone Project: Multi-Domain Machine Learning Challenge: From Classification to Optimization• 150分钟
3个非评分实验室• 总计180分钟
- AutoML vs. Manual Modeling: Which One Wins?• 60分钟
- Grid, Random, or Bayesian? Tune and Compare Your Models• 60分钟
- Track and Compare Multiple Model Runs with MLflow• 60分钟
获得职业证书
将此证书添加到您的 LinkedIn 个人资料、简历或履历中。在社交媒体和绩效考核中分享。
位教师

提供方

提供方

Coursera brings together a diverse network of subject matter experts who have demonstrated their expertise through professional industry experience or strong academic backgrounds. These instructors design and teach courses that make practical, career-relevant skills accessible to learners worldwide.
人们为什么选择 Coursera 来帮助自己实现职业发展

Felipe M.

Jennifer J.

Larry W.

Chaitanya A.
从 Data Science 浏览更多内容
FFractal Analytics
课程
JJohns Hopkins University
课程


