When will I receive my Course Certificate?

If you complete the course successfully, your electronic Course Certificate will be added to your Accomplishments page - from there, you can print your Course Certificate or add it to your LinkedIn profile.

Why can’t I audit this course?

This course is currently available only to learners who have paid or received financial aid, when available.

Is financial aid available?

Yes. In select learning programs, you can apply for financial aid or a scholarship if you can’t afford the enrollment fee. If fin aid or scholarship is available for your learning program selection, you’ll find a link to apply on the description page.

Cutting-Edge Topics in Deep Reinforcement Learning

Cutting-Edge Topics in Deep Reinforcement Learning

本课程是 Deep Reinforcement Learning Hands-On 专项课程的一部分

位教师：Packt - Course Instructors

包含在中

了解更多

8个模块

深入了解一个主题并学习基础知识。

高级设置等级

推荐体验

7 小时完成

灵活的计划

自行安排学习进度

8个模块

深入了解一个主题并学习基础知识。

高级设置等级

推荐体验

7 小时完成

灵活的计划

自行安排学习进度

您将学到什么

Understand continuous action spaces and their applications in deep reinforcement learning
Master trust region methods for stable policy optimization in RL
Explore black-box optimization techniques to solve complex RL problems

要了解的详细信息

可分享的证书

添加到您的领英档案

了解顶级公司的员工如何掌握热门技能

了解关于 Coursera for Business 的更多信息

Petrobras, TATA, Danone, Capgemini, P&G 和 L'Oreal 的徽标

积累特定领域的专业知识

本课程是 Deep Reinforcement Learning Hands-On 专项课程专项课程的一部分

在注册此课程时，您还会同时注册此专项课程。

向行业专家学习新概念
获得对主题或工具的基础理解
通过实践项目培养工作相关技能
获得可共享的职业证书

该课程共有8个模块

Master the latest advancements in deep reinforcement learning, including continuous action spaces, trust region methods, black-box optimization, and multi-agent systems. Explore innovative approaches and real-world case studies at the frontier of RL research.

This course explores cutting-edge topics such as continuous control, trust region policy optimization, advanced exploration strategies, and reinforcement learning with human feedback. Learners will investigate high-profile applications like AlphaGo Zero and MuZero, as well as RL for discrete optimization and multi-agent environments. By engaging with these advanced topics, you will gain a comprehensive understanding of the current landscape and future directions of deep RL. The course presents complex concepts through accessible explanations and practical examples, guiding learners through the latest research and its implementation. Emphasis is placed on understanding the motivations and mechanics behind each technique, fostering both depth and breadth of knowledge. Designed for learners with a foundational understanding of RL, this course will deepen your expertise and prepare you for practical implementation in cutting-edge research and industry applications. This course is part three of a three-course Specialization designed to provide a comprehensive learning pathway in Reinforcement Learning. While it delivers standalone value, learners seeking an in-depth progression may benefit from completing the full Specialization.

This module introduces advanced reinforcement learning techniques for environments with continuous action spaces. Learners will explore the A2C method, analyze its performance, and implement practical solutions for training agents in such domains. Hands-on coding examples and experimental results will deepen understanding of policy gradient methods in continuous settings.

涵盖的内容

1个视频5篇阅读材料1个作业

This module explores advanced techniques for stabilizing policy gradient methods in deep reinforcement learning. Learners will compare and contrast Proximal Policy Optimization (PPO), Trust Region Policy Optimization (TRPO), and ACKTR, examining their theoretical foundations and practical performance. By the end, you will understand how these methods improve training stability and efficiency.

涵盖的内容

1个视频4篇阅读材料1个作业

This module introduces black-box optimization techniques in reinforcement learning, highlighting their principles and recent applications to complex environments. Learners will explore practical implementations using evolutionary strategies and genetic algorithms, and analyze performance results on benchmark tasks such as CartPole and HalfCheetah.

涵盖的内容

1个视频4篇阅读材料1个作业

This module delves into advanced exploration strategies in reinforcement learning, highlighting the exploration/exploitation dilemma and presenting alternative methods such as random exploration, noisy networks, and network distillation. Learners will experiment with these techniques in the MountainCar environment and compare their effectiveness using both DQN and PPO algorithms.

涵盖的内容

1个视频6篇阅读材料1个作业

This module introduces reinforcement learning with human feedback (RLHF), a technique for training agents when explicit reward functions are difficult to define. Learners will explore the RLHF pipeline, including data labeling, reward model training, and integration with reinforcement learning algorithms. Real-world applications, such as training large language models, are also discussed.

涵盖的内容

1个视频6篇阅读材料1个作业

This module explores advanced model-based reinforcement learning techniques through the lens of AlphaGo Zero and MuZero. Learners will examine Monte Carlo Tree Search (MCTS), neural network architectures, and the process of training agents for board games like Connect 4. Practical implementation details and evaluation strategies are also covered.

涵盖的内容

1个视频11篇阅读材料1个作业

1个视频总计1分钟

Overview1分钟

11篇阅读材料总计63分钟

Introduction5分钟
Model-Based Methods for Board Games6分钟
MCTS6分钟
Training and Evaluation7分钟
Implementing MCTS7分钟
The Model5分钟
Results4分钟
MuZero6分钟
Connect 4 with MuZero5分钟
Models7分钟
Training Data and Gameplay5分钟

1个作业总计16分钟

Reinforcement Learning in AI Systems16分钟

This module explores how deep reinforcement learning techniques can be applied to discrete optimization problems, using the example of solving cubes. Learners will examine neural network architectures, training processes, and experimental results, gaining insight into both implementation and evaluation of RL-based solvers.

涵盖的内容

1个视频5篇阅读材料1个作业

This module introduces the fundamentals of multi-agent reinforcement learning (MARL), exploring how multiple agents interact and learn within shared environments. Learners will examine the application of deep Q-networks to groups of agents and analyze the resulting behaviors. Practical examples illustrate how agent strategies evolve in multi-agent scenarios.

涵盖的内容

1个视频2篇阅读材料1个作业

获得职业证书

将此证书添加到您的 LinkedIn 个人资料、简历或履历中。在社交媒体和绩效考核中分享。

位教师

Packt - Course Instructors

Packt

1,749 门课程492,078 名学生

提供方

Packt

从 Software Development 浏览更多内容

IBM
Deep Learning and Reinforcement Learning
课程
状态：免费试用
类别：提供的学分
University of Alberta
Fundamentals of Reinforcement Learning
课程
状态：免费试用
类别：提供的学分
New York University
Reinforcement Learning in Finance
课程
状态：免费试用
类别：提供的学分
University of Alberta
Reinforcement Learning
专项课程
状态：免费试用
类别：提供的学分

人们为什么选择 Coursera 来帮助自己实现职业发展

Felipe M.

自 2018开始学习的学生

''能够按照自己的速度和节奏学习课程是一次很棒的经历。只要符合自己的时间表和心情，我就可以学习。'

Jennifer J.

自 2020开始学习的学生

''我直接将从课程中学到的概念和技能应用到一个令人兴奋的新工作项目中。'

Larry W.

自 2021开始学习的学生

''如果我的大学不提供我需要的主题课程，Coursera 便是最好的去处之一。'

Chaitanya A.

''学习不仅仅是在工作中做的更好：它远不止于此。Coursera 让我无限制地学习。'

通过 Coursera Plus 开启新生涯

无限制访问 10,000+ 世界一流的课程、实践项目和就业就绪证书课程 - 所有这些都包含在您的订阅中

了解更多

通过在线学位推动您的职业生涯

获取世界一流大学的学位 - 100% 在线

探索学位

加入超过 3400 家选择 Coursera for Business 的全球公司

提升员工的技能，使其在数字经济中脱颖而出

了解更多

常见问题

Yes, you can preview the first video and view the syllabus before you enroll. You must purchase the course to access content not included in the preview.

If you decide to enroll in the course before the session start date, you will have access to all of the lecture videos and readings for the course. You’ll be able to submit assignments once the session starts.

Once you enroll and your session begins, you will have access to all videos and other resources, including reading items and the course discussion forum. You’ll be able to view and submit practice assessments, and complete required graded assignments to earn a grade and a Course Certificate.

Cutting-Edge Topics in Deep Reinforcement Learning

Cutting-Edge Topics in Deep Reinforcement Learning

您将学到什么

要了解的详细信息

了解顶级公司的员工如何掌握热门技能

积累特定领域的专业知识

该课程共有8个模块

Continuous Action Space

涵盖的内容

Trust Region Methods

涵盖的内容

Black-Box Optimizations in RL

涵盖的内容

Advanced Exploration

涵盖的内容

Reinforcement Learning with Human Feedback

涵盖的内容

AlphaGo Zero and MuZero

涵盖的内容

RL in Discrete Optimization

涵盖的内容

Multi-Agent RL

涵盖的内容

获得职业证书

位教师

提供方

从 Software Development 浏览更多内容

Deep Learning and Reinforcement Learning

Fundamentals of Reinforcement Learning

Reinforcement Learning in Finance

Reinforcement Learning

人们为什么选择 Coursera 来帮助自己实现职业发展

Felipe M.

Jennifer J.

Larry W.

Chaitanya A.

通过 Coursera Plus 开启新生涯

通过在线学位推动您的职业生涯

加入超过 3400 家选择 Coursera for Business 的全球公司

常见问题

Can I preview a course before enrolling?

When will I have access to the lectures and assignments?

What will I get when I enroll?

更多问题