What will I get if I subscribe to this Certificate?

When you enroll in the course, you get access to all of the courses in the Certificate, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.

Designing Production LLM Architectures

通过 Coursera Plus 提高技能，仅需 239 美元/年（原价 399 美元）。立即节省

Designing Production LLM Architectures

本课程是 LLM Engineering That Works: Prompting, Tuning, and Retrieval 专业证书的一部分

位教师：Professionals from the Industry

包含在中

了解更多

5个模块

深入了解一个主题并学习基础知识。

中级等级

推荐体验

1 周完成

在 10 小时一周

灵活的计划

自行安排学习进度

5个模块

深入了解一个主题并学习基础知识。

中级等级

推荐体验

1 周完成

在 10 小时一周

灵活的计划

自行安排学习进度

您将学到什么

Compare synchronous and asynchronous architectures and apply 12-factor principles and container orchestration to deploy scalable microservices.
Analyze multi-region deployments, pinpoint latency bottlenecks, and design resilient architecture improvements via fault analysis.
Create Airflow DAGs to automate data workflows and analyze the impact of schema evolution on downstream processes and tests.
Analyze trade-offs between self-hosting models vs. managed APIs and evaluate proposed infrastructure for fault tolerance and cost.

您将获得的技能

您将学习的工具

要了解的详细信息

可分享的证书

添加到您的领英档案

了解顶级公司的员工如何掌握热门技能

了解关于 Coursera for Business 的更多信息

Petrobras, TATA, Danone, Capgemini, P&G 和 L'Oreal 的徽标

积累 Design and Product 领域的专业知识

本课程是 LLM Engineering That Works: Prompting, Tuning, and Retrieval 专业证书专项课程的一部分

在注册此课程时，您还会同时注册此专业证书。

向行业专家学习新概念
获得对主题或工具的基础理解
通过实践项目培养工作相关技能
通过 Coursera 获得可共享的职业证书

该课程共有5个模块

This course is for ML engineers, solutions architects, and senior developers who build robust infrastructure powering large language models. This course teaches you how to design, deploy, and maintain the complex, interconnected systems required for scalable, resilient, and cost-effective LLM applications in the real world.

You will learn to think like an architect, starting with foundational design choices. Using sequence diagrams and structured analysis, you will compare synchronous and asynchronous architectures and evaluate the critical trade-offs between self-hosting open-source models and using managed APIs, considering total cost of ownership, latency, and data privacy. The course then dives deep into building for resilience and scale, applying the 12-factor app methodology to design stateless, configurable microservices. You’ll learn to analyze multi-region deployment strategies for fault tolerance and to use container orchestration manifests like Helm to deploy scalable applications capable of handling production workloads. Finally, you’ll master the data backbone of your system by designing automated data pipelines with tools like Airflow and learning to manage the complexities of schema evolution.

This module empowers engineers and architects to master the "build vs. buy" decision for LLM applications through a structured, strategic lens. You will learn to design complex system architectures using sequence diagrams to evaluate synchronous and asynchronous processing, while comparing the trade-offs of self-hosted open-source models against managed APIs. By focusing on critical metrics like Total Cost of Ownership (TCO), latency, and data privacy, you will develop the expertise to justify architectural choices. Ultimately, you'll gain the confidence to document and defend high-performance, business-aligned AI solutions to any stakeholder.

涵盖的内容

4个视频2篇阅读材料3个作业

4个视频总计38分钟

The Cost of Ambiguity8分钟
Building Sequence Diagrams Step-by-Step9分钟
The Build vs. Buy Dilemma9分钟
A Practical Guide to TCO Calculation12分钟

2篇阅读材料总计24分钟

Synchronous vs. Asynchronous Architectures12分钟
The Deployment Decision Matrix12分钟

3个作业总计55分钟

Architectural Decision Record (ADR)30分钟
Hands-On Learning: Diagram an LLM-Powered Workflow15分钟
Hands-On Learning: Calculate the TCO for Your LLM10分钟

This module explores building resilient, scalable architectures for LLM applications. You will apply 12-factor app methodology to design portable, cloud-native microservices, mastering stateless design and dependency management. The curriculum bridges theory and practice by evaluating multi-region deployment strategies for fault tolerance and high availability. You'll learn to analyze failover mechanisms and mitigate architectural risks before production. By the end, you’ll be equipped to document reliable, future-proof AI systems. Prerequisites include a foundational understanding of cloud concepts (regions/zones) and microservice basics (containers/APIs).

涵盖的内容

1个视频1篇阅读材料3个作业

This module teaches how to transition LLM prototypes into production-grade services. You will learn to analyze multi-stage architectures like RAG to identify and quantify performance bottlenecks using evidence-based metrics. The curriculum focuses on mastering Kubernetes deployment through declarative Helm charts and implementing Horizontal Pod Autoscaling (HPA) to manage unpredictable traffic. By studying deployment lifecycles, including controlled rollouts and rapid rollbacks, you will gain the skills to transform fragile prototypes into resilient, scalable, and reliable production systems capable of handling real-world loads.

涵盖的内容

5个视频5篇阅读材料6个作业

5个视频总计20分钟

Why Performance is a Pipeline Problem4分钟
How to Trace a Request and Spot Bottlenecks3分钟
How to Quantify Latency from Logs4分钟
Why Prototypes Fail in Production4分钟
How to Write a Helm Chart with Autoscaling6分钟

5篇阅读材料总计24分钟

Deconstructing a RAG Architecture5分钟
Evidence Replaces Assumption: The Power of Profiling4分钟
Interpreting Performance Dashboards5分钟
Declarative Deployments with Helm and Kubernetes4分钟
Anatomy of a Production Helm Chart6分钟

6个作业总计67分钟

Scalable LLM Deployment Portfolio20分钟
Hands-On Learning: Analyze the Architecture Diagram10分钟
Scenario-Based Question: Architectural Analysis10分钟
Hands-On Learning: Analyzing Production Logs to Identify Performance Bottlenecks10分钟
Evidence-Based Performance Tuning Quiz10分钟
Hands-On Learning: Review and Correct the Helm Manifest7分钟

In today's dynamic data landscape, pipelines often break when source data structures change unexpectedly—a problem known as schema drift. This module tackles that challenge head-on, teaching you how to design and automate data pipelines that can gracefully handle schema evolution using Apache Airflow. By the end, you will be equipped to create resilient, scalable, and fully automated data pipelines that are built to withstand the complexities of real-world data environments.

涵盖的内容

5个视频5篇阅读材料7个作业

5个视频总计23分钟

Coding and Scheduling Your First DAG5分钟
The Silent Pipeline Killer: Schema Drift4分钟
Writing and Adapting dbt Tests5分钟
When a Tree Falls: The Danger of Silent Failures3分钟
Building-In Failure Alerts6分钟

5篇阅读材料总计21分钟

The Core Components of Airflow5分钟
How-To: Managing Connections and Variables4分钟
Understanding Schema Drift and Data Lineage3分钟
How-To: Documenting and Communicating Schema Changes4分钟
Designing for Observability5分钟

7个作业总计82分钟

Building a Resilient and Monitored Pipeline30分钟
Hands-On Learning: Automating an Article Processing Workflow10分钟
Knowledge Check: Airflow Fundamentals5分钟
Hands-On Learning: Handling Schema Evolution with dbt Testing12分钟
Knowledge Check: Schema Impact5分钟
Hands-On Learning: Enhancing Your DAG with Monitoring and Alerting15分钟
Knowledge Check: Monitoring Concepts5分钟

In the module, you will step into the high-stakes role of a senior systems engineer tasked with diagnosing a failing AI service. A critical Retrieval-Augmented Generation (RAG) system is plagued by high latency and intermittent outages, and you must get to the root of the problem. Using architectural diagrams, system logs, and performance metrics, you will analyze the system’s design to identify the primary performance bottleneck and the most significant single point of failure. Your analysis will culminate in a concise, two-paragraph report for stakeholders, pinpointing the critical issues and recommending targeted fixes to restore stability and performance.

涵盖的内容

2篇阅读材料1个作业

获得职业证书

将此证书添加到您的 LinkedIn 个人资料、简历或履历中。在社交媒体和绩效考核中分享。

位教师

Professionals from the Industry

451 门课程68,414 名学生

提供方

Coursera

从 Design and Product 浏览更多内容

状态：免费试用
Coursera
Analyze & Deploy Scalable LLM Architectures
课程
状态：免费试用
Coursera
Architect Resilient LLM Microservices for Scale
课程
状态：免费试用
Coursera
Design, Compare and Analyze LLM Architectures
课程
状态：免费试用
Coursera
Build Next-Gen LLM Apps with LangChain & LangGraph
专项课程

人们为什么选择 Coursera 来帮助自己实现职业发展

Felipe M.

自 2018开始学习的学生

''能够按照自己的速度和节奏学习课程是一次很棒的经历。只要符合自己的时间表和心情，我就可以学习。'

Jennifer J.

自 2020开始学习的学生

''我直接将从课程中学到的概念和技能应用到一个令人兴奋的新工作项目中。'

Larry W.

自 2021开始学习的学生

''如果我的大学不提供我需要的主题课程，Coursera 便是最好的去处之一。'

Chaitanya A.

''学习不仅仅是在工作中做的更好：它远不止于此。Coursera 让我无限制地学习。'

通过 Coursera Plus 开启新生涯

无限制访问 10,000+ 世界一流的课程、实践项目和就业就绪证书课程 - 所有这些都包含在您的订阅中

了解更多

通过在线学位推动您的职业生涯

获取世界一流大学的学位 - 100% 在线

探索学位

加入超过 3400 家选择 Coursera for Business 的全球公司

提升员工的技能，使其在数字经济中脱颖而出

了解更多

常见问题

This course assumes hands-on experience with cloud concepts and containers. If you are new to cloud platforms or Kubernetes, first complete a foundational cloud or container course to gain the most from these modules.

You will work with sequence diagrams, Kubernetes manifests and Helm charts, Airflow DAGs, and cloud deployment patterns. Labs use common cloud and orchestration tooling; no proprietary vendor lock-in is required.

The course provides structured analysis and decision criteria—latency, total cost of ownership, data privacy, and operational complexity—so you can compare options and make informed architecture choices for your use case.

To access the course materials, assignments and to earn a Certificate, you will need to purchase the Certificate experience when you enroll in a course. You can try a Free Trial instead, or apply for Financial Aid. The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.