Coursera
LLM Optimization & Evaluation 专项课程

只需 199 美元(原价 399 美元)即可通过 Coursera Plus 学习更高水平的技能。立即节省

Coursera

LLM Optimization & Evaluation 专项课程

Optimize & Deploy Production-Ready LLM Systems. Build expertise in LLM evaluation, optimization, and deployment through hands-on MLOps projects.

John Whitworth
LearningMate

位教师:John Whitworth

包含在 Coursera Plus

深入学习学科知识
中级 等级

推荐体验

4 周 完成
在 10 小时 一周
灵活的计划
自行安排学习进度
深入学习学科知识
中级 等级

推荐体验

4 周 完成
在 10 小时 一周
灵活的计划
自行安排学习进度

您将学到什么

  • Evaluate and optimize LLM performance using statistical testing, MLOps tools, and production monitoring systems.

  • Build automated pipelines for feature engineering, experiment tracking, and data processing with industry-standard tools.

  • Diagnose LLM errors, implement safety frameworks, and reduce operational costs through systematic analysis.

要了解的详细信息

可分享的证书

添加到您的领英档案

授课语言:英语(English)
最近已更新!

December 2025

了解顶级公司的员工如何掌握热门技能

Petrobras, TATA, Danone, Capgemini, P&G 和 L'Oreal 的徽标

精进特定领域的专业知识

  • 向大学和行业专家学习热门技能
  • 借助实践项目精通一门科目或一个工具
  • 培养对关键概念的深入理解
  • 通过 Coursera 获得职业证书

专业化 - 11门课程系列

您将学到什么

  • Build feature engineering pipelines and evaluate ML experiments using MLOps tools to select and deploy production-ready models.

您将获得的技能

类别:Feature Engineering
类别:MLOps (Machine Learning Operations)
类别:Performance Analysis
类别:Performance Tuning
类别:Data Transformation
类别:Model Evaluation
类别:Data Pipelines
类别:Predictive Modeling
类别:Data Preprocessing
Evaluate & Optimize LLM Performance

Evaluate & Optimize LLM Performance

第 2 门课程3小时

您将学到什么

  • Evaluate LLMs using metrics like BLEU & ROUGE run A/B tests for statistical significance, and optimize model performance with data-driven strategies.

您将获得的技能

类别:Test Script Development
类别:LLM Application
类别:Statistical Hypothesis Testing
类别:Data-Driven Decision-Making
类别:Natural Language Processing
类别:Performance Metric
类别:Statistical Analysis
类别:Prompt Engineering
类别:Model Evaluation
类别:Statistical Methods
类别:Large Language Modeling
Analyze Logs: Fix LLM Hallucinations

Analyze Logs: Fix LLM Hallucinations

第 3 门课程3小时

您将学到什么

  • Use data analysis to diagnose LLM hallucinations by correlating user behavior and system errors, and document findings to guide engineering fixes.

您将获得的技能

类别:Analysis
类别:Data Analysis
类别:Data Analysis Expressions (DAX)
类别:Artificial Intelligence
类别:Root Cause Analysis
类别:Pandas (Python Package)
类别:LLM Application
类别:Business Metrics
类别:Data Manipulation
类别:Data Processing
类别:Debugging
类别:Generative AI
类别:Anomaly Detection
类别:Customer Retention
类别:Performance Metric
类别:Technical Communication

您将学到什么

  • Rigorously evaluate LLM performance using statistical tests and confidence intervals to make data-driven deployment decisions.

您将获得的技能

类别:Jupyter
类别:Large Language Modeling
类别:Matplotlib
类别:Statistical Visualization
类别:Statistical Inference
类别:Data Storytelling
类别:Statistical Methods
类别:Statistical Hypothesis Testing
类别:Probability & Statistics
类别:Model Evaluation
类别:Statistical Analysis
类别:Performance Metric
类别:Experimentation
类别:Data Presentation
类别:Data-Driven Decision-Making

您将学到什么

  • Build and validate a robust safety testing framework for LLMs. Create behavioral test suites and use mutation testing to ensure their effectiveness.

您将获得的技能

类别:Software Testing
类别:API Testing
类别:Code Coverage
类别:Unit Testing
类别:Prompt Engineering
类别:Penetration Testing
类别:Test Script Development
类别:AI Security
类别:LLM Application
类别:Software Technical Review
类别:Verification And Validation
类别:Quality Assessment
类别:Large Language Modeling
类别:Responsible AI
类别:Maintainability
类别:Threat Modeling
类别:Security Testing
类别:Model Evaluation
类别:Test Tools
类别:Test Case

您将学到什么

  • Track, version, and evaluate ML experiments using DVC and W&B to reliably select and prepare models for production deployment.

您将获得的技能

类别:Data Management
类别:Model Evaluation
类别:Machine Learning
类别:Git (Version Control System)
类别:Performance Analysis
类别:Large Language Modeling
类别:Performance Testing
类别:Version Control
类别:Dashboard
类别:MLOps (Machine Learning Operations)
类别:Software Versioning
类别:Data Integrity

您将学到什么

  • Create automated Python scripts to manage multi-step cloud workflows, from provisioning resources to persisting data.

您将获得的技能

类别:Data Persistence
类别:Infrastructure as Code (IaC)
类别:Data Pipelines
类别:Virtual Machines
类别:Cloud Deployment
类别:Scripting
类别:Python Programming
类别:Command-Line Interface

您将学到什么

  • Build automated data pipelines with Apache Airflow, manage schema evolution to prevent failures, and implement monitoring for data integrity.

您将获得的技能

类别:Data Pipelines
类别:Apache Airflow
类别:Data Integrity
类别:Real Time Data
类别:Data Modeling
类别:Continuous Monitoring
类别:Technical Communication
类别:Data Quality
类别:Scalability
类别:Data Transformation
类别:Data Validation
类别:Extract, Transform, Load
类别:System Monitoring

您将学到什么

  • Translate an LLM product concept into a detailed PRD and create a UAT plan to validate that the delivered feature meets user requirements.

您将获得的技能

类别:User Acceptance Testing (UAT)
类别:Technical Communication
类别:Acceptance Testing
类别:Scenario Testing
类别:Functional Testing
类别:Large Language Modeling
类别:User Requirements Documents
类别:Product Requirements
类别:Key Performance Indicators (KPIs)
类别:Functional Requirement
类别:AI Product Strategy
类别:User Story
类别:Business Requirements
类别:Risk Management Framework
类别:LLM Application
类别:Requirements Analysis

您将学到什么

  • Create operational run-books for LLM systems and evaluate prompt patterns to improve performance and reduce operational costs.

您将获得的技能

类别:Performance Tuning
类别:Benchmarking
类别:Large Language Modeling
类别:Technical Writing
类别:Technical Documentation
类别:Prompt Patterns
类别:MLOps (Machine Learning Operations)
类别:Data Maintenance
类别:Configuration Management
类别:Prompt Engineering
类别:Requirements Analysis
类别:Performance Testing

您将学到什么

  • Optimize LLM costs by analyzing spend reports and streamline ML pipelines using value-stream mapping to boost efficiency and reduce cycle times.

您将获得的技能

类别:Process Improvement and Optimization
类别:Data-Driven Decision-Making
类别:Productivity Software
类别:Cost Benefit Analysis
类别:Process Optimization
类别:Miro AI
类别:Business Workflow Analysis
类别:Process Analysis
类别:Expense Management
类别:Cost Management

获得职业证书

将此证书添加到您的 LinkedIn 个人资料、简历或履历中。在社交媒体和绩效考核中分享。

位教师

John Whitworth
Coursera
0 门课程0 名学生
LearningMate
Coursera
62 门课程685 名学生

提供方

Coursera

人们为什么选择 Coursera 来帮助自己实现职业发展

Felipe M.
自 2018开始学习的学生
''能够按照自己的速度和节奏学习课程是一次很棒的经历。只要符合自己的时间表和心情,我就可以学习。'
Jennifer J.
自 2020开始学习的学生
''我直接将从课程中学到的概念和技能应用到一个令人兴奋的新工作项目中。'
Larry W.
自 2021开始学习的学生
''如果我的大学不提供我需要的主题课程,Coursera 便是最好的去处之一。'
Chaitanya A.
''学习不仅仅是在工作中做的更好:它远不止于此。Coursera 让我无限制地学习。'
Coursera Plus

通过 Coursera Plus 开启新生涯

无限制访问 10,000+ 世界一流的课程、实践项目和就业就绪证书课程 - 所有这些都包含在您的订阅中

通过在线学位推动您的职业生涯

获取世界一流大学的学位 - 100% 在线

加入超过 3400 家选择 Coursera for Business 的全球公司

提升员工的技能,使其在数字经济中脱颖而出

常见问题