Transition from theoretical concepts to production-ready engineering in this hands-on course which is the final part in "Fundamentals of Generative AI" specialization. Designed for learners ready to move beyond the theory, this course focuses entirely on construction: you won't just learn about Large Language Models (LLMs); you will build, refine, and deploy them.

Building and Deploying Generative AI Models
本课程是 Generative AI Fundamentals 专项课程 的一部分



位教师:Amreen Anbar
访问权限由 New York State Department of Labor 提供
您将学到什么
Construct and evaluate Transformer-based LLMs from scratch using PyTorch and industry metrics like ROUGE and BLEU.
Engineer Retrieval Augmented Generation (RAG) pipelines using LangChain to integrate current, domain-specific knowledge into models.
Deploy autonomous AI Agents to production environments on Google Cloud Platform (Vertex AI) using professional workflows.
您将获得的技能
- Vector Databases
- Generative AI
- Development Environment
- Google Cloud Platform
- Model Deployment
- Embeddings
- Generative AI Agents
- Model Evaluation
- Large Language Modeling
- Artificial Intelligence and Machine Learning (AI/ML)
- PyTorch (Machine Learning Library)
- Generative Model Architectures
- Natural Language Processing
- LangChain
- System Monitoring
- Deep Learning
- Prompt Engineering
- Transfer Learning
- 技能部分已折叠。显示 9 项技能,共 18 项。
要了解的详细信息

添加到您的领英档案
3 项作业
December 2025
了解顶级公司的员工如何掌握热门技能

积累特定领域的专业知识
- 向行业专家学习新概念
- 获得对主题或工具的基础理解
- 通过实践项目培养工作相关技能
- 获得可共享的职业证书

该课程共有3个模块
In this module, we dive deep into the Transformer architecture, its core mechanics, and different transformer architecture types (encoder-only, decoder-only, encoder-decoder). We gain hands-on experience by building and training a complete suite of PyTorch-based models from scratch. The module concludes with strategic deployment skills, teaching when to build custom models versus leveraging pre-trained models for efficiency and state-of-the-art results.
涵盖的内容
18个视频11篇阅读材料1个作业
Module 2 addresses the limitations of static knowledge and hallucinations in Large Language Models (LLMs) by introducing Retrieval Augmented Generation (RAG). Learners will progress from building fundamental pipelines with Ollama and LangChain to implementing production-ready systems by adding rigorous RAG evaluation and utilizing advanced techniques such as custom chunking strategies, vector stores, reranking, and query transformations to optimize context retrieval and response generation. The module concludes with an overview of another adaptation technique called finetuning and a comparison of RAG vs. finetuning.
涵盖的内容
13个视频2篇阅读材料1个作业
Module 3 marks a pivotal transition from passive information retrieval to the dynamic realm of autonomous AI Agents, anchored by the "Understand, Think, Take Action" conceptual framework. Students will critically evaluate development ecosystems before applying these concepts to build a functional Summarizer Agent. The module emphasizes professional engineering standards, guiding learners through a complete lifecycle that includes environment management with Poetry, deployment to the Vertex AI Engine, and the implementation of robust performance monitoring using Google Cloud Platform’s logging and tracing tools.
涵盖的内容
15个视频1篇阅读材料1个作业
获得职业证书
将此证书添加到您的 LinkedIn 个人资料、简历或履历中。在社交媒体和绩效考核中分享。
人们为什么选择 Coursera 来帮助自己实现职业发展

Felipe M.

Jennifer J.

Larry W.







