By completing this course, learners will be able to preprocess image and text datasets, build and evaluate a deep learning model, and deploy a fully functional image captioning application. They will gain hands-on experience in applying tokenization, feature extraction, CNN-RNN architectures, and BLEU score evaluation for accurate caption generation.

您将学到什么
Preprocess image/text datasets with tokenization and feature extraction.
Build CNN-RNN models and evaluate performance with BLEU scores.
Deploy a Streamlit image captioning app on AWS EC2 for real-world use.
您将获得的技能
要了解的详细信息

添加到您的领英档案
8 项作业
October 2025
了解顶级公司的员工如何掌握热门技能

积累特定领域的专业知识
- 向行业专家学习新概念
- 获得对主题或工具的基础理解
- 通过实践项目培养工作相关技能
- 获得可共享的职业证书

该课程共有2个模块
This module introduces learners to the foundations of automatic image captioning by preparing both text and image data. Learners will explore how to access datasets, clean and preprocess captions, and extract meaningful features from images. By the end of this module, they will be able to create structured datasets that combine textual and visual inputs, ensuring data readiness for deep learning models.
涵盖的内容
9个视频4个作业
This module guides learners through the complete model-building lifecycle for automatic image captioning. They will design and train deep learning models, evaluate their performance, and integrate them into an interactive Streamlit application. Finally, learners will test and deploy their app on cloud infrastructure, making their captioning system accessible for real-world use.
涵盖的内容
9个视频4个作业
获得职业证书
将此证书添加到您的 LinkedIn 个人资料、简历或履历中。在社交媒体和绩效考核中分享。
人们为什么选择 Coursera 来帮助自己实现职业发展

Felipe M.

Jennifer J.

Larry W.








