Image Captioning with TensorFlow & Streamlit

本课程是 AI Deep Learning Projects with TensorFlow 专项课程的一部分

位教师：EDUCBA

访问权限由 New York State Department of Labor 提供

2个模块

深入了解一个主题并学习基础知识。

6 小时完成

灵活的计划

自行安排学习进度

2个模块

深入了解一个主题并学习基础知识。

6 小时完成

灵活的计划

自行安排学习进度

您将学到什么

Preprocess image/text datasets with tokenization and feature extraction.
Build CNN-RNN models and evaluate performance with BLEU scores.
Deploy a Streamlit image captioning app on AWS EC2 for real-world use.

您将获得的技能

您将学习的工具

要了解的详细信息

可分享的证书

添加到您的领英档案

作业

8 项作业

授课语言：英语（English）

了解顶级公司的员工如何掌握热门技能

了解关于 Coursera for Business 的更多信息

Petrobras, TATA, Danone, Capgemini, P&G 和 L'Oreal 的徽标

积累特定领域的专业知识

本课程是 AI Deep Learning Projects with TensorFlow 专项课程专项课程的一部分

在注册此课程时，您还会同时注册此专项课程。

向行业专家学习新概念
获得对主题或工具的基础理解
通过实践项目培养工作相关技能
获得可共享的职业证书

该课程共有2个模块

By completing this course, learners will be able to preprocess image and text datasets, build and evaluate a deep learning model, and deploy a fully functional image captioning application. They will gain hands-on experience in applying tokenization, feature extraction, CNN-RNN architectures, and BLEU score evaluation for accurate caption generation.

This course uniquely bridges computer vision and natural language processing, enabling learners to generate meaningful captions for social media images. Unlike traditional AI tutorials, it not only covers dataset preparation and neural network modeling but also demonstrates how to create an interactive Streamlit app and deploy it on AWS EC2 for real-world accessibility. Learners benefit by acquiring both technical depth and practical deployment skills, preparing them for roles in AI development, machine learning engineering, and applied data science. By the end, they will confidently design, test, and launch their own automatic image captioning systems that integrate seamlessly into modern applications.

This module introduces learners to the foundations of automatic image captioning by preparing both text and image data. Learners will explore how to access datasets, clean and preprocess captions, and extract meaningful features from images. By the end of this module, they will be able to create structured datasets that combine textual and visual inputs, ensuring data readiness for deep learning models.

涵盖的内容

9个视频4个作业

9个视频总计68分钟

Introduction to Course5分钟
Import the Libraries9分钟
Accessing the Caption Dataset for Training5分钟
Accessing the Image DataSet for Training2分钟
Preprocessing the Text Data11分钟
Pre-Process and Load Captions Data11分钟
Loading the Captions for Training and Test Data4分钟
Preprocessing of Image Data11分钟
Loading Features for Train and Test Dataset9分钟

4个作业总计60分钟

Introduction and Dataset Access10分钟
Text Data Preprocessing10分钟
Image Data Preparation10分钟
Granded - Data Preparation and Preprocessing30分钟

This module guides learners through the complete model-building lifecycle for automatic image captioning. They will design and train deep learning models, evaluate their performance, and integrate them into an interactive Streamlit application. Finally, learners will test and deploy their app on cloud infrastructure, making their captioning system accessible for real-world use.