Learn to build AI that sees, hears, and understands the world in an integrated way. This course takes you beyond single-modality models, teaching you to architect applications that connect different data types like text, images, and speech.

Genießen Sie unbegrenztes Wachstum mit einem Jahr Coursera Plus für 199 $ (regulär 399 $). Jetzt sparen.

Empfohlene Erfahrung
Kompetenzen, die Sie erwerben
- Kategorie: Application Design
- Kategorie: Microsoft Azure
- Kategorie: LLM Application
- Kategorie: Natural Language Processing
- Kategorie: AI Workflows
- Kategorie: Image Analysis
- Kategorie: Multimodal Prompts
- Kategorie: Computer Vision
- Kategorie: AI Orchestration
- Kategorie: Generative AI
- Kategorie: Prompt Engineering
Wichtige Details

Zu Ihrem LinkedIn-Profil hinzufügen
24 Aufgaben
Erfahren Sie, wie Mitarbeiter führender Unternehmen gefragte Kompetenzen erwerben.

In diesem Kurs gibt es 4 Module
This module introduces the foundational concepts of multimodal AI. You will learn the architectural patterns for combining different AI components, such as text and image models, and progress from basic integration to building complex systems that can reason across multiple data types.
Das ist alles enthalten
4 Videos9 Lektüren7 Aufgaben
This module provides a deep dive into the popular and creative task of generating images from text descriptions. You will explore the models that power this technology, like DALL·E, and learn both basic and advanced prompting techniques to craft and refine specific, high-quality visual outputs.
Das ist alles enthalten
5 Videos5 Lektüren5 Aufgaben
This module focuses on practical implementation using a powerful, specialized tool. You will leverage the features of Azure AI Vision to build and optimize cross-modal applications like image captioning and visual search. You'll learn how this single service can analyze visual content to generate rich textual descriptions and extract embedded text (OCR), providing the core components for sophisticated multimodal solutions.
Das ist alles enthalten
7 Videos6 Lektüren7 Aufgaben
This capstone module builds upon your deep expertise in Azure AI Vision. You will learn to integrate your vision applications with other powerful Azure AI Services, such as Language and Speech, to create comprehensive, end-to-end solutions. The focus will be on orchestrating these distinct services to develop a sophisticated application that solves a real-world business problem, demonstrating your ability to design and build a complete multimodal system from the ground up.
Das ist alles enthalten
6 Videos5 Lektüren5 Aufgaben
Warum entscheiden sich Menschen für Coursera für ihre Karriere?




Häufig gestellte Fragen
To access the course materials, assignments and to earn a Certificate, you will need to purchase the Certificate experience when you enroll in a course. You can try a Free Trial instead, or apply for Financial Aid. The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.
When you enroll in the course, you get access to all of the courses in the Certificate, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.
Weitere Fragen
Finanzielle Unterstützung verfügbar,





