Lorsque vous vous inscrivez à ce cours, vous êtes également inscrit(e) à ce Certificat Professionnel.
Apprenez de nouveaux concepts auprès d'experts du secteur
Acquérez une compréhension de base d'un sujet ou d'un outil
Développez des compétences professionnelles avec des projets pratiques
Obtenez un certificat professionnel partageable auprès de Microsoft
Il y a 5 modules dans ce cours
This course provides a comprehensive overview of data storage and management approaches for big data. Learners will explore structured, semi-structured, and unstructured data formats, compare SQL and NoSQL database technologies, and implement data lakes and data warehouses. The course includes working with various file formats and understanding the differences between batch and real-time processing approaches.
Course Learning Objectives:
By the end of this course, you will be able to:
- Compare and implement SQL and NoSQL database solutions for different big data scenarios
- Work effectively with structured, semi-structured, and unstructured data formats
- Design and implement data lakes and data warehouses for big data workloads
- Build data pipelines using ETL and ELT approaches with Azure Data Factory
- Differentiate between batch and real-time processing methodologies and implement appropriate solutions
Data Storage Technologies (SQL vs NoSQL) guides learners through the core principles of modern data storage and the trade-offs that shape today’s big data systems. The module examines how relational databases manage structured data, where they encounter limitations at scale, and how techniques such as partitioning, indexing, and lakehouse architectures mitigate performance gaps. Learners compare major NoSQL categories—including document, key-value, and column-family databases—to understand how flexible schemas and distributed designs support high-volume, high-velocity workloads. Through hands-on activities with SQL Server, Azure Synapse, and Azure Cosmos DB, learners practice essential operations, evaluate storage technologies based on workload requirements, and build the skills needed to select and implement effective database solutions for big data environments.
Inclus
6 vidéos3 lectures8 devoirs
Afficher les informations sur le contenu du module
6 vidéos•Total 22 minutes
When Relational Databases Hit Their Limits•3 minutes
Optimizing SQL Server for Big Data Workloads•2 minutes
Breaking Free from Tables•3 minutes
Mapping Data Patterns to NoSQL Types•5 minutes
Cosmos DB Success Stories•3 minutes
Building Production-Ready Cosmos DB Solutions•6 minutes
3 lectures•Total 30 minutes
Relational Databases in the Big Data Era•10 minutes
NoSQL Database Landscape•10 minutes
Azure Cosmos DB Implementation Guide•10 minutes
8 devoirs•Total 240 minutes
SQL Performance Optimization and Delta Lake Operations•30 minutes
Database Performance Analysis and Lakehouse Operations•30 minutes
Working with Data Formats (Structured, Semi-structured, Unstructured)
Module 2•5 heures à terminer
Détails du module
Working with Data Formats (Structured, Semi-structured, Unstructured) helps learners build a clear understanding of how different data formats function within big data systems and why format selection matters for performance, storage, and analytical success. The module introduces structured formats, such as CSV and TSV, and explores flexible semi-structured formats, including JSON and XML. It also examines optimized file types, including Parquet, Avro, and ORC, that support large-scale analytics. Learners practice transforming data between formats using Azure Data Factory, working with nested structures, applying schema inference, and evaluating performance trade-offs across file types. Through demonstrations, code exercises, and hands-on labs, this module equips learners to select, convert, and manage data formats effectively for diverse big data scenarios.
Inclus
6 vidéos3 lectures8 devoirs
Afficher les informations sur le contenu du module
6 vidéos•Total 20 minutes
The Foundation of Data Analytics•3 minutes
Processing Structured Data with Azure Data Factory•5 minutes
Embracing Flexible Data Structures•3 minutes
Data Format Conversions with Azure Data Factory•3 minutes
Optimizing for Scale and Performance•3 minutes
File Format Performance Analysis•4 minutes
3 lectures•Total 30 minutes
Structured Data Best Practices•10 minutes
Semi-structured Data Processing Techniques•10 minutes
Big Data File Format Optimization•10 minutes
8 devoirs•Total 240 minutes
Structured Data Pipeline•30 minutes
Structured Data Management Assessment•30 minutes
JSON Data Transformation•30 minutes
Format Transformation Pipeline•30 minutes
Semi-structured Data Processing Assessment•30 minutes
File Format Performance Comparison•30 minutes
File Format Optimization Assessment•30 minutes
Data Formats Mastery Assessment•30 minutes
Data Lakes and Data Warehouses Implementation
Module 3•4 heures à terminer
Détails du module
Data Lakes and Data Warehouses Implementation guides learners through the architectural foundations and hands-on skills needed to build modern analytical environments. The module explores the purpose and structure of data lakes, highlighting the zones of raw, cleaned, enriched, and curated data, and demonstrates how thoughtful design supports flexibility, governance, and large-scale analytics. Learners also study core data warehouse concepts, including dimensional modeling, star schemas, and data marts, to understand how structured storage enables high-performance querying. Through practical work with Azure Data Lake Storage Gen2 and Azure Synapse Analytics, learners design zone architectures, implement dimensional models, configure SQL pools, and apply best practices for partitioning, distribution, and optimization. By the end, they gain the ability to organize, govern, and integrate data across both lake and warehouse environments, supporting scalable, enterprise-ready analytics.
Inclus
6 vidéos3 lectures7 devoirs
Afficher les informations sur le contenu du module
6 vidéos•Total 25 minutes
Building the Foundation for Data-Driven Innovation•3 minutes
Implementing Data Lake Zones•6 minutes
The Art and Science of Dimensional Modeling•3 minutes
Designing Effective Data Warehouse Schemas•4 minutes
Synapse Implementation Best Practices Assessment•30 minutes
Data Storage Architecture Mastery Assessment•30 minutes
Building Data Pipelines (ETL/ELT with Azure Data Factory)
Module 4•5 heures à terminer
Détails du module
Building Data Pipelines (ETL/ELT with Azure Data Factory) equips learners with the skills to design, implement, and manage scalable data integration workflows using modern, cloud-native approaches. The module examines the differences between ETL and ELT, helping learners understand when each methodology delivers the best performance, flexibility, and cost efficiency. Learners gain hands-on experience with Azure Data Factory, configuring linked services, datasets, activities, and core orchestration components, and practice building both simple and advanced pipelines. The module also introduces transformation logic, control flow patterns, parameterization, and error handling strategies that support production-ready data engineering solutions. Through walkthroughs, labs, code exercises, and scenario-based decisions, learners learn to monitor pipelines, troubleshoot failures, and design reliable data workflows that support enterprise-scale analytics.
Inclus
6 vidéos3 lectures9 devoirs
Afficher les informations sur le contenu du module
6 vidéos•Total 22 minutes
The Great ETL vs ELT Debate•4 minutes
Architecting ETL and ELT Solutions•5 minutes
Data Factory as the Integration Hub•3 minutes
Building Your First Data Factory Pipeline•4 minutes
Production-Ready Pipeline Engineering•3 minutes
Engineering Robust Data Pipelines•4 minutes
3 lectures•Total 30 minutes
ETL vs ELT Strategic Decision Framework•10 minutes
Azure Data Factory Implementation Guide•10 minutes
Advanced Data Factory Pipeline Design•10 minutes
9 devoirs•Total 270 minutes
ETL vs ELT Analysis•30 minutes
Data Integration Strategy Assessment•30 minutes
Azure Data Factory Pipeline JSON•30 minutes
Data Factory Pipeline Creation•30 minutes
Data Factory Fundamentals Assessment•30 minutes
Advanced Data Factory Pipeline Configuration•30 minutes
Advanced Pipeline Development•30 minutes
Advanced Pipeline Design Assessment•30 minutes
Data Pipeline Engineering Mastery Assessment•30 minutes
Batch and Real-Time Processing Fundamentals
Module 5•5 heures à terminer
Détails du module
Batch and Real-Time Processing Fundamentals introduces learners to the core processing models that power modern big data systems, helping them understand when each approach delivers the most value. The module explores batch architectures, scheduling methods, and optimization strategies for large-scale historical processing, while also examining real-time stream processing concepts, including event handling, latency trade-offs, and throughput requirements. Learners gain hands-on experience implementing both models—building batch workflows with Azure Data Factory and configuring streaming pipelines using Event Hubs and Stream Analytics. Through architectural analysis, code exercises, and practical labs, learners learn to evaluate business needs, select the right processing approach, and design hybrid systems that combine batch and streaming for comprehensive analytics.
Inclus
6 vidéos3 lectures9 devoirs
Afficher les informations sur le contenu du module
Comprehensive Data Storage and Integration Solution - Project•30 minutes
Obtenez un certificat professionnel
Ajoutez ce titre à votre profil LinkedIn, à votre curriculum vitae ou à votre CV. Partagez-le sur les médias sociaux et dans votre évaluation des performances.
Our goal at Microsoft is to empower every individual and organization on the planet to achieve more.
In this next revolution of digital transformation, growth is being driven by technology. Our integrated cloud approach creates an unmatched platform for digital transformation. We address the real-world needs of customers by seamlessly integrating Microsoft 365, Dynamics 365, LinkedIn, GitHub, Microsoft Power Platform, and Azure to unlock business value for every organization—from large enterprises to family-run businesses. The backbone and foundation of this is Azure.
Pour quelles raisons les étudiants sur Coursera nous choisissent-ils pour leur carrière ?
Felipe M.
Étudiant(e) depuis 2018
’Pouvoir suivre des cours à mon rythme à été une expérience extraordinaire. Je peux apprendre chaque fois que mon emploi du temps me le permet et en fonction de mon humeur.’
Jennifer J.
Étudiant(e) depuis 2020
’J'ai directement appliqué les concepts et les compétences que j'ai appris de mes cours à un nouveau projet passionnant au travail.’
Larry W.
Étudiant(e) depuis 2021
’Lorsque j'ai besoin de cours sur des sujets que mon université ne propose pas, Coursera est l'un des meilleurs endroits où se rendre.’
Chaitanya A.
’Apprendre, ce n'est pas seulement s'améliorer dans son travail : c'est bien plus que cela. Coursera me permet d'apprendre sans limites.’
When will I have access to the lectures and assignments?
To access the course materials, assignments and to earn a Certificate, you will need to purchase the Certificate experience when you enroll in a course. You can try a Free Trial instead, or apply for Financial Aid. The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.
What will I get if I subscribe to this Certificate?
When you enroll in the course, you get access to all of the courses in the Certificate, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.