When will I receive my Course Certificate?

If you complete the course successfully, your electronic Course Certificate will be added to your Accomplishments page - from there, you can print your Course Certificate or add it to your LinkedIn profile.

Why can’t I audit this course?

This course is currently available only to learners who have paid or received financial aid, when available.

Is financial aid available?

Yes. In select learning programs, you can apply for financial aid or a scholarship if you can’t afford the enrollment fee. If fin aid or scholarship is available for your learning program selection, you’ll find a link to apply on the description page.

Data Engineering with Scala and Spark

kurs ist nicht verfügbar in Deutsch (Deutschland)

Wir übersetzen es in weitere Sprachen.

Data Engineering with Scala and Spark

Dozent: Packt - Course Instructors

Bei enthalten

Mehr erfahren

13 Module

Verschaffen Sie sich einen Einblick in ein Thema und lernen Sie die Grundlagen.

Stufe Mittel

Empfohlene Erfahrung

2 Wochen zu vervollständigen

unter 10 Stunden pro Woche

Flexibler Zeitplan

In Ihrem eigenen Lerntempo lernen

13 Module

Verschaffen Sie sich einen Einblick in ein Thema und lernen Sie die Grundlagen.

Stufe Mittel

Empfohlene Erfahrung

2 Wochen zu vervollständigen

unter 10 Stunden pro Woche

Flexibler Zeitplan

In Ihrem eigenen Lerntempo lernen

Was Sie lernen werden

Set up a development environment for building data pipelines in Scala
Use Spark DataFrames, Datasets, and SQL with Scala for data processing
Profile and clean data using Deequ for improved data quality

Kompetenzen, die Sie erwerben

Kategorie: Test Driven Development (TDD)
Kategorie: Unit Testing
Kategorie: Data Integrity
Kategorie: Maintainability
Kategorie: Data Pipelines
Kategorie: Data Architecture
Kategorie: Data Store
Kategorie: CI/CD
Kategorie: Data Processing
Kategorie: Data Quality
Kategorie: Performance Tuning
Kategorie: Continuous Deployment
Kategorie: Data Validation
Kategorie: Continuous Integration
Kategorie: Data Transformation

Werkzeuge, die Sie lernen werden

Kategorie: Apache Spark
Kategorie: Scala Programming
Kategorie: Apache Kafka
Kategorie: Data Lakes
Kategorie: Apache Airflow

Wichtige Details

Zertifikat zur Vorlage

Zu Ihrem LinkedIn-Profil hinzufügen

Kürzlich aktualisiert!

März 2026

Bewertungen

13 Aufgaben

Unterrichtet in Englisch

Erfahren Sie, wie Mitarbeiter führender Unternehmen gefragte Kompetenzen erwerben.

Weitere Informationen zu Coursera für Unternehmen

Logos von Petrobras, TATA, Danone, Capgemini, P&G und L'Oreal

In diesem Kurs gibt es 13 Module

This course is designed to equip data engineers with the skills to build scalable and efficient data pipelines using Scala and Spark. Data engineers will learn best practices for development, testing, and deployment in cloud environments, with a focus on optimizing performance and ensuring data quality. The course provides the necessary tools to transform raw data into actionable insights, making it highly relevant in today’s data-driven world.

Throughout the course, learners will improve their data engineering skills by mastering techniques for building both streaming and batch data pipelines. The content emphasizes practical outcomes such as performance tuning and data profiling. With hands-on examples and step-by-step guidance, learners will gain a solid understanding of real-time and batch processing pipelines. What makes this course unique is its combination of foundational theory and real-world applications. By the end, you will be able to use Scala and Spark to process large datasets and optimize pipelines in cloud environments effectively. This course is ideal for data engineers with some experience in data processing. While it assumes familiarity with data engineering concepts and cloud technologies, anyone eager to improve their skills in Scala and Spark will benefit from the practical, step-by-step approach.

In this section, we explore functional programming, higher-order functions, polymorphic functions, and pattern matching in Scala for data engineering applications.

Das ist alles enthalten

2 Videos6 Lektüren1 Aufgabe

2 VideosInsgesamt 2 Minuten

Course Overview1 Minute
Scala Essentials for Data Engineers - Overview Video1 Minute

6 LektürenInsgesamt 120 Minuten

Introduction10 Minuten
Understanding Objects, Classes, and Traits10 Minuten
Trait10 Minuten
Examples of HOFs from the Scala Collection Library30 Minuten
Understanding Polymorphic Functions30 Minuten
Understanding Pattern Matching30 Minuten

1 AufgabeInsgesamt 10 Minuten

Scala Essentials for Data Engineers10 Minuten

In this section, we explore cloud-based and local environments for data engineering pipelines, focusing on setup processes, trade-offs, and practical applications.

Das ist alles enthalten

1 Video5 Lektüren1 Aufgabe

In this section, we explore Apache Spark's APIs, focusing on DataFrame and Dataset for distributed data processing.

Das ist alles enthalten

1 Video3 Lektüren1 Aufgabe

In this section, we explore using Spark JDBC API for database access, designing database interfaces, and performing operations with configuration loading.

Das ist alles enthalten

1 Video3 Lektüren1 Aufgabe

In this section, we explore object stores, data lakes, and lakehouses, focusing on their roles in managing large-scale data workflows efficiently.

Das ist alles enthalten

1 Video6 Lektüren1 Aufgabe

In this section, we explore Spark transformations, aggregations, joins, and window functions to enhance data processing for BI and analytics. Key concepts include efficient data manipulation and pipeline development.

Das ist alles enthalten

1 Video4 Lektüren1 Aufgabe

In this section, we explore Deequ for implementing data quality checks, analyzing completeness and accuracy, and defining constraints to ensure reliable data pipelines.

Das ist alles enthalten

1 Video3 Lektüren1 Aufgabe

In this section, we explore test-driven development, static code analysis, and linting to improve code quality, maintainability, and consistency in data engineering projects.

Das ist alles enthalten

1 Video4 Lektüren1 Aufgabe

1 VideoInsgesamt 1 Minute

Test-Driven Development, Code Health, and Maintainability - Overview Video1 Minute

4 LektürenInsgesamt 70 Minuten

Introduction20 Minuten
Performing Integration Testing10 Minuten
Running Static Code Analysis30 Minuten
Understanding Linting and Code Style10 Minuten

1 AufgabeInsgesamt 10 Minuten

Test-Driven Development and Code Maintainability Fundamentals10 Minuten

In this section, we explore CI/CD practices with GitHub to automate Scala data pipeline workflows, focusing on GitHub Actions, version control, and reliable deployment processes.

Das ist alles enthalten

1 Video4 Lektüren1 Aufgabe

In this section, we explore data pipeline orchestration using tools like Airflow, Argo, Databricks, and Azure Data Factory. We focus on workflow design, task management, and real-world implementation strategies.

Das ist alles enthalten

1 Video6 Lektüren1 Aufgabe

1 VideoInsgesamt 1 Minute

Data Pipeline Orchestration - Overview Video1 Minute

6 LektürenInsgesamt 80 Minuten

Introduction10 Minuten
Monitoring and UI10 Minuten
Working with Argo Workflows20 Minuten
Creating an Argo Workflow10 Minuten
Using Databricks Workflows20 Minuten
Leveraging Azure Data Factory10 Minuten

1 AufgabeInsgesamt 10 Minuten

Data Pipeline Orchestration Fundamentals10 Minuten

In this section, we analyze Spark UI metrics to identify performance issues, optimize data shuffling, and right-size compute resources for efficient data processing.

Das ist alles enthalten

1 Video4 Lektüren1 Aufgabe

In this section, we explore building batch pipelines using Spark and Scala, focusing on medallion architecture, data ingestion, transformation, and orchestration for scalable data processing.

Das ist alles enthalten

1 Video5 Lektüren1 Aufgabe

In this section, we explore building real-time data pipelines using Spark, Scala, and Kafka for IoT applications. Key concepts include data ingestion, transformation, and serving layer design.

Das ist alles enthalten

1 Video4 Lektüren1 Aufgabe

Dozent

Packt - Course Instructors

Packt

1.749 Kurse494.468 Lernende

von

Packt

Mehr von Data Management entdecken

Status: Kostenloser Testzeitraum
Packt
Apache Spark with Scala – Hands-On with Big Data!
Kurs
Status: Kostenloser Testzeitraum
Duke University
Spark, Hadoop, and Snowflake for Data Engineering
Kurs
Status: Kostenloser Testzeitraum
EDUCBA
Apache Spark with Scala: Master Data Building & Analysis
Kurs
Status: Kostenloser Testzeitraum
Coursera
Real-Time, Real Fast: Kafka & Spark for Data Engineers
Spezialisierung

Warum entscheiden sich Menschen für Coursera für ihre Karriere?

Felipe M.

Lernender seit 2018

„Es ist eine großartige Erfahrung, in meinem eigenen Tempo zu lernen. Ich kann lernen, wenn ich Zeit und Nerven dazu habe.“

Jennifer J.

Lernender seit 2020

„Bei einem spannenden neuen Projekt konnte ich die neuen Kenntnisse und Kompetenzen aus den Kursen direkt bei der Arbeit anwenden.“

Larry W.

Lernender seit 2021

„Wenn mir Kurse zu Themen fehlen, die meine Universität nicht anbietet, ist Coursera mit die beste Alternative.“

Chaitanya A.

„Man lernt nicht nur, um bei der Arbeit besser zu werden. Es geht noch um viel mehr. Bei Coursera kann ich ohne Grenzen lernen.“

Neue Karrieremöglichkeiten mit Coursera Plus

Unbegrenzter Zugang zu 10,000+ Weltklasse-Kursen, praktischen Projekten und berufsqualifizierenden Zertifikatsprogrammen - alles in Ihrem Abonnement enthalten

Mehr erfahren

Bringen Sie Ihre Karriere mit einem Online-Abschluss voran.

Erwerben Sie einen Abschluss von erstklassigen Universitäten – 100 % online

Erkunden Sie die Abschlüsse

Schließen Sie sich mehr als 3.400 Unternehmen in aller Welt an, die sich für Coursera for Business entschieden haben.

Schulen Sie Ihre Mitarbeiter*innen, um sich in der digitalen Wirtschaft zu behaupten.

Mehr erfahren

Häufig gestellte Fragen

Yes, you can preview the first video and view the syllabus before you enroll. You must purchase the course to access content not included in the preview.

If you decide to enroll in the course before the session start date, you will have access to all of the lecture videos and readings for the course. You’ll be able to submit assignments once the session starts.

Once you enroll and your session begins, you will have access to all videos and other resources, including reading items and the course discussion forum. You’ll be able to view and submit practice assessments, and complete required graded assignments to earn a grade and a Course Certificate.

Weitere Fragen

Besuchen Sie die das Hilfe-Center für Kursteilnehmer.

Finanzielle Unterstützung verfügbar,

Data Engineering with Scala and Spark

kurs ist nicht verfügbar in Deutsch (Deutschland)

Data Engineering with Scala and Spark

Was Sie lernen werden

Kompetenzen, die Sie erwerben

Werkzeuge, die Sie lernen werden

Wichtige Details

Erfahren Sie, wie Mitarbeiter führender Unternehmen gefragte Kompetenzen erwerben.

In diesem Kurs gibt es 13 Module

Scala Essentials for Data Engineers

Das ist alles enthalten

Environment Setup

Das ist alles enthalten

An Introduction to Apache Spark and Its APIs DataFrame Dataset and Spark SQL

Das ist alles enthalten

Working with Databases

Das ist alles enthalten

Object Stores and Data Lakes

Das ist alles enthalten

Understanding Data Transformation

Das ist alles enthalten

Data Profiling and Data Quality

Das ist alles enthalten

Test-Driven Development, Code Health, and Maintainability

Das ist alles enthalten

CI/CD with GitHub

Das ist alles enthalten

Data Pipeline Orchestration

Das ist alles enthalten

Performance Tuning

Das ist alles enthalten

Building Batch Pipelines Using Spark and Scala

Das ist alles enthalten

Building Streaming Pipelines Using Spark and Scala

Das ist alles enthalten

Dozent

von

Mehr von Data Management entdecken

Apache Spark with Scala – Hands-On with Big Data!

Spark, Hadoop, and Snowflake for Data Engineering

Apache Spark with Scala: Master Data Building & Analysis

Real-Time, Real Fast: Kafka & Spark for Data Engineers

Warum entscheiden sich Menschen für Coursera für ihre Karriere?

Felipe M.

Jennifer J.

Larry W.

Chaitanya A.

Neue Karrieremöglichkeiten mit Coursera Plus

Bringen Sie Ihre Karriere mit einem Online-Abschluss voran.

Schließen Sie sich mehr als 3.400 Unternehmen in aller Welt an, die sich für Coursera for Business entschieden haben.

Häufig gestellte Fragen

Can I preview a course before enrolling?

When will I have access to the lectures and assignments?

What will I get when I enroll?

Weitere Fragen