PySpark 课程可以帮助您学习数据操作、分布式计算和数据分析技术。您可以掌握处理大型数据集、执行 Transformer 和执行 Machine Learning 算法的技能。许多课程都会介绍 Apache Spark 及其库等工具,这些工具支持高效处理 Big Data 并与 AI 应用程序集成。

您将获得的技能: Model Evaluation, Data Preprocessing, Exploratory Data Analysis, Feature Engineering, Model Deployment, Data Analysis, PySpark, Model Training, Data Cleansing, Data Import/Export, Data Transformation, Apache Spark, Data-Driven Decision-Making, AI Enablement, Decision Tree Learning, Predictive Modeling, Predictive Analytics, Machine Learning
★ 4.7 (26) · 中级 · 指导项目 · 不超过 2 小时

您将获得的技能: Scala Programming, Data Pipelines, Test Driven Development (TDD), Apache Airflow, Data Lakes, Apache Spark, CI/CD, Apache Kafka, Data Quality, Data Architecture, Performance Tuning, Data Store, Unit Testing, Data Transformation, Data Processing, Data Validation, Maintainability, Continuous Integration, Continuous Deployment, Data Integrity
中级 · 课程 · 3-6 个月

您将获得的技能: PySpark, Apache Spark, Data Synthesis, Data Visualization Software, Data Analysis, Exploratory Data Analysis, Data Cleansing, Data Wrangling, Data Processing, Data Manipulation, Big Data, Data Science, Jupyter, People Analytics
中级 · 指导项目 · 不超过 2 小时

您将获得的技能: Databricks, Apache Spark, Microsoft Azure, Data Integration, Data Lakes, File Systems, Data Processing, Big Data, File Management
★ 4.3 (36) · 初级 · 课程 · 1-3 个月

您将获得的技能: PySpark, Apache Spark, Apache Hadoop, Data Pipelines, Big Data, Data Storage Technologies, Data Processing, Distributed Computing, Data Architecture, Data Storage, Data Wrangling, Data Integration, Data Transformation, SQL, Data Manipulation, Performance Tuning
★ 2.8 (8) · 中级 · 课程 · 1-3 个月

Duke University
您将获得的技能: PySpark, Snowflake Schema, Databricks, Data Pipelines, Apache Spark, MLOps (Machine Learning Operations), Apache Hadoop, Data Architecture, Big Data, Data Warehousing, Data Quality, Data Integration, Data Processing, DevOps, Model Training, Model Deployment, Distributed Computing, Data Transformation, SQL, Python Programming
★ 3.9 (66) · 高级设置 · 课程 · 1-4 周

您将获得的技能: Model Evaluation, PySpark, Apache Spark, Logistic Regression, Predictive Modeling, Applied Machine Learning, Unsupervised Learning, Decision Tree Learning, Predictive Analytics, Advanced Analytics, Machine Learning Methods, Random Forest Algorithm, Model Training, Regression Analysis, Classification Algorithms, Machine Learning Algorithms, Data Pipelines
★ 5 (12) · 混合 · 课程 · 1-4 周

University of Pittsburgh
您将获得的技能: Apache Hadoop, Apache Spark, PySpark, Data Pipelines, Distributed Computing, Big Data, Apache Hive, Data Processing, Data Storage, Scikit Learn (Machine Learning Library), Predictive Modeling, Scalability, Data Management, File Systems, Data Science, Data Transformation, Information Technology, Data Analysis
中级 · 课程 · 1-4 周

École Polytechnique Fédérale de Lausanne
您将获得的技能: 分布式计算, Scala 编程, 大数据, 数据转换, 查询语言, 数据操作, 数据导入/导出, 数据持久性, Apache Spark, 性能调整, Apache Hadoop, 数据处理
★ 4.6 (2600) · 中级 · 课程 · 1-4 周

您将获得的技能: PySpark, Apache Spark, Data Pipelines, Data Processing, AI Personalization, Dimensionality Reduction, OpenAI API, Data Manipulation, Pandas (Python Package), Data Transformation, Unsupervised Learning, Applied Machine Learning, Embeddings, Machine Learning
中级 · 指导项目 · 不超过 2 小时

Duke University
您将获得的技能: Data Visualization Software, PySpark, Data Visualization, Snowflake Schema, Data Storytelling, Site Reliability Engineering, Docker (Software), Databricks, Containerization, GitHub Copilot, Interactive Data Visualization, Plot (Graphics), Plotly, Data Pipelines, Kubernetes, Apache Spark, Apache Hadoop, Big Data, Data Science, Python Programming
★ 3.8 (122) · 中级 · 专项课程 · 1-3 个月

Google Cloud
您将获得的技能: 大数据, Google Cloud 平台, 托管服务, Apache Spark, 数据管理, Data Management, Apache Hadoop
★ 4 (11) · 初级 · 项目 · 不超过 2 小时