返回到 Cloud Computing Applications, Part 2: Big Data and Applications in the Cloud
University of Illinois Urbana-Champaign

Cloud Computing Applications, Part 2: Big Data and Applications in the Cloud

Welcome to the Cloud Computing Applications course, the second part of a two-course series designed to give you a comprehensive view on the world of Cloud Computing and Big Data! In this second course we continue Cloud Computing Applications by exploring how the Cloud opens up data analytics of huge volumes of data that are static or streamed at high velocity and represent an enormous variety of information. Cloud applications and data analytics represent a disruptive change in the ways that society is informed by, and uses information. We start the first week by introducing some major systems for data analysis including Spark and the major frameworks and distributions of analytics applications including Hortonworks, Cloudera, and MapR. By the middle of week one we introduce the HDFS distributed and robust file system that is used in many applications like Hadoop and finish week one by exploring the powerful MapReduce programming model and how distributed operating systems like YARN and Mesos support a flexible and scalable environment for Big Data analytics. In week two, our course introduces large scale data storage and the difficulties and problems of consensus in enormous stores that use quantities of processors, memories and disks. We discuss eventual consistency, ACID, and BASE and the consensus algorithms used in data centers including Paxos and Zookeeper. Our course presents Distributed Key-Value Stores and in memory databases like Redis used in data centers for performance. Next we present NOSQL Databases. We visit HBase, the scalable, low latency database that supports database operations in applications that use Hadoop. Then again we show how Spark SQL can program SQL queries on huge data. We finish up week two with a presentation on Distributed Publish/Subscribe systems using Kafka, a distributed log messaging system that is finding wide use in connecting Big Data and streaming applications together to form complex systems. Week three moves to fast data real-time streaming and introduces Storm technology that is used widely in industries such as Yahoo. We continue with Spark Streaming, Lambda and Kappa architectures, and a presentation of the Streaming Ecosystem. Week four focuses on Graph Processing, Machine Learning, and Deep Learning. We introduce the ideas of graph processing and present Pregel, Giraph, and Spark GraphX. Then we move to machine learning with examples from Mahout and Spark. Kmeans, Naive Bayes, and fpm are given as examples. Spark ML and Mllib continue the theme of programmability and application construction. The last topic we cover in week four introduces Deep Learning technologies including Theano, Tensor Flow, CNTK, MXnet, and Caffe on Spark.

状态:Big Data
状态:Scalability
课程小时

精选评论

DT

4.0评论日期:Oct 10, 2016

The course could use a programming assignment to go along with the lectures.

SB

4.0评论日期:Mar 18, 2018

Good overview and jumping off points to go explore more. Great that a lot of tool sets were exposed to us. A list of all these tool sets in a document would be handy.

GS

4.0评论日期:May 22, 2020

Good learning about big data and real life scenarios esp. Yahoo.

BD

4.0评论日期:Feb 22, 2020

There are a lot of technologies to cover and it is a dynamically changing subject. However, it will be great adding some hands-on exercises.

JA

4.0评论日期:Jun 10, 2020

Good structure, well explained but some of the examples presented are starting to be outdated. Solid theoretical presentation.

JA

5.0评论日期:Sep 29, 2019

Very Useful Course. Course material is massive and well prepared for the modern industry demands.

MS

5.0评论日期:Nov 26, 2017

Very good introduction of application concepts of cloud data computing. Thank You!

UN

5.0评论日期:Oct 30, 2016

good things to learn about real world big problems

UN

5.0评论日期:Apr 9, 2018

My understanding of Big Data technologies was really enhanced by this course. I have decided to pursue more of these underlying technologies after this course. Good job

KG

5.0评论日期:Jul 14, 2020

Really helpful to get insights into Big Data applications

VV

4.0评论日期:May 20, 2020

I love this course. Open lots of perspectives in cloud applications.

所有审阅

显示:20/53

Ak Dayal
4.0
评论日期:Sep 2, 2017
Patrick Santos
5.0
评论日期:Jun 19, 2017
Ning Zhao
5.0
评论日期:Oct 28, 2020
Yaron Klein
5.0
评论日期:Aug 27, 2017
Uche Ngadi
5.0
评论日期:Apr 10, 2018
Fillipe de Souza Silva
5.0
评论日期:Nov 13, 2016
André Luís Damázio de Sales Júnior
5.0
评论日期:Aug 5, 2020
Javed Ahmad
5.0
评论日期:Sep 30, 2019
Mahendra Pratap Singh
5.0
评论日期:Nov 26, 2017
Kedar Gosavi
5.0
评论日期:Jul 15, 2020
uzair naroo
5.0
评论日期:Oct 31, 2016
Eduardo Barreto Lourenço
5.0
评论日期:Jun 12, 2018
Sudhanshu Sharma
5.0
评论日期:Dec 17, 2016
Rajesh Kumar Pradhan
5.0
评论日期:May 18, 2025
Nursultan Turdaliev
5.0
评论日期:Oct 29, 2016
Ruiwen Wang
5.0
评论日期:Aug 13, 2020
Sreedevi R Nagarmunoli
5.0
评论日期:Aug 6, 2020
shashank
5.0
评论日期:Nov 13, 2018
Dario Fernandez Bayure
5.0
评论日期:Nov 10, 2020
Joseph Kilonzo
5.0
评论日期:Mar 30, 2019