返回到 Big Data Modeling and Management Systems
University of California San Diego

Big Data Modeling and Management Systems

Once you’ve identified a big data issue to analyze, how do you collect, store and organize your data using Big Data solutions? In this course, you will experience various data genres and management tools appropriate for each. You will be able to describe the reasons behind the evolving plethora of new big data platforms from the perspective of big data management systems and analytical tools. Through guided hands-on tutorials, you will become familiar with techniques using real-time and semi-structured data examples. Systems and tools discussed include: AsterixDB, HP Vertica, Impala, Neo4j, Redis, SparkSQL. This course provides techniques to extract value from existing untapped data sources and discovering new data sources. At the end of this course, you will be able to: * Recognize different data elements in your own work and in everyday life problems * Explain why your team needs to design a Big Data Infrastructure Plan and Information System Design * Identify the frequent data operations required for various types of data * Select a data model to suit the characteristics of your data * Apply techniques to handle streaming data * Differentiate between a traditional Database Management System and a Big Data Management System * Appreciate why there are so many data management systems * Design a big data information system for an online game company This course is for those new to data science. Completion of Intro to Big Data is recommended. No prior programming experience is needed, although the ability to install applications and utilize a virtual machine is necessary to complete the hands-on assignments. Refer to the specialization technical requirements for complete hardware and software specifications. Hardware Requirements: (A) Quad Core Processor (VT-x or AMD-V support recommended), 64-bit; (B) 8 GB RAM; (C) 20 GB disk free. How to find your hardware information: (Windows): Open System by clicking the Start button, right-clicking Computer, and then clicking Properties; (Mac): Open Overview by clicking on the Apple menu and clicking “About This Mac.” Most computers with 8 GB RAM purchased in the last 3 years will meet the minimum requirements.You will need a high speed internet connection because you will be downloading files up to 4 Gb in size. Software Requirements: This course relies on several open-source software tools, including Apache Hadoop. All required software can be downloaded and installed free of charge (except for data charges from your internet provider). Software requirements include: Windows 7+, Mac OS X 10.10+, Ubuntu 14.04+ or CentOS 6+ VirtualBox 5+.

状态:Database Management Systems
状态:Data Modeling
课程小时

精选评论

RL

5.0评论日期:Apr 9, 2019

As a undergraduate data analytics student, this course was an enlightening experience that complemented my more theoretical, less-applicational on campus course very well.

SM

5.0评论日期:Apr 12, 2020

Gives us a very good understanding on Big data modeling and various data models like Graph Data model, Vector Space model (TF-IDF), Vertica, AsterixDB, Aerospike etc.

BR

4.0评论日期:Oct 10, 2018

It was a difficult module, although trainer tried to convey but seems it is more complex it took time for me to understand the concept and apply the same while doing my assignment.

AG

4.0评论日期:Oct 8, 2017

Lot of new information, excellent delivery. Given 4 as I feel real-use case flavor is inadequate -exercises could be more intensive, real case studies can be added.

DN

4.0评论日期:Nov 22, 2016

Pretty good overall, although some exercises are a bit difficult to understand from the descriptions and instructions given, some graphs and initial reference documentation for exercises might help

BT

5.0评论日期:Mar 31, 2026

A hoe-like tool is placed on a concrete edge. There is a small container with a light-colored liquid nearby. Wet soil or mortar is piled on the ground to the right.

PR

4.0评论日期:Mar 6, 2018

Pretty good course. The peer corrected assignment is avoidable. Instead, a little bit of programming may be introduced! The course becomes extremely boring due to only theoretical aspects and quizzes.

LR

5.0评论日期:Dec 4, 2019

Great course to learn and also practice how data can be visualized and how to model data, formats and how important is to choose appropriate data format and model

JW

5.0评论日期:May 7, 2019

I feel as though the assessment questions could have been more specific and the assessment criteria when marking could have been more precise. But other than that it was a great course.

RM

4.0评论日期:Oct 16, 2016

Interesting. Sometimes a little bit overwhelmed by a lot of information within a single video but it gives you an overview of what is big data modeling and management systems.

VC

4.0评论日期:Sep 17, 2017

The course provided me a good understanding of the tools and insights on how data could be modeled and managed. I feel confident that I can use the knowledge at work.

AC

5.0评论日期:Mar 23, 2020

It's a good course to get an overview of different technologies that are out there. I would highly recommend this course to anyone who wants to build a foundation for big data.

所有审阅

显示:20/519

Sergey Kondrashov
2.0
评论日期:Oct 4, 2019
sandeep dhankhar
3.0
评论日期:Jul 16, 2019
Alberto Ramirez
2.0
评论日期:Nov 19, 2019
David Pérez García
3.0
评论日期:Nov 15, 2019
Hendrik Bruns
2.0
评论日期:Dec 17, 2017
Massimo Marra
1.0
评论日期:Jan 27, 2018
Kuldeep Kumar Sondhiya
5.0
评论日期:Jun 4, 2020
Nishant Upadhyay
5.0
评论日期:Feb 18, 2019
Online Learning
3.0
评论日期:Sep 25, 2021
Kamran Huseynov
2.0
评论日期:Jun 5, 2020
Isaac Lawrence
2.0
评论日期:May 7, 2018
David Shapiro
2.0
评论日期:Feb 3, 2017
Ruben Dario Mendoza Peña
3.0
评论日期:Mar 30, 2022
Zaher Alhaj Hussein
2.0
评论日期:Jun 22, 2017
Kenneth Charles Chilcoat
1.0
评论日期:Feb 24, 2022
James woodhouse
5.0
评论日期:May 8, 2019
Aldo Bohorquez
5.0
评论日期:Apr 3, 2017
Deleted Account
4.0
评论日期:May 5, 2024
Brian Song
4.0
评论日期:Jul 11, 2022
Bani Chibuike Nwogbo
4.0
评论日期:May 11, 2020