Data Management for Analytics Part 2

Data Management for Analytics Part 2

位教师：Xuemin Jin

包含在 Coursera Plus 中

了解更多

6个模块

深入了解一个主题并学习基础知识。

9 小时完成

灵活的计划

自行安排学习进度

攻读学位

了解更多

6个模块

深入了解一个主题并学习基础知识。

9 小时完成

灵活的计划

自行安排学习进度

攻读学位

了解更多

您将获得的技能

要了解的详细信息

可分享的证书

添加到您的领英档案

了解顶级公司的员工如何掌握热门技能

了解关于 Coursera for Business 的更多信息

Petrobras, TATA, Danone, Capgemini, P&G 和 L'Oreal 的徽标

该课程共有6个模块

This course will offer you an opportunity to learn the fundamental concepts and emerging technologies in data storage and data governance. It presents a balanced theory-practice focus and covers Structured Query Language, and two flavors of NoSQL databases in MongoDB and Neo4j graph database. It also includes a brief introduction to big data management including hadoop, MapReduce, and Apache Spark. By the end of this part 2 course on data analytics, you will have a foundational understanding of the theory and applications of database management to support data analytics, data mining, machine learning, and artificial intelligence.

This module first presents an overview of the structured query language (SQL) Data Definition Language (SQL DDL) to define a relational data model. It examines the schema creation, table creation, drop command, and alter command. Various syntaxes are illustrated with explicit examples. This module also discusses the SQL Data Manipulation Language (SQL DML) used to retrieve data, update data, insert new data, and delete existing data. The focus is on SQL INSERT statements for inserting data into tables and some simple SQL SELECT statements. More complex SQL SELECT statements will be discussed in later modules along with SQL DELETE and SQL UPDATE statements.

涵盖的内容

1个视频10篇阅读材料7个作业

1个视频

Meet Your Faculty0分钟

10篇阅读材料总计128分钟

Course Introduction2分钟
Syllabus - Data Management for Analytics Part 210分钟
Academic Integrity1分钟
What is SQL?15分钟
SQL data definition language (DDL)20分钟
A DDL example20分钟
DROP and ALTER command10分钟
SQL INSERT statement15分钟
SQL SELECT statement30分钟
Module 1 Summary5分钟

7个作业总计13分钟

Check Your Prior Knowledge3分钟
Assess Your Learning: What is SQL?1分钟
Assess Your Learning: SQL Data Definition Language (DDL)2分钟
Assess Your Learning: A DDL Example2分钟
Assess Your Learning: DROP and ALTER Command2分钟
Assess Your Learning: SQL INSERT Statement1分钟
Assess Your Learning: SQL SELECT statement2分钟

This module continues the discussion of the SQL data manipulation language (DML) SELECT statement. It introduces various aggregate functions: COUNT, SUM, AVG, VARIANCE, MIN, and MAX, which are used to summarize information from database tuples. This is followed by the GROUP BY/HAVING clause, which allows the application of aggregate functions to subgroups. This module then discusses join queries that allow the user to combine or join data from multiple tables. The inner join queries feature a “where” clause that matches one or multiple columns from two tables. The left outer join, right outer join, and full outer join can be used to keep all the tuples of one or both tables in the result, regardless of whether or not they have matching tuples in the other table. All queries in this module use the Wine database in the online playground and can be executed there.

涵盖的内容

1个视频6篇阅读材料6个作业

1个视频总计4分钟

Aggregate Functions4分钟

6篇阅读材料总计85分钟

Queries with Aggregate Functions25分钟
Queries with GROUP BY/HAVING10分钟
Queries with ORDER BY10分钟
Inner Joins20分钟
Outer Joins15分钟
Module 2 Summary5分钟

6个作业总计11分钟

Check Your Prior Knowledge2分钟
Assess Your Learning: Queries with Aggregate Functions2分钟
Assess Your Learning: Queries with GROUP BY/HAVING1分钟
Assess Your Learning: Queries with ORDER BY2分钟
Assess Your Learning: Inner Joins2分钟
Assess Your Learning: Outer Joins2分钟

This module presents more complex SQL queries. It introduces nested queries where a complete SELECT FROM block appears in the WHERE clause of another query. The subquery or inner block is nested in the outer block and there can be multi-level nesting. The query optimizer usually flattens the nested query into multiple queries and executes them sequentially from the innermost to the outermost level. This module also examines the correlated nested query, where the inner block uses one or more columns of the table defined in the outer block. In this case, the query cannot be flattened, and the inner block subquery must be evaluated for each tuple of the table (also used in the inner block). The usage of the operators >= ALL and > ANY is discussed. The former can be used to find the highest or largest values whereas the latter can be used to exclude the lowest or smallest values. All queries in this module use the Wine database in the online playground and can be executed there. Finally, this module examines the DELETE and UPDATE statements that can be used to delete or modify data. It concludes with a brief discussion of SQL views.

涵盖的内容

2个视频10篇阅读材料10个作业

2个视频总计7分钟

Nested Query - Correlated Query4分钟
ALL/ANY/EXISTS/NOT EXISTS3分钟

10篇阅读材料总计135分钟

Nested Queries15分钟
Nested Correlated Queries20分钟
Queries with ALL/ANY15分钟
EXISTS/NOT EXISTS functions10分钟
Subqueries in SELECT/FROM10分钟
Set Operations15分钟
DELETE Statement15分钟
UPDATE Statement15分钟
SQL Views15分钟
Module 10 Summary5分钟

10个作业总计19分钟

Check Your Prior Knowledge3分钟
Assess Your Learning: Nested Queries2分钟
Assess Your Learning: Nested Correlated Queries2分钟
Assess Your Learning: Queries with ALL/ANY Knowledge2分钟
Assess Your Learning: EXISTS/NOT EXISTS Functions2分钟
Assess Your Learning: Subqueries in SELECT/FROM1分钟
Assess Your Learning: Set Operations2分钟
Assess Your Learning: DELETE Statement2分钟
Assess Your Learning: UPDATE Statement2分钟
Assess Your Learning: SQL Views1分钟

This module introduces a couple of extensions to the Relational Database Management Systems (RDBMSs). We will start by reviewing the core components of the relational model and its limitations. Subsequently, the module explores methods for extending relational databases, starting with a thorough review of triggers and stored procedures as pivotal mechanisms for augmenting the activity of RDBMSs. The module concludes by delving into the intricacies of recursive queries, a powerful extension to the SQL language.

涵盖的内容

4篇阅读材料4个作业

4篇阅读材料总计60分钟

Limitations of the relational model10分钟
Active Relational Database Management System Extensions: Triggers and Stored Procedures25分钟
Recursive SQL Queries20分钟
Week 11 Summary5分钟

4个作业总计8分钟

Check Your Prior Knowledge2分钟
Assess Your Learning: Limitations of the relational model3分钟
Assess Your Learning: Active Relational Database Management System Extensions: Triggers and Stored Procedures2分钟
Assess Your Learning: Recursive SQL Queries1分钟

This module presents an overview of the NoSQL movement and distributed systems. MongoDB NoSQL database is discussed at the introductory level. MongoDB is intended for storing documents such as resumes, legal documents, books, etc. It does not use any schema or data model, and stores documents as collections — which store a collection of attributes labeled and unordered that represent semi-structured items.

涵盖的内容

5篇阅读材料5个作业

5篇阅读材料总计70分钟

The NoSQL movement20分钟
Key-Value Stores and Distributed Systems10分钟
Document Stores and MongoDB20分钟
Aggregation with MapReduce15分钟
Module 5 Summary5分钟

5个作业总计7分钟

Check Your Prior Knowledge1分钟
Assess Your Learning: The NoSQL movement2分钟
Assess Your Learning: Key-Value Stores and Distributed Systems1分钟
Assess Your Learning: Document Stores and MongoDB2分钟
Assess Your Learning: Aggregation with MapReduce1分钟

This module continues the discussion of the NoSQL database. The graph theory and Neo4j graph database are discussed at the introductory level. The Neo4j is a graph database that applies graph theory to information storage. It consists of nodes and edges, both of which can store information. Graph databases are particularly useful in modeling social networks such as X (formerly known as Twitter) and Facebook. In a way, a graph database is a hyper-relational database where join tables are replaced by more interesting and semantically meaningful relationships that can be navigated (graph traversal) and/or queried, based on graph pattern matching.

涵盖的内容

5篇阅读材料4个作业

5篇阅读材料总计42分钟

A Brief Introduction to Graph Theory5分钟
Graph-based Databases10分钟
Neo4j and Cypher Query Language25分钟
Module 6 Summary1分钟
Congratulations!1分钟

4个作业总计5分钟

Check Your Prior Knowledge 1分钟
Assess Your Learning: A Brief Introduction to Graph Theory1分钟
Assess Your Learning: Graph-based Databases1分钟
Assess Your Learning: Neo4j and Cypher Query Language2分钟

攻读学位

课程是 Northeastern University 提供的以下学位课程的一部分。如果您被录取并注册，您已完成的课程可计入您的学位学习，您的学习进度也可随之转移。

位教师

Xuemin Jin

Northeastern University

4 门课程612 名学生

提供方

Northeastern University

从 Software Development 浏览更多内容

状态：免费试用
Meta
Introduction to Data Management
课程
状态：预览
Northeastern University
Database to AI: Practical Data Analytics Integration
课程
状态：免费试用
LearnKartS
Data Management, Reports, and Dashboards
课程
状态：免费试用
Whizlabs
AWS: Data Analytics
课程

人们为什么选择 Coursera 来帮助自己实现职业发展

Felipe M.

自 2018开始学习的学生

''能够按照自己的速度和节奏学习课程是一次很棒的经历。只要符合自己的时间表和心情，我就可以学习。'

Jennifer J.

自 2020开始学习的学生

''我直接将从课程中学到的概念和技能应用到一个令人兴奋的新工作项目中。'

Larry W.

自 2021开始学习的学生

''如果我的大学不提供我需要的主题课程，Coursera 便是最好的去处之一。'

Chaitanya A.

''学习不仅仅是在工作中做的更好：它远不止于此。Coursera 让我无限制地学习。'

通过 Coursera Plus 开启新生涯

无限制访问 10,000+ 世界一流的课程、实践项目和就业就绪证书课程 - 所有这些都包含在您的订阅中

了解更多

通过在线学位推动您的职业生涯

获取世界一流大学的学位 - 100% 在线

探索学位

加入超过 3400 家选择 Coursera for Business 的全球公司

提升员工的技能，使其在数字经济中脱颖而出

了解更多

常见问题

To access the course materials, assignments and to earn a Certificate, you will need to purchase the Certificate experience when you enroll in a course. You can try a Free Trial instead, or apply for Financial Aid. The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.

When you purchase a Certificate you get access to all course materials, including graded assignments. Upon completing the course, your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.

Yes. In select learning programs, you can apply for financial aid or a scholarship if you can’t afford the enrollment fee. If fin aid or scholarship is available for your learning program selection, you’ll find a link to apply on the description page.