This is the fifth course in the Google Advanced Data Analytics Certificate. In this course, you’ll learn about machine learning, which uses algorithms and statistics to teach computer systems to discover patterns in data. Data professionals use machine learning to help analyze large amounts of data, solve complex problems, and make accurate predictions. You’ll focus on the two main types of machine learning: supervised and unsupervised. You'll learn how to apply different machine learning models to business problems and become familiar with specific models such as Naive Bayes, decision tree, random forest, and more.
Google employees who currently work in the field will guide you through this course by providing hands-on activities that simulate relevant tasks, sharing examples from their day-to-day work, and helping you enhance your data analytics skills to prepare for your career.
Learners who complete the eight courses in this program will have the skills needed to apply for data science and advanced data analytics jobs. This certificate assumes prior knowledge of foundational analytical principles, skills, and tools covered in the Google Data Analytics Certificate.
By the end of this course, you will:
-Apply feature engineering techniques using Python
-Construct a Naive Bayes model
-Describe how unsupervised learning differs from supervised learning
-Code a K-means algorithm in Python
-Evaluate and optimize the results of K-means model
-Explore decision tree models, how they work, and their advantages over other types of supervised machine learning
-Characterize bagging in machine learning, specifically for random forest models
-Distinguish boosting in machine learning, specifically for XGBoost models
-Explain tuning model parameters and how they affect performance and evaluation metrics
You’ll start by exploring the basic concepts of machine learning and the role of machine learning in data science. Then, you’ll review the four main types of machine learning: supervised, unsupervised, reinforcement, and deep learning.
涵盖的内容
16个视频7篇阅读材料7个作业4个插件
显示有关单元内容的信息
16个视频•总计56分钟
Introduction to Course 5•4分钟
Susheela: Delight people with data•3分钟
Welcome to module 1•1分钟
The main types of machine learning•7分钟
Determine when features are infinite•3分钟
Categorical features and classification models•4分钟
Guide user interest with recommendation systems•7分钟
Equity and fairness in machine learning•3分钟
Build ethical models•4分钟
Python for machine learning•4分钟
Different types of Python IDEs•2分钟
More about Python packages•3分钟
Resources to answer programming questions•3分钟
Your machine learning team•2分钟
Samantha: Connect to the data professional community•3分钟
Wrap-up•2分钟
7篇阅读材料•总计110分钟
Helpful resources and tips•8分钟
Course 5 overview•12分钟
Case study: The Woobles: The power of recommendation systems to drive sales•20分钟
Reference guide: Python for machine learning•20分钟
Python libraries and packages•20分钟
Find solutions online•20分钟
Glossary terms from module 1•10分钟
7个作业•总计82分钟
Test your knowledge: Introduction to machine learning•6分钟
Test your knowledge: Categorical versus continuous data types and models•4分钟
Test your knowledge: Machine learning in everyday life•6分钟
Test your knowledge: Ethics in machine learning•4分钟
Test your knowledge: Utilize the Python toolbelt for machine learning•6分钟
Test your knowledge: Machine learning resources for data professionals•6分钟
[Turkish learners ONLY] Categorize: Data science tools - Türkçe•15分钟
Workflow for building complex models
第 2 单元•小时 后完成
单元详情
You’ll learn how data professionals use a structured workflow for machine learning. You'll identify the main steps of the workflow and the importance of each step in the overall process. Then, you'll learn how to apply specific machine learning models to business problems.
涵盖的内容
12个视频6篇阅读材料3个作业6个非评分实验室
显示有关单元内容的信息
12个视频•总计46分钟
Welcome to module 2•1分钟
PACE in machine learning•1分钟
Plan for a machine learning project•2分钟
Ganesh: Overcome challenges and learn from your mistakes•3分钟
Analyze data for a machine learning model•3分钟
Introduction to feature engineering•5分钟
Solve issues that come with imbalanced datasets•4分钟
Feature engineering and class balancing•8分钟
Introduction to Naive Bayes•4分钟
Construct a Naive Bayes model with Python•10分钟
Key evaluation metrics for classification models•3分钟
Wrap-up•1分钟
6篇阅读材料•总计44分钟
More about planning a machine learning project•8分钟
Explore feature engineering•8分钟
More about imbalanced datasets•8分钟
Naive Bayes classifiers•8分钟
More about evaluation metrics for classification models•8分钟
Glossary terms from module 2•4分钟
3个作业•总计52分钟
Test your knowledge: PACE in machine learning: The plan and analyze stages•6分钟
Test your knowledge: PACE in machine learning: The construct and execute stages•6分钟
Module 2 challenge •40分钟
6个非评分实验室•总计200分钟
Annotated follow-along guide: Feature engineering with Python•20分钟
Activity: Perform feature engineering•60分钟
Exemplar: Perform feature engineering•20分钟
Annotated follow-along guide: Construct a Naive Bayes model with Python•20分钟
Activity: Build a Naive Bayes model•60分钟
Exemplar: Build a Naive Bayes model•20分钟
Unsupervised learning techniques
第 3 单元•小时 后完成
单元详情
You’ll learn more about one of the major types of machine learning: unsupervised learning. You'll begin by exploring the difference between supervised and unsupervised techniques and the benefits and uses of each approach. Then, you’ll learn how to apply two unsupervised machine learning models: clustering and K-means.
涵盖的内容
7个视频4篇阅读材料3个作业4个非评分实验室
显示有关单元内容的信息
7个视频•总计32分钟
Welcome to module 3•2分钟
Introduction to K-means•5分钟
Use K-means for color compression with Python•7分钟
Key metrics for representing K-means clustering•4分钟
Inertia and silhouette coefficient metrics•4分钟
Apply inertia and silhouette score with Python•9分钟
Wrap-up•1分钟
4篇阅读材料•总计24分钟
More about K-means•8分钟
Clustering beyond K-means•4分钟
More about inertia and silhouette coefficient metrics•8分钟
Glossary terms from module 3•4分钟
3个作业•总计52分钟
Test your knowledge: Explore unsupervised learning and K-means•6分钟
Test your knowledge: Evaluate a K-means model•6分钟
Module 3 challenge•40分钟
4个非评分实验室•总计120分钟
Annotated follow-along guide: Use K-means for color compression with Python•20分钟
Annotated follow-along resource: Apply inertia and silhouette score with Python•20分钟
Activity: Build a K-means model•60分钟
Exemplar: Build a K-means model•20分钟
Tree-based modeling
第 4 单元•小时 后完成
单元详情
Next, you’ll focus on supervised learning. You’ll learn how to test and validate the performance of supervised machine learning models such as decision tree, random forest, and gradient boosting.
涵盖的内容
17个视频11篇阅读材料5个作业10个非评分实验室2个插件
显示有关单元内容的信息
17个视频•总计77分钟
Welcome to module 4•2分钟
Daisy: Highlight both technical and people skills•3分钟
Tree-based modeling •4分钟
Build a decision tree with Python •6分钟
Tune a decision tree•5分钟
Verify performance using validation •3分钟
Tune and validate decision trees with Python •5分钟
Bootstrap aggregation•5分钟
Explore a random forest•3分钟
Tuning a random forest •4分钟
Build and cross-validate a random forest model with Python•5分钟
Build and validate a random forest model using a validation data set•8分钟
Introduction to boosting: AdaBoost •5分钟
Gradient boosting machines•5分钟
Tune a GBM model •5分钟
Build an XGBoost model with Python •7分钟
Wrap-up•2分钟
11篇阅读材料•总计84分钟
Explore decision trees•8分钟
Hyperparameter tuning•8分钟
More about validation and cross-validation•8分钟
Bagging: How it works and why to use it•8分钟
More about random forests•8分钟
Reference guide: Random forest tuning•8分钟
Reference guide: Validation and cross-validation•8分钟
Case Study: Machine learning model unearths resourcing insights for Booz Allen Hamilton•8分钟
More about gradient boosting•8分钟
Reference guide: XGBoost tuning•8分钟
Glossary terms from module 4 •4分钟
5个作业•总计80分钟
Test your knowledge: Additional supervised learning techniques•8分钟
Test your knowledge: Tune tree-based models•8分钟
Test your knowledge: Bagging •8分钟
Test your knowledge: Boosting•6分钟
Module 4 challenge•50分钟
10个非评分实验室•总计320分钟
Annotated follow-along guide: Build a decision tree•20分钟
Annotated follow-along guide: Tune and validate decision trees•20分钟
Activity: Build a decision tree•60分钟
Exemplar: Build a decision tree•20分钟
Annotated follow-along guide: Build and cross-validate a random forest model•20分钟
Activity: Build a random forest model•60分钟
Exemplar: Build a random forest model•20分钟
Annotated follow-along guide: Build an XGBoost model with Python•20分钟
Activity: Build an XGBoost model•60分钟
Exemplar: Build an XGBoost model•20分钟
2个插件•总计20分钟
Identify: Parts of the decision tree •10分钟
[Turkish learners ONLY] Identify: Parts of the decision tree - Türkçe•10分钟
Course 5 end-of-course project
第 5 单元•小时 后完成
单元详情
You’ll complete the final end-of-course project by applying different machine learning models to a workplace scenario dataset.
涵盖的内容
5个视频10篇阅读材料4个作业6个非评分实验室
显示有关单元内容的信息
5个视频•总计12分钟
Welcome to module 5•2分钟
Uri: Impress interviewers with your unique solutions•2分钟
Introduction to your Course 5 end-of-course portfolio project•2分钟
End-of-course project wrap-up and tips for ongoing career success•3分钟
Grow with Google is an initiative that draws on Google's decades-long history of building products, platforms, and services that help people and businesses grow. We aim to help everyone – those who make up the workforce of today and the students who will drive the workforce of tomorrow – access the best of Google’s training and tools to grow their skills, careers, and businesses.
确定
人们为什么选择 Coursera 来帮助自己实现职业发展
Felipe M.
自 2018开始学习的学生
''能够按照自己的速度和节奏学习课程是一次很棒的经历。只要符合自己的时间表和心情,我就可以学习。'
Jennifer J.
自 2020开始学习的学生
''我直接将从课程中学到的概念和技能应用到一个令人兴奋的新工作项目中。'
Larry W.
自 2021开始学习的学生
''如果我的大学不提供我需要的主题课程,Coursera 便是最好的去处之一。'
Chaitanya A.
''学习不仅仅是在工作中做的更好:它远不止于此。Coursera 让我无限制地学习。'
学生评论
4.8
623 条评论
5 stars
85.39%
4 stars
11.39%
3 stars
2.08%
2 stars
0.64%
1 star
0.48%
显示 3/623 个
C
CM
5·
已于 May 17, 2024审阅
This course helped me take my ML skills to another level entirely, I would certainly recommend it to anyone looking for a breakthrough in data analytics.
I
IH
5·
已于 Jan 14, 2024审阅
Very useful course! Concise overview of strengths and weaknesses of various cutting edge machine learning techniques.
M
MB
5·
已于 Jul 24, 2023审阅
A great course for anyone who wants to dive into the world of Machine Learning. The steps are easy to follow and the lectures and lengthy enough to give a complete idea of the topic.
Organizations of all types and sizes have business processes that generate massive volumes of data. Every moment, all sorts of information gets created by computers, the internet, phones, texts, streaming video, photographs, sensors, and much more. In the global digital landscape, data is increasingly imprecise, chaotic, and unstructured. As the speed and variety of data increases exponentially, organizations are struggling to keep pace.
Data science and advanced data analytics are part of a field of study that uses raw data to create new ways of modeling and understanding the unknown. To gain insights, businesses rely on data professionals to acquire, organize, and interpret data, which helps inform internal projects and processes. Data scientists and advanced data analysts rely on a combination of critical skills, including statistics, scientific methods, data analysis, and artificial intelligence.
What do data professionals do?
A data professional is a term used to describe any individual who works with data and/or has data skills. At a minimum, a data professional is capable of exploring, cleaning, selecting, analyzing, and visualizing data. They may also be comfortable with writing code and have some familiarity with the techniques used by statisticians and machine learning engineers, including building models, developing algorithmic thinking, and building machine learning models.
Data professionals are responsible for collecting, analyzing, and interpreting large amounts of data within a variety of different organizations. The role of a data professional is defined differently across companies. Generally speaking, data professionals possess technical and strategic capabilities that require more advanced analytical skills such as data manipulation, experimental design, predictive modeling, and machine learning. They perform a variety of tasks related to gathering, structuring, interpreting, monitoring, and reporting data in accessible formats, enabling stakeholders to understand and use data effectively. Ultimately, the work of data professionals helps organizations make informed, ethical decisions.
Why start a career in data science or advanced data analytics?
Large volumes of data — and the technology needed to manage and analyze it — are becoming increasingly accessible. Because of this, there has been a surge in career opportunities for people who can tell stories using data, such as senior data analysts and data scientists. These professionals collect, analyze, and interpret large amounts of data within a variety of different organizations. Their responsibilities require advanced analytical skills such as data manipulation, experimental design, predictive modeling, and machine learning.
Which jobs will this certificate help me prepare for?
The Google Advanced Data Analytics Certificate on Coursera is designed to prepare learners for roles as entry-level data scientists and advanced-level data analy
What tools and platforms are taught in the curriculum?
During this certificate program, you’ll gain knowledge of tools and platforms like Jupyter Notebook, Kaggle, Python, Stack Overflow, and Tableau.
What background is required?
This certificate program assumes prior knowledge of foundational analytical principles, skills, and tools. To succeed in this certificate program, you should already know about key foundational aspects of data analysis, such as the data analysis process and data life cycle, databases and general database elements, programming language basics, and project stakeholders.
The content in this certificate program builds upon data analytics concepts taught in the Google Data Analytics Certificate. These include key foundational aspects of data analysis such as the data analysis process and data life cycle, databases and general database elements such as primary and foreign keys, SQL and programming language basics, and project stakeholders. If you haven’t completed that program or if you’re unsure whether you have the necessary prerequisites, you can take an ungraded assessment in Course 1 Module 1 of this certificate to evaluate your readiness.
Why enroll in the Google Advanced Data Analytics Certificate?
You’ll learn job-ready skills through interactive content — like activities, quizzes, and discussion prompts — in under six months, with less than 10 hours of flexible study a week. Along the way, you’ll work through a curriculum designed by Google employees who work in the field, with input from top employers and industry leaders. You’ll even have the opportunity to complete end-of-course projects and a final capstone project that you can share with potential employers to showcase your data analysis skills. After you’ve graduated from the program, you’ll have access to career resources and be connected directly with employers hiring for open entry-level roles in data science and advanced roles in data analytics.
Do I need to take the course in a certain order?
We highly recommend completing the seven courses in the order presented because the content in each course builds on information covered in earlier lessons.
When will I have access to the lectures and assignments?
To access the course materials, assignments and to earn a Certificate, you will need to purchase the Certificate experience when you enroll in a course. You can try a Free Trial instead, or apply for Financial Aid. The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.
What will I get if I subscribe to this Certificate?
When you enroll in the course, you get access to all of the courses in the Certificate, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.