When it comes to using data, there are two main camps, traditional statistics and machine learning, and the two camps complement each other. Statistics remains highly relevant, irrespective of the size of data. Its role remains what it has always been, but it is even more important now. There is a need to transition from traditional statistical modeling to the machine learning world. This course introduces the statistical background necessary for machine learning. Knowledge of statistics relevant to machine learning will prepare you to become a data scientist. The course prepares you for future instruction on machine learning (including its underlying methodology that has statistical foundations) and enables you to develop a deeper understanding of machine learning models.

推荐体验
推荐体验
中级
Experience using computer software. A course in statistics covering distribution, p-values, and hypothesis testing is helpful.
推荐体验
推荐体验
中级
Experience using computer software. A course in statistics covering distribution, p-values, and hypothesis testing is helpful.
您将学到什么
Explain the relevance of statistics in machine learning.
Perform linear and logistic regression with practical troubleshooting tips.
要了解的详细信息

添加到您的领英档案
December 2025
71 项作业
了解顶级公司的员工如何掌握热门技能

积累 Probability and Statistics 领域的专业知识
- 向行业专家学习新概念
- 获得对主题或工具的基础理解
- 通过实践项目培养工作相关技能
- 通过 SAS 获得可共享的职业证书

该课程共有5个模块
In this module, you learn how to explain the relevance of statistics in big data and machine learning and relate statistical and data science terminology.
涵盖的内容
19个视频8篇阅读材料8个作业1个应用程序项目
19个视频• 总计52分钟
- Welcome to the Course• 2分钟
- Overview• 1分钟
- Data and Digital Economies• 5分钟
- Big Data Produced Smart Applications• 8分钟
- Knowing Statistics (1)• 0分钟
- Knowing Statistics (2)• 1分钟
- Relevance of Statistics in Big Data• 3分钟
- Transitioning to the Machine Learning World• 1分钟
- Analyzing Data• 1分钟
- Statistical Modeling: The Two Cultures• 2分钟
- Modeling Approaches• 4分钟
- Why Is Statistics Important to Machine Learning?• 2分钟
- Journey from Statistics to Machine Learning• 2分钟
- Basic Terminology• 5分钟
- Variable Type and Level of Measurement• 7分钟
- Modeling Vocabulary• 6分钟
- Introduction to SAS Viya• 1分钟
- SAS Viya Servers• 1分钟
- Introduction to SAS Studio and SAS Studio Flows• 3分钟
8篇阅读材料• 总计80分钟
- Data Dictionary• 10分钟
- Using SAS Viya for Learners with This Course (Required)• 10分钟
- What Is Big Data?• 10分钟
- Demo: Automated Machine Learning in SAS Studio• 10分钟
- More about the Relevance of Statistics in Big Data• 10分钟
- More about the Journey from Statistics to Machine Learning• 10分钟
- More about SAS Viya• 10分钟
- Lesson Summary• 10分钟
8个作业• 总计240分钟
- Question: Big Data• 30分钟
- Question: Computer Vision• 30分钟
- Question: Modeling Cultures• 30分钟
- Question: Basic Terminology (1)• 30分钟
- Question: Basic Terminology (2)• 30分钟
- Question: Measurement Scales• 30分钟
- Question: Modeling Vocabulary• 30分钟
- Question: SAS Viya• 30分钟
1个应用程序项目• 总计60分钟
- Access SAS Viya for Learners• 60分钟
In this module, you learn about fundamental statistical concepts.
涵盖的内容
36个视频9篇阅读材料17个作业
36个视频• 总计70分钟
- Overview• 1分钟
- Populations and Samples• 2分钟
- Process of Statistical Analysis• 2分钟
- Sampling• 1分钟
- Sampling Methods• 3分钟
- Event-Based Sampling• 7分钟
- Analysis Data• 2分钟
- Analysis Goals• 1分钟
- Describing Your Data• 2分钟
- Measures of Central Tendency• 1分钟
- Measures of Position• 1分钟
- Measures of Dispersion• 4分钟
- Visualizing Distributions• 0分钟
- Histograms• 1分钟
- Normal (Gaussian) Distribution• 2分钟
- Usefulness of Normal Distribution in Machine Learning• 1分钟
- Plots beyond Histograms• 2分钟
- Measures of Shape: Skewness• 3分钟
- Measures of Shape: Kurtosis• 5分钟
- Usefulness of Distribution Analysis in Machine Learning• 1分钟
- Making Inferences from Data• 1分钟
- Point Estimates• 0分钟
- Standard Error• 1分钟
- Sampling Distribution• 2分钟
- Confidence Intervals• 2分钟
- Statistical Hypothesis Test• 2分钟
- Performing a Hypothesis Test• 1分钟
- Statistical Hypothesis Test: Coin Example• 3分钟
- Statistical Hypothesis Test: Types of Errors• 2分钟
- Statistical Hypothesis Test: Effect Size Influence• 3分钟
- Statistical Hypothesis Test: Sample Size Influence• 2分钟
- p-Values and Statistical Significance• 1分钟
- Hypothesis Tests for Means• 1分钟
- One-Sample t Test Scenario• 0分钟
- Performing a t Test• 4分钟
- p-Values in Machine Learning• 4分钟
9篇阅读材料• 总计90分钟
- More about Sampling• 10分钟
- Oversampling and Undersampling: Advantages and Disadvantages• 10分钟
- Demo: Listing Observations• 10分钟
- Why “1.5” in IQR Method of Outlier Detection?• 10分钟
- Platykurtic and Leptokurtic Distributions• 10分钟
- Demo: Exploring Data• 10分钟
- Confidence Interval for Mean• 10分钟
- Demo: Testing a Hypothesis Using One-Sample t Test• 10分钟
- Lesson Summary• 10分钟
17个作业• 总计510分钟
- Question: Sampling Distribution• 30分钟
- Question: Samples• 30分钟
- Question: Sampling Methods• 30分钟
- Question: Event Based Sampling• 30分钟
- Question: Central Tendency• 30分钟
- Question: Dispersion• 30分钟
- Question: Summary Statistics• 30分钟
- Question: Normal (Gaussian) Distribution• 30分钟
- Question: Visualizing Distributions• 30分钟
- Question: Skewness• 30分钟
- Question: Normal Distribution• 30分钟
- Question: Variability in the Sample Statistic• 30分钟
- Question: Interval Estimates• 30分钟
- Question: Hypothesis Testing Terminology• 30分钟
- Question: Type I and Type II Errors• 30分钟
- Question: t Distribution• 30分钟
- Question: Confidence Interval• 30分钟
In this module, you learn about correlation and simple linear regression, multiple regression and model selection, and model diagnostics.
涵盖的内容
51个视频15篇阅读材料9个作业
51个视频• 总计58分钟
- Overview• 2分钟
- Explanatory Modeling• 3分钟
- Explore Your Data before Regression Modeling• 1分钟
- Correlation Coefficient• 2分钟
- Correlation Differs from Covariance• 2分钟
- Pearson Correlation Is Inappropriate for Some Data• 1分钟
- Using Correlation for Variable Screening• 1分钟
- Relevant versus Irrelevant Predictors• 1分钟
- Redundant versus Non-redundant Predictors• 1分钟
- Examples of Irrelevancy and Redundancy• 1分钟
- Correlation Does Not Imply Causation• 1分钟
- Correlation versus Regression• 1分钟
- Simple Linear Regression• 3分钟
- Least Squares Regression• 1分钟
- Linear Regression Hypothesis Tests• 1分钟
- Explained versus Unexplained Variability• 1分钟
- Coefficient of Determination• 1分钟
- Confidence and Prediction Intervals• 1分钟
- Correlations and Simple Linear Regression• 1分钟
- Multiple Regression• 2分钟
- Multiple Regression Hypothesis Test• 1分钟
- Categorical Predictors in Regression• 0分钟
- Dummy Coding of Categorical Inputs• 1分钟
- Multiple Regression with Categorical Predictors• 0分钟
- Regression and ANOVA• 2分钟
- Interaction Effects• 1分钟
- Many Possible Models!• 1分钟
- Comparing Regression Models• 0分钟
- Adjusted R Square• 1分钟
- Information Criteria• 2分钟
- Akaike's Information Criterion (AIC)• 3分钟
- Common Regression Model Selection Methods• 1分钟
- Sequential Selection: Forward• 2分钟
- Sequential Selection: Backward• 1分钟
- Sequential Selection: Stepwise• 1分钟
- Multiple Regression and Model Selection• 0分钟
- Model Diagnostics• 0分钟
- Assumptions of Linear Regression• 1分钟
- Verifying Assumptions with Residual Plots• 1分钟
- Nonrandom Patterns Indicate Problems• 1分钟
- Check Normality with Other Plots• 1分钟
- Potential Problems: Collinearity• 1分钟
- Illustration of Collinearity• 1分钟
- How to Detect Collinearity• 0分钟
- Variance Inflation Factor (VIF)• 1分钟
- Potential Problems: Extreme Values• 1分钟
- Outliers, High Leverage Points, and Influential Observations• 2分钟
- Influence Diagnostics• 1分钟
- Detecting Outliers and Influential Observations• 2分钟
- Assessing the Model• 0分钟
- What Did We Discover about the Model?• 1分钟
15篇阅读材料• 总计150分钟
- History of the Term Regression• 10分钟
- OLS Regression Parameter Estimates• 10分钟
- More about Least Square Regression• 10分钟
- Demo: Correlation and Linear Regression• 10分钟
- Interactions• 10分钟
- Most Used Information Criteria• 10分钟
- Select Variables/Choose Model• 10分钟
- Problems with Stepwise Selection Methods• 10分钟
- Shrinkage Methods• 10分钟
- Demo: Multiple Regression and Model Selection• 10分钟
- Candidate Models• 10分钟
- Which Model to Use?• 10分钟
- Removing Collinearity• 10分钟
- Demo: Model Diagnostics• 10分钟
- Lesson Summary• 10分钟
9个作业• 总计270分钟
- Question: Simple Linear Regression• 30分钟
- Question: Scatter Plots• 30分钟
- Question: Pearson Correlation• 30分钟
- Question: Relationships between Variables• 30分钟
- Question: Interaction• 30分钟
- Question: AIC• 30分钟
- Question: Sequential Selection Methods• 30分钟
- Question: Regression Assumptions• 30分钟
- Question: Collinearity• 30分钟
In this module, you learn about predictive modeling using logistic regression.
涵盖的内容
13个视频8篇阅读材料15个作业
13个视频• 总计54分钟
- Overview• 1分钟
- To Explain or to Predict?• 5分钟
- Predictive Modeling• 4分钟
- Honest Assessment• 4分钟
- Candidate Models• 2分钟
- Optimizing Model Complexity• 12分钟
- Associations between Categorical Variables• 5分钟
- Odds Ratio• 4分钟
- Logistic Regression• 5分钟
- Interpreting the Odds Ratio• 5分钟
- Assessing the Model Fit• 4分钟
- Multiple Logistic Regression Model• 2分钟
- Model Deployment• 2分钟
8篇阅读材料• 总计80分钟
- Analytical Methods for Predictive Modeling• 10分钟
- Timeframes for Modeling• 10分钟
- Decomposing Prediction Error• 10分钟
- Cramer's V Statistic• 10分钟
- Demo: Examining Categorical Association• 10分钟
- Demo: Fitting a Multiple Logistic Regression Model• 10分钟
- Demo: Scoring a Logistic Regression Model• 10分钟
- Lesson Summary• 10分钟
15个作业• 总计450分钟
- Question: Logistic Regression Model• 30分钟
- Question: Odds Ratio• 30分钟
- Question: Comparing Pairs for Model Assessment• 30分钟
- Question: Default and Income• 30分钟
- Question: Explanation versus Prediction• 30分钟
- Question: Score Code in SAS Viya• 30分钟
- Question: Honest Assessment• 30分钟
- Question: Model Diversity• 30分钟
- Question: Predictive Models• 30分钟
- Question: Categorical Variables• 30分钟
- Question: Association• 30分钟
- Question: Strength of an Association• 30分钟
- Question: Regression Models• 30分钟
- Question: Bounds for a Logit• 30分钟
- Question: Allocation Rule• 30分钟
In this module, you learn about the statistical foundations of machine learning.
涵盖的内容
41个视频15篇阅读材料22个作业
41个视频• 总计114分钟
- Overview• 1分钟
- Machine Learning• 1分钟
- Supervised Learning• 1分钟
- Unsupervised Learning• 2分钟
- Semi-supervised Learning• 2分钟
- Reinforcement Learning• 0分钟
- How Does Machine Learning Work?• 2分钟
- Neural Networks• 5分钟
- Data Preparation for Machine Learning• 3分钟
- Data Preprocessing for Machine Learning• 1分钟
- Data Difficulties and Modeling Issues• 1分钟
- Data Visualization• 1分钟
- Big Data Visualization Challenges• 3分钟
- Visualization Examples• 4分钟
- Dirty Data: Errors, Missing Values, and Outliers• 1分钟
- Errors• 0分钟
- Missing Data• 1分钟
- Missing Data Problems• 1分钟
- Analysis Strategies for Missing Data• 6分钟
- Cluster Imputation• 1分钟
- Tailored-Value Imputation• 2分钟
- Excessive Missingness• 1分钟
- Missing Value Indicators• 1分钟
- What is an Outlier?• 1分钟
- Outliers and Machine Learning Models• 2分钟
- Dealing with Outliers• 3分钟
- What Do Transformations Do?• 2分钟
- Simple Transformations• 2分钟
- Problems With Too Many Variables• 6分钟
- Types of Feature Engineering• 4分钟
- Distinctly Scaled Variables• 1分钟
- Feature Scaling• 4分钟
- Modeling Challenges with Machine Learning Data• 6分钟
- More about Modeling Challenges with Machine Learning Data• 7分钟
- Cross Validation Example• 1分钟
- Bootstrap Aggregation• 2分钟
- Model Fitting• 4分钟
- Effect of Magnitude of Coefficients• 2分钟
- Shrinking the Coefficients• 11分钟
- Learning Process• 10分钟
- Model Interpretability• 6分钟
15篇阅读材料• 总计150分钟
- More about Reinforcement Learning• 10分钟
- More Information: Common Algorithms• 10分钟
- More about Visualization Examples• 10分钟
- Demo: Modifying and Correcting Data• 10分钟
- Missing Data Causes• 10分钟
- Demo: Managing Missing Values• 10分钟
- Demo: Transforming Inputs• 10分钟
- Demo: Performing Feature Selection• 10分钟
- Think About It: Feature Scaling Method• 10分钟
- L1 versus L2• 10分钟
- Parameters versus Hyperparameters• 10分钟
- More about Estimation Criterion• 10分钟
- Demo: Running a Neural Network Model and Tuning Its Hyperparameters• 10分钟
- Why Interpretability Matters• 10分钟
- Lesson Summary• 10分钟
22个作业• 总计660分钟
- Question: Errors• 30分钟
- Question: Missing Values• 30分钟
- Question: Mean or Median Imputation• 30分钟
- Question: Outliers• 30分钟
- Question: Finding Parameter Values• 30分钟
- Question: Machine Learning Methods• 30分钟
- Question: Neural Networks• 30分钟
- Question: Data Preparation and Data Preprocessing• 30分钟
- Question: Data Visualization Challenges• 30分钟
- Question: Missing Value Representation• 30分钟
- Question: Missing Stock Exchange Price• 30分钟
- Question: Imputation Methods• 30分钟
- Question: Missing Value Imputation• 30分钟
- Question: Variable Transformations• 30分钟
- Question: Feature Engineering• 30分钟
- Question: Feature Scaling• 30分钟
- Question: Signal and Noise• 30分钟
- Question: Cross Validation• 30分钟
- Question: Model Complexity• 30分钟
- Question: Regularization Methods• 30分钟
- Question: Learning Terminology• 30分钟
- Question: Interpreting Black-Box Models• 30分钟
获得职业证书
将此证书添加到您的 LinkedIn 个人资料、简历或履历中。在社交媒体和绩效考核中分享。
位教师

提供方

提供方

Through innovative software and services, SAS empowers and inspires customers around the world to transform data into intelligence. SAS is a trusted analytics powerhouse for organizations seeking immediate value from their data. A deep bench of analytics solutions and broad industry knowledge keep our customers coming back and feeling confident. With SAS®, you can discover insights from your data and make sense of it all. Identify what’s working and fix what isn’t. Make more intelligent decisions. And drive relevant change.
从 Probability and Statistics 浏览更多内容
DDeepLearning.AI
课程
AAlberta Machine Intelligence Institute
课程
CCorporate Finance Institute
课程
OO.P. Jindal Global University
课程
人们为什么选择 Coursera 来帮助自己实现职业发展

Felipe M.

Jennifer J.

Larry W.

Chaitanya A.
常见问题
To access the course materials, assignments and to earn a Certificate, you will need to purchase the Certificate experience when you enroll in a course. You can try a Free Trial instead, or apply for Financial Aid. The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.
When you enroll in the course, you get access to all of the courses in the Certificate, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.
更多问题
提供助学金,

