Welcome to the Statistical Methods and Data Analysis course! This course serves as an introduction to the statistical and computation methods that have become indispensable tools for those pursuing careers in public policy. Alongside offering the necessary background in basic and applied statistics, the course will also introduce you to the powerful R programming interface.

您将学到什么
• Explain the basic statistical reasoning involved in data analysis.
• Explain the applications of data analysis with examples from published research.
• Execute the data analysis projects using R.
您将获得的技能
您将学习的工具
要了解的详细信息

添加到您的领英档案
29 项作业
了解顶级公司的员工如何掌握热门技能

该课程共有8个模块
Statistical methods, by definition, are tools for identifying patterns in large datasets. This module takes the first step towards statistical analysis by exploring various strategies for visualizing data, an increasingly important skill in today’s era of big data. This module explains the different forms of data, types of plots, and charts used to depict the different forms of data. In addition, the module focuses on different visualization techniques appropriate for big data.
涵盖的内容
10个视频3篇阅读材料4个作业
10个视频•总计92分钟
- Statistical Methods and Data Analysis•5分钟
- Nominal, Ordinal, and Interval-Ratio Measurement•12分钟
- Bar Plot, Pie Chart, and Histogram•10分钟
- Stacked Bar Chart, Multiple Bar Chart, and Scatter Plot•9分钟
- Box Plot, Violin Plot, and Ridge plot•13分钟
- Big Data•10分钟
- Word Frequency Plot and Word Cloud•8分钟
- Sentiment Analysis•7分钟
- N-Gram Plot•9分钟
- Twitter Analysis•9分钟
3篇阅读材料•总计30分钟
- Course Overview•10分钟
- Pre-reading material•10分钟
- Essential Reading Material – Data Visualization•10分钟
4个作业•总计18分钟
- Practice Quiz•2分钟
- Practice Quiz•2分钟
- Practice Quiz•4分钟
- Practice Quiz•10分钟
While data visualization gives us a ‘first cut’ in the empirical world, knowing what the data ‘looks like’ will not take us far towards identifying relationships between variables — the focal point of policymaking. At the minimum, identification requires that the researcher be able to summarize large amounts of information in the form of descriptive statistics. This module explains the measures of central tendency and dispersion for ungrouped data and for grouped data. The measures of central tendency and dispersion for ungrouped data include mean, median, mode, standard deviation, skewness, and kurtosis. The means of central tendency and dispersion for grouped data include grouped mean, grouped standard deviation, grouped mode, and grouped median.
涵盖的内容
8个视频1篇阅读材料3个作业
8个视频•总计63分钟
- Measures of Central Tendency and Dispersion: Introduction•5分钟
- Mode and Variation Ratio•9分钟
- Median and Quartile Range•8分钟
- Mean and Standard Deviation•7分钟
- Skewness and Kurtosis•12分钟
- Time-Series Data•9分钟
- Grouped Mean and Standard Deviation•7分钟
- Grouped Mode and Median•6分钟
1篇阅读材料•总计10分钟
- Essential Reading Material – Descriptive Statistics•10分钟
3个作业•总计46分钟
- Graded Quiz•30分钟
- Practice Quiz•12分钟
- Practice Quiz•4分钟
Except in the rarest of cases when data on the entire population is available for all attributes of interest to the researcher, social scientists must draw inferences about a population from a sample drawn from that population. This module focuses on the statistical reasoning involved in studying the uncertainty attached to sample statistics. For making inferences about the population from a sample, the module explains the fundamentals of probability theory. In addition, the module explains the concepts of random variables and function of random variables. Finally, the module covers the concepts and applications of the binomial and normal distributions.
涵盖的内容
9个视频1篇阅读材料5个作业
9个视频•总计65分钟
- Compound Events•11分钟
- Axioms of Probability and Addition Rule•7分钟
- Independence and Multiplication Rule•8分钟
- Bayes’ Theorem•9分钟
- Introduction to Random Variables•4分钟
- Function of Random Variables•6分钟
- Binomial Distribution•7分钟
- Normal Distribution•6分钟
- Normal Approximation•7分钟
1篇阅读材料•总计10分钟
- Essential Reading Material – Probability Distributions•10分钟
5个作业•总计78分钟
- Graded Quiz•60分钟
- Practice Quiz•4分钟
- Practice Quiz•4分钟
- Practice Quiz•4分钟
- Practice Quiz•6分钟
This module discusses the various strategies available to researchers for drawing samples from a population and the first principles involved in determining sample size. The module explains the sampling strategies for sampling from a population. In addition, the module explains how to measure the accuracy of sample estimates. Finally, the module focuses on statistical inference. The goal of statistical inference is to make a statement about something that is not observed based on something that is observed, within a certain level of uncertainty. The module will discuss the Central Limit Theorem (CLT) and the concept of the confidence interval, which allow us to make such statements.
涵盖的内容
6个视频1篇阅读材料4个作业
6个视频•总计51分钟
- Sampling Strategies•14分钟
- Parameters and Statistics•6分钟
- Accuracy of Sample Percentage•8分钟
- Accuracy of Sample Mean•6分钟
- Central Limit Theorem•8分钟
- Confidence Interval•9分钟
1篇阅读材料•总计10分钟
- Essential Reading Material – Sampling•10分钟
4个作业•总计72分钟
- Graded Quiz•60分钟
- Practice Quiz•4分钟
- Practice Quiz•4分钟
- Practice Quiz•4分钟
This module introduces the critical distinction between experimental data and observational data. In addition, the module explores statistical inference in the context of experimental data using tests of significance. You will also learn about observational data and the problem of confounding, controlled experiment, and natural experiment. The module focuses on the concepts and methods for analyzing statistical significance, including analytical framework, one sample t-test, two sample t-test, and ANOVA.
涵盖的内容
8个视频4篇阅读材料3个作业
8个视频•总计77分钟
- Observational Data and the Problem of Confounding•6分钟
- Controlled Experiment•12分钟
- Natural Experiment•12分钟
- Analytical Framework•8分钟
- One Sample T-Test•11分钟
- Two Sample T-Test•8分钟
- Anova Test•8分钟
- Experiments and Statistical Significance•12分钟
4篇阅读材料•总计40分钟
- Essential Reading Material – Tests of Significance•10分钟
- Recommended Reading Material – Observational Data and Experiments•10分钟
- Essential Reading Material – Tests of Significance•10分钟
- Recommended Reading Material – Concepts and Methods for Analyzing Statistical Significance •10分钟
3个作业•总计76分钟
- Graded Quiz•60分钟
- Practice Quiz•6分钟
- Practice Quiz•10分钟
This module introduces the foundational model for statistical inference with observational data, namely, the ordinary least squares (OLS) regression, paying particular attention to the conditions under which the OLS estimator is the best linear unbiased estimator (BLUE). You will learn about the concept of association, which helps to understand the relationship between two variables. You will also learn about the measures of association appropriate for each variable type: lambda coefficient for nominal variables, gamma coefficient for ordinal variables, and correlation coefficient for interval-ratio variables. Finally, the module focuses on regression analysis by explaining bivariate OLS and multivariate OLS.
涵盖的内容
8个视频2篇阅读材料4个作业
8个视频•总计82分钟
- Measures of Association•5分钟
- Lambda Coefficient•12分钟
- Gamma Coefficient•11分钟
- Correlation Coefficient•6分钟
- Bivariate OLS•12分钟
- Multivariate OLS•15分钟
- BLUE Assumptions•7分钟
- Extensions of Basic OLS•13分钟
2篇阅读材料•总计20分钟
- Essential Reading Material – Correlation and Regression•10分钟
- Recommended Reading Material – OLS Assumptions and Extensions•10分钟
4个作业•总计76分钟
- Graded Quiz•60分钟
- Practice Quiz•8分钟
- Practice Quiz•4分钟
- Practice Quiz•4分钟
This module focuses on advanced modeling strategies in settings where the best linear unbiased estimator (BLUE) assumptions are violated. You will learn about how to get valid ordinary least squares (OLS) estimates when one or the other key assumption on regression errors for OLS estimates to be BLUE is violated. In particular, you will learn how to detect and correct OLS estimates for reverse causality, heteroscedasticity, and serial correlation. Next, under violations of BLUE assumptions on model and variable specification, you will learn how to model nominal and ordinal dependent variables.
涵盖的内容
5个视频4篇阅读材料3个作业
5个视频•总计68分钟
- Reverse Causality•17分钟
- Heteroscedasticity•12分钟
- Serial Correlation•10分钟
- Nominal Dependent Variable•18分钟
- Ordinal Dependent Variable•10分钟
4篇阅读材料•总计40分钟
- Essential Reading Material – Measurement Error, Complex Residual Structures, and Limited Dependent Variables•10分钟
- Recommended Reading Material – Violations of BLUE Assumptions: Errors•10分钟
- Essential Reading Material – Measurement Error, Complex Residual Structures, and Limited Dependent Variables•10分钟
- Recommended Reading Material – Violations of BLUE Assumptions: Model and Variable Specification•10分钟
3个作业•总计70分钟
- Graded Quiz•60分钟
- Practice Quiz•6分钟
- Practice Quiz•4分钟
Running regression models on large-scale datasets with millions of observations and thousands of variables can be a daunting task. This module examines the strategies for building regression models when dealing with such datasets. For conducting big data regression analysis with nominal dependent variables, you will learn the concepts of decision tree, pruning, cross-validation, and random forest. You will also learn about the penalized regression approach, which is useful for running big data regressions when the dependent variable is an interval-ratio variable.
涵盖的内容
5个视频2篇阅读材料3个作业
5个视频•总计40分钟
- Decision Tree•11分钟
- Pruning•8分钟
- Cross-validation•8分钟
- Random Forest•7分钟
- Penalized Regression•7分钟
2篇阅读材料•总计20分钟
- Essential Reading Material – Variable Selection•10分钟
- Course Wrap- Up•10分钟
3个作业•总计70分钟
- Graded Quiz•60分钟
- Practice Quiz•8分钟
- Practice Quiz•2分钟
攻读学位
课程 是 O.P. Jindal Global University提供的以下学位课程的一部分。如果您被录取并注册,您已完成的课程可计入您的学位学习,您的学习进度也可随之转移。
攻读学位
课程 是 O.P. Jindal Global University提供的以下学位课程的一部分。如果您被录取并注册,您已完成的课程可计入您的学位学习,您的学习进度也可随之转移。
O.P. Jindal Global University
M.A. in Public Policy
学位 · 24 - 36 months
必须成功申请并注册。资格要求适用。各院校会根据您现有的学分情况,确定完成本课程后可计入学位要求的学分。单击特定课程了解更多信息。
位教师

提供方

提供方

O.P. Jindal Global University is recognised as an Institution of Eminence by the Ministry of Education, Government of India. It is also ranked the No. 1 Private University in India in the QS World University Rankings 2021. The university has 9000+ students across 12 schools that offer 52 degree programs. The university maintains a 1:9 faculty-student ratio. It is a research-intensive university, deeply committed to institutional values of interdisciplinary and innovative learning, pluralism and rigorous scholarship, globalism, and international engagement.
人们为什么选择 Coursera 来帮助自己实现职业发展

Felipe M.

Jennifer J.

Larry W.

Chaitanya A.
从 Data Science 浏览更多内容
UUniversity of Colorado Boulder
课程
UUniversity of Pittsburgh
课程
MMacquarie University
课程

