Build a job-ready portfolio with these five beginner-friendly data analysis projects.
![[Featured Image] A person wearing a pink al-amira sits on a grey sofa, working on a laptop computer on their data analytics projects.](https://d3njjcbhbojbot.cloudfront.net/api/utilities/v1/imageproxy/https://images.ctfassets.net/wp1lcwdav1p1/b2mlsGI3LyFqNoenyNpGD/f1ec8b37b56da9905ee774c6b6f18ec4/data_analyst_projects.png?w=1500&h=680&q=60&fit=fill&f=faces&fm=jpg&fl=progressive&auto=format%2Ccompress&dpr=1&w=1000)
Data analytics projects allow you to develop content for your portfolio that demonstrates your skills and experience to hiring managers.
Data analytics portfolio projects can focus on web scraping, data cleaning, exploratory data analysis, sentiment analysis, or data visualization.
Access public data sets and free data visualization tools as you embark on a portfolio project.
You can add data visualizations and mini projects to your portfolio to showcase your range of skills to potential employers.
Discover five types of projects you should include in your portfolio, especially if you’re just starting out. If you’re ready to enhance your skills, consider enrolling in the Google Data Analytics Professional Certificate. You'll have the chance to build your data cleaning, visualization, and analysis skills in as little as six months. Upon completion, you’ll have earned a career credential to add to your resume.
As an aspiring data analyst, you’ll want to demonstrate a few key skills in your portfolio. These data analytics project ideas reflect the tasks often fundamental to many data analyst roles.
While you’ll find no shortage of excellent (and free) public data sets on the internet, you might want to show prospective employers that you’re able to find and scrape your own data as well. Plus, knowing how to scrape web data means you can find and use data sets that match your interests, regardless of whether or not they’ve already been compiled.
If you know some Python, you can use tools like Beautiful Soup or Scrapy to crawl the web for interesting data. If you don’t know how to code, don’t worry. You’ll also find several tools that automate the process (many offer a free trial), like Octoparse or ParseHub.
If you’re unsure where to start, here are some websites with interesting data options to inspire your project:
Wikipedia
Job portals
Example web scraping project: Todd W. Schneider of Wedding Crunchers scraped some 60,000 New York Times wedding announcements published from 1981 to 2016 to measure the frequency of specific phrases.
Tip: Anytime you’re scraping data from the internet, remember to respect and abide by each website’s terms of service. Limit your scraping activities so as not to overwhelm a company’s servers, and always cite your sources when you present your data findings in your portfolio.
Want insight into how employers view data analysts? Learn more about how data analysts and their portfolios are viewed by hiring managers in this lecture from Google's Data Analytics Professional Certificate:
A significant part of your role as a data analyst is cleaning data to prepare it for analysis. Data cleaning (also called data scrubbing) is the process of removing incorrect and duplicate data, managing any holes in the data, and ensuring consistent formatting.
As you look for a data set to practice cleaning, look for one that includes multiple files gathered from multiple sources without much curation. Some sites where you can find “dirty” data sets to work with include:
CDC Wonder
Data.gov
World Bank
Data.world
/r/datasets
Example data cleaning project: This Medium article outlines how data analyst Raahim Khan cleaned a set of daily-updated statistics on trending YouTube videos.
Learn how to collect, clean, sort, evaluate, and visualize data with the Meta Data Analyst Professional Certificate.
Data analysis is all about answering questions with data. Exploratory data analysis, or EDA for short, helps you explore what questions to ask. This could be done separately from or in conjunction with data cleaning. Either way, you’ll want to accomplish the following during these early investigations.
Ask lots of questions about the data.
Discover the underlying structure of the data.
Look for trends, patterns, and anomalies in the data.
Test hypotheses and validate assumptions about the data.
Think about what problems you could potentially solve with the data.
Example exploratory data analysis project: This data analyst took an existing data set on American universities from Kaggle in 2013 and used it to explore what makes students prefer one university over another.
An EDA project is an excellent time to take advantage of the wealth of public datasets available online. Here are 10 fun and free datasets to get you started in your explorations.
1. National Centers for Environmental Information: Dig into the world’s largest provider of weather and climate data.
2. World Happiness Report 2025: What makes the world’s happiest countries so happy?
3. NASA: If you’re interested in space and earth science, see what you can find among the tens of thousands of public datasets made available by NASA.
4. US Census: Learn more about the people and economy of the United States with the latest Census data from 2020.
5. FBI Crime Data Explorer (CDE): Explore crime data collected by more than 18,000 law enforcement agencies.
6. World Health Organization COVID-19 Dashboard: Track the latest coronavirus numbers by country or WHO region.
7. Latest Netflix Data: This Kaggle dataset (updated in April 2021) includes movie data broken down into 26 attributes.
8. Google Books Ngram: Download the raw data from the Google Books Ngram to explore phrase trends in books published from 1800 to 2022.
9. NYC Open Data: Discover New York City through its many publicly available datasets on topics like the Central Park squirrel population and motor vehicle collisions.
10. Yelp Open Dataset: See what you can find while exploring this collection of Yelp user reviews, check-ins, and business attributes.
Sentiment analysis, typically performed on textual data, is a technique in natural language processing (NLP) for determining whether data is neutral, positive, or negative. It may also be used to detect a particular emotion based on a list of words and their corresponding emotions (known as a lexicon).
This type of analysis works well with public review sites and social media platforms, where people are likely to offer public opinions on various subjects.
To get started exploring what people feel about a certain topic, you can start with sites like:
Amazon (product reviews)
Rotten Tomato (movie reviews)
News sites
Example sentiment analysis project: This blog post on Medium details how one business analyst used Python to perform a sentiment analysis of product reviews using NLP.
Read more: What Is Sentiment Analysis in Python?
Learn how to use Google Cloud for sentiment analysis from Google itself in their short, interactive project, Entity and Sentiment Analysis with the Natural Language API.
Humans are visual creatures, which makes data visualization a powerful tool for transforming data into a compelling story to encourage action. Great visualizations are not only fun to create, but they also have the power to make your portfolio look beautiful.
Example data visualization project: Data analyst Hannah Yan Han visualizes the skill level required for 60 different sports to determine which are the toughest.
You don’t need to pay for advanced visualization software to start creating stellar visuals, either. These are just a few of the free visualization tools you can use to start telling a story with data:
1. Tableau Public: Tableau ranks among the most popular visualization tools. Use the free version to transform spreadsheets or files into interactive visualizations (here are some examples from April 2021).
2. Google Charts: This gallery of interactive charts and data visualization tools makes it easy to embed visualizations within your portfolio using HTML and JavaScript code. A robust Guides section walks you through the creation process.
3. Datawrapper: Copy and paste your data from a spreadsheet or upload a CSV file to generate charts, maps, or tables—no coding required. The free version allows you to create unlimited visualizations to export as PNG files.
4. D3 (Data-Driven Documents): With a bit of technical know-how, you can do a ton with this JavaScript library.
5. RAWGraphs: This open-source web app makes it easy to turn spreadsheets or CSV files into a range of chart types that might otherwise be difficult to produce. The app even provides sample data sets for you to experiment with.
There’s nothing wrong with populating your portfolio with mini-projects highlighting individual skills. But if you’ve scraped the web for your own data, you might also consider using that same data to complete an end-to-end project. To do this, take the data you scraped and apply the main steps of data analysis to it—clean, analyze, and interpret.
This can show a potential employer that you have the essential skills of a data analyst and know how they fit together.
We published a newsletter about becoming a data analyst. Subscribe to keep up with the latest career tips with Career Chat.
Learn more about the field of data analytics by subscribing to our YouTube channel. Then, check out these helpful resources:
Bookmark this page: Data Analysis Terms and Definitions
Build your skills: How to Use For Loops in Python: Step by Step
Learn from an expert: 8 Questions with an Expert: Google Financial Data Analyst
Accelerate your career growth with a Coursera Plus subscription. When you enroll in either the monthly or annual option, you’ll get access to over 10,000 courses.
对于刚刚开始数据解析的人来说,有很多好书。尤其是以下三本书,对该领域的关键方面提供了无障碍的介绍:Anil Maheshwari 博士所著的《Data Analytics Made Accessibility》、Numsense!Annalyn Ng 和 Kenneth Soo 合著的《Data Science for the Layman: No Math Added》和《Python for Everybody:Charles Russell Severance 博士著的《Python 3 中的数据探索》。
作为阅读的补充,初学者还可以考虑参加密歇根大学提供的、由 Severance 博士亲自讲授的在线Python for Everybody 专项课程。
数据可视化是通过可视化手段以图形表示数据的过程。常见的数据可视化形式包括使用图形、图表和图解来直观地表示原本抽象的数据 Set。如今,数据可视化已被视为数据解析领域的一项关键技能。
初学数据分析师应确保自己对结构化查询语言 (SQL)、Microsoft Excel 以及 R 或 Python 有扎实的技术理解。此外,他们还应具备批判性思维、自信的表达能力,并知道如何以 Visualization 的方式讲述数据故事。了解更多有关这些及其他关键数据分析师技能的信息。
编辑团队
Coursera 的编辑团队由经验丰富的专业编辑、作者和事实核查人员组成。我们的文章都经过深入研究和全面审核,以确保为任何主题提供值得信赖的信息和建议。我们深知,在您的教育或职业生涯中迈出下一步时可能...
此内容仅供参考。建议学生多做研究,确保所追求的课程和其他证书符合他们的个人、专业和财务目标。