Udacity Data Analyst Repository

Udacity Data Analyst projects showcase my programming skills in Python and SQL. I will utilise my ability to work with messy, complex datasets then, manipulate and prepare data for analysis, and create visualizations for data exploration. Finally, I’ll use my data skills to tell a story with data.

SQL

Exploring Weather Trends

The Exploring Weather Trends project will demonstrate using SQL and downloading data from a database. We will analyze local and global temperature data and compare the temperature trends where we live to overall global temperature trends.

PYTHON

Investigating TMDB Movies Dataset

In this project, we will choose one of Udacity’s curated datasets and investigate it using NumPy and pandas. We will complete the entire data analysis process, starting by posing a question and finishing by sharing your findings.

PYTHON

Analyze Experiment Results

In this project, we will be provided a dataset reflecting data collected from an landing page experiment. We’ll use statistical techniques to answer questions about the data and report our conclusions and recommendations in a report. We will also use machine learning and logistic regression to predict how various factors might affect conversions depending on the landing page.

PYTHON

We Rate Dogs Twitter Analysis

Real-world data rarely comes clean. Using Python, we’ll gather data from a variety of sources, assess its quality and tidiness, then clean it. We will document our wrangling efforts in a Jupyter Notebook, plus showcase them through analyses and visualizations using Python and SQL.

PYTHON

The Truth About Airline Statistics

This dataset reports flights in the United States from 1987 to 2008 including arrival and departure delays, carriers, and other data. For our analysis, we are sampling the three year period between 89′ to 91′ as our sample dataset as the whole dataset is too large to practically observe with Jupyter. Findings will be visualised in univariate, bivariate and, multivariate plots.

Udacity Data Scientist Repository

Udacity Data Scientist projects showcase the programming fundamentals required for a career in data science. I will use Python, SQL, Command-Line, and Git to complete them.

PYTHON

Predicting Airbnb Prices In Boston

Airbnb is a popular way for homeowners to make money by renting out their properties or even spare rooms in their own homes. More people are considering joining Airbnb to profit, but how will they know what to consider to make their property an attractive proposition for customers?

PYTHON

Disaster Response Pipeline

In this project, we will choose one of Udacity’s curated datasets and investigate it using NumPy and pandas. We will complete the entire data analysis process, starting by posing a question and finishing by sharing your findings.

PYTHON

IBM Recomendation Engine

In this project, we used data engineering skills to analyze disaster data from Figure Eight. This data was used to build a model for an API that classifies disaster messages. A machine learning pipeline was created to categorize these events so that the messages could be sent to an appropriate disaster relief agency. This project also includes an interactive web app.

PYTHON

Starbucks Capstone Project

This capstone project uses simulated data to provide insight into how people make purchasing decisions, and how those decisions are influenced by promotional offers. The data contains various events, including receiving offers, opening offers, and making purchases. Our task is to identify which groups of people are most responsive to each type of offer by finding traits and purchasing patterns.