Fuel Economy Data Analysis

Table of Contents

  1. Introduction
  2. Prerequisites
  3. File Description
  4. Data Overview
  5. Data Analysis Process
  6. Results
  7. Acknowledgements

Introduction

The files being utilized for analysis are: all_alpha_08.csv and all_alpha_18.csv and can be found here. The information provided is a result of vehicle testing done by the U.S. Environmental Protection Agency, Office of Mobile Sources, National Vehicle and Fuel Emissions Laboratory. I’ll also answer the following questions which I have constructed after initial assessment of the data:

  1. Are more unique models using alternative sources of fuel? By how much?
  2. How much have vehicle classes improved in fuel economy?
  3. What are the characteristics of SmartWay vehicles? Have they changed over time?
  4. What features are associated with better fuel economy?
  5. For all of the models that were produced in 2008 that are still being produced now, how much has the mpg improved and which vehicle improved the most?

Fuel-Image

(Source: Shared Mobility News)

Prerequisites

  1. Pandas (for data loading and analysis)
  2. NumPy (for computing)
  3. Matplotlib (for visualizations)
  4. Seaborn (for visualizations)
  5. Jupyter (to run notebooks)

File Description

There are three folders:

  1. Code
    • Fuel Economy Data Analysis.ipynb- covers the entire analysis process performed to investigate both datasets as well as the documentation
  2. Data
    • all_alpha_08: Fuel Exonomy Dataset for the year 2008
    • all_alpha_18: Fuel Exonomy Dataset for the year 2018
    • clean_08: 2008 Dataset after assessing and cleaning the quality issues
    • clean_18: 2018 Dataset after assessing and cleaning the quality issues
    • combined_dataset: Master Dataset containing both 2008 and 2018 cleaned datasets
    • GreenVehicleGuideDocumentation.pdf- EPA Green Vehicle Guide and SmartWay List Documentation. This guide may aid better understanding of the columns utilized in this dataset and their purpose
  3. Presentation
    • fuel_economy_data_presentation.slides.html: Slideshow providing insights on questions
    • presentation.gif: Preview of presentation Presentation-gif
  4. Images
    • fuel-image.jpg

Data Overview

Note:

Data Analysis Process

  1. Data Gathering - Fuel Economy data was gathered for the years 2008 and 2018
  2. Data Assessment - Data was assessed for quality issues
  3. Data Cleaning
    • Dropping extraneous columns
    • Renaming columns for consistency
    • Filtering
    • Droping nulls
    • Deduping
    • Inspecting and correcting the data types
  4. Exploratory Visuals - Visually assessing the distribution of features and their correlation
  5. Drawing Conclusions
    • Total 5 questions were constructed and answered using a data driven approach

Results

1. Are more unique models using alternative sources of fuel? By how much?

2. How much have vehicle classes improved in fuel economy?

3. What are the characteristics of SmartWay vehicles? Have they changed over time?

4. What features are associated with better fuel economy?

The above combination of statistics suggest a better fuel ecomomy

5. For all of the models that were produced in 2008 that are still being produced now, how much has the mpg improved and which vehicle improved the most?

Acknowledgements

The datasets can be found here.