end-to-end-internship-data-analysis

End-to-End Analysis of Internship Dataset

Table of Contents

Project Overview

This project involves a comprehensive analysis of an internship dataset obtained from Kaggle. It focuses on cleaning and preprocessing the dataset, followed by exploratory data analysis (EDA), and conducting statistical, sectoral, and geospatial analysis. The key insights are visualized using an interactive Tableau dashboard.

Objective

The aim of this project is to:

These insights are presented through an interactive Tableau dashboard, enabling better visualization and decision-making.

Dataset

The dataset was obtained from Kaggle and consists of the following fields:

Features

Technologies

Analysis Breakdown

Data Cleaning and Preprocessing

The data was cleaned to remove null values, duplicate records, and invalid entries, ensuring consistent data quality for the analysis. Furthermore, the data was preprocessed and insights were pulled out using Microsoft Excel and Google BigQuery SQL.

The original dataset had several discrepancies, which were addressed by thoroughly cleaning, adding new fields, and eliminating redundant ones. This process was carried out using Microsoft Excel. Various insights were derived using Excel pivot tables and Google BigQuery SQL. The updated dataset Excel sheet features the various details obtained in individual sheets.

Exploratory Data Analysis

EDA was conducted to identify trends, distributions, and relationships between various fields in the dataset, such as average stipend across sectors and locations.

Statistical, Sectoral, and Geospatial Analysis

The following key analyses were performed:

Results

The analysis provided key insights into:

Tableau Dashboard

The Tableau dashboard created for this project showcases the key insights:

To view the Tableau dashboard, visit: View Tableau Dashboard