E-commerce-Data-Analysis-Project

This project aims to analyze e-commerce data to derive meaningful insights about customer behavior, sales trends, and product performance. We utilize Python, MySQL, and various data visualization libraries to perform the analysis.

E-commerce Data Analysis

This project involves a comprehensive analysis of an e-commerce database using Python, Pandas, MySQL, and various data visualization tools like Matplotlib and Seaborn. The analysis includes customer demographics, order trends, sales performance, and product popularity among other insights.

Installation

Clone the repository:

git clone https://github.com/yourusername/ecommerce-analysis.git

Navigate to the project directory:
```
cd ecommerce-analysis
```
Install the required dependencies:
```
pip install -r requirements.txt
```

Database Connection

Ensure you have MySQL installed and running. Update the database connection details in your script accordingly:

import pandas as pd
import matplotlib.pyplot as plt
import seaborn as sns
import mysql.connector

db = mysql.connector.connect(
    host='localhost',
    user='root',
    password='Akshay@123',
    database='ecomerce'
)

Creating a cursor object to interact with the database

cur = db.cursor()

Data Ingestion Script

The data_ingestion.py script reads each CSV file, processes it, and inserts the data into the corresponding MySQL tables. The script handles various data types and ensures that NaN values are appropriately replaced with SQL NULL.

Here is a brief overview of the script:

import pandas as pd
import mysql.connector
import os

# Connect to the MySQL database
conn = mysql.connector.connect(
    host='localhost',
    user='root',
    password='Akshay@123',
    database='ecomerce'
)
cursor = conn.cursor()

# Folder containing the CSV files
folder_path = 'D:/Ecomerce project full data'

def get_sql_type(dtype):
    if pd.api.types.is_integer_dtype(dtype):
        return 'INT'
    elif pd.api.types.is_float_dtype(dtype):
        return 'FLOAT'
    elif pd.api.types.is_bool_dtype(dtype):
        return 'BOOLEAN'
    elif pd.api.types.is_datetime64_any_dtype(dtype):
        return 'DATETIME'
    else:
        return 'TEXT'

for csv_file, table_name in csv_files:
    file_path = os.path.join(folder_path, csv_file)
    
    # Read the CSV file into a pandas DataFrame
    df = pd.read_csv(file_path)
    
    # Replace NaN with None to handle SQL NULL
    df = df.where(pd.notnull(df), None)
    
    # Debugging: Check for NaN values
    print(f"Processing {csv_file}")
    print(f"NaN values before replacement:\n{df.isnull().sum()}\n")

    # Clean column names
    df.columns = [col.replace(' ', '_').replace('-', '_').replace('.', '_') for col in df.columns]

    # Generate the CREATE TABLE statement with appropriate data types
    columns = ', '.join([f'`{col}` {get_sql_type(df[col].dtype)}' for col in df.columns])
    create_table_query = f'CREATE TABLE IF NOT EXISTS `{table_name}` ({columns})'
    cursor.execute(create_table_query)

    # Insert DataFrame data into the MySQL table
    for _, row in df.iterrows():
        # Convert row to tuple and handle NaN/None explicitly
        values = tuple(None if pd.isna(x) else x for x in row)
        sql = f"INSERT INTO `{table_name}` ({', '.join(['`' + col + '`' for col in df.columns])}) VALUES ({', '.join(['%s'] * len(row))})"
        cursor.execute(sql, values)

    # Commit the transaction for the current CSV file
    conn.commit()

# Close the connection
conn.close()

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
Data		Data
data integrasion script		data integrasion script
project codes		project codes
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

E-commerce-Data-Analysis-Project

E-commerce Data Analysis

Table of Contents

Installation

Database Connection

Creating a cursor object to interact with the database

Data Ingestion Script

About

Uh oh!

Releases

Packages

Languages

Akshay8087/E-commerce-Data-Analysis-Project

Folders and files

Latest commit

History

Repository files navigation

E-commerce-Data-Analysis-Project

E-commerce Data Analysis

Table of Contents

Installation

Database Connection

Creating a cursor object to interact with the database

Data Ingestion Script

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages