Uncovering financial trends across the United States through multi-format data exploration, SQL queries, and statistical insights.
This project analyzes U.S. household income data using multiple file formats—Excel, SQL, JSON, and CSV—to extract meaningful insights into regional income distribution and demographic patterns.
- Ingest and clean raw income data across diverse formats
- Use SQL to perform data exploration, filtering, and aggregation
- Generate insights on income trends by region and demographics
- Document a transparent, repeatable analytical workflow
📦 US-Household-Income-Analysis/ ├── 📁 raw_data_part_1/ │ ├── USHouseholdIncome.xls │ ├── USHouseholdIncome.sql │ ├── USHouseholdIncome.json │ └── USHouseholdIncome.csv ├── 📁 raw_data_part_2/ │ ├── USHouseholdIncome_Statistics.sql │ ├── USHouseholdIncome_Statistics.json │ └── USHouseholdIncome_Statistics.csv ├── 📁 analysis_queries/ │ └── US_household-income_script.sql ├── 📁 output/ │ └── US_household_income_output.csv
- Regional disparities in median household income
- Correlation between income levels, state, and county attributes
- Identification of top-performing and underserved regions
- SQL (JOINs, aggregations, filtering, transformations)
- Data wrangling & validation
- Multi-format parsing (Excel, JSON, CSV)
- Git & GitHub for project versioning
- Built experience in working with diverse datasets
- Designed and executed complex SQL queries
- Learned to translate raw data into actionable conclusions
- Strengthened documentation and version control habits
🚀 Ready to explore the data behind U.S. household income? Dive into the code and see what stories emerge from the numbers.