This project provides an interactive analysis of the Global Power Plant Database using Python and Streamlit. It enables users to explore energy generation patterns worldwide and includes predictive modeling capabilities.
- Data Overview: Examine basic statistics and initial data exploration
- Data Cleaning: Handle missing values and prepare data for analysis
- Exploratory Analysis: Interactive visualizations of global power plant distributions
- Feature Engineering: Create derived features for enhanced analysis
- Model Building: Predictive modeling for power generation
- Model Evaluation: Performance metrics and visualization tools
- Python 3.11
- Streamlit
- Pandas
- NumPy
- Plotly
- Scikit-learn
- Matplotlib
- Seaborn
- Clone the repository
- Install dependencies:
pip install -r requirements.txt
- Run the Streamlit app:
streamlit run app.py
├── app.py # Main Streamlit application
└── global_power_plant_database.csv # Dataset
- Global distribution of power plants
- Capacity and generation analysis
- Fuel type distribution
- Temporal analysis of commissioning dates
- Interactive maps of power plant locations
- Time series analysis of power generation
- Capacity factor calculations
- Performance metrics visualization
- Predictive modeling for power generation
- Feature importance analysis
- Model performance evaluation
The analysis uses the Global Power Plant Database, which contains information about power plants worldwide including their capacity, location, generation, and fuel types.
MIT License
- Fork the repository
- Create your feature branch
- Commit your changes
- Push to the branch
- Create a new Pull Request