This is the solution to PS 1 optional exercise.
Reconstruct the simulated data frame from Lecture 2, then perform the necessary data wrangling to transform the individual-level data frame into a household-level data frame.
- Recreate the initial data frame as shown in the provided screenshot. Use simulated data to generate a new data frame containing the specified information.
- Develop functions to derive the features displayed in the household-level data frame (the second screenshot).
- Adhere to Git and GitHub workflow practices:
- Create a separate branch for each new function
- Submit a Pull Request (PR) for each function
- Merge the PR into the main branch upon completion
- You may complete this assignment individually or in teams.
- Utilise Git and GitHub
- Implement a modular code structure to enhance maintainability:
- Create functions within separate modules
- Import these functions into your main orchestration script
- Use the imported functions to execute the steps "create_simulated_data()" and "generate_household_level_data()" (you are free to come up with better naming)
Question: What are some potential use cases for generating simulated data?