-
Notifications
You must be signed in to change notification settings - Fork 0
Add entity to NewsAPI source resolution table and related SQL queries #11
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
Add Terraform configuration and infrastructure setup for Digital Ocean
Add Spacy installation step to PR tests workflow
Develop/gold layer newsapi
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Pull Request Overview
This PR adds new gold layer scripts to build comprehensive reporting tables by joining entity mapping data with NewsAPI and known entities data, along with several new SQL queries for enhanced insights and updated documentation.
- Adds gld_04_load_newsapi.py for mapping entities to news sources.
- Adds gld_04_load_entities.py for compiling entity affiliation data and its reverse relationships.
- Updates local_run.sh, several SQL query files, and README.md to reflect new data flows and reporting capabilities.
Reviewed Changes
Copilot reviewed 11 out of 11 changed files in this pull request and generated 2 comments.
Show a summary per file
File | Description |
---|---|
semantic_medallion_data_platform/gold/gld_04_load_newsapi.py | New script to join and write entity-to-NewsAPI source resolution table. |
semantic_medallion_data_platform/gold/gld_04_load_entities.py | New script to create a comprehensive entity affiliations table. |
local_run.sh | Updates to execute the new gold layer processes. |
data/metabase_questions/Entity_NewsAPI_Trends.sql | New SQL queries for entity mention trends and moving averages. |
data/metabase_questions/Entity_NewsAPI_Insights.sql | New SQL queries for insights on entity match quality and news source timelines. |
data/metabase_questions/Entity_NewsAPI_Analysis.sql | SQL queries for basic entity analysis and news source relationships. |
data/metabase_questions/Entity_Analysis.sql | SQL queries for retrieving basic entity details and affiliation count. |
README.md | Expanded documentation with architecture diagrams and usage instructions. |
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
No description provided.