Skip to content

Add entity to NewsAPI source resolution table and related SQL queries #11

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 9 commits into from
Jun 6, 2025

Conversation

ByteMeDirk
Copy link
Owner

No description provided.

@ByteMeDirk ByteMeDirk added this to the Data Platform milestone Jun 6, 2025
@ByteMeDirk ByteMeDirk requested a review from Copilot June 6, 2025 09:58
@ByteMeDirk ByteMeDirk self-assigned this Jun 6, 2025
@ByteMeDirk ByteMeDirk added the enhancement New feature or request label Jun 6, 2025
Copy link
Contributor

@Copilot Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull Request Overview

This PR adds new gold layer scripts to build comprehensive reporting tables by joining entity mapping data with NewsAPI and known entities data, along with several new SQL queries for enhanced insights and updated documentation.

  • Adds gld_04_load_newsapi.py for mapping entities to news sources.
  • Adds gld_04_load_entities.py for compiling entity affiliation data and its reverse relationships.
  • Updates local_run.sh, several SQL query files, and README.md to reflect new data flows and reporting capabilities.

Reviewed Changes

Copilot reviewed 11 out of 11 changed files in this pull request and generated 2 comments.

Show a summary per file
File Description
semantic_medallion_data_platform/gold/gld_04_load_newsapi.py New script to join and write entity-to-NewsAPI source resolution table.
semantic_medallion_data_platform/gold/gld_04_load_entities.py New script to create a comprehensive entity affiliations table.
local_run.sh Updates to execute the new gold layer processes.
data/metabase_questions/Entity_NewsAPI_Trends.sql New SQL queries for entity mention trends and moving averages.
data/metabase_questions/Entity_NewsAPI_Insights.sql New SQL queries for insights on entity match quality and news source timelines.
data/metabase_questions/Entity_NewsAPI_Analysis.sql SQL queries for basic entity analysis and news source relationships.
data/metabase_questions/Entity_Analysis.sql SQL queries for retrieving basic entity details and affiliation count.
README.md Expanded documentation with architecture diagrams and usage instructions.

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
@ByteMeDirk ByteMeDirk merged commit a1f750b into main Jun 6, 2025
1 check passed
@ByteMeDirk ByteMeDirk deleted the feature/gold-layer-newsapi branch June 6, 2025 11:37
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant