This project collects news articles from NewsAPI, detects their language, and prepares them for sentiment analysis using PySpark. Results are stored in Firebase Firestore, simulating a cloud-connected data pipeline.
- Fetch news articles using the NewsAPI
- Detect language of each article using
langdetect
- Store article titles and sentiment scores in Firebase Firestore
- Load and prepare the dataset using PySpark
langdetect
β for language detectionfirebase-admin
β to connect with Firebasepyspark
β to process and display big datarequests
β to access NewsAPIdatetime
,getpass
β for automation