Skip to content

metechmohit/data-extractor-agent

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Extractor ai agent

This tool helps extract specific information about companies or other entities from the web using AI. It combines web search capabilities with AI processing to gather and analyze information about specified entities.

Features

  • Multiple data source support (CSV, Google Sheets)
  • Configurable search queries
  • AI-powered information extraction
  • Batch processing capability
  • Export results to CSV or Google Sheets

Setup

  1. Clone the repository
  2. Install dependencies:
    pip install -r requirements.txt
  3. Create a .env file with your API keys:
    SERPAPI_KEY=your_serp_api_key
    GROQ_API_KEY=your_groq_api_key
    
  4. For Google Sheets integration:
    • Place your credentials.json file in the project root
    • Run the application to authenticate and generate token.json

Usage

Run the application:

streamlit run src/app.py

Follow the UI prompts to:

  1. Select your data source
  2. Configure your queries
  3. Start the extraction process
  4. Download or export results

Project Structure

project_root/
├── requirements.txt      # Project dependencies
├── README.md            # Project documentation
├── src/
    ├── app.py           # Main Streamlit application
    ├── config.py        # Configuration and settings
    └── ...              # Other modules

Releases

No releases published

Packages

No packages published

Languages