datadiver-ai is the ultimate tool for π web scraping, transforming πΈοΈ unstructured websites into β¨ clean JSON. Easily extract π paragraphs, π lists, π links, and πΌοΈ images with our π§ AI-powered processing.
Important
Extract structured data from any website with a simple API!π
DataDiver AI is an intelligent web scraping tool that transforms unstructured web pages into clean, organized JSON data. Perfect for research, data analysis, content aggregation, and more!
- π
Universal Scraping- Works with virtually any website - π§
AI-Powered- Uses Mistral AI for intelligent data processing - π§©
Structured Output- Converts messy HTML into clean, consistent JSON - π
Content Categorization- Automatically organizes content by section - π
Rich Content Support- Extracts paragraphs, lists, links, and images - π»
Simple API- Easy-to-use interface for quick integration
βοΈNext.js + ReactπTypeScriptπJSDOM for HTML parsingπ§Mistral API for optimizationπ¨Custom CSS for beautiful UI
# Clone the repository
git clone https://github.com/divyanshudhruv/datadiver-ai.git
# Navigate to project directory
cd datadiver-ai
# Install dependencies
npm install
# Set up environment variables
cp .env.example .env
# Add your Mistral API key to .env file# Start the development server
npm run dev
# Open your browser and navigate to
http://localhost:3000- Enter the
URLyou want to scrape - Click
"Scrape" - View the structured
JSONoutput
// Fetch data from a URL
const response = await fetch("/api/scrape", {
method: "POST",
headers: { "Content-Type": "application/json" },
body: JSON.stringify({ url: "https://example.com" })
});
const data = await response.json();
console.log(data);{
"success": true,
"url": "https://example.com",
"data": {
"title": "Example Website",
"meta": {
"description": "This is an example website"
},
"content": {
"about_us": {
"title": "About Us",
"items": [
{
"type": "paragraph",
"text": "We are a sample company demonstrating DataDiver AI"
},
{
"type": "list",
"listType": "unordered",
"items": ["Feature 1", "Feature 2", "Feature 3"]
}
]
}
}
}
}Contributions are welcome! Please feel free to submit a Pull Request.
# Create a new branch
git checkout -b feature/amazing-feature
# Make your changes and commit them
git commit -m 'Add some amazing feature'
# Push to the branch
git push origin feature/amazing-feature
# Open a Pull RequestThis project is licensed under the MIT License - see the LICENSE file for details.