Skip to content

Shivarora22/WebScapper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Web Scraper

Java Spring Boot JSoup

A web scraping application built using Java, Spring Boot, and the JSoup library. This project extracts and processes data from websites, making it available in a structured format for analysis or other applications.

Features

  • Efficient Scraping: Extract text, images, and links from web pages.
  • Dynamic Content Handling: Parse HTML and manage dynamic or complex structures.
  • Customizable: Easily modify scraping logic to suit specific use cases.
  • Spring Boot Integration: Leverages Spring Boot for scalability, easy configuration, and RESTful API exposure.
  • Data Export: Outputs data in formats such as JSON or CSV.

Requirements

To run this project, ensure the following prerequisites are met:

  • Java: Version 17 or higher.
  • Maven: To manage dependencies and build the project.
  • Spring Boot: Integrated with the application.
  • JSoup Library: For parsing and scraping web pages.

Installation

1) Clone the repository:
   git clone https://github.com/Shivarora22/WebScapper
2) Build the project using Maven:
   mvn clean install
3) Run the application:
   mvn spring-boot:run
    

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages