Skip to content

Content Scraper, simple pugin to pull prices(DATA) from one website an extract data into a .CSV file.

License

Notifications You must be signed in to change notification settings

brandonv98/Content_Scraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

44 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Content_Scraper

Content-Scraper is a Node.js command line application.

What does it do?

Connect to the website http://shirts4mike.com Get the latest products details (price, title, url and image url) Save the products to a spreadsheet (CSV format) inside the data folder

(will be created if not found)

Tech

Content-Scraper uses a number of open source projects to work properly:

  • Request - Simplified HTTP request client.
  • NodeJS - Running the backend.
  • Cheerio - Fast, flexible & lean implementation of core jQuery designed specifically for the server.
  • FS - Export a JSON array to CSV.
  • json2csv - Json to CVS parser package to conver json data into a csv format.
  • JSHint - Static analysis tool for JavaScript.

Those projects were chosen because they're very popular and have a great documentations.

Installation

Install the dependencies and devDependencies and run the application.

In your terminal navigate to projects root dir (./) then follow install.

npm i bthis-content-scraper
$ npm install
$ npm start

For production environments...

$ NODE_ENV=production npm install
$ npm start

Linting tool only in dev mode

Run this line to check your code for syntax errors

$ npm run error-report

Error handling

All errors will be stored into a dir called "error-logs".
Open the "scrapper-log" file, which will have more detail on the error along with the date && time of the error.

(will be created if not found)

License

MIT MIT

Free Software, Yay!!

About

Content Scraper, simple pugin to pull prices(DATA) from one website an extract data into a .CSV file.

Topics

Resources

License

Stars

Watchers

Forks

Packages

No packages published