Skip to content

danielpcamara/Crawler-Tareffa

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

About

Tareffas is a task maneger web app. "https://web.tareffa.com.br/".

This site is able to bulk insert Companys from csv files. However, the "Características" (attributes) of this companys can't be define from the CSV file.

To overcome this limitarion, this crawler uses selenium to update this fiels base on a Excel workbook.

Workbook layout

The workbook must have a worksheet with at least 2 columns:

  • idempresa: the company ID present in the sites URL
  • Alterações: A structure list that informs for every "Grupo de Características" (attribute group), wich attribute must be enable.
    • The groups must be sorted with the same site's order
    • Every group must be separated by pipe (|)
    • If the group should not be changed, inform "N/A"
    • Otherwise informe the relative position of evrey attribute that must be enable, seperated by semicolon (;)
    • By default all items are changed, they wont be unckeck only if M (Manter, to keep in portuguese) is inform in the first space of the group.

A sample file is available in this repo with one Exemple.

Requirements

pip install pandas
pip install openpyxl
pip install selenium

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published