Real Estate - Scraper

A scraper that gathers data from real estate ads.

Currently Supported Websites

Country	Website
Brazil	ZAP Imóveis

Installation

Requirements
Python 3.6
MongoDB

1. Clone this repository

Clone this repository using git and cd into the project folder:

git clone https://github.com/pauloromeira/realestate-scraper.git && \
cd realestate-scraper

2. Install Python requirements

Inside project folder, install python requirements using pip:

pip install -r requirements.txt

Usage

First, run MongoDB server:

mongod &

Then use the following command to start crawling:

scrapy crawl zap [-a url=<zapimoveis-url>] [-a start=n] [-a count=n] [-a seed=<seed>]

Curently, only ZAP Imóveis is supported

Arguments:

count: limits the number of pages the crawler will search for. The default is to crawl till the end.
start: start crawling from a given page. The default is 1.
url: website url to perform search.
seed: seed for the website search engine.

Examples

Default values - properties in Pernambuco, Brazil. Crawl all pages.
```
scrapy crawl zap
```

Olinda-PE. Crawl the first 4 pages.

scrapy crawl zap -a count=4 -a urls="https://www.zapimoveis.com.br/venda/imoveis/pe+olinda/"

Rio de Janeiro-RJ - south zone. Starting at page 100, crawl till the end:

scrapy crawl zap -a start=100 -a urls="https://www.zapimoveis.com.br/venda/imoveis/agr+rj+rio-de-janeiro+zona-sul/"

All places. Starting from page 4, crawl 3 pages:

scrapy crawl zap -a start=4 -a count=3 -a urls="https://www.zapimoveis.com.br/venda/imoveis/"

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
realestate		realestate
.gitignore		.gitignore
LICENSE.md		LICENSE.md
README.md		README.md
requirements.in		requirements.in
requirements.txt		requirements.txt
scrapy.cfg		scrapy.cfg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Real Estate - Scraper

Currently Supported Websites

Installation

1. Clone this repository

2. Install Python requirements

Usage

Examples

About

Uh oh!

Releases

Packages

Languages

License

cidata-github/realestate-scraper

Folders and files

Latest commit

History

Repository files navigation

Real Estate - Scraper

Currently Supported Websites

Installation

1. Clone this repository

2. Install Python requirements

Usage

Examples

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages