This small demo is to crawl the proxy IP under https://www.kuaidaili.com/ and check whether the proxy IP can be used.
-
First install scrapy.
-
Make sure your system has telnet installed.
-
Change to the project directory and run
scrapy crawl proxyspider -a page=20 -o proxy.json
The above command line means to crawl 20 pages of proxy IP and save the result to the proxy.json file.
A small demo for network security rookies, the purpose is to learn scrapy, if you have any questions, please send an email to chuanlongxie@stu.xidian.edu.cn.