Image Crawler Utils Documentation
A rather customizable image crawler structure, designed to download images with their information using multi-threading method. This GIF depicts a sample run:
Besides, several classes and functions have been implemented to help better build a custom image crawler for yourself.
Please follow the rules of robots.txt, and set a low number of threads with high number of delay time when crawling images. Frequent requests and massive download traffic may result in IP addresses being banned or accounts being suspended.
Installation
It is recommended to install it by
pip install image-crawler-utils
Requires
Python >= 3.9.
Important
nodriver are used to parse information from certain websites. It is suggested to install the latest version of Google Chrome first to ensure the crawler will be correctly running.
Guides
It is recommended to start Image Crawler Utils with the Get Started chapter.
For those using the preset crawling tasks, check out the Notes for Tasks chapter.
For those planning to construct a custom crawler, check out the Build a Custom Crawler chapter.
Quick Start
Advanced Usage