README
Listal-bot
Listal-bot
is a simple command line tool that allows downloading all images
from given Listal page.
It supports downloading of person images:
$ listal-bot -u http://www.listal.com/marilyn-monroe -o ./mm-pictures
Created destination directory ./mm-pictures
Downloading new images of person "marilyn-monroe"
15% [=====> ] Done 1113 of 7418 (saved: 1113, errors: 0)
as well as images from other categories (sorry, lists are not supported at the moment):
$ listal-bot -u http://www.listal.com/movie/batman -o ./batman-pictures
Created destination directory ./batman-pictures
Downloading new images of movie "batman"
9% [===> ] Done 18 of 210 (saved: 18, errors: 0)
Installation
Make sure you have Node
and npm
installed.
(https://nodejs.org/)
The latest version is published to npm repository, install it using:
npm install listal-bot -g
Usage
$ listal-bot --help
listal-bot 0.5.2
Download all images from listal page.
Usage
Usage listal-bot -u <url> -o <dir> [options]
Options -u and -o are required.
Options
-u, --url <url|name> Listal URL to download (e.g. http://www.listal.com/<name>), or simply <name>
-o, --output <dir> output directory (will be created if does not exist)
-a, --append-name append name extracted from URL to output directory
-k, --append-category append category extracted from URL to output directory
-b, --append-category-name append category and name extracted from URL to output directory
-l, --limit-to download only from a single page (-l 5), a range of pages (-l 3:6), from page
to the end (-l 7:) or from the start to a page (-l :12)
-x, --overwrite overwrite existing files (by default only new files are downloaded)
-h, --help show this help
-v, --version show program version
-t, --timeout <seconds> image download timeout in seconds (default: 5)
-c, --concurrency <number> number of concurrent image downloads (default: 6)
-r, --retries <number> maximum number of retries after image download error (default: 5)
Arguments -u
(URL or name to download) and -o
(output directory) are mandatory.
Example:
$ listal-bot -u http://www.listal.com/marilyn-monroe -o ./mm-pictures
Downloading new images of person "marilyn-monroe"
15% [=====> ] Done 1113 of 7418 (saved: 523, errors: 0)
Re-downloading already downloaded images
By default, listal-bot
skips images that you already have in a specified directory.
Add -x
argument to force redownload all images and overwrite existing ones.
Example:
$ listal-bot -u http://www.listal.com/marilyn-monroe -o ./mm-pictures -x
Downloading all images of person "marilyn-monroe"
15% [=====> ] Done 1113 of 7418 (saved: 1113, errors: 0)
Automatically creating subdirectories
By default listal-bot
downloads images directly into the directory specified
with -o
option, but you can ask it to create subdirectory based on category and/or name extracted
from Listal URL. Use one of -a
, -k
or -b
options to do that:
$ listal-bot -u http://www.listal.com/marilyn-monroe -o ./my-picture-collection -a
Created destination directory ./my-picture-collection/marilyn-monroe
Downloading new images of person "marilyn-monroe"
15% [=====> ] Done 1113 of 7418 (saved: 1113, errors: 0)
$ listal-bot -u http://www.listal.com/marilyn-monroe -o ./my-picture-collection -k
Created destination directory ./my-picture-collection/person
Downloading new images of person "marilyn-monroe"
15% [=====> ] Done 1113 of 7418 (saved: 1113, errors: 0)
$ listal-bot -u http://www.listal.com/marilyn-monroe -o ./my-picture-collection -b
Created destination directory ./my-picture-collection/person/marilyn-monroe
Downloading new images of person "marilyn-monroe"
15% [=====> ] Done 1113 of 7418 (saved: 1113, errors: 0)
Downloading only specified pages
By default listal-bot
examines all the numbered pages that belong to the given subject page.
If you want to download images only from a given subset of pages, use -l
option, which
allows specifying a single page, or range of pages.
Examples:
To download only most recent images (from first 5 pages):
$ listal-bot -u http://www.listal.com/marilyn-monroe -o ./mm-pictures -l :5
Created destination directory ./mm-pictures
Downloading new images of person "marilyn-monroe"
20% [======> ] Done 20 of 100 (saved: 20, errors: 0)
To download only the oldest images:
$ listal-bot -u http://www.listal.com/marilyn-monroe -o ./mm-pictures -l 370:
Created destination directory ./mm-pictures
Downloading new images of person "marilyn-monroe"
59% [==================> ] Done 22 of 37 (saved: 22, errors: 0)
To download images only from one specific page:
$ listal-bot -u http://www.listal.com/marilyn-monroe -o ./mm-pictures -l 17
Created destination directory ./mm-pictures
Downloading new images of person "marilyn-monroe"
75% [=======================> ] Done 15 of 20 (saved: 15, errors: 0)
Or from a specific range of pages:
$ listal-bot -u http://www.listal.com/marilyn-monroe -o ./mm-pictures -l 17:21
Created destination directory ./mm-pictures
Downloading new images of person "marilyn-monroe"
29% [=========> ] Done 29 of 100 (saved: 29, errors: 0)
Note that only one page range can be specified. You can download more images by running
the program multiple times with different -l
parameter values.
Development
Download the code and install dependencies
git clone https://github.com/Messere/listal-bot.git
cd listal-bot
npm install
Listal-bot
is written in TypeScript, for all your
hacking pleasures.
To compile the code, run
npm run compile
After that, the main executable javascript is dist/index.js
.
To check if your code is clean and conforming to standards, run
npm run lint
To run the automated test suite (written with the help of Jasmine testing framework), run
npm test