stash

This is a web article extractor that outputs epubs.

For automatic extraction it uses dom_smoothie. For manual extraction you can define CSS selectors for each field in ~/.config/stash/sites.toml:

["somedomain.com"]
title = ".content h2"
body = ".content .main"
authors = ".content .bylines"
date = ".content .published_at"

You also need to create ~/.config/stash/config.toml and define the output directory:

output_dir = "~/docs/articles"

Then to use:

stash <url>

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
src		src
.gitignore		.gitignore
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
readme.md		readme.md

Provide feedback