Bug#1004115: ITP: epub2txt2 -- extract text from EPUB documents
Package: wnpp
Severity: wishlist
Owner: Thiago Pezzo <pezzo@protonmail.com>
X-Debbugs-Cc: debian-devel@lists.debian.org, pezzo@protonmail.com
* Package name : epub2txt2
Version : 2.01
Upstream Author : Kevin Boone <kevin@railwayterrace.com>
* URL : https://github.com/kevinboone/epub2txt2
* License : GPL-3, GPL-3+
Programming Lang: C
Description : extract text from EPUB documents
epub2txt is a simple command-line utility for extracting text from EPUB
documents and, optionally, re-flowing it to fit a text display of a particular
number of columns. It is intended for reading EPUB e-books on embedded systems
that can't host a graphical EPUB viewer, or converting such e-books to read on
those systems. However, it should be robust enough for other purposes, such as
batch indexing of EPUB document collections.
.
epub2txt favours speed and low memory usage over accuracy of rendering. Most
of the formatting of the source document will be lost but, with a text-only
display, this is likely to be of little consequence. It will output
UTF8-encoded text by default, but can be told to output ASCII, in which case
it will try to convert non-ASCII characters into something displayable if
possible.
Reply to: