Index | index by Group | index by Distribution | index by Vendor | index by creation date | index by Name | Mirrors | Help | Search |
Name: pdfsandwich | Distribution: OpenMandriva Lx |
Version: 0.1.7 | Vendor: OpenMandriva |
Release: 2 | Build date: Wed Oct 25 23:28:23 2023 |
Group: Graphics | Build host: ph300-4.openmandriva.org |
Size: 1600552 | Source RPM: pdfsandwich-0.1.7-2.src.rpm |
Packager: mandian <mandian@tutanota.com> | |
Url: http://www.tobias-elze.de/pdfsandwich/ | |
Summary: A tool to make sandwich OCR pdf files |
pdfsandwich generates "sandwich" OCR pdf files, i.e. pdf files which contain only images (but no editable text) will be processed by optical character recognition (OCR) and the text will be added to each page invisibly "behind" the images. pdfsandwich is a command line tool which is supposed to be useful to OCR scanned books or journals. It is able to recognize the page layout even for multicolumn text. Essentially, pdfsandwich is a wrapper script which calls the following binaries: convert, unpaper, tesseract, gs, and hocr2pdf (if tesseract < 3.03). It is known to run on Unix systems and has been tested on Linux and MacOS X. It supports parallel processing on multiprocessor systems. In contrast to most competing sandwich programs, it performs preprocessing of the scanned images, such as de-skewing or removal of dark edges etc.
GPLv2
/usr/bin/pdfsandwich /usr/share/doc/pdfsandwich /usr/share/doc/pdfsandwich/changelog /usr/share/licenses/pdfsandwich /usr/share/licenses/pdfsandwich/copyright /usr/share/man/man1/pdfsandwich.1.zst
Generated by rpm2html 1.8.1
Fabrice Bellet, Tue Dec 17 23:03:18 2024