Editing PDF documents with Master PDF Editor 4
New in Version 4
Soon after Master PDF Editor version 4.0 was released, the developers released two minor updates in January 2017. One significant new feature is the OCR function for processing scanned documents, which is roughly equivalent to the functionality that gscan2pdf [4] or Paperwork [5] attempt to provide. All three programs use the Tesseract OCR engine, which promises reasonably good results.
Using the scan function, Master PDF Editor initially generates a PDF with images (see the "Scanning" box). The OCR function, which you will find in the Document | OCR menu, converts the results back into directly editable PDFs. I ran a simple test to check how well the OCR performs. The idea was for Master PDF Editor to scan a short passage from a magazine and convert it into machine-readable text.
Scanning
The scan function in Master PDF Editor resides in the File | New | From Scanner menu. The preview is at the same resolution as the final scan, which makes little sense and is unnecessarily time consuming. If you want to digitize the document using OCR after scanning, do not choose a low scanning resolution: Usually, the higher the resolution, the better the character recognition results.
Although the scanner used for this test had a resolution of up to 600dpi, the results were not convincing. The original presented a challenge to the system: The text was not on a white background, the paper of the original was slightly wavy, and the printing was of moderate quality. However, the OCR routine should have identified these problems and correctly scanned OCR text. The results indicated much room for improvement.
Many OCR tools use a more-or-less intelligent spell checker that warns the user if many errors occur; this feature is missing in Master PDF Editor. The installation stores the Tesseract data locally on the hard drive, regardless of whether the files already exist in a system global installation under /usr/share/tessdata/
, resulting in data redundancy.
If you want to understand all the features in Master PDF Editor, you can refer to the manual, which comes as a PDF or online [6]. However, it still refers to version 3.7 and thus contains no information on scanning or OCR. That said, the manual explains all the other features well and in detail. In fact, the manual contains a great deal of information about the structure of PDF files that is otherwise difficult to find.
Conclusions
Despite minor weaknesses, Master PDF Editor 4 proves to be a fine piece of software for retroactive PDF editing. In practical terms, no other free software with a similar feature set exists (see the "Alternatives" box). The latest version of the program comes with promising new functions, such as scanning and text recognition, but does not yet deliver in practice what it promises.
Alternatives
You can find a number of free tools that also attempt to edit PDF documents. The best results are currently obtained with LibreOffice Draw, but in practical terms, LibreOffice's capabilities lag miles behind those of Master PDF Editor.
Infos
- Master PDF Editor 4: https://code-industry.net/masterpdfeditor/
- "Master PDF Editor" by Karsten Günther, Linux Pro Magazine, issue 164, July 2014, pg. 46, http://www.linuxpromagazine.com/Issues/2014/164/Master-PDF-Editor
- Free version: https://code-industry.net/free-pdf-editor
- gscan2pdf: http://gscan2pdf.sourceforge.net
- "Paperwork" by Karsten Günther, Ubuntu User, issue 33, 2017, pg. 13, http://www.ubuntu-user.com/Magazine/Archive/2017/33/Use-Paperwork-to-digitize-and-archive-documents
- Manual: https://code-industry.net/masterpdfeditor-help
« Previous 1 2
Buy Linux Magazine
Subscribe to our Linux Newsletters
Find Linux and Open Source Jobs
Subscribe to our ADMIN Newsletters
Support Our Work
Linux Magazine content is made possible with support from readers like you. Please consider contributing when you’ve found an article to be beneficial.
News
-
Red Hat Adds New Deployment Option for Enterprise Linux Platforms
Red Hat has re-imagined enterprise Linux for an AI future with Image Mode.
-
OSJH and LPI Release 2024 Open Source Pros Job Survey Results
See what open source professionals look for in a new role.
-
Proton 9.0-1 Released to Improve Gaming with Steam
The latest release of Proton 9 adds several improvements and fixes an issue that has been problematic for Linux users.
-
So Long Neofetch and Thanks for the Info
Today is a day that every Linux user who enjoys bragging about their system(s) will mourn, as Neofetch has come to an end.
-
Ubuntu 24.04 Comes with a “Flaw"
If you're thinking you might want to upgrade from your current Ubuntu release to the latest, there's something you might want to consider before doing so.
-
Canonical Releases Ubuntu 24.04
After a brief pause because of the XZ vulnerability, Ubuntu 24.04 is now available for install.
-
Linux Servers Targeted by Akira Ransomware
A group of bad actors who have already extorted $42 million have their sights set on the Linux platform.
-
TUXEDO Computers Unveils Linux Laptop Featuring AMD Ryzen CPU
This latest release is the first laptop to include the new CPU from Ryzen and Linux preinstalled.
-
XZ Gets the All-Clear
The back door xz vulnerability has been officially reverted for Fedora 40 and versions 38 and 39 were never affected.
-
Canonical Collaborates with Qualcomm on New Venture
This new joint effort is geared toward bringing Ubuntu and Ubuntu Core to Qualcomm-powered devices.