PDF creators, extractors, and editors tested
Scribus
The Scribus 1.4.3 layout program supports PDF versions 1.3, 1.4, 1.5, and PDF/X-3. On request, the DTP program either zips images or converts them to JPEG format. In the latter case, you can select one of five different quality levels. Scribus can create thumbnails and integrate bookmarks, as well as compress text and vector graphics. Furthermore, the user decides which of the fonts used in the document Scribus embeds in the PDF. Alternatively, you can convert the text to vectors.
On request, the layout program blends the changes between pages with presentation effects. As with LibreOffice Writer, users can influence the display in Adobe Reader and, for example, hide the menubar. Passwords can be used to restrict access to the document in Scribus, although rival LibreWriter offers much more granular settings.
Scribus, for example, can only completely ban printing, whereas Writer lets you print at a reduced resolution. In return, Scribus adds color bars, bleed marks, and other elements useful for printing to the PDF on request. In PDF/X-3 documents, it can save an explicit color profile.
Import: Very Poor
Although the PDF import feature in LibreOffice was middling, it proved to be simply unusable in Scribus 1.4.3. The text was missing in all of the imported PDF documents (Figure 6). Scribus handles the PDF as a large vector graphic. Users can only dissolve the group and move or delete some of the ingredients left over from the import.
Scribus really took the cake by crashing after importing PDFs that it created itself. It stands to reason, however, that the DTP program refuses to process password-protected documents.
gPDFText
The gPDFText tool opens the text from PDFs for e-books in a text editor [4]. However, this GTK+-based program can also handle normal PDFs, as long as they are not encrypted. We tested version 0.1.6 with our sample documents. gPDFText was able to extract and display all the text from the PDFs. That said, the layout was lost in all cases, and the extracted text appears a jumble of words (Figure 7).
For multicolumn text, such as the article output by InDesign, the sentences were scrambled – partly nested within one other. gPDFText apparently insists on single-column text, as is typical for e-books. In the basic settings, the user can instruct the tool not to merge separate lines. In all, this setting had no effect; the text remained a Dadaist block of letters. Additionally, gPDFText tried to reconnect words separated by hyphens, but this did not always work in our lab. In any case, the user has no alternative but to rework.
« Previous 1 2 3 4 Next »
Buy this article as PDF
(incl. VAT)
Buy Linux Magazine
Subscribe to our Linux Newsletters
Find Linux and Open Source Jobs
Subscribe to our ADMIN Newsletters
Support Our Work
Linux Magazine content is made possible with support from readers like you. Please consider contributing when you’ve found an article to be beneficial.
News
-
Endless OS 6 has Arrived
After more than a year since the last update, the latest release of Endless OS is now available for general usage.
-
Fedora Asahi 40 Remix Available for Macs with Apple Silicon
If you've been anticipating KDE's Plasma 6 for your Apple Silicon-powered Mac, then you're in luck.
-
Red Hat Adds New Deployment Option for Enterprise Linux Platforms
Red Hat has re-imagined enterprise Linux for an AI future with Image Mode.
-
OSJH and LPI Release 2024 Open Source Pros Job Survey Results
See what open source professionals look for in a new role.
-
Proton 9.0-1 Released to Improve Gaming with Steam
The latest release of Proton 9 adds several improvements and fixes an issue that has been problematic for Linux users.
-
So Long Neofetch and Thanks for the Info
Today is a day that every Linux user who enjoys bragging about their system(s) will mourn, as Neofetch has come to an end.
-
Ubuntu 24.04 Comes with a “Flaw"
If you're thinking you might want to upgrade from your current Ubuntu release to the latest, there's something you might want to consider before doing so.
-
Canonical Releases Ubuntu 24.04
After a brief pause because of the XZ vulnerability, Ubuntu 24.04 is now available for install.
-
Linux Servers Targeted by Akira Ransomware
A group of bad actors who have already extorted $42 million have their sights set on the Linux platform.
-
TUXEDO Computers Unveils Linux Laptop Featuring AMD Ryzen CPU
This latest release is the first laptop to include the new CPU from Ryzen and Linux preinstalled.