Display only first two levels of bookmarks in PDF using pdfMark/ghostscript [closed] - pdf

Closed. This question is not about programming or software development. It is not currently accepting answers.
This question does not appear to be about a specific programming problem, a software algorithm, or software tools primarily used by programmers. If you believe the question would be on-topic on another Stack Exchange site, you can leave a comment to explain where the question may be able to be answered.
Closed last month.
This post was edited and submitted for review last month and failed to reopen the post:
Original close reason(s) were not resolved
Improve this question
I'm using PDFtk to combine multiple pdf files and create bookmarks in the combined pdf file. When the combined pdf file is opened I want to display only first two levels of bookmarks (file has total three levels of bookmarks)
Solution using pdfmark+ghostscript suggested here displays all (level) bookmarks.
https://unix.stackexchange.com/questions/398806/how-to-display-bookmarks-in-the-initial-view-when-opening-a-pdf-file
https://thechriskent.com/2017/03/06/setting-pdf-view-options-with-pdfmark/
I went through pdfmark reference manual by Adobe but I did not find any options. I'm not an expert in pdfmark or ghostscript.
I have Windows and looking for a command line solution (process multiple files regularly). Thanks.

Try:
cpdf -bookmarks-open-to-level <n> in.pdf -o out.pdf
where <n> is 0 for top level, 1 for next level etc...

Related

iText Volumen License print count [closed]

Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
This question does not appear to be about a specific programming problem, a software algorithm, or software tools primarily used by programmers. If you believe the question would be on-topic on another Stack Exchange site, you can leave a comment to explain where the question may be able to be answered.
Closed 6 years ago.
Improve this question
I know iText Volume license is based on how many PDF is getting printed. I cannot find any documentation how to setup a counter. We have JAR files declared in POM and we are building war file. What else needs to be done?
How is this number of PDFs are calculated? Who calculates them, iText or Customer?
See below, Answer to this question involves programming.
The question is also answered on the official web site: How can I log the number of documents / bytes I've processed?
Customers with a capacity rental license are asked to count the metric that was agreed using the Counter mechanism. We have a very simple SysoCounter class that writes information to the System.out.
You can test this SysoCounter like this:
CounterFactory.getInstance().setCounter(new SysoCounter());
Obviously, it's not very useful to have all that information in the System.out. Depending on how you want to store the document count (e.g. you might want to update a value in a database), you should write your own Counter implementation and use CounterFactory.getInstance().setCounter(yourCounter) to get the mechanism in place.

How to make InDesign's epub file vs. PDF file compatible? [closed]

Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
This question does not appear to be about a specific programming problem, a software algorithm, or software tools primarily used by programmers. If you believe the question would be on-topic on another Stack Exchange site, you can leave a comment to explain where the question may be able to be answered.
Closed 8 years ago.
Improve this question
We used Adobe inDesign to design story books. We need both the PDF file as well as epub file. Since we all view in PDF during the process, the final clear product in PDF, when we export as epub file, it's huge. It all messed up the original design. What can we do?
Why did it happen?
I've worked on ONE project going from InDesign to ePub about two years ago - and you are right it is a mess. It didn't understand which local overrides to keep and practically every paragraph had style="localoverride1 localoverride2 substyle3 etc" in it. It was a mess to sort and clean up.
After that miserable experience we've found that it is better to view PDF and ePub as two separate products. Our workflow takes source XML and goes EITHER into InDesign OR through an XSLT to make an ePub. We no longer use InDesign to attempt to make ePubs - with an XSLT there is a LOT more control over the look and feel of the final product.
However if you are dead set on using InDesign - I've heard that it does fixed layout "epub" fairly well (basically it ends up being a bunch of images - it's not reflowable).

PDF optimization for better rendering [closed]

Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
This question does not appear to be about a specific programming problem, a software algorithm, or software tools primarily used by programmers. If you believe the question would be on-topic on another Stack Exchange site, you can leave a comment to explain where the question may be able to be answered.
Closed 6 years ago.
Improve this question
I am looking for PDF optimization techniques/pre-processing to convert Print ready PDF to media ready(Press PDF to web PDF).
Target devices for rendering the PDF are iOS and Android
Tools like Adobe Acrobat Pro, provides settings for such tasks like reduction of layers, merging of layers etc.
Expected output PDF shall have only three layers:
1) Text
2) Image
3) Special effects
We can do this by using pre-flight and thus selecting the layers and merge them.
Any steps to do this effectively, i don't want to do such optimization at page level manually.
Can i import layer (say multiple image.tiff) at multiple pages at a single run?
You can use Ghostscript for that.
If you want to do that via Ghostscript User interface, you can download Ghostscript Studio (IDE) and use this switches in the Ghostscript Processor:
! >> interaction-related parameters
-dBATCH ! keep gs out from going into interactive loop reading
-dNOPAUSE ! disables the prompt and pause at the end of each page
! >> device selection parameters
-sDEVICE = pdfwrite ! pdf device
-dPDFSETTINGS=/ebook
Also, take a look at this answer: Optimize PDF files (with Ghostscript or other)

How to convert PDF files to spreadsheets [closed]

Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
This question does not appear to be about a specific programming problem, a software algorithm, or software tools primarily used by programmers. If you believe the question would be on-topic on another Stack Exchange site, you can leave a comment to explain where the question may be able to be answered.
Closed 6 years ago.
Improve this question
I have been trying the whole day to convert several. pdf files which contain traffic flow for São Paulo to spreadsheets like MS Office Excel, or LibreOffice Calc in Ubuntu. When I open the .pdf file with LibreOffice Calc it opens LibreOffice Draw, and I can't get the spreadsheet.
The most promising method that I found was here with pdftotext. It works fine and I can get the tables in LibreOffice Calc but adjusting manually the columns.
My problem is that I have so many .pdf files that it would take me a lot of time.
Does anyone know a better method?
Another option is to use Okular (http://okular.kde.org).
It has table selection tool (Ctrl+5).
You may select a table, add lines for additional rows and columns and copy the resulting table into a clipboard.
It works fine for me.
Tabula can work quite well. PDF is not an easy format to extract structured information from, so it's not always possible.
Maybe the -layout would be useful for you. With this option set, pdftotext will try to keep the column layout in the resulting text file.
Now, you can import the text file into LibreOffice Calc with the appropriate import settings. When opening a txt file in Calc, you will get asked how to parse the file content (see screenshot below). Under Separator Options, select both the Options [separated by] Space and Merge Delimiters. This way, Calc will be able to restore the column structure (assuming the cell data doesn't contain spaces).
Tool called Able2Extract is the option that can do for you exactly wat you want with minimum errors

go through files and OCR pdf [closed]

Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
This question does not appear to be about a specific programming problem, a software algorithm, or software tools primarily used by programmers. If you believe the question would be on-topic on another Stack Exchange site, you can leave a comment to explain where the question may be able to be answered.
Closed 2 years ago.
Improve this question
Is there free way to go though bunch of pdf image only files and folders (in different location) and OCR them?
I would be really interested it... please suggest..
Try VietOCR, which monitors a watch folder for new input images. The program requires GhostScript to recognize PDF format.
I recommend OCRvision OCR PDF software. It has OCR folder watch where you can configure any folder as a monitored folder and the software will auto-OCR the PDF files there and convert any new scanned documents to searchable PDF. You can download the software from the web site.
PS:- I work for OCrvision