wkhtmltopdf Page break for TD element - webkit

I am using wkhtmltopdf to convert html to pdf documents. Last line gets cut off and breaks over to next page.
Here is the sample html file I am using.
http://s000.tinyupload.com/index.php?file_id=23944558487334389019
This is how output pdf looks.
I am just passing html file and pdf filename as a parameter to wkhtmltopdf.exe
I am using 0.12.4 version of wkhtmltopdf.exe

Related

creating pdftk watermark file from command line

I need pdftk to watermark a pdf. I’m generating the content of the watermark programatically and write it out into a text file. Then I use cupsfilter to create the watermark pdf, and then pdftk to apply the generated watermark pdf onto an eBook pdf.
I understand that pdftk applies page by page watermark pdf onto eBook pdf.
If I create a 62 line text file, with 61 empty lines and watermark text on 62nd, then it gets applied properly at around 5/6 of the page height on every page of the eBook pdf.
I add one more empty line, the watermark text disappears. It does not end up on the next page, it is simply not there.
My ultimate goal is to have the watermark text at the bottom of the second page of the eBook
So I would need to create a 3 page pdf, having the first page empty, watermark text at the bottom of the second page and an third page again empty
I tried to insert page break using BBEdit into the text file, but I do not get the expected result.
does anybody have a hint how could I create the required text file which once printed out with cupsflter into a pdf will create the needed watermark pdf (first and third page empty and line or two of text at the bottom of the second page)
OK, so first, the manual is not entirely clear about difference between stamp and multistamp, and background and multibackground - it explains that the watermark pdf will be applied page by page onto eBook pdf if the watermark pdf is a multipage pdf, and that if the watermark pdf has fewer pages than the eBook pdf, the last page of the watermark pdf will be applied to all surplus pages of the eBook, and this is correct, but only in case of multistamp/multibackground option. If you use stamp/background option then only the first page of the watermark pdf will be applied to all pages of the eBook pdf, this was the first to figure out.
So I created two txt files using echo, one empty (one space in it) and one with one line of watermark text. Than I used pdftk cat option to merge the empty pdf with the watermark pdf, so I got two page pdf having first page empty and second with the line of text. Than I merged this file once again with the empty pdf, and ended up with 3 pages pdf.
Then I applied this 3 page watermark pdf with multibackground option to the eBook and got what I wanted - first page no watermark, second page the line of text and third an all other pages with no watermark.

Not exact format of dots in doc. document list after formating it to html via LibreOffice

When I format my doc document into html file via LibreOffice, I'm getting a black dots instead of the valid one's.
Example doc. document list before formatting:
doc document before saving as html
After saving as html page via LibreOffice:
After saving as html file
Is there any plugin for LibreOffice to make the marking exactly the same or is there any other way to make it look exactly the same after formatting?

PDF content stream operators are in reverse order. By using PDFBOX how we can re order it?

There are some PDF's which are generated by ocrmypdf library in hocr mode.
ocrmypdf.ocr("input.pdf", "output.pdf",pdf_renderer='hocr', keep_temporary_files=True)
These all pdf's content stream is in reverse order. The page last line is located first in content stream as shown in below figure.
PDF file
Because of this when I tag this pdf the jaws is reading last line first in every paragraph tag.

How to merge pages of a PDF document into a single page from command line?

I have an HTML document to be printed in an 80mm paper roll. So, I convert the HTML document to PDF and use the lp command to print it. Problem is: the generated PDF file breaks my document into pages. What I need is to merge PDF pages into a single page which has the height of the HTML document I am converting from

LaTeX Embedded PDFs not showing or printing

I have a set of SVGs that have been converted into PDFs using inkscape.
These are then compiled with latex and attempted to be viewed in a reader.
Pdflatex completes successfully and is error free, yet when I print form either Preview or Adobe Reader, the result is identical to the left pane in the screen grab enclosed.
Adobe Reader also only sees the pdf in this way.
I am able to print the original pdfs, so the error must lie in latex or the viewers.
One working solution lies in converting SVG's to eps:
inkscape -z --export-area-drawing <input filename+ path>.svg -E <output filename + path>.eps --export-ps-level=3
... and then letting pdf latex convert them into a pdf during compiling to embed them in the original document.
\includegraphics[...]{filename.eps}