Reformat image format in PDF - pdf

I got issues in rendering a couple of images and texts in PDF in Telerik PDF viewer - according to Telerik's documentation it seems those texts/images formats are incompatible.
Are there ways to convert existing images in a PDF and replace it back to the PDF so to make the file compatible to the Telerik PDF viewer?
Many thanks

Related

Google Cloud Vision PDF Gibberish

I'm trying to extract text from PDF files using the Google Cloud Vision API. It works most of the times, but I get gibberish in a few cases. I tried both DOCUMENT_TEXT_DETECTION and TEXT_DETECTION, I tried forcing the language in the languageHints but it didn't help.
Then I tried with a screenshot saved as tiff and this did work, so I'm guessing that Google tries to use the text in the PDF if it's not just a picture. Indeed, when I select all "text" in the PDF, I get gibberish.
When I print the tiff back into PDF, text extraction works. So it's really something weird with the PDF. But other extraction software (such as abbyy) work well with the original PDF.
Has anyone had the same kind of issues?
One thing that could help would be an option to force treat the PDF as an "image PDF". Is there such an option?
Thanks for your help!
FYI, I am unfortunately not allowed to show the PDF, and I use the dotnet library.
Edit:
The info on the PDF is:
Creator: "PScript5.dll Version 5.2.2"
Producer: Acrobat Distiller 10.1.16 (Windows)

SyncFusion, Bad PDF Format, Inverted Color on Inputs

So the state publishes PDF files for us to download. However these files only look right in Adobe Acrobat. When I try to open them in PDFSharp or SyncFusion it fails with the error that there are no pages. When I have PDFSharp or SyncFusion create a new file from the pages in the PDF the result is the following;
However if I save this document as an archive-able PDF/A in Adobe Acrobat Pro, the file straightens out and both PDFSharp and SyncFusion have no problems printing and viewing the resulting PDF file. I can't get SyncFusion to re-save the document as a PDF/A and have it fix like with Adobe.
What could cause this issue?
Edit: The PDF file says it was created by an Elixir program which converts open document formats to PDF. Possibly Librex.

Convert pdf document to jpeg using LEADTOOLS and PDF-TOOLS

We have pdf documents (source: camera or scanner) that we want to convert to jpeg.
We use LEADTOOLS and PDF-TOOLS(in two separate programs) to convert these pdf files to jpeg files.
Both these tools use the default DPI of 150 irrespective of the DPI of the source pdf file.
We would rather like this value to be taken from the source pdf file.
For example: Adobe Acrobat software recognizes the source pdf file DPI and uses the same to create the jpeg file.
Is there some way we could achieve the same using the LEADTOOLS and PDF-TOOLS by determining the DPI of the source pdf file?
This feature was added to v19 of LEADTOOLS a few months ago. You can now extract images from PDF pages while preserving their original pixel dimensions using the following members of the Leadtools.Pdf.PDFDocument class:
ParseDocumentStructure method.
Images property.
DecodeImage method.
Furthermore, if the image is stretched inside the PDF page, you can detect that by examining its display size in the PDF page using the Leadtools.Pdf.PDFObject.Bounds property.
There's a dedicated demo for the PDFDocument class and related objects installed with LEADTOOLS 19 in these folders:
Examples\DotNet\CS\PDFDocumentDemo
Examples\DotNet\VB\PDFDocumentDemo

itextsharp joined pdf size too large

I have an asp.net (c#) web application.
I joined together some pdf files on serverside with itextsharp.
The result pdf is too large (>1M) to be downloaded in our environment.
I checked fonts list in Adobe Reader and Verdana font embedded 10 times. It could be a problem. I don't know why? I use iTextSharp PDFCopy for merging.
If I capture the file by fiddler and than print it out by bullzip pdf printer the size become the half (500KB)!!!
I can't figure out what bullzip pdf printer does to reduce the size???
All the joined pdfs are mosty text only a couple of small images.
Interesting when I try to copy/paste text from the original pdf I can, but I can't from the bullzip printed version (i get only rectangles when paste). That is no problem by the way because I don't need to edit or search by text, I only need to print it from the browser.
I need some .net library to do the same with the pdf before I send it to the browser.
Can anyone help me?

Converting PDF to WMF

Can someone suggest the best way to convert a PDF file to WMF?
I currently make Dynamic XML Form PDFs with Adobe LiveCycle Designer, but a client requires the file in WMF format for printing. Instead of filling out the form directly, they have their users answer the questions within their own UI, then print the fields to the form.
I've tried converting the PDF to PNG within Acrobat then converting it to WMF, but our client is having issues with them indicating these WMFs "appear to be graphics renamed as WMFs".
Thanks
Try writing a script that prints your PDF files into a WMF/EMF virtual printer.
There a few out there, for example:
EMF Printer (GPL, not free for commercial use).
Amyuni EMF Printer Freeware, but a bit outdated (Disclaimer: I work for Amyuni Technologies).
I believe there is another one (as source code) in the Windows Driver Kit source code samples, but I cannot find the link now.