Can I use google API to convert a PDF into PNGs? - pdf

I have noticed that when you view PDFs in google docs the PDF viewer renders the PDF file into PNG images.
I was wondering if you could use Google Data API to upload a PDF and get the URLs of the rendered PNG files?
I have never used the google API or really had the extra time to learn it, but if it help me do this it will be well worth the extra time.

No you can't do it.
Google explicitly does not allow it.
Downloading PDFs and arbitrary files
Native PDF files cannot be exported in
a format other than .pdf.

I doubt it.. I think it would be easier/more stable to use imagemagick or some other library to handle this. Converting PDFs to images usign imagemagick is just one CLI command.

Related

API for PDF Library

I need to build a small PDF library that will display many catalogs, the user will be able to view the document and go thru pages but he will not be able to download or share the documents in any way, somehow to work like Google Books (here an example).
I have in mind something like the Google Drive API or some kind of Scribd API, but I don't know if one of those will work, I would like to know if there are more options for these application or the mentioned before will do the job.
Edit: Forgot to mention, all this done in a web browser.
In principle all you need would be the ability to render pages from a PDF file into an image. Your application (you didn't mention where you want to build this) is then responsible for displaying the images, scrolling, moving from page to page etc...
If this is correct there are multiple possible libraries that can do this:
- ImageMagick can convert PDF to images (http://www.imagemagick.org)
- GhostScript has extensions for PDF and can convert PostScript or PDF into images and other formats (http://www.ghostscript.com)
- I'm sure there are many, many more...
There are also a number of commercial tools, for example those from Adobe (licensed through DataLogics, http://www.datalogics.com) and callas software (http://www.callassoftware.com - I'm affiliated with this company)

How to convert InDesign IDML to Tiff?

I have a requirement to take idml files provided by a client, twiddle them a bit to fill in some placeholders and generate a TIFF file. This needs to happen automatically and I have InDesign Server at my disposal.
I have the first part down. I have also found how to connect to InDesign Server via SOAP and convert IDML files to hi-res PDF or low-res JPG (This implies a few other other options).
I am at a bit of a loss as to how to take it the rest of the way to generate a TIFF file, the adobe forums have not been much help. It is my impression that this sort of thing is exactly why the IDML format was introduced so I'm assuming there's decent support out there for it but the best I've been able to come up with so far is to go IDML via Indesign Server to PDF (or SVG) via Inkscape Command-line to PNG via System.Drawing to TIFF but that seems horribly contrived and fault-prone (and I have no idea how I'm going to handle multiple pages).
Any ideas?
I don't believe there is a way to export to TIFF via InDesign Server, however I did find this post on the Adobe Forums that suggests using Photoshop to render the Tiff after exporting it as a PDF from IDS. Maybe that would be an option? Otherwise maybe you could use one of the formats that you CAN export from (i.e. JPG, PDF, EPS).
Hope this helps!
For reference, I ended up using Ghostscript to achieve the results.

Any other way to read/write a PDF file using java application other than itext, PDFbox?

I Tried with iText and PDFBox .
It is not simple , we need to understand lot of code for this.
Can anybody provide a simple way of reading and writing PDF using Java Application.
Make sure the application is standalone, and no need of any web/application server.
There are loads of simple examples for manipulating PDFs with Itext in the Itext in Action Book.
PDF is a complex file format. What are you trying to do exactly?

OpenSource CMS for pdf files

I am looking for some opensource suggestions for a CMS which facilitates alot of pdf files. I have thousands of pdf files and I would like to use a CMS which makes handling these files as easy as possible.
Thanks for any suggestions.
Plone has great support for binary files like PDFs: upload, download, manage security/access restrictions, manage caching of them and have the PDF's full-text indexed for search out of the box.
I don't know your requirements, but if I were you, I'd store an editable copy of the CMS content (in HTML) and render it each time it is saved.
To render HTML in the best manner possible, use a tool like WKHTMLTOPDF - easily connected to PHP.

Anyway to automatically convert DWF to PDF?

Our eTendering solution, www.monaqasat.com, currently works exclusively with PDF documents for various reasons, some of them being security. We are being asked if we can support DWF documents. For this to happen, we would need to find a way to automatically convert DWF documents to PDF, using some kind of Unix application.
Does anybody know any such application, preferably using Rails or Java?
Thanks,
.Karim
http://www.autodwg.com/pdf/
http://www.dwgto.com/
http://www.aidecad.com/
http://en.wikipedia.org/wiki/List_of_PDF_software
http://www.cogniview.com/convert-pdf-to-excel/category/pdf/
Suggestion would be to install a software printer call its APIs and pass dwf and get back pdf and then apply security as needed.
Autodesk has its DWF Toolkit available at
http://www.autodesk.com/dwftoolkit
It contains full source code in C++ to read & write DWF files, so it should be reasonably easy to make it run under Linux and to use a PDF library to write the output.