make PDF with external sources - pdf

is there anyone who tried to make PDF file with external sources like images/style/scripts?
These external sources should be loaded from internet whenever is PDF opened.
Somebody told me that metasploit can do it, bud I didn't find out how to.

A PDF file is not a webpage, you shouldn't rely on online resources to be displayed in it when you open it.
However, if you mean using online resources when you create a PDF file, I would say any PDF creation library can do this. My personal favorite is a Java lib called iText. If you prefer C#, it's been ported as iTextSharp.

Related

Is it possible to automate Visio using a third party?

We are trying to create a Visio project file and to process it with an external tool to generate a json file that a web app can read to create a decision tree based on the json contents.
We know about 3rd parties to automate MS Word files but no idea if there are others dealing with Visio files.
Google didn't return anything valid as per Automation it always returns links to how to program macros on it.
Any help will be appreciated.
There are many ways to automate Visio: macros, .net plugins, python, pascal, even windows powershell.
But if your final aim is getting Visio to the web, you may try the following link.
http://unmanagedvisio.com/category/svgexport/
The tool exports the drawing to an SVG embedded in an HTML file with Javascript functionality. Definitely worth a look.

Selecting text and image from pdf through any programming language

I'm trying to develop a tool/web application such that it will import a PDF file and I need to select text and images available in PDF by selecting them with a mouse click and marking them as title,content and image with a button click (3 different button) where the marked contents and image will be copied to clipboard or will be pasted into a word document which is going to be a another part. So in which programming language is this possible to work with and carry on ?
I'd probably try researching pure browser-side solution using pdf.js and clipboard API.
Otherwise, you'd still need clipboard API in the browser and the server-side may actually be powered by any programming language which can be hooked into a web server and has a library to parse PDFs.
You said nothing at all about your prospective server platform but to name a few, .NET has PdfSharp which is able to read PDFs, Python has a host of tools available for it. After all, there exist a bunch of command-line utilities to extract data from PDF which can be called using any PL able to call external processes.
Note that this only appears to be a simpler solution than using pdf.js but note that unless your PDFs are really uniform (say, invoices created by some piece of software), and so you'll be able to make your PDF parser know which bits of data it has to extract and return, the parser will need to returl all the data it extracted to the client, and you'll need to somehow render it all there. May be it's exactly what you need but maybe not.
Since PDFs are really tailored for typesetting and not presenting information in a structured manner, I'd try to piggyback on an already hard-core PDF rendering solution which runs in the browser, so see above.

API for PDF Library

I need to build a small PDF library that will display many catalogs, the user will be able to view the document and go thru pages but he will not be able to download or share the documents in any way, somehow to work like Google Books (here an example).
I have in mind something like the Google Drive API or some kind of Scribd API, but I don't know if one of those will work, I would like to know if there are more options for these application or the mentioned before will do the job.
Edit: Forgot to mention, all this done in a web browser.
In principle all you need would be the ability to render pages from a PDF file into an image. Your application (you didn't mention where you want to build this) is then responsible for displaying the images, scrolling, moving from page to page etc...
If this is correct there are multiple possible libraries that can do this:
- ImageMagick can convert PDF to images (http://www.imagemagick.org)
- GhostScript has extensions for PDF and can convert PostScript or PDF into images and other formats (http://www.ghostscript.com)
- I'm sure there are many, many more...
There are also a number of commercial tools, for example those from Adobe (licensed through DataLogics, http://www.datalogics.com) and callas software (http://www.callassoftware.com - I'm affiliated with this company)

Anyway to automatically convert DWF to PDF?

Our eTendering solution, www.monaqasat.com, currently works exclusively with PDF documents for various reasons, some of them being security. We are being asked if we can support DWF documents. For this to happen, we would need to find a way to automatically convert DWF documents to PDF, using some kind of Unix application.
Does anybody know any such application, preferably using Rails or Java?
Thanks,
.Karim
http://www.autodwg.com/pdf/
http://www.dwgto.com/
http://www.aidecad.com/
http://en.wikipedia.org/wiki/List_of_PDF_software
http://www.cogniview.com/convert-pdf-to-excel/category/pdf/
Suggestion would be to install a software printer call its APIs and pass dwf and get back pdf and then apply security as needed.
Autodesk has its DWF Toolkit available at
http://www.autodesk.com/dwftoolkit
It contains full source code in C++ to read & write DWF files, so it should be reasonably easy to make it run under Linux and to use a PDF library to write the output.

How to convert Word and Excel documents to PDF programmatically?

We are developing a little application that given a directory with PDF files creates a unique PDF file containing all the PDF files in the directory. This is a simple task using iTextSharp. The problem appears if in the directory exist some files like Word documents, or Excel documents.
My question is, is there a way to convert word, excel documents into PDF programmatically? And even better, is this possible without having the office suite installed on the computer running the application?
Office 2007 allows for this. I have found PDFCreator to be good, the VBA is included in sample files, and have heard that CutePDF is also good. PDFCreator and CutePDF are free.
To work without Office, you would need viewers, as far as I know:
http://www.microsoft.com/downloads/details.aspx?FamilyID=c8378bf4-996c-4569-b547-75edbd03aaf0&displaylang=EN
http://www.microsoft.com/downloads/details.aspx?familyid=95E24C87-8732-48D5-8689-AB826E7B8FDF&displaylang=en
I needed to do this myself, but managed to get it done with .Net and without 3rd party tools:
MSDN: Saving Word 2007 Documents to PDF and XPS Formats
Pretty simple, about 50 lines of code. However I think you will need Word 2007 installed on the machine as well as the ability to Save As PDF
To convert Word documents to PDF, take a look at jWordConvert, a java library that can do exactly that. This will not work with the Excel files though, only with the Word files. The language is not Sharp, it's Java but you could switch to use IText (which is java) instead of ITextSharp.
You can also use a component like activePDF's DocConverter to convert a lot formats to PDF.
Use PDF maker that comes with adobe 7- 9
I just used this code Covert Doc to PDF
I'm surprised Aspose wasn't mentioned here, it's easy, simple, and reliable. Downside is that it is not free.
I've used iTextSharp in the past, it's really good, easy to install (one DLL I believe), the merge takes a bit of tindering so it's not as easy to use as Aspose, but hey, it's free so that is the best part.
TallPDF.NET (comes with a hefty price tag) allows you to serve dynamic PDF from any .NET application including ASP.NET pages and web services.
PDFEdit (free and open source) is an editor for manipulating PDF documents. It has a GUI version and a command-line interface. Scripting is used to a great extent in the editor and almost anything can be scripted. It is possible to create your own scripts or plugins.
The most common way to convert files to a pdf is to print them to a pdf printer driver. There are a number of such drivers, one that i know of that will do the job is Black Ice.
Another is to use Adobe Acrobat's SDK. from memory its very expensive.
Its been a while since i have actually done any work with converting pdf's and the landscape may have changed.