Merge PDF Files from Fileupload - pdf

I want to merge four separate PDF uploads' contents and keep them in database as binary format.

There's no easy to way to merge pdf files. If you're using PHP you can use an open source library to do so.
Link: PDF Merger for PHP
If you're using .NET follow this tutorial: Simple .NET PDF Merger which uses iTextSharp library.

Related

Extract xml from ZUGFeRD PDF with Ghostscript

We would like to automate the processing of Zugferd invoices.
Is there a way to extract and save the xml files embedded in the PDF using Ghostscript?
as mentioned by KenS Ghostscript can help assemble Zugferd files but not extract the contents. Below we can see those contents in the source xml (lower) and a good !? PDF where the plain text is visible (upper part of image is PDF viewed in WordPad) and can be easily extracted as text. However nothing about PDF extraction is reliable since the format of one PDF is rarely the same as the next unless you make it so.
Many PDF readers have the ability to export such attachments as the source file and many PDF libraries will allow for extraction of the named file in a scripted fashion.
The samples above are from currently very up to date Open Source Java application https://www.mustangproject.org/
For very simple cross platform use there is pdfdetach which can save any attachments by name or all attachments

Is is possible to embed UUID into PDF/A file using iText library?

My program produces a multiple-pages PDF/A files using iText API. I have two problems about this:
1.I have no idea that the PDF file does not come with both "xmpMM:DocumentID" and "xmpMM:InstanceID" tags in root metadata. I tried to add both tags into root metadata by XmpMMSchema, but it seems like doesnot support the field 'xmpMM:InstanceID'.
2. I think iText generates UUID for each PDF/A file, right? Is there a way to extract it out?
Does anyone here know how to do that?

Pdf generators from an xml template?

Are there any PDF generators out there be it commercial or open source which can be used for research purposes? That i can use such as pdfnow.com or any standalone desktop apps which allow me to generate a PDF from a XML template. I have tried researching but there is allot of ambiguity going around.
Applidok is generating PDF based on an original (raw) PDF, a template definition and dynamic/user data (e.g. from a form): http://go.applidok.com/en/howitworks.gz.html
Template format there is JSON, not XML, but approach is the same.

How to merge PDF files without external dependencies

In one of my applications I need to merge many single PDF documents into one document, where each of the original PDFs is a page. Although many PDF libraries exist for most languages, I would like to write this myself if it's not too hard.
Is it necessary to implement a full-fledged PDF parser in order to merge PDF documents? Where and what would I start to read to find out what is needed for the task?
You can use the Debenu QuickPDF Library Lite (free) version to do it. Here is a very good example how to do it:
http://www.debenu.com/kb/merge-pdf-files-together-programmatically/

create one pdf from multiple ppt files

Someone knows how can I create one pdf file from multiple ppt files ?
Whether it to write script or computer program. However if it can be done with some program it will be the best.
I searched the web for something like this but I didn't get any results.
If you want to convert the PPT/PPTX files to PDF and then join those converted PDF files into a single PDF using either .NET or Java, you may try Aspose.Slides and Aspose.Pdf.Kit components.
Aspose.Slides allows you to convert the PPT/PPTX files to PDF and Aspose.Pdf.kit allows you to join the PDF files into a single PDF. Please see if this solution can work for your scenario.
Disclosure: I work as developer evangelist at Aspose.