Google Script for PDF image/screenshot import into Google Doc - pdf

I'm trying to write a script that takes images from a PDF file and puts them into a Google Doc Template. The PDF has a bunch of images in it, one image per page. What I want to do is grab the images one at a time (or take a screenshot of the entire page) and paste it into a new document. How can I access a single image at a time to import it into the google doc?
Thanks.

There is probably no way to play directly with a pdf file but you should consider to first convert these pdf to google docs and, in a second time, use the document class that can retrieve images inside the doc quite easily...

Related

Fetch a certain page of a pdf as an image from google drive

I have some 10,000 pages of hand-written scanned documents in google drive in somewhere around 70 pdf documents.
I am making a spreadsheet index of these, with one row for each page where I make notes of what is on each page, by actually viewing those pages, reading it, and every fully typing it if required.
I need a link, which I can put in the spreadsheet, which when clicked opens up a certain page of the pdf as an image only, and not the entire pdf, the pdf is in google drive. Is there something like this possible in Google Drive? Or should I manually download all pdf, split it into images, and then re-upload and use that?
(example - java -jar pdfbox-app.jar PDFToImage -format jpg -quality 0.75 pdffile.pdf ; and then upload all this)
I have a feeling it must be possible because when we open the pdf in browser, it loads pdf pages one by one, it takes time but it opens it in some custom image+text format, so it must be exported. Also I know there is one image version for each google slide and link is stable, so there might be something for pdf also I was thinking.
There isn’t a parameter or feature to link a pdf page in Google Drive file viewer.
Indeed as mentioned, you can link to a specific slide in Google Slides, however Google file types do have additional features.
That’s not the case for PDFs for example. A workaround I can think of would be to create a comment for each page and each comment will have its own id.
After creating the comment, you can click on the three vertical dots icon and click on Link to this comment.
Alternatively, you can send feedback to Google (On file viewer page, click on three vertical dots icon and then Send feedback to Google) making sure to describe the proposed feature.

Update an existing PDF

I have an existing PDF template, and I want to add some text over the top of it in a specific location on a specific page. For example, an invoice template or a scanned document.
How would I go about this? I'm using Sheets if that narrows my options.
There is a lot of documentation on how to create PDFs from documents, but nothing that will take a particular PDF and put text over it.
A bit of blue sky thinking from my side. What if you were to do the following?
Manually:
Export the PDF as an image
Resize a Google slide to be the same shape as the image
Add the image to the google slide as a background image
In Code:
Make a copy of the google slide as needed
Add your Google sheets data to the slide positioned as needed on the slide
Export the Google slide as a pdf
I mention google slides, because Google docs to not have allow background images.
Another possibility with using Google docs is to build a document that looks like your invoice, then add to it tags. In your code search for the tags and replace them with data from your sheet. Even you could use a mail merge addon to do this if you didn't fancy writing any code.
In your Google Drive right-click on your PDF;
Open With > Connect More Apps > Search docusign > Left-click blue connect button and this should take you to Docusign.
Your PDF will then open with Docusign where you can manually add a signature or text to specific places.
Alternatively... You can use App Script.
Example,
Constructing a document using HTML and replacing the specific value with a variable which is collect from a Sheet or particular location. Writing the function to do this will allow you to interpret the HTML and replace the variable with the text or possibly an image. Then it would create this as a PDF.
There are many alternative ways to do this.
Here is one example on github;
Convert Google Doc to PDF using App Script

Extract PDF coordinates using mouse click

I want to extract the coordinates of a PDF document with the help of a mouse click. I have gone through some posts but since I'm new to this, I'm not being able to understand it properly. Also, can this be done if I render the PDF file in a web page?
You can add javascript to a pdf document. Although you only get access to a limited subset of the language.
If you only need the coordinates once (for instance when doing layout of the document), you can simply open it with adobe and activate the rulers/grid option to see where your mousepointer is currently located.

Error When Importing PDF into Word

I need to perform multiple edits in a word document and based on search criteria then import the relevant PDF for that specific page. The file path I dynamically create as per specific data on the various pages. The code works fine if there are less that roughly 70 PDF files to import.
Once it gets to importing after around 70 PDF files then Word starts to give warning about not being able to undo the action which is about to be performed, even though i clear the Undo History after I have imported each PDF.
Then there is also an error which comes up with regards to the importing of the PDF, see image below:
But I can still open PDF document when this error occurs, if I stop my code and try to manually import the PDF it doesnt work. Only if I save the word document and then reopen it, can I import PDF documents into the word document again.
I believe it has something to do with all the actions being performed in Word.
Unfortunately I am not able to create a demo code which will give the same outputs for this question.
Why would PDF documents give an error when importing into Word?

Viewing pdf large files online as thumbnail

I need to upload a pdf file in my website so that users can view/download it. But it is of 30MB file. So when the user clicks on that downloadable link it takes much time to download. So I was just thinking is there any other solution for this so that users can atleast view and read the contents of the file without downloading it and if required they can download it. ??
one suggestion is like onclick of downloadable link one new link will open where user can view pages individual as thumbnail and on click it will zoom and below we will give download link(as in google ebooks). But I was just wondering how this can be done?? Is it possible using jquery/ajax?? Please give your thoughts on this and any inputs will be highly appreciated.
You should be able to upload your PDF at http://view.samurajdata.se and then include the viewer for your PDF in an iframe.
If you like, you can even take the viewer link (something like http://view.samurajdata.se/psview.php?id=YOUR_PDF_ID), and get URLs for the individual pages as GIFs like http://view.samurajdata.se/rsc/YOUR_PDF_ID/tmp1.gif for page 1, tmp2.gif for page 2, etc. Then you can embed those GIFs how you want and customize the look and feel of your viewer.