I have a Flash file which displays a PDF file as a magazine, because the magazine is in Hebrew Google doesn't read it good.
Is there any why to display raw Text instead of the Flash for the search engine crawlers?
I heard you can do that using SWFObject, is that correct? and if so, how?
I need it to be SEO friendly...
Thanks in advance :)
SWFObject is still flash, so you don't really change the situation by using that.
recreate the pdf file in html; replace the .html file for your .pdf file. although there are extremely better ways to do this, if you're simply loading content via Ajax.
Related
We are converting large PNG images to JPEG using ImageResizer (https://imageresizing.net/). It's works great with ?format=jpg and they are delivered with the correct content type and all.
When downloading these images they are named image.jpg in all browsers except Safari. Here it asks the user if they want to go with .jpg or with .png. It also saves the images as image.png.jpg which is confusing for some users.
I read something about Content-Disposition header, but I think that's for direct download, not for right click and choosing Save as. I also don't know if it would be possible to add it without creating some kind of middle layer and probably lose performance.
Ideas? Thanks <3
I don't think there is a solution to this, as it is in the browser's control.
I have data in the form of pdfs and i want to convert it into text. I want to remove the images, header and footer, than the data will be left only in the form of multi-line tables, can you please suggest the best way to convert it? I tried tabula and apache tika but the results are not desired.
As you probably know, text in PDFs is arranged by x/y coordinates on the page. Headers are not stored/identified as such, like they are in MSWord, HTML etc.
Good to hear that you’ve tried tabula: https://github.com/tabulapdf/tabula . I’m sorry that it didn’t work for you.
If you’re working with journal articles, you might have luck with grobid https://wiki.apache.org/tika/GrobidJournalParser
To extract text by locations see: https://stackoverflow.com/a/35299074
http://pdftotext.com/
this works but probably not the way you where looking for
I am searching for a product that converts HTML to PDF in VB. I have searched alot for the products, but most of them are very expensive or doesn't work very well. Can anyone suggest me of a good quality, working product?
Thanks
Try using mPDF. mPDF is a PHP class which generates PDF files from UTF-8 encoded HTML. Its free and works very good.
http://www.mpdf1.com/mpdf/index.php
See WKHTMLPDF.
I have checked quality and used this tool. This is the good one and Opensource. This is shell utility to convert html to pdf using the webkit rendering engine, and qt.
I use htmldoc
Don't know if it fits your definition of "works very well", but for me, it does the trick.
A different approach is to install a print-to-pdf printer driver (such as freepdf), and print from the browser.
I want to create html file by which i could read any PDF file by providing the source of that PDF file. How can i do this by using only html5?
For example i want read a pdf file which is available in C drive so scr="http://virdir/mypdf.pdf".
I want something like this.
You want to use the developing HTML 5 File API. Mozilla has a good explanation, and you can also refer directly to the spec.
Since PDF is a binary format, you will probably want to use FileReader.readAsBinaryString().
Parsing and rendering (e.g. to a canvas) a PDF in JavaScript is possible, but it would be very challenging.
Here is an open source pdf reader written in javascript.
https://github.com/mozilla/pdf.js
There are APIs available to play with. It comes built into Firefox browser and has good support from Mozilla community.
I want to convert html to pdf dynamically in objective c.
what is the best way to convert it, so that it maintains its look same as html.
Thanks in advance.
Prasad.
UIGraphicsBeginPDFContextToFile will help you to create a PDF file which shows your web content. Read through the documentation and create your own solution or just follow the tutorials I found:
Convert Html or UIWebView to pdf in iPhone or iPad
Making a PDF from a UIWebView
I have used this Haru open source library, it is quite simple and useful for creating pdfs http://libharu.org/wiki/Main_Page
Objective-C is a programming language, it doesn’t know anything about HTML or PDF per se. How to do this depends on the framework you’re using. I’m assuming here you’re using Cocoa on OS X.
There you can load your HTML into a WebView provided by WebKit and then use the NSView method dataWithPDFInsideRect: to render it as PDF. I haven’t tested this, but maybe you’ll have to add the web view to a window before you can query the PDF data.