Is there a way to use docx4j library to convert doc format documents? - doc

I want to read doc format using docx4j. wondering if there's away or a workaround

You can use LibreOffice to convert binary doc to docx, and invoke it via JODConverter.
See open-rtf-with-docx4j for more on JODConverter.

Related

How to convert text pdf to image pdf using ghostscript

I need to convert text in pdf file to images, so users cannot copy it from the pdf etc.
This should be equivalent to converting the entire pdf to a set of images and then merging them to one single document. I did so, but it seems slow, is there any way to do it with ghostscipt options?
Welp, looks like I only need to specify option -dNoOutputFonts.

Is there way to convert docx to image-only pdf with convertapi?

I'm trying to find a way to convert docx to image-only pdf, so I could put a watermark on the pdf document right after conversion.
I've looked through convertapi documentation and I can't find any available options.
First convert the .docx file to a .jpg:
https://www.convertapi.com/docx-to-jpg
then feed the .jpg to
https://www.convertapi.com/jpg-to-pdf
to generate .pdf
You can chain the API calls to get your desired output.

How to generate PDF file using X++?

Can I create simple pdf file in x++? In this pdf I would like to have for example select from one table or simple static text.
MorphX reports can be saved to PDF by using the proper print settings beforehand.
SSRS reports can do this also using similar tricks.
Another way is to generate RTF, then let Word do the PDF creation. Silly, but maybe the PDF is smaller or better looking.
It is possible, but not simple, to generate PDF directly by using third party .Net components.
Some weeks ago, I used the Evo HTML to PDF library http://www.evopdf.com/ to convert simple HTML templates to PDF and it worked great. It can convert plain text as well, so maybe it could be useful for you.
Natively, AX hasn't anything to create PDF files.

How to convert marathi data from pdf to excel in proper format

I am converting Marathi data from PDF to excel or word but it is not getting proper format.
I have copied some data from PDF and pasted in word document but it was not getting proper format.
e.g. प्रविण सुधाकर शिरवाडकर this line is in PDF
but when i copied and pasted in word it has been getting
-प्रववर् सुधाकर शिरवाडकर
what should i do for this?
anyone please help me.
thank you in advance
There seem to be problems in the way PDF stores unicode devnagri text. Try this alternative route: convert your PDF to an image. Can use an online tool or downloaded, or if on linux use this command in terminal:
for f in *.pdf; do convert -density 200 "$f" "${f}_200dpi.jpg"; done
change the density from 200 to other as per need. Each page from your document should be converted into an image file. For a windows tool, try https://www.pdfill.com/pdf_tools_free.html
Then, go to http://www.i2ocr.com/free-online-hindi-ocr, upload the image and convert. That uses OCR (optical character recognition).
check the font in your PDF and try making it available to the word document.
I think you dont have perticular fonts which are used in PDF
In Adobe Reader -- -- File menu > Properties > Fonts tab gives you a list of all fonts used in the document.

How to convert PDF file to .doc format in Objective-C?

right now i am working on one ipad application where i am giving facility of opening the pdf file and also to customize it,now i want to add one functionality like i want to convert that pdf file in .doc format.
I researched but did not get any way around. Can anybody help me out?
Thanking you in advance.
I wrote an article on PDF to text conversion issues. If you look at some of the existing PDF to Word conversion tools (ie BCL) you will see what is realistically possible with a lot of work.
It’s not possible to convert a generic PDF back into a text format. I guess you could render the PDF into images and create a DOC from those, but that doesn’t sound very useful.