Closed. This question needs to be more focused. It is not currently accepting answers.
Want to improve this question? Update the question so it focuses on one problem only by editing this post.
Closed 3 years ago.
Improve this question
We are trying to View following files on Autodesk Forge Viewer in our
application.
DWF
DWFX
DWG
DXF
NWD
RVT
NWC
PDF
RCP
GBXML
IFC
As per Autodesk documentation
-(https://developer.autodesk.com/en/docs/model-derivative/v2/overview/supported-translations/)
these files format are supported to viewer.
But, on (https://viewer.autodesk.com/) this site some files are getting
Format error such as PDF, RCP.
So my questions are:
Which file format is supporting to viewer
Can we open PDF files on viewer or it require any specific PDF file to
launch on viewer.
The files that the Viewer support are in the link you provided from the Model Derivative API. so PDF and RCP are supported. The A360 Viewer does support just to a number of file types, you can see the list from that website here
As you can see those 2 types are not mentioned there, but it doesn't mean they are not supported from the Forge platform. You will have to use the Model Derivative API in order to translate those file types and use the Viewer API in order to visualize them.
Related
Closed. This question needs to be more focused. It is not currently accepting answers.
Want to improve this question? Update the question so it focuses on one problem only by editing this post.
Closed 5 years ago.
Improve this question
Is there a way to convert a map pdf file to a kml file? How can I convert it or is there any guidelines to do so?
Apache PDFBox is an open source Java library that can parse PDF document and extract content. The API includes methods to extract text, metadata, and embedded files from PDF files as well create PDF files from scratch. Apache PDFBox also includes several command-line utilities. One command-line tool called pdfbox-app has options to extract all text or images from PDF files.
There is also Apache Tika library which focuses solely on text extraction from a variety of file formats including PDF.
Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
We don’t allow questions seeking recommendations for books, tools, software libraries, and more. You can edit the question so it can be answered with facts and citations.
Closed 5 years ago.
Improve this question
I ve a Asp.Net core projet. This project reference another library in which I should extract informations from a pdf. I was using itextsharp, but it seems not compatible with .net core.
Any idea how I could extract Text from a pdf file ?
If you want to write your own pdf parser, you will need to read up on all the different versions of the pdf file format. They are all officially documented here.
Text extraction from PDF is a complex task. I would not recommend you to do this without a library.
For an Asp.Net Core library I can recommend you Docotic.Pdf library (I work for the vendor). The library supports .NET Standard and can be used to extract not only text but paths and images too.
Here are some samples / tutorials:
Extract text
Extract text by words
Extract text, paths and images
Closed. This question needs to be more focused. It is not currently accepting answers.
Want to improve this question? Update the question so it focuses on one problem only by editing this post.
Closed 6 years ago.
Improve this question
Setup
XSL processor: Saxon
FO processor / PDF renderer: Antenna House Formatter V6.2
Is it possible to embed a 3D PDF, XVL or 3DU via a FO transformation / PDF rendering into the current publication?
The source data would have several XML, XVL (whatever) 3D data nodes that have to be processed into the generated PDF.
Thanks in advance.
You can embed a 3D PDF using AH Formatter V6.2 or V6.3. Use fo:external-graphic to refer to the PDF just as you would for any other external image.
In the AH Formatter GUI, you can select to embed 3D annotations in the 'PDF Option Setting' dialog box (see https://www.antennahouse.com/product/ahf60/docs/ahf-gui.html#others-page). On the AHFCmd (or run.sh on Linux/Unix) command line, you may need to specify -p3da (see https://www.antennahouse.com/product/ahf60/docs/ahf-xslcmd.html#keyIDAR1YD) and/or enable 3D annotations in the Option Setting File (see https://www.antennahouse.com/product/ahf60/docs/ahf-optset.html#keyIDAVUFU).
Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
We don’t allow questions seeking recommendations for books, tools, software libraries, and more. You can edit the question so it can be answered with facts and citations.
Closed 4 years ago.
Improve this question
i am a student and working on my college project. My problem is that i want to open documents like doc, docx, pdf, ppt, txt, xls and some office documents on the browser which are located at my server. it means I dont want to download the document first to see the matter in it. All these document should be opened inside my web browser directly.
I know about google document viewer which provides cloud services for that, but their are some limitations for their use. there are many other cloude services , but they are not fullfilling my need. Is there any api that can help me? By the way my project is in J2EE and server is tomcat, database is Mysql. and my plateform is Windows 7.
If i am able to edit those document from client side , then it will be an extra help to my project.
ViewerJS will solve your problem. It is the easiest way to use presentations, spreadsheets, PDF's and other documents on your website or blog without any external dependencies.
You just need to include the viewer.js plugin inside your project.
It is user friendly.
You can easily view your documents from server side to the browser.
Can support for .ppt files, .pdf files , .doc files and open office files.
Have a look at here http://viewerjs.org/
Closed. This question needs to be more focused. It is not currently accepting answers.
Want to improve this question? Update the question so it focuses on one problem only by editing this post.
Closed 5 years ago.
Improve this question
I need to extract the "articles" from this magazine which has both text and images. The image content has to be placed separately, the text extracted (as far as possible) and placed separately.
How do i go about doing this? Is there a commercial service / api that does this already? The input to the program/service will just be the file.
Eg of input: http://edition.pagesuite-professional.co.uk/pdfspool/rQBvRbttuPUWUoJlU6dBVSRnIlE=.pdf
(the actual file would be a normal pdf-file, not a seured one)
Docotic.Pdf library can extract images and text from PDF files for you.
Here are couple of samples for your task:
Extract text from PDFs
Extract images from a PDF
Extracted images can be saved as JPEG and TIFFs. You can extract text from each page or from the whole document. And you can extract text chunks with their coordinates.
Disclaimer: I work for Bit Miracle, vendor of the library.
Try this one:
http://asp.syncfusion.com/sfaspnetsamplebrowser/9.1.0.20/Web/Pdf.Web/samples/4.0/Importing/TextExtraction/CS/Default.aspx?args=7
The same component has the image-extraction feature also.
You could make a try!!
If you can afford a commercial option, Amyuni PDF Creator will allow you to enumerate all components inside the pdf file (text, image, etc), you will be able to extract them as independent objects and you can create new PDF files with them.
You may use Aspose.Pdf.Kit to extract text and images separately from a PDF file. The API is quite simple. You can also find samples, tutorials and support on Aspose website.
Note: I'm working as Developer Evangelist at Aspose.