I'm trying to parse a .docx (Open XML) file using Apache POI on Java. I want to be able to extract tags like these: <w:tag w:val="tag"/> from my document. The problem is that I didn't find any examples of how to do it in the internet. Is it possible to achieve something like this using Apache POI library for java or some another library?
Similiar question but in C# for reference: OpenXML tag search
Related
I am using Apache POI and having problem to find out if it´s possible to read an UserProperties which is added to an Outlook-message (.msg) using VBA. I know it can be found by using VBA (eg msg.Userproperties("outlookid")), but I want to use Apache POI instead if it´s possible.I have tried to read Chunks, CustomProperties, DocumentSummaryInformation, OutlookTextExtractor etc, etc... If I open the msg-file in an text-editor and search for an given propertyname - I find it so I know it´s there. Is there any way to read Userproperties in Apache POI - please help!
Hello Everyone m new to java and i just wanted to ask that wheather we can parse through a docx file using apache poi or tika and extract a certain pargaph with specific headings if its possible please guide me
I have a Microsoft Excel file and I need to convert it to HTML. In excel, it is pretty simple to do: there is the "Save As..." option and you select "web page (.htm)". But I need to do this from Java, using JXL. Does anyone know how to achieve this?
Take a look at either the XML or CSV classes: they are examples of how to extract data from an Excel document and do something interesting with it.
You may also want to take a look at the Apache POI project if JXL doesn't look like it's going to meet your needs.
Does anyone know if it's possible to save an Excel Spreadsheet as a jpg using java? Currently, I am reading and manipulating Excel Spreadsheets in java using Apache POI. It's working great for everything else, but I haven't been able to find an answer to this question in their documentation or online.
There are some commercial libs to do so, another solution would be to use Apache POI to render your Excel as HTML and then convert this to an image utilizing a lib like this java-html2image
I need to create spreadsheet in various file format: .xls, .xlsx, .ods, csv. I want to know how can I achieve this in Java and about available open source API.
I found some API for .xls, .xlsx but not .ods. Any information will be very helpful.
Thanks and Regards.
Have you tried the OdfToolkit? I haven't used it but it sounds like something you need.
For pdf generation you can use apache fop and altova stylevision to generate xsl-fo files. For xls - there is apache poi and jxls for templating.
For .ods I am using Simple ODF.
Update
Simple ODF is moved to Apache. New link is here.