open source OCR library that runs on windows XP - java

I'm looking for an open source OCR library that runs on windows XP. I need this to work for images and PDFs. Mostly I would like to interface this library from java . Any idea if there is anything available?
Regards.

Check Tesseract
Tesseract is probably the most accurate open source OCR engine available. Combined with the Leptonica Image Processing Library it can read a wide variety of image formats and convert them to text in over 60 languages. It was one of the top 3 engines in the 1995 UNLV Accuracy test. Between 1995 and 2006 it had little work done on it, but since then it has been improved extensively by Google. It is released under the Apache License 2.0.
Tesseract works on Linux, Windows (with VC++ Express or CygWin)
and Mac OSX
And here is a comparision table from wiki

http://roncemer.com/software-development/java-ocr/
"Java OCR is a suite of pure java libraries for image processing and character recognition."

I would take a look at the Apache Tika project and combine it with Tesseract OCR. Apache Tika manages opening and extracting content from a broad assortment of file types. And it has a very pluggable design so you can connect OCR for input and even wire it's output to Lucene for searching. And it's pure Java.

There is huge work done by Heat on Research about OCR, have a look at this

Check out Tess4J, a Java JNA wrapper for Tesseract OCR API.

Related

Java Media Framework x64bit

I've tried using JMF on a 64 bit environment I didnt find anythere 64 bit jmvfw.
Try Xuggler as Sai Ye Yan Naing Aye said in the first answer or you can also try FMJ.
From FMJ site:
FMJ is an open-source project with the goal of providing an alternative to Java Media
Framework (JMF), while remaining API-compatible with JMF. It aims to produce a single
API/Framework which can be used to capture, playback, process, and stream media across
multiple platforms.
Since FMJ is API-compatible with latest JMF, you may use existing JMF codes and run the.
There is no JMF for 64bit environment .But you can try this form oracle forum.The other alternative likes JMF is Xuggler.Try to use it.

Extract images from mp4 using java

I want to write code in java to extract the images in a mp4 file. Kindly guide me how to go about it. I am totally clueless.
Java SE has Java Media Framework (JMF) but it provides support for only a few video formats, not including MPEG-4.
Of course, you have some third-party choices to achieve your goal. One of them is Jffmpeg. It is an extension to JMF which contains support for MPEG-4.
Xuggler is another good choice with GPL/LGPL license. It works on Windows, Mac OS X and Linux.
Xuggler is an open-source third party library you can use to do this. Read through the API's. and sample code.

Trouble getting and installing the latest JAI java library

I have a couple of questions. I'm trying to install Java JAI libraries on my Eclipse on a PC so that I can create a standalone desktop application to convert TIFF files to JPEGs. I've searched and found links about how to do it but half are broken or very old and I'm not sure which one is the latest. Oracle site is no help and has links to download pages that don't exist. Any help is much appreciated.
Second question I have is will this libary work on a Mac and if so how do I install it there so that my code would work on a Mac.
Thank you!
JAI is native (written in c or c++). That means that you have to take care on which platform your application is running, is it 32bit or 64bit, etc. for each platform you need separate JAI library.
Check if java's javax.imageio.ImageIO utility supports tiff. If it does, you can simply read your tiff, then save it as jpeg. ... again i am not sure if this will work, but it is worth trying

SVG processing under Java on Android and desktop

I am trying to write a Java application that will generate SVG image based on XML file. The application should also be able to show SVG file. My application should run on Android platform and desktop PCs. I read about Swing + Batik but afaik it will not work on Android. What is the best library to achieve this ?
This article in Code Project deals with the subject. Two approaches are presented:
android-libsvg library.
Anti Grain Geometry engine.
Both solutions have dependencies on native code library, so you would need JNI.
There is also svg4mobile project, which only uses Java.
per this thread: libsvg ported
libsvg has been ported to android..see thread for details. You need a crystax form of NDK, ie exceptions enabled..build/install doc here at:build-install-doc
Also try this open-source library, Apache 2.0 license:
SVG-Android
Performance is good as the actual drawing is handled natively by an android.graphics.Picture object.

Video Thumbnails in Java

I want to generate a thumbnail preview of videos in Java. I'm mostly JMF and video manipulation alienated.
Is there an easy way to do it?
What about codecs? Will I have to deal with it?
Any video type is suported? (including Quicktime)
Well, since you're not stuck with JMF, have you considered Xuggler? Xuggler is a Java API that uses FFmpeg under the covers to do all video decoding and encoding. It's free and LGPL licensed.
In fact, we have a tutorial that shows How to Make Thumbnails of an Existing File
There seems to be a few examples out there that are far better than what I was going to send you.
See http://krishnabhargav.blogspot.com/2008/02/processing-videos-in-java.html.
I'd agree with Stu, however. If you can find a way to get what you want using some command-line tools (and run them using Commons-Exec), you might have a better overall solution than depending on what is essentially the Sanskrit of Java extensions.
Are you sure that JMF is right for you? Unfortunately, it is not in particularly good shape. Unless you are already committed to JMF, you very well may want to investigate alternatives. Wikipedia has a decent overview at en.wikipedia.org/wiki/Java_Media_Framework
Many JMF developers have complained that it supports few codecs and formats in modern use. Its all-Java version, for example, cannot play MPEG-2, MPEG-4, Windows Media, RealMedia, most QuickTime movies, Flash content newer than Flash 2, and needs a plug-in to play the ubiquitous MP3 format. While the performance packs offer the ability to use the native platform's media library, they're only offered for Linux, Solaris and Windows. Furthermore, Windows-based JMF developers can unwittingly think JMF provides support for more formats than it does, and be surprised when their application is unable to play those formats on other platforms.
Another knock against JMF is Sun's seeming abandonment of it. The API has not been touched since 1999, and the last news item on JMF's home page was posted in November 2004.
While JMF is built for extensibility, there are few such third-party extensions.
Furthermore, editing functionality in JMF is effectively non-existent, which makes a wide range of potential applications impractical.
My own server-side app shells out to FFmpeg to do the encoding. I'm 98.42% sure FFmpeg does snapshots, too. (It is an all singing, all dancing beast of a program. The command line options alone could fill a book.)
Check it out: ffmpeg.mplayerhq.hu
There is a relatively newer option called JThumbnailer that you find here: https://github.com/makbn/JThumbnail
JThumbnail is a Java library for creating Thumbnails of common types
of file including .doc, .docx, .pdf , .mp4 and etc. full list

Categories