Do I need to play video to generate transcription [closed] - java

Closed. This question needs to be more focused. It is not currently accepting answers.
Want to improve this question? Update the question so it focuses on one problem only by editing this post.
Closed 5 years ago.
Improve this question
I'm making a thesis about Speech Recognition for generating subtitle out of the video itself. The concept is I'm planning to make a video player and integrate the Speech Recognition on that. So if I run the system and play the video until it ends, the system will generate another copy of that video file, but this time subtitle is already included or integrated on that. Is this concept possible? If so, can you please advise me on how can I do it, or tell me whats the best SR API or some sort out there? I'm familiar in java btw.

I also had a similar sort of Idea in my mind in my university years.
I think your approach should be work. Play the Movie first time and then generate the subtitle and export it to a ".srt" file. Because you cant do it in Real-time Processing due to the several reasons like
Sentences meanings can be change from words to word
Processing or analysing time
etc.
Disadvantage of your approach is
People won't like to play the whole movie to generate the subtitle file.
So I think it would be better to extract the Sound track (audio file), analyzing the sound track and generate the Subtitle file without Playing the Movie
In here you wont able to do the SPEECH-TO-TEXT conversion.
You have to find the way to analyse the sound waves styles and extract the data.
Good Luck

Related

voice recognition constantly in background android [closed]

Closed. This question needs to be more focused. It is not currently accepting answers.
Want to improve this question? Update the question so it focuses on one problem only by editing this post.
Closed 2 years ago.
Improve this question
FfhggtffgtfgtgftI want develop application that whenever it recognizes a keyword it does something. it needs to be in listening mode all the time, in backgeound too.
I was exposed to this and this. I treid run it but it is not work when I am speaking.
actually I read it still doesn't support my native language. is that the reason?
I want to know how it works? does it is doing speach to text and saved it in assets files? does it is run in background? does it is used AI models? how it behaves when two apps need mic resource in parallel? noises? does it is work with Neural Networks API? how can I start developing such a thing?
thanks!
It is great you tried Vosk offline speech recognition on Android, here are some answers to your questions:
actually I read it still doesn't support my native language.
If you are about Hebrew, we might support it in the future, and you can build it yourself.
is that the reason?
You didn't provide enough information to answer this, please explain a bit more what is "it is not work"
I want to know how it works?
Extensive documentation on speech recognition is available on lectures, courses and books. You can find some introduction here for example: https://www.youtube.com/watch?v=q67z7PTGRi8
does it is doing speech to text and saved it in assets files?
It does speech to text, but it doesn't save results into assets, it just displays them. You can not modify assets, they are static.
does it is run in background?
Yes
does it is used AI models?
Sure
how it behaves when two apps need mic resource in parallel?
In android it is not possible to record audio from two apps in parallel, second one will be blocked.
noises?
It is robust to noises.
does it is work with Neural Networks API?
No, it is portable
how can I start developing such a thing?
Get some basic understanding and start writing the code. If you have further questions you can ask them in the Telegram chat

is it posible to compare image at realtime using camera with stored images in db, android java [closed]

Closed. This question needs to be more focused. It is not currently accepting answers.
Want to improve this question? Update the question so it focuses on one problem only by editing this post.
Closed 7 years ago.
Improve this question
I am doing image processing, but stuck in. can we implement realtime image processing, like the camera captures videos and also compares the object with stored images in data base, need algorithm for implementing this idea, any tutorial, link or source will be appreciated.
I would recomend using OpenCv for android. It allows a simple and very well working live processing. This library also comes with a wide variety of imageprocessing implementations. It is easy to convert the image-Object (Mat) to Bitmaps and so Files! OpenCv allows LowLevelImageProcessing which could be usefull to you to compere these images. It maybe seems a bit difficult to add this library to your project at first but, it is worth it!

Downloading mp3 from youtube and other music-websites programatically Java [closed]

Closed. This question needs to be more focused. It is not currently accepting answers.
Want to improve this question? Update the question so it focuses on one problem only by editing this post.
Closed 5 years ago.
Improve this question
I need to let my app's client paste youtube (or any other website but that later) URL and download just music (mp3) from the specified video.
I was looking for the solution on stackoverflow and google and found that it is in spite of their law. We just cant dowload anything from youtube. In case of doing it, youtube will block our service.
However, I know that there are still available sites, on which we can convert the video to mp3 easily, for example: http://2conv.com And these sites exist in web for really long time.
So how did they avoid youtube-law? How can I download videos from youtube using Java? (I dont ask for code, but just the way how to do this, what to type in google to find the answer)
Thanks for reading! I really need the answer, because downloading from youtube has to be the core-feature of my app
TO BE CLEAR AND NOT DOWN-VOTED:
I am not going to violate youtube terms of service, just asking if there is any way to reconcile my needs with youtube terms?
It's possible, just use Pafy script (Python), this API based on youtube-dl.
You can see more youtube-dl support sites at:
https://rg3.github.io/youtube-dl/supportedsites.html
Unfortunately not possible. Video files are integrated with their audio files. so if you want to extract audio from any video file you should download the whole thing and then extract its audio to mp3 yourself. Every service that does so first fetches the video and then does as mentioned. Make sure not to violate any terms of service.

Android Signal Processing [closed]

Closed. This question is opinion-based. It is not currently accepting answers.
Want to improve this question? Update the question so it can be answered with facts and citations by editing this post.
Closed 2 years ago.
Improve this question
I am currently working on a project that involves the use of EEG signals on an android mobile phone.
I am deciding if I should make my own signal processing library for android or not because I can't seem to find any online.
Does anybody know of a library that I can use or would it be easier and faster to make my own?
Thanks
I'm working on a similar project. I am using the NeuroSky MindSet EEG headset and Processing IDE with the Ketai library. I could not get the NeuroSky Android API to work with either of my Android 2.3.4 or 4.0.1 phones. When digging into why it didn't work, I got lost in their API's library and decided it would probably just be easier to read the data myself. I modified the Ketai bluetooth program to read in the raw data, but did it in sort of a haphazard way (no checksum was performed). It's quite a rough hack, but I got the data to draw on the screen and have not worked on it for a couple of weeks. After I comment the code some more, I could put my code up on github - it'd be nice to get some collaboration.
By the way, if you did not intend for this to be for the Processing IDE of processing.org, you're quite lucky. As Arcymag pointed out, the keyword "processing" might trigger a response from people like myself who receive emails for processing.org. In any case, the Processing IDE has an Android mode that allows you to program Android under its IDE. If you want to develop your program in say, the Eclipse IDE, you can export your project and take all of the power of Processing with you as a library. After exporting, your code will be full Java. There are only some minor differences between Java "proper" and how Processing presents Java anyhow (some aesthetic changes). I prefer to work with the power of Processing because I am not actually a "proper" programmer, as my background is in physics, and Processing makes programming a joy. (There are several questions regarding the differences between Java proper and Processing on stackoverflow.)
This is my first post on stackoverflow. I have been looking for a way to give back since this site has saved me countless hours on this project alone. If you feel I deserve an upvote, you'd make one noob very happy. I can do things such as: post more than two hyperlinks and upvote answers.

Analyzing Music in Java [closed]

Closed. This question needs to be more focused. It is not currently accepting answers.
Want to improve this question? Update the question so it focuses on one problem only by editing this post.
Closed 4 years ago.
Improve this question
Greetings!
I was wondering if it's possible to make music-based games in Java? I've played games like Beat Hazzard and Audiosurf and stuff like that and I was considering making one like that - that dynamically loads songs the user selects and finds pieces of information (like bass notes) and does something in return.
I don't even know where to start, but I have found ways to play music (through certain classes others provide or the Java Sound API). Anybody ever done anything like this before? Any help is appreciated.
Of course it is. I think the term you should be Googling for is Java Signal Processing and analysis.
There is a related SO post on this as well as numerous other Java libraries and source examples (see the Google search results from above.) Here is a page with applets that perform a variety of signal analysis. Might not be exactly what you need but should be a start. And MARF, from the SO post mentioned above, looks VERY promising!

Categories