de en

Whisper v3 Automatic Speech Recognition (ASR) for JAVA

For an internal product prototype we have traced OpenAI’s Whisper 3 model from Huggingface and made it usable under JAVA via DJL.

We thought that this could also be interesting for other JAVA developers - e.g. for using Whisper v3 in Spring. The current version only works on GPU, but we hope to have a version that works on both GPU and CPU soon.

Whisper 3 is a deep learning model for speech-to-text transcription, also known as Automatic Speech Recognition (ASR) or Speech-To-Text (STT). Whisper 3 can handle almost 100 different languages and can even translate on-the-fly.

The repository with installation instructions and an example can be found here: https://github.com/DIVISIO-AI/whisper-java