Enterprise Tensorflow -Python vs. Java

This is the first part of a series of posts about Java and Tensorflow interop. It is a more extensive version of my talk at ML Conference 2017 in Berlin

Why not just use Python for everything?

Tensorflow and Python go hand in hand, so when we think about using Tensorflow, we think of doing so in Python. The vast majority of the Tensorflow API is only available in Python, so when we develop our models and train them, there is pretty much no alternative to using Python (proof-of-concept or esoteric approaches not withstanding). Depending on the environment where we actually want to use our model, however, other languages might be better suited to run our model, especially when we cannot or do not want to install a Python runtime. (For reasons where a Python based approach might not be warranted, see below.) In cases where Python is not readily available, we need to remember that the actual implementation of the computation graph in Tensorflow is written in C++, i.e. it is a native binary. Hence, it can be made to run within most other languages. For many languages, Google already provides such bridging for running the graph. If the Tensorflow wrappers are not an option, there is still the possibility of using the parameterization we have trained in Tensorflow (e.g. our neural network weights) in another framework that is more readily available. In the rarest of cases where neither Tensorflow nor another framework is available for inference, it helps that inference is often much easier implemented than training, so we even have the possibility, to implement inference ourselves and just use the learned parameters like we would when using another ML Framework. In later posts we well look at each of these possibilities in turn.

When might Python not be an option?

The platform on which we want to run our model might constrain us:

Mobile: You cannot / should not embed a complete python environment in your native App
Native desktop app: A sleek small installer is wanted.
Embedded: Maybe there are not enough resources to add a Python environment (though Python can run on many “small” platforms)

The project might put constraints on your options:

Customer requirements (whether justified or not) might not allow you to use Python.
Political reasons (you have to get the right people on board) - this is especially a case in large corporations.
Time consuming certification / approval processes for each new component not already in use - again, a typical case for large corporations
Key people in the decision process are against it - and all arguments fail
Team / organizational concerns: the team building the software that runs the model is different from the developers of the model, the “Python people” might not be around for the whole lifetime of the product

There are other benefits that may not forbid usage of Python, but still make a non-Python solution more attractive in some cases:

keeping dependencies to a minimum
keeping the build simple
keeping the installer simple / the deployment process simple
keeping updates simple
keeping artifact size to a minimum
you may not want to use multiple programming languages in your project
you do not like Python (preferences do differ, after all)

Why use Java instead?

The above arguments apply for many alternative environments to Python. In the rest of this post and the following series we will focus on one alternative specifically, Java for Server Side applications, especially in conjunction with “Enterprise” frameworks. In direct response to the arguments above here are reasons why Java sometimes might be a better alternative (as software development is a complicated affair, each case must of course be judged individually):

Platform constraints:

Except for Android, where we can regard Java as a first class citizen, platform constraints that apply to Python will generally also apply to Java (we need a separate runtime environment etc.). As we do focus on server side Java, we do not regard the Android case here. Also Google is heavily invested in this area (see Tensorflow Light), so there is plenty of discussion of this scenario elsewhere. So generally, Java will not help you with platform constraints. But as we focus on server side development, luckily the platform is rarely the problem for us. Servers these days have plenty RAM, Storage and CPU power.

Project constraints:

This is the area where Java often shines. The magic incantation “We can offer you a 100% Java Enterprise Solution” opens the gates to easy project approval, larger budgets and fast acceptance. (Pro Tip: Wear a nice suit while you say it). On a more serious note, Java is one of the most widely used frameworks in the business world, especially for large companies like insurances, banks, broadcasting and industry. In those environments, a tried-and-true technology with a vibrant ecosystem, commercial support and a large potential workforce will (almost) always beat the next cool RAD technology. Hence if there is a policy that enforces the use of certain technologies, chances are good that Java (and certain Java frameworks) are among them. One also often encounters entrenched teams that have taken care of an IT project for years and are hesitant to change and might torpedo your new and cool AI projects with FUD. Being able to play the Java Enterprise card may just be the kicker you need to make your hand. Being able to use the existing team to keep the new and shiny AI solution running once it is developed, even if the team that built it is not around anymore, will let managment sleep easy and make your work a lot easier.

Other Benefits

So far we have mainly argued why a Java based integration might be a good idea if existing infrastructure / teams / organizations are already heavily invested in the Java ecosystem. But sometimes Java might even make sense for a clean slate project. What follows are some points highlighting the strength of Java in general and for a server side application in particular. As always, these opinions come with a dose of individual bias and do not claim that this is the silver bullet solving all problems. There is always more than one way to do it. Java has - as far as the “coolness” is concerned - fallen somewhat out of favor in the last years. One of the reasons being that the cathedral-building over-engineering enterprise approach has often proven to be inefficient and unnecessarily complex. But as these things often do, programming fashions come and go and in response to that (well-based) criticism many things have changed in Java land that have improved the situation for your classical Java Server Enterprise Project. Java is:

Fast. Surprisingly so, in my experience often beating hand optimized C code thanks to the JIT Comiler. While computing power is cheap, being an order of magnitude more efficient than the last cool JavaScript, Ruby etc. framework might reduce your server cost significantly.
Mature. Especially many of the Apache libraries have matured over years, and have an impressive track record regarding stability. Often new, fancy frameworks look awesome at first glance but fail when real-world edge cases come along.
Not as bloated as it used to be. New frameworks also allow for a RAD approach more commonly found in Ruby on Rails or Django - we do not spend one week writing XML for a “Hello World” server application anymore. Also Java 8 finally brings us FP and libraries like guava allow for well readable concise code. While we will never be es elegant as Haskell things have gotten a lot better.
Well supported. The documentation of the language, tooling and libraries is extremely extensive.
Statically typed. If this is a pro or a con of course very much depends on your personal programming style.

As a sidenode as of now (2017) my stack recommendation for new Java server projects would be:

Java 8
A tried and true SQL database (e.g. PostgreSQL)
JOOQ for the persistence layer
Ninja as web framework (CXF, Spring etc. are not that bad either)
any utility libraries that make your life easier, especially Guava

Using this stack, Java development is actually fun again - try it! And of course lastly many of the above points do not only apply to Java but to many other JVM based languages. All the points we will present in the following posts will apply to most JVM based languages that allow easy Java interop, like Kotlin, Scala, Clojure and others.

Where to go from here?

If after this discussion you end up using Java for the integration of a Tensorflow model into a production environment there remains the question: how to best do this? This again depends on a number of factors. In the next few posts we will discuss the various aspects of such an integration. Specifically we will cover:

How to get the parameterization (or the whole graph) from your Training code into your server
How to load that data and run the inference in the server
How to design your client side facing server interface for various inference scenarios
How to set up a build chain to automize this integration to varying degrees
How to organize your projects

25 Feb 2025
Working with Ollama, Part 2
In the first part of our article on Ollama, we demonstrated how to install Ollama and local models. In this second part, we cover advanced usage of Ollama by customizing modelfiles and integrating with the AnythingLLM frontend. We show how these tools make managing and utilizing local AI models more efficient.
weiterlesen
24 Feb 2025
Working with Ollama, Part 1
In the first installment of our two-part series “Working with Ollama,” we introduce the open-source, cross-platform solution Ollama, which simplifies both the management and usage of AI models.
weiterlesen
08 Apr 2024
Whisper 3 Large for JAVA
For an internal product prototype we have traced OpenAI’s Whisper 3 model from Huggingface and made it usable under JAVA via DJL.
weiterlesen
14 Jun 2023
ChatGPT for Teams: Privacy-Compliant Use in the Workplace
In today’s digital business world, AI-powered communication platforms like ChatGPT are essential for tasks such as answering complex code questions or creating top-notch texts for offers. However, in companies dealing with sensitive customer data, using ChatGPT can lead to a data protection dilemma. While ChatGPT offers an option to prevent the use of chat conversations for training purposes, it comes with certain limitations. Moreover, as of June 2023, there is no way to manage multiple team members or users through a company account. Each user must register individually and use their own email, phone number, and credit card. If you want to use ChatGPT+, for example, you cannot pay for all users with one credit card. Individual invoices also end up with individual users, creating an organizational and accounting nightmare. We at DIVISO have also grappled with this issue and went in search of a solution.
weiterlesen
25 Oct 2021
Git as a management tool for training data and experiments in ML
In this part of the series of articles on MLOps, we start with information that will be familiar to most of you: With the basics of Git. However, to give a different perspective on the well-known tool, these basics provide the basis to highlight the function and benefits of Git for machine learning (ML) and the difference in managing training data.
weiterlesen
02 Aug 2021
MLOps: Establishment and operation of an AI
With Machine Learning Operations (MLOps) we ensure that data is efficiently and strategically integrated into business processes through regular and automated training, thus contributing to increased revenue. The challenge is to establish and maintain these automated processes.
weiterlesen
31 Aug 2020
Types of Artificial Neural Networks
In our real-world example, we used a “feed-forward neural network” to recognise handwritten numbers. This is probably the most basic form of a NN. In reality, however, there are hundreds of types of mathematical formulas that are used – beyond addition and multiplication – to compute steps in a neural network, many different ways to arrange the layers, and many mathematical approaches to train the network.
weiterlesen
17 Jul 2020
Amazon DJL - a new DL framework for Java
Developers who wanted to explore neural networks and deep learning using the JVM, and especially Java, had little choice so far. Those who wanted to focus exclusively on Java could not get around DL4J until now. If it had to be the JVM, but not necessarily Java, the MXNet Scala Frontend was also an option. Finally, if a little Python didn’t scare you, you could try a hybrid solution, combining TensorFlow and Java just like we already explained in previous articles.
weiterlesen
29 Jun 2020
NLP, NLU and NLG: AI and text
So far, we have generally steered clear of the areas of text comprehension and text generation by ML in our practical examples for the basic understanding of AI. For good reason, we have focused primarily on two types of problems: classification of images and prediction of numerical values.
weiterlesen
23 Jun 2020
Neural networks - The five most common mistakes
AI and especially Neural Networks or Deep Learning have been the technological hype topic for some years now. However, since the subject is quite abstract – one could say it is uncharted territory for most people – we want to clear up some mistakes that we often encounter in our work.
weiterlesen
02 Jun 2020
What are Neural Networks and how do they work?
In our past articles we mainly covered the basics of current AI research and tried to shed some light on them in a way that is understandable for non-IT scientists. We are now proceeding to the probably “hottest” current AI topic: Neural Networks (NN).
weiterlesen
12 May 2020
Deep Java Learning Introduction - Part 1: NDManager & NDArray
After our first presentation of Amazon’s new Deep Learning Framework for Java, DJL, we now want to introduce the basics of Deep Learning under Java with DJL step by step in a series of beginner posts. This is not about quickly copying code snippets, but about really understanding the framework and the concepts.
weiterlesen
11 May 2020
Deep Fakes - How to spot faked Images
A (fairly) new kind of neural networks, so-called Generative Adversarial Networks or GANs, are nowadays capable of generating deceptively real images of people that do not actually exist. These fake images are indistinguishable from real photos at first glance. Fortunately, you might still uncover them if you look closely – if you know what to look for!
weiterlesen
28 Jun 2019
Recap: ML Conference 2019 in Munich
On 17.06. another round of the semi annual ML Conference started in Munich. As usual, it started with a day-long workshop with joint live coding, giving the participants an approachable introduction into Machine Learning and Deep Learning.
weiterlesen
24 May 2019
Understanding AI - Part 5: Supervised & Unsupervised Learning in ML
In the previous article we introduced the basic concepts of Machine Learning and how the training of an ML model works, using a simple but practical algorithm. Next, we want to take a closer look at the different types of Machine Learning.
weiterlesen
14 May 2019
BGL symposium 2019 - lecture 'AI and Magic'
“Any sufficiently advanced technology is indistinguishable from magic.” – Arthur C. Clarke JAX 2019 is barely over, but Christoph is already on the podium for the next talk. At the symposium of the BLG (Federal Association of Industrial Photographic Laboratories), his lecture will cover “AI and Magic – How does Artificial Intelligence work?
weiterlesen
29 Apr 2019
Jax 2019 Recap
JAX 2019 is approaching and once again Christoph is contributing two sessions. This year he’s focussing on Neural Networks and explains how to use TensorFlow-Training while working with JVM.
weiterlesen
25 Apr 2019
Understanding AI - Part 4: The basics of Machine Learning
After shedding some light onto Symbolic AI in the previous article, we’re now moving on to take a closer look at Machine Learning (ML). When it comes to Symbolic AI, breaking down a problem as minutely as possible is key for successfully solving it.
weiterlesen
08 Apr 2019
Understanding AI - Part 3: Methods of symbolic AI
In the previous article we added two distinctions to our initial definition of AI: On the one hand we distinguish between strong and weak AI (Terminator & Science Fiction vs. the scientific status quo). Also we pointed out the difference between symbolic AI and Machine Learning.
weiterlesen
21 Mar 2019
Understanding AI - Part 2: Symbolic AI, Neural Networks and Deep Learning
Artificial Intelligence (AI) is as old as computer science itself. Calculations, logical deductions, complex assignments… all this was once restricted to humans, until computers came forth.
weiterlesen
07 Mar 2019
Understanding AI - Part 1: What is AI?
From household help to doomsday scenario - there’s hardly a topic where public perception, state of research and reality seem so incongruent as with artificial intelligence. Reason enough to shed some light onto this subject with a series of articles.
weiterlesen
06 Aug 2018
DL4J Workshop at the ML Summit in Berlin
On October 1st and 2nd the first ML Summit takes place in Berlin. In 12 workshops in three parallel tracks, experts impart practical knowledge on the topics Applications for Business, Machine Learning Basics & Tools and Specialized Topics.
weiterlesen
23 Apr 2018
Jax 2018 - Talks about DL4J and more
Christoph will give two talks about Java and Machine Learning at JAX 2018
weiterlesen
29 Jan 2018
Enterprise TensorFlow 4 - Executing a TensorFlow Session in Java
A TensorFlow Session can be executed in Java in the same way as in Python. This post shows how.
weiterlesen
23 Jan 2018
Enterprise TensorFlow 3 - Loading a SavedModel in Java
Part 3 in the series about Java / TensorFlow Interoperability, showing how to load a TensorFlow SavedModel in Java
weiterlesen
22 Jan 2018
Enterprise TensorFlow 2 - Saving a trained model
Part 2 in the series about Java / TensorFlow Interoperability, discussing how to save a model so it can be reused in a different environment.
weiterlesen
11 Jan 2018
TensorFlow and Java - An interview with entwickler.de
Our CTO was interviewed about TensorFlow / Java Interoperability while at ML Conference 2017 in Berlin.
weiterlesen
08 Jan 2018
Enterprise Tensorflow: Code Examples
Overview over the example projects for TensorFlow / Java integration
weiterlesen
15 Nov 2017
ML Conference 2017 in Berlin
An announcement for my presentation at the ML Conference 2017 in Berlin
weiterlesen

Enterprise Tensorflow -Python vs. Java

Why not just use Python for everything?

When might Python not be an option?

Why use Java instead?

Platform constraints:

Project constraints:

Other Benefits

Where to go from here?

Working with Ollama, Part 2

Working with Ollama, Part 1

Whisper 3 Large for JAVA

ChatGPT for Teams: Privacy-Compliant Use in the Workplace

Git as a management tool for training data and experiments in ML

MLOps: Establishment and operation of an AI

Types of Artificial Neural Networks

Amazon DJL - a new DL framework for Java

NLP, NLU and NLG: AI and text

Neural networks - The five most common mistakes

What are Neural Networks and how do they work?

Deep Java Learning Introduction - Part 1: NDManager & NDArray

Deep Fakes - How to spot faked Images

Recap: ML Conference 2019 in Munich

Understanding AI - Part 5: Supervised & Unsupervised Learning in ML

BGL symposium 2019 - lecture 'AI and Magic'

Jax 2019 Recap

Understanding AI - Part 4: The basics of Machine Learning

Understanding AI - Part 3: Methods of symbolic AI

Understanding AI - Part 2: Symbolic AI, Neural Networks and Deep Learning

Understanding AI - Part 1: What is AI?

DL4J Workshop at the ML Summit in Berlin

Jax 2018 - Talks about DL4J and more

Enterprise TensorFlow 4 - Executing a TensorFlow Session in Java

Enterprise TensorFlow 3 - Loading a SavedModel in Java

Enterprise TensorFlow 2 - Saving a trained model

TensorFlow and Java - An interview with entwickler.de

Enterprise Tensorflow: Code Examples

ML Conference 2017 in Berlin