Deep Java Learning - NDManager & NDArray

After our first presentation of Amazon’s new Deep Learning Framework for Java, DJL, we now want to introduce the basics of Deep Learning under Java with DJL step by step in a series of beginner posts. This is not about quickly copying code snippets, but about really understanding the framework and the concepts.

If you can’t wait, you can already find a lot of complete examples in DJL’s Github repository, both as Java projects and as interactive Jupyter notebooks.

However, we will go a little deeper and start with the two most essential interfaces of the DJL API: ai.djl.ndarray.NDManager, ai.djl.ndarray.NDArray. Both are interfaces that are implemented at runtime by one of the underlying engines. For the time being, this will mostly be Apache MXNet, but implementations based on TensorFlow and PyTorch are already in the works.

Getting started with the API: creating an `NDManager`

The NDManager takes care of managing data on a device - often the GPU. Access to this data is given in the form of NDArray instances. If one trains a new DJL model or uses an existing one, the NDManager is created by the corresponding auxiliary classes. If you want to access it directly for test purposes or “non-Deep Learning” applications, you can simply create it as follows:

NDManager manager = NDManager.newBaseManager();
    NDManager managerOnCPU = NDManager.newBaseManager(Device.cpu());

In the first variant, DJL selects a so-called device on which the operations are executed - usually the first available GPU or otherwise the CPU if no GPUs are usable. If one wants to select a very special device manually, one uses the second variant.

The most important class of the DJL API: `NDArray`

If you want to perform calculations, you have to put the values you want to calculate with into NDArray. To create a new NDArray, you need an NDManager. This then places the data on its device outside the Java heap and manages the memory required for it:

NDArray pi        = manager.create((float)Math.PI);
    NDArray e         = manager.create(Math.E);
    NDArray one       = manager.create((byte)1);
    NDArray theAnswer = manager.create(42);
    NDArray big       = manager.create(Long.MAX_VALUE);
    NDArray isTrue    = manager.create(true);

The simplest way to create an NDArray is to wrap a single value in an NDArray. This can be a Java primitive, or a class that implements Number, such as Integer or Float. Unlike, for example, java.util.List, NDArray is not generic, so we cannot tell from the type what data is stored. So while you can create a List<Float>, there is no NDArray<Float>. To find out what the type of data stored in the NDArray is, there is the method NDArray.getDataType(). These are the data types that specify the previously created NDArrays:

System.out.println(pi.getDataType());        //float32
    System.out.println(e.getDataType());         //float64
    System.out.println(one.getDataType());       //int8
    System.out.println(theAnswer.getDataType()); //int32
    System.out.println(big.getDataType());       //int64
    System.out.println(isTrue.getDataType());    //boolean

The possible data types of an NDArray can be found in the enum ai.djl.ndarray.types.DataType. Most NDArray data types correspond 1:1 to a Java primitive:

float → DataType.FLOAT32
double → DataType.FLOAT64
byte → DataType.INT8
int → DataType.INT32
long → DataType.INT64
boolean → DataType.BOOLEAN

The data type of the created NDArray thus depends on the Java data type passed to the create method. However, there are also two data types that have no Java equivalent: UINT8 (an unsigned byte) and FLOAT16 (a float value with lower precision; less precise, but saves memory, which can sometimes be scarce on graphics cards). To create NDArrays of this type, one must first create an array of another type and then manually convert the data type:

NDArray pi16 = pi.toType(DataType.FLOAT16, true);

The second parameter, copy, specifies whether the existing NDArray is modified or whether a new copy is obtained and the old NDArray is retained.

Other ways to create `NDArray`s

There are a number of other ways to create an NDArray. Practically all of them are member functions of the NDManager. The most important method is - as above - the create method. However, it accepts not only single values, but also arrays of Java primitives and number instances. Very often you will create NDArrays from one or two dimensional float[] or int[] arrays.

In addition, there are the methods NDManager.arange and NDManager.linspace, with which one can create sequences of numbers as NDArrays, e.g. 0, 1, 2, 3 or 0.0, -0.1, -0.2, -0.3. The start value, end value and step size can be set. This is very useful to quickly create some test data, but also, for example, to create offsets for input data in very small calculation steps in a neural network.

With NDManager.ones and NDManager.zeros you can create NDArrays of any size, filled with ones or zeros. Finally, the methods with which one creates NDArrays filled with random numbers are very important in practice. With NDManager.randomNormal, NDManager.randomUniform and NDManager.randomMultinomial one can generate random numbers with the corresponding probability distributions. This is especially important for neural networks, because they have to be randomly initialised before they can be trained.

Calculations on `NDArray`s

Now that we have packaged data so that DJL can work with it, we can also perform mathematical operations:

System.out.println(pi.sin().getFloat()); //-8.742278E-8

All calculations are now performed natively on the device of the underlying NDManager. When calculating a single value, this is of course neither exciting nor useful. The back and forth between GPU and JVM is slower and more time-consuming than simply calculating everything in Java. It becomes exciting when we have a lot to calculate at once. For testing, we generate 100 million random numbers:

float[] random = new float[1000 * 1000 * 100];
    Random rand = new Random();
    for (int i = 0; i < random.length; ++i) {
        random[i] = rand.nextFloat();
    }

Now we calculate the sine of each of these numbers in Java:

float[] sines1 = new float[random.length];
    for (int i = 0; i < random.length; ++i) {
        sines1[i] = (float)Math.sin(random[i]);
    }

On one of our working laptops this takes about 3s. Now we perform the same calculation using DJL on the GPU:

NDArray randOnGpu = manager.create(random);
    float[] sines2 = randOnGpu.sin().toFloatArray();

This takes about 500ms, so it is six times as fast. As a rule, calculations with DJL are even faster by a much larger factor than in Plain Java. The main time eater in our example is the transfer from and to the graphics card. If one stays on the GPU and performs many operations in succession, the relative time gain compared to an unaccelerated solution becomes greater and greater.

The shape of `NDArray`s - `Shape`

Important when using NDArrays compared to normal arrays is not only the higher speed, but also the much more readable code. All operations are executed “vectorised”, that is, with all elements at once. With an operation like sin() you can easily imagine this, because you only need one input for a sine - the operation is simply repeated on every element of the array.

It gets exciting with operations where NDArrays are combined, e.g. with a simple addition (the result is always given in the comment above the call):

// I. 4
    manager.create(2).add(manager.create(2));
    // II. [10, 12, 14, 16, 18, 20, 22, 24]
    manager.arange(0, 8).add(manager.arange(10, 18));
    // III. [ 2,  3,  4,  5,  6,  7,  8,  9]
    manager.arange(0, 8).add(manager.create(2));
    // IV.
    // [[ 100, 1001],
    //  [ 102, 1003],
    //  [ 104, 1005],
    //  [ 106, 1007],
    // ]
    manager.arange(0, 8).reshape(4, 2)
        .add(manager.create(new int[]{100, 1000}));

The first example is unsurprising: 2 + 2 = 4. The second is more interesting: You can simply add two arrays with one call, the elements are added together in each case (this corresponds to a vector addition). The third example is even more interesting: It shows that the NDArrays do not necessarily have to have the same size. If you add a single value, it is added to all the elements of the first NDArray. It gets really exciting in example IV. Here we see a new, important operation on arrays, reshape. If you omit it in this example, the code crashes. But what does reshape do and how does the result come about?

So far we have learned that an NDArray has a data type (e.g. FLOAT32) and a size (the number of elements in the array). But an NDArray also has a shape. The shape determines how arithmetic operations that combine arrays must handle the array. In the example above, the number series [0, 1, ... , 7] is given a new shape by reshape. It is no longer a number series (a vector), but a series of series (a matrix). The call reshape(4, 2) means that the existing series is to be divided into four pieces of length two. For this to work, the resulting shape must have the same size as the original one. Since 2 * 4 = 8, this is no problem here. But since we now only have rows of length two at the “end” of the NDArray, another row of length two can now be added. There is only one of them, but it will be used every time. This behaviour is called broadcasting and is an essential feature of all deep learning frameworks.

If you don’t know what shape an NDArray has, you can always find out with getShape. The reshaping of NDArrays and the correct linking of NDArrays of different shapes is one of the most important and trickiest tasks in programming Deep Learning systems. For the budding Java Deep Learning expert, it is important to know the broadcasting behaviour for a number of important operations like add, sub, mul, dot, matMul etc. pp. in order to effectively and elegantly broadcast formulas and pseudocode into a concatenation of NDArray operations.

The memory management of `NDArray`s

Now that we know how to create and use NDArrays, all that remains is to clean up after ourselves when the work is done. As mentioned earlier, NDArrays are placeholders for data on a Device used by the NDManager. However, the memory of this device cannot be managed by the memory manager of the JVM, so we have to take care of it ourselves. Each initial NDArray must be closed again (as with streams with .close()). so that the underlying native memory, e.g. on the GPU, becomes available again.

This could of course be done with each array individually, preferably with try…finally blocks. However, operations on NDArrays also create new NDArrays in the native memory of the Device. If we add two arrays, a new one is created for the result. (Exceptions here are some special variants of operations that make changes in the NDArray, like addi. These have the suffix -i for in place) . Closing all these intermediate results can quickly become tedious. But fortunately there is a simple solution: All these NDArrays are linked to an NDManager, through which they were created directly or indirectly. However, NDManager itself also implements AutoClosable! Closing the manager in turn closes all NDArrays “descended” from it, so that one can easily clean up all memories with one operation.

But what if you don’t want to close all NDArrays, but only those that have been created, for example, during an intermediate calculation? This is also quite simple: With NDManager.newSubManager() you can create a “submanager” that behaves like the original manager but does not “inherit” its NDArrays. With this submanager one can now perform calculations, and then close only the submanager. The original manager and its arrays are then retained.

Conclusion

In this introduction we have seen how to use the most basic classes of the DJL API: NDManager and NDArray. In the following post, we will then take the next step towards Deep Learning and load data for our first example in such a way that it can be used by DJL for training. To do this, we will have to create, fill and transform NDArrays, as well as make the first calculations so that our data can also be “digested” by a neural network.

02 Feb 2026
Ollama Installation for Beginners
In the podcast “Künstlich Klug” that we produce together with afritz Consulting, we explain how to run LLMs yourself and achieve maximum data privacy and independence. The tool of choice here is Ollama. For those who find our introduction to Ollama for developers too technical, we have compiled a beginner-friendly installation guide here.
weiterlesen
07 Aug 2025
Python Dependency Management with Fewer Headaches
One of the most painful parts of developing neural networks is dependency management in Python. It seems like Python has reinvented multiple wheels that other languages like JAVA have been merrily rolling along on for multiple years. Ironically, Python packages are actually called wheels. Oh well. In this short post we want to show you our solution to this problem for Deep Learning (DL) projects, where this problem is particularly nasty as you also need to juggle multiple CUDA versions. Note that there are multiple ways to deal with this - this just happens to be the one we like most - maybe you will as well?
weiterlesen
25 Feb 2025
Working with Ollama, Part 2
In the first part of our article on Ollama, we demonstrated how to install Ollama and local models. In this second part, we cover advanced usage of Ollama by customizing modelfiles and integrating with the AnythingLLM frontend. We show how these tools make managing and utilizing local AI models more efficient.
weiterlesen
24 Feb 2025
Working with Ollama, Part 1
In the first installment of our two-part series “Working with Ollama,” we introduce the open-source, cross-platform solution Ollama, which simplifies both the management and usage of AI models.
weiterlesen
08 Apr 2024
Whisper 3 Large for JAVA
For an internal product prototype we have traced OpenAI’s Whisper 3 model from Huggingface and made it usable under JAVA via DJL.
weiterlesen
14 Jun 2023
ChatGPT for Teams: Privacy-Compliant Use in the Workplace
In today’s digital business world, AI-powered communication platforms like ChatGPT are essential for tasks such as answering complex code questions or creating top-notch texts for offers. However, in companies dealing with sensitive customer data, using ChatGPT can lead to a data protection dilemma. While ChatGPT offers an option to prevent the use of chat conversations for training purposes, it comes with certain limitations. Moreover, as of June 2023, there is no way to manage multiple team members or users through a company account. Each user must register individually and use their own email, phone number, and credit card. If you want to use ChatGPT+, for example, you cannot pay for all users with one credit card. Individual invoices also end up with individual users, creating an organizational and accounting nightmare. We at DIVISO have also grappled with this issue and went in search of a solution.
weiterlesen
25 Oct 2021
Git as a management tool for training data and experiments in ML
In this part of the series of articles on MLOps, we start with information that will be familiar to most of you: With the basics of Git. However, to give a different perspective on the well-known tool, these basics provide the basis to highlight the function and benefits of Git for machine learning (ML) and the difference in managing training data.
weiterlesen
02 Aug 2021
MLOps: Establishment and operation of an AI
With Machine Learning Operations (MLOps) we ensure that data is efficiently and strategically integrated into business processes through regular and automated training, thus contributing to increased revenue. The challenge is to establish and maintain these automated processes.
weiterlesen
31 Aug 2020
Types of Artificial Neural Networks
In our real-world example, we used a “feed-forward neural network” to recognise handwritten numbers. This is probably the most basic form of a NN. In reality, however, there are hundreds of types of mathematical formulas that are used – beyond addition and multiplication – to compute steps in a neural network, many different ways to arrange the layers, and many mathematical approaches to train the network.
weiterlesen
17 Jul 2020
Amazon DJL - a new DL framework for Java
Developers who wanted to explore neural networks and deep learning using the JVM, and especially Java, had little choice so far. Those who wanted to focus exclusively on Java could not get around DL4J until now. If it had to be the JVM, but not necessarily Java, the MXNet Scala Frontend was also an option. Finally, if a little Python didn’t scare you, you could try a hybrid solution, combining TensorFlow and Java just like we already explained in previous articles.
weiterlesen
29 Jun 2020
NLP, NLU and NLG: AI and text
So far, we have generally steered clear of the areas of text comprehension and text generation by ML in our practical examples for the basic understanding of AI. For good reason, we have focused primarily on two types of problems: classification of images and prediction of numerical values.
weiterlesen
23 Jun 2020
Neural networks - The five most common mistakes
AI and especially Neural Networks or Deep Learning have been the technological hype topic for some years now. However, since the subject is quite abstract – one could say it is uncharted territory for most people – we want to clear up some mistakes that we often encounter in our work.
weiterlesen
02 Jun 2020
What are Neural Networks and how do they work?
In our past articles we mainly covered the basics of current AI research and tried to shed some light on them in a way that is understandable for non-IT scientists. We are now proceeding to the probably “hottest” current AI topic: Neural Networks (NN).
weiterlesen
11 May 2020
Deep Fakes - How to spot faked Images
A (fairly) new kind of neural networks, so-called Generative Adversarial Networks or GANs, are nowadays capable of generating deceptively real images of people that do not actually exist. These fake images are indistinguishable from real photos at first glance. Fortunately, you might still uncover them if you look closely – if you know what to look for!
weiterlesen
28 Jun 2019
Recap: ML Conference 2019 in Munich
On 17.06. another round of the semi annual ML Conference started in Munich. As usual, it started with a day-long workshop with joint live coding, giving the participants an approachable introduction into Machine Learning and Deep Learning.
weiterlesen
24 May 2019
Understanding AI - Part 5: Supervised & Unsupervised Learning in ML
In the previous article we introduced the basic concepts of Machine Learning and how the training of an ML model works, using a simple but practical algorithm. Next, we want to take a closer look at the different types of Machine Learning.
weiterlesen
14 May 2019
BGL symposium 2019 - lecture 'AI and Magic'
“Any sufficiently advanced technology is indistinguishable from magic.” – Arthur C. Clarke JAX 2019 is barely over, but Christoph is already on the podium for the next talk. At the symposium of the BLG (Federal Association of Industrial Photographic Laboratories), his lecture will cover “AI and Magic – How does Artificial Intelligence work?
weiterlesen
29 Apr 2019
Jax 2019 Recap
JAX 2019 is approaching and once again Christoph is contributing two sessions. This year he’s focussing on Neural Networks and explains how to use TensorFlow-Training while working with JVM.
weiterlesen
25 Apr 2019
Understanding AI - Part 4: The basics of Machine Learning
After shedding some light onto Symbolic AI in the previous article, we’re now moving on to take a closer look at Machine Learning (ML). When it comes to Symbolic AI, breaking down a problem as minutely as possible is key for successfully solving it.
weiterlesen
08 Apr 2019
Understanding AI - Part 3: Methods of symbolic AI
In the previous article we added two distinctions to our initial definition of AI: On the one hand we distinguish between strong and weak AI (Terminator & Science Fiction vs. the scientific status quo). Also we pointed out the difference between symbolic AI and Machine Learning.
weiterlesen
21 Mar 2019
Understanding AI - Part 2: Symbolic AI, Neural Networks and Deep Learning
Artificial Intelligence (AI) is as old as computer science itself. Calculations, logical deductions, complex assignments… all this was once restricted to humans, until computers came forth.
weiterlesen
07 Mar 2019
Understanding AI - Part 1: What is AI?
From household help to doomsday scenario - there’s hardly a topic where public perception, state of research and reality seem so incongruent as with artificial intelligence. Reason enough to shed some light onto this subject with a series of articles.
weiterlesen
06 Aug 2018
DL4J Workshop at the ML Summit in Berlin
On October 1st and 2nd the first ML Summit takes place in Berlin. In 12 workshops in three parallel tracks, experts impart practical knowledge on the topics Applications for Business, Machine Learning Basics & Tools and Specialized Topics.
weiterlesen
23 Apr 2018
Jax 2018 - Talks about DL4J and more
Christoph will give two talks about Java and Machine Learning at JAX 2018
weiterlesen
29 Jan 2018
Enterprise TensorFlow 4 - Executing a TensorFlow Session in Java
A TensorFlow Session can be executed in Java in the same way as in Python. This post shows how.
weiterlesen
23 Jan 2018
Enterprise TensorFlow 3 - Loading a SavedModel in Java
Part 3 in the series about Java / TensorFlow Interoperability, showing how to load a TensorFlow SavedModel in Java
weiterlesen
22 Jan 2018
Enterprise TensorFlow 2 - Saving a trained model
Part 2 in the series about Java / TensorFlow Interoperability, discussing how to save a model so it can be reused in a different environment.
weiterlesen
11 Jan 2018
TensorFlow and Java - An interview with entwickler.de
Our CTO was interviewed about TensorFlow / Java Interoperability while at ML Conference 2017 in Berlin.
weiterlesen
08 Jan 2018
Enterprise Tensorflow: Code Examples
Overview over the example projects for TensorFlow / Java integration
weiterlesen
30 Nov 2017
Enterprise Tensorflow - Java vs. Python
This is the first part of a series of posts about Java and Tensorflow interop. It is a more extensive version of my talk at ML Conference 2017 in Berlin
weiterlesen
15 Nov 2017
ML Conference 2017 in Berlin
An announcement for my presentation at the ML Conference 2017 in Berlin
weiterlesen

Deep Java Learning - NDManager & NDArray

Getting started with the API: creating an NDManager

The most important class of the DJL API: NDArray

Other ways to create NDArrays

Calculations on NDArrays

The shape of NDArrays - Shape

The memory management of NDArrays

Conclusion

Ollama Installation for Beginners

Python Dependency Management with Fewer Headaches

Working with Ollama, Part 2

Working with Ollama, Part 1

Whisper 3 Large for JAVA

ChatGPT for Teams: Privacy-Compliant Use in the Workplace

Git as a management tool for training data and experiments in ML

MLOps: Establishment and operation of an AI

Types of Artificial Neural Networks

Amazon DJL - a new DL framework for Java

NLP, NLU and NLG: AI and text

Neural networks - The five most common mistakes

What are Neural Networks and how do they work?

Deep Fakes - How to spot faked Images

Recap: ML Conference 2019 in Munich

Understanding AI - Part 5: Supervised & Unsupervised Learning in ML

BGL symposium 2019 - lecture 'AI and Magic'

Jax 2019 Recap

Understanding AI - Part 4: The basics of Machine Learning

Understanding AI - Part 3: Methods of symbolic AI

Understanding AI - Part 2: Symbolic AI, Neural Networks and Deep Learning

Understanding AI - Part 1: What is AI?

DL4J Workshop at the ML Summit in Berlin

Jax 2018 - Talks about DL4J and more

Enterprise TensorFlow 4 - Executing a TensorFlow Session in Java

Enterprise TensorFlow 3 - Loading a SavedModel in Java

Enterprise TensorFlow 2 - Saving a trained model

TensorFlow and Java - An interview with entwickler.de

Enterprise Tensorflow: Code Examples

Enterprise Tensorflow - Java vs. Python

ML Conference 2017 in Berlin

Getting started with the API: creating an `NDManager`

The most important class of the DJL API: `NDArray`

Other ways to create `NDArray`s

Calculations on `NDArray`s

The shape of `NDArray`s - `Shape`

The memory management of `NDArray`s