Text comprehension and automated text generation with NLP, NLU and NLG

So far, we have generally steered clear of the areas of text comprehension and text generation by ML in our practical examples for the basic understanding of AI. For good reason, we have focused primarily on two types of problems: classification of images and prediction of numerical values.

This is because in such tasks it is obvious what the input and output are to a machine learning algorithm: Images can be expressed by Numbers as a sequence of colour and brightness values. Numerical problems such as “How many crickets chirp per minute at a summer temperature of 27°C?” are already expressed in numbers, so it is intuitively clear that a neural network, for example, can “calculate” with them.

Challenge for the algorithm: language and text

But how does it behave if, for example, you want to know whether an email is spam or not? Or if you want to filter out the most urgent and the most negative from a mass of customer requests for the purpose of prioritisation? This is also a classification problem. However, it is not immediately clear here how an email or a request can be expressed by numbers in such a way that machine learning algorithms can be used to solve this problem.

In the next articles, we will look specifically at which methods and functions are used in NLP, in the automated capture and creation of texts. Today, however, we first want to give an overview. What exactly is NLP? Which sub-areas does it cover and what are the current challenges and possible applications?

Natural Language Processing - NLP

The technical term for understanding and processing text is “natural language processing”, usually abbreviated to NLP. In German, the term “computer linguistics” is still often used. However, the term NLP is increasingly gaining acceptance in this country as well, which is why we also want to use it.

For outsiders, it may sound strange that we always speak of natural language, i.e. “natural” language. Is there such a thing as “unnatural language”? This emphasis is based on the fact that computers were already able to understand languages perfectly very early on - but only those that were made for them, namely programming languages. In order to make a clear distinction here, the term “natural” was chosen, since human language (with a few exceptions) is not formally constructed, but has developed naturally.

Written language vs. spoken word in ML

To anticipate a common misunderstanding right away: NLP is about processing written language, not the spoken word. The conversion of spoken language into text is called speech-to-text (STT). Conversely, e.g. with a screen reader, one speaks of text-to-speech (TTS). Since audio signal processing also plays a role here, this is usually not included in NLP.

The same applies to the conversion of scanned documents into text files (optical character recognition - OCR) or handwriting recognition. Since this is more about recognition of visual components, it (like STT and TTS) does not properly belong to NLP. Simply put, NLP is anything where a simple text file (.txt) can serve as input and/or output to an AI system.

Breakdown of NLP into Understanding (NLU) and Generation (NLG).

NLP breaks down further into natural language understanding (NLU) and natural language generation (NLG).

Acquisition of text by an AI model (NLU)

In NLU, a computer program must “understand” some aspect of the text in order to solve a problem. Most of the time, however, this understanding of text by the AI is only focused on a very specific aspect, so it is not the same understanding that a human has when reading a text, for example.

NLU starts with tasks that we still know from primary school - grammatical analysis of texts, e.g.:

Distinguishing types of words: What is a verb, what is an adjective?
Recognising plural, singular and grammatical gender
Recognising cases

The NLU becomes more complex when the meaning of words (semantics) plays a role alongside simple grammatical tasks. Then the tasks become more difficult, but also more exciting and useful:

Distinguishing between subject and object (Who does something to what?)
Recognising people, places, products, brands etc. (What is being talked about in a text?)
Recognising key words in a text (Which words are particularly important? What is a customer complaining about?)
Classifying text (Contract or invoice? Spam or important email? Urgent task or small talk? Positive or negative customer feedback?)

With systems trained to handle such tasks, even simple office tasks that were previously reserved for humans can be automated. Documents and emails can be automatically recognised and sent to the right recipients, important information such as invoice recipients, order numbers can be automatically extracted and transferred to CRM and ERP systems.

Generation of texts by an AI model (NLG)

The counterpart to NLU is natural language generation (NLG). Here the focus is on the language as the output of the system. Classic NLG applications are, for example:

Generation of text based on machine-readable data, such as weather reports, sports reports, financial texts or product descriptions.
Summarising texts
Translation of texts

NLG systems enable the development of new business areas that would not have been profitable in the past due to the high costs of manual text creation, i.e. the “long tail” area. For example, a large number of differentiated landing pages can be created for SEO, which would not be affordable without machine support.

In cooperation with a good NLP system, NLG systems become even more interesting, as the automatic generation of text can completely automate significantly more tasks than pure understanding. A typical example are chatbots that have to understand the request of a conversation partner (NLU) and then generate a suitable answer (NLG).

In the next articles, we will then take a closer look at how a machine learning and, in particular, deep learning system can solve such problems.

FAQs

What is NLP?

The abbreviation NLP stands for natural language processing and refers to a focus of machine learning in which natural (i.e. human) language is processed automatically by algorithms.

NLP is in turn divided into two sub-areas, namely NLU (natural language understanding), i.e. the automated acquisition of language, and NLG (natural language generation), the automated generation of texts. Ideally, both tasks are solved by combining different ML methods (e.g. determination of word type, case or gender, classification of different text genres, recognition of proper names or keywords).

A challenge in NLP is the fact that (for example, due to varying sentence length, etc.) the input values are always changing and this must be taken into account.

What is the difference between NLU and NLG?

Both NLU and NLG are approaches to automatically process natural language using artificial intelligence and machine learning.

NLU stands for natural language understanding - the algorithm is to be trained to correctly capture machine texts. An example of an application would be spam filters that sort corresponding mails without errors, ML systems that correctly classify and file documents and files, or sentiment analysis, which can be used to pre-sort and prioritise customer enquiries or reviews based on the sentiment of the text.

NLG (short for natural language generation) refers to the automated generation of texts. As with other ML approaches, the error-free input of machine-readable data and a successful training phase are necessary for this. Correctly trained, the system can then produce, for example, weather reports or sports news.

Why do we speak of natural language in NLP (natural language processing)?

Natural language is explicitly used to exclude, for example, computer or programming languages. NLP (also known as computational linguistics) is about the processing of human language.

Is NLP (natural language processing) also applied to the processing and generation of spoken language?

NLP refers exclusively to machine-readable information. In order to process spoken language, it must first be converted or written down. This is called speech-to-text (STT) or - for example in the case of a screen reader/reading device - text-to-speech (TTS). Since audio signal processing also plays a role here, this is usually not included in NLP.

The same applies to document capture or handwriting recognition by scanning, where methods of image recognition (optical character recognition - OCR) are applied.

25 Feb 2025
Working with Ollama, Part 2
In the first part of our article on Ollama, we demonstrated how to install Ollama and local models. In this second part, we cover advanced usage of Ollama by customizing modelfiles and integrating with the AnythingLLM frontend. We show how these tools make managing and utilizing local AI models more efficient.
weiterlesen
24 Feb 2025
Working with Ollama, Part 1
In the first installment of our two-part series “Working with Ollama,” we introduce the open-source, cross-platform solution Ollama, which simplifies both the management and usage of AI models.
weiterlesen
08 Apr 2024
Whisper 3 Large for JAVA
For an internal product prototype we have traced OpenAI’s Whisper 3 model from Huggingface and made it usable under JAVA via DJL.
weiterlesen
14 Jun 2023
ChatGPT for Teams: Privacy-Compliant Use in the Workplace
In today’s digital business world, AI-powered communication platforms like ChatGPT are essential for tasks such as answering complex code questions or creating top-notch texts for offers. However, in companies dealing with sensitive customer data, using ChatGPT can lead to a data protection dilemma. While ChatGPT offers an option to prevent the use of chat conversations for training purposes, it comes with certain limitations. Moreover, as of June 2023, there is no way to manage multiple team members or users through a company account. Each user must register individually and use their own email, phone number, and credit card. If you want to use ChatGPT+, for example, you cannot pay for all users with one credit card. Individual invoices also end up with individual users, creating an organizational and accounting nightmare. We at DIVISO have also grappled with this issue and went in search of a solution.
weiterlesen
25 Oct 2021
Git as a management tool for training data and experiments in ML
In this part of the series of articles on MLOps, we start with information that will be familiar to most of you: With the basics of Git. However, to give a different perspective on the well-known tool, these basics provide the basis to highlight the function and benefits of Git for machine learning (ML) and the difference in managing training data.
weiterlesen
02 Aug 2021
MLOps: Establishment and operation of an AI
With Machine Learning Operations (MLOps) we ensure that data is efficiently and strategically integrated into business processes through regular and automated training, thus contributing to increased revenue. The challenge is to establish and maintain these automated processes.
weiterlesen
31 Aug 2020
Types of Artificial Neural Networks
In our real-world example, we used a “feed-forward neural network” to recognise handwritten numbers. This is probably the most basic form of a NN. In reality, however, there are hundreds of types of mathematical formulas that are used – beyond addition and multiplication – to compute steps in a neural network, many different ways to arrange the layers, and many mathematical approaches to train the network.
weiterlesen
17 Jul 2020
Amazon DJL - a new DL framework for Java
Developers who wanted to explore neural networks and deep learning using the JVM, and especially Java, had little choice so far. Those who wanted to focus exclusively on Java could not get around DL4J until now. If it had to be the JVM, but not necessarily Java, the MXNet Scala Frontend was also an option. Finally, if a little Python didn’t scare you, you could try a hybrid solution, combining TensorFlow and Java just like we already explained in previous articles.
weiterlesen
23 Jun 2020
Neural networks - The five most common mistakes
AI and especially Neural Networks or Deep Learning have been the technological hype topic for some years now. However, since the subject is quite abstract – one could say it is uncharted territory for most people – we want to clear up some mistakes that we often encounter in our work.
weiterlesen
02 Jun 2020
What are Neural Networks and how do they work?
In our past articles we mainly covered the basics of current AI research and tried to shed some light on them in a way that is understandable for non-IT scientists. We are now proceeding to the probably “hottest” current AI topic: Neural Networks (NN).
weiterlesen
12 May 2020
Deep Java Learning Introduction - Part 1: NDManager & NDArray
After our first presentation of Amazon’s new Deep Learning Framework for Java, DJL, we now want to introduce the basics of Deep Learning under Java with DJL step by step in a series of beginner posts. This is not about quickly copying code snippets, but about really understanding the framework and the concepts.
weiterlesen
11 May 2020
Deep Fakes - How to spot faked Images
A (fairly) new kind of neural networks, so-called Generative Adversarial Networks or GANs, are nowadays capable of generating deceptively real images of people that do not actually exist. These fake images are indistinguishable from real photos at first glance. Fortunately, you might still uncover them if you look closely – if you know what to look for!
weiterlesen
28 Jun 2019
Recap: ML Conference 2019 in Munich
On 17.06. another round of the semi annual ML Conference started in Munich. As usual, it started with a day-long workshop with joint live coding, giving the participants an approachable introduction into Machine Learning and Deep Learning.
weiterlesen
24 May 2019
Understanding AI - Part 5: Supervised & Unsupervised Learning in ML
In the previous article we introduced the basic concepts of Machine Learning and how the training of an ML model works, using a simple but practical algorithm. Next, we want to take a closer look at the different types of Machine Learning.
weiterlesen
14 May 2019
BGL symposium 2019 - lecture 'AI and Magic'
“Any sufficiently advanced technology is indistinguishable from magic.” – Arthur C. Clarke JAX 2019 is barely over, but Christoph is already on the podium for the next talk. At the symposium of the BLG (Federal Association of Industrial Photographic Laboratories), his lecture will cover “AI and Magic – How does Artificial Intelligence work?
weiterlesen
29 Apr 2019
Jax 2019 Recap
JAX 2019 is approaching and once again Christoph is contributing two sessions. This year he’s focussing on Neural Networks and explains how to use TensorFlow-Training while working with JVM.
weiterlesen
25 Apr 2019
Understanding AI - Part 4: The basics of Machine Learning
After shedding some light onto Symbolic AI in the previous article, we’re now moving on to take a closer look at Machine Learning (ML). When it comes to Symbolic AI, breaking down a problem as minutely as possible is key for successfully solving it.
weiterlesen
08 Apr 2019
Understanding AI - Part 3: Methods of symbolic AI
In the previous article we added two distinctions to our initial definition of AI: On the one hand we distinguish between strong and weak AI (Terminator & Science Fiction vs. the scientific status quo). Also we pointed out the difference between symbolic AI and Machine Learning.
weiterlesen
21 Mar 2019
Understanding AI - Part 2: Symbolic AI, Neural Networks and Deep Learning
Artificial Intelligence (AI) is as old as computer science itself. Calculations, logical deductions, complex assignments… all this was once restricted to humans, until computers came forth.
weiterlesen
07 Mar 2019
Understanding AI - Part 1: What is AI?
From household help to doomsday scenario - there’s hardly a topic where public perception, state of research and reality seem so incongruent as with artificial intelligence. Reason enough to shed some light onto this subject with a series of articles.
weiterlesen
06 Aug 2018
DL4J Workshop at the ML Summit in Berlin
On October 1st and 2nd the first ML Summit takes place in Berlin. In 12 workshops in three parallel tracks, experts impart practical knowledge on the topics Applications for Business, Machine Learning Basics & Tools and Specialized Topics.
weiterlesen
23 Apr 2018
Jax 2018 - Talks about DL4J and more
Christoph will give two talks about Java and Machine Learning at JAX 2018
weiterlesen
29 Jan 2018
Enterprise TensorFlow 4 - Executing a TensorFlow Session in Java
A TensorFlow Session can be executed in Java in the same way as in Python. This post shows how.
weiterlesen
23 Jan 2018
Enterprise TensorFlow 3 - Loading a SavedModel in Java
Part 3 in the series about Java / TensorFlow Interoperability, showing how to load a TensorFlow SavedModel in Java
weiterlesen
22 Jan 2018
Enterprise TensorFlow 2 - Saving a trained model
Part 2 in the series about Java / TensorFlow Interoperability, discussing how to save a model so it can be reused in a different environment.
weiterlesen
11 Jan 2018
TensorFlow and Java - An interview with entwickler.de
Our CTO was interviewed about TensorFlow / Java Interoperability while at ML Conference 2017 in Berlin.
weiterlesen
08 Jan 2018
Enterprise Tensorflow: Code Examples
Overview over the example projects for TensorFlow / Java integration
weiterlesen
30 Nov 2017
Enterprise Tensorflow - Java vs. Python
This is the first part of a series of posts about Java and Tensorflow interop. It is a more extensive version of my talk at ML Conference 2017 in Berlin
weiterlesen
15 Nov 2017
ML Conference 2017 in Berlin
An announcement for my presentation at the ML Conference 2017 in Berlin
weiterlesen

Text comprehension and automated text generation with NLP, NLU and NLG

Challenge for the algorithm: language and text

Natural Language Processing - NLP

Written language vs. spoken word in ML

Breakdown of NLP into Understanding (NLU) and Generation (NLG).

Acquisition of text by an AI model (NLU)

Generation of texts by an AI model (NLG)

FAQs

What is NLP?

What is the difference between NLU and NLG?

Why do we speak of natural language in NLP (natural language processing)?

Is NLP (natural language processing) also applied to the processing and generation of spoken language?

Working with Ollama, Part 2

Working with Ollama, Part 1

Whisper 3 Large for JAVA

ChatGPT for Teams: Privacy-Compliant Use in the Workplace

Git as a management tool for training data and experiments in ML

MLOps: Establishment and operation of an AI

Types of Artificial Neural Networks

Amazon DJL - a new DL framework for Java

Neural networks - The five most common mistakes

What are Neural Networks and how do they work?

Deep Java Learning Introduction - Part 1: NDManager & NDArray

Deep Fakes - How to spot faked Images

Recap: ML Conference 2019 in Munich

Understanding AI - Part 5: Supervised & Unsupervised Learning in ML

BGL symposium 2019 - lecture 'AI and Magic'

Jax 2019 Recap

Understanding AI - Part 4: The basics of Machine Learning

Understanding AI - Part 3: Methods of symbolic AI

Understanding AI - Part 2: Symbolic AI, Neural Networks and Deep Learning

Understanding AI - Part 1: What is AI?

DL4J Workshop at the ML Summit in Berlin

Jax 2018 - Talks about DL4J and more

Enterprise TensorFlow 4 - Executing a TensorFlow Session in Java

Enterprise TensorFlow 3 - Loading a SavedModel in Java

Enterprise TensorFlow 2 - Saving a trained model

TensorFlow and Java - An interview with entwickler.de

Enterprise Tensorflow: Code Examples

Enterprise Tensorflow - Java vs. Python

ML Conference 2017 in Berlin