Methods of symbolic AI

In the previous article we added two distinctions to our initial definition of AI: On the one hand we distinguish between strong and weak AI (Terminator & Science Fiction vs. the scientific status quo). Also we pointed out the difference between symbolic AI and Machine Learning.

Let’s remember: Symbolic AI attempts to solve problems using a top-down approach (example: chess computer). Machine Learning uses the bottom-up principle to gradually adjust a large number of parameters - until it can deliver the expected results.

Deep learning - a Machine Learning sub-category - is currently on everyone’s lips. In order to understand what’s so special about it, we will take a look at classical methods first. Even though the major advances are currently achieved in Deep Learning, no complex AI system - from personal voice-controlled assistants to self-propelled cars - will manage without one or several of the following technologies. As so often regarding software development, a successful piece of AI software is based on the right interplay of several parts.

The most simple AI: Learning by heart

The so-called “table-driven agent” is the simplest AI imaginable: All correct solutions are available to the AI in the form of a table, which it can access to solve a specific problem.

This “primal” and simplest concept is however met with resistance at our lectures and events: “This has nothing to do with intelligence!” However, it is a technique that we (humans) obviously use - albeit much less efficiently. Anyone who ever had to learn vocabulary knows that. Memorizing is an aspect of the human mind - so why not that of an AI? After all, computers are particularly good at it.

The table based agent: it simply stores all solutions in a table, all solutions are learned ahead of time by heart.

So let’s say I want to create AI software that solves a specific problem. And assuming this works flawlessly, quickly and easily using a “simple” table - why shouldn’t I choose exactly this solution?

Some might wonder where the “expertise” lies in building such a system? Well, the trick in development is first and foremost to recognize which subproblems can be solved efficiently and simply with a table (often called “lookup”). Using the table based agent for simple partial problems enables the overall systems to focus on the more complex parts of a task. Thus the efficiency of the entire system is improved. In this way table-driven agents can be useful supporting Neural Networks: You simply expand the input of a Machine Learning system by the possibility to take a look at the table. The learning system can focus on exceptional cases, since it does not have to learn all the information from the table first - these are already reliably available to it. In this way the performance of the whole system can be increased or the time needed for development and training can be shortened - in the best case both.

Alternatively, the table can also be the result of learning: Instead of manually determining the content of the table, it can be “learned” through a machine learning process.

Is each table therefore an AI? Of course not, it depends on how it is used. A system this simple is of course usually not useful by itself, but if one can solve an AI problem by using a table containing all the solutions, one should swallow one’s pride to build something “truly intelligent”. A table-based agent is cheap, reliable and - most importantly - its decisions are comprehensible.

Taking it one step further: The decision tree

Decision trees can be easily found in our everyday life: Official instructions, traffic rules, game rulebooks or tax returns are just a few examples of instructions that can be implemented as decision trees. You begin at a specific starting point and then run through questions, each one resulting from your previous answers, reaching a result, e.g. the applicable income tax rate.

A decision tree encodes a step-by-step, rule based decision process. In this example we decide what to take with us on a walk.

Of course, this technology is not only found in AI software, but for instance also at the checkout of an online shop (“credit card or invoice” - “delivery to Germany or the EU”). As with the table-based agent, not every decision tree is an AI. However, simple AI problems can be easily solved by decision trees (often in combination with table-based agents). The rules for the tree and the contents of tables are often implemented by experts of the respective problem domain. In this case we like to speak of an “expert system”, because one tries to map the knowledge of experts in the form of rules.

The classic expert system: an AI using a decision tree.

Like tables, decision trees have the enormous advantage that their decisions are comprehensible. Thinking of innovative technologies like self-driving cars, the advantage of this kind of AI becomes apparent: Using rules as transparently and deterministically as possible. Probably everyone wants to be able to understand and comprehend which “decisions” actually determine the manoeuvers of a self-driving vehicle.

Speaking of self-driving cars: We have already mentioned that in the ideal case a symbolic AI can be combined with modern methods and thus perform particularly efficiently. In the case of a self-driving car, this interplay could look like this: The Neural Network detects a stop sign (with Machine Learning based image analysis), the decision tree (Symbolic AI) decides to stop. And as with the table-driven agent, Machine Learning can be combined with decision trees by learning the structure of the trees. This is called “decision tree learning”. The advantage is apparent: You don’t have to create the tree yourself, but the decision making (if this- do that) is comprehensible afterwards in the application and can be adjusted if necessary.

Intelligence based on search

“Seek and you shall find.” Search is the symbolic AI technique. In this context “search” means that the computer tries different solutions step by step and validates the results. The classic example of this would be a chess computer that “imagines” millions of different future moves and combinations and, based on the outcome, “decides” which moves promise the highest probability of winning. The analogy to the human mind is obvious: Anyone who has ever played a board or strategy game intensively will have “gone through” moves in their head at least once in order to decide on them.

When using search algorithms an AI inspects all possible solutions step by step. Only the part of the solution that is currently investigated is created in computer memory.

Naturally, a program has the advantage of being able to check infinitely more moves and scenarios due to its computing power. This method is the foundation of most turn-based gaming AI. Even AlphaGo is works with a variety of this technique at its core. However, there is one important difference to humans: A computer, equipped with the appropriate computing power, can and will execute all possible moves, including the senseless ones, in an incredibly structured way. Humans however can partially rely on their “gut”. We usually decide early on, based on our gut feeling, what actually makes sense and thus limit the number of potential moves we think about.

Recently, though, the combination of symbolic AI and Deep Learning has paid off. Neural Networks can enhance classic AI programs by adding a “human” gut feeling - and thus reducing the number of moves to be calculated. Using this combined technology, AlphaGo was able to win a game as complex as Go against a human being. If the computer had computed all possible moves at each step this would not have been possible.

Intelligence based on logic

We encounter this kind of AI in old Science Fiction movies: When the computer goes crazy and becomes a threat, you give the command: “Ignore this command”. The logical paradox (if it executes the command, it doesn’t ignore it, if it doesn’t ignore it, it doesn’t execute it) causes it to crash, explode, or restart. This nicely illustrates the functionality, but also the limits of a purely logical AI.

The classic, symbolic Science Fiction AI: throroughly determined by logic.

Such a system needs a representation of the world in unique logical values: true/false, yes/no, zero/one… and then uses logical formulas to draw conclusions. Again, the analogy to the human mind is not too far-fetched: The ideal mind should act logically and rationally (as established in our table in the previous article). Such inflexible systems unfortunately fail spectacularly at the real world, which is enormously messy, complex and not always quite logical.

So this is, although even a specialized programming language (Prolog) was developed for the construction of such systems, the practically least important of the classical technologies presented, although it once was the poster child for a real AI. But even if one manages to express a problem in such a deterministic way, the complexity of the computations grows exponentially. In the end, useful applications might quickly take several billion years to solve.

Logical systems are therefore currently only of historical interest, apart from a few niche applications. But who knows: In a few years the big breakthrough of logic-based AI and Deep Learning might yet happen, and - like Neural Networks - logic-based AI might rise from the Ashes and Prolog programmers will be in high demand. We’re not counting on it, however.

Benefits of symbolic AI

The great benefit of classical AI is that its decision-making is transparent and can easily be comprehended. It also doesn’t require large amounts of data, since the systems do not “learn” (based on a lot of input), but the developer “pours” her own knowledge into the system. Depending on the method, less computing power is needed than is necessary for training large Neural Networks.

Some AI algorithms, like the ‘random forest’ algorithm, use multiple search trees and create the solutions by combining multiple results.

There still exist tasks in which a symbolic AI performs better. It does this especially in situations where the problem can be formulated by searching all (or most) possible solutions. However, hybrid approaches are increasingly merging symbolic AI and Deep Learning. The goal is balancing the weaknesses and problems of the one with the benefits of the other - be it the aforementioned “gut feeling” or the enormous computing power required. Apart from niche applications, it is more and more difficult to equate complex contemporary AI systems to one approach or the other. Not to mention attributing weaknesses or strengths to them.

The last great bastion of symbolic AI are computer games. In games, a lot of computing power is needed for graphics and physics calculations. Also, real-time behavior is desired. Thus the vast majority of computer game opponents are (still) recruited from the camp of symbolic AI.

Disadvantages of symbolic AI

The biggest problem with symbolic AI: It’s (often) unable to successfully solve most problems from the real world. As we have to formulate our solutions using clear rules (tables, decision trees, search algorithms, symbols…), we encounter a massive obstacle the moment a problem cannot be described this easily. Interestingly, this often happens in situations where a person can reliably fall back on her “gut”. The classic example here is the recognition of objects in a picture. Even for small children, this is trivial. For computers it was an insurmountable problem for decades.

Symbolic AI regrettably fails on many real world tasks: e.g. telling cats and dogs apart in pictures.

In general, it is always challenging for symbolic AI to leave the world of rules and definitions and enter the “real” world instead. Nowadays it frequently serves as only an assistive technology for Machine Learning and Deep Learning.

In the next part of the series we will leave the deterministic and rigid world of symbolic AI and have a closer look at “learning” machines.

08 Apr 2024

Whisper 3 Large for JAVA

For an internal product prototype we have traced OpenAI’s Whisper 3 model from Huggingface and made it usable under JAVA via DJL.

weiterlesen
14 Jun 2023

ChatGPT for Teams: Privacy-Compliant Use in the Workplace

In today’s digital business world, AI-powered communication platforms like ChatGPT are essential for tasks such as answering complex code questions or creating top-notch texts for offers. However, in companies dealing with sensitive customer data, using ChatGPT can lead to a data protection dilemma. While ChatGPT offers an option to prevent the use of chat conversations for training purposes, it comes with certain limitations. Moreover, as of June 2023, there is no way to manage multiple team members or users through a company account. Each user must register individually and use their own email, phone number, and credit card. If you want to use ChatGPT+, for example, you cannot pay for all users with one credit card. Individual invoices also end up with individual users, creating an organizational and accounting nightmare. We at DIVISO have also grappled with this issue and went in search of a solution.

weiterlesen
25 Oct 2021

Git as a management tool for training data and experiments in ML

In this part of the series of articles on MLOps, we start with information that will be familiar to most of you: With the basics of Git. However, to give a different perspective on the well-known tool, these basics provide the basis to highlight the function and benefits of Git for machine learning (ML) and the difference in managing training data.

weiterlesen
02 Aug 2021

MLOps: Establishment and operation of an AI

With Machine Learning Operations (MLOps) we ensure that data is efficiently and strategically integrated into business processes through regular and automated training, thus contributing to increased revenue. The challenge is to establish and maintain these automated processes.

weiterlesen
31 Aug 2020

Types of Artificial Neural Networks

In our real-world example, we used a “feed-forward neural network” to recognise handwritten numbers. This is probably the most basic form of a NN. In reality, however, there are hundreds of types of mathematical formulas that are used – beyond addition and multiplication – to compute steps in a neural network, many different ways to arrange the layers, and many mathematical approaches to train the network.

weiterlesen
17 Jul 2020

Amazon DJL - a new DL framework for Java

Developers who wanted to explore neural networks and deep learning using the JVM, and especially Java, had little choice so far. Those who wanted to focus exclusively on Java could not get around DL4J until now. If it had to be the JVM, but not necessarily Java, the MXNet Scala Frontend was also an option. Finally, if a little Python didn’t scare you, you could try a hybrid solution, combining TensorFlow and Java just like we already explained in previous articles.

weiterlesen
29 Jun 2020

NLP, NLU and NLG: AI and text

So far, we have generally steered clear of the areas of text comprehension and text generation by ML in our practical examples for the basic understanding of AI. For good reason, we have focused primarily on two types of problems: classification of images and prediction of numerical values.

weiterlesen
23 Jun 2020

Neural networks - The five most common mistakes

AI and especially Neural Networks or Deep Learning have been the technological hype topic for some years now. However, since the subject is quite abstract – one could say it is uncharted territory for most people – we want to clear up some mistakes that we often encounter in our work.

weiterlesen
02 Jun 2020

What are Neural Networks and how do they work?

In our past articles we mainly covered the basics of current AI research and tried to shed some light on them in a way that is understandable for non-IT scientists. We are now proceeding to the probably “hottest” current AI topic: Neural Networks (NN).

weiterlesen
12 May 2020

Deep Java Learning Introduction - Part 1: NDManager & NDArray

After our first presentation of Amazon’s new Deep Learning Framework for Java, DJL, we now want to introduce the basics of Deep Learning under Java with DJL step by step in a series of beginner posts. This is not about quickly copying code snippets, but about really understanding the framework and the concepts.

weiterlesen
11 May 2020

Deep Fakes - How to spot faked Images

A (fairly) new kind of neural networks, so-called Generative Adversarial Networks or GANs, are nowadays capable of generating deceptively real images of people that do not actually exist. These fake images are indistinguishable from real photos at first glance. Fortunately, you might still uncover them if you look closely – if you know what to look for!

weiterlesen
28 Jun 2019

Recap: ML Conference 2019 in Munich

On 17.06. another round of the semi annual ML Conference started in Munich. As usual, it started with a day-long workshop with joint live coding, giving the participants an approachable introduction into Machine Learning and Deep Learning.

weiterlesen
24 May 2019

Understanding AI - Part 5: Supervised & Unsupervised Learning in ML

In the previous article we introduced the basic concepts of Machine Learning and how the training of an ML model works, using a simple but practical algorithm. Next, we want to take a closer look at the different types of Machine Learning.

weiterlesen
14 May 2019

BGL symposium 2019 - lecture 'AI and Magic'

“Any sufficiently advanced technology is indistinguishable from magic.” – Arthur C. Clarke JAX 2019 is barely over, but Christoph is already on the podium for the next talk. At the symposium of the BLG (Federal Association of Industrial Photographic Laboratories), his lecture will cover “AI and Magic – How does Artificial Intelligence work?

weiterlesen
29 Apr 2019

Jax 2019 Recap

JAX 2019 is approaching and once again Christoph is contributing two sessions. This year he’s focussing on Neural Networks and explains how to use TensorFlow-Training while working with JVM.

weiterlesen
25 Apr 2019

Understanding AI - Part 4: The basics of Machine Learning

After shedding some light onto Symbolic AI in the previous article, we’re now moving on to take a closer look at Machine Learning (ML). When it comes to Symbolic AI, breaking down a problem as minutely as possible is key for successfully solving it.

weiterlesen
21 Mar 2019

Understanding AI - Part 2: Symbolic AI, Neural Networks and Deep Learning

Artificial Intelligence (AI) is as old as computer science itself. Calculations, logical deductions, complex assignments… all this was once restricted to humans, until computers came forth.

weiterlesen
07 Mar 2019

Understanding AI - Part 1: What is AI?

From household help to doomsday scenario - there’s hardly a topic where public perception, state of research and reality seem so incongruent as with artificial intelligence. Reason enough to shed some light onto this subject with a series of articles.

weiterlesen
06 Aug 2018

DL4J Workshop at the ML Summit in Berlin

On October 1st and 2nd the first ML Summit takes place in Berlin. In 12 workshops in three parallel tracks, experts impart practical knowledge on the topics Applications for Business, Machine Learning Basics & Tools and Specialized Topics.

weiterlesen
23 Apr 2018

Jax 2018 - Talks about DL4J and more

Christoph will give two talks about Java and Machine Learning at JAX 2018

weiterlesen
29 Jan 2018

Enterprise TensorFlow 4 - Executing a TensorFlow Session in Java

A TensorFlow Session can be executed in Java in the same way as in Python. This post shows how.

weiterlesen
23 Jan 2018

Enterprise TensorFlow 3 - Loading a SavedModel in Java

Part 3 in the series about Java / TensorFlow Interoperability, showing how to load a TensorFlow SavedModel in Java

weiterlesen
22 Jan 2018

Enterprise TensorFlow 2 - Saving a trained model

Part 2 in the series about Java / TensorFlow Interoperability, discussing how to save a model so it can be reused in a different environment.

weiterlesen
11 Jan 2018

TensorFlow and Java - An interview with entwickler.de

Our CTO was interviewed about TensorFlow / Java Interoperability while at ML Conference 2017 in Berlin.

weiterlesen
08 Jan 2018

Enterprise Tensorflow: Code Examples

Overview over the example projects for TensorFlow / Java integration

weiterlesen
30 Nov 2017

Enterprise Tensorflow - Java vs. Python

This is the first part of a series of posts about Java and Tensorflow interop. It is a more extensive version of my talk at ML Conference 2017 in Berlin

weiterlesen
15 Nov 2017

ML Conference 2017 in Berlin

An announcement for my presentation at the ML Conference 2017 in Berlin

weiterlesen

Methods of symbolic AI

The most simple AI: Learning by heart

Taking it one step further: The decision tree

Intelligence based on search

Intelligence based on logic

Benefits of symbolic AI

Disadvantages of symbolic AI

Whisper 3 Large for JAVA

ChatGPT for Teams: Privacy-Compliant Use in the Workplace

Git as a management tool for training data and experiments in ML

MLOps: Establishment and operation of an AI

Types of Artificial Neural Networks

Amazon DJL - a new DL framework for Java

NLP, NLU and NLG: AI and text

Neural networks - The five most common mistakes

What are Neural Networks and how do they work?

Deep Java Learning Introduction - Part 1: NDManager & NDArray

Deep Fakes - How to spot faked Images

Recap: ML Conference 2019 in Munich

Understanding AI - Part 5: Supervised & Unsupervised Learning in ML

BGL symposium 2019 - lecture 'AI and Magic'

Jax 2019 Recap

Understanding AI - Part 4: The basics of Machine Learning

Understanding AI - Part 2: Symbolic AI, Neural Networks and Deep Learning

Understanding AI - Part 1: What is AI?

DL4J Workshop at the ML Summit in Berlin

Jax 2018 - Talks about DL4J and more

Enterprise TensorFlow 4 - Executing a TensorFlow Session in Java

Enterprise TensorFlow 3 - Loading a SavedModel in Java

Enterprise TensorFlow 2 - Saving a trained model

TensorFlow and Java - An interview with entwickler.de

Enterprise Tensorflow: Code Examples

Enterprise Tensorflow - Java vs. Python

ML Conference 2017 in Berlin