Unravelling the Enigma: Why Do AI Systems Hallucinate Facts and Figures

In the ever-evolving landscape of artificial intelligence (AI) (Figure 1), the phenomenon of hallucinating facts and figures has emerged as a perplexing challenge. Determination of the root causes behind this curious behaviour involves a deep understanding of the underlying technology. In this article, we will delve into the intricacies of why large language models (LLMs) occasionally veer into the realm of the untrue, exploring the technical, ethical, and practical implications of this enigmatic occurrence.

Figure 1: The Map of Artificial Intelligence by Dr. Milan Milanovic. Originally published on LinkedIn.

What is a Large Language Model

A LLM is a type of artificial intelligence system designed to understand and generate human-like language. It is built on sophisticated neural network architectures, trained on vast datasets and can perform various natural language processing tasks, such as language translation, text completion, and answering questions. LLMs can comprehend context, infer meanings, and generate coherent and contextually relevant text based on the input it receives.

The Black Box of Neural Networks

LLMs are built on a type of neural network called a transformer model (Figure 2), and there is no way of working out exactly how these models arrive at specific conclusions. In other words, the output of an LLM is considered to be ‘Non-deterministic’. This means that from the output, it is not possible to determine the input, meaning that the detection of AI-generated content can only be evaluated based on a margin of confidence rather than a certain ‘true/false’ evaluation.

Figure 2: Transformer Architecture; the backbone of LLMs

In the absence of a concrete understanding of every learned parameter, the models may occasionally generate seemingly plausible information that is, in fact, a fabrication. This phenomenon, known as hallucination, occurs when the LLM extrapolates from its training data, creating information that appears accurate but lacks a factual basis.

Overfitting challenges

Hallucination in LLMs can be attributed, in part, to the challenges associated with overfitting. Overfitting occurs when a model becomes too closely tailored to its training data. As a result, the model may hallucinate information that aligns with the peculiarities of the training dataset. For example, If the machine learning model was trained on a data set that contained mostly photos showing dogs outside in parks, it may learn to use grass as a feature for classification and may not recognise a dog inside a room. When faced with novel scenarios or inputs, LLMs may resort to generating responses based on superficial similarities to the learned data, leading to the production of inaccurate or hallucinated information.

Ethical considerations: The fine line between assistance and misinformation

The implications of LLMs hallucination extend beyond technical challenges, delving into ethical territory. As these systems become integral to decision-making processes in various fields, from healthcare to finance, the potential for disseminating misinformation raises concerns.

When an LLM hallucinates facts or figures, it may inadvertently contribute to the spread of false information (Figure 3), with consequences ranging from misinformation in news articles to inaccuracies in critical decision-making processes.

Figure 3: Oops.

Striking the delicate balance between providing assistance and avoiding the propagation of misinformation poses a significant ethical challenge for developers, researchers, and policymakers.

The quest for explainability and accountability

Addressing the issue of hallucination requires a concerted effort to enhance the reference-ability of LLMs. Researchers are exploring methods to make neural networks more interpretable, allowing stakeholders to trace the decision-making processes of these systems. Additionally, accountability measures must be implemented to ensure responsible development and deployment of LLMs.

The road ahead involves refining algorithms, establishing robust evaluation frameworks, and fostering interdisciplinary collaboration to create LLMs that not only perform functionally but also uphold ethical standards. As we navigate the evolving landscape of LLMs, a deeper understanding of hallucination paves the way for more transparent, accountable, and reliable artificial intelligence systems.

Emerging Irish Tech and R&D Series

This is the third article in KPMG’s 4-part series on emerging technologies in Ireland and their potential application for research and development. The next article will focus on AgriTech.

For more insights or if you have an R&D-related query, visit KPMG’s R&D Incentives practice.

Guest post by Nigel Brennan, Software Consultant, R&D Tax Incentives Practice at KPMG.

Irish Tech News

Next 70% of Employers had Employees Leave Within the First Year due to a Poor Match with the Organisation »

Previous « Ministers O’Donovan, McConalogue and Heydon announce €104 million investment for scientific research

Published by

Irish Tech News

2 years ago

From Classrooms to Careers: Dell Simplifies Learning With Purpose-Built Education PCs and Future-Ready Programs

We're at a critical moment in education. New research and emerging technologies, such as Generative…

7 hours ago

Business

University of Galway launches new prototype hub in partnership with Medtronic

The University of Galway has today launched its new Medical Device Prototype Hub, supported by…

8 hours ago

Business

Making healthcare better: The manufacturing technologies powering MedTech innovation

Innovation in medical technology (MedTech) has always been driven by curiosity, creativity and the pursuit…

9 hours ago

Salesforce survey reveals AI Is the #2 Growth Tactic for 2026 as Irish Sales Teams Turn to Agents

As sales teams kick off 2026 with ambitious new quotas, they're turning to AI, especially…

11 hours ago

SciFest celebrates 20 years of student innovation as 2026 competition launches

SciFest, Ireland’s largest and most inclusive second-level STEM fair programme, has announced its return for…

12 hours ago

Tech News

Sustainability in the sugarcane sector, Global Week 9–13 March

Join the global sugarcane community in Delhi to shape the future of sustainable agriculture. Bonsucro…

14 hours ago

More about Irish Tech News

Irish Tech News are Ireland’s No. 1 Online Tech Publication and often Ireland’s No.1 Tech Podcast too.

You can find hundreds of fantastic previous episodes and subscribe using whatever platform you like via our Anchor.fm page here: https://anchor.fm/irish-tech-news

If you’d like to be featured in an upcoming Podcast email us at Simon@IrishTechNews.ie now to discuss.

Irish Tech News have a range of services available to help promote your business. Why not drop us a line at Info@IrishTechNews.ie now to find out more about how we can help you reach our audience.

You can also find and follow us on Twitter, LinkedIn, Facebook, Instagram, TikTok and Snapchat.