Your message has been sent

You can find the message in your personal profile at "My messages".

An error occured

Please try again.

Make an appointment with

So that you can make an appointment, the calendar will open in a new tab on the personal profile of your contact person.

Create an onsite appointment with

So that you can make an onsite appointment, the appointment request will open in a new tab.

Make onsite appointment

8/10/2023

Industry News
Artificial intelligence (AI)

Attacks using AI that are almost undetectable: Indirect Prompt Injections

The German Federal Office for Information Security (BSI) warns of the dangers of using large language models: Indirect prompt injections enable attacks that are almost impossible to detect and give attackers unauthorized access to information. Attackers bypass security mechanisms when using AI; plug-ins and new functions increase the danger.

Caution is advised when using Large Language Models (LLM). The AI can evaluate unnoticed "unverified data from insecure sources", behind which attacks can be hidden, warns the German BSI.

The German BSI warns to be careful when using AI systems such as ChatGPT: attacks using indirect prompt injections are almost impossible to detect and could give attackers unauthorised access to information.

Indirect Prompt Injections are a new AI-based attack option
The systems can access "unverified data from insecure sources" unnoticed
The danger increases due to new functions and the use of plugins

The application forms of AI are increasing, as are new dangers. The German Federal Office for Information Security (BSI) has now warned has now warned of so-called indirect prompt injections. This is an extended form of the already known prompt injections. These are clever inputs and query formulations that aim to elicit a result from the AI system that it would not produce in response to a normal query. The focus is on the language-based systems, Large Language Models (LLM), which are able to automatically process natural language in written form.

Early on, ChatGPT was misused to generate malicious code. When this became known, the developers made changes to the AI to prevent this. Resourceful attackers then took other paths. Instead of directly launching a query for a malicious program, they tried this in a roundabout way. For example, through a query that asks how a firewall can be bypassed and asks for the creation of a corresponding sample program.

Dangerous access to unverified data

The more such detours became known, the more these backdoors were closed. That's why attackers now fall back on queries that don't lead directly to the target but use external data, hence Indirect Prompt Injections. In doing so, AI evaluates "unverified data from insecure sources", as the BSI puts it in a nutshell.

Such an input could look like this, for example: Go to my website http://beispiel.com, search it for descriptions of algorithms and create corresponding example programs for them.

For attacks, data is manipulated in external sources and unwanted instructions for LLMs are placed there, which are then accessed by means of a query in order to generate harmful results. Attackers can thus specifically manipulate the behaviour of LLMs. "The potentially malicious commands can be coded or hidden and may not be recognizable to users," warns the BSI. This is not without reason, as the office shows: "In simple cases, this could be, for example, a text on a web page with font size zero or a hidden text in the transcript of a video". In addition, instructions could be obfuscated "so that they continue to be easily interpreted by LLMs, but are difficult to read by humans".

Users have little chance

The danger is made even more complex by the constantly increasing functionality of AI systems. "For example, it is now possible for chatbots to use plug-ins to automatically evaluate internet pages or documents and to access programming environments or email inboxes," explains the Federal Office.

The BSI therefore advises caution when using LLM systems, because it is virtually impossible for users to detect such manipulations. There is not much users can do about it: "Since this is an intrinsic vulnerability of the current technology, attacks of this kind are fundamentally difficult to prevent," writes the BSI in its report. And further: "Currently, no reliable and sustainably secure mitigation measure is known that does not also significantly limit functionality".

The BSI has already addressed such attack vectors in its position paper "Large AI Language Models - Opportunities and Risks for Industry and Authorities".

You might also be interested in

Exhibition hall of it-sa Expo&Congress with many visitors in the aisles.

11/12/2024

Problem case AI, databases and apps - solutions for secure use at the it-sa Expo&Congress

External attack surface management, data security posture management, cloud security posture mana...

Symbolic image of artificial intelligence

8/30/2024

The key to AI-supported cyber data resilience

Advances in artificial intelligence (AI) are causing many companies to change their business mode...

Symbolic image: Manipulated content on the internet

8/14/2024

Algorithms as a weapon against democracy

Automated propaganda networks use AI to undermine democratic societies. Targeted disinformation a...

Jake Moore, Global Cybersecurity Advisor ESET

7/11/2024

Real or fraud? Expert shows how to become a managing director with a LinkedIn hack and deepfakes

British security expert Jake Moore tested the impact of AI and deepfakes on security ¬ with devas...

Three IT security specialists look at one screen

11/9/2023

AI capabilities in threat defence: more to appear than to be?

Risks and opportunities go hand in hand when it comes to artificial intelligence. Many providers ...

Your digital "Home of IT Security"

Send message to