Chatbot Security Risks - Trends and Guidance

Chatbot security risks are now a major concern for cyber professionals, as the use of generative artificial intelligence (AI) and predictive AI tools continues to proliferate around the world.

While the rise of ChatGPT and other AI chatbots has been hailed as a business game-changer, it is increasingly being seen as a critical security issue. Previously, we outlined the challenges created by ChatGPT and other forms of AI.

In this blog post, we look at the growing threat from AI-associated cyber-attacks and discuss new guidance from the National Institute of Standards and Technology (NIST).

The sharp rise in chatbot security risks

From content creation to customer service, AI innovations such as ChatGPT are being hailed as hugely beneficial for businesses and consumers. A recent Kroll survey of senior leaders found that more than half the respondents have turned to AI technology to address a rise in financial crime risks.

While there is little doubt that AI is changing the way businesses operate and provide services, its potential has also been recognised by threat actors. The security risks associated with AI are growing alongside the development of the technology itself. So much so that, in mid-2023, the FBI issued a warning in which it stated that the harm caused by cybercriminals leveraging AI is now increasing at a concerning rate.

In late 2023, a survey highlighted increasing disquiet among security professionals about the use of AI by cybercriminals, with many taking the view that AI is indeed driving up the volume of attacks. A particular area of worry is around the use of deepfakes. However, despite this, many of the professionals surveyed did not seem to fully understand the full extent of AI’s potential impact on cyber security. New research from Microsoft and OpenAI has confirmed that nation states are exploring AI’s capabilities and leveraging large language models (LLMs) such as ChatGPT to support campaigns.

A rise in threats associated with AI has also been the subject of new analysis from the National Cyber Security Centre (NCSC), which warns that it will “almost certainly increase the volume and heighten the impact of cyber attacks” over the next two years, with ransomware in the spotlight as a particularly significant threat. While the use of AI is currently limited to certain types of threat actors, the NCSC assessment also states that between 2024 and 2026, there will be a rise in its use for nefarious purposes by novice cybercriminals, hackers-for-hire and others. This is anticipated to accelerate UK cyber resilience challenges in the near term for the UK government and for businesses.

Adversarial machine learning

The issue is so pressing that NIST recently released a new publication, Adversarial Machine Learning: A Taxonomy and Terminology of Attacks and Mitigations (NIST.AI.100-2). Aimed at individuals and groups responsible for designing, developing, deploying and evaluating AI systems, the report highlights the very real threat presented by AI, outlining the specific types of attacks it can give rise to and how organisations can respond. NIST comments that these defences may not be able to fully mitigate all the threats and that the security community is encouraged to help develop better approaches for addressing growing chatbot security risks. The report is intended to support progress towards developing a taxonomy and terminology of adversarial machine learning (AML) that could help to secure AI applications against manipulation.

As the report highlights, the data-driven approach of machine learning (ML) introduces additional security and privacy challenges, including:

Adversarial manipulation of training data
Adversarial exploitation of model vulnerabilities to adversely affect the performance of the AI system
Malicious manipulations, modifications or interactions with models to exfiltrate sensitive information about people represented in the data, the model or proprietary enterprise data

As the report points out, AML is a key defence in the fight against chatbot security risks because it focuses on understanding the capabilities of attackers and their goals, as well as the design of attack methods that exploit ML vulnerabilities during the development, training and deployment phase of the ML lifecycle. AML also includes the design of ML algorithms that can stand up to the security and privacy threats posed by AI.

Chatbot security attack objectives

The NIST report classifies AI attackers’ objectives according to three main types of security violations, with adversarial success indicated by achieving one or more of the following goals:

Availability Breakdown
Integrity Violations
Privacy Compromise

Types of chatbot security risks

Image of taxonomy of attacks on generative AI systems from the NIST report: Adversarial Machine Learning: A Taxonomy and Terminology of Attacks and Mitigations (NIST.AI.100-2)

The new NIST report outlines types of attacks in which threat actors both target AI and leverage it to execute other types of cybercrime. NIST classifies these into four specific groups, outlining mitigations for each.

Evasion attacks

Aimed at generating adversarial output after a machine learning model is deployed. The NIST guidance looks at white box and black box types of attacks.

Poisoning attacks

Aimed at targeting the training phase of the algorithm by introducing corrupted data. Types of attacks include:

Availability poisoning: The entire ML model is corrupted in an availability attack, leading to model misclassification on the majority of testing samples.
Targeted poisoning: Poisoning attacks against machine learning that change the prediction on a small number of targeted samples
Backdoor poisoning attacks: Poisoning attacks against machine learning that change the prediction on samples, including a backdoor pattern
Model poisoning: Model poisoning attacks attempt to directly modify the trained ML model to inject malicious functionality into the model.

Privacy attacks

Aimed at gaining sensitive information about the system or data it was trained on through the use of questions that work around existing guardrails. Types of attacks include:

Data reconstruction: The ability to recover an individual’s data from released aggregate information
Membership inference: Aimed at determining whether a particular record or data sample was part of the training dataset used for the statistical or ML algorithm
Model extraction: Extracts information about the model architecture and parameters by submitting queries to the ML model trained by a machine learning as a service (MLaaS) provider
Property inference attacks: The attacker attempts to learn global information about the training data distribution by interacting with an ML model.

Abuse attacks

Aimed at compromising legitimate sources of information, such as a web page with incorrect information, to repurpose the system’s intended use through indirect prompt injection in order to execute fraud, malware or manipulation. Types of attacks include phishing, masquerading, malware attacks and historical distortion, in which an attacker can prompt the model to output disinformation. NIST gives the example of researchers who demonstrated this by successfully prompting Bing Chat to deny that Albert Einstein won a Nobel Prize.

For proactive steps to defend against chatbot security risks, check out our post on ChatGPT security.

How Kroll can help

Ensuring effective cyber resilience against the increasing chatbot security risks demands a comprehensive approach to cyber defence. Kroll Responder, our Managed Detection and Response (MDR) service, supplies EDR and other detection technologies, as well as the people and intelligence required to utilise them effectively, to continuously hunt for threats across networks and endpoints and help shut them down before they cause damage and disruption. Functioning as an extension of your IT team, Kroll Responder combines world-class security expertise, leading network and endpoint detection technologies, and aggregated security intelligence to defend against current and emerging security threats, 24/7/365.

Learn more about Kroll Responder

Cookie	Duration	Description
__cf_bm	1 hour	This cookie, set by Cloudflare, is used to support Cloudflare Bot Management.
_ok	session	The cookie is set by Olark live chat software and is used to store most recent Olark site for security purposes.
_okdetect	session	This cookie is set by Olark live chat software. The cookie is used for detecting when storage contexts have changed due to things like ssl or host transitions.
_oklv	session	The cookie is set by Olark live chat software. According to Olark documentation, the cookie is the Olark Loader version used for improved caching.
cookielawinfo-checkbox-advertisement	1 year	Set by the GDPR Cookie Consent plugin, this cookie records the user consent for the cookies in the "Advertisement" category.
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
CookieLawInfoConsent	1 year	CookieYes sets this cookie to record the default button state of the corresponding category and the status of CCPA. It works only in coordination with the primary cookie.
hblid	1 year 1 month 4 days	The cookie is set by Olark live chat software and is used as a visitor identifier to remember a visitor between visits.
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Cookie	Duration	Description
lang	session	LinkedIn sets this cookie to remember a user's language setting.
li_gc	6 months	Linkedin set this cookie for storing visitor's consent regarding using cookies for non-essential purposes.
lidc	1 day	LinkedIn sets the lidc cookie to facilitate data center selection.
UserMatchHistory	1 month	LinkedIn sets this cookie for LinkedIn Ads ID syncing.
yt-player-headers-readable	never	The yt-player-headers-readable cookie is used by YouTube to store user preferences related to video playback and interface, enhancing the user's viewing experience.
yt-remote-cast-available	session	The yt-remote-cast-available cookie is used to store the user's preferences regarding whether casting is available on their YouTube video player.
yt-remote-cast-installed	session	The yt-remote-cast-installed cookie is used to store the user's video player preferences using embedded YouTube video.
yt-remote-connected-devices	never	YouTube sets this cookie to store the user's video preferences using embedded YouTube videos.
yt-remote-device-id	never	YouTube sets this cookie to store the user's video preferences using embedded YouTube videos.
yt-remote-fast-check-period	session	The yt-remote-fast-check-period cookie is used by YouTube to store the user's video player preferences for embedded YouTube videos.
yt-remote-session-app	session	The yt-remote-session-app cookie is used by YouTube to store user preferences and information about the interface of the embedded YouTube video player.
yt-remote-session-name	session	The yt-remote-session-name cookie is used by YouTube to store the user's video player preferences using embedded YouTube video.
ytidb::LAST_RESULT_ENTRY_KEY	never	The cookie ytidb::LAST_RESULT_ENTRY_KEY is used by YouTube to store the last search result entry that was clicked by the user. This information is used to improve the user experience by providing more relevant search results in the future.

Cookie	Duration	Description
_okbk	session	The cookie is set by Olark live chat software and is used to store extra state information of the chat box.
olfsk	1 year 1 month 4 days	This cookie is set by Olark live chat software. This cookies is a storage identifier used to maintain chat state across pages.
SRM_B	1 year 24 days	Used by Microsoft Advertising as a unique ID for visitors.
wcsid	session	This cookie is set by Olark live chat software. The cookie is a session identifier that is used to keep track of a single at session.

Cookie	Duration	Description
_ce.gtld	session	Crazyegg sets this cookie to identify the top-level domain.
_clck	1 year	Microsoft Clarity sets this cookie to retain the browser's Clarity User ID and settings exclusive to that website. This guarantees that actions taken during subsequent visits to the same website will be linked to the same user ID.
_clsk	1 day	Microsoft Clarity sets this cookie to store and consolidate a user's pageviews into a single session recording.
_ga	1 year 1 month 4 days	Google Analytics sets this cookie to calculate visitor, session and campaign data and track site usage for the site's analytics report. The cookie stores information anonymously and assigns a randomly generated number to recognise unique visitors.
_ga_*	1 year 1 month 4 days	Google Analytics sets this cookie to store and count page views.
_gat_UA-*	1 minute	Google Analytics sets this cookie for user behaviour tracking.
_gid	1 day	Google Analytics sets this cookie to store information on how visitors use a website while also creating an analytics report of the website's performance. Some of the collected data includes the number of visitors, their source, and the pages they visit anonymously.
AnalyticsSyncHistory	1 month	Linkedin set this cookie to store information about the time a sync took place with the lms_analytics cookie.
cebs	session	Crazyegg sets this cookie to trace the current user session internally.
CLID	1 year	Microsoft Clarity set this cookie to store information about how visitors interact with the website. The cookie helps to provide an analysis report. The data collection includes the number of visitors, where they visit the website, and the pages visited.
MR	7 days	This cookie, set by Bing, is used to collect user information for analytics purposes.
SM	session	Microsoft Clarity cookie set this cookie for synchronizing the MUID across Microsoft domains.
vuid	1 year 1 month 4 days	Vimeo installs this cookie to collect tracking information by setting a unique ID to embed videos on the website.

Cookie	Duration	Description
ANONCHK	10 minutes	The ANONCHK cookie, set by Bing, is used to store a user's session ID and verify ads' clicks on the Bing search engine. The cookie helps in reporting and personalization as well.
bcookie	1 year	LinkedIn sets this cookie from LinkedIn share buttons and ad tags to recognize browser IDs.
bscookie	1 year	LinkedIn sets this cookie to store performed actions on the website.
li_sugr	3 months	LinkedIn sets this cookie to collect user behaviour data to optimise the website and make advertisements on the website more relevant.
MUID	1 year 24 days	Bing sets this cookie to recognise unique web browsers visiting Microsoft sites. This cookie is used for advertising, site analytics, and other operations.
NID	6 months	Google sets the cookie for advertising purposes; to limit the number of times the user sees an ad, to unwanted mute ads, and to measure the effectiveness of ads.
test_cookie	15 minutes	doubleclick.net sets this cookie to determine if the user's browser supports cookies.
VISITOR_INFO1_LIVE	6 months	YouTube sets this cookie to measure bandwidth, determining whether the user gets the new or old player interface.
VISITOR_PRIVACY_METADATA	6 months	YouTube sets this cookie to store the user's cookie consent state for the current domain.
YSC	session	Youtube sets this cookie to track the views of embedded videos on Youtube pages.
yt.innertube::nextId	never	YouTube sets this cookie to register a unique ID to store data on what videos from YouTube the user has seen.
yt.innertube::requests	never	YouTube sets this cookie to register a unique ID to store data on what videos from YouTube the user has seen.