Can AI Write Malware? What Research Shows 2026 | Securityelites

Yes — AI tools can assist in generating malicious code, and security researchers have been documenting this capability since 2022. My assessment after tracking this research closely: the threat is real, the defensive adaptations are working, and the honest picture is more nuanced than most headlines suggest. The important nuances: what AI produces still requires human expertise to weaponise effectively, existing defences are adapting, and the documented threat looks different from the sensationalised version in headlines. Here is what the published research actually shows, what it means specifically for defenders trying to protect organisations in 2026, and why calibrated understanding is more useful than exaggeration in either direction.

What You’ll Learn

What published research documents about AI and malicious code generation

Why AI-generated threats challenge traditional detection approaches

The documented real-world incidents and research findings

How defenders are adapting their detection and response capabilities

What this means for organisations and security teams right now

⏱️ 12 min read

Can AI write malware – AI-Generated Malware — Defender’s Guide 2026

What Published Research Shows
Why Detection Is Harder
Documented Real-World Incidents
How Defenders Are Responding
What Organisations Should Do Now

I wrote this for defenders and security-aware users who want to understand the threat landscape. The technical detail on AV evasion methodology from a red team perspective is in the AI-Generated Malware and AV Bypass guide. The broader AI vulnerability landscape is in the 10 AI Vulnerabilities overview.

What Published Research Shows

My starting point for any discussion of AI-generated malware is always the published research record, not speculation. Several credible security research firms and academic groups have documented specific capabilities, all of which are publicly available. Here is what the evidence actually shows.

PUBLISHED RESEARCH — DOCUMENTED FINDINGS

# CyberArk Research (2023) — key findings

Demonstrated: using commercial LLMs to generate malware code variants iteratively

Key finding: AI can generate numerous functional variants rapidly — overwhelming signature detection

Implication: the “signature per variant” defence model becomes less effective at scale

Publication: CyberArk Blog, publicly available

# Recorded Future research findings

Documented: threat actors discussing and sharing AI-generated code on dark web forums

Finding: LLM-generated scripts appearing in criminal forums from late 2022 onward

Context: most were basic automation scripts, not sophisticated targeted malware

# Check Point Research (2023)

Documented: ChatGPT bypassed by threat actors to create basic infostealer code

Finding: safety guardrails on commercial AI can be bypassed for code generation tasks

Context: researchers alerted OpenAI, who improved content filters

# What the research does NOT show

AI autonomously creating sophisticated nation-state-grade malware without human expertise

AI replacing skilled malware developers for complex targeted attacks

AI creating novel attack techniques that humans couldn’t develop manually

Why Detection Is Harder

The detection challenge created by AI-assisted malware development is not primarily about sophistication of individual samples — it is about volume and variety at a scale that outpaces traditional signature-based defences. Traditional signature-based detection works by matching known patterns. AI enables rapid generation of functional variants with no matching signatures. My explanation of why this changes the defender’s calculus.

DETECTION CHALLENGES — WHY AI CHANGES THE CALCULUS

# How signature detection works

AV vendors: identify malicious code patterns → add to signature database

Works when: the same code pattern is used repeatedly

Limitation: new variants with different byte patterns evade existing signatures

# How AI changes the variant generation equation

Manual variant generation: skilled developer creates 5–10 variants per day

AI-assisted variant generation: LLM generates hundreds of syntactically different versions

Impact: signature-per-variant approach cannot keep pace with AI generation speed

# What still works for detection

Behaviour-based detection: what the code DOES, not what it looks like (bytes/patterns)

Sandboxing: detonate the file in isolation, observe behaviour regardless of surface appearance

ML-based classifiers: trained on behaviour patterns rather than static signatures

Network-layer detection: C2 communication patterns are harder to vary than code patterns

Documented Real-World Incidents

My review of incident reports and threat intelligence from 2023–2026: documented AI-generated malware in real attacks has mostly appeared in lower-sophistication attacks — script kiddies and low-skill actors producing code they could not previously write, rather than nation-state actors replacing their sophisticated manual development processes.

AI MALWARE — DOCUMENTED THREAT ACTOR USE

# What threat intelligence firms have documented (2023–2026)

Dark web forum discussions: AI-generated scripts shared as attack tools (lower-skill actors)

Infostealer variants: AI-generated code variants deployed in commodity malware campaigns

Phishing kit improvements: AI-generated convincing phishing page HTML and JavaScript

Script automation: AI-written automation scripts reducing attack operational burden

# Who benefits most from AI code generation (honest assessment)

Lower-skill actors: AI lets them produce code they couldn’t write manually

Speed: more experienced actors work faster with AI assistance

NOT primarily: nation-state groups whose manual capabilities exceed what AI currently produces

# The threat actor AI toolkit (as documented in public threat intel)

Commercial LLMs with jailbreaks for initial code generation

Private/local models without safety filters for more targeted use

Specialised underground AI tools marketed to criminal communities

How Defenders Are Responding

The security industry has not been standing still while AI-assisted threats evolved. The most significant defensive adaptation is the shift away from purely signature-based detection toward behaviour-based and ML-enhanced approaches that are inherently more resilient to variant generation at scale.

DEFENSIVE ADAPTATIONS — 2026

# AV/EDR evolution

Major AV vendors (CrowdStrike, SentinelOne, Microsoft Defender) have shifted to ML+behaviour

These approaches detect what code does, not what it looks like — AI variant generation doesn’t defeat this

Real-time sandboxing: suspicious files detonated in isolation before execution

# AI being used for defence too

AI models trained specifically to detect AI-generated code patterns

Adversarial ML training: defensive models trained against AI-generated variants

Threat intelligence AI: faster detection of new variant families across customer telemetry

# The practical guidance for defenders

Prioritise EDR with behaviour detection over purely signature-based AV

Enable sandboxing features — executables and Office macros especially

Patch fast: AI-accelerated vulnerability exploitation narrows the patch window

Network segmentation: limits blast radius even if endpoint protection is bypassed

What Organisations Should Do Now

My practical guide for security teams and IT managers updating their defences to account for AI-assisted threats. My core insight after reviewing the research: the controls that protect against AI-generated variants are largely the same controls that protect against any novel threat — behaviour detection, layered defences, fast patching, and network segmentation.

DEFENSIVE POSTURE — AI THREAT ADAPTATION

# Endpoint protection

If still running legacy AV: upgrade to EDR with behaviour-based detection

CrowdStrike Falcon, SentinelOne, Microsoft Defender for Endpoint — all have AI-era capabilities

Enable all behaviour monitoring features, not just signature scanning

# Email and web gateway

Enable sandboxing for email attachments — detonate before delivery

URL rewriting and detonation: check links when clicked, not just at receipt

# Patch management urgency

AI-assisted exploitation compresses the window between CVE publication and exploitation

Critical CVEs on internet-facing systems: patch within 48–72 hours, not 30 days

# Detection investment

The AI arms race in security means detection quality matters more than before

Invest in: threat hunting, incident response capability, and detection engineering

How AI Safety Systems Create Barriers

A significant part of the AI malware story that often gets missed: commercial AI platforms have implemented safety systems that create real friction for malicious code generation. My assessment of how effective these barriers are and where the gaps remain.

AI SAFETY BARRIERS — HOW THEY WORK

# What commercial AI safety systems do

Content classifiers: detect and refuse requests to generate malicious code

Context analysis: identify when a code request has malicious framing

API logging: commercial platforms log all API usage — accountability exists

# Where barriers have gaps

Jailbreaking: safety systems can sometimes be bypassed through creative prompting

Local models: open-source models run locally have no safety filters and no logging

Underground tools: purpose-built criminal AI tools with no safety restrictions documented

Fragmented code: requesting individual components that are later assembled manually

# The overall picture

Commercial AI: provides meaningful barrier — raises the effort required significantly

Determined, skilled actors: will find or create tools without safety restrictions

The risk reduction from safety barriers is real, but not a complete solution

The Democratisation Risk — Lower Skill Floor

My assessment of the most important implication of AI-assisted malware development for security teams is not that nation-state actors have a new tool — those actors already had sophisticated manual capabilities. The more impactful concern is the skill floor reduction: actors who previously lacked the technical expertise to develop functional malicious code can now produce it with AI assistance.

DEMOCRATISATION OF ATTACK CAPABILITY

# Pre-AI skill requirement for attack code development

Functional scripting and automation: intermediate skill required

Custom malware: advanced skill, typically years of development experience

Evasion techniques: specialist knowledge of AV engine internals

# Post-AI skill requirement change

Basic scripting and automation: now accessible to minimal-skill actors

Custom malware: AI reduces skill requirement — still needs human expertise to weaponise

Net result: more actors capable of producing functional attack tools than before

# Why this matters for defence

More actors + lower barrier = higher attack volume overall

SOC teams see more low-quality attacks from actors who couldn’t attack before

Defensive automation (AI-assisted detection) is the appropriate scale response

The Open-Source Model Risk

Commercial AI platforms implement safety barriers that create friction for malicious use. Open-source models run locally have no such barriers. My concern about this for the threat landscape: the democratisation effect is amplified when safety-unrestricted models are accessible at zero cost.

OPEN-SOURCE MODEL THREAT LANDSCAPE

# What open-source models change

No content filtering: models run locally can be used without any safety restrictions

No logging: no audit trail — usage is unattributable

No cost: Llama, Mistral, and others are free to download and run

Fine-tuning: models can be trained specifically for malicious code generation tasks

# Documented underground tools (threat intel)

WormGPT: fine-tuned model marketed to cybercriminals, documented by SlashNext (2023)

FraudGPT: similar underground LLM for fraud use cases

Both tools were documented by security researchers, reported widely, and saw their infrastructure disrupted — but successor tools continue to emerge

# What this means for defenders

Do not assume commercial AI safety controls represent the attacker’s full toolkit

Actors willing to use underground tools or run local models face no AI safety friction

Behaviour-based detection remains the right answer — agnostic to how the code was produced

AI-Generated Malware — Key Points for Defenders in 2026

Published research confirms AI assists malware generation — variant creation at scale is the main threat

Detection challenge: signature-based AV struggles with high-volume AI-generated variants

What works: behaviour-based EDR, sandboxing, network segmentation — all AI-resilient

Real-world impact: mostly lower-skill actors so far — AI enabling code generation for those who couldn’t before

Action: upgrade legacy AV to behaviour-based EDR, enable sandboxing, accelerate patch cycles

AI Malware Threats — Your Defensive Response

The defensive adaptations that most directly address AI-generated malware threats are the same ones that improve your overall security posture: behaviour-based EDR, sandboxing, faster patching, and network segmentation. Start with your EDR — if it is still purely signature-based, it is already inadequate for the current threat landscape regardless of AI.

Quick Check

A security team is told that AI can now generate thousands of malware variants per hour. Which defensive response most directly addresses this specific threat?

Frequently Asked Questions

Can AI actually write working malware?

Published research confirms that AI tools can generate functional code that serves malicious purposes, particularly when combined with human expertise for refinement and deployment. CyberArk, Check Point Research, and others have documented this capability. The threat is real but differs from sensationalised headlines: AI assists and accelerates malware development rather than autonomously creating sophisticated targeted attacks. The most documented real-world use is lower-skill actors using AI to produce code they couldn’t write manually.

Why is AI-generated malware harder to detect?

Traditional signature-based antivirus detection works by matching code patterns against a database of known malicious signatures. AI can generate many functional variants of the same malicious code with different byte patterns, syntax, and structure — each evading existing signatures while maintaining the same malicious capability. Behaviour-based detection (monitoring what code does rather than what it looks like) is inherently more resilient to this because the malicious behaviours (process injection, C2 communication, data exfiltration) remain consistent across variants.

What is the best protection against AI-generated malware?

Behaviour-based EDR (Endpoint Detection and Response) with sandboxing is the most directly effective control against AI-generated malware variants. Major platforms — CrowdStrike Falcon, SentinelOne, Microsoft Defender for Endpoint — all offer behaviour-based detection that catches malicious activity regardless of code surface appearance. Complementing this with email attachment sandboxing, fast patch cycles, and network segmentation creates a layered defence that addresses the threat across multiple stages.

Are AI companies doing anything to prevent malware generation?

Yes — all major commercial AI platforms (OpenAI, Anthropic, Google) have safety systems designed to detect and refuse requests to generate malicious code. These systems are not perfect — researchers have documented bypasses — but they create meaningful friction. AI companies update these systems as new bypass techniques are discovered. Additionally, all major commercial AI platforms log API usage, which creates accountability that purely local AI models (run on personal hardware) do not have.

→ Technical Deep Dive

AI-Generated Malware and AV Bypass — Red Team Guide

→ Related

Adversarial Machine Learning 2026

Can AI Write Malware? What the Research Shows — And What Defenders Must Know (2026)