LLM security field atlas

LLM Threat Coverage Atlas

Overall coverage map, not a perfect score. This atlas is a map of LLM attack surfaces: use it to ask better questions across prompt input, RAG, memory, tools, identity, quorum approval, MCP plugins, output handling, and incident response.

480currently mapped vectors

Inventory is not completeness

The 480 leaves are review prompts for coverage, not a grade, target score, or claim that the threat model is finished. Real coverage depends on your architecture, data sensitivity, tool permissions, tenant boundaries, deployment model, and human approval flow.

Domain counts show coverage density, not risk score. A system with one powerful tool may be riskier than a system with dozens of prompt-only vectors, so use the map to find missing surfaces and then prioritize by blast radius.

364137554124402825272225212830
Traffic signal

Visits and dwell time

Free public counters. Totals are approximate and do not expose raw IP addresses.

...page views
...unique browser-days
...foreground minutes
...1m+ sessions
01Input authorityPrompt injection, instruction hierarchy, templates, metadata, hidden fields, and role impersonation.
02Context and retrievalRAG, embeddings, vector stores, memory, cache bleed, corpus poisoning, and stale authorization.
03Identity boundariesUser, tenant, service account, delegated identity, token scope, and authorization propagation.
04Tools and actionsFunction calls, browser automation, code execution, file access, APIs, side effects, and egress.
05Model supply chainModels, adapters, prompts, datasets, guardrails, parsers, providers, and deployment changes.
06Approval and agencyQuorum, human review, autonomous loops, multi-agent delegation, rubber-stamping, and race conditions.
07Output and user trustGenerated HTML, Markdown, SQL, code, reports, citations, UI wording, and downstream ingestion.
08Operations and responseLogging, telemetry, cost abuse, kill switches, rollback, memory purge, and incident reconstruction.

Measurement model

Use the leaf score as a starting point only. Validate applicability first, then score likelihood and impact after looking at architecture, controls, exposure, and blast radius.

1-5Likelihood: exposure, attacker control, repeatability, and control bypass ease.
1-5Impact: data sensitivity, tool privilege, tenant reach, reversibility, and legal harm.
L x IStarter score: Critical 20-25, High 15-19, Medium 8-14, Low 1-7.
EvidenceEvery applicable leaf needs owner, test, logs, control proof, and residual-risk decision.
174 Prompt-only chatbot
157 RAG / knowledge assistant
362 Tool-using agent
180 Multi-agent or quorum workflow
60 MCP / plugin ecosystem
41 Multimodal, voice, or computer-use
134 Training, fine-tuning, or model ops
268 Governance, privacy, and audit

Framework cross-walk

Use this to reconcile the A-O domains with OWASP LLM, OWASP Agentic, OWASP MCP, MITRE ATLAS, MITRE ATT&CK, NIST AI RMF, privacy, provenance, and governance workstreams.

Atlas domainPrimary mappingsConfidence note
APrompt and Input Manipulation LLM01:2025 Prompt InjectionLLM07:2025 System Prompt LeakageASI01 Agent Goal HijackAML.T0051 LLM Prompt InjectionAML.T0068 LLM Prompt ObfuscationT1027 Obfuscated Files or InformationMAPMEASURE Domain-level mapping; leaf cards add keyword-derived technique chips.
BRAG, Context, Memory, and Embeddings LLM08:2025 Vector and Embedding WeaknessesLLM04:2025 Data and Model PoisoningLLM02:2025 Sensitive Information DisclosureASI06 Memory & Context PoisoningAML.T0070 RAG PoisoningAML.T0080 AI Agent Context PoisoningAML.T0080.000 MemoryAML.T0064 Gather RAG-Indexed TargetsT1005 Data from Local SystemT1213 Data from Information RepositoriesMAPMEASUREMANAGEGDPR Art. 17CCPA deletion rights Domain-level mapping; leaf cards add keyword-derived technique chips.
CSensitive Data and Privacy LLM02:2025 Sensitive Information DisclosureLLM07:2025 System Prompt LeakageASI03 Identity & Privilege AbuseAML.T0057 LLM Data LeakageAML.T0056 Extract LLM System PromptAML.T0024 Exfiltration via AI Inference APIT1552 Unsecured CredentialsT1020 Automated ExfiltrationGOVERNMAPMEASUREMANAGEGDPR Art. 17CCPA privacy rights Domain-level mapping; leaf cards add keyword-derived technique chips.
DTool Use, Function Calling, and Execution LLM06:2025 Excessive AgencyLLM05:2025 Improper Output HandlingASI02 Tool MisuseASI05 Unexpected Code ExecutionAML.T0053 AI Agent Tool InvocationAML.T0086 Exfiltration via AI Agent Tool InvocationAML.T0101 Data Destruction via AI Agent Tool InvocationT1059 Command and Scripting InterpreterT1190 Exploit Public-Facing ApplicationMAPMEASUREMANAGE Domain-level mapping; leaf cards add keyword-derived technique chips.
EQuorum, Approval, Consensus, and Control Gates LLM06:2025 Excessive AgencyASI09 Human-Agent Trust ExploitationASI08 Cascading FailuresAML.T0053 AI Agent Tool InvocationT1078 Valid AccountsGOVERNMANAGEISO/IEC 42001 controls Domain-level mapping; leaf cards add keyword-derived technique chips.
FIdentity, Authorization, and Tenant Boundaries LLM02:2025 Sensitive Information DisclosureLLM06:2025 Excessive AgencyASI03 Identity & Privilege AbuseAML.T0055 Unsecured CredentialsAML.T0083 Credentials from AI Agent ConfigurationAML.T0098 AI Agent Tool Credential HarvestingT1078 Valid AccountsT1552 Unsecured CredentialsGOVERNMAPMANAGEISO/IEC 42001 controls Domain-level mapping; leaf cards add keyword-derived technique chips.
GSupply Chain, Models, Datasets, and Deployment LLM03:2025 Supply ChainLLM04:2025 Data and Model PoisoningASI04 Agentic Supply Chain VulnerabilitiesAML.T0010 AI Supply Chain CompromiseAML.T0019 Publish Poisoned DatasetsAML.T0020 Poison Training DataAML.T0058 Publish Poisoned ModelsT1195 Supply Chain CompromiseGOVERNMAPMEASUREMANAGEISO/IEC 42001 controlsEU AI Act lifecycle controls Domain-level mapping; leaf cards add keyword-derived technique chips.
HOutput Handling and Downstream Injection LLM05:2025 Improper Output HandlingLLM09:2025 MisinformationASI09 Human-Agent Trust ExploitationAML.T0077 LLM Response RenderingAML.T0067 LLM Trusted Output Components ManipulationT1059 Command and Scripting InterpreterT1566 PhishingMEASUREMANAGEC2PA content provenance Domain-level mapping; leaf cards add keyword-derived technique chips.
IDenial of Service, Cost Abuse, and Reliability LLM10:2025 Unbounded ConsumptionASI08 Cascading FailuresAML.T0034.001 Resource-Intensive QueriesAML.T0034.002 Agentic Resource ConsumptionAML.T0046 Spamming AI System with Chaff DataT1499 Endpoint Denial of ServiceMEASUREMANAGE Domain-level mapping; leaf cards add keyword-derived technique chips.
JModel Extraction, Inference, and Safety Evasion LLM07:2025 System Prompt LeakageLLM04:2025 Data and Model PoisoningLLM02:2025 Sensitive Information DisclosureASI01 Agent Goal HijackAML.T0024.000 Infer Training Data MembershipAML.T0024.001 Invert AI ModelAML.T0024.002 Extract AI ModelAML.T0056 Extract LLM System PromptT1592 Gather Victim Host InformationMAPMEASUREMANAGEC2PA content provenance Domain-level mapping; leaf cards add keyword-derived technique chips.
KMulti-Agent and Delegation Risks LLM06:2025 Excessive AgencyASI07 Insecure Inter-Agent CommunicationASI08 Cascading FailuresAML.T0108 AI AgentAML.T0080 AI Agent Context PoisoningAML.T0081 Modify AI Agent ConfigurationT1053 Scheduled Task/JobGOVERNMAPMANAGEISO/IEC 42001 controls Domain-level mapping; leaf cards add keyword-derived technique chips.
LMultimodal, Document, and File-Based Inputs LLM01:2025 Prompt InjectionLLM05:2025 Improper Output HandlingASI01 Agent Goal HijackASI09 Human-Agent Trust ExploitationAML.T0051.001 IndirectAML.T0052.001 Deepfake-Assisted PhishingAML.T0088 Generate DeepfakesT1566 PhishingMAPMEASUREMANAGEC2PA content provenance Domain-level mapping; leaf cards add keyword-derived technique chips.
MHuman Factors, UI, and Social Engineering LLM09:2025 MisinformationLLM06:2025 Excessive AgencyASI09 Human-Agent Trust ExploitationAML.T0052 PhishingAML.T0100 AI Agent ClickbaitT1566 PhishingGOVERNMEASUREMANAGEEU AI Act transparency obligationsISO/IEC 42001 controls Domain-level mapping; leaf cards add keyword-derived technique chips.
NMonitoring, Audit, Incident Response, and Governance LLM10:2025 Unbounded ConsumptionLLM02:2025 Sensitive Information DisclosureASI08 Cascading FailuresAML.T0084 Discover AI Agent ConfigurationAML.T0085 Data from AI ServicesTA0040 ImpactTA0010 ExfiltrationGOVERNMAPMEASUREMANAGEEU AI ActISO/IEC 42001GDPR Art. 17 Domain-level mapping; leaf cards add keyword-derived technique chips.
OMCP, Plugin, and Agent Server Specific Risks LLM01:2025 Prompt InjectionLLM03:2025 Supply ChainLLM06:2025 Excessive AgencyASI02 Tool MisuseASI04 Agentic Supply Chain VulnerabilitiesMCP1:2025 Token Mismanagement & Secret ExposureMCP2:2025 Privilege Escalation via Scope CreepAML.T0110 AI Agent Tool PoisoningAML.T0104 Publish Poisoned AI Agent ToolAML.T0098 AI Agent Tool Credential HarvestingAML.T0099 AI Agent Tool Data PoisoningT1195 Supply Chain CompromiseT1552 Unsecured CredentialsGOVERNMAPMEASUREMANAGEISO/IEC 42001 controls Domain-level mapping; leaf cards add keyword-derived technique chips.

Usability hooks

Each leaf has a stable hash URL such as #LLM-001, searchable metadata, architecture tags, a starter score, and framework chips. Export the same catalog to JSON or CSV for Jira, GRC, spreadsheets, or control libraries.

How to read a leaf bubble

The badge is a stable vector ID, the bold line names the attack path, and the smaller line is the question to ask during threat modeling. Treat each bubble as a review checkpoint: decide whether the vector applies, what trust boundary it crosses, what control should stop it, and how you would prove that control works.

LLM-091 Quorum bypass Can privileged actions execute without the required approvals?

Workshop rhythm

01
Filter by architectureSearch for RAG, tool, MCP, memory, quorum, output, or supply chain depending on what the system actually uses.
02
Mark trust boundariesFocus first where untrusted content, sensitive data, privileged tools, external connectors, and autonomous decisions meet.
03
Turn leaves into controlsConvert relevant bubbles into validation checks, approval gates, isolation rules, audit events, or abuse-case tests.
04
Record the decisionFor every applicable leaf, capture owner, impact, existing control, missing evidence, and residual risk.
Leaf anatomy

Each bubble is a testable question

A good review does not stop at "could this happen?" It asks where the input enters, which identity or tool is used, what data is exposed, and what deterministic control prevents the model from turning text into authority.

  • ID for tracking across notes and tickets
  • Attack path name for fast triage
  • Question phrased as a design-review check
Relevance test

Decide whether the vector applies

A leaf is relevant when the system has the matching capability or trust boundary. Prompt-only chatbots will not need every tool risk, but RAG agents with browser, email, filesystem, or payment tools should review the high-impact clusters first.

  • Does the system ingest untrusted content?
  • Does it retrieve private or tenant-scoped data?
  • Can it call tools, write data, or trigger actions?
Evidence

Capture more than a yes or no

When a vector applies, record the concrete component, input source, actor, privilege, expected control, and verification method. The atlas is useful when each leaf becomes evidence that engineering, security, and product can act on.

  • Preconditions and abuse path
  • Preventive, detective, and recovery controls
  • Test case, log signal, or approval artifact
Prioritization

Start where blast radius is highest

Prioritize leaves that combine private data, untrusted input, autonomous execution, weak identity boundaries, or irreversible actions. Those combinations usually create the biggest real-world LLM incidents.

  • Privileged tools or service accounts
  • Cross-tenant retrieval or memory
  • Quorum, approval, and audit bypass
480 matching vector prompts
A 36

Threat Domain

Prompt and Input Manipulation

B 41

Threat Domain

RAG, Context, Memory, and Embeddings

C 37

Threat Domain

Sensitive Data and Privacy

D 55

Threat Domain

Tool Use, Function Calling, and Execution

E 41

Threat Domain

Quorum, Approval, Consensus, and Control Gates

F 24

Threat Domain

Identity, Authorization, and Tenant Boundaries

G 40

Threat Domain

Supply Chain, Models, Datasets, and Deployment

H 28

Threat Domain

Output Handling and Downstream Injection

I 25

Threat Domain

Denial of Service, Cost Abuse, and Reliability

J 27

Threat Domain

Model Extraction, Inference, and Safety Evasion

K 22

Threat Domain

Multi-Agent and Delegation Risks

L 25

Threat Domain

Multimodal, Document, and File-Based Inputs

M 21

Threat Domain

Human Factors, UI, and Social Engineering

N 28

Threat Domain

Monitoring, Audit, Incident Response, and Governance

O 30

Threat Domain

MCP, Plugin, and Agent Server Specific Risks