Wiki / Blog
AI Security Research Blog
AI-SPM research and engineering wiki.
15 practitioner articles on AI-SPM, LLM security testing, MCP security, RAG security, and AI compliance. Written by the engineering, security research, and compliance teams.
15 articles, newest first. Click a category in the left sidebar to narrow the list; the column headers below are static labels.
| Article | Category | Read | Published |
|---|---|---|---|
| The Q1 2026 GenAI Exploit Round-up: Eight Incidents, One CVE A read-through of the quarter's eight notable GenAI security incidents, why only one of them carried a CVE, and what that gap means for how you track AI risk. | Attacks and defence | 8 min | |
| The First Autonomous AI-Agent Intrusion: What It Means for Defenders An LLM agent reportedly ran a full intrusion, from RCE to database exfiltration, in under an hour with no operator. What changed, and where you catch it. | Attacks and defence | 7 min | |
| The Self-Propagating AI Worm: Separating the Signal From the Panic Researchers demonstrated an open-weight LLM driving a self-propagating worm across a simulated network. Here is what actually changed for defenders, and what did not. | Attacks and defence | 7 min | |
| AI Security in the First Half of 2026: The Breaches That Ended the Debate H1 2026 AI security review: the breaches that turned theory into incident queues, why posture management is now essential, and the H2 outlook. | Attacks and defence | 11 min | |
| Frontier Agents Cut Both Ways - Opus 4.8, Dynamic Workflows, and the First In-the-Wild LLM-Agent Intrusion The week frontier models learned to run a thousand parallel subagents is the same week someone pointed one at a network and dumped a database in under an hour. Notes for anyone shipping agents to production. | Attacks and defence | 10 min | |
| MCP Tool Poisoning - How the Attack Works and How to Stop It MCP tool poisoning explained: line jumping, rug pulls, and the malicious tool descriptions that hijack AI agents, with byte-level payloads and the seven runtime controls that actually held. | Attacks and defence | 13 min | |
| EU AI Act Cybersecurity Requirements for High-Risk AI Systems A practitioner's guide to Article 15 cybersecurity, Article 9 risk management, and Article 17 quality management for high-risk AI providers. What auditors will actually ask in 2026. | Compliance and regulation | 8 min | |
| MCP Security Checklist - Securing Model Context Protocol Servers in Enterprise AI Systems Practical checklist for security teams reviewing MCP server deployments. Covers tool surface, permission scoping, indirect injection, confused deputy, and runtime enforcement. | Attacks and defence | 6 min | |
| Prompt Injection Testing for Enterprise LLM Apps How to test enterprise LLM applications against prompt injection in a way that produces auditor-acceptable evidence. Covers direct injection, indirect injection via RAG, tool-output injection, and judge bias. | Attacks and defence | 7 min | |
| AI-SPM vs LLM Security - What is the Difference? AI-SPM and LLM Security are complementary disciplines, not substitutes. LLM Security is the focused layer; AI-SPM is the broader programme. Here is what each covers. | AI-SPM fundamentals | 6 min | |
| What is AI Security Posture Management? AI Security Posture Management (AI-SPM) is the continuous process of discovering, assessing, securing, and proving the compliance posture of AI systems. Here is what it covers and why it matters. | AI-SPM fundamentals | 7 min | |
| How to Build an AI Asset Inventory That Survives the First Audit Building an AI asset inventory looks easy until the first auditor asks for it. Eight categories, two failure modes, and the practical script that worked for us. | AI-SPM fundamentals | 6 min | |
| OWASP Agentic Top 10 Walkthrough - What Actually Matters in Production A practitioner's reading of the OWASP Agentic Top 10 T1-T15 list. Which entries we see exploited in the wild, which are still mostly theoretical, and the controls that work. | Attacks and defence | 6 min | |
| NIST AI 600-1 Profile in 20 Minutes - What Auditors Care About A practitioner's reading of NIST AI 600-1, the Generative AI Profile under the NIST AI RMF. Which functions matter most, the three controls auditors ask about every time, and how to map them to live evidence. | Compliance and regulation | 6 min | |
| Vector Database Tenant Isolation - The Quiet Failure Mode We Keep Finding A walkthrough of the cross-tenant retrieval failure mode in production vector stores, why namespace separation alone is not enough, and the test that catches it. | Architecture and operations | 5 min |