Abstract

AttackFactory, our new vulnerability research project, demonstrates public disclosures can now be turned into validated exploit evidence in minutes, not days.

According to the latest reporting from Google Threat Intelligence, adversaries are already using AI to speed up vulnerability exploitation at scale.

While AI developed zero day exploits are a real concern, the more immediate operational risk is AI accelerating exploitation of vulnerabilities that are already public. Once a CVE is disclosed, attackers often have the affected versions, the advisory, and the patch. That is enough for an automated workflow to assess exploitability, create the exploit code and discover vulnerable systems. Defenders need matching automation for intake, exposure mapping, prioritisation, and remediation.

1. Overview

AttackFactory takes a newly disclosed CVE, finds the relevant patch, assesses the likely impact, creates exploit code, and verifies the result in an isolated lab.

CVE appears → patch is found → risk is assessed → exploit code is created → lab validation confirms impact

2. Vulnerability Selection

AttackFactory prioritises issues that are most likely to be exploited by attackers.

  • Open source code, as it is public, the vulnerable and fixed versions can be compared.
  • High and critical CVEs are targeted for maximum impact.
  • Internet-facing software is easier for attackers to discover.

3. Architecture

AttackFactory follows a single validation flow from disclosure to evidence.

flowchart TD
  cve["New CVE"] --> classified["Classified"]
  classified --> patch["Patch reviewed"]
  patch --> lab["Lab environment built"]
  lab --> exploit["Exploit code created"]
  exploit --> test["Exploit tested"]
  test --> works{"Exploit works?"}
  works -- "Yes" --> patched["Patch applied in lab"]
  works -- "No" --> exploit
  patched --> confirmed["Exploit fails after patch"]
  confirmed --> code["Exploit code"]
  confirmed --> report["Technical report and evidence"]

4. Agentic Attack Pattern: CVE Release to Targeting in Minutes

The primary concern is attackers with the ability to connect exploit code generation with other autonomous workflows: discovery, discovering internet exposed vulnerable systems, and launching automated attacks.

flowchart LR
  feed["CVE Feed"] --> attackfactory["AttackFactory"]
  feed --> hunter["Exposure Hunter"]
  attackfactory --> merged((" "))
  hunter --> merged
  merged --> remote["Remote Attack"]

For attackers, this becomes a release-to-targeting pipeline. For defenders, the same pattern needs to run in reverse: identify the issue, determine whether it matters, find exposed assets, prioritise by business impact, apply patches or controls, and verify remediation.

Human-centred vulnerability management often assumes days of analysis. Agentic systems can run continuously.

5. Risk of Known Vulnerabilities

AI-assisted zero-day discovery is a serious concern, but N-day exploitation is the more immediate operational problem. N-day means the vulnerability is already public.

  • The affected versions are usually known.
  • The patch often explains the bug.
  • Exploitability can be tested in a lab.
  • Public exposure can be searched at scale.
  • Many organisations will still be waiting on patch windows.

6. Defender Use Case

For defenders, AttackFactory can move vulnerability response from advisory tracking to evidence-based validation. When paired with Seer and public infrastructure visibility, the workflow can identify relevant emerging vulnerabilities, show which edge-facing systems are discoverable from the outside, create PoCs for validation against those technologies, and produce detection content such as YARA rules for exploitation attempts.

  • Seer: identify high-signal vulnerabilities and candidate disclosures affecting watched technologies.
  • Public infrastructure visibility: use the same external discovery perspective attackers rely on to understand what is exposed.
  • AttackFactory: create controlled PoCs to validate whether discoverable edge systems are actually vulnerable.
  • Detection output: generate YARA rules and supporting evidence from the exploit path and observed artefacts.
  • Outcome: combine exposure, exploitability, and detection evidence to find and verify weaknesses before threat actors can act on the same public information.

This is not a replacement for governance or human approval. It is a way to move vulnerability response at the speed now being enabled by AI workflows, while keeping testing controlled, auditable, and defensively focused.

7. Validated Results: End-to-End Pipeline

For the evaluated CVE's, AttackFactory developed a working exploit in minutes.

7.1 Summary: Four CVE Types, One Pipeline

CVE Risk Type Severity Project Business Impact Validated
CVE-2026-40982 Path traversal 9.1 CRITICAL Spring Cloud Config Remote file exposure Yes
CVE-2026-42047 Information disclosure 8.6 HIGH Inngest JS SDK Secrets-like environment data exposure Yes
CVE-2026-8133 SQL injection 7.3 HIGH FilePress CMS Database extraction Yes
CVE-2026-43186 Memory corruption 8.2 HIGH Linux kernel IOAM6 Remote crash/corruption Yes

The given CVEs are illistrative of recent High/Critical CVEs in open source software - the key is that a single automated workflow handled web application, Java, TypeScript, PHP, database, and kernel-level cases without changing the operating model for each domain.

7.2 CVE-2026-40982 — Spring Cloud Config Path Traversal (CVSS 9.1 CRITICAL)

Attribute Value
Project Spring Cloud Config Server
Business risk Remote file read from a configuration service
Pipeline runtime ~4 minutes
Validation result Vulnerable version exposed a system file; patched version blocked the request

Outcome: A remote attacker could use the service to read files outside the intended configuration path. For a configuration server, that creates a credible risk of exposing sensitive application or environment data.

7.3 CVE-2026-42047 — Inngest JS Environment Variable Disclosure (CVSS 8.6 HIGH)

Attribute Value
Project Inngest JS SDK
Business risk Unauthenticated exposure of server environment variables
Pipeline runtime ~4 minutes
Validation result Vulnerable version leaked secrets-like values; patched version did not

Outcome: The vulnerable handler could disclose environment variables, including values that commonly hold database URLs, API keys, cloud credentials, and signing secrets. That is a direct confidentiality risk and a potential stepping stone to wider compromise.

7.4 CVE-2026-8133 — FilePress Pre-Auth SQL Injection (CVSS 7.3 HIGH)

Attribute Value
Project FilePress CMS
Business risk Unauthenticated database access through application input
Pipeline runtime ~11 minutes
Validation result Vulnerable version allowed database extraction in the lab; patched version rejected the payload

Outcome: A public-facing content system accepted input that could influence database queries before login. The lab proof confirmed that database information could be extracted and that the patch removed the behaviour.

7.5 CVE-2026-43186 — Linux Kernel IOAM6 Memory Corruption (CVSS 8.2 HIGH)

Attribute Value
Project Linux kernel
Business risk Remote packet-triggered kernel crash and memory corruption
Pipeline runtime ~25 minutes, including virtual machine boot and crash capture
Validation result Packet-only denial-of-service/corruption validated; packet-only arbitrary RCE not validated

Outcome: A crafted IPv6 packet could crash a vulnerable kernel in the lab.

8. Conclusion

AttackFactory demonstrates a change in vulnerability economics. Public disclosure is no longer the beginning of a slow manual research cycle. It can be the trigger for continuous automated analysis, validation, and exposure correlation.

The old model assumed:

Disclosure → manual analysis → exploit development (days) → target discovery → compromise

Updated agentic model:

Disclosure → automated analysis, development, testing, target discovery → compromise (hours)

For defenders, the response must be equally operational. Security teams need systems that can ingest disclosures, identify exposed assets, validate exploitability, and trigger remediation faster than human-only processes can manage.