Home / Security & Performance / Defending Against the Rise of AI-Driven Cybercrime

Defending Against the Rise of AI-Driven Cybercrime

May 18, 2026 Interview

Kendra HainesNetwork Security Specialist

Matilda Bailey is a distinguished networking specialist whose work sits at the critical intersection of infrastructure and emerging intelligence. With a deep focus on how cellular and wireless technologies integrate with next-generation solutions, she has spent years navigating the evolving threat landscape that businesses face today. As artificial intelligence moves from a novelty to a core operational tool, Bailey’s expertise has become essential for organizations trying to outpace increasingly sophisticated threat actors. In this discussion, we explore the alarming rise of autonomous cyberattacks, the financial devastation caused by deepfakes, and the rigorous governance structures required to secure the modern digital enterprise.

AI-driven cybercrime has transitioned from simple automated spam to fully autonomous agentic AI that manages the entire attack lifecycle. How does this shift change the speed of a typical breach, and what specific technical hurdles do security teams face when defending against human-free operations? Please elaborate with step-by-step defensive strategies.

The shift to agentic AI represents a fundamental change in the tempo of cyber warfare, moving from human-speed decision-making to machine-speed execution. In the past, a hacker had to manually pivot from reconnaissance to exploitation, but today’s AI agents can plan and execute an entire series of steps across an attack lifecycle with minimal human oversight. This drastically compresses the time between initial entry and full system compromise, often leaving security teams struggling to react before the damage is done. To defend against these human-free operations, organizations must first deploy a new generation of AI-specific security tools that can identify prompt injection and adaptive behaviors. Second, it is vital to establish strict governance that keeps a “human in the loop” for critical decisions, ensuring that automated systems don’t have unchecked authority over sensitive data. Finally, a robust defense requires real-time monitoring of network traffic for the subtle, rapid-fire signatures that indicate an autonomous agent is testing the perimeter.

Deepfake technology now allows for realistic voice clones and video impersonations, leading to massive financial losses and eroded customer trust. Beyond multi-channel verification, what cultural shifts are needed within a company to foster a “verification first” environment, and how can leaders implement these changes without hindering productivity?

The psychological weight of seeing a CFO’s face or hearing their voice on a video call is incredibly difficult to overcome, as evidenced by the British multinational engineering firm that lost $25 million to a deepfake video conference. To combat this, we need a cultural shift where skepticism is seen as a professional asset rather than a lack of trust among colleagues. Leaders must normalize the practice of “stopping the clock” on any high-stakes request—such as urgent wire transfers or credential resets—to perform a secondary check through a different, secure internal channel. This verification shouldn’t feel like a bureaucratic hurdle; it should be integrated into the workflow as a standard safety protocol, much like a pilot’s pre-flight checklist. By framing these checks as a collective defense of the company’s $1.1 billion at-risk assets—an estimate for deepfake losses in 2025—employees feel empowered to question even the most convincing digital impersonations.

Modern malware can now modify its own code signature and generate harmful scripts in real-time to evade traditional detection. What are the limitations of standard antivirus software in this scenario, and how can enterprises transition to security tools that specifically identify and block prompt-driven malicious behavior?

Standard antivirus software is largely built on the concept of recognizing known signatures, but when malware like PromptLock can generate harmful Lua scripts in real-time based on text prompts, those static signatures become useless. This autonomous malware can independently decide whether to steal or encrypt data based on the environment it finds, working across Windows, Linux, and macOS simultaneously. Enterprises must move away from reactive, signature-based tools and toward behavioral AI security platforms that analyze the intent and origin of system commands. These advanced tools are designed to recognize “prompt-driven” anomalies, such as an LLM suddenly requesting access to sensitive system files or attempting to rewrite its own execution parameters. Transitioning to this model requires a deep audit of current infrastructure and the implementation of security tools specifically engineered to intercept the output of “criminal LLMs” before they can execute their payloads.

Even a minor disturbance of 0.001% in training data can drastically reduce the accuracy of internal AI models or create hidden backdoors. When building proprietary models, what rigorous processes ensure the integrity of the data supply chain, and what metrics should companies monitor to catch subtle deviations in model output?

The terrifying reality of data poisoning is that a microscopic disturbance of just 0.001% in a training set can lead to a staggering 30% reduction in model accuracy or create invisible backdoors for attackers. To ensure the integrity of the data supply chain, companies must implement a rigorous “chain of custody” for every data point used in their proprietary models, treating data with the same scrutiny as a physical manufacturing supply chain. This involves using cryptographic hashing to verify data sets and conducting regular “clean room” audits where the model is tested against known-good benchmarks to see if its logic has drifted. Metrics like “output variance” and “adversarial robustness scores” should be monitored constantly to catch the subtle deviations that suggest a model has been compromised. If a customer service chatbot begins giving slightly inappropriate or “off-book” results, it could be the first sign of a poisoned model that has been operating under the radar for months.

Establishing an Acceptable Use Policy and maintaining an AI Bill of Materials are critical components of modern governance. How do these documents help manage the risks of shadow AI, and what specific components must be cataloged to ensure a business can trace the origin of every model response or decision?

Governance is the only way to prevent “shadow AI” from turning an organization’s internal network into a playground for threat actors. An Acceptable Use Policy (AUP) sets the ground rules, ensuring that employees understand the risks of data leaks and copyright infringement when they turn to unauthorized AI tools for productivity. Meanwhile, an AI Bill of Materials (SBOM) acts as a comprehensive ledger that catalogs the specific models, datasets, and prompts used to build any given system. By documenting these components, a business can trace a faulty or malicious decision back to its source, answering critical questions about where a model came from and what data influenced its behavior. This level of transparency is essential for maintaining digital trust and ensuring that the organization can quickly patch or replace a compromised component in its AI ecosystem.

Sophisticated AI threats often require cross-industry collaboration and the adoption of frameworks like the OWASP Top 10 for LLMs. How does sharing real-time threat intelligence help firms defend against “criminal LLMs,” and what role does global community participation play in staying ahead of emerging jailbreak techniques?

Defending against “criminal LLMs” like WormGPT, GhostGPT, or the open-source KawaiiGPT is a task far too large for any single company to handle in isolation. These tools are specifically designed to bypass the safety restrictions of standard AI models, and their tactics evolve daily as hackers find new ways to “jailbreak” legitimate systems. By participating in global threat intelligence feeds, firms can share signatures of new prompt injection payloads and indicators of compromise as they appear in the wild. Frameworks like the OWASP Top 10 for LLMs provide a common language for security professionals to describe these attacks, allowing for a faster and more coordinated response. When the community works together to document and neutralize a new jailbreak technique, it raises the “cost of entry” for criminals, making it much harder for them to successfully exploit the broader ecosystem.

What is your forecast for the future of AI crimes?

I believe we are entering an era of “hyper-personalized” warfare where AI crimes will move from mass-market phishing to surgically precise, long-term infiltrations. In the coming years, we will see the rise of AI agents that can maintain a believable presence within a company’s Slack or email threads for months, slowly building trust and gathering intelligence before striking. The financial impact will likely grow as deepfake technology becomes indistinguishable from reality, making traditional remote identity verification almost obsolete. However, I also forecast a massive surge in “defensive AI” capabilities, where autonomous security agents will live on our networks, constantly hunting for and neutralizing threats before they ever reach a human user’s screen. The battle will ultimately be won by those who can govern their AI systems with the most transparency and speed, turning the technology into a shield that is just as powerful as the weapons used against us.

Defending Against the Rise of AI-Driven Cybercrime

Related Publications

Subscribe to our weekly news digest.