Welcome to this edition of our AI Security Newsletter, where we’re tracking the rapid evolution of AI technologies and their security implications. This edition highlights the emergence of agentic browsers and autonomous AI agents reshaping commerce, with significant developments from Perplexity AI, OpenAI, and Google. We’re also covering critical security concerns, including a 156% surge in AI-driven supply chain attacks and new vulnerabilities in AI infrastructure. Additionally, we examine the business landscape with Anthropic’s profitability trajectory and strategic concerns about the US losing its AI edge due to energy constraints.
Risks & Security
Whisper Leak: Privacy Risks in Language Models Exposed
Researchers from Microsoft identified a novel side-channel attack, dubbed “Whisper Leak,” that threatens the confidentiality of user interactions with AI chatbots. By analyzing encrypted network traffic, attackers can infer sensitive topics of discussion despite end-to-end encryption. The findings underscore the need for stronger privacy measures, prompting industry partners like OpenAI and Microsoft to implement mitigations, emphasizing the importance of user privacy in AI communications.
Rising Threat: AI-Driven Supply Chain Attacks Increase by 156%
AI-enabled supply chain attacks have surged by 156% over the past year, showcasing the inadequate efficacy of traditional security measures against evolving threats. Characterized by polymorphism and context-awareness, AI-generated malware adapts to bypass defenses, complicating detection and response. Organizations are urged to adopt new strategies, including AI-aware security measures and thorough audits of code dependencies, to better safeguard against these sophisticated attacks.
Implementing AI in Security Operations: Lessons from Redis
Redis highlights the transformational impact of integrating AI in Security Operations Centers (SOCs). By leveraging AI tools, the average investigation time plummeted to about 10 minutes from hours, enhancing efficiency in alert triage. Key lessons include prioritizing transparency and aligning success metrics with organizational goals. As AI solutions evolve, establishing trust and improving workflows will be critical to realizing their full potential in cybersecurity strategies.
CyberSOCEval: New Benchmarks for LLMs in Cybersecurity
Meta AI has unveiled CyberSOCEval, an open-source benchmark suite designed to evaluate Large Language Models (LLMs) on Malware Analysis and Threat Intelligence Reasoning. Aimed at enhancing Security Operations Center (SOC) automation, the benchmarks reveal that while modern LLMs perform well, they are underutilized in real-world cybersecurity contexts. This initiative seeks to guide AI developers and improve models’ capabilities against evolving cyber threats.
AI-Powered Malware Evolves: Google Identifies New Threats
Google has reported the emergence of five AI-based malware families that adapt and conceal themselves during attacks, complicating detection efforts for security teams. Notably, PROMPTFLUX and PROMPTSTEAL utilize AI capabilities to dynamically alter their code and execute commands, illustrating a significant shift in cyber threats. As attackers increasingly leverage AI technology, the need for advanced detection measures becomes critical in the evolving landscape of cybersecurity.
AI Misuse Evolving: Google Threat Intelligence Report 2025
The Google Threat Intelligence Group highlights a concerning shift in AI misuse, with adversaries employing tools to alter malware behavior mid-execution, marking a new phase of cyber threats. Some malware now dynamically generates malicious scripts using large language models, while state-sponsored actors leverage AI for all attack lifecycle phases. Google emphasizes its commitment to responsible AI development, enhancing security measures to thwart emerging threats from malicious actors.
Vulnerabilities Discovered in AI Infrastructure of Ollama and NVIDIA Triton
Security researchers have identified significant vulnerabilities in Ollama and NVIDIA Triton Inference Server. These include critical flaws enabling remote code execution, such as a command injection vulnerability in Triton Server and several issues in Ollama, including CVE-2024-12886, rated at a CVSS of 7.5. The findings underscore a growing focus on AI infrastructure security, set to be presented at Black Hat Europe 2025.
Technology & Tools
Google Introduces Private AI Compute: Enhancing AI Security and Privacy
Google has unveiled Private AI Compute, a cloud-based platform designed to leverage Gemini models while ensuring user data privacy. This innovative system allows for advanced AI processing power without exposing personal information, even to Google. By employing a secure framework, Private AI Compute enables timely suggestions and interactions, marking a significant step in the evolution of user-focused AI experiences.
Kimi K2 Thinking: Open-Source Model Surpassing Proprietary Rivals
Moonshot AI’s Kimi K2 Thinking has emerged as a leading open-source AI model, surpassing both OpenAI’s GPT-5 and Anthropic’s Claude Sonnet 4.5 in multiple benchmarks. With a trillion parameters and advanced reasoning capabilities, K2 achieved notable scores, including 60.2% on BrowseComp. Its modified MIT license allows full commercial use with light attribution, marking a significant shift in the competitive landscape of high-end AI systems.
The Shift to Smaller, Specialized AI Models
Recent research from MIT and IBM reveals a movement towards smaller, specialized AI models outperforming larger systems in specific tasks. These smaller models, designed with strategic insights rather than size alone, demonstrate superior efficiency and environmental benefits. Applications in domains like climate prediction and object localization highlight their effectiveness. This revolution signifies a shift from “bigger is better” to prioritizing performance, resource efficiency, and accessibility in AI development.
ElevenLabs Introduces Advanced Speech-to-Text Model: Scribe v2 Realtime
ElevenLabs has unveiled Scribe v2 Realtime, a low-latency Speech-to-Text model supporting 90+ languages. The model boasts a 93.5% accuracy rate and is designed for real-time applications ranging from customer service to education. Key features include under 150 ms latency, voice activity detection, and customizable vocabularies. It’s tailored for enterprise use and complies with Indian data regulations, enhancing voice interactions globally.
Business & Products
The Rise of Agentic Browsers: New Players in Online Shopping
Perplexity AI, OpenAI, and Google are leading the charge in the emerging field of agentic browsers—AI tools that autonomously handle web tasks like shopping and booking. With the launch of Perplexity’s Comet and OpenAI’s Atlas, Amazon has responded with legal action against Comet over unauthorized transactions. As these browsers optimize user preferences, traditional retailers must adapt to remain visible in this evolving marketplace.
Anthropic Poised for Profitability Ahead of OpenAI
Financial documents reveal that Anthropic is on track to achieve profitability by 2028, significantly outpacing OpenAI, which anticipates $74 billion in operating losses that year. Anthropic’s success is attributed to its Claude chatbot, while OpenAI’s strategy focuses on heavy investments in technology and talent, aiming for long-term growth despite short-term financial strain. The contrasting business models highlight differing approaches to the AI industry’s rapid evolution.
Agentic Commerce: AI Agents Reshape Shopping Dynamics
A new report reveals that agentic commerce, where AI agents act autonomously on consumer behalf, is set to revolutionize retail, forecasting $1 trillion in US B2C revenue by 2030. This shift will alter traditional e-commerce, emphasizing personalization and seamless transactions while posing challenges in trust and business model adaptation. Companies must move quickly to harness this opportunity or risk being sidelined in the evolving marketplace.
Microsoft’s AI Agents Face Collaboration Challenges in New Marketplace Test
Microsoft’s recent experiment in the “Magentic Marketplace” revealed unexpected weaknesses in AI agents’ performance. With a setup involving 100 customer agents and 300 business agents, the study found that agents struggled with overwhelming options and effective collaboration. Although improved instructions helped, Ece Kamar emphasized the need for inherent capabilities to evolve as businesses look to leverage AI agents for decision-making and negotiation.
Regulation & Policy
EU’s Digital Omnibus: A Shift in Privacy and AI Regulations
Proposed changes within the EU’s Digital Omnibus threaten to dilute existing GDPR protections, allowing companies to process personal data for AI training without explicit consent. The reforms introduce exceptions for “legitimate interest,” potentially widening data access for US tech firms while curbing individual rights to access and correct data. Critics argue this could undermine public safety and privacy, leading to greater risks in AI accountability.
CISA Faces Challenges Amid Budget Cuts and Workforce Changes
The Cybersecurity and Infrastructure Security Agency (CISA) is experiencing significant challenges, including budget cuts, workforce reassignments, and politicization that threaten its critical role in U.S. cyber defense. With a proposed $3 billion budget reduction and approximately one-third of its workforce impacted, CISA’s ability to secure vital infrastructure against cyber threats is diminishing, raising concerns about America’s cybersecurity resilience.
Opinions & Analysis
US Risks Losing AI Edge Due to Energy Constraints
A recent analysis highlights that the US may jeopardize its technological advantage in AI due to energy supply limitations. In 2024, China significantly outpaced the US by adding over six times the new power capacity. With the US focusing on reviving coal while renewable energy projects face political hurdles, experts warn that unless changes are made, the nation could fall behind in both energy and AI innovation.

Leave a comment