November 2024
-
Happy Thanksgiving to our US readers! 🦃 If you’re interested in discovering vulnerabilities in AI models like me, don’t miss the article on automated red-teaming techniques against OpenAI’s o1 model. It lists some advanced technical methods employed by Haize Labs, which secured testing contracts from OpenAI and Anthropic. In a recent blog, DryRun Security shared…
-
I have two favorite stories in this issue of the AI Security Newsletter. Firstly, researchers at George Mason University have developed a creative approach to defend against AI-driven cyberattacks. Their Mantis framework can inject “malicious” prompts into attackers’ AI models when they rely on AI to launch cyberattacks, demonstrating the principle that anything AI processes…
-
In this issue, I want to spotlight OWASP’s recent developments in GenAI security guidance. This is an extension of the OWASP Top 10 for LLM Application Security Project. The new guidance provides practical resources for addressing deepfake threats, creating AI Security Centers of Excellence, and navigating the AI Security Solution Landscape. It serves as a…
-
Several big players have unveiled new products or features: Apple launched iOS 18.1 with Apple Intelligence enhancements, OpenAI upgraded ChatGPT with web search capabilities, and Cohere introduced Embed 3 for multimodal AI search. I am particularly excited about ChatGPT’s new search feature. Many of my AI tasks require finding latest and most accurate information, and…
