AI Security Newsletter (09-16-2024)

OpenAI released a new model called “o1” or “Strawberry” last week, which significantly improves reasoning capabilities. The AI community is still evaluating the model, but initial results suggest that its reasoning abilities might elevate the AI game to a new level.

If you are developing an Agent-based application and want to see how it performs on Windows OS, a new platform called WindowsAgentArena (WAA) is now available for this purpose.

Another small model that excels in a specific task is Jina AI’s Reader LM.

Technology & Tools

OpenAI Unveils “Strawberry” Enhanced Reasoning System
OpenAI’s new AI model, “Strawberry,” showcases a leap in AI’s problem-solving capabilities, particularly in areas requiring planning and iteration. Dubbed o1-preview, it outperforms human PhDs in complex physics problems but remains limited by its underlying model, GPT-4o. Strawberry’s approach to problem-solving, demonstrated through a challenging crossword puzzle, reveals its ability to “think through” problems, offering solutions that were previously unattainable. Despite its advancements, Strawberry still encounters errors and hallucinations, highlighting the ongoing journey towards fully autonomous AI agents.

https://www.oneusefulthing.org/p/something-new-on-openais-strawberry

Introducing WindowsAgentArena: A Leap in AI Agent Benchmarking

Microsoft, alongside Carnegie Mellon and Columbia University, has unveiled the WindowsAgentArena (WAA), a groundbreaking framework designed to evaluate AI agents’ ability to perform multi-modal tasks on the Windows OS. WAA enables agents to interact with a real Windows environment, using applications and tools as humans do, to complete over 150 diverse tasks. This scalable benchmark can be executed in Azure in minutes, a significant improvement over traditional methods. The team also introduced Navi, a multi-modal agent tested within WAA, achieving a 19.5% success rate against human performance at 74.5%. This development opens new avenues for research into AI agent capabilities and efficiency.

(Windows OS is the main target of cyber attacks, The Windows Agent Arena provides a much needed platform for testing AI agents in a real-world environment. Although the current tasks are not cybersecurity related, but it should be not too difficult to extend the platform to include cybersecurity tasks.)

https://microsoft.github.io/WindowsAgentArena/

Jina AI Introduces Reader LM for HTML to Markdown Conversion
Jina AI has launched Reader LM, a small language model designed to efficiently clean and convert HTML content into Markdown. This innovation aims to streamline the process of repurposing web content for various platforms, enhancing productivity and content management.

(I always believe small models can excel in specific tasks, and this is another good example. It demonstrates that with suitable and clearly defined tasks, being trained with a properly designed architecture and high-quality data, small models can be very effective. I have been dreaming about small models that can be very efficient in reading specific types of cybersecurity logs, and I believe we will have it soon.)

https://jina.ai/news/reader-lm-small-language-models-for-cleaning-and-converting-html-to-markdown/

Business & Products

OpenShield: A Firewall for AI Models

OpenShield introduces a firewall designed to protect AI models from malicious attacks, offering features like rate limiting, content filtering, and keyword filtering. It’s an early-stage project aimed at safeguarding AI from prompt injection, insecure output handling, training data poisoning, and other vulnerabilities listed in the OWASP Top 10 LLM attacks. OpenShield operates as a transparent proxy, enhancing security for AI applications by filtering inputs and outputs before they reach foundational models.

https://github.com/openshieldai/openshield

Regulation & Policy

Startup’s Underwater Data Center Plan Stirs Regulatory Waves

Startup NetworkOcean’s proposal to test an underwater data center in San Francisco Bay has caught regulators off guard, lacking necessary permits and raising environmental concerns. Founders Sam Mendel and Eric Kim aim to address the AI industry’s growing demand for energy-efficient data processing by submerging GPU servers, potentially reducing land-based data center impacts. However, scientists warn of possible adverse effects on marine life and ecosystems, while regulatory bodies signal potential legal violations. The project underscores the tension between innovative climate solutions and environmental regulations.

https://arstechnica.com/tech-policy/2024/09/proposed-underwater-data-center-surprises-regulators-who-hadnt-heard-about-it/

Global AI Treaty Signed to Safeguard Human Rights and Democracy

The European Union, United Kingdom, United States, and other nations have signed the Framework Convention on Artificial Intelligence and Human Rights, Democracy, and the Rule of Law, marking the first legally binding international treaty on AI. Aimed at ensuring AI innovation aligns with democratic values and human rights, the treaty sets standards for AI use, including privacy protection, non-discrimination, and transparency. It excludes military applications but covers a broad range of AI systems, with oversight mechanisms for compliance. This landmark agreement could significantly influence global AI practices, despite notable absences like China and Russia.

https://www.coe.int/en/web/portal/-/council-of-europe-opens-first-ever-global-treaty-on-ai-for-signature

Opinions & Analysis

Anthropic CEO Dario Amodei Discusses AI’s Future, Safety, and Regulation

In a comprehensive interview on Econ 102, Dario Amodei, CEO of Anthropic, delves into AI’s potential to revolutionize biology, the importance of safety in AI development, and the implications of California’s SB 1047 on AI regulation. Amodei emphasizes the need for responsible scaling and the potential for AI to create a world of abundance while also acknowledging the challenges of ensuring equitable benefits and managing international competition in AI advancements.

Discover more from Mindful Machines

Subscribe to get the latest posts sent to your email.