May 2025

  • After I put together the first version of the Fabric extension (my previous post about the first version), it really helped me use those Fabric patterns on web pages. But there were some downsides: it couldn’t read PDFs on the web, nor did it work with YouTube video pages. This was a bit of a…

  • Anthropic has released new models, Cloude Opus 4 and Sonnet 4, claiming exceptional coding and reasoning capabilities. Are they as impressive as advertised? We include a post that evaluated Claude 4 Opus in this issue. The results show promise, though some persistent issues remain. It’s also encouraging to see Stripe’s AI efforts improving fraud detection…

  • In this article, Rohit Krishnan explores the challenges and considerations of working with large language models (LLMs). Having developed several LLM applications from the ground up, I couldn’t agree more with his key observations: achieving perfect verifiability of LLM output is unattainable, increased AI usage in applications leads to more hallucinations, and trial and error…

  • This issue of AI newsletter includes Meta’s LlamaFirewall for AI security, WhatsApp’s Private Processing for enhanced privacy, and OpenAI’s retraction of the sycophantic GPT-4o update. Concerns over AI reliability pitfalls and privacy issues with ChatGPT’s location identification are also highlighted. On the technology front, we cover DARPA’s AI Cyber Challenge and advancements in jailbreaking resistance…