Rami's Readings #119 - Mistral Released Voxtral ⭐

The latest on AI, LLMs, Mistral, LLM-as-a-Judge, MCP, Jane Street, Apple EarPods, and more.

Jul 20, 2025

Welcome to Rami’s Readings #119 - a weekly digest of interesting articles, papers, videos, and X threads from my various sources across the Internet. Expect a list of reads covering AI, technology, business, culture, fashion, travel, and more. Learn about what I do at ramisayar.com/about.

👋🏼 Welcome New Subscribers

Hello! A hearty thank you for subscribing to Rami's Readings! There are quite a few new subscribers this week, thanks to recommendations from The AI Ethics Brief, Global Fintech Insider, and The VC Corner. I am thrilled to have you on board! In this newsletter, I curate the best papers, tweets, and articles I have read during the week focusing on LLMs, AI, economics, business, and technology news. You can learn more about me on my website.

📈 Top Recent Editions According to Substack

Rami's Readings #94 - 🤖 5 AI Predictions for 2025 ✨

Rami Sayar

January 26, 2025

Read full story

Rami's Readings #100 - 🎉 10 AI Lessons From 100 Newsletters 🎉

Rami Sayar

March 9, 2025

Read full story

Rami's Readings #95 - Welcome 👋🏼 and More on DeepSeek 🔥

Rami Sayar

February 3, 2025

Read full story

🤖 AI Reads

One Token to Fool LLM-as-a-Judge

Notes: Yikes. This paper presses pause on an assumption I constantly overhear: human annotation is no longer necessary.

Despite the seeming simplicity of this comparison task, we find that generative reward models exhibit surprising vulnerabilities to superficial manipulations: non-word symbols (e.g., ":" or ".") or reasoning openers like "Thought process:" and "Let's solve this problem step by step." can often lead to false positive rewards.

Context Rot: How Increasing Input Tokens Impacts LLM Performance

Notes: From Chroma. Yep…

Energy-Based Transformers are Scalable Learners and Thinkers

Notes: Really interesting paper from UVA, UIUC, Amazon GenAI, Stanford and Harvard.

EBTs are the first instance of an approach that scales at a faster rate than the Transformer++ during pretraining across both continuous and discrete modalities.

Voxtral: Mistral’s Speech Understanding Model ⭐

Notes: Outperforms Whisper v3 (which is already incredible).

Agentic-R1: Distilled Dual-Strategy Reasoning

Notes: From CMU.

The Lethal Trifecta for AI Agents: Private Data, Untrusted Content, and External Communication

Notes: Simon Willison is just always on point.

As a user of these systems you need to understand this issue. The LLM vendors are not going to save us! We need to avoid the lethal trifecta combination of tools ourselves to stay safe.