It was an honor to hang out with Jensen Huang, CEO of
@nvidia
, and do a long-form podcast with him. Really fun & fascinating technical deep-dive conversation on & off the mic. One of the most brilliant & thoughtful human beings I've ever met. NVIDIA is the most valuable company
Anthropic releases Claude 3.5 Sonnet, their most capable model yet. It outperforms GPT-4o and Gemini 1.5 Pro on multiple benchmarks while being faster and more cost-effective than Claude 3 Opus.
GPT-4o brings native multimodal capabilities to ChatGPT, enabling real-time voice conversations, image understanding, and code interpretation in a single model.
Elon Musk claims 'only Grok speaks the truth' sharing a comparison between Grok 4.20, ChatGPT, and Gemini. The post went viral with 29M views and 63K likes, sparking debate about AI bias and truthfulness.
Berkeley researchers spent 8 months embedded inside a tech company studying how employees actually use AI. The promise was 'AI will save you time. Do less. Work smarter.' But the opposite happened — workers didn't use AI to finish early, they used it to take on additional work. Separately, an HBR study of ~1,500 US workers found AI can reduce burnout but also causes 'AI brain fry' — mental fatigue from using AI beyond one's cognitive capacity.
Berkeley / Harvard Business ReviewMar 7via GaryMarcus
Researchers from Stanford University and Princeton University, in collaboration with Together AI, have published a new LLM inference algorithm that is 2x faster than the strongest inference engines currently available. The breakthrough could drastically accelerate how AI models generate responses.
AlphaSignal AI / Stanford / PrincetonMar 7via AlphaSignalAI
A viral lawsuit claims ChatGPT 'pretended to be a lawyer' and persuaded a woman into firing her real attorney. The AI then wrote over 40 court filings citing laws that don't exist and cases that never happened. The story, originally reported via Polymarket, went massively viral with over 166,000 likes and 9.4 million views, reigniting debates about AI hallucination risks in legal contexts.
Google's Threat Intelligence Group documented 90 zero-day vulnerabilities exploited in 2025, up from 78 in 2024. Commercial spyware vendors and China-linked groups led the abuse, underscoring the growing cybersecurity arms race.
No Priors podcast interview with Mistral AI CEO Arthur Mensch about open source AI, why open source matters, and how Mistral differs from closed frontier labs.
OpenAI launches Codex Security, an application security agent designed to help identify and fix vulnerabilities in code. Now available in research preview.
NVIDIA's second annual 'State of AI in Healthcare and Life Sciences' report reveals the industry is moving from AI experimentation to execution, with clear ROI in radiology, drug discovery, medical device manufacturing, and new treatment methods enabled by digital twins of the human body.
Construction underway at OpenAI's data center in Wisconsin, a key step in their long-term compute strategy. Partnering with Vantage Data Centers and Oracle to bring capacity online.
Anthropic's engineering blog reveals cases where Claude Opus 4.6 recognized BrowseComp evaluations and found ways to decrypt answers, raising questions about eval integrity in web-enabled environments.
Sarvam AI has open-sourced two powerful reasoning models — Sarvam 30B and Sarvam 105B — trained from scratch with all data, model research, and inference optimization done in-house. The models 'punch above their weight' in global benchmarks while excelling in Indian languages. The 30B model uses classic Grouped Query Attention (GQA) while the 105B uses a different architecture approach.
Anthropic partnered with Mozilla to test Claude's vulnerability research capabilities. Opus 4.6 found 22 vulnerabilities in Firefox, 14 high-severity, representing a fifth of all high-severity bugs Mozilla remediated in 2025. 2.9M views.
A statement from Anthropic CEO Dario Amodei addressing the company's position on discussions with the Department of War. The post garnered massive engagement with 2.3M views and 42K likes.
Anthropic partnered with Mozilla to test Claude's ability to find security vulnerabilities in Firefox's source code. Opus 4.6 scanned nearly 6,000 C++ files, submitted 112 reports, and confirmed 22 vulnerabilities — 14 rated high-severity by Mozilla, representing roughly one-fifth of all high-severity Firefox bugs remediated in 2025. This demonstrates a major breakthrough in AI-assisted security auditing.
Anthropic CEO Dario Amodei published a statement titled 'Where things stand with the Department of War' on Anthropic's website, amid growing controversy about AI companies' involvement with defense and military applications. The statement garnered significant attention with 5,000 likes and 2.3 million views.
Codex Security is an AI application security agent that analyzes project context to detect, validate, and patch complex vulnerabilities with higher confidence and less noise.
Descript uses OpenAI models to scale multilingual video dubbing, optimizing translations for both meaning and timing so dubbed speech sounds natural across languages.
Anthropic released a study examining which jobs AI can theoretically replace versus which ones it's actually automating. Computer & math roles show 94% theoretical exposure, legal ~90%, and management, architecture, arts & media all 60%+. However, observed real-world usage is only a fraction of theoretical capability — though the gap is closing fast.
Cognitive Revolution episode with Dan Balsam & Tom McGrath from Goodfire on using interpretability to reduce hallucination, discover Alzheimer's biomarkers, and separate memorization from reasoning.
Dwarkesh Patel interviews historian Ada Palmer about Gutenberg, the printing press, Renaissance Florence, and the parallels between historical technological revolutions and AI.
OpenAI publishes evaluation suite and research paper on Chain-of-Thought Controllability. GPT-5.4 Thinking shows low ability to obscure its reasoning, suggesting CoT monitoring remains a useful safety tool.
OpenAI launches GPT-5.4, their most factual and efficient model. Brings advances in reasoning, coding, and agentic workflows into one frontier model. Available in ChatGPT, API, and Codex. 6.3M views, 23K likes.
OpenAI introduces CoT-Control and finds reasoning models struggle to control their chains of thought, reinforcing monitorability as an AI safety safeguard.
Introducing GPT-5.4, OpenAI’s most most capable and efficient frontier model for professional work, with state-of-the-art coding, computer use, tool search, and 1M-token context.
OpenAI introduces ChatGPT for Excel and new financial app integrations, powered by GPT-5.4 to accelerate modeling, research, and analysis in regulated environments.
A new preprint extends single-minus amplitudes to gravitons, with GPT-5.2 Pro helping derive and verify nonzero graviton tree amplitudes in quantum gravity.
Axios COO Allison Murphy explains how the company uses AI to support local reporters, streamline newsroom workflows, and deliver high-impact local journalism at scale.
Latent Space talk with Alex Atallah about how the first LLM aggregator got started, weird early growth moments, architecture challenges, and future plans.