It was an honor to hang out with Jensen Huang, CEO of
@nvidia
, and do a long-form podcast with him. Really fun & fascinating technical deep-dive conversation on & off the mic. One of the most brilliant & thoughtful human beings I've ever met. NVIDIA is the most valuable company
Anthropic releases Claude 3.5 Sonnet, their most capable model yet. It outperforms GPT-4o and Gemini 1.5 Pro on multiple benchmarks while being faster and more cost-effective than Claude 3 Opus.
GPT-4o brings native multimodal capabilities to ChatGPT, enabling real-time voice conversations, image understanding, and code interpretation in a single model.
Facebook Marketplace is adding a bunch of new AI-powered tools that are supposed to make selling items on the platform a little more efficient. One feature will use Meta AI to automatically respond to those annoying "Is this still available?" messages.
You can toggle on the auto-reply option when creating a listing, and Meta AI will draft an editable response to any questions related to availability. In an example shown by Meta, its AI assistant creates an auto-reply saying, "Yes, it's still ...
Computers ordering cappuccinos.
A couple of weeks ago, Google and Samsung announced a big Gemini development coming to their newest devices: task automation. Starting with food delivery and rideshare apps, Gemini would be able to use certain apps on your behalf in a virtual window to take care of things like ordering dinner or getting a car to the airport - all based on simple prompts. You know, all the stuff that we've been promised for years AI assistants will be able to do. That feature ...
Pinecone expands its serverless vector database free tier to 100M vectors with no time limit. New features include hybrid search (dense + sparse), metadata filtering, and integration with LangChain, LlamaIndex, and OpenAI.
Google launches Gemini 1.5 Flash optimized for high-volume applications. The model offers 1M token context at 2x the speed and half the cost of 1.5 Pro. Ideal for chatbots, content moderation, and data extraction tasks.
OpenAI opens Sora text-to-video model to ChatGPT Plus subscribers. Users can generate 1080p videos up to 20 seconds with custom aspect ratios. Features include video extension, style presets, and frame-level editing.
LangChain releases version 0.2 with major architecture improvements including native streaming support, async-first agents, and built-in LangSmith observability. The update reduces latency by 40% for complex agent workflows.
Midjourney V7 brings photorealistic video generation with camera motion controls, supporting clips up to 60 seconds. New features include character consistency across frames, style transfer from images, and improved text rendering.
Figma introduces AI-powered design features allowing users to generate auto-layout designs from screenshots or voice descriptions. The update includes AI component suggestions, accessibility checks, and design-to-code export.
Replicate secures $200M funding to expand its AI model hosting platform. The startup now serves 50,000+ developers running 100M+ predictions monthly. New features include fine-tuning UI, model versioning, and enterprise SLAs.
Elon Musk's xAI releases Grok-3, claiming top spot on MMLU with 89.2% accuracy. The model features real-time X integration, image understanding, and a 1M token context window. Available to X Premium+ subscribers immediately.
Mistral AI announces Codestral 2, a 32B parameter code-specific model achieving 92.1% on HumanEval benchmark. The model supports 80+ programming languages and offers a 256k context window for large codebase analysis. Released under Apache 2.0 license for commercial use.
Google rolled out spend caps for the Gemini API, letting developers define a hard monthly budget limit. Logan Kilpatrick (Google AI Studio DevRel) announced the feature and recommended all developers set a cap immediately. Simon Willison called it great news for anyone running Gemini prompts in CI or building agents that experiment with the API. There is up to a 10-minute delay before a newly set cap takes effect, and developers remain responsible for usage incurred in that window. The feature is the first in a planned series of cost-control tools for Gemini API users.
Anthropic's latest update to Claude will allow the AI chatbot to generate custom charts, diagrams, and other visualizations during your conversation. If Claude determines a visual is useful based on the context of your chat, it will insert the image in-line, rather than in its side panel.
As an example, Anthropic says a conversation about the periodic table could lead Claude to generate a visualization of it, featuring interactive elements that let you click inside the table for more informat...
Anthropic rolled out two significant Claude updates this week. First, Claude can now build interactive charts and diagrams directly inside the chat window — available in beta on all plans including free. Second, Claude for Excel and Claude for PowerPoint now share full conversation context when multiple files are open, letting users pull data from spreadsheets into presentations without manually switching tabs. The dual release reinforces Anthropic's push into enterprise productivity workflows. Separately, the Ramp AI Index flagged Anthropic as the top AI stack choice for businesses, adding third-party validation to the product momentum.
Google launched Immersive Navigation, a new 3D navigation mode for Google Maps that Sundar Pichai called the product's biggest upgrade in over a decade. The view renders a vivid, real-time 3D picture of your surroundings with road-level details including lane markings and crosswalks. The update is part of a broader reimagining of Google Maps that the team described as built for the Gemini era, using AI to bring richer context and spatial understanding to everyday navigation. Logan Kilpatrick, who sat down with the Maps team, called it an impressive demonstration of Gemini in action at product scale.
Today we’re talking about the messy, fast-moving situation at Anthropic, the maker of Claude that now finds itself in a very ugly legal battle with the Pentagon.
The back-and-forth is complicated, but as of a few days ago, the Pentagon had deemed Anthropic a supply chain risk, and Anthropic has filed a lawsuit challenging that designation, saying the government has violated its First and Fifth Amendment rights by “seeking to destroy the economic value created by one of the world’s fastest-gr...
The impact of artificial intelligence extends far beyond the digital world and into our everyday lives, across the cars we drive, the appliances in our homes, and medical devices that keep people alive. More and more, product engineers are turning to AI to enhance, validate, and streamline the design of the items that furnish our…
Multiple AI agent companies announce major funding rounds, reflecting investor enthusiasm for autonomous AI systems. Notable raises include MultiOn ($35M), Adept ($150M), and Imbue ($200M) for agent development platforms.
The former Tesla AI director releases a comprehensive free course covering neural networks from scratch. Topics include backpropagation, transformers, and LLM training, with hands-on coding exercises in Python.
Zapier introduces AI-powered automation that understands natural language instructions. Users can describe workflows in plain English, and the AI builds and configures the appropriate Zaps across 6,000+ integrated apps.
SD4 brings photorealistic image generation and 4-second video clips from text prompts. The model shows significant improvement in text rendering and human anatomy, addressing long-standing issues with previous versions.
Windows 12 preview showcases Copilot deeply integrated into the OS, with ability to control settings, manage files, and automate workflows. New "Recall" feature provides photographic memory of user activity for instant retrieval.
Perplexity Enterprise allows companies to connect internal documents, databases, and wikis for AI-powered search. New features include role-based access control, audit logs, and integrations with Slack, Notion, and Google Workspace.
The AI-powered code editor Cursor secures major funding to expand its team and capabilities. The company reports 2M+ active developers and plans to introduce collaborative coding features and enterprise security controls.
SmolVLM enables multimodal AI on smartphones and IoT devices with models under 2B parameters. The release includes optimized versions for iOS and Android, bringing vision capabilities to mobile apps without cloud dependencies.
Meta's Llama 4 family includes models from 8B to 400B parameters, with the largest variant matching GPT-4 on most benchmarks. Released under permissive license for commercial use, marking a significant milestone for open source AI.
Gemini 2.5 Pro introduces native multimodal understanding across text, images, audio, and video. New agentic capabilities allow the model to perform complex tasks autonomously, including research, data analysis, and content creation.
Claude 3.7 Sonnet sets new records on SWE-bench, solving complex software engineering problems with 62% accuracy. The model introduces enhanced tool use capabilities and improved instruction following for enterprise workflows.
OpenAI announces GPT-4.5, featuring significant improvements in mathematical reasoning and code generation. The new model demonstrates 15% better performance on MATH benchmark and supports longer context windows up to 256k tokens.
Augment Code argued that modern development is shifting away from classic IDE assumptions and toward workspaces where developers define intent and delegate execution to agents. The company said the basic unit of interest is no longer a single file but an agent, suggesting the next generation of developer tooling will be organized around orchestration rather than manual code navigation. It is a notable framing because it pushes the coding-agent conversation beyond model quality into the shape of the actual interface developers may end up using every day.
Replit announced it raised $400 million at a $9 billion valuation, with investors including Georgian and G Squared. On the same day, it launched Replit Agent 4, featuring real-time multi-user collaboration — multiple people building in the same workspace simultaneously — and a new canvas mode that renders live app previews inline while you code. Early testers described the leap as the biggest product improvement they had felt in any tool. The timing of the funding and launch together signals Replit is positioning itself as the primary platform for AI-native software creation.
Lightning AI promoted Nvidia's Nemotron 3 Super as a model developers can customize, fine-tune, and deploy for reasoning agents in minutes, while related posts from Nvidia and Artificial Analysis emphasized the model's open weights, efficiency, and launch-day availability across inference providers. Taken together, the posts frame Nemotron 3 Super as more than another model release: it is being positioned as an open reasoning model with a real deployment ecosystem already wrapped around it. That combination of openness, benchmark credibility, and immediate infrastructure support is what gives the launch its weight.
Kaggle said its Community Benchmark SDK now supports automatic tracking for token usage, cost, and latency alongside standard evaluation results. The update pushes benchmark workflows closer to real product decisions, where teams need to understand not just which model performs best, but which one is cheapest and fastest to run. It is a useful signal that practical model economics are becoming part of mainstream benchmark tooling rather than an afterthought.
Google has completed its acquisition of Wiz, the cloud security company, with Sundar Pichai welcoming the Wiz team publicly. The deal gives Google a broad cloud security platform that protects workloads across providers, strengthening its pitch to enterprise customers who run multicloud environments. Pichai framed the acquisition as giving customers a comprehensive platform to secure their cloud and AI workloads. Wiz co-founder Assaf Rappaport previously turned down a $23 billion offer from Google, making this closing a notable reversal and one of the largest cybersecurity acquisitions in recent memory.
Rakuten uses Codex, the coding agent from OpenAI, to ship software faster and safer, reducing MTTR 50%, automating CI/CD reviews, and delivering full-stack builds in weeks.
Feng Qingyang had always hoped to launch his own company, but he never thought this would be how—or that the day would come this fast. Feng, a 27-year-old software engineer based in Beijing, started tinkering with OpenClaw, a popular new open-source AI tool that can take over a device and autonomously complete tasks for a…
Wayfair uses OpenAI models to improve ecommerce support and product catalog accuracy, automating ticket triage and enhancing millions of product attributes at scale.
How OpenAI built an agent runtime using the Responses API, shell tool, and hosted containers to run secure, scalable agents with files, tools, and state.
Abacus.AI CEO Bindu Reddy said her team is racing to switch coding workloads to GPT-5.4 because it performs better on fairly complex codebases and harder problems. In parallel, Augment Code said GPT-5.4 had become the default model in its agent development environment and framed it as especially strong for agent coordination. Taken together, the posts suggest GPT-5.4’s momentum is no longer just about benchmarks: it is turning into real adoption pressure inside coding and agent-engineering workflows.
A widely shared post from trq212 said Claude Code now supports `/btw`, a command for starting side-chain conversations while Claude continues working on the main task. Jason Liu reposted the feature to his audience, helping turn it into one of the most visible coding-agent workflow updates in this scrape batch. The change points toward a more interruptible, multitasking model for agent-assisted development rather than a single linear prompt-and-wait loop.
AgentMail announced a $6 million seed round led by General Catalyst, and Yohei Nakajima amplified the raise by arguing that email is becoming a core layer for AI agents, not just a communications channel. In his framing, email gives agents identity, authentication, notifications, and access to account creation flows such as self-service API signup and key retrieval. The combined posts cast AgentMail as infrastructure for practical autonomous workflows rather than another narrow inbox tool.
Two posts from the last 48 hours point to GPT-5.4 gaining real traction in coding-agent workflows. Augment Code said GPT-5.4 is now the default model in Intent and highlighted it as especially strong for agent coordination, while Abacus.AI CEO Bindu Reddy said her team is racing to move coding workloads onto GPT-5.4 because it performs better on fairly complex codebases and hard problems. The takeaway is that GPT-5.4 may be moving beyond benchmark headlines into day-to-day developer tooling decisions.
Andreessen Horowitz says new per-capita usage analysis across the 10 largest LLM products shows the United States ranking only 20th in AI adoption despite building many of the category’s biggest products. The firm framed the finding as evidence that consumer AI is becoming a more global market than top-line traffic charts imply. If that view is right, AI distribution and user behavior may become at least as strategically important as model leadership in the next phase of consumer competition.
AgentMail says it has raised a $6 million seed round led by General Catalyst, with investors including Y Combinator, Paul Graham, Dharmesh Shah, and Matt Shumer. The pitch is simple but strategic: every AI agent needs its own inbox. If that framing sticks, email could become one of the core infrastructure layers for autonomous software, giving agents a native way to receive updates, authenticate into workflows, and interact with the outside world without piggybacking on a human account.
Techmeme highlighted Axios reporting that Nielsen’s Gracenote has sued OpenAI for copyright infringement, alleging that OpenAI copied Gracenote’s data and the relational framework it uses to connect metadata. The case is notable because it extends the AI copyright fight beyond training on expressive works into the structured data layers that help classify and link content. If courts take these claims seriously, the legal risk for AI companies could widen from scraped media itself to the metadata systems that make media usable at scale.
A post flagged by The Rundown AI says OpenAI has added interactive visuals for more than 70 math and science concepts inside ChatGPT, including variable sliders, live graphs, and animated demonstrations. The update matters because visual explanations can make ChatGPT more useful for education and self-study than text responses alone, especially for subjects where intuition comes from seeing systems change in real time. It is another sign that OpenAI is expanding ChatGPT from a chatbot into a broader interactive product surface for learning and problem-solving.