The Daily Agentic AI Podcast

The Daily Agentic AI Podcast https://podcast.sourcelabs.nl/the-daily-agentic-ai-podcast/ AI-generated audio briefings from your favourite content sources en Fri, 20 Mar 2026 15:29:42 GMT 60 2026-03-20T15:29:42Z en podcast@sourcelabs.nl Sourcelabs Sourcelabs The Daily Agentic AI Podcast https://podcast.sourcelabs.nl/the-daily-agentic-ai-podcast/podcast-image.jpg https://podcast.sourcelabs.nl/the-daily-agentic-ai-podcast/ The Daily Agentic AI Podcast - 2026-03-20 https://podcast.sourcelabs.nl/the-daily-agentic-ai-podcast/episodes/briefing-20260320-152313.mp3 Researchers found LLM security code reviews can be heavily fooled by confirmation bias: framing adversarial pull requests as bug-free reduced vulnerability detection rates by 16–93%, with one-shot bypass success reaching 35% on GitHub Copilot and 88% on Claude Code configurations. Defenses like metadata redaction and explicit “look for vulnerabilities” prompting largely restored detection (up to ~94% in interactive/autonomous tests), alongside broader themes of tool-call safety and policy-first guardrails. The roundup also highlighted agent “fleet” management via LangSmith Fleet with per-agent identities and Slack/Teams integrations, faster Claude Code performance and chat-based control channels, improved agentic coding infrastructure (TDAD test-impact analysis, Colab MCP for remote GPU execution), and Mistral Small 4’s open-weights MoE upgrade plus benchmarks. Sources: https://podcast.sourcelabs.nl/the-daily-agentic-ai-podcast/episodes/briefing-20260320-152313-sources.html Fri, 20 Mar 2026 15:08:18 GMT https://podcast.sourcelabs.nl/the-daily-agentic-ai-podcast/episodes/briefing-20260320-152313.mp3 2026-03-20T15:08:18Z The Daily Agentic AI Podcast - 2026-03-19 https://podcast.sourcelabs.nl/the-daily-agentic-ai-podcast/episodes/briefing-20260319-212202.mp3 A new five-layer security framework for autonomous LLM agents (OpenClaw) shows that community tool supply chains are a major risk: 26% of contributed tools were found vulnerable, and multi-stage attacks (from skill poisoning and prompt/memory injection to fork-bomb style execution) can bypass single-point filtering. The episode also highlights agentic coding advances—self-rebuilding agents driven by stable specifications, the “intent gap” problem for turning informal goals into formal specs, benchmarks showing reduced fidelity when specs emerge over time, and ProofWright using formal verification to validate optimized CUDA kernels. On the model side, Mamba-three cuts state size by half while maintaining quality, and a human-safety study warns that over-reliance on coding agents reduces critical thinking, calling for interaction designs that force reflection and verification. Sources: https://podcast.sourcelabs.nl/the-daily-agentic-ai-podcast/episodes/briefing-20260319-212202-sources.html Thu, 19 Mar 2026 15:17:30 GMT https://podcast.sourcelabs.nl/the-daily-agentic-ai-podcast/episodes/briefing-20260319-212202.mp3 2026-03-19T15:17:30Z The Daily Agentic AI Podcast - 2026-03-18 https://podcast.sourcelabs.nl/the-daily-agentic-ai-podcast/episodes/briefing-20260318-200034.mp3 Researchers demonstrated that LLM agents can be hijacked via prompt injection hidden in ordinary files (e.g., a GitHub README) to escape sandbox boundaries and execute malware, and that deeper trust-boundary failures enable a self-replicating worm (ClawWorm) targeting an open-source multi-agent platform (OpenClaw). In response to these risks, NVIDIA open-sourced OpenShell (kernel-level isolation, granular network/binary policies, auditing, private inference routing) and LangChain open-sourced Deep Agents plus sandbox/evaluation tooling, while benchmarks like EnterpriseOps-Gym showed planning is a major bottleneck for real enterprise task success. The show also covered major model and orchestration updates (OpenAI GPT-5.4 Mini/Nano, Codex subagents; Anthropic 1M-token Claude Opus/Sonnet, Claude Code efficiency; Replit Agent 4; and Andrew Ng’s course on memory-aware persistent agents). Sources: https://podcast.sourcelabs.nl/the-daily-agentic-ai-podcast/episodes/briefing-20260318-200034-sources.html Wed, 18 Mar 2026 19:56:42 GMT https://podcast.sourcelabs.nl/the-daily-agentic-ai-podcast/episodes/briefing-20260318-200034.mp3 2026-03-18T19:56:42Z The Daily Agentic AI Podcast - 2026-03-17 https://podcast.sourcelabs.nl/the-daily-agentic-ai-podcast/episodes/briefing-20260317-184509.mp3 The podcast discusses advancements in agentic AI, focusing on frameworks like MemCoder and Lore that enhance coding agents' memory and understanding of past decisions, facilitating better software development. It highlights the growing capability of agents to share knowledge and provide feedback, as seen in Andrew Ng's Context Hub and new tools from LangChain and Replit that prioritize accessibility for developers. Additionally, it addresses the performance of AI agents in continuous software maintenance and the nuanced impact of AI on code quality, emphasizing the importance of structuring institutional knowledge for optimal use of agentic AI systems. Sources: https://podcast.sourcelabs.nl/the-daily-agentic-ai-podcast/episodes/briefing-20260317-184509-sources.html Tue, 17 Mar 2026 15:05:56 GMT https://podcast.sourcelabs.nl/the-daily-agentic-ai-podcast/episodes/briefing-20260317-184509.mp3 2026-03-17T15:05:56Z The Daily Agentic AI Podcast - 2026-03-16 https://podcast.sourcelabs.nl/the-daily-agentic-ai-podcast/episodes/briefing-20260317-154153.mp3 Anthropic has made significant advancements by offering a million-token context window for its Claude models without additional charges, positioning itself competitively against OpenAI and Google. The episode also discusses the implications of this feature for coding agents, enabling them to manage entire codebases effectively, and highlights new tools like Chrome DevTools MCP that allow agents to inspect live applications. Additionally, the conversation touches on the challenges of AI-generated contributions overwhelming open-source projects, exemplified by the shutdown of Jazzband, and concludes with DeepMind's launch of Aletheia, an autonomous AI agent capable of conducting mathematical research independently. Sources: https://podcast.sourcelabs.nl/the-daily-agentic-ai-podcast/episodes/briefing-20260317-154153-sources.html Mon, 16 Mar 2026 15:01:48 GMT https://podcast.sourcelabs.nl/the-daily-agentic-ai-podcast/episodes/briefing-20260317-154153.mp3 2026-03-16T15:01:48Z The Daily Agentic AI Podcast - 2026-03-13 https://podcast.sourcelabs.nl/the-daily-agentic-ai-podcast/episodes/briefing-20260313-181939.mp3 Replit's launch of Agent Four enables multiple AI agents to collaborate on projects, enhancing development speed and introducing a job marketplace for "vibe coders." Real-world examples highlight the democratization of software creation, though concerns arise within the developer community about the loss of craftsmanship in programming. Additionally, advancements in agentic coding are showcased through Shopify's performance improvements, various platform updates, and new research initiatives, while the importance of accountability in AI systems is underscored by a case involving wrongful imprisonment due to AI errors. Sources: https://podcast.sourcelabs.nl/the-daily-agentic-ai-podcast/episodes/briefing-20260313-181939-sources.html Fri, 13 Mar 2026 18:19:01 GMT https://podcast.sourcelabs.nl/the-daily-agentic-ai-podcast/episodes/briefing-20260313-181939.mp3 2026-03-13T18:19:01Z The Daily Agentic AI Podcast - 2026-03-12 https://podcast.sourcelabs.nl/the-daily-agentic-ai-podcast/episodes/briefing-20260312-152812.mp3 OpenAI's launch of GPT-5.4, codenamed xhigh, shows significant improvements in reasoning and agentic coding capabilities compared to previous models, while Replit's Agent Four aims to democratize software development by enabling non-technical users to create various outputs. Notable advancements include NVIDIA's Nemotron 3 Super for multi-agent applications and Google's open-sourced Agent Development Kit, which facilitates persistent memory in agents, enhancing their contextual understanding. Additionally, Anthropic's new institute emphasizes the governance of AI, and practical tool integrations like Claude for Excel and PowerPoint improve cross-application efficiency, reflecting a broader trend toward more autonomous AI systems. Sources: https://podcast.sourcelabs.nl/the-daily-agentic-ai-podcast/episodes/briefing-20260312-152812-sources.html Thu, 12 Mar 2026 15:02:04 GMT https://podcast.sourcelabs.nl/the-daily-agentic-ai-podcast/episodes/briefing-20260312-152812.mp3 2026-03-12T15:02:04Z The Daily Agentic AI Podcast - 2026-03-11 https://podcast.sourcelabs.nl/the-daily-agentic-ai-podcast/episodes/briefing-20260311-152723.mp3 The podcast discusses recent advancements in agentic AI, focusing on Claude Code's new slash-btw feature that allows for side-chain conversations during tasks. It covers a study analyzing prompt architecture in coding agents, the introduction of the LLM Delegate Protocol for multi-agent systems, and the security framework AgenticCyOps. Additionally, milestones in developer tools, a new programming language for agentic computation called Turn, and benchmarks on LLM agents' performance are also highlighted, emphasizing the importance of quality training data and efficiency in AI development. Sources: https://podcast.sourcelabs.nl/the-daily-agentic-ai-podcast/episodes/briefing-20260311-152723-sources.html Wed, 11 Mar 2026 15:25:47 GMT https://podcast.sourcelabs.nl/the-daily-agentic-ai-podcast/episodes/briefing-20260311-152723.mp3 2026-03-11T15:25:47Z