Tenkai Daily — May 15, 2026
Open Source Releases
- OpenCode v1.15.0 — Effect-based core event system for more reliable event delivery across sessions and integrations. Also stops custom tool modules from exploding on invalid exports and handles missing project instructions gracefully. Solid housekeeping release.
- corpulse 0.1.0 — Corpus health analytics for RAG pipelines. Tracks document quality, coverage, and drift. If your retrieval system is a black box, this gives you some instrumentation. 🔥
- octopoda 3.1.9 — Persistent memory kernel for AI agents with crash recovery, shared memory, audit trails, and a real-time dashboard. Basically the infrastructure layer your long-running multi-agent systems have been missing — or cobbling together with duct tape.
- opik 2.0.36
— Comet’s LLM observability tool. Logging, tracing, eval for LLM app runs. Useful if you’re tired of
print()debugging your agent chains in production. - Tavily MCP — MCP server that gives AI agents clean, structured web search with source URLs and relevance scoring. Nearly 5k installs, squarely aimed at RAG use cases. Worth a look if you’re building agent toolkits. 🤖
- OpenCode v1.14.51 — Experimental background subagents so your tasks keep churning while you keep coding. Plus the billing origin header NVIDIA decided was needed. Two things you didn’t know you wanted, simultaneously.
Research Worth Reading
- GraphBit — Replaces prompted orchestration in agent frameworks with explicit DAG-based workflow definitions. No more hallucinated routing or infinite loops from the model “winging it.” Deterministic execution is underrated. 📄
- SPIN — Planning wrapper that combines validated DAG planning with prefix-based execution to stop LLM planners from generating structurally broken or wildly bloated workflows. Targets the brittle-failure problem in production agent systems.
- EvolveMem — Co-evolves stored content and the retrieval/scoring infrastructure in agent memory. The core insight: your scoring functions shouldn’t be frozen at deploy while your content keeps changing. AutoResearch-driven adaptation. Smart.
- TabPFN-3 — Scales tabular foundation models to 1M-row datasets with substantially lower training and inference cost. If you’ve been ignoring foundation models for tabular data because of scale limits, this one’s for you. 🔥
- HodgeCover — Tackles an irreducible cycle obstruction in MoE compression where pairwise-compatible experts form incompatible triples. Higher-order topological coverage framework. Niche but real if you care about efficient MoE inference.
- PREPING — Pre-task memory construction for agents. Addresses the cold-start problem: what do you stuff in agent memory before it has any task experience? No curated demos, no post-deploy bootstrapping required.
AI Dev Tools
- NVIDIA AI Blueprints: Video Search & Summarization — Reference architectures for GPU-accelerated vision agents and video analytics. Practical starting points if you want to build video search/summarization on NVIDIA kit without reinventing the pipeline. 🛠️
- Claude Code v2.1.142
— New
claude agentsflags (--add-dir,--settings,--mcp-config,--plugin-dir,--permission-mode,--model,--effort,--dangerously-skip-permissions) for configuring background sessions. Fast mode now defaults to Opus 4.7. The--dangerously-skip-permissionsflag is doing a lot of heavy lifting in that name, btw. 🤖
Community Finds
- Symbolic Algebra with MCP — Hooking symbolic math systems into MCP so LLMs can do precise mathematical reasoning through external tools instead of confidently hallucinating integrals. The future is LLMs knowing when they’re out of their depth. 📄
Today’s Synthesis
If you’re building agent systems headed for production, today’s reads converge on one theme: stop letting LLMs wing the control flow and start treating your data layer as infrastructure, not an afterthought. GraphBit and SPIN both argue for replacing prompted orchestration with explicit DAG-based planning — one at the workflow definition level, the other at execution validation. Together they sketch a clear pattern: define routing as structure, not suggestion. But deterministic execution only helps if the data you’re retrieving is actually any good. That’s where corpulse fits — it gives you instrumentation for the corpus feeding your RAG pipeline, tracking quality, coverage, and drift over time. The concrete play: wire your agent orchestration through DAG-defined workflows, then instrument the retrieval corpus with corpulse so you can distinguish routing bugs from data rot when things break. Two different failure modes, two different fixes, and both are now actionable.