Dev Systems
Show HN: ChatML - Run Claude Code Parallel Sessions in a Desktop app
Hey HN,Over the past 10 months I've been using Claude Code heavily, and one limitation kept coming up: you can really only run one coding agent at a time.While one agent is refactoring something, the rest of the repo is basically blocked unless you start manually juggling branches and working directories.The core issue is that AI coding agents operate directly in your filesystem. If two agents run in the same working directory they quickly start stepping on each other’s changes.Git worktree
Show HN: Twick – React SDK for building timeline-based video editors
Hi HN - I am building Twick, a React video editor SDK for teams that want to ship video workflows inside their product.Core features:
- Drag and drop video editor UI with timeline + canvas (`@twick/studio`)- Auto Caption flow for generating timed captions from video- Asset management for both user uploads and public assets- Modular packages if you want headless/custom UI (`@twick/timeline`, `@twick/canvas`, `@twick/live-player`)
- MP4 export in browser (`@twick/brow
Show HN: cursor-tg – Run Cursor Cloud Agents from Telegram
I built cursor-tg, a Telegram bot that lets me use Cursor Cloud Agents from my phone.The idea is simple: sometimes I want to start an agent run, reply to it, or check what changed without opening my laptop. With cursor-tg, I can talk to Cursor agents in Telegram, track their progress, view generated diffs/PRs, and handle simple review actions from chat.I made this mainly for remote/asynchronous development workflows, where I want quick access to my coding agent while away from my desk.
Show HN: AC-trace–map acceptance criteria to code and tests,then mutate the code
I built a small open-source tool to explore a software confidence gap that seems to be getting worse with AI-assisted coding.Passing tests and coverage are often weaker signals than teams assume. Code can be exercised, tests can pass, and yet the actual acceptance criteria may still be only weakly protected.More about the problem: https://www.dmytrohuz.com/p/i-built-ac-trace-to-question-theac-trace maps acceptance criteria to code and tests, then mutates the mapped code to ch
What changes in engineering teams once AI tools "click"?
A founder I work with recently ran a small experiment with an engineering team at a Series B company that I thought was interesting; I'm not technical but was curious as to how this session would play out.The setup was simple: six engineers spent a few 2 days experimenting with AI in their real development workflows. Nothing theoretical — just trying things directly against their codebase and normal tasks.A few observations stood out.1. Initial skepticism was highSeveral engineers were open
Ask HN: Are we going to see more job postings asking for only agentic coding?
Was perusing job postings today and saw this on a Zapier listing:"You work through AI agents, not alongside them. Your daily development workflow is built around directing and reviewing agent-written code, not writing it by hand. You have opinions about which models to use for which tasks, you've hit real failure modes and built mitigations, and your workflow is actively evolving. Bonus: you use multi-agent patterns, enable others on your team to build faster with AI, or have scaled AI
Show HN: Fakebase – a lightweight PostgreSQL browser for development databases
Hi HN,Over the past year I’ve been building a lot of products on PostgreSQL and kept needing a very simple way to quickly inspect tables, run queries, and sanity-check data without opening a heavy database client.So I built a small tool for myself called Fakebase - a lightweight PostgreSQL browser for local and dev databases.I’ve been using it daily for a while now, and a couple of colleagues started using it too, which made me think it might be useful for others.So I cleaned it up a bit and dec
Show HN: Time Machine – Debug AI Agents by Forking and Replaying from Any Step
Hey HN! We are building Time Machine, a debugging and replay platform for AI agents. We would love your feedback.Here's a demo: https://youtu.be/KyOP9BY0WiY
Website Link: https://timemachinesdk.dev/Here is the initial problem we are trying to solve: Imagine it's Step 9 of 10 of an agent running, and it hallucinated a tool call, wrote garbage to your database, and crashed. You fix the prompt. You re-run. $1.50 gone. This happens six more times before lunch.
300 Founders, 3M LOC, 0 engineers. Here's our workflow
My co-founder Tyler Brown and I have been building our product for 6 months. The co-working space that Tyler founded that we work out of houses 300 founders that we've gleaned agentic coding tips and tricks from.Neither of us came from traditional SWE backgrounds. Tyler was a film production major. I did informatics. Our codebase is a 300k line Next.js monorepo and at any given time we have 3-6 AI coding agents running in parallel across git worktrees.Every feature follows the same four-pha
How Advanced Browsing Protection Works in Messenger
We’re sharing the technical details behind how Advanced Browsing Protection (ABP) in Messenger protects the privacy of the links clicked on within chats while still warning people about malicious links.We hope that this post has helped to illuminate some of the engineering challenges and infrastructure components involved for providing this feature for our users.While end-to-end encryption (E2EE) on Messenger ensures that direct messages and calls are protected, Messenger’s Safe Browsing feature
Stop Burning Money in Azure Storage
Audience: Engineers, Architects, FinOps teams (and anyone whose finance team sends "friendly" cost emails) Your blobs called. They want to talk about your spending habits.Look, we've all been there. You spin up a storage account, dump everything into Hot tier, and walk away feeling productive. Six months later, your finance team sends you a cost report that looks like a phone number.Let's fix that — without a 47-page whitepaper.────────────────────────────────────────────────────────────1.
Show HN: key-carousel - Key rotation for LLM agents
I think in-process key management is the right abstraction for multi-key LLM setups. Not LiteLLM, not a Redis queue, not a custom load balancer.The failure modes are well-understood: a key gets rate-limited, you wait, you try the next one. Billing errors need a longer cooldown than rate limits. This is not a distributed systems problem — it's a state machine that fits in a library.
The problem is everyone keeps solving it with infrastructure instead. Spin up LiteLLM, now you have a Python s
Show HN: I Built Systems AGI – 1600 Verticals, Self-Healing\Self-Evolving
Hey HN,For a while now, I've been building a cognitive AI business operating system from the ground up natively as a solo developer. I never coded before 6 months ago. I was frustrated with wrappers and fragile single-agent setups, so I decided to approach this as a distributed systems problem instead. It's called *Sophify*.Instead of trying to make one monolithic capability do everything, I built a 12-server, 28+ microservice swarm orchestrated by over 40 highly specialized cognitive
Show HN: OpenMandate – Declare what you need, get matched
Hi HN, I'm Raj.We all spend a bulk of our time looking for the right job, cofounders, hires. Post on boards, search, connect, ask around. Hit ratio is very low. There's this whole unsaid rule that you have to build your network for this kind of thing. Meanwhile the person you need is out there doing the exact same thing on their side. Both of you hunting, neither finding.What if you just declare what you need and someone does the finding for you?That's what I built - OpenMandate.
Ask HN: How valuable is production scale experience vs. LeetCode in startups?
Backend engineer from India working mainly on distributed systems and backend infrastructure.Over the past few years, I’ve worked on systems serving ~6M monthly users and currently operate backend services handling ~1300 rps production traffic. A lot of my work involves things like autoscaling services, async job processing, data pipelines (~400GB), and deployments where we need zero downtime (blue-green / rolling deploys).I’ve also solved 300+ LeetCode problems and understand the algorithm
Show HN: Jido 2.0, Elixir Agent Framework
Hi HN!I'm the author of an Elixir Agent Framework called Jido. We reached our 2.0 release this week, shipping a production-hardened framework to build, manage and run Agents on the BEAM.Jido now supports a host of Agentic features, including:- Tool Calling and Agent Skills
- Comprehensive multi-agent support across distributed BEAM processes with Supervision
- Multiple reasoning strategies including ReAct, Chain of Thought, Tree of Thought, and more
- Advanced workflow capabilities
- Durabi
Ask HN: Transforming My Resumé for the Age of AI
Now that array of language expertise and distributed systems stuff is no longer impressive or a strong signal, how should I reframe my 20+ years of programming and management experience? Should I go higher level and focus only on business impact and drop the technical aspects?
Show HN: PantheonOS–An Evolvable, Distributed Multi-Agent System for Science
We are thrilled to share our preprint on PantheonOS, an evolvable, distributed multi-agent system for automatic genomics discovery.Preprint: https://www.biorxiv.org/content/10.64898/2026.02.26.707870v1
Website(online platform free to everyone): https://pantheonos.stanford.edu/PantheonOS unites LLM-powered agents, reinforcement learning, and agentic code evolution to push beyond routine analysis — evolving state-of-the-art algorithms to super-human performa
Show HN: I Built Systems AGI – 1600 Verticals, Self-Healing, Self-Evolving
Hey HN,For a while now, I've been building a cognitive AI business operating system from the ground up natively as a solo developer. I never coded before 6 months ago. I was frustrated with wrappers and fragile single-agent setups, so I decided to approach this as a distributed systems problem instead. It's called *Sophify*.Instead of trying to make one monolithic capability do everything, I built a 12-server, 28+ microservice swarm orchestrated by over 40 highly specialized cognitive
Show HN: I built a LLM human rights evaluator for HN (content vs. site behavior)
I built Observatory to automatically evaluate Hacker News front-page stories against all 31 provisions of the UN Universal Declaration of Human Rights — starting with HN because its human-curated front page is one of the few feeds where a story's presence signals something about quality, not just virality. It runs every minute: https://observatory.unratified.org. Claude Haiku 4.5 handles full evaluations; Llama 4 Scout and Llama 3.3 70B on Workers AI run a lighter free-tier pass.M