Dev Systems

Escaping the Fork: How Meta Modernized WebRTC Across 50+ Use Cases

At Meta, WebRTC powers real-time audio and video across various platforms. But forking a large open-source project like WebRTC within our monorepo presents unique challenges – over time, an internal fork can drift behind upstream, cutting itself off from community upgrades.We’re sharing how we escaped this “forking trap” – from building a dual-stack architecture that enabled safe A/B testing across 50+ use cases, to the workflows that now keep us continuously upgraded wit

Show HN: Splice CAD – Wiring and cable assembly CAD with an agentic assist

Still working on Splice CAD. The two most significant updates focus on handling design complexity and automating using agentic workflows. Previous Show HN here:https://news.ycombinator.com/item?id=449781401. Projects:This workflow is more amenable to system-level modeling. It overlaps somewhat with the original workflow but includes improvements including:Complex Routing: Model topologies with branchpoints.Mating Relationships: Represent physical mating (e.g., a receptacle into a

Show HN: PromptJuggler – A dev env and runner for prompts, workflows, agents

Backstory: At work I had to build an AI pipeline to run millions of prompts. First I just put the prompts into string consts and integrated directly with api, chaining one run onto the output of another – but it quickly became a maintenance nightmare. Iterating on prompts, testing them over datasets, experimenting with different chaining did not fit into the regular sdlc and running them at our scale was quite difficult as most of the time is spent on waiting for the api response while holding o

Show HN: Secure SDLC Agents for Claude and Cursor (MCP)

Hey HN,I have been using Claude Code and Cursor lately and as we all know, they write code incredibly fast but a few times i have noticed they can introduce the same security flaws. For example, you ask the LLM to build a file upload feature, you will get working code in minutes, but it would almost always miss magic-byte validation or leaves you vulnerable to SVG XSS. The LLM optimizes for code that compiles not code that is secure.To fix this for my own workflow, I made a set of 8 security-foc

Show HN: jmux – tmux-based development environment for humans and coding agents

I've been a tmux user for years. When I started running 5-10 Claude Code sessions in parallel, I tried the tools that are out there: Conductor, cmux, the GUI orchestrators. None of them felt right. They either wanted me to leave tmux entirely for a 100MB+ Electron app with its own editor and Git workflow, or they were thin wrappers that didn't solve the actual problem: I need to parallelize my entire development environment, agents, editors, servers, logs, and keep track of all of it.S

ACID vs BASE Explained | Why Distributed Systems Break Under Load (System Design Deep Dive)

ACID vs BASE — Why Money Disappears in Distributed Systems An e-commerce system crashed mid-transaction. Customers ...

Designing Reliable Health Check Endpoints for IIS Behind Azure Application Gateway

Why Health Probes Matter in Azure Application GatewayAzure Application Gateway relies entirely on health probes to determine whether backend instances should receive traffic.If a probe:Receives a non‑200 responseTimes outGets redirectedRequires authentication…the backend is marked Unhealthy, and traffic is stopped—resulting in user-facing errors. A healthy IIS application does not automatically mean a healthy Application Gateway backend. Failure Flow: How a Misconfigured Health Probe L

Build a multi-tenant configuration system with tagged storage patterns

In modern microservices architectures, configuration management remains one of the most challenging operational concerns. Two gaps emerge as organizations scale: handling tenant metadata that changes faster than cache TTL allows, and scaling the metadata service itself without creating a performance bottleneck. Traditional caching strategies force an uncomfortable trade-off: either accept stale tenant context (risking incorrect data isolation or feature flags), or implement aggressive cache inva

Trust But Canary: Configuration Safety at Scale

As AI increases developer speed and productivity it also increases the need for safeguards.On this episode of the Meta Tech Podcast, Pascal Hartig sits down with Ishwari and Joe from Meta’s Configurations team to discuss how Meta makes config rollouts safe at scale. Listen in to learn about canarying and progressive rollouts, the health checks and monitoring signals used to catch regressions early, and how incident reviews focus on improving systems rather than blaming people.They also tal

Ably Python SDK v3: realtime for Python, built for AI

Python dominates AI development. It's where teams build their agents, orchestration layers, and the backend systems that turn LLM calls into products people actually use. Over the past year, those systems have matured rapidly. What used to live in notebooks and prototypes is now running in production, serving real users with real expectations around reliability and performance.That maturity brings infrastructure requirements. Tokens need to stream in order. Sessions need to survive refreshes, re

Laid Off from Oracle(OCI). Looking for Software Roles (USA)

10+ yrs of experience working in distributed backend systems(Java). Founding Engineer in early stage cyber security startup, Worked on tier 1 service in oracle cloud infrastructure (OCI) which handled 295~ millions requests / operations. Scaled services for Series B Startup.

Show HN: Composer – AI architect / MCP for software architecture diagrams

Hi everyone!I built Composer, which is a tool turns your ideas into architecture diagrams. You can also use MCP to turn your EXISTING codebase into a visual diagram!It connects to all possible tools using MCP (Claude Code, Codex, OpenCode, etc.)The goal was to make system design easier and to be able to draw out what I wanted to make before I started / to explain to others.Its currently live at usecomposer.com and free!I’d love feedback on whether it feels useful for real projects and where

Unlock efficient model deployment: Simplified Inference Operator setup on Amazon SageMaker HyperPod

Amazon SageMaker HyperPod offers an end-to-end experience supporting the full lifecycle of AI development—from interactive experimentation and training to inference and post-training workflows. The SageMaker HyperPod Inference Operator is a Kubernetes controller that manages the deployment and lifecycle of models on HyperPod clusters, offering flexible deployment interfaces (kubectl, Python SDK, SageMaker Studio UI, or HyperPod CLI), advanced autoscaling with dynamic resource allocation, and com

How Meta Used AI to Map Tribal Knowledge in Large-Scale Data Pipelines

AI coding assistants are powerful but only as good as their understanding of your codebase. When we pointed AI agents at one of Meta’s large-scale data processing pipelines – spanning four repositories, three languages, and over 4,100 files – we quickly found that they weren’t making useful edits quickly enough. We fixed this by building a pre-compute engine: a swarm of 50+ specialized AI agents that systematically read every file and produced 59 concise context files encoding tribal

Ask HN: Academic study on AI's impact on software development – want to join?

Would you like to participate in a study on AI’s impact on software development? We are researchers at New York University and City, University of London conducting an interview study on how new AI tools are changing the work of software developers. We are looking to speak with developers of all seniority levels, including those in leadership roles, who can share their experiences and perspectives on using (or choosing not to use) AI in their day-to-day work.Interviews will last 45 to 60 minutes

Secure HTTP‑Only AKS Ingress with Azure Front Door Premium, Firewall DNAT, and Private AGIC

 Reference architecture and runbook (Part 1: HTTP-only) for Hub-Spoke networking with private Application Gateway (AGIC), Azure Firewall DNAT, and Azure Front Door Premium (WAF)0. When and Why to Use This ArchitectureSeries note: This document is Part 1 and uses HTTP to keep the focus on routing and control points. A follow-up Part 2 will extend the same architecture to HTTPS (end-to-end TLS) with the recommended certificate and policy configuration.What this document containsScope: Archite

Blue‑Green Strategy for Always‑On TCP Workloads on Azure Container Apps

Scenario: Always‑on workloads in Azure Container Apps continuously pull from a TCP source, process the stream, and push into Azure Managed Redis, which is then consumed by another always‑on Container Apps workload that writes to a database. Challenge: Standard revision traffic splitting isn’t a fit because there’s no HTTP ingress-based routing for this workload pattern as defined here; instead, the approach uses a flag‑controlled activation plus a temporary/mock Redis path to validate a new

AKS cluster with AGIC hits the Azure Application Gateway backend pool limit (100)

I’m writing this article to document a real-world scaling issue we hit while exposing many applications from an Azure Kubernetes Service (AKS) cluster using Application Gateway Ingress Controller (AGIC). The problem is easy to miss because Kubernetes resources keep applying successfully, but the underlying Azure Application Gateway has a hard platform limit of 100 backend pools—so once your deployment pattern requires the 101st pool, AGIC can’t reconcile the gateway configuration and traffic sto

Show HN: Open-source distributed quantum compute network

Hey HN. I'm Colton (YC S21, ex-Acorns), one of the founders of Postquant Labs. My cofounder Richard is a cryptographer out of Draper Labs and DARPA. We're building Quip.Network, the first distributed quantum compute network. We just opened our testnet and wanted to share it here.The basic problem: quantum hardware is here and already competitive on certain optimization problems, but for most people, there's no way to access it. The machines cost millions and the hardware and resea

Show HN: I couldn't compare storage topologies without 3 forks, so I built this

I was reading Designing Data-Intensive Applications and kept wanting to run the examples, not just read them. I understand systems through code. So I started building one.Sandstore is a hyperconverged distributed file system in Go. Every node runs control plane, data plane, and Raft consensus together. BoltDB metadata, full POSIX semantics, 2PC chunk lifecycle, gRPC, Kubernetes. The problem I kept hitting was simpler than any of that: I wanted to compare this design against a disaggregated one u