Dev Systems
Show HN: Supaqueue – Node.js background job queue (no Redis needed)
Hi everyone,I have been using BullMq for most of my background job related work but lately I have been working on some smaller scale app where I having a full blown Redis setup with separate worker process would have been overkill.That is why I built a lightweight, in-memory Node.js background job queue. It comes with a Bull/BullMq-type API, concurrency control, schedulers, job retention and much more. It has zero dependencies and is fully typesafe. Use this when you need a simple, performa
Ask HN: Switching from backend development to graphics programming
I love computers. I wrote my first program in Borland C++ when I was 11.By chance, I managed at my 15 to get a job where I did some HTML pages and later some PHP programming. Making websites wasn't as fun as making games in C++. Overall PHP didn't seem as fun as C++. I made all of my lab projects at the university in C++ with Qt, or wxWidgets or bare WinAPI. In fact, I improved the university the internal testing system on our Math/Physics faculty. I got a privilege to do la
How to become an AI infrastructure engineer?
Hi,I currently work on a GenAI platform for one of the largest local industrial companies. My daily work mostly involves building inference infrastructure on top of a 48x H200 GPU, Kubernetes and vLLM. Hence, I'd say it's 80% SRE and 20% software engineering when it comes to building request routing and internal control planes.Although I have a background in backend engineering rather than ML research or low-level GPU programming, I am trying to understand what I need to learn to becom
Introducing AI Transport v0.3.0
Last week we introduced AI Transport v0.2.0 and made one idea the centre of the design: the session is the channel. Every input, output, and lifecycle event for an AI conversation is just a message published to an Ably channel, which is what makes a session durable, multi-party, and resumable. In v0.3.0, we added first-class support for presence and LiveObjects to AI sessions, allowing you and your agent to see who's online and update shared state in real time.We made the codec interface declara
How Meta Engineered Ultra-Narrow Batteries for AI Glasses
Smart glasses like the Ray-Ban Meta and Oakley Meta Vanguards need to pack enough energy to power features like cameras, speakers, AI workloads, and even a display. But it all has to fit into the glasses’ temple arms.So how do you place a battery with enough power to run a pair of smart glasses all day into a form factor narrower than an adult’s pinky finger? You have to rethink how batteries are made. In episode 86 of the Meta Tech Podcast, host Pascal Hartig sat down with Karthik and Myuran, t
Nobody trusted our internal dashboards. Now they live in code
How we used AI to fix a data trust problem, and built a governed reporting system the whole company can contribute to.We audited our skills library a few months ago and found twelve dashboards hiding in it.Not dashboards. Skills that built dashboards. Someone needed a view of some data, asked Claude to put it together, got a long HTML page out of it, and then wrapped the whole thing in a skill so others could run it again. Twelve times over, by different people, for different questions.This is w
See Anthropic Orchestrate the Narrative
tl;dr FOSS is the biggest threat to the largest new economic sector, so everything that economic sector does should be viewed through the lens of trying to kill it.I occasionally see articles and sentiments along the lines of, Anthropic is or is not, "scare mongering to boost the perceived cultural impact of their AI/ML tools; a sort of underhanded advertisement".If your job is to defend Anthropic online, it's a good angle to fight from. It's a viral subject with no prac
Toward More Controllable AI Video Editing: An Early Research Exploration at Netflix
By Zhuoning Yuan, Ta-Ying Cheng, Benjamin Klein, Bahareh AzarnoushIntroductionAt Netflix, we build technology to help storytellers bring their creative visions to life and to help members discover the stories they love.To connect stories with diverse audiences around the world, we produce promotional assets, including trailers, teasers, and social short‑form videos, that build on and elevate the original footage. Through close collaboration with the teams crafting these assets, we identified a r
How Netflix Simplified Batch Compute with Kueue
By Alvin Bao, Alex Petrov, Jennifer Lai, Aidan Sherr, and Samartha ChandrashekarAs a part of the journey to transition Netflix’s compute infrastructure to be more Kubernetes-native, we have leaned into incorporating components from the Kubernetes ecosystem into our container platform Titus. One example of this is our use of Kueue, a cloud-native job queueing system for batch workloads, which has largely replaced the custom queuing and scheduling logic in our homegrown managed batch solution Comp
Secure multi-tenant RAG with Amazon Bedrock and Verified Permissions
Large organizations building internal generative AI applications face a recurring challenge: controlling which teams or departments can access which documents, without duplicating infrastructure for each group. Within a single tenant, employees from a specific department should only access material assigned to that department. However, executives, with a wider span of control, will require access to material across multiple departments. Retrieval Augmented Generation (RAG) is one of several comp
Modernizing financial analytics with Amazon SageMaker Unified Studio
Avanse Financial Services is one of India’s leading education loan providers. Their Data Engineering Team had built a data lake on AWS using Amazon Simple Storage Service (Amazon S3), Amazon Athena, and AWS Glue for data ingestion and processing. However, their analytics and reporting layer ran on an external analytics application that wasn’t integrated with AWS. Data had to be copied from Amazon S3 into this external application before analysts could run any report, its license consumed a signi
Architecting AI-powered resilience framework on AWS
When your production system goes down, you often discover the hard way that your resilience testing missed critical dependencies. Building an AI-powered resilience framework on AWS helps you find those weaknesses before your customers do. Your systems don’t fail because your infrastructure isn’t resilient. They fail because resilience is assumed, not proven. Every deployment introduces new dependencies, every configuration change creates untested paths, and every gap between design intent and ru
Adopting AV1 for Real-Time Communication (RTC) at Scale
Adopting AV1 for real-time communication at Meta has been a multi-year effort spanning codec selection, device eligibility, rate control, and error resilience.We’re sharing the technical and operational challenges while deploying AV1 and expanding coverage, and how we addressed them for real-time communication.We’re presenting several technologies for improving AV1 call quality, including rate control and error resilience.The AV1 video codec, first standardized by AOMedia in 2018, has rapidly ev
Vercel AI SDK in production: when DefaultChatTransport needs a session layer
You've built an AI chat app on the Vercel AI SDK. It works in development. The model responds, the stream comes through, and the UI updates cleanly. Then you ship to production, and the transport layer starts showing its edges.Most of these failures are quiet: things that work in demos and break in ways that are hard to pin down until you know where to look. They share a common cause: DefaultChatTransport is built for HTTP, and HTTP has structural properties that some production requirements exc
Vercel AI SDK in production: when DefaultChatTransport needs a session layer
You've built an AI chat app on the Vercel AI SDK. It works in development. The model responds, the stream comes through, and the UI updates cleanly. Then you ship to production, and the transport layer starts showing its edges.Most of these failures are quiet: things that work in demos and break in ways that are hard to pin down until you know where to look. They share a common cause: DefaultChatTransport is built for HTTP, and HTTP has structural properties that some production requirements exc
Show HN: Intelligrade – EU Based Digital Exams
Hey HN!I am Kevin and together with my co-founder Steven I have built Intelligrade over the last 2 years. Steven is a teacher and he got sick of having to deal with outdated, overly expensive and inadequate tooling to create, conduct and grade exams. Most of them don't respect privacy either or are US based which generally disqualifies them from usage in many parts of the EU.So we set out with a mission: Create a tool for teachers and schools that covers exams E2E. The reality is that teach
Show HN: Alpenglow, a Linux distribution that boots to login in 0.6s
Alpenglow is a general-purpose (focused on appliance use right now) Linux distribution focused on fast boot times, small system size, and minimal runtime overhead.The project supports both traditional root-on-disk installations and diskless immutable deployments from the same codebase. In diskless mode the entire system runs from an initramfs with a read-only root and optional persistent state. In rootfs mode it behaves like a conventional Linux installation with package management and writable
Show HN: Pacwich – lightweight new monorepo tooling on top of Bun, NPM, or pnpm
I developed a package simply called bun-workspaces that worked on top of Bun workspaces directly with zero required config, using plain package.json scripts for orchestration.I have re-developed this package into pacwich, which
supports Bun, npm, or pnpm. I decided it would be a better direction for it to be decoupled from a particular package manager, so it needed a new name (but I wanted to keep my logo).I write about the development strategy and my engineering philosophy (including disclosing
Ask HN: How are you enabling your employees to do AI dev in the cloud?
Sure, us engineers can Claude Code up a storm locally on our laptops these days. But now with everyone trying to vibe code everything, there's quite a few people that don't have a "proper" local dev environment to do that same kind of development. Let's just take running a test suite. Our devs need a pretty beefy environment to run that.So ideally, these environments are just in the cloud. But Claude Code web, is so "environment lite" that it really isn't
The Data Canary: How Netflix Validates Catalog Metadata
By Celina AmadosAt Netflix, our catalog metadata is crucial to our member experience, and a single corrupted data state can impact millions of viewers immediately. To protect streaming reliability, we built an automated data canary system that validates data transformations using production traffic. This canary detects issues in under 10 minutes, and blocks bad data from reaching our members.IntroCatalog metadata is what makes Netflix functional. It defines what titles exist, where they’re avail