Dev Systems

See Anthropic Orchestrate the Narrative

tl;dr FOSS is the biggest threat to the largest new economic sector, so everything that economic sector does should be viewed through the lens of trying to kill it.I occasionally see articles and sentiments along the lines of, Anthropic is or is not, "scare mongering to boost the perceived cultural impact of their AI/ML tools; a sort of underhanded advertisement".If your job is to defend Anthropic online, it's a good angle to fight from. It's a viral subject with no prac

Secure multi-tenant RAG with Amazon Bedrock and Verified Permissions

Large organizations building internal generative AI applications face a recurring challenge: controlling which teams or departments can access which documents, without duplicating infrastructure for each group. Within a single tenant, employees from a specific department should only access material assigned to that department. However, executives, with a wider span of control, will require access to material across multiple departments. Retrieval Augmented Generation (RAG) is one of several comp

Modernizing financial analytics with Amazon SageMaker Unified Studio

Avanse Financial Services is one of India’s leading education loan providers. Their Data Engineering Team had built a data lake on AWS using Amazon Simple Storage Service (Amazon S3), Amazon Athena, and AWS Glue for data ingestion and processing. However, their analytics and reporting layer ran on an external analytics application that wasn’t integrated with AWS. Data had to be copied from Amazon S3 into this external application before analysts could run any report, its license consumed a signi

Architecting AI-powered resilience framework on AWS

When your production system goes down, you often discover the hard way that your resilience testing missed critical dependencies. Building an AI-powered resilience framework on AWS helps you find those weaknesses before your customers do. Your systems don’t fail because your infrastructure isn’t resilient. They fail because resilience is assumed, not proven. Every deployment introduces new dependencies, every configuration change creates untested paths, and every gap between design intent and ru

Adopting AV1 for Real-Time Communication (RTC) at Scale

Adopting AV1 for real-time communication at Meta has been a multi-year effort spanning codec selection, device eligibility, rate control, and error resilience.We’re sharing the technical and operational challenges while deploying AV1 and expanding coverage, and how we addressed them for real-time communication.We’re presenting several technologies for improving AV1 call quality, including rate control and error resilience.The AV1 video codec, first standardized by AOMedia in 2018, has rapidly ev

Vercel AI SDK in production: when DefaultChatTransport needs a session layer

You've built an AI chat app on the Vercel AI SDK. It works in development. The model responds, the stream comes through, and the UI updates cleanly. Then you ship to production, and the transport layer starts showing its edges.Most of these failures are quiet: things that work in demos and break in ways that are hard to pin down until you know where to look. They share a common cause: DefaultChatTransport is built for HTTP, and HTTP has structural properties that some production requirements exc

Vercel AI SDK in production: when DefaultChatTransport needs a session layer

You've built an AI chat app on the Vercel AI SDK. It works in development. The model responds, the stream comes through, and the UI updates cleanly. Then you ship to production, and the transport layer starts showing its edges.Most of these failures are quiet: things that work in demos and break in ways that are hard to pin down until you know where to look. They share a common cause: DefaultChatTransport is built for HTTP, and HTTP has structural properties that some production requirements exc

Ask HN: How are you enabling your employees to do AI dev in the cloud?

Sure, us engineers can Claude Code up a storm locally on our laptops these days. But now with everyone trying to vibe code everything, there's quite a few people that don't have a "proper" local dev environment to do that same kind of development. Let's just take running a test suite. Our devs need a pretty beefy environment to run that.So ideally, these environments are just in the cloud. But Claude Code web, is so "environment lite" that it really isn't

Show HN: Alpenglow, a Linux distribution that boots to login in 0.6s

Alpenglow is a general-purpose (focused on appliance use right now) Linux distribution focused on fast boot times, small system size, and minimal runtime overhead.The project supports both traditional root-on-disk installations and diskless immutable deployments from the same codebase. In diskless mode the entire system runs from an initramfs with a read-only root and optional persistent state. In rootfs mode it behaves like a conventional Linux installation with package management and writable

Show HN: Pacwich – lightweight new monorepo tooling on top of Bun, NPM, or pnpm

I developed a package simply called bun-workspaces that worked on top of Bun workspaces directly with zero required config, using plain package.json scripts for orchestration.I have re-developed this package into pacwich, which supports Bun, npm, or pnpm. I decided it would be a better direction for it to be decoupled from a particular package manager, so it needed a new name (but I wanted to keep my logo).I write about the development strategy and my engineering philosophy (including disclosing

Show HN: Intelligrade – EU Based Digital Exams

Hey HN!I am Kevin and together with my co-founder Steven I have built Intelligrade over the last 2 years. Steven is a teacher and he got sick of having to deal with outdated, overly expensive and inadequate tooling to create, conduct and grade exams. Most of them don't respect privacy either or are US based which generally disqualifies them from usage in many parts of the EU.So we set out with a mission: Create a tool for teachers and schools that covers exams E2E. The reality is that teach

Show HN: Numax - a portable runtime for distributed apps

Hi, over the past few months I've been working on this project: Numax is a small Rust runtime that does three things: it runs WebAssembly modules in a sandbox, has a built-in local key-value store, and syncs everything across nodes with CRDTs and gossip. Basically, you write a wasm module, run it on two machines, and they converge (I hope !). It's a decentralized system... I hope someone finds it interesting! There's a whitepaper I've put a lot into, and I think the code isn&

Revolutionizing Document Intelligence: Scaling Construction Industries with AI-Driven Extraction

IntroductionGenerative AI (GenAI) is poised to transform the construction industry by addressing chronic challenges such as low productivity, cost overruns, schedule delays, and labor shortages. By automating the analysis of drawings, specifications, contracts, and project documentation, GenAI can reduce manual effort, accelerate decision-making, and improve coordination across architects, engineers, contractors, and suppliers. Industry studies indicate that AI-powered workflows can increase pro

Powering the world’s first AI arts museum

<img src="https://storage.googleapis.com/gweb-uniblog-publish-prod/images/Gallery_A__Data_Pavilion.max-600x600.format-webp.webp">Rafik Anadol Studio opens Dataland, the first museum of AI arts, powered by Google Cloud and supported by Google Arts & Culture.

Show HN: ELDC – Natural language identification, faster than FastText and CLD2

I want to introduce ELDC, an efficient language detector, written in C, designed to maximize speed and accuracy within a relatively constrained memory footprint.ELDC is the latest iteration of the ELD software I made years ago. This version is available as an executable, a library, and a Python package.This is my first C software, or anything compiled for that matter, I previously built this in pure PHP, JavaScript, and Python.Highlights: - Performance: In my benchmarks, it runs faster than CLD2

Show HN: Fata – Spaced repetition to fight skill rot from AI coding

Hi HN, I&#x27;m Djoumé. I&#x27;ve been a developer for over 20 years, and like a lot of you I&#x27;ve been coding almost exclusively through an agent in the past few months.It&#x27;s been amazing to vibe code prototypes in any stack, but when it comes to building something reliable&#x2F;scalable, I couldn&#x27;t effectively guide the agent unless I knew the technology. And the scariest part is that I&#x27;m seeing a lot of my technical skills decreasing due to AI coding.Reflecting on my journey,

Show HN: The Ruby AI Newsletter

Now on its 32nd edition, the Ruby AI Newsletter tracks what’s happening at the intersection of Ruby, Rails, and AI coding agents.YC recommends Rails for new startups, YC’s internal software like Bookface, Work at a Startup, and the software that runs the accelerator all run on Rails. Garry has been a vocal advocate for Ruby on Rails for agentic startups, calling it a &quot;crazy unlock&quot;. And there is an incredible ecosystem forming around RubyLLM. Ruby and Rails gives agents conventions, st

Ask HN: Does your mind drift while waiting for AI prompts to finish?

I&#x27;ve been a software engineer for 9 years now, and I noticed a very new weirdness in my workflows. Once I finish the architecture of a project and i have my context engineering prompt ready, I hit SEND, and then is just void lol. I have realized that between that time of waiting till AI is done, my mind drifts, I go use other apps, search random stuff and if AI fucks things up I go back and forth supervising it. I feel like this whole experience has generated some kind of fatigue and comes

Ask HN: How do you deal with the feeling of "loss of control" with AI coding?

I have to admit that newest models and harness tools have gotten really good and produce working code really fast with even mediocre prompts &quot;wish programming&quot;.The problem is more on my side than the LLM side. I feel I am 1) losing control of my project, even my hobby ones, because LLM outputs code so fast that I am 2) too lazy to manually review all of them, even if, indeed, I had the overall architecture design choices and sometimes straight up pseudo-code in natural language. But NL

Reducing SMS OTP fraud with Vonage network-powered solutions and Amazon Cognito

User authentication remains one of the most targeted touchpoints in application security. With the industrialization of fraud threats by generative AI, cybercrime costs are expected to reach $23 trillion in 2027, an increase of 175 percent from 2022. 20 percent of fraud is attributed to synthetic identity and authentication exploits, with account takeover (ATO) surging 141 percent since 2021. But the damage goes beyond security. SMS One-time passcodes (OTPs) achieve only approximately 80 percent