Dev Systems

Ask HN: I built an AI-native codebase framework–could you evaluate it?

I built this open-source project and would really appreciate technical feedback from people here:https://github.com/xodn348/ai-nativeThe goal is to make AI-assisted development more reliable through clearer project structure, explicit contracts, and verification workflow.I made this because applying these patterns from scratch in every project was repetitive and hard to maintain, so I wanted a reusable framework.If you have time, I’d love your evaluation on:1. What is useful

Show HN: I built a 38K-line Rust CLI using 3 AI models as my engineering team

Hi HN,I was getting incredibly frustrated with the current state of AI agent skill managers (like skills.sh). They rely on heavy Node.js runtimes just to manage a few skills, symlinks break constantly across environments, and there's no real determinism.I wanted a tool that treats agent skills like Terraform treats infrastructure: Config-as-Code, purely deterministic, and zero-dependency.So I built eden-skills — a single ~10MB Rust binary based on Tokio that uses a skills.toml to lock every

Show HN: GladeKit – AI agent for Unity game development

We’ve been building GladeKit, an AI agent for Unity game development that works directly inside the editor. Instead of just generating code, it reads the actual project state: scene hierarchy, scripts, selected objects, component values, installed packages, and compiler errors - then takes actions inside Unity.What it does today: - 150+ native Unity actions (create prefabs, wire components, configure Animator Controllers, build UI, set up physics, lighting, and NavMeshes) - Reads real editor c

Show HN: From Claude Code to OpenCode – My Evolution in Vibe AI Engineering

I’ve spent the last few months iterating on my "Vibe Coding" workflow, moving away from closed-box solutions toward a more transparent, multi-provider stack. I documented the transition from Codex and Claude Code to an open-source setup using OpenCode and opencode serve.Cursor -> Claude Code -> OpenCode -> OpenCode + OpenCode-Manager -> Codex + Tmux + Tailscale -> OpenCode Serve + Tailscale.Press enter or click to view image in full sizeKey takeaways from the journey:The

Show HN: Mesa – A collaborative canvas IDE built for agent-first development

Hi HN - I'm Ryan a product designer who codes, and I built Mesa. Current IDEs feel wrong for the type of development being done now - the focus is still on files.Mesa puts the focus on the full workflow: your agent, terminal, browser, and files all live as equal nodes on a canvas with full multiplayer support. (think figma but for code)I was tired of the overhead of switching windows, tabs, and terminals across multiple projects. Inspired by TouchDesigner and Factorio, I wanted something mo

Show HN: Mozzie – a local desktop orchestrator for Codex, Claude Code and Gemini

Mozzie started as a tool I built for my own workflow.I like working on multiple things at once, but most development tools split the workflow across different places: tickets live in issue trackers, execution happens in terminals, and context gets lost between them. I wanted my work items and their context right next to the place where the work actually happens.Mozzie is a local desktop workspace where each work item can spawn its own terminal or coding agent. The idea is that tasks become the p

Toolpack SDK, an Open Source TypeScript SDK for Building AI-Powered Applications

Just Released Toolpack SDK — a completely Open-Source unified TypeScript SDK for AI developmentIf you've worked with multiple LLM providers, you know the pain: each has different APIs, different tool formats, different quirks.Toolpack SDK gives you a single interface across OpenAI, Anthropic, Gemini, and Ollama.It comes with 77 built-in tools for file ops, git, databases, web scraping, code analysis, and shell commands. You can also create and integrate your own custom tools.The workflow en

Workflow to build context for coding agents

Here’s the workflow my team and I have found works best with coding agents:- Plan: Write a plan in markdown. Edit this. Iterate. The plan isn’t a throwaway note. It tracks status as work progresses (draft -> in-development -> in-review -> completed), versions with git alongside the code, and serves as the single source of truth. When the agent later implements, it reads this document. When we review the work, we compare against it.- Diagram: Have the agent enrich the plan with architect

Show HN: I built an AI comic generator from scratch using only natural language

I spent ~4 hours building AIComicBuilder [1], a full-stack AI comic/drama generation platform, without writing a single line of code manually. This is a writeup of my "vibe coding" workflow using Claude Code. The app lets you: input a script → AI generates screenplay → character analysis → storyboard generation (with first/last frame images) → video generation. Supports OpenAI-compatible, Gemini, and Bytedance Seedance APIs for text/image/video models, conf

Patch Me If You Can: AI Codemods for Secure-by-Default Android Apps

Even seemingly simple engineering tasks — like updating an API — can become monumental undertakings when you’re dealing with millions of lines of code and thousands of engineers, especially if the changes are security-related. Nowhere is this more apparent than in mobile security, where a single class of vulnerability can be replicated across hundreds of call sites scattered throughout a sprawling, multi-app codebase serving billions of users.Meta’s Product Security team has develope

Resume tokens and last-event IDs for LLM streaming: How they work and what they cost to build

When an AI response reaches token 150 and the connection drops, most implementations have one answer: start over. The user re-prompts, you pay for the same tokens twice, and the experience breaks.Resume tokens and last-event IDs are the mechanism that prevents this. They make streams addressable – every message gets an identifier, clients track their position, and reconnections pick up from exactly where they left off. The concept is straightforward. The production scope is not: storage design,

Cost per outcome: measuring the real economics of AI workflows

Hi HN, I’m the technical founder of botanu (https://www.botanu.ai ).I started building this after repeatedly running into the same problem on AI teams: we could see total LLM spend, but we couldn’t answer a simple question:“What did one successful outcome actually cost?”In real systems, a single business event often requires multiple attempts before it succeeds — retries, fallbacks, tool calls, escalations, async workers, etc. Most tooling measures individual model calls or sometimes a

Show HN: Pacto – OCI-distributed contracts for cloud-native services

Author here.I work as a platform engineer and kept running into the same problem: services are described in fragments across different tools.APIs live in OpenAPI specs. Deployment assumptions end up in Helm values. Runtime details are hidden in Kubernetes manifests. Configuration lives in environment variables. Dependencies are often documented in READMEs or tribal knowledge.There is no single machine-readable contract that describes how a service actually behaves operationally.So I started buil

Show HN: Rust primitives for AI agents, LLM infrastructure, and financial data

I've been building the substrate layer for production AI systems in Rust. This is the index repo for 10 crates I shipped this week. The AI side covers agent memory (episodic, semantic, working with decay and multi-agent bus), cost governance with hard budget enforcement and automatic model downgrade cascade, CRDT state sync for distributed agent fleets, output versioning with full lineage tracking, a knowledge graph on top of agent memory, LLM inference primitives for WASM and edge runtimes

Ask HN: 1 Hash/Sec paced PoW making 51% attacks impossible – seeking engineers

I’ve been working on a new Proof-of-Work architecture built around deterministic mining pacing (what I call Capped PoW).The core idea is a protocol-enforced pacing model where each admitted identity progresses through the mining process at a fixed rate (currently modeled as 1 hash/sec). Instead of competing through raw hashpower, nodes advance deterministically through the mining process.The broader architecture aims to neutralize large-scale parallel mining advantages by making horizontal

Show HN: OpenVerb – A deterministic action layer for AI agents

Hi HN,I’ve been working on a project called OpenVerb, which explores an architectural idea for AI systems: separating reasoning from execution.Most AI agent frameworks today focus on improving reasoning loops, planning, and orchestration (LangChain, LangGraph, etc.). But once an agent decides to perform an action, execution usually becomes a direct tool call, script, or API invocation.That approach works, but it also creates some issues: • custom glue code for every integration • inconsistent

Infrastructure Vulnerabilities and White-Label Abuse: Is Opnex a Scam?

In the current digital finance landscape, fraudulent entities are becoming increasingly adept at using mature technical tools (like MetaTrader 5) to manufacture credit endorsements for their illicit operations. Today, we dissect OPNEX from a network infrastructure, information security, and architectural perspective to answer the tech community's most pressing question: Is OPNEX a scam?1. High-Risk Shared Network Infrastructure During our OPNEX in-depth analysis, topology scans of its core

Show HN: OpenMolt – A programmatic AI agent framework for Node.js

Hi HN!I built OpenMolt, an open-source framework for creating programmatic AI agents in Node.js.Instead of wiring complex workflows (that can break), you define agents in code with instructions, tools, integrations, and memory. The agent then plans steps and executes actions like calling APIs, updating data, or interacting with services. When things break or change, the agent can adapt instead of failing like static workflows.The goal is to make agents behave more like real software systems than

Building a better tool for documenting software architecture

Having spent extensive time researching options to document software architecture, not finding a satisfying solution to serve my needs, I’ve decided to build something that does.I want to make it worth the effort by founding this project on solid data, using the jobs-to-be-done framework and outcome driven innovation (ODI) to identify underserved needs in the segment.I’ve created a ~15 minute survey to establish that foundation: https://forms.gle/2dtEAad6xirS4Ejy5The aggregated re

Show HN: BXP – An open standard for atmospheric exposure data

Air pollution kills 7 million people a year. The sensors exist. The data infrastructure does not. Every device, app, and agency stores air quality data in incompatible formats. A sensor in Accra cannot speak to a hospital in Nairobi. A researcher in London cannot access ground-truth data from Lagos. There is no open standard for atmospheric exposure data — the same way there was no open standard for the web before HTTP.BXP (Breathe Exposure Protocol) is my attempt to fix that.It defines: — A uni