AI Engineer News // Tue, May 12, 2026

01. *

Show HN: Needle: We Distilled Gemini Tool Calling into a 26M Model

Needle distills Gemini 3.1 into a 26M parameter model optimized for on-device AI, capable of fine-tuning locally on consumer devices.

443 pts by HenryNdubuaku [hn]

02. *

Claude Platform on AWS

Claude Platform is now generally available on AWS, enabling seamless access, management, and deployment of AI features within existing AWS workflows.

224 pts by matrixhelix [hn]

03. *

Reimagining the mouse pointer for the AI era

AI-powered pointer understands context and intent, enabling seamless, intuitive interactions across apps without prompts.

194 pts by devhouse [hn]

04. *

Show HN: Statewright – Visual state machines that make AI agents reliable

Statewright uses state machines to improve AI agent reliability by constraining tool use and workflows across models like Claude, Codex, and Pi.

91 pts by azurewraith [hn]

05.

Show HN: Agentic interface for mainframes and COBOL

AI-driven environment simplifies mainframe tasks—debugs, datasets, JCL, and z/OS operations—empowering modern mainframe development.

71 pts by sai18 [hn]

06.

Launch HN: Voker (YC S24) – Analytics for AI Agents

Voker offers analytics to improve AI agents' performance, aiding teams with insights, ROI tracking, and ownership across high-volume conversations.

52 pts by ttpost [hn]

07.

Fake building: Claude wrote 3k lines instead of import pywikibot

Claude rewrote 3,000 lines of code instead of importing libraries, highlighting AI's tendency to reinvent rather than reuse.

41 pts by firef1y1203 [hn]

08.

Company behind GLiNER model released open source model for running LLM guardrail

Open-source GLiGuard—the small, fast 300M model—matches large guardrails' accuracy, enabling affordable, real-time safety moderation.

35 pts by neon_share1 [hn]

09.

AI isn't paying off in the way companies think according to Gartner study

AI layoffs don't boost ROI; companies are using AI to amplify productivity, not just cut jobs, with limited returns from automation.

35 pts by 1vuio0pswjnm7 [hn]

10.

Amazon staff use AI tool for unnecessary tasks to inflate usage scores

Amazon staff exploit AI tools for trivial tasks to boost usage metrics, raising concerns over genuine engagement.

24 pts by uhfraid [hn]

11.

Parents say ChatGPT got their son killed with bad advice on party drugs

Parents sue OpenAI, alleging ChatGPT provided deadly drug advice to their son, leading to his accidental overdose and death.

23 pts by 1vuio0pswjnm7 [hn]

12.

Mass NPM Supply Chain Attack Hits TanStack, Mistral AI, and 170 Packages

A coordinated attack hacked over 170 npm and 2 PyPI packages, injecting malware, stealing credentials, and propagating through repositories.

18 pts by birdculture [hn]

13.

FairyFuse: Multiplication-Free LLM Inference on CPUs via Fused Ternary Kernels

FairyFuse enables multiplication-free LLM inference on CPUs using ternary weights, achieving 29.6x speedup with minimal quality loss.

18 pts by PaulHoule [hn]

14.

Googlebook, Designed for Gemini Intelligence

Googlebook, built for Gemini AI, seamlessly integrates Android and ChromeOS, offering proactive help, personalized widgets, and premium hardware.

18 pts by meetpateltech [hn]

15.

Kash Patel Touts AI Overhaul of FBI Crime-Fighting Operations

Kash Patel promotes using AI to enhance FBI crime operations; aims to overhaul law enforcement with AI technology.

16 pts by jethronethro [hn]

16.

Show HN: Reducing LLM input tokens by 70%

Rose 1 reduces LLM input tokens by 70%, trimming noisy content while maintaining answer accuracy across various benchmarks.

13 pts by Jbunga [hn]

17.

Supporting critical Open Source with $5M credits for vulnerability detection

Open Defense Initiative offers $5M credits to protect open source projects from vulnerabilities before exploitation.

12 pts by andreamichi [hn]

18.

Show HN: UIGen – Production UI from any API spec with full override control

UIGen converts API specs into interactive UIs at runtime, supporting full overriding, AI-driven setup, authentication, data visualization, and theming.

11 pts by ombedzi [hn]

19.

Cisco CPO predicts AI will have built majority of their products by end of 2027

Cisco's CPO predicts AI will power most of their products by 2027.

10 pts by oavioklein [hn]

20.

AI agents only amplify what's already there

Autonomous AI agents refreshed a side project, building and vetting features over weeks, balancing productivity with slop.

10 pts by flreln [hn]

21.

Lovable is the first coding agent platform to adopt AIUC-1 (SoC-2 for AI Agents)

Lovable leads in AIUC-1 coding agent certification, setting security, safety, and accountability standards for enterprise AI development.

10 pts by vikeri [hn]

22.

Canva's Magic Layers AI Changed "Palestine" to "Ukraine" in User Designs

Canva's AI tool mistakenly replaced "Palestine" with "Ukraine," prompting an apology and investigation into bias in its Magic Layers feature.

10 pts by lebowska [hn]

23.

OpenAI Hit with Overdose Suit Targeting ChatGPT Drug Advice (1)

Family sues OpenAI after ChatGPT allegedly advised a student on dangerous drug use, leading to his fatal overdose.

9 pts by 1vuio0pswjnm7 [hn]

24.

Show HN: Agent FM – local, open-source radio for Claude Code and Codex agents

Agent FM turns AI coding sessions into live radio for macOS, surfacing real-time progress, blockers, and decisions across agents.

9 pts by anideshp [hn]

25.

AI load breaks GitHub – why not other vendors?

GitHub's recent outages, data integrity issues, and load challenges highlight its struggles with reliability, pushing users toward alternatives.

8 pts by esafak [hn]

26.

Show HN: Nimbalyst open source Obsidian, Codex app, and Linear for coding agents

Nimbalyst is a free visual workspace for building with Codex, Claude, Opencode, and Copilot, supporting sessions, task management, and extensions.

7 pts by wek [hn]

27.

Show HN: Grunden – Frontier AI inference hosted in Sweden, OpenAI-compatible

Swedish-hosted AI service, Grunden, offers OpenAI-compatible models with EU data law compliance, Swedish support, and local billing.

7 pts by fsrc [hn]

28.

AI coders are carrying half-open laptops through airports, offices, & ice rinks

AI coders keep laptops open in public to run their coding agents continuously everywhere.

7 pts by littlexsparkee [hn]

29.

Anything that is underneath the cursor gets fed into Google's surveillance AI

Google’s "Googlebook" scans everything under the cursor, fueling surveillance AI and raising privacy concerns across Android and ChromeOS.

7 pts by doener [hn]

30.

Show HN: I submitted 316 AI-generated PRs to open source

AI-generated pull requests flood open source; filters and adversarial tactics evolve to protect maintainers' focus and quality.

6 pts by kimjune01 [hn]

31.

Natural Language Autoencoders: Inside Claude's Activations

Anthropic's Natural Language Autoencoders translate Claude’s activations into English, aiding interpretability but risking manipulation if used actively.

6 pts by 7777777phil [hn]

32.

How to Achieve Serverless GPUs

Modal developed fast, serverless GPU scaling by buffering idle GPUs, lazy loading, checkpointing, and GPU snapshots—reducing spin-up from minutes to seconds.

6 pts by charles_irl [hn]

33.

Microsoft researchers find AI models and agents can't handle long-running tasks

Microsoft researchers find AI models struggle with long tasks, often corrupting documents or degrading performance in multi-step workflows.

6 pts by beardyw [hn]

34.

LLMs Are a Siren Song

LLMs may erode human skills and inner peace, accelerating life and fragmenting cognition, posing risks to natural connection and thought depth.

6 pts by dnnddidiej [hn]

35.

The April every AI plan broke

Major AI providers faced a storm of pricing and plan changes in April, exposing outdated monetization infrastructure and the need for flexible financial engineering.

6 pts by gmays [hn]

36.

Microsoft's $1B AI data center will "switch off half of Kenya"

Microsoft's \$1B data center in Kenya risks overloading the country's fragile grid, potentially "switching off" half the nation.

6 pts by pjmlp [hn]

37.

Supply chain compromise in mistralai Python package

Mistralai 2.4.6 Python package compromised with backdoor, downloading and executing payload from hardcoded IP.

6 pts by meander_water [hn]

38.

Can a Language Model Paint?

Iterative LLM painting reveals fragile, sometimes disastrous, artistic process, mirroring code instability and questioning art’s sincerity.

6 pts by liamlaverty [hn]

39.

Why You Shouldn't Treat AI Agents Like Employees

Treating AI as an employee blurs accountability, reduces quality, and lowers trust. Proper work redesign is essential for responsible use.

5 pts by gpi [hn]

40.

Show HN: Atlas - Local-first AI code reviewer for Claude Code, Codex, Cursor

Atlas reviews AI commits locally, blocking risky merges and ensuring safe code with customizable policies and full review record.

5 pts by avinashpdy [hn]

41.

DSM: A Hierarchical Graph Memory Engine for LLMs

DSM is a hierarchical graph memory engine enabling scalable reasoning over extensive datasets for LLMs, boosting speed and efficiency.

5 pts by BastOfMax [hn]

42.

Anthropic's Mythos was greatest marketing stunt ever, says cURL creator

Anthropic's Mythos AI found minimal security flaws in cURL, revealing hype over its capabilities as largely marketing.

5 pts by isaacfrond [hn]

43.

Agentic AI token compression using Haskell

Using Haskell's type and effect system, code is highly compressed, safer, and more inferable for AI, reducing token usage 6x over Python.

5 pts by villagegreens [hn]

44.

CC-Ledger: Claude Code Cost Tracker (Per-Session and Per-PR)

Tracks Claude Code activity, costs per session and PR locally, with optional cloud sync, detailed stats, and a macOS menubar app.

5 pts by tsv650 [hn]

45.

DeepSeek V4's indexer dies at 65K. We got it to 1M on 6GB

StreamIndex enables sparse attention in large models by reducing memory usage from 256 GB to 6.21 GB, allowing massive sequence processing.

5 pts by OsamaJaber [hn]

46.

Agent View in Claude Code

Claude Code's agent view streamlines multi-session management, enabling scalable, efficient parallel agent handling within the CLI.

5 pts by pretext [hn]

47.

The Problem with "Mathematically Proven" Claims About LLMs

Mathematical proofs about idealized models are often misinterpreted, leading to exaggerated claims about AI capabilities and limitations.

5 pts by gmays [hn]

48.

Show HN: Voice gender classifier for European voice AI (1MB, ONNX, 4ms)

Lightweight, real-time voice gender classifier (0.64MB, 4ms) for European languages, aiding gender-sensitive voice AI interactions.

5 pts by biduskamil [hn]

49.

Fedora Hummingbird: Taking the Hummingbird model to the full operating system

Fedora Hummingbird: a container-based, minimal, auto-updating Linux OS applying Project Hummingbird's zero-CVE goal, now available for testing.

5 pts by ibotty [hn]