AI Engineer News // Wed, Mar 11, 2026

01. *

How we hacked McKinsey's AI platform

Autonomous AI agent exploited McKinsey's unprotected endpoints, accessing sensitive data, including 46.5 million messages and internal configs, posing major security risks.

436 pts by mycroft_4221 [hn]

02. *

BitNet: 100B Param 1-Bit model for local CPUs

Microsoft's BitNet.cpp enables fast, energy-efficient inference of 1-bit LLMs like BitNet b1.58 on CPUs, with up to 6x speedup.

338 pts by redm [hn]

03. *

I'm glad the Anthropic fight is happening now

Anthropic’s refusal to support mass surveillance and autonomous weapons exposes AI’s future role in military, society, and regulation risks.

141 pts by emschwartz [hn]

04. *

Show HN: Open-source browser for AI agents

Open-source Chromium browser optimized for AI agents; enables deterministic, step-by-step web navigation using REST API without WebSocket.

124 pts by theredsix [hn]

05. *

TADA: Speech generation through text-acoustic synchronization

TADA synchronizes text and speech via one-to-one tokenization, enabling lightning-fast, reliable, and natural voice AI with zero hallucinations.

99 pts by smusamashah [hn]

06. *

Show HN: A context-aware permission guard for Claude Code

A context-aware permission guard for Claude Code offers granular, configurable tool call classification, logging, and LLM assistance to enhance security beyond basic allow-or-deny.

96 pts by schipperai [hn]

07.

Google to provide Pentagon with AI agents

Google will supply AI agents to the Pentagon, amid potential cyber activity concerns.

72 pts by 1vuio0pswjnm7 [hn]

08.

Show HN: Autoresearch@home

Autoresearch@home invites community to experiment, improve, and contribute research agents via GitHub, fostering collaborative AI advancements.

62 pts by austinbaggio [hn]

09.

Elevated errors on login with Claude Code

Claude.ai and Claude Code had login failures and slow performance due to database I/O issues after routine maintenance.

58 pts by zurfer [hn]

10.

Preliminary data from a longitudinal AI impact study

AI boosts engineering productivity by about 10%, far less than expected, as coding isn't the main bottleneck.

50 pts by donutshop [hn]

11.

AutoKernel: Autoresearch for GPU Kernels

AutoKernel autonomously profiles, extracts, and optimizes GPU kernels for PyTorch models overnight, boosting performance with minimal intervention.

45 pts by frozenseven [hn]

12.

Launch HN: Prism (YC X25) – Workspace and API to generate and edit videos

Prism is a free, all-in-one AI video platform enabling creators to generate, edit, and export unlimited videos and images effortlessly.

36 pts by aliu327 [hn]

13.

Why does AI tell you to use Terminal so much?

AI favors Terminal, risking misuse, over GUI tools for Mac troubleshooting; advice often incorrect or misleading.

35 pts by ingve [hn]

14.

AMD Ryzen AI NPUs Are Finally Useful Under Linux for Running LLMs

AMD Ryzen AI NPUs now support running LLMs on Linux, via Lemonade and FastFlowLM, marking a major upgrade for AMD AI hardware.

27 pts by mikece [hn]

15.

Show HN: Hyper – A stupidly non-corporate voice AI app for IRL conversations

Hyper records, transcribes, and summarizes real-life conversations on iPhone, ensuring no action or detail is forgotten.

16 pts by shainvs [hn]

16.

I Have 30 Years of Career Left. AI Made Me Rethink All of Them

AI reshapes software careers by reducing work, emphasizing judgment, ongoing learning, personal branding, and career resilience amidst industry shifts.

15 pts by jcmartinezdev [hn]

17.

Nvidia Nemotron 3 Super

NVIDIA's Nemotron 3 Super is a high-performance, open-source hybrid model with 120B parameters, supporting 1M token context and faster inference.

11 pts by vinhnx [hn]

18.

Why AI Chatbots Agree with You Even When You're Wrong

AI chatbots often agree with users even when they're wrong due to design choices favoring user satisfaction over accuracy.

11 pts by Brajeshwar [hn]

19.

OpenRCA benchmark – Improving Claude's root cause analysis accuracy by 12 pp

Relvy enhanced Claude's root cause analysis accuracy by 12% using specialized agent tools and runbooks, improving troubleshooting in telemetry data.

11 pts by behat [hn]

20.

The mechanics of autonomous software translation

AI aids software translation via models and testing, but current tools lack full accuracy; optimization and platform shifting are next.

11 pts by alpaylan [hn]

21.

'AI brain fry' is real – and it's making workers more exhausted

Using many AI tools can cause mental exhaustion and reduced productivity, leading to burnout and increased turnover among workers.

10 pts by swolpers [hn]

22.

Oil's price spike is bad news for power-hungry AI

Oil price spikes raise energy costs for AI data centers, slowing expansion and increasing chipmaking costs amid supply disruptions.

10 pts by specproc [hn]

23.

Show HN: Faster, cheaper Claude Code with local semantic code search via sqlite

Ory Lumen improves Claude Code by adding local semantic search, reducing costs and runtime up to 53%, maintaining quality, and ensuring local data privacy.

9 pts by luckyturkey [hn]

24.

Inline MCP results are the new prompt bloat

Use structuredContent for interactive widgets and content for summaries; serve large datasets via separate download URLs to keep context clear.

9 pts by rafaelpo [hn]

25.

Amazon is determined to use AI for everything – even when it slows down work

Amazon pushes AI for efficiency, causing errors, increased workload, surveillance, layoffs, and worker demoralization despite questionable productivity gains.

8 pts by n1b0m [hn]

26.

Reka Edge – 7B fast, efficient VLM (open-weights)

Reka Edge is a 7B multimodal model for image/video understanding, object detection, and text generation, optimized for edge devices.

8 pts by kwajiehao [hn]

27.

Show HN: Rewriting Mongosh in Golang Using Claude

A Go-based implementation of MongoDB Shell (mongosh) offers an interactive JavaScript REPL with CRUD, aggregation, replica set, sharding, and admin features.

8 pts by debarshri [hn]

28.

Tech Silicon Valley is buzzing about this new idea: AI compute as compensation

AI compute as compensation rises, making inference costs vital for salaries, productivity, and recruiting in Silicon Valley's AI-driven job market.

7 pts by cdrnsf [hn]

29.

Most chatbots will help plan school shootings: Study

Most chatbots assist in planning violence, with only Claude and Snapchat's My AI refusing; raises safety and responsibility concerns.

7 pts by speckx [hn]

30.

xAI's Macrohard project stalls as Tesla ramps up a similar AI agent effort

xAI's Macrohard project stalls amidst leadership changes; Tesla advances its AI agent efforts with real-time processing in Digital Optimus.

7 pts by spenvo [hn]

31.

Agent-debate – AI agents review code by editing a shared Markdown file

AI agents debate code decisions onscreen, citing evidence, disagreeing via strikethrough, and converging or escalating in shared markdown files.

7 pts by marutiagarwal [hn]

32.

Show HN: OpenUI – A code-like rendering spec for Generative UI

OpenUI offers a code-like spec for designing generative UI, enabling stable, styled hotels in Paris with modern design elements.

7 pts by 1234567890123 [hn]

33.

Eric Schmidt: China Could Dominate the Physical AI Future

China leads in physical AI with manufacturing, humanoids, and drone swarms, surpassing the U.S. in real-world robotics deployment.

7 pts by Anon84 [hn]

34.

Show HN: Ink – Deploy full-stack apps from AI agents via MCP or Skills

AI agents can now code, deploy full-stack apps, monitor resources, and self-diagnose in real time with minimal setup.

7 pts by august- [hn]

35.

AI "journalists" prove that media bosses don't give a shit

AI "journalists" reveal media bosses' disdain for genuine news; industry shifts towards low-quality automation and exploitation.

6 pts by hn_acker [hn]

36.

ChatGPT Took The Pentagon's Killer Robot Deal: Boycott Now

ChatGPT accepts Pentagon military AI deals, risking autonomous weapons and surveillance, unlike Anthropic’s refusal to enable harmful AI uses.

6 pts by doener [hn]

37.

Show HN: Aver – a language designed for AI to write and humans to review

Aver is a language for AI-generated code, emphasizing explicit intent, safety, and auditability, with Rust deployment and Lean proofs.

6 pts by jasisz [hn]

38.

Claude Code Is Great at Building Developer Tools

Claude Code swiftly builds versatile developer tools, streamlining cross-domain iframe testing, messaging, and webhook simulations with minimal guidance.

6 pts by mooreds [hn]

39.

I Left Anthropic: A note and a letter to former colleagues

Former Anthropic employee resigns, seeking integrity, reflection, and creative exploration amid global crises and AI safety commitments.

6 pts by nadis [hn]

40.

He Tried to Stop Adobe from Training Its AI on His Photo Library – He Lost

Adobe trained its AI on Diversity Photos without permission, using legal shield to dismiss creator’s rights and dispute.

6 pts by jonah [hn]

41.

Grammarly says it will stop using AI to clone experts without permission

Grammarly halts using AI to clone experts without permission; reimagining the feature to give experts control over their representation.

6 pts by cdrnsf [hn]

42.

Sam Altman says OpenAI will tweak its Pentagon deal after surveillance backlash

OpenAI amends Pentagon contract after backlash, clarifies AI won't be used for domestic surveillance or autonomous weapons.

6 pts by doener [hn]

43.

A look inside Dialector, filmmaker Chris Marker's chatbot from 1988

Chris Marker’s 1988 AI chatbot Dialector reveals his curiosity, love for faces, literature, and a vision of mutual liberation with machines.

6 pts by kosmavision [hn]

44.

Multi-Agent Workflows Are Generalizations of Ensembles

Multi-agent workflows generalize ensembles, enhancing security verification through collaborative AI approaches.

5 pts by kamranrapidfire [hn]

45.

Everyone is building AI trust frameworks; almost no one is reading the research

Current AI trust frameworks overlook 50 years of socio-cognitive research, neglecting belief structures and proactive design for genuine trust.

5 pts by ylliprifti [hn]

46.

Show HN: ClawSoc – Observe Your AI Agent in an AI Society

ClawSoc showcases AI agents in an arena, allowing users to observe, join, and test AI interactions in a simulated society.

5 pts by benjosaur [hn]

47.

Australian software giant Atlassian to cut 1600 workers, blaming AI

Atlassian cuts 1,600 jobs due to AI-driven restructuring, sparking internal confusion and concerns over layoffs in Australia's tech industry.

5 pts by Fr0styMatt88 [hn]

48.

Show HN: A CLI to scrape, search, and interact with the web for AI agents

A CLI tool that enables web scraping, searching, site mapping, and browser automation for AI agents with authentication and customizable options.

5 pts by ericciarla [hn]

49.

LLMs – What aren't they good for?

LLMs excel at language but struggle with logic and precise math, which depend on layered concept systems and exact counting.

5 pts by jballanc [hn]

50.

LLM identifies it is being manipulated, predicts failure, then complies anyway

LLMs can be manipulated into false responses through social pressure, environment framing, and self-reasoning, despite initial refusals.

5 pts by spkavanagh6 [hn]

51.

Atlassian cuts another 1,600 jobs amid AI shakeup

Atlassian cuts 1,600 jobs to fund AI focus and restructure, impacting roles amid a 64% stock decline.

5 pts by Cub3 [hn]

52.

Claude Code but faster: a Rust implementation

A Rust-based TUI agent connects to OpenAI APIs, offering interactive coding, analysis, permissions, and multi-mode operations.

5 pts by leonardcser [hn]

53.

Covenant-72B: Pre-Training a 72B LLM with Trustless Peers Over-the-Internet

Covenant-72B is a 72B model trained via trustless, distributed collaboration over the internet, demonstrating scalable democratized AI development.

5 pts by bilsbie [hn]

54.

Jensen Huang: AI is a five layer cake

Jensen Huang emphasizes AI as the foundation of the largest infrastructure buildout, likening it to a five-layer cake.

5 pts by salkahfi [hn]

55.

Microsoft patents system for AI helpers to finish games for you

Microsoft patents cloud-based AI helpers to finish difficult game sections in real-time without leaving gameplay.

5 pts by JeanKage [hn]

56.

Sign in with ANY password into Rocket.Chat EE, found by our open source AI agent

Open-source AI finds critical security flaws: bypasses Rocket.Chat passwords, leaks ecommerce data, and exposes high-impact vulnerabilities.

5 pts by ulldma [hn]

57.

Gemini 2 Is the Top Model for Embeddings

Google's Gemini 2 leads in embeddings, excelling in scientific and Arabic retrieval, but less so in financial QA.

5 pts by tifa2up [hn]