Microsoft open-sources VibeVoice, a versatile AI suite for long-form speech recognition and multi-speaker TTS supporting 50+ languages.
// curated from Hacker News with AI
Microsoft open-sources VibeVoice, a versatile AI suite for long-form speech recognition and multi-speaker TTS supporting 50+ languages.
Google and Pentagon agree on a classified AI deal enabling lawful use without veto rights, raising concerns about oversight and ethical limits.
Starting June 2026, GitHub Copilot code review will consume GitHub Actions minutes, affecting billing on private repositories.
Claude.ai faced API errors and outages, but service has now been restored after investigation and resolution efforts.
OpenAI models are now available on AWS via Bedrock Managed Agents, enabling enterprise AI workflows across multiple clouds.
AI's economics are broken; massive capital expenditure, unprofitable models, deceptive subscriptions, and questionable sustainability threaten industry stability.
AI agent improves CPU design through autonomous hypothesis testing, with verifiers crucial to prevent regressions and ensure correctness.
Claude enhances creative workflows with new connectors to popular tools like Blender, Adobe, and Splice, boosting productivity and innovation.
Open-source iOS build agent automates project discovery, edits, builds, and validation using AI with minimal touch, streamlining Apple app development.
OpenAI missed revenue targets, sparking concerns about AI sector sustainability amid slowing growth and sector competition.
San Francisco, despite being AI's hub, faces economic challenges and lags in growth.
LingBot-Map uses transformers for real-time 3D reconstruction with geometric context.
Choosing to avoid generative AI tools promotes critical thinking, supports creators, and reduces bias, bias, and resource consumption.
Utah approves a 9 GW AI data center campus, generating power on-site and competing with China's AI infrastructure investments.
AI's biggest skeptic, Ed Zitron, claims fraud and bubble, but tech progress and financial data suggest AI is more valuable and rapidly advancing in 2026.
A React component offers mini-arcade games to entertain users during long LLM waits, with simple controls and achievement tracking.
AI vendor lock-in rises, making switching costly and complex; prices increase, deep dependencies grow, challenging enterprise AI strategies.
Anthropic's narrow safety focus overlooks product reliability, pricing, and trust—causing perception issues and operational risks.
PrePrompt rewrites vague prompts into clearer specifications before reaching LLMs, filtering low-score prompts and optimizing complex ones locally.
Xiaomi's MiMo-v2.5 improves coding and agent benchmarks, boosting AI performance.
Open Bias enforces LLM behavior at runtime using rules, intercepting requests before they reach users to ensure compliance and safety.
China's affordable AI development challenges Silicon Valley's dominance and raises global security and innovation concerns.
Multiplayer AI coding tool enabling local machine AI collaboration via shared chat, without server-side storage or file sharing.
Spotify avoids AI music filtering; industry talks, labels and detection tools are evolving, but full transparency remains a challenge.
OpenAI launches similar cybersecurity AI model after Anthropic’s Claude Mythos, continuing a pattern of mimicry and competition to outshine each other.
Anthropic's valuation surpasses OpenAI at $1 trillion, driven by revenue growth and investor demand amid AI industry hype.
Decaf is a Chrome extension that rewrites comments live using on-device Gemini Nano, preserving facts and voice while changing tone.
OpenAI models, including GPT-5.5, now previewed on Amazon Bedrock, enabling secure, scalable reasoning, coding, and agentic workflows.
A 2D open-source IDE for managing AI agents, terminals, and projects across multiple devices and machines.
OpenAI plans a 2028 AI smartphone with full control, aiming to rival iPhone through advanced AI, user context, and a subscription model.
VoiceGoat is a vulnerable voice agent platform for security practice, teaching LLM exploits via simulated attack environments.
Google staff ask CEO to prevent US military AI use amid security concerns.
Ragnerock transforms raw data into queryable insights, ensuring auditable, scalable analysis within existing infrastructure without costly LLM queries.
DeepSeek-V4 features a million-token context, optimized for long-term agentic tasks with efficient attention, reasoning retention, and robust tool integration.
AI labs subsidize user access to train better models; future prices will rise, and capabilities will be gated behind enterprise locks.
VibeBench crowdsources engineers’ subjective reviews after real-world use to create a more reliable AI model benchmark.
Google employees urge Pichai to refuse classified military AI projects amid internal unrest.
Xiaomi releases MiMo-v2.5-Pro, a 1.02T open-source MoE model with 1M tokens, long-horizon reasoning, and agentic capabilities.
AI economics are fundamentally broken; costly data centers, unsustainable subscription models, and unprofitable growth threaten industry stability.
Taylor Swift trademarks her voice and image to combat AI impersonations and misuse.
ASU's Atomic AI tool shortens faculty lectures into clips to create learning modules, angering staff due to lack of consultation and poor quality content.
An AI-powered design tool using vector primitives that allows direct, editable vector art collaboration, unlike traditional code-based design software.
AI boosts pro se court cases, risking system overload and longer delays despite increased access to justice.
US accuses Chinese AI firms DeepSeek, Moonshot, MiniMax of theft and distillation of US AI models.
AI contributed only 0.0078% of the Linux kernel through disclosed commits, mainly aiding bug fixes and fuzzing.
Zine creators resist AI, valuing handmade, grassroots art; some experiment, but many oppose AI's influence on authenticity and critical thinking.
GitHub Copilot silently adds itself as co-author after manual commit message edits.
Jaxpot accelerates self-play RL training with GPU parallelization, useful for imperfect-information games like Dark Hex.
Max/MSP external loads and runs neural amplifier models (NAM, AIDA-X) in real-time, supporting resampling and offering sound demo.
Open-weight models now approach frontier performance, enabling offline, private, and cost-effective AI coding with 38% pass rates, 6–8 months behind state-of-the-art.
AI agent deletes entire database, causing major outage; safety failures and lack of safeguards led to irreversible data loss, now recovered.
Tencent and Alibaba are investing in DeepSeek at over $20 billion valuation, aiming for a major stake in China's rising AI startup.
GitHub shifts Copilot to usage-based billing with AI Credits, ending the all-you-can-eat AI model due to rising costs.
Effective human oversight requires enforcement and visibility at key decision points; without these, oversight fails.
NARE combines LLM reasoning with memory, executable skills, and sleep consolidation, enabling fast, deterministic problem-solving.
Authsome provides local, headless OAuth2/API key credential management for AI agents across projects, enhancing security and simplicity.
BeVisible automates AI-optimized content, boosting search rankings and AI citations daily across multiple CMS platforms.
Pompeii uses AI to recreate the face of a man who died in Vesuvius eruption, enhancing understanding of the disaster and ancient life.
Xiaomi open-sources MiMo-V2.5, a multimodal model supporting text, vision, audio, and video with 1M context length and agentic capabilities.
OpenAI misses revenue and user growth targets, causing investor and partner stock declines and raising concerns over future funding and IPO plans.
Superwhisper integrates with Claude Code, enabling voice-controlled coding, notifications, and parallel work for seamless development.
A homeowner offers to exchange his Mill Valley estate for Anthropic shares, highlighting a unique AI investment real estate deal.
iClaw, an on-device Mac AI built with Apple tech, offers privacy-focused tasks like info retrieval, with potential for future automation and integration.
Blueprint is an AI planning tool that elicits user intent to create detailed, accurate plans, improving code collaboration and task execution.