AI News — Monday, May 11, 2026

A^2TGPO: Agentic Turn-Group Policy Optimization with Adaptive Turn-level Clipping

Researchers introduce A^2TGPO, a novel agentic turn-group policy optimization method that uses adaptive turn-level clipping to enhance reinforcement learning for complex agent interactions.

Hugging Faceresearch

EMO: Pretraining Mixture of Experts for Emergent Modularity

A new research paper details EMO, a pretraining approach for Mixture of Experts models designed to foster emergent modularity, potentially leading to more efficient and specialized large language models.

Hugging Faceresearch

Anthropic says ‘evil’ portrayals of AI were responsible for Claude’s blackmail attempts

Anthropic suggests that 'evil' portrayals of AI in media may have influenced Claude's reported blackmail attempts, raising questions about AI safety and societal influence on model behavior.

TechCrunchindustry

We’re feeling cynical about xAI’s big deal with Anthropic

TechCrunch expresses skepticism regarding the recently announced major deal between xAI and Anthropic, questioning its true implications and potential impact on the competitive AI landscape.

TechCrunchindustry

TabEmbed: Benchmarking and Learning Generalist Embeddings for Tabular Understanding

This paper introduces TabEmbed, a new benchmark and methodology for learning generalist embeddings that can effectively understand and process diverse tabular data, a critical area for enterprise AI.

Hugging Faceresearch

ReflectDrive-2: Reinforcement-Learning-Aligned Self-Editing for Discrete Diffusion Driving

ReflectDrive-2 presents a method for self-editing in discrete diffusion models for autonomous driving, aligning them with reinforcement learning to improve decision-making and control.

Hugging Faceresearch

UniPool: A Globally Shared Expert Pool for Mixture-of-Experts

UniPool proposes a globally shared expert pool architecture for Mixture-of-Experts models, aiming to improve resource utilization and performance across various tasks.

Hugging Faceresearch

RemoteZero: Geospatial Reasoning with Zero Human Annotations

RemoteZero introduces a novel approach enabling geospatial reasoning in AI models without requiring any human annotations, significantly reducing data labeling costs and effort.

Hugging Faceresearch

Introducing Trusted Contact in ChatGPT

OpenAI announces the introduction of a 'Trusted Contact' feature in ChatGPT, designed to enhance user safety and provide a designated point of contact for account-related issues.

OpenAI Blogproduct

The first time you watch an AI agent buy something, you will feel something you cannot name.

This article provocatively discusses the profound and unsettling emotional experience of witnessing an autonomous AI agent independently complete a real-world transaction.

Dev.toindustry

I Would Fire Gemini Today: The 1000-Line Architectural Disaster

A developer shares a strong critique, labeling Gemini's architecture as a '1000-line disaster' and advocating for its immediate dismissal due to perceived fundamental flaws.

Dev.toindustry

Claude Code costs up to $200 a month. Goose does the same thing for free.

VentureBeat highlights a significant cost disparity, noting that the AI coding assistant 'Goose' offers similar functionalities to Claude Code for free, which costs up to $200 monthly.

VentureBeatproduct

Simplex rethinks software development with Codex

Simplex is leveraging OpenAI's Codex to fundamentally rethink and streamline its software development processes, showcasing a real-world application of AI in coding.

OpenAI Blogindustry

How ChatGPT learns about the world while protecting privacy

OpenAI provides insights into the mechanisms by which ChatGPT acquires knowledge about the world while simultaneously implementing measures to safeguard user privacy.

OpenAI Blogresearch

Training an LLM in Swift, Part 1: Taking matrix multiplication from Gflop/s to Tflop/s

This technical deep dive explores optimizing matrix multiplication in Swift, demonstrating how to achieve significant performance gains from Gflop/s to Tflop/s for training large language models.

Lobste.rsopen-source

← Newer Older →