OpenAI CEO Sam Altman addressed a controversial New Yorker article and a recent attack on his home, highlighting the intense scrutiny and personal risks associated with leading the frontier of AI development.
AI News — Sunday, April 12, 2026
Salesforce has launched an enhanced Slackbot AI agent, intensifying its competition with Microsoft and Google in the rapidly evolving market for AI-powered workplace productivity tools.
Google introduces new AI-powered features for Google Vids, allowing users to create, edit, and share videos at no cost, further integrating generative AI into its Workspace suite.
Researchers introduce OpenVLThinkerV2, a new generalist multimodal reasoning model designed to excel across a wide array of visual tasks by integrating diverse reasoning capabilities.
LPM 1.0 presents a novel video-based character performance model, enabling realistic and dynamic animation of digital characters directly from video input.
A new paper introduces DMax, an aggressive parallel decoding method aimed at significantly improving the inference speed and efficiency of distributed Large Language Models (dLLMs).
KnowU-Bench proposes a new benchmark for evaluating mobile AI agents, focusing on interactive, proactive, and personalized behaviors in real-world scenarios.
This paper offers a comprehensive review of externalization techniques in LLM agents, covering memory, skills, protocols, and harness engineering to enhance their capabilities and robustness.
Researchers explore methods to cultivate meta-cognitive tool use in agentic multimodal models, enabling them to 'act wisely' by strategically selecting and applying tools for complex tasks.
MolmoWeb introduces an open visual web agent and corresponding open datasets, aiming to democratize research and development for AI agents interacting with the web.
OpenSpatial presents a principled data engine designed to empower spatial intelligence in AI systems by providing structured and efficient access to spatial data.
A developer created an open-source AI content detection tool in Python after experiencing false positives from GPTZero on their own original writing, highlighting the challenges of AI authorship verification.
The Agent Reading Test introduces a novel benchmark designed to rigorously evaluate the reading comprehension and understanding capabilities of AI agents.
This research demonstrates that small vision-language models can act as efficient compressors, significantly improving the processing and understanding of long video sequences.
Flux Attention proposes a new context-aware hybrid attention mechanism to achieve more efficient inference for Large Language Models, optimizing performance without sacrificing accuracy.