AI News — Sunday, April 12, 2026

9
Sam Altman responds to ‘incendiary’ New Yorker article after attack on his home

OpenAI CEO Sam Altman addressed a controversial New Yorker article and a recent attack on his home, highlighting the intense scrutiny and personal risks associated with leading the frontier of AI development.

TechCrunchindustry
8
Salesforce rolls out new Slackbot AI agent as it battles Microsoft and Google in workplace AI

Salesforce has launched an enhanced Slackbot AI agent, intensifying its competition with Microsoft and Google in the rapidly evolving market for AI-powered workplace productivity tools.

VentureBeatproduct
8
Google Vids: Create, edit and share videos at no cost with new AI features

Google introduces new AI-powered features for Google Vids, allowing users to create, edit, and share videos at no cost, further integrating generative AI into its Workspace suite.

Google AI Blogproduct
7
OpenVLThinkerV2: A Generalist Multimodal Reasoning Model for Multi-domain Visual Tasks

Researchers introduce OpenVLThinkerV2, a new generalist multimodal reasoning model designed to excel across a wide array of visual tasks by integrating diverse reasoning capabilities.

Hugging Faceresearch
7
LPM 1.0: Video-based Character Performance Model

LPM 1.0 presents a novel video-based character performance model, enabling realistic and dynamic animation of digital characters directly from video input.

Hugging Faceresearch
7
DMax: Aggressive Parallel Decoding for dLLMs

A new paper introduces DMax, an aggressive parallel decoding method aimed at significantly improving the inference speed and efficiency of distributed Large Language Models (dLLMs).

Hugging Faceresearch
6
KnowU-Bench: Towards Interactive, Proactive, and Personalized Mobile Agent Evaluation

KnowU-Bench proposes a new benchmark for evaluating mobile AI agents, focusing on interactive, proactive, and personalized behaviors in real-world scenarios.

Hugging Faceresearch
6
Externalization in LLM Agents: A Unified Review of Memory, Skills, Protocols and Harness Engineering

This paper offers a comprehensive review of externalization techniques in LLM agents, covering memory, skills, protocols, and harness engineering to enhance their capabilities and robustness.

Hugging Faceresearch
6
Act Wisely: Cultivating Meta-Cognitive Tool Use in Agentic Multimodal Models

Researchers explore methods to cultivate meta-cognitive tool use in agentic multimodal models, enabling them to 'act wisely' by strategically selecting and applying tools for complex tasks.

Hugging Faceresearch
6
MolmoWeb: Open Visual Web Agent and Open Data for the Open Web

MolmoWeb introduces an open visual web agent and corresponding open datasets, aiming to democratize research and development for AI agents interacting with the web.

Hugging Faceopen-source
5
OpenSpatial: A Principled Data Engine for Empowering Spatial Intelligence

OpenSpatial presents a principled data engine designed to empower spatial intelligence in AI systems by providing structured and efficient access to spatial data.

Hugging Faceresearch
5
I got mass-flagged by GPTZero for my own writing. So I built an open-source alternative in pure Python.

A developer created an open-source AI content detection tool in Python after experiencing false positives from GPTZero on their own original writing, highlighting the challenges of AI authorship verification.

Dev.toopen-source
5
Agent Reading Test: A New Benchmark for AI Agent Comprehension

The Agent Reading Test introduces a novel benchmark designed to rigorously evaluate the reading comprehension and understanding capabilities of AI agents.

Lobste.rsresearch
5
Small Vision-Language Models are Smart Compressors for Long Video Understanding

This research demonstrates that small vision-language models can act as efficient compressors, significantly improving the processing and understanding of long video sequences.

Hugging Faceresearch
5
Flux Attention: Context-Aware Hybrid Attention for Efficient LLMs Inference

Flux Attention proposes a new context-aware hybrid attention mechanism to achieve more efficient inference for Large Language Models, optimizing performance without sacrificing accuracy.

Hugging Faceresearch