AI News — Sunday, April 12, 2026

Sam Altman responds to ‘incendiary’ New Yorker article after attack on his home

OpenAI CEO Sam Altman addressed a controversial New Yorker article and a recent attack on his home, highlighting the intense scrutiny and personal risks associated with leading the frontier of AI development.

TechCrunchindustry

Salesforce rolls out new Slackbot AI agent as it battles Microsoft and Google in workplace AI

Salesforce has launched an enhanced Slackbot AI agent, intensifying its competition with Microsoft and Google in the rapidly evolving market for AI-powered workplace productivity tools.

VentureBeatproduct

Google Vids: Create, edit and share videos at no cost with new AI features

Google introduces new AI-powered features for Google Vids, allowing users to create, edit, and share videos at no cost, further integrating generative AI into its Workspace suite.

Google AI Blogproduct

OpenVLThinkerV2: A Generalist Multimodal Reasoning Model for Multi-domain Visual Tasks

Researchers introduce OpenVLThinkerV2, a new generalist multimodal reasoning model designed to excel across a wide array of visual tasks by integrating diverse reasoning capabilities.

Hugging Faceresearch

LPM 1.0: Video-based Character Performance Model

LPM 1.0 presents a novel video-based character performance model, enabling realistic and dynamic animation of digital characters directly from video input.

Hugging Faceresearch

DMax: Aggressive Parallel Decoding for dLLMs

A new paper introduces DMax, an aggressive parallel decoding method aimed at significantly improving the inference speed and efficiency of distributed Large Language Models (dLLMs).

Hugging Faceresearch

KnowU-Bench: Towards Interactive, Proactive, and Personalized Mobile Agent Evaluation

KnowU-Bench proposes a new benchmark for evaluating mobile AI agents, focusing on interactive, proactive, and personalized behaviors in real-world scenarios.

Hugging Faceresearch

Externalization in LLM Agents: A Unified Review of Memory, Skills, Protocols and Harness Engineering

This paper offers a comprehensive review of externalization techniques in LLM agents, covering memory, skills, protocols, and harness engineering to enhance their capabilities and robustness.

Hugging Faceresearch

Act Wisely: Cultivating Meta-Cognitive Tool Use in Agentic Multimodal Models

Researchers explore methods to cultivate meta-cognitive tool use in agentic multimodal models, enabling them to 'act wisely' by strategically selecting and applying tools for complex tasks.

Hugging Faceresearch

MolmoWeb: Open Visual Web Agent and Open Data for the Open Web

MolmoWeb introduces an open visual web agent and corresponding open datasets, aiming to democratize research and development for AI agents interacting with the web.

Hugging Faceopen-source

OpenSpatial: A Principled Data Engine for Empowering Spatial Intelligence

OpenSpatial presents a principled data engine designed to empower spatial intelligence in AI systems by providing structured and efficient access to spatial data.

Hugging Faceresearch

I got mass-flagged by GPTZero for my own writing. So I built an open-source alternative in pure Python.

A developer created an open-source AI content detection tool in Python after experiencing false positives from GPTZero on their own original writing, highlighting the challenges of AI authorship verification.

Dev.toopen-source

Agent Reading Test: A New Benchmark for AI Agent Comprehension

The Agent Reading Test introduces a novel benchmark designed to rigorously evaluate the reading comprehension and understanding capabilities of AI agents.

Lobste.rsresearch

Small Vision-Language Models are Smart Compressors for Long Video Understanding

This research demonstrates that small vision-language models can act as efficient compressors, significantly improving the processing and understanding of long video sequences.

Hugging Faceresearch

Flux Attention: Context-Aware Hybrid Attention for Efficient LLMs Inference

Flux Attention proposes a new context-aware hybrid attention mechanism to achieve more efficient inference for Large Language Models, optimizing performance without sacrificing accuracy.

Hugging Faceresearch

← Newer Older →