Multimodal AI | Digital Mind News

Enterprise

Alibaba Cloud's HappyHorse 1.1 video model reached No. 2 globally in June 2026 as OpenAI's Sora…

2026-06-24

Google

Google unveiled Gemini Omni at I/O 2026 as its first native any-to-any multimodal model, while ByteDance…

2026-06-23

AI

ByteDance Research released Lance, a 3B-parameter open-source model handling image and video generation and editing in…

2026-05-19

AI

Thinking Machines Lab previewed real-time multimodal interaction models built by Mira Murati's team, while Perceptron launched…

2026-05-18

OpenAI

Perceptron Inc. launched Mk1, a video analysis model priced at $0.15/$1.50 per million tokens — 80–90%…

2026-05-17

OpenAI

Thinking Machines Lab previewed real-time interaction models for continuous audio and video input, while Perceptron released…

2026-05-16

OpenAI

Thinking Machines Lab previewed real-time multimodal interaction models, Perceptron released a video analysis model priced 80–90%…

2026-05-16

AI

Multimodal AI systems are advancing rapidly with new real-time interaction capabilities, 80-90% cost reductions for video…

2026-05-16

AI

Thinking Machines introduced real-time interaction models that process voice and video simultaneously, while Perceptron launched video…

2026-05-15

AI

Multimodal AI systems are advancing rapidly with new models offering 80-90% cost reductions, real-time interaction capabilities,…

2026-05-15

AI

Multimodal AI made significant strides this week with Thinking Machines previewing real-time interaction models and Perceptron…

2026-05-14

Enterprise

AI models trained on different data types are converging toward identical internal representations of reality as…

2026-05-14