Multimodal AI Accelerates: Five Models Redefine Vision and Video
ByteDance Research released Lance, a 3B-parameter open-source model handling image and video generation and editing in…
ByteDance Research released Lance, a 3B-parameter open-source model handling image and video generation and editing in…
Thinking Machines Lab previewed real-time multimodal interaction models built by Mira Murati's team, while Perceptron launched…
Perceptron Inc. launched Mk1, a video analysis model priced at $0.15/$1.50 per million tokens — 80–90%…
Thinking Machines Lab previewed real-time interaction models for continuous audio and video input, while Perceptron released…
Thinking Machines Lab previewed real-time multimodal interaction models, Perceptron released a video analysis model priced 80–90%…
Multimodal AI systems are advancing rapidly with new real-time interaction capabilities, 80-90% cost reductions for video…
Thinking Machines introduced real-time interaction models that process voice and video simultaneously, while Perceptron launched video…
Multimodal AI systems are advancing rapidly with new models offering 80-90% cost reductions, real-time interaction capabilities,…
Multimodal AI made significant strides this week with Thinking Machines previewing real-time interaction models and Perceptron…
AI models trained on different data types are converging toward identical internal representations of reality as…
Three AI startups launched specialized models this week: Perceptron's video analysis model at 90% lower cost…
Research shows major AI models are converging on similar internal representations of reality despite different training…