Browsing: multimodal-AI

Google’s DeepMind has enhanced its Gemini 2.5 Flash Native Audio model with improved function calling precision, robust instruction following, and smoother conversational capabilities. The technical improvements are being deployed through Google Translate’s live speech translation feature, currently rolling out to Android users in select markets as a real-world testbed for the enhanced multimodal AI architecture.