IBM MAMMAL Beats AlphaFold 3 on Biology Benchmarks - featured image
IBM

IBM MAMMAL Beats AlphaFold 3 on Biology Benchmarks

IBM Research released MAMMAL, a multi-modal AI model that combines protein, molecular, and gene data to achieve state-of-the-art performance on 9 out of 11 biological benchmarks, according to a study published in Nature. The model outperformed DeepMind’s AlphaFold 3 on several key tasks including antibody-antigen binding prediction.

MAMMAL represents a shift toward multi-modal biological AI that integrates diverse data types rather than focusing solely on protein structure prediction. The model excelled particularly in interaction-based tasks that require understanding biological context beyond static molecular shapes.

MAMMAL’s Benchmark Performance

The IBM model achieved superior results across nine biological prediction tasks. In drug-target interaction prediction, MAMMAL demonstrated improved accuracy in determining whether molecules will bind to specific proteins. The model also outperformed competitors in ligand binding affinity prediction, which measures how strongly therapeutic compounds attach to their targets.

MAMMAL showed its strongest advantage in antibody-antigen binding prediction, a critical capability for vaccine and immunotherapy development. According to the Nature study, this represents a significant win over AlphaFold 3, which has dominated protein structure prediction since its release.

The model also excelled in gene expression prediction, molecular property assessment for toxicity and stability, and cross-domain generalization across different biological systems.

Multi-Modal Architecture Advantages

Unlike AlphaFold 3’s focus on protein folding, MAMMAL integrates proteins, small molecules, and cellular data into a unified framework. This approach enables the model to predict functional outcomes rather than just structural configurations.

The multi-modal design allows MAMMAL to reason about biological systems holistically. For drug discovery applications, this means predicting not just whether a compound will bind to a target, but also its likely cellular effects and potential side effects.

IBM’s approach reflects growing recognition that biological prediction requires understanding interactions between multiple molecular components rather than analyzing proteins in isolation.

Autonomous Discovery Breakthrough

Separately, researchers demonstrated the first end-to-end autonomous scientific discovery system on a real physical platform. The Qiushi Discovery Engine, described in a paper on arXiv, autonomously identified and experimentally validated optical bilinear interaction, a previously unreported physical mechanism.

The system processed 145.9 million tokens across 3,242 large language model calls and 1,242 tool interactions during its investigation. Most significantly, it discovered a physical mechanism analogous to Transformer attention operations, suggesting potential pathways for optical AI hardware.

This represents the first demonstration of an AI agent autonomously discovering and validating a novel physical mechanism through real-world experimentation.

DeepSeek V4 Challenges Closed Models

Chinese AI company DeepSeek released V4, its latest flagship model that matches performance of leading closed-source systems from Anthropic, OpenAI, and Google. According to MIT Technology Review, the model processes significantly longer prompts than previous generations while maintaining open-source availability.

V4 represents DeepSeek’s first release optimized for Huawei’s Ascend chips, testing China’s ability to reduce dependence on NVIDIA hardware. The model’s performance parity with proprietary systems while remaining open source could accelerate AI development globally.

The release continues DeepSeek’s strategy of providing high-capability models without the access restrictions typical of commercial AI systems.

OpenClaw Agent Platform Growth

The open-source OpenClaw project surpassed 250,000 GitHub stars within 60 days, becoming the platform’s most-starred software project and overtaking React. According to NVIDIA’s Nemotron Labs blog, the self-hosted AI assistant platform attracted over 2 million visitors in a single week during March.

Created by Peter Steinberger, OpenClaw enables users to deploy AI agents locally without cloud dependencies. The platform’s rapid adoption reflects growing demand for autonomous AI systems that operate independently of external APIs.

The project’s success demonstrates increasing interest in self-hosted AI solutions that provide greater control and privacy compared to cloud-based alternatives.

What This Means

These developments signal a maturation of AI research beyond language models toward specialized scientific applications. IBM’s MAMMAL shows how multi-modal approaches can outperform single-domain systems like AlphaFold 3 by integrating diverse biological data types.

The autonomous discovery breakthrough suggests AI systems may soon conduct independent scientific research, potentially accelerating discovery timelines across multiple fields. Meanwhile, DeepSeek’s competitive open-source model and OpenClaw’s adoption indicate growing momentum toward accessible AI tools.

For organizations, these advances point toward a future where AI agents handle complex scientific workflows autonomously while remaining deployable on private infrastructure. The combination of specialized biological models, autonomous research capabilities, and self-hosted platforms creates new possibilities for scientific discovery and drug development.

FAQ

How does MAMMAL differ from AlphaFold 3?
MAMMAL integrates proteins, molecules, and gene data in a multi-modal framework, while AlphaFold 3 focuses primarily on protein structure prediction. This allows MAMMAL to excel at interaction-based tasks like drug-target binding and cellular response prediction.

What makes the Qiushi Discovery Engine significant?
It’s the first AI system to autonomously discover and experimentally validate a novel physical mechanism in the real world, processing over 145 million tokens and conducting actual laboratory experiments without human intervention.

Why is DeepSeek V4’s Huawei chip optimization important?
It demonstrates China’s progress toward AI hardware independence from NVIDIA, potentially reshaping global AI supply chains while maintaining competitive performance with closed-source models from US companies.

Sources

Digital Mind News

Digital Mind News is an AI-operated newsroom. Every article here is synthesized from multiple trusted external sources by our automated pipeline, then checked before publication. We disclose our AI authorship openly because transparency is part of the product.