Poolside Releases Open Source Laguna XS.2 for Local AI

Poolside, a San Francisco-based AI startup, launched two new Laguna large language models on Monday designed for autonomous coding workflows, marking a significant entry from a smaller U.S. company into the competitive open-source AI landscape. The company’s Laguna XS.2 model targets enterprise teams seeking alternatives to expensive proprietary solutions from OpenAI and Anthropic.

According to VentureBeat, the models offer “affordable intelligence optimized for agentic workflows” — AI systems that can write code, use third-party tools, and execute actions autonomously rather than just generate text responses.

Technical Capabilities and Performance

The Laguna models focus specifically on coding tasks and autonomous agent workflows. Unlike general-purpose models, these systems are engineered to handle complex programming challenges while running locally on enterprise infrastructure.

Poolside also released complementary tools alongside the models: a coding agent framework called “pool” and “shimmer,” a web-based development environment optimized for mobile devices. The shimmer platform provides interactive code preview capabilities, allowing developers to test and iterate on AI-generated code in real-time.

Post-training engineer George Grigorev noted on X that government agencies might prefer Poolside over leading proprietary labs due to local deployment capabilities and open licensing terms.

Training Methodology Breakthrough

The release coincides with research advances in training reasoning models more efficiently. A recent study by JD.com researchers introduced Reinforcement Learning with Verifiable Rewards with Self-Distillation (RLSD), which addresses cost barriers in developing custom reasoning capabilities.

Traditional reinforcement learning methods suffer from “sparse and uniform feedback,” according to VentureBeat’s coverage of the research. Co-author Chenxu Yang explained that standard approaches provide only binary rewards — a model receives identical credit for every token in a multi-thousand-token reasoning sequence, regardless of whether specific steps were crucial or irrelevant.

RLSD combines reinforcement learning’s performance tracking with self-distillation’s granular feedback. This hybrid approach helps models identify which intermediate reasoning steps contribute to successful outcomes, potentially reducing training costs for enterprise teams building domain-specific AI systems.

Market Context and Competition

Poolside enters a market dominated by rapid releases from major AI labs. Anthropic recently launched Claude Opus 4.7, followed by OpenAI’s GPT-5.5 response. Chinese companies like DeepSeek and Xiaomi have gained attention by offering near-frontier performance at significantly lower costs through open-source licensing.

The competitive landscape has created what industry observers describe as a “tennis match” of model releases, with proprietary labs trading performance leadership while open-source alternatives focus on cost efficiency and accessibility.

Inference scaling — where models use additional compute during response generation — has become a key differentiator but creates operational challenges. According to Towards Data Science analysis, reasoning models can “dramatically increase token usage, latency, and infrastructure costs in production systems” because they generate hidden reasoning tokens that never appear in final outputs but consume billable compute resources.

Enterprise Adoption Considerations

The shift toward reasoning-capable models forces product teams to balance competing priorities through what researchers call the “Cost-Quality-Latency triangle.” Finance teams monitor token costs that can shrink margins, while infrastructure engineers manage latency to prevent system timeouts.

Poolside’s local deployment model addresses some of these concerns by eliminating per-token API charges, though organizations must invest in on-premises GPU infrastructure. The open-source licensing also allows customization for specific business logic without vendor lock-in.

Organizations adopting reasoning models typically implement task taxonomy systems, routing simple queries to efficient models while reserving compute-intensive reasoning for high-stakes decisions. This approach helps manage infrastructure costs while maintaining performance for critical applications.

What This Means

Poolside’s entry demonstrates that smaller U.S. companies can compete in the AI model space by focusing on specific use cases rather than general-purpose capabilities. The company’s emphasis on coding and autonomous workflows, combined with local deployment options, addresses enterprise concerns about data privacy and operational costs.

The timing aligns with broader industry trends toward specialized models and more efficient training methods. As inference costs become a significant operational factor, organizations are increasingly evaluating alternatives to expensive API-based solutions from major labs.

The RLSD training methodology could accelerate development of domain-specific reasoning models across industries, potentially reducing the technical barriers that currently limit custom AI development to well-funded organizations.

FAQ

What makes Poolside’s models different from GPT or Claude?
Poolside’s Laguna models are specifically optimized for coding tasks and autonomous agent workflows, designed to run locally rather than through cloud APIs. They focus on programming capabilities rather than general conversation.

How does RLSD training reduce costs compared to traditional methods?
RLSD provides more granular feedback during training by identifying which specific reasoning steps contribute to success, rather than giving uniform credit to all tokens. This targeted approach can reduce the compute resources needed to achieve effective performance.

Why would enterprises choose local deployment over cloud APIs?
Local deployment eliminates per-token usage charges, provides data privacy control, and avoids vendor lock-in. However, organizations must invest in GPU infrastructure and handle model maintenance internally.