Blog

Company Updates & Technology Articles

February 4, 2026

Partnerships

Scale AI and Webster University Launch New Technical Writing Certificate to Advance AI-Driven Workforce

Scale AI and Webster University Launch New Technical Writing Certificate to Advance AI-Driven Workforce

Scale AI partners with Webster University to launch a technical writing certificate advancing AI workforce skills.

Read more

February 3, 2026

Research

Moltbook, Agent Collectives, and the AI Risk Matrix

Moltbook, Agent Collectives, and the AI Risk Matrix

The emergence of Moltbook offer an early glimpse into AI agents operating collectively at scale. These systems demonstrate how interaction between autonomous agents can give rise to emergent behaviors that are not attributable to any single model or prompt, complicating traditional, model-level approaches to AI safety. Viewed through the AI Risk Matrix, such agent collectives point to a distinct and underdeveloped category of system-level risk with implications for evaluation, red teaming, and governance.

Read more

February 2, 2026

People

The People Behind the Models: Meet Scott O’Neil

The People Behind the Models: Meet Scott O’Neil

Scott O’Neill is a plumbing sales professional in Louisiana who contributes to building better AI models in his spare time. Between a full-time job and raising two young daughters, Scott uses flexible, remote work through Outlier to stay connected to technology, apply his problem-solving skills, and continue learning. His story reflects how people from diverse backgrounds are helping shape the future of AI on schedules that fit real life.

Read more

January 22, 2026

Company

Scale’s Next Era: Building for 2026

Scale’s Next Era: Building for 2026

Scale CEO Jason Droege reflects on a record-breaking 2025 and shares how Scale is building reliable, production-ready AI systems for 2026.

Read more

January 13, 2026

Government

The Next Phase of U.S. AI Policy: Governance, Implementation, and Global Leadership

The Next Phase of U.S. AI Policy: Governance, Implementation, and Global Leadership

What it will take for the United States to move from AI experimentation to real governance, government-wide implementation, and lasting global leadership.

Read more

January 12, 2026

General

What's different about enterprise healthcare AI? | Human in the Loop Episode 17

What's different about enterprise healthcare AI? | Human in the Loop Episode 17

The team is kicking off 2026 like the rest of us: by focusing on health(care)! They discuss why adopting AI in healthcare is different from other enterprise AI initiatives and how leaders can account for those differences. And as always, they react to some of the internet's hottest takes on AI (healthcare edition).

Read more

January 8, 2026

Government

Securing America’s Decision Advantage

Securing America’s Decision Advantage

How agentic AI systems give the U.S. military decision advantage through faster planning, alerting, and command and control.

Read more

December 22, 2025

Research

MoReBench: Evaluating the Process of AI Moral Reasoning

MoReBench: Evaluating the Process of AI Moral Reasoning

MoReBench is a large-scale benchmark for evaluating AI moral reasoning beyond final outcomes. Instead of scoring answers alone, it assesses the intermediate reasoning traces models produce when navigating 1,000 morally ambiguous, real-world scenarios. Our findings show that moral reasoning is a distinct and underdeveloped capability, largely uncorrelated with performance on traditional math and coding benchmarks.

Read more

December 19, 2025

Government

The Agentic Era: Building the Foundation for Autonomous Mission Assurance

The Agentic Era: Building the Foundation for Autonomous Mission Assurance

Agentic AI marks a shift from reactive chatbots to autonomous mission partners. Government must adopt unified Agentic Infrastructure—combining resilient agent execution and governed AgentOps—to enable machine-speed decisions. Platforms like Scale’s SGP and Agentex deliver interoperable, durable, and accountable autonomy for mission assurance.

Read more

December 19, 2025

Research

Open-Sourcing MCP-Atlas: A Benchmark for Real Tool Use

Open-Sourcing MCP-Atlas: A Benchmark for Real Tool Use

We’re open-sourcing MCP-Atlas, including the dataset, evaluation environment, and updated results for a benchmark designed to measure how reliably AI agents use real tools. MCP-Atlas evaluates realistic, multi-step workflows that run against real Model Context Protocol servers, exposing where agents succeed—and where they still fail—when tool discovery, parameterization, and execution must work together.

Read more