What does this writing signal mean?

DigitalOcean (GradientAI) published Run Advanced Reasoning on DigitalOcean with Arcee AI's Trinity Large-Thinking. This talking signal gives public context for research themes, product direction, policy, or launch framing. High-signal details: Routine blog post/tutorial, no major traction. · Run Advanced Reasoning on DigitalOcean with Arcee AI's Trinity Large-Thinking | DigitalOcean © 2026 DigitalOcean, LLC. Sitemap . Dark mode is coming soon. Product.... onlylabs links this event to 1 captured evidence page and 6 related writing signals.

DigitalOcean (GradientAI) Writing: Run Advanced Reasoning on DigitalOcean with Arcee AI's Trinity Large-Thinking

Captured source

source ↗

digitalocean.com/digitalocean.com/blog/run-advanced-reasoning-with-arcee-ai-trinity

Run Advanced Reasoning on DigitalOcean with Arcee AI's Trinity Large-Thinking

Source ↗

published Apr 15, 2026seen 5dcaptured 3dhttp 200method plain

Run Advanced Reasoning on DigitalOcean with Arcee AI's Trinity Large-Thinking | DigitalOcean

Dark mode is coming soon. Product updates Run Advanced Reasoning on DigitalOcean with Arcee AI's Trinity Large-Thinking

By DigitalOcean

Updated: April 15, 2026 3 min read

<- Back to blog home

Today, we’re announcing that Arcee AI ’s Trinity Large-Thinking is now available in Public Preview on DigitalOcean’s Agentic Inference Cloud, giving developers the ability to run frontier-class reasoning workloads without managing infrastructure or stitching together complex systems.

DigitalOcean is proud to partner with Arcee to bring Trinity Large-Thinking to AI builders, available via Serverless Inference, on day one. Instantly available and queried directly through the DigitalOcean Cloud Console or API alongside the compute, data, and services you already run on DigitalOcean.

Why this model, why now

Trinity Large-Thinking didn’t emerge in a vacuum. It’s been pressure-tested in exactly the kind of workloads DigitalOcean is built for.

Arcee is a 26-person San Francisco startup that spent nine months building a full open-weight model family from the ground up, with the explicit goal of producing models developers and enterprises could actually own. The result is a family ranging from 4.5B to 400B parameters, and a top-of-stack reasoning model that has earned its place in production.

In its first two months, Trinity served over 3.4 trillion tokens on OpenRouter, becoming the most-used open weight model in the U.S., driven by always-on, agentic workloads running continuously.

Trinity Large-Thinking builds on that foundation with extended reasoning, stronger multi-turn tool use, and more stable long-running behavior. It ranks #2 on PinchBench (Kilo’s benchmark for agentic model capability) at approximately 96% lower price point than the top-ranking model.

Developers shouldn’t have to choose between a model that can reason and one they can afford to run at scale. Thanks to the partnership between DigitalOcean and Arcee, they don’t have to.

Built for real-world agent workloads on DigitalOcean

Reasoning workloads are long-running, multi-step, and deeply integrated into the rest of your stack. This is crucial for building agents and complex applications that dynamically interpret unstructured data and execute complex, multi-step action sequences.

On DigitalOcean’s Agentic Inference Cloud, Trinity Large-Thinking runs as part of a complete system and not a standalone model endpoint you have to wire up yourself.

With this launch, you get:

Frontier reasoning at usable economics: #2 on PinchBench for agentic tasks at ~$0.90/M output tokens. Capable enough for complex systems, affordable enough to run continuously.

Integrated infrastructure: Run agents alongside your Kubernetes clusters, databases, and storage. No stitching across vendors.

Instant, serverless access: No provisioning or scaling. Query immediately via API or console, your infrastructure adapts to your workload.

Full model control: Apache 2.0 licensed weights available on Hugging Face. Inspect, fine-tune, distill, or self-host as needed.

A new phase of AI infrastructure

This is what the next phase of AI infrastructure looks like: integrated systems where reasoning, data, and compute run together.

As more workloads shift toward continuous, agent-driven execution, the platform they run on matters just as much as the model itself.

Hear more about Trinity Large-Thinking and the partnership between DigitalOcean and Arcee from CEO Mark McQuade at Deploy on April 28th in San Francisco. Save your spot to attend live.

Get started in seconds

Trinity Large-Thinking is live now in Public Preview on DigitalOcean Serverless Inference. You can start running advanced reasoning workloads immediately, without managing infrastructure, and without compromising on cost.

Get started quickly using the request below:

curl --location 'https://inference.do-ai.run/v1/chat/completions' \ --header 'Authorization: Bearer $DO_API_TOKEN' \ --header 'Content-Type: application/json' \ --data '{ "model": "trinity-large-thinking", "messages": [ { "role": "user", "content": "What is the capital of France?" } ], "temperature": 0.7, "max_completion_tokens": 256 }'

About the author

DigitalOcean Author

See author profile

Product Updates

Start building today From GPU-powered inference and Kubernetes to managed databases and storage, get everything you need to build, scale, and deploy intelligent applications. Sign up

Product updates Model Evaluations: Prove Your Routing Policy Actually Works

Sathish Jothikumar

June 4, 2026 7 min read

Product updates Powering the Inference Era: Inside the DigitalOcean Data & Learning Layer

Zach Peirce

June 3, 2026 5 min read

Product updates OpenCode Now Supports DigitalOcean Inference Router for Intelligent Model Routing

Musa Malik May 28, 2026 3 min read

Notability

notability 3.0/10

Routine blog post/tutorial, no major traction.