WritingCoreWeaveCoreWeavepublished May 20, 2026seen 6d

Announcing Distributed AI on CoreWeave With Fully Managed Ray on Anyscale

Open original ↗

Captured source

source ↗

Announcing distributed AI on CoreWeave with fully managed Ray on Anyscale

Announcement

Announcement

Webinar

Announcement

Podcast

Announcement

GTC 2026

Announcement

CoreWeave brings up the industry’s first NVIDIA Vera Rubin NVL72 deployment.

Read more

Products

Data and storage

Infrastructure control

Runtime acceleration

Model and agent development

Mission control

Solutions

Pricing

Resources

About us

Contact us Login

Contact us Login

Clear

We’re excited to announce that Anyscale, powered by Ray, can now be deployed directly in CoreWeave customer accounts via BYOC on CoreWeave Kubernetes Service (CKS) .

Ray is an open-source distributed framework for parallelizing Python and AI applications. As a python-native framework that supports unstructured data formats and orchestration over heterogeneous clusters, those with CPUs and GPUs, Ray abstracts the complexity of any distributed AI workloads such as training, fine-tuning, inference, or multimodal data processing. Better workload balancing can help unlock the full potential of your AI ambitions. Learn in our blog how CoreWeave and Ray can maximize utilization, reduce idle time, and accelerate time to market. Developed by the original creators of Ray, Anyscale eliminates the complexity of developing, operating, and scaling distributed data and AI workloads with Ray. Anyscale helps teams build and scale AI fast and efficiently with extensive developer tooling, resilient clusters, proprietary performance improvements (RayTurbo), and built-in governance controls—all purpose-built for Ray. ‍

Anyscale on CoreWeave CKS ‍ With today’s announcement, CoreWeave now offers first-party support of Anyscale for CKS, making it easier than ever to run distributed AI workloads with Ray directly within your CoreWeave environment. By deploying fully managed Ray clusters in their own account, CoreWeave customers maintain full control of their data in CoreWeave AI Object Storage while being able to efficiently use a wide range of accelerated compute options. Together with the Weights & Biases integration with Anyscale via Ray, CoreWeave customers can further streamline the path to production for their AI workloads with scalable experiment tracking and artifact management.

We built CoreWeave AI Object Storage to specifically meet the needs and conquer the usual storage challenges of working with AI workloads. See how CoreWeave AI Object Storage supports the performance, speed, and scale AI ambitions need to succeed. ‍

Seamless Ray Workload Experiment Tracking and Observability with Weights & Biases ‍

OOTB Integration Between Experiment Tracking in W&B and Anyscale Logs ‍ Enhanced performance and reliability with distributed AI on CoreWeave

Anyscale offers many clear advantages over open-source Ray. By running Anyscale on  CoreWeave’s purpose-built AI Cloud platform, customers can make full use of AI infrastructure highly optimized for machine learning at scale. By running Anyscale workloads on CoreWeave clusters, customers gain significant benefits that result in clear competitive advantage: Highly efficient cluster validation: A best-in-class validation suite performs comprehensive checks to ensure cluster readiness. This continuously evaluates infrastructure components, including GPUs, CPUs, memory, storage, and networking. It also verifies functional health to confirm the cluster is fully prepared to support large-scale, production-grade workloads at the time of delivery.

Proactive health checking and monitoring: Automated, proactive health checks continuously monitor Ray nodes, detecting early signs of potential issues and remediating the source of anomalous behavior before it impacts workloads.

World-class observability: Users benefit directly from a vast and ultra-granular array of metrics as detailed as GPU temperatures or real-time ingress and egress traffic right at their fingertips. Utilize powerful, intuitive dashboards with a comprehensive view of entire fleets, allowing customers to identify under-optimized workloads and correlate job interruptions to underlying issues.

Industry-leading performance: All of these reliability and observability features combined allow customers to get the most out of their AI workloads, unlocking an industry-leading standard of up to 96% goodput.

‍ Want to learn how CoreWeave provides up to 96% goodput across AI workloads? Learn in this blog how our AI Cloud platform optimizes every layer of the stack. Finally, Anyscale is a fully managed platform, with access control, single sign-on with SAML, and quota management. These features are critical to scaling Ray beyond just a few clusters. ‍ Why Anyscale vs. Self-Managed Ray on CKS?

While the Ray compute framework is built for distributed execution, and KubeRay—the open-source Kubernetes (K8s) operator—can help teams get started with managing their first Ray workload, together they are just great starting foundations. To reliably scale Ray to a production-ready platform and get the most out of CoreWeave’s cutting-edge GPU infrastructure , teams still need to build out many other components that include developer tooling, comprehensive observability, cost controls, job queues, user management, and more. Anyscale shortcuts the path to production and eliminates the operational burden of building out developer tooling and managing Ray by providing: Anyscale Workspaces: Use an i nteractive dev console with advanced workload observability to help you debug quickly, build faster, and seamlessly transition from dev to prod. This interface also makes it easy for devs to self-serve infrastructure needed for development without any K8s expertise required. ‍ Fully Managed Ray Clusters: Offload operational complexity with proactive unhealthy node draining and replacement, fast autoscaling, and integrated monitoring using managed Prometheus and Grafana dashboard. ‍ RayTurbo: Boost performance with workload-specific optimizations not available in the open-source distribution. This includes the ability to run 5x faster data preprocessing with RayTurbo Data alongside other optimizations available as part of the RayTurbo Train and RayTurbo Serve libraries. ‍ Budget and Cost Dashboards: Set and track budgets at the organization or project-level and receive alerts when spending limits are exceeded. ‍ SLA-Backed Support. Get guaranteed response times from Ray experts to ensure your production AI workloads stay up and running with paid support plans ....

Excerpt shown — open the source for the full document.

Notability

notability 6.0/10

Substantive partnership announcement for managed AI services