Why Volcano
Unified Scheduling
Queue Management
Heterogeneous Device Support
Network Topology Aware Scheduling
Multi-cluster Scheduling
Online and Offline Workloads Colocation
Load Aware Descheduling
Multiple Scheduling Policies
Cloud native batch scheduling system for compute-intensive workloads
Unified Scheduling
Queue Management
Heterogeneous Device Support
Network Topology Aware Scheduling
Multi-cluster Scheduling
Online and Offline Workloads Colocation
Load Aware Descheduling
Multiple Scheduling Policies
Seamlessly integrate with mainstream computing frameworks for AI, big data, and scientific computing

Apache Spark™ is a unified analytics engine for large-scale data processing

Apache Flink is a framework and distributed processing engine for stateful computations over unbounded and bounded data streams

An end-to-end open source machine learning platform

An open source machine learning framework that accelerates the path from research prototypes to production deployment

Argo Workflows is an open source container-native workflow engine for orchestrating parallel jobs on Kubernetes. Argo Workflows is implemented as a Kubernetes CRD.

The all-scenario deep learning framework developed by Huawei.

Ray is a high-performance distributed computing framework that supports machine learning, deep learning, and distributed applications.

The Kubeflow project is dedicated to making deployments of machine learning (ML) workflows on Kubernetes simple, portable, and scalable.

The Open MPI Project is an open source Message Passing Interface implementation that is developed and maintained by a consortium of academic, research, and industry partners.

Horovod is a distributed deep learning training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.

A truly open source deep learning framework suited for flexible research prototyping and production.

PaddlePaddle is an open source deep learning platform derived from industrial practice initiated by Baidu.
Volcano v1.11.0 introduces enhanced GPU scheduling capabilities, improved job management, and better integration with AI/ML frameworks.
Learn how Volcano optimizes resource allocation and scheduling for distributed AI/ML workloads, significantly improving training and inference performance.
Volcano v1.10.0 has been released with new features including enhanced queue management, improved job scheduling, and better integration with Kubernetes.