UCSD – Cerebro

The Vision

Cerebro

Deep learning (DL) is revolutionizing data analytics applications across many domains. But making effective use of DL is often a painful empirical process, since accuracy is tied to the data representation, neural architecture, and hyper-parameter settings. This process, called model selection, is a bottleneck to democratizing DL due to both resource costs and user time spent.

Cerebro is a first-of-its-kind platform that mitigates this bottleneck for DL model selection at scale. It raises model building throughput without raising resource costs and while ensuring accuracy, reproducibility, and generality to support multiple DL frameworks (PyTorch and TensorFlow). Our target setting is small clusters, which covers the vast majority of DL use cases in practice.

Cerebro System

Cerebro is open sourced under Apache License v2.0.

Components and Capabilities

Deep Learning

Cerebro will be the first DL platform to offer unified execution support with holistic resource optimization for all axes of scalability: model size, dataset size, example size, number of tasks, and for transfer learning. Its carefully layered architecture decouples the specification of model building tasks (e.g., in Keras APIs or AutoML heuristics) from the execution backend, making it portable across multiple backends: Kubernetes, Spark, Dask, Greenplum, Ray and soon, cloud-native IaaS.

Discover More

Model Scalability

Hydra decouples model scalability from model parallelism and hybridizes it with task-parallelism to seamlessly scale to larger-than-GPU-memory models even on commodity GPUs and/or fewer GPUs than prior art such as DeepSpeed or GPipe.

Data Scalability

MOP hybridizes task-parallelism with sharded data-parallelism to enable DL users to seamlessly scale model selection to very large datasets with far less communication costs than prior art such as Horovod.

Scalable Transfer Learning

Nautilus is the first system to scale and optimize deep transfer learning from pre-trained models (e.g., HuggingFace Transformers or ImageNet CNNs) by automatically leveraging caching, spilling, and reuse of intermediate feature layers.

Layered Application Stack

Since user specification is decoupled from the scheduler, Cerebro supports both programmatic APIs (e.g., Keras or AutoML heuristics) and a new GUI based on TensorBoard for intermittent human-in-the-loop model selection.

Overview Resources

Some Damaging Delusions of Deep Learning Practice (and How to Avoid Them)

Arun Kumar, Supun Nakandala, and Yuhao Zhang
KDD 2021 Deep Learning Day | Extended Abstract PDF | Talk slides | Talk video

Cerebro: A Layered Data Platform for Scalable Deep Learning

Arun Kumar, Supun Nakandala, Yuhao Zhang, Side Li, Advitya Gemawat, and Kabir Nagrecha
CIDR 2021 (Vision paper) | Paper PDF and BibTeX | Talk video

Cerebro: Efficient and Reproducible Model Selection on Deep Learning Systems

Supun Nakandala, Yuhao Zhang, and Arun Kumar
ACM SIGMOD 2019 DEEM Workshop | Paper PDF and BibTeX | Blog post

Our Sponsors

This project was/is supported in part by a Hellman Fellowship, the NIDDK of the NIH under award number R01DK114945, an NSF CAREER Award under award number 1942724, and gifts from VMware.

UCSD - Cerebro

Cerebro

Deep Learning

Model Scalability

Data Scalability

Scalable Transfer Learning

Layered Application Stack

Overview Resources

Some Damaging Delusions of Deep Learning Practice (and How to Avoid Them)

Cerebro: A Layered Data Platform for Scalable Deep Learning

Cerebro: Efficient and Reproducible Model Selection on Deep Learning Systems

Our Sponsors

Quick Links

Contact

Your search for: "" revealed the following:

UCSD - Cerebro

Your search for: "" revealed the following:

Cerebro

Deep Learning

Model Scalability

Data Scalability

Scalable Transfer Learning

Layered Application Stack

Overview Resources

Some Damaging Delusions of Deep Learning Practice (and How to Avoid Them)

Cerebro: A Layered Data Platform for Scalable Deep Learning

Cerebro: Efficient and Reproducible Model Selection on Deep Learning Systems

Our Sponsors

Quick Links

Contact