MoE Inference

latest

Benchmarking

Benchmarking

Design and Implementation

Design and Implementation

MoE Inference

Benchmarking
Existing frameworks
Edit on GitHub

Existing frameworks

We have documentation on the other inference frameworks we are benchmarking

Fairseq MoE
DeepSpeed-MoE
FasterTransformer MoE

Previous Next

© Copyright 2022, the authors. Revision 46cc660a.

Built with Sphinx using a theme provided by Read the Docs.