Site is undergoing maintenance! Please check back shortly

Deploying Transformers at Scale

Transformer networks have taken the NLP world by storm, but the sheer size of these networks presents new challenges for deployment, such as how to provide acceptable latency and unit economics.