Deploying Transformers at Scale
Transformer networks have taken the NLP world by storm, but the sheer size of these networks presents new challenges for deployment, such as how to provide acceptable latency and unit economics.
Transformer networks have taken the NLP world by storm, but the sheer size of these networks presents new challenges for deployment, such as how to provide acceptable latency and unit economics.