Language Modelling via Learning to Rank

Language Modelling via Learning to Rank

In the previous episode of Private AI’s ML Speaker Series, Patricia Thaine (CEO of Private AI) sat down with Arvid Frydenlund (PhD candidate at the University of Toronto in the Computer Science Department and Vector Institute) to discuss his latest paper Language...
MLOps & Machine Learning Deployment at Scale

MLOps & Machine Learning Deployment at Scale

In the latest episode of Private AI’s ML Speaker Series, Patricia Thaine (CEO of Private AI) sits down to chat about MLOps and Machine Learning Deployment at Scale with Luke de Oliveira from Twilio. Luke de Oliveira is the Director of Machine Learning at Twilio,...
Deploying Transformers at Scale

Deploying Transformers at Scale

Key takeways ONNXRuntime is the best inference package for Transformer networks; Nvidia Triton, together with ONNXRuntime is the best solution for GPU inference; Optimization matters. It’s quite easy to unlock a >10X performance gain in 2022. About this...