optional image

Mariko Wakabayashi

‎@mwkby‎

Sr. ML Engineer

Insights Speeding up Transformer CPU inference in Google Cloud

This blog post shares optimization findings to speed up Transformer-based models’ CPU inference and improve computational demand in Google Cloud.