Apparate: Early-Exit Models for ML Latency and Throughput Optimization - Additional Related Work
:::info
Authors:
(1) Yinwei Dai, Princeton University (Equal contributions);
(2) Rui Pan, Princeton University (Equal contributions);
(3) Anand Iyer, Georgia Institute of Technology;
(4) Ravi Netraval...