Efficient Inference

Model Compression, Efficient Inference

Introduction

In existing research, efficient inference (also known as model compression) can be achieved by following several methods:

Efficient training: