Efficient Processing of Deep Neural Networks. Vivienne Sze

Чтение книги онлайн.

Читать онлайн книгу Efficient Processing of Deep Neural Networks - Vivienne Sze страница 17

Efficient Processing of Deep Neural Networks - Vivienne Sze Synthesis Lectures on Computer Architecture

Скачать книгу

requirements, the number of datasets continues to grow at a rapid pace.

       PART II

       Design of Hardware for Processing DNNs

      CHAPTER 3

       Key Metrics and Design Objectives

      Over the past few years, there has been a significant amount of research on efficient processing of DNNs. Accordingly, it is important to discuss the key metrics that one should consider when comparing and evaluating the strengths and weaknesses of different designs and proposed techniques and that should be incorporated into design considerations. While efficiency is often only associated with the number of operations per second per Watt (e.g., floating-point operations per second per Watt as FLOPS/W or tera-operations per second per Watt as TOPS/W), it is actually composed of many more metrics including accuracy, throughput, latency, energy consumption, power consumption, cost, flexibility, and scalability. Reporting a comprehensive set of these metrics is important in order to provide a complete picture of the trade-offs made by a proposed design or technique.

      In this chapter, we will

      • discuss the importance of each of these metrics;

      • breakdown the factors that affect each metric. When feasible, present equations that describe the relationship between the factors and the metrics;

      • describe how these metrics can be incorporated into design considerations for both the DNN hardware and the DNN model (i.e., workload); and

      •

Скачать книгу