Skip to content

Latest commit

 

History

History
22 lines (17 loc) · 1.64 KB

README.md

File metadata and controls

22 lines (17 loc) · 1.64 KB

Section 4 - Performance Measurement & Vector Programming

Now that you've had a chance to walk through the components of compiling and running a program on the Ryzen™ AI hardware in section-3, we will start looking at how we measure performance and utilize vector programming technqiues to fully leverage the power of the AI Engines for parallel compute.

It is helpful to first examine performance measurement before we delve into vector programming in order to get a baseline for where our application performance is. There are many factors that contribute to performance including latency, throughput and power efficiency. Performance measurement is an active area of research to provide more powerful tools for users to measure the speedup of their application on AIEs. In section-4a and section-4b, we look at performance from the perspective of timers and trace. Then in section-4c, we look more closely at how to vectorize AIE kernel code.


[Prev - Section 3] [Top] [Next - Section 5]