摘要:
HOW TO OPTIMIZE GEMM 介绍一些常规的优化思路,参考:https://github.com/flame/how-to-optimize-gemm/wiki baseline /* Create macros so that the matrices are stored in co 阅读全文
摘要:
problem1 How many bytes is the program? For the above x86 assembly code, how many bytes of instructions need to be fetched if x = 0x01020304 and n = 5 阅读全文
摘要:
the report finished in first time the report finished in first time 3.4 Note how the mix of different types of instructions vary between benchmarks. R 阅读全文