Optimized single-core layernorm
Showing
- dnn_kernels/benchmarks/layernorm-raw-fp64-sdma-ssr-frep-50-50.c 6 additions, 0 deletions...rnels/benchmarks/layernorm-raw-fp64-sdma-ssr-frep-50-50.c
- dnn_kernels/benchmarks/templates/layernorm_raw.tpl.c 93 additions, 0 deletionsdnn_kernels/benchmarks/templates/layernorm_raw.tpl.c
- dnn_kernels/include/layernorm.h 10 additions, 0 deletionsdnn_kernels/include/layernorm.h
- dnn_kernels/src/layernorm.c 274 additions, 4 deletionsdnn_kernels/src/layernorm.c
- dnn_kernels/src/matmul.c 2 additions, 2 deletionsdnn_kernels/src/matmul.c
Please register or sign in to comment