treewide: Add NoC collective operations + MLSys experiments#99
treewide: Add NoC collective operations + MLSys experiments#99
Conversation
69ff120 to
0796dc5
Compare
d0b9142 to
2b4f86b
Compare
c0cc426 to
879d079
Compare
f4eb08d to
6a5cb8d
Compare
9180b8f to
f476df8
Compare
There was a problem hiding this comment.
Remaining comments which cannot be posted as a review comment to avoid GitHub Rate Limit
verible-verilog-format
[verible-verilog-format] reported by reviewdog 🐶
Lines 79 to 80 in f476df8
[verible-verilog-format] reported by reviewdog 🐶
Lines 82 to 83 in f476df8
[verible-verilog-format] reported by reviewdog 🐶
Line 97 in f476df8
[verible-verilog-format] reported by reviewdog 🐶
Lines 495 to 508 in f476df8
[verible-verilog-format] reported by reviewdog 🐶
Lines 89 to 101 in f476df8
[verible-verilog-format] reported by reviewdog 🐶
Lines 107 to 116 in f476df8
[verible-verilog-format] reported by reviewdog 🐶
Lines 118 to 119 in f476df8
[verible-verilog-format] reported by reviewdog 🐶
Line 133 in f476df8
f476df8 to
ec0ef69
Compare
There was a problem hiding this comment.
This PR includes all the developments that were done for the MLSys paper. This includes 1) the hardware extensions required to support (performant) multicast and reduction, 2) software benchmarks and tests and 3) an experiments' framework for Picobello derived from Snitch's, used to develop the experiments for the paper.
In detail:
experimentsin PYTHONPATH, alternative to creating a proper Picobello Python package.PB_ROOT(this is IMO the best method for ensuring all simulation artifacts are collected under a different directory).src/contains potentially reusable (across 2D tile-based accelerators) sources,impl/contains a Picobello-specific implementation of the runtime/library (providing a Picobello-specific HAL and stitching together a Picobello-specific selection of the reusable sources, including Snitch's).sync) for 2D tile-based accelerators.teamAPI.barrier_benchmark.c,reduction_benchmark.canddma_multicast_v2.cbenchmarks.overlapping_barriers.c). This test used to fail when this feature was not supported, as desired.parallel_row_col_barriers.c).sn-prefix).TODOs
peakrdl-rawheaderreleasereduction_benchmark_hyperbank.c?summa_gemm.candgemm_2d.cimplementations?