Tri Dao
|
ea38d3d261
|
Fix race condition in backward pass (smem_dq)
|
2022-06-25 18:02:30 -07:00 |
|
Tri Dao
|
5d07483bbc
|
Refactor Gmem code to store q, k, v pointers separately
|
2022-06-12 16:37:32 -07:00 |
|
Tri Dao
|
d3e6440958
|
Implement bwd for head dim 128
|
2022-06-11 17:52:36 -07:00 |
|
Tri Dao
|
d380e87fb6
|
Don't use Smem_dp_sum in backward pass
To reduce smem usage for SM75
|
2022-06-04 16:01:36 -07:00 |
|
Tri Dao
|
14dc326e59
|
Use Cutlass gemm as WarpMma
|
2022-06-02 10:33:32 -07:00 |
|
Tri Dao
|
9dbc491aa5
|
Rename, add benchmarking script
|
2022-05-26 13:57:38 -07:00 |
|