Tri Dao
|
59594f2a67
|
Bump to v2.6.2
|
2024-07-23 02:30:05 -07:00 |
|
Tri Dao
|
7551202cb2
|
Bump to v2.6.1
|
2024-07-11 08:28:32 -07:00 |
|
Tri Dao
|
116b05f9b0
|
[CI] Compile with pytorch 2.4.0.dev20240514
|
2024-07-11 02:53:30 -07:00 |
|
Tri Dao
|
da11d1b853
|
Bump v2.6.0
|
2024-07-10 21:34:58 -07:00 |
|
Tri Dao
|
e2e4333c95
|
Limit to MAX_JOBS=1 with CUDA 12.2
|
2024-05-26 15:35:49 -07:00 |
|
Tri Dao
|
ce73503578
|
Bump to 2.5.9
|
2024-05-26 14:02:11 -07:00 |
|
Tri Dao
|
9a11f440d3
|
Bump to v2.5.8
|
2024-04-26 10:54:52 -07:00 |
|
Tri Dao
|
85881f547f
|
Bump to v2.5.7
|
2024-04-07 20:13:05 -07:00 |
|
Tri Dao
|
6c9e60de56
|
Bump to v2.5.6
|
2024-03-01 22:09:56 -08:00 |
|
Tri Dao
|
87a1277653
|
Bump to v2.5.5
|
2024-02-21 15:58:23 -08:00 |
|
Tri Dao
|
43950dda45
|
Bump to v2.5.4
|
2024-02-20 16:30:16 -08:00 |
|
Tri Dao
|
5cdabc2809
|
Bump to v2.5.3
|
2024-02-10 01:06:27 -08:00 |
|
Tri Dao
|
61a7772479
|
Bump to v2.5.2
|
2024-01-31 02:44:24 -08:00 |
|
Tri Dao
|
dc72d960a7
|
[CI] Install torch 2.3 using index
|
2024-01-30 14:32:29 -08:00 |
|
Tri Dao
|
daf37a9d8a
|
Bump to v2.5.1
|
2024-01-29 21:03:38 -08:00 |
|
Tri Dao
|
197f2083a2
|
Bump to v2.5.0
|
2024-01-22 23:40:10 -08:00 |
|
Tri Dao
|
e43a4ceaab
|
[CI] Fix CUDA 12.2.2 compilation
|
2024-01-21 17:23:39 -08:00 |
|
Tri Dao
|
f9d7376126
|
Bump to v2.4.3
|
2024-01-21 17:14:37 -08:00 |
|
Tri Dao
|
1a2c3e8c25
|
Bump to v2.4.2
|
2023-12-25 16:28:57 -08:00 |
|
Tri Dao
|
f844852485
|
Bump to v2.4.1
|
2023-12-23 21:00:39 -08:00 |
|
Tri Dao
|
68f178aa4b
|
[CI] Don't compile for python 3.7 pytorch 2.2
|
2023-12-22 10:10:02 -08:00 |
|
Tri Dao
|
7316277303
|
Bump to v2.4.0
|
2023-12-22 00:09:53 -08:00 |
|
Tri Dao
|
92dd5703ec
|
Bump to v2.3.6
|
2023-11-27 16:23:39 -08:00 |
|
Tri Dao
|
23b77c8148
|
Bump to v2.3.5
|
2023-11-26 19:08:28 -08:00 |
|
Tri Dao
|
2c3baba4a6
|
Bump to v2.3.4
|
2023-11-19 23:21:31 -08:00 |
|
Tri Dao
|
83aef842be
|
Bump to v2.3.3
|
2023-10-24 00:24:07 -07:00 |
|
Tri Dao
|
7f31e7c16a
|
Bump to v2.3.2
|
2023-10-08 17:21:29 -07:00 |
|
Tri Dao
|
5e525a8dc8
|
[CI] Use official Pytorch 2.1, add CUDA 11.8 for Pytorch 2.1
|
2023-10-03 22:20:30 -07:00 |
|
Tri Dao
|
21c3b0d8f6
|
Bump to v2.3.1
|
2023-10-03 19:56:45 -07:00 |
|
Tri Dao
|
601b4dc48d
|
Bump to v2.3.0
|
2023-09-26 22:08:29 -07:00 |
|
Tri Dao
|
0a1d03c7ea
|
Bump to v2.2.5
|
2023-09-24 00:54:03 -07:00 |
|
Tri Dao
|
bff3147175
|
Re-enable compilation for Hopper
|
2023-09-21 23:55:25 -07:00 |
|
Tri Dao
|
229080b9d2
|
Bump to v2.2.4
|
2023-09-20 23:39:38 -07:00 |
|
Tri Dao
|
799f56fa90
|
Don't compile for Pytorch 2.1 on CUDA 12.1 due to nvcc segfaults
|
2023-09-17 22:15:38 -07:00 |
|
Tri Dao
|
c984208ddb
|
Set block size to 64 x 64 for kvcache to avoid nvcc segfaults
|
2023-09-17 16:14:58 -07:00 |
|
Tri Dao
|
8c8b4d36e1
|
Bump to v2.2.3
|
2023-09-16 01:47:01 -07:00 |
|
Tri Dao
|
08c295c043
|
Bump to v2.2.2
|
2023-09-10 23:48:12 -07:00 |
|
Tri Dao
|
a1576ad1e8
|
Bump to v2.2.1
|
2023-09-06 02:19:55 -07:00 |
|
Tri Dao
|
6d673cd961
|
Bump to v2.2.0
|
2023-09-05 11:34:13 -07:00 |
|
Tri Dao
|
37c6e05406
|
Implement flash_attn_with_kvcache
|
2023-09-04 00:11:44 -07:00 |
|
Tri Dao
|
4976650f74
|
Set single threaded compilation for CUDA 12.2 so CI doesn't OOM
|
2023-09-03 23:42:55 -07:00 |
|
Tri Dao
|
6a89b2f121
|
Remove constexpr in launch template to fix CI compilation
|
2023-09-03 22:59:41 -07:00 |
|
Tri Dao
|
97ba7a62e9
|
Try switching back to Cutlass 3.2.0
|
2023-09-03 22:45:35 -07:00 |
|
Tri Dao
|
1dc1b6c8f2
|
Bump to v2.1.2
|
2023-09-03 22:23:05 -07:00 |
|
Tri Dao
|
757058d4d3
|
Update Cutlass to v3.2.0
|
2023-08-27 23:47:28 -07:00 |
|
Tri Dao
|
9e5e8bc91e
|
Change causal mask to be aligned to bottom-right instead of top-left
|
2023-08-24 23:41:07 -07:00 |
|
Tri Dao
|
6711b3bc40
|
Bump version to 2.0.9
|
2023-08-22 00:21:14 -07:00 |
|
Tri Dao
|
f1a73d0740
|
Run isort and black on python files
|
2023-08-18 14:22:11 -07:00 |
|
Tri Dao
|
c65b5106ac
|
Fix Bwd NaN for varlen when seqlen_q >> seqlen_k and causal
|
2023-08-16 15:12:36 -07:00 |
|
Tri Dao
|
c60851a825
|
Bump to v2.0.7
|
2023-08-14 14:55:35 -07:00 |
|