Commit Graph

671 Commits

Author SHA1 Message Date
Tri Dao
2800efc71f [FT] rotary_cos/sin should have batch_size dimension 2023-07-06 15:33:33 -07:00
Tri Dao
d2f4324f4c [LayerNorm] Make sure memory addresses are aligned to 16 bytes 2023-07-04 14:53:12 -07:00
Tri Dao
3a9bfd076f [FT] rotary_cos/sin should have shape (dim) instead of (seqlen, dim) 2023-07-03 09:41:04 -07:00
Tri Dao
e8a0b4acdd [Doc] Change total -> total_q 2023-07-02 17:23:52 -07:00
Tri Dao
9610114ce8 Bump to v1.0.8 2023-07-02 17:04:54 -07:00
Tri Dao
a5d8714c26 [Build] Remove pyproject.toml
I haven't found an easy way to add torch as a build dependency in
pyproject.toml.
If we add torch in pyproject.toml, for some setup it would download a
different version of Pytorch before building.
If we don't add torch, lots of users report they get error when installing.
2023-07-02 17:02:49 -07:00
Tri Dao
62e9814466 [Rotary] Make sure frequency calculation is in fp32 2023-07-02 16:39:39 -07:00
Pierce Freeman
9af165c389 Clean setup.py imports 2023-06-07 17:27:36 -07:00
Pierce Freeman
eb812c205b Remove builder project 2023-06-07 17:20:13 -07:00
Pierce Freeman
6c730dc8c6 Bump version 2023-06-07 17:07:14 -07:00
Pierce Freeman
494b2aa486 Add notes to github action workflow 2023-06-07 17:06:12 -07:00
Pierce Freeman
8d60c373e4 Add torch dependency to final build 2023-06-04 06:14:42 -07:00
Pierce Freeman
1848d0004f Exclude cuda erroring builds 2023-06-04 06:14:42 -07:00
Pierce Freeman
84009fcc66 Exclude additional disallowed matrix params 2023-06-04 06:14:42 -07:00
Pierce Freeman
ac543b0e8d Full version matrix 2023-06-04 06:14:42 -07:00
Pierce Freeman
a372e2be1b Add CUDA 11.7 2023-06-04 06:14:42 -07:00
Pierce Freeman
18e100d312 Release is actually unsupported 2023-06-02 19:01:44 -07:00
Pierce Freeman
061470ae58 echo OS version 2023-06-02 18:59:09 -07:00
Pierce Freeman
2dadfdbbca Temp disable deploy 2023-06-02 18:48:02 -07:00
Pierce Freeman
a682252be7 OS version build numbers 2023-06-02 18:47:25 -07:00
Pierce Freeman
cd0c169eee Restore full build matrix 2023-06-02 18:28:00 -07:00
Pierce Freeman
ea2ed88623 Refactor and clean of setup.py 2023-06-02 18:25:07 -07:00
Pierce Freeman
9fc9820a5b Strip cuda name from torch version 2023-06-02 18:25:07 -07:00
Pierce Freeman
5e4699782a Allow fallback install 2023-06-02 18:25:07 -07:00
Pierce Freeman
dab99053e4 Bump build to use 116 for testing 2023-06-02 18:25:07 -07:00
Pierce Freeman
0e7769c813 Guessing wheel URL 2023-06-02 18:25:07 -07:00
Pierce Freeman
e1faefce9d Raise cuda error on build 2023-06-02 18:25:07 -07:00
Pierce Freeman
add4f0bc42 Scaffolding for wheel prototype 2023-06-02 18:25:07 -07:00
Tri Dao
9818f85fee
Merge pull request #255 from beginlner/main
Fix a bug
2023-06-02 02:23:25 -04:00
ljss
8e44c0eefb
Fix a bug 2023-06-02 13:46:19 +08:00
Tri Dao
85b51d61ee Bump version to 1.0.7 2023-05-30 14:18:44 -07:00
Tri Dao
27f8f890df [FusedDense] Allocate lt_workspace on input device 2023-05-30 14:17:26 -07:00
Tri Dao
48bc6eacd6 [Gen] Add rotary base as an argument to FT attention kernel 2023-05-30 13:38:34 -07:00
Tri Dao
7c766b1bbc
Merge pull request #243 from ksivaman/bump_version_to_v1_0_6
bump to v1.0.6
2023-05-26 22:48:08 -04:00
Kirthi Shankar Sivamani
dd9c3a1fc2 bump to v1.0.6
Signed-off-by: Kirthi Shankar Sivamani <ksivamani@nvidia.com>
2023-05-26 17:44:10 -07:00
Tri Dao
ce68305c84 Update installation instruction 2023-05-25 16:52:52 -07:00
Tri Dao
cf4f0a39f3
Merge pull request #241 from ksivaman/fix_compilation_time
Fix compilation time
2023-05-25 18:34:41 -04:00
Kirthi Shankar Sivamani
6d45d0bd6c Re-add ninja
Signed-off-by: Kirthi Shankar Sivamani <ksivamani@nvidia.com>
2023-05-25 21:22:50 +00:00
Kirthi Shankar Sivamani
852bc40b8c Remove torch from pyproject.toml
Signed-off-by: Kirthi Shankar Sivamani <ksivamani@nvidia.com>
2023-05-25 19:12:22 +00:00
Kirthi Shankar Sivamani
c1d117c2d0 Remove ninja from pyproject.toml
Signed-off-by: Kirthi Shankar Sivamani <ksivamani@nvidia.com>
2023-05-25 19:12:00 +00:00
Tri Dao
f0c40b7ddb Recommend Nvidia's Pytorch container 2023-05-19 09:41:14 -07:00
Tri Dao
3cad2ab35d
Merge pull request #229 from maxhgerlach/local-version
Allow adding an optional local version to the package version
2023-05-19 11:43:24 -04:00
Max H. Gerlach
31f78a9814 Allow adding an optional local version to the package version 2023-05-19 17:27:41 +02:00
Tri Dao
40a25c8ee7 Update roadmap 2023-05-17 08:32:26 -07:00
Tri Dao
eff9fe6b80 Add ninja to pyproject.toml build-system, bump to v1.0.5 2023-05-12 14:20:31 -07:00
Tri Dao
36d0a19f1e
Merge pull request #193 from anthonyhu/pyproject-build
Use pyproject.toml to specify build dependencies
2023-05-11 21:26:28 -04:00
Tri Dao
5bf7f57d47
Merge pull request #202 from fedebotu/main
[BugFix] avoid bug on ImportError
2023-05-06 14:15:02 -04:00
Federico Berto
69f5f7d0a2 [BugFix] cannot unpack non-iterable NoneType object 2023-05-07 03:07:44 +09:00
Federico Berto
3889ba168b [BugFix] cannot unpack non-iterable NoneType object 2023-05-07 03:07:30 +09:00
Tri Dao
a9a4b4e4f2 [LLaMa] Fix last norm layer to use RMSNorm instead of LayerNorm 2023-05-04 23:39:43 -07:00