21 Commits

Author SHA1 Message Date
93a3891178 add readme 2024-07-20 23:28:27 +02:00
8a2ef81d17 add copyright header 2024-07-20 22:57:16 +02:00
f035d51e42 add gpl3 text 2024-07-20 22:54:11 +02:00
5dcbfb1cce add example script 2024-07-20 21:48:17 +02:00
c38ac65d5b various optimizations 2024-07-20 21:47:18 +02:00
2f35689355 Add chat datamodules 2024-07-20 21:46:34 +02:00
0b39ba0843 Fix group_texts not grouping texts to a single length when the number of samples is less than the number of threads used 2024-05-08 22:37:42 +02:00
bc5321cb33 fix automatic train test spliting 2024-05-08 22:07:11 +02:00
Carl Philipp Klemm
65482b55a6 actually print eval response 2024-05-07 15:27:59 +02:00
Carl Philipp Klemm
2ee4f294af fix evaluate() invocation 2024-05-07 15:10:02 +02:00
Carl Philipp Klemm
ce2ada2617 remove stray breakpoint, fix mypy warning 2024-05-07 15:05:22 +02:00
Carl Philipp Klemm
68f748e99e Fix mypy warnings 2024-05-07 14:53:28 +02:00
a74ef976e4 add support for huggingfacehub datasets and for specificying a prompt for eval 2024-05-07 00:23:12 +02:00
8abea9ef89 fix checkDistance 2024-04-08 00:33:02 +02:00
6b38cfabf8 imlement 8bit quantization 2024-04-07 20:52:15 +02:00
c33964371c Inactive parameter quanitzation support 2024-04-07 19:15:42 +02:00
3fa1fc254f add qunatized linear, refactor model for it soon to be addition 2024-03-23 21:38:27 +01:00
38a7f7cfc4 add gpu memory rebalanceing 2024-03-17 22:54:33 +01:00
5acb6809ed wip refactor 2024-03-13 19:45:52 +01:00
11ea9eeaa7 working full training 2024-03-09 10:03:37 +01:00
7a47fcdcc0 Inital commit 2024-03-06 17:50:40 +01:00