- 07 Sep, 2022 1 commit
-
-
xuchen committed
-
- 06 Sep, 2022 2 commits
- 04 Sep, 2022 3 commits
- 02 Sep, 2022 1 commit
-
-
xuchen committed
-
- 31 Aug, 2022 1 commit
-
-
I also find some bugs in the dual architecture.
xuchen committed
-
- 30 Aug, 2022 1 commit
-
-
1. using different mixup prob for each sample 2. arbitrary mixup ratio 3. cross entropy mixup consistency loss
xuchen committed
-
- 26 Aug, 2022 1 commit
-
-
xuchen committed
-
- 22 Aug, 2022 1 commit
-
-
xuchen committed
-
- 27 Jul, 2022 1 commit
-
-
xuchen committed
-
- 26 Jul, 2022 1 commit
-
-
xuchen committed
-
- 25 Jul, 2022 3 commits
- 19 Jul, 2022 3 commits
- 12 Jul, 2022 2 commits
- 01 Jun, 2022 1 commit
-
-
xuchen committed
-
- 27 May, 2022 1 commit
-
-
Yeah, it is better.
xuchen committed
-
- 25 May, 2022 1 commit
-
-
xuchen committed
-
- 24 May, 2022 1 commit
-
-
It must be said that some problems still confuse me: 1. Whether to scale in the input layer (I try to replace it with layer specification); 2. The detailed setting of weight sharing between output projection matrix and embedding matrix in the adapter (I notice that inconsistent variance will lead to bad results); 3. The biggest confusion is that the variance increases with the calculation layer by layer (I am not sure if this phenomenon is reasonable, I will compare the behavior on the latest code). Finally, the detailed implementation is so important to the final performance, even if it is a subtle difference.
xuchen committed
-
- 13 May, 2022 1 commit
-
-
xuchen committed
-
- 12 May, 2022 1 commit
-
-
big update! I integrate the latest updates of shell scripts, optimize the implementation of sae and fix some bugs.
xuchen committed
-
- 06 May, 2022 1 commit
-
-
xuchen committed
-
- 06 Apr, 2022 1 commit
-
-
xuchen committed
-
- 30 Mar, 2022 1 commit
-
-
xuchen committed
-
- 18 Mar, 2022 1 commit
-
-
xuchen committed
-
- 13 Mar, 2022 1 commit
-
-
xuchen committed
-
- 10 Mar, 2022 3 commits
- 09 Mar, 2022 1 commit
-
-
xuchen committed
-
- 04 Mar, 2022 1 commit
-
-
xuchen committed
-
- 01 Mar, 2022 1 commit
-
-
xuchen committed
-
- 28 Feb, 2022 1 commit
-
-
xuchen committed
-
- 24 Feb, 2022 1 commit
-
-
xuchen committed
-