Commits · f9987d03c08b54ab59e509f9904f73447b6cff0c · xuchen / Fairseq-S2T

19 Jul, 2022 1 commit
- update the shell scripts · a598692d
  xuchen committed 2 years ago
  
  a598692d Browse File
24 May, 2022 1 commit

I optimized the implementation of S2T. · 380d7794

It must be said that some problems still confuse me:
1. Whether to scale in the input layer (I try to replace it with layer specification);
2. The detailed setting of weight sharing between output projection matrix and embedding matrix in the adapter (I notice that inconsistent variance will lead to bad results);
3. The biggest confusion is that the variance increases with the calculation layer by layer (I am not sure if this phenomenon is reasonable, I will compare the behavior on the latest code).
Finally, the detailed implementation is so important to the final performance, even if it is a subtle difference.

committed 2 years ago

380d7794 Browse File

13 May, 2022 1 commit
- fix the bugs · 03076942
  xuchen committed 2 years ago
  
  03076942 Browse File
30 Mar, 2022 1 commit
- fix the bugs and optimize the code · 8f084189
  xuchen committed 3 years ago
  
  8f084189 Browse File
13 Mar, 2022 1 commit
- up-sampling the representation for ctc calculation · b970c7df
  xuchen committed 3 years ago
  
  b970c7df Browse Directory
10 Mar, 2022 2 commits
- fix the bugs · 8b50c392
  xuchen committed 3 years ago
  
  8b50c392 Browse Directory
- fix the bug of circular import · 244e506e
  xuchen committed 3 years ago
  
  244e506e Browse Directory