Commits · f9987d03c08b54ab59e509f9904f73447b6cff0c · xuchen / Fairseq-S2T

08 Oct, 2022 1 commit
- report bleu and wer during validation · f9987d03
  xuchen committed Oct 08, 2022
  
  f9987d03 Browse Files
15 Sep, 2022 2 commits
- update the dual arch · 8f45faa2
  xuchen committed Sep 15, 2022
  
  8f45faa2 Browse Files
- update the mixup implementation · cbeb5521
  xuchen committed Sep 15, 2022
  
  cbeb5521 Browse Files
07 Sep, 2022 1 commit
- implement the w2v2-transformer arch · c7242ff4
  xuchen committed Sep 07, 2022
  
  c7242ff4 Browse Files
06 Sep, 2022 2 commits
- fix the bugs of mixup · afa5095d
  xuchen committed Sep 06, 2022
  
  afa5095d Browse Files
- fix the bugs of mixup and support the manifold mixup · c845197f
  xuchen committed Sep 06, 2022
  
  c845197f Browse Files
04 Sep, 2022 3 commits
- The initial implementation of dynamic encoding. · 37eaeb25
```
It only supports condensation based on the specific threshold then creates a new tensor.
We will merge it into the s2t_transformer.py finally.
```
  xuchen committed Sep 04, 2022
  37eaeb25 Browse Files
- update the shell scripts · f0d55b1f
  xuchen committed Sep 04, 2022
  
  f0d55b1f Browse Files
- fix the bugs of multibranch arch · e0da16dd
  xuchen committed Sep 04, 2022
  
  e0da16dd Browse Files
02 Sep, 2022 1 commit
- fix the bugs of multibranch arch · a2fd43e7
  xuchen committed Sep 02, 2022
  
  a2fd43e7 Browse Files
31 Aug, 2022 1 commit
- add the multibranch S2T architecture. · 47e0f6e0
```
I also find some bugs in the dual architecture.
```
  xuchen committed Aug 31, 2022
  47e0f6e0 Browse Files
30 Aug, 2022 1 commit

optimize the implementation of mixup: · 793f553a

1. using different mixup prob for each sample
2. arbitrary mixup ratio
3. cross entropy mixup consistency loss

committed Aug 30, 2022

793f553a Browse Files

26 Aug, 2022 1 commit
- Daily revision and add the consistency regularization for mixup · 444a1f46
  xuchen committed Aug 26, 2022
  
  444a1f46 Browse Files
22 Aug, 2022 1 commit
- Daily revision · cabfc4ea
  xuchen committed Aug 22, 2022
  
  cabfc4ea Browse Files
27 Jul, 2022 1 commit
- add the settings for the weight sharing of interleaved CTC · 0a70c5c5
  xuchen committed Jul 27, 2022
  
  0a70c5c5 Browse Files
26 Jul, 2022 1 commit
- fix the bugs · 21734086
  xuchen committed Jul 26, 2022
  
  21734086 Browse Files
25 Jul, 2022 3 commits
- update shell scripts · e1d3d2ed
  xuchen committed Jul 25, 2022
  
  e1d3d2ed Browse Files
- fix the bugs · de9ef921
  xuchen committed Jul 25, 2022
  
  de9ef921 Browse Files
- fix the bugs of shell scripts · b2031168
  xuchen committed Jul 25, 2022
  
  b2031168 Browse Files
19 Jul, 2022 3 commits
- update the shell scripts · 9452b069
  xuchen committed Jul 19, 2022
  
  9452b069 Browse Files
- update the shell scripts · a598692d
  xuchen committed Jul 19, 2022
  
  a598692d Browse Files
- fix some bugs · 9fe8cd1e
  xuchen committed Jul 19, 2022
  
  9fe8cd1e Browse Files
12 Jul, 2022 2 commits
- Try more settings of adapter · a201a883
  xuchen committed Jul 12, 2022
  
  a201a883 Browse Files
- enable the additional label for CTC learning · 5d84c743
  xuchen committed Jul 12, 2022
  
  5d84c743 Browse Files
01 Jun, 2022 1 commit
- optimize the information dump · e40eac14
  xuchen committed Jun 01, 2022
  
  e40eac14 Browse Files
27 May, 2022 1 commit
- I valid the results of embedding norm and no scale embedding for speech-to-text encoder. · d946bc3b
```
Yeah, it is better.
```
  xuchen committed May 27, 2022
  d946bc3b Browse Files
25 May, 2022 1 commit
- fix the bugs of sae for MT · 2de89089
  xuchen committed May 26, 2022
  
  2de89089 Browse Files
24 May, 2022 1 commit

I optimized the implementation of S2T. · 380d7794

It must be said that some problems still confuse me:
1. Whether to scale in the input layer (I try to replace it with layer specification);
2. The detailed setting of weight sharing between output projection matrix and embedding matrix in the adapter (I notice that inconsistent variance will lead to bad results);
3. The biggest confusion is that the variance increases with the calculation layer by layer (I am not sure if this phenomenon is reasonable, I will compare the behavior on the latest code).
Finally, the detailed implementation is so important to the final performance, even if it is a subtle difference.

committed May 24, 2022

380d7794 Browse Files

13 May, 2022 1 commit
- fix the bugs · 03076942
  xuchen committed May 13, 2022
  
  03076942 Browse Files
12 May, 2022 1 commit
- big update! I integrate the latest updates of shell scripts, optimize the… · 1d60b3a6
```
big update! I integrate the latest updates of shell scripts, optimize the implementation of sae and fix some bugs.
```
  xuchen committed May 12, 2022
  1d60b3a6 Browse Files
06 May, 2022 1 commit
- fix the bugs of checkpoint load and add the special kernel sizes for pds conformer · 1288e535
  xuchen committed May 06, 2022
  
  1288e535 Browse Files
06 Apr, 2022 1 commit
- fix the bugs during prepare and CTC decoding · 408e2b95
  xuchen committed Apr 06, 2022
  
  408e2b95 Browse Files
30 Mar, 2022 1 commit
- fix the bugs and optimize the code · 8f084189
  xuchen committed Mar 30, 2022
  
  8f084189 Browse Files
18 Mar, 2022 1 commit
- fix the bugs · 4f679c86
  xuchen committed Mar 18, 2022
  
  4f679c86 Browse Files
13 Mar, 2022 1 commit
- up-sampling the representation for ctc calculation · b970c7df
  xuchen committed Mar 13, 2022
  
  b970c7df Browse Files
10 Mar, 2022 3 commits
- fix the bugs · 8b50c392
  xuchen committed Mar 10, 2022
  
  8b50c392 Browse Files
- fix the bug of circular import · 244e506e
  xuchen committed Mar 10, 2022
  
  244e506e Browse Files
- implement the dual speech-to-text (need to optimize) and CTC loss for MT · 5fb50cc3
  xuchen committed Mar 10, 2022
  
  5fb50cc3 Browse Files
09 Mar, 2022 1 commit
- implement the mixup method for speech-to-text · 6cbfe851
  xuchen committed Mar 09, 2022
  
  6cbfe851 Browse Files
04 Mar, 2022 1 commit
- add target ctc · 67d8695f
  xuchen committed Mar 04, 2022
  
  67d8695f Browse Files