common_attention.py
32.7 KB
-
revise some transformer decoding configuration, support relative position… · d6b1aadf
revise some transformer decoding configuration, support relative position reprasentation training, add transformer_rpr_base
libei committed