2019-liunian Harold Li-Efficient Contextual Representation Learning Without Softmax Layer.pdf 226 KB