fix a bug in the softmax function. It executes backward comptuation incorrectly (in the condition of cpu computation and NOLOSS). I found this bug by chance, then I spent one night confirming that something must be wrong and I spent one night finding the location of bug. Finally, I fixed this bug in five minutes.