- 18 Mar, 2021 1 commit
-
-
xiaotong committed
-
- 13 Mar, 2021 3 commits
-
-
1. Add mutex when operating the memory pool. 2. Support CMake 3.19 and fix some CMake bugs. Note that CUDA_ROOT variable in CMake is modified as CUDA_TOOLKIT_ROOT. You can find this update in the README. 3. Fix some bugs on Ubuntu.
liyinqiao committed -
1. Add mutex when operating the memory pool. 2. Support CMake 3.19 and fix some CMake bugs. Note that CUDA_ROOT variable in CMake is modified as CUDA_TOOLKIT_ROOT. You can find this update in the README. 3. Fix some bugs on Ubuntu.
liyinqiao committed -
liyinqiao committed
-
- 08 Mar, 2021 1 commit
-
-
xiaotong committed
-
- 05 Mar, 2021 1 commit
-
-
xiaotong committed
-
- 28 Feb, 2021 1 commit
-
-
xiaotong committed
-
- 23 Feb, 2021 2 commits
- 21 Feb, 2021 2 commits
- 06 Feb, 2021 3 commits
- 15 Dec, 2020 1 commit
-
-
1. Support Inplace way for some functions. 2. Add new flexible constructor for XList. 3. Bug fixed.
liyinqiao committed
-
- 25 Sep, 2020 1 commit
-
-
Fix the minor bugs in IsSameShaped.
liyinqiao committed
-
- 14 Sep, 2020 1 commit
-
-
Find some fp16 bugs during decoding NMT system. We decide to roll back some codes to the last version. These codes need to be reviewed.
liyinqiao committed
-
- 12 Sep, 2020 1 commit
-
-
Find some bugs during training NMT system. We decide to roll back some codes to the last version. These codes need to be reviewed.
liyinqiao committed
-
- 10 Sep, 2020 1 commit
-
-
Fix the __half bugs in SetData.
liyinqiao committed
-
- 07 Sep, 2020 1 commit
-
-
1. Fix the bugs in backward process. 2. Support the float16 class. 3. Fix bugs. 4. Clean the codes. 5. Remove the makefile.
liyinqiao committed
-
- 02 Sep, 2020 1 commit
-
-
Merge with the branch of xuchen (NOT update the float16, this needs code review) and fix the bugs in Gather function. 1. Support Reciprocal fucntion. 2. Fix the safe delete bugs in XDevice. 3. Support new API to convert the data type of tensor. 4. Support to show the memory usage of buffer memory. 5. Fix minor errors.
liyinqiao committed
-
- 27 Aug, 2020 1 commit
-
-
1. Try to fix the bugs in destroy the stream of XDevice (Uncheck). 2. Fix the bug of memory leak in ReduceSumAll function. 2. Adjust the directory structure. 3. Fix the minor errors.
liyinqiao committed
-
- 07 Aug, 2020 1 commit
-
-
Fix the bugs in Sum and Sub when MKL or OpenBlas is used.
liyinqiao committed
-
- 06 Aug, 2020 1 commit
-
-
1. Update the codes of machine translation sample. The current version is the same with NiuTrans.NMT. 2. Update the XList class. 3. Bugs fix.
liyinqiao committed
-
- 29 Apr, 2020 1 commit
-
-
1. Offer macro to set whether use the half precision in cuda codes. 2. Update the manuals.
liyinqiao committed
-
- 26 Apr, 2020 2 commits
- 20 Apr, 2020 1 commit
-
-
Fix the bugs in ReduceMax and its related functions.
liyinqiao committed
-
- 18 Apr, 2020 2 commits
- 17 Apr, 2020 1 commit
-
-
Support fp16 data type for more operations and fix the minor errors. (Don't use this! It's an incomplete version)
liyinqiao committed
-
- 11 Apr, 2020 1 commit
-
-
1. Fix the mistakes in manual. 2. Fix the minor errors in Stack and LogSoftmax functions.
liyinqiao committed
-
- 09 Apr, 2020 1 commit
-
-
1. Fix the mistakes in manual. 2. Fix the minor errors in Normalize and ScaleAndShift functions.
liyinqiao committed
-
- 31 Mar, 2020 1 commit
-
-
1. Fix the mistakes in manual. By the way, I have to say there are so many mistakes in the manual. I'm shocked it has been checked a lot of times, but why they are still be there. No one care about that? Really??? 2. Clean the codes.
liyinqiao committed
-
- 26 Mar, 2020 1 commit
-
-
1. Call the Scale or Shift functions when the parameter shift is 0.0F or scale is 1.0F. 2. Add the unit test for ScaleAndShift for the cases which parameter shift is 0.0F or scale is 1.0F.
liyinqiao committed
-
- 25 Mar, 2020 4 commits
-
-
Optimize the MultiplyMe and DivMe functions when operate the scalar tensors.
liyinqiao committed -
1. Fix the bug in DivMe function which cannot handle the scalar tensor and broadcast case. 2. Clean the codes. 3. Fix minor errors.
liyinqiao committed -
1. Fix the bug in SubMe function which cannot handle the scalar tensor and broadcast case. 2. Clean the codes.
liyinqiao committed -
Fix the bug in SumMe function which cannot handle the scalar tensor and broadcast case.
liyinqiao committed
-
- 24 Mar, 2020 1 commit
-
-
1. Fix the bug in MultiplyMe function which cannot handle the scalar tensor and broadcast case. 2. Add broadcast multiply unit test.
liyinqiao committed
-