- 31 Aug, 2020 1 commit
-
-
Highlight to users about installing MKL, OpenBLAS and CUDA by themselves.
liyinqiao committed
-
- 30 Aug, 2020 1 commit
-
-
1. Support to generate the dynamic link library on GPU. 2. Support to compile the project on the MKL and OpenBLAS. 3. Update the manual to support the new CMake file. 4. Fix minor errors.
liyinqiao committed
-
- 29 Aug, 2020 2 commits
- 27 Aug, 2020 1 commit
-
-
1. Try to fix the bugs in destroy the stream of XDevice (Uncheck). 2. Fix the bug of memory leak in ReduceSumAll function. 2. Adjust the directory structure. 3. Fix the minor errors.
liyinqiao committed
-
- 07 Aug, 2020 1 commit
-
-
Fix the bugs in Sum and Sub when MKL or OpenBlas is used.
liyinqiao committed
-
- 06 Aug, 2020 1 commit
-
-
1. Update the codes of machine translation sample. The current version is the same with NiuTrans.NMT. 2. Update the XList class. 3. Bugs fix.
liyinqiao committed
-
- 29 Apr, 2020 1 commit
-
-
1. Offer macro to set whether use the half precision in cuda codes. 2. Update the manuals.
liyinqiao committed
-
- 26 Apr, 2020 2 commits
- 20 Apr, 2020 1 commit
-
-
Fix the bugs in ReduceMax and its related functions.
liyinqiao committed
-
- 18 Apr, 2020 3 commits
- 17 Apr, 2020 2 commits
- 14 Apr, 2020 2 commits
- 13 Apr, 2020 1 commit
-
-
1. Support KB size for memory pool. 2. Clean the codes: 1000000 -> 1024 * 1024.
liyinqiao committed
-
- 11 Apr, 2020 2 commits
- 09 Apr, 2020 1 commit
-
-
1. Fix the mistakes in manual. 2. Fix the minor errors in Normalize and ScaleAndShift functions.
liyinqiao committed
-
- 01 Apr, 2020 1 commit
-
-
1. Fix the mistakes in manual. 2. Fix the bug in the link process of ConvertDataType function.
liyinqiao committed
-
- 31 Mar, 2020 1 commit
-
-
1. Fix the mistakes in manual. By the way, I have to say there are so many mistakes in the manual. I'm shocked it has been checked a lot of times, but why they are still be there. No one care about that? Really??? 2. Clean the codes.
liyinqiao committed
-
- 28 Mar, 2020 1 commit
-
-
1. Offer CMakeLists.txt to compile the NiuTensor. 2. Update the manuals for CMake.
liyinqiao committed
-
- 26 Mar, 2020 1 commit
-
-
1. Call the Scale or Shift functions when the parameter shift is 0.0F or scale is 1.0F. 2. Add the unit test for ScaleAndShift for the cases which parameter shift is 0.0F or scale is 1.0F.
liyinqiao committed
-
- 25 Mar, 2020 6 commits
-
-
Use the _CheckData without tolerance in some unit test to speedup the test process.
liyinqiao committed -
1. Use the _CheckData with tolerance in unit test to avoid the bug in "0.0F != -0.0F" case. 2, Clean the codes.
liyinqiao committed -
Optimize the MultiplyMe and DivMe functions when operate the scalar tensors.
liyinqiao committed -
1. Fix the bug in DivMe function which cannot handle the scalar tensor and broadcast case. 2. Clean the codes. 3. Fix minor errors.
liyinqiao committed -
1. Fix the bug in SubMe function which cannot handle the scalar tensor and broadcast case. 2. Clean the codes.
liyinqiao committed -
Fix the bug in SumMe function which cannot handle the scalar tensor and broadcast case.
liyinqiao committed
-
- 24 Mar, 2020 1 commit
-
-
1. Fix the bug in MultiplyMe function which cannot handle the scalar tensor and broadcast case. 2. Add broadcast multiply unit test.
liyinqiao committed
-
- 19 Mar, 2020 3 commits
- 18 Mar, 2020 4 commits