- 19 Sep, 2020 2 commits
- 14 Sep, 2020 3 commits
-
-
liyinqiao committed
-
Find some fp16 bugs during decoding NMT system. We decide to roll back some codes to the last version. These codes need to be reviewed.
liyinqiao committed -
1. Update the installing methods in manual. 2. Remove the traditional installing way of Windows in manual. 3. Fix the bugs in CMake.
liyinqiao committed
-
- 12 Sep, 2020 1 commit
-
-
Find some bugs during training NMT system. We decide to roll back some codes to the last version. These codes need to be reviewed.
liyinqiao committed
-
- 11 Sep, 2020 1 commit
-
-
Support to check whether to compile half precision parts in CUDA codes.
liyinqiao committed
-
- 10 Sep, 2020 1 commit
-
-
Fix the __half bugs in SetData.
liyinqiao committed
-
- 07 Sep, 2020 1 commit
-
-
1. Fix the bugs in backward process. 2. Support the float16 class. 3. Fix bugs. 4. Clean the codes. 5. Remove the makefile.
liyinqiao committed
-
- 02 Sep, 2020 2 commits
-
-
Merge with the branch of xuchen (NOT update the float16, this needs code review) and fix the bugs in Gather function. 1. Support Reciprocal fucntion. 2. Fix the safe delete bugs in XDevice. 3. Support new API to convert the data type of tensor. 4. Support to show the memory usage of buffer memory. 5. Fix minor errors.
liyinqiao committed -
Fix minor errors.
liyinqiao committed
-
- 01 Sep, 2020 1 commit
-
-
Fix the bugs in compiling NiuTensor with OpenBLAS (the version is not 0.3.10).
liyinqiao committed
-
- 31 Aug, 2020 2 commits
- 30 Aug, 2020 1 commit
-
-
1. Support to generate the dynamic link library on GPU. 2. Support to compile the project on the MKL and OpenBLAS. 3. Update the manual to support the new CMake file. 4. Fix minor errors.
liyinqiao committed
-
- 29 Aug, 2020 2 commits
- 27 Aug, 2020 1 commit
-
-
1. Try to fix the bugs in destroy the stream of XDevice (Uncheck). 2. Fix the bug of memory leak in ReduceSumAll function. 2. Adjust the directory structure. 3. Fix the minor errors.
liyinqiao committed
-
- 07 Aug, 2020 1 commit
-
-
Fix the bugs in Sum and Sub when MKL or OpenBlas is used.
liyinqiao committed
-
- 06 Aug, 2020 1 commit
-
-
1. Update the codes of machine translation sample. The current version is the same with NiuTrans.NMT. 2. Update the XList class. 3. Bugs fix.
liyinqiao committed
-
- 29 Apr, 2020 1 commit
-
-
1. Offer macro to set whether use the half precision in cuda codes. 2. Update the manuals.
liyinqiao committed
-
- 26 Apr, 2020 2 commits
- 20 Apr, 2020 1 commit
-
-
Fix the bugs in ReduceMax and its related functions.
liyinqiao committed
-
- 18 Apr, 2020 3 commits
- 17 Apr, 2020 2 commits
- 14 Apr, 2020 2 commits
- 13 Apr, 2020 1 commit
-
-
1. Support KB size for memory pool. 2. Clean the codes: 1000000 -> 1024 * 1024.
liyinqiao committed
-
- 11 Apr, 2020 2 commits
- 09 Apr, 2020 1 commit
-
-
1. Fix the mistakes in manual. 2. Fix the minor errors in Normalize and ScaleAndShift functions.
liyinqiao committed
-
- 01 Apr, 2020 1 commit
-
-
1. Fix the mistakes in manual. 2. Fix the bug in the link process of ConvertDataType function.
liyinqiao committed
-
- 31 Mar, 2020 1 commit
-
-
1. Fix the mistakes in manual. By the way, I have to say there are so many mistakes in the manual. I'm shocked it has been checked a lot of times, but why they are still be there. No one care about that? Really??? 2. Clean the codes.
liyinqiao committed
-
- 28 Mar, 2020 1 commit
-
-
1. Offer CMakeLists.txt to compile the NiuTensor. 2. Update the manuals for CMake.
liyinqiao committed
-
- 26 Mar, 2020 1 commit
-
-
1. Call the Scale or Shift functions when the parameter shift is 0.0F or scale is 1.0F. 2. Add the unit test for ScaleAndShift for the cases which parameter shift is 0.0F or scale is 1.0F.
liyinqiao committed
-
- 25 Mar, 2020 1 commit
-
-
Use the _CheckData without tolerance in some unit test to speedup the test process.
liyinqiao committed
-