Commits · master · NiuTrans / NiuTrans.Tensor

18 Mar, 2021 1 commit
- bug fixes in ReduceSumAll · f408c730
  xiaotong committed 4 years ago
  
  f408c730 Browse Directory
13 Mar, 2021 3 commits

1. Add mutex when operating the memory pool.
2. Support CMake 3.19 and fix some CMake bugs. Note that CUDA_ROOT variable in CMake is modified as CUDA_TOOLKIT_ROOT. You can find this update in the README.
3. Fix some bugs on Ubuntu.

committed 4 years ago

ae6e43fd Browse Directory

Merge with liyinqiao branch. · 8f368e73

committed 4 years ago

8f368e73 Browse Directory

Merge with xiaotong branch and add mutex when operating the memory pool. · a79523f9
liyinqiao committed 4 years ago

a79523f9 Browse Directory

08 Mar, 2021 1 commit
- fix the bug in Merge · 4bcf6c54
  xiaotong committed 4 years ago
  
  4bcf6c54 Browse Directory
05 Mar, 2021 1 commit
- wording · e6c92495
  xiaotong committed 4 years ago
  
  e6c92495 Browse Directory
28 Feb, 2021 1 commit
- code clean · 4a87ecc0
  xiaotong committed 4 years ago
  
  4a87ecc0 Browse Directory
23 Feb, 2021 2 commits
- updates of XThead · 2b03a447
  xiaotong committed 4 years ago
  
  2b03a447 Browse Directory
- remove XStream · dd7a67bc
  xiaotong committed 4 years ago
  
  dd7a67bc Browse Directory
21 Feb, 2021 2 commits
- bug fixes · 9fd6a28f
  xiaotong committed 4 years ago
  
  9fd6a28f Browse Directory
- bug fixes and removing warnings · 02b6c379
  xiaotong committed 4 years ago
  
  02b6c379 Browse Directory
06 Feb, 2021 3 commits
- Merge branch liyinqiao. · d291f56a
  liyinqiao committed 4 years ago
  
  d291f56a Browse Directory
- Merge branch liyinqiao. · dee31741
  liyinqiao committed 4 years ago
  
  dee31741 Browse Directory
- Merge with the branch of huchi and fix bugs. · 5bfbd041
  liyinqiao committed 4 years ago
  
  5bfbd041 Browse Directory
15 Dec, 2020 1 commit

Merge with the branch of huchi. · 64973687

1. Support Inplace way for some functions.
2. Add new flexible constructor for XList.
3. Bug fixed.

committed 4 years ago

64973687 Browse Directory

25 Sep, 2020 1 commit
- Bug fixed. · 893a0938
```
Fix the minor bugs in IsSameShaped.
```
  liyinqiao committed 4 years ago
  893a0938 Browse Directory
14 Sep, 2020 1 commit

Roll back some codes. · 7d8fedae

Find some fp16 bugs during decoding NMT system. We decide to roll back some codes to the last version. These codes need to be reviewed.

committed 4 years ago

7d8fedae Browse Directory

12 Sep, 2020 1 commit

Roll back some codes. · 65bb83d8

Find some bugs during training NMT system. We decide to roll back some codes to the last version. These codes need to be reviewed.

committed 4 years ago

65bb83d8 Browse Directory

10 Sep, 2020 1 commit
- Bug fixed. · 80da434c
```
Fix the __half bugs in SetData.
```
  liyinqiao committed 4 years ago
  80da434c Browse Directory
07 Sep, 2020 1 commit

Merge with the branch of xuchen and huchi. · 55dd6a78

1. Fix the bugs in backward process.
2. Support the float16 class.
3. Fix bugs.
4. Clean the codes.
5. Remove the makefile.

committed 4 years ago

55dd6a78 Browse Directory

02 Sep, 2020 1 commit

Merge with the branch of xuchen (NOT update the float16, this needs code review)… · 2f7adb8c

Merge with the branch of xuchen (NOT update the float16, this needs code review) and fix the bugs in Gather function.
1. Support Reciprocal fucntion.
2. Fix the safe delete bugs in XDevice.
3. Support new API to convert the data type of tensor.
4. Support to show the memory usage of buffer memory.
5. Fix minor errors.

committed 4 years ago

2f7adb8c Browse Directory

27 Aug, 2020 1 commit

Bug fixed and clean the codes. · 2cc0a82d

1. Try to fix the bugs in destroy the stream of XDevice (Uncheck).
2. Fix the bug of memory leak in ReduceSumAll function.
2. Adjust the directory structure.
3. Fix the minor errors.

committed 4 years ago

2cc0a82d Browse Directory

07 Aug, 2020 1 commit
- Bug fixed. · 9f12ebd2
```
Fix the bugs in Sum and Sub when MKL or OpenBlas is used.
```
  liyinqiao committed 4 years ago
  9f12ebd2 Browse Directory
06 Aug, 2020 1 commit

Update the codes of Transformer sample and XList class. · b801df51

1. Update the codes of machine translation sample. The current version is the same with NiuTrans.NMT.
2. Update the XList class.
3. Bugs fix.

committed 4 years ago

b801df51 Browse Directory

29 Apr, 2020 1 commit
- Support run NiuTensor on the older GPUs with Maxwell or more previous architectures. · 8178ba40
```
1. Offer macro to set whether use the half precision in cuda codes.
2. Update the manuals.
```
  liyinqiao committed 4 years ago
  8178ba40 Browse Directory
26 Apr, 2020 2 commits
- Bug fixed. (This version cannot run on the GPUs with pascal and much older framework). · bb5eb7db
```
1. Fix the memory leak bugs in XList.
2. Replace the half with the unsigned short data type.
```
  liyinqiao committed 4 years ago
  bb5eb7db Browse Directory
- Clean the codes. (This version cannot run on the GPUs with pascal and much older framework). · 148b2577
  liyinqiao committed 4 years ago
  
  148b2577 Browse Directory
20 Apr, 2020 1 commit
- Bug fixed. (This version cannot run on the GPUs with pascal and much older framework). · a095188b
```
Fix the bugs in ReduceMax and its related functions.
```
  liyinqiao committed 4 years ago
  a095188b Browse Directory
18 Apr, 2020 2 commits
- Support fp16 data type for more operations and fix the minor errors. (Don't use… · f1792ca4
```
Support fp16 data type for more operations and fix the minor errors.  (Don't use this! It's an incomplete version)
```
  liyinqiao committed 4 years ago
  f1792ca4 Browse Directory
- Support fp16 data type for more operations and fix the minor errors. (Don't use… · 1f1413ca
```
Support fp16 data type for more operations and fix the minor errors.  (Don't use this! It's an incomplete version)
```
  liyinqiao committed 4 years ago
  1f1413ca Browse Directory
17 Apr, 2020 1 commit
- Support fp16 data type for more operations and fix the minor errors. (Don't use… · 22cc1218
```
Support fp16 data type for more operations and fix the minor errors.  (Don't use this! It's an incomplete version)
```
  liyinqiao committed 4 years ago
  22cc1218 Browse Directory
11 Apr, 2020 1 commit
- Fix the mistakes in manual and minor errors. · 3c15686c
```
1. Fix the mistakes in manual.
2. Fix the minor errors in Stack and LogSoftmax functions.
```
  liyinqiao committed 5 years ago
  3c15686c Browse Directory
09 Apr, 2020 1 commit
- Fix the mistakes in manual and minor errors. · 0c396646
```
1. Fix the mistakes in manual.
2. Fix the minor errors in Normalize and ScaleAndShift functions.
```
  liyinqiao committed 5 years ago
  0c396646 Browse Directory
31 Mar, 2020 1 commit

Fix the mistakes in manual and clean the codes. · 7d4ab222

1. Fix the mistakes in manual. By the way, I have to say there are so many mistakes in the manual. I'm shocked it has been checked a lot of times, but why they are still be there. No one care about that? Really???
2. Clean the codes.

committed 5 years ago

7d4ab222 Browse Directory

26 Mar, 2020 1 commit

Optimize the ScaleAndShift function and add more unit tests. · 6d137345

1. Call the Scale or Shift functions when the parameter shift is 0.0F or scale is 1.0F.
2. Add the unit test for ScaleAndShift for the cases which parameter shift is 0.0F or scale is 1.0F.

committed 5 years ago

6d137345 Browse Directory

25 Mar, 2020 4 commits

Optimize the functions. · dcd3a86b
```
Optimize the MultiplyMe and DivMe functions when operate the scalar tensors.
```
liyinqiao committed 5 years ago
dcd3a86b Browse Directory

Bug fixed and clean the codes. · a7d832bc

1. Fix the bug in DivMe function which cannot handle the scalar tensor and broadcast case.
2. Clean the codes.
3. Fix minor errors.

committed 5 years ago

a7d832bc Browse Directory

Bug fixed and clean the codes. · a4b98ac6

1. Fix the bug in SubMe function which cannot handle the scalar tensor and broadcast case.
2. Clean the codes.

committed 5 years ago

a4b98ac6 Browse Directory

Bug fixed. · 14ec9fad

Fix the bug in SumMe function which cannot handle the scalar tensor and broadcast case.

committed 5 years ago

14ec9fad Browse Directory

24 Mar, 2020 1 commit

Bug fixed and add new unit test. · b0f2bbbf

1. Fix the bug in MultiplyMe function which cannot handle the scalar tensor and broadcast case.
2. Add broadcast multiply unit test.

committed 5 years ago

b0f2bbbf Browse Directory