1) fix the bug of unexpected breakpoints in tensor graph 2) add devID to XTensor constructor 3) avoid unneccessary data allocation in XLink