============================= test session starts ============================== platform linux -- Python 3.7.5, pytest-5.4.3, py-1.11.0, pluggy-0.13.1 rootdir: /home/jenkins/mindspore/testcases/testcases/tests/st/dyn_shape_dev, inifile: /home/jenkins/sault/virtual_test/virtualenv_004/sault/config/pytest.ini plugins: anyio-3.7.1, forked-1.1.3, xdist-1.32.0 [INFO] ATRACE(119067,python3.7):2024-01-11-05:59:17.876.041 [trace_attr.c:105](tid:119067) platform is 1. [INFO] ATRACE(119067,python3.7):2024-01-11-05:59:17.876.239 [trace_recorder.c:114](tid:119067) use root path: /home/jenkins/ascend/atrace [INFO] ATRACE(119067,python3.7):2024-01-11-05:59:17.876.272 [trace_signal.c:133](tid:119067) register signal handler for signo 2 succeed. [INFO] ATRACE(119067,python3.7):2024-01-11-05:59:17.876.287 [trace_signal.c:133](tid:119067) register signal handler for signo 15 succeed. [INFO] RUNTIME(119067,python3.7):2024-01-11-05:59:18.315.616 [runtime.cc:1159] 119067 GetAicoreNumByLevel: workingDev_=0 [INFO] RUNTIME(119067,python3.7):2024-01-11-05:59:18.315.673 [runtime.cc:4719] 119067 GetVisibleDevices: ASCEND_RT_VISIBLE_DEVICES param was not set collected 2 items test_tile.py [INFO] TDT(119067,python3.7):2024-01-11-05:59:22.723.505 [process_mode_manager.cpp:109][OpenProcess][tid:119067] [ProcessModeManager] enter into open process deviceId[3] rankSize[0] [INFO] TDT(119067,python3.7):2024-01-11-05:59:22.724.663 [process_mode_manager.cpp:379][InitTsdClient][tid:119067] [TsdClient] deviceId[3] begin to init hdc client [INFO] TDT(119067,python3.7):2024-01-11-05:59:22.724.824 [version_verify.cpp:34][SetVersionInfo][tid:119067] VersionVerify: send client version to server [INFO] TDT(119067,python3.7):2024-01-11-05:59:22.724.919 [version_verify.cpp:50][SetVersionInfo][tid:119067] send feature_info:{msg_type:35, features:{check before send aicpu package,}} [INFO] TDT(119067,python3.7):2024-01-11-05:59:22.724.938 [version_verify.cpp:50][SetVersionInfo][tid:119067] send feature_info:{msg_type:37, features:{check before send open qs message,}} [INFO] TDT(119067,python3.7):2024-01-11-05:59:22.744.540 [version_verify.cpp:66][PeerVersionCheck][tid:119067] VersionVerify: Check client version info, server[1230], client[1230] [INFO] TDT(119067,python3.7):2024-01-11-05:59:22.744.562 [version_verify.cpp:87][ParseVersionInfo][tid:119067] VersionVerify: pass client version info success [INFO] TDT(119067,python3.7):2024-01-11-05:59:22.744.574 [hdc_client.cpp:276][CheckHdcConnection][tid:119067] Service[2] create hdc success [INFO] TDT(119067,python3.7):2024-01-11-05:59:22.744.592 [version_verify.cpp:120][SpecialFeatureCheck][tid:119067] VersionVerify: new type[35], supported [INFO] TDT(119067,python3.7):2024-01-11-05:59:22.744.643 [process_mode_manager.cpp:748][GetDeviceCheckCode][tid:119067] [TsdClient][deviceId=3] [sessionId=1] wait package info respond [INFO] TDT(119067,python3.7):2024-01-11-05:59:22.744.786 [process_mode_manager.cpp:379][InitTsdClient][tid:119067] [TsdClient] deviceId[3] begin to init hdc client [INFO] TDT(119067,python3.7):2024-01-11-05:59:22.744.992 [version_verify.cpp:34][SetVersionInfo][tid:119067] VersionVerify: send client version to server [INFO] TDT(119067,python3.7):2024-01-11-05:59:22.745.008 [version_verify.cpp:50][SetVersionInfo][tid:119067] send feature_info:{msg_type:35, features:{check before send aicpu package,}} [INFO] TDT(119067,python3.7):2024-01-11-05:59:22.745.024 [version_verify.cpp:50][SetVersionInfo][tid:119067] send feature_info:{msg_type:37, features:{check before send open qs message,}} [INFO] TDT(119067,python3.7):2024-01-11-05:59:22.745.177 [version_verify.cpp:66][PeerVersionCheck][tid:119067] VersionVerify: Check client version info, server[1230], client[1230] [INFO] TDT(119067,python3.7):2024-01-11-05:59:22.745.192 [version_verify.cpp:87][ParseVersionInfo][tid:119067] VersionVerify: pass client version info success [INFO] TDT(119067,python3.7):2024-01-11-05:59:22.745.204 [hdc_client.cpp:276][CheckHdcConnection][tid:119067] Service[2] create hdc success [INFO] TDT(119067,python3.7):2024-01-11-05:59:22.745.218 [process_mode_manager.cpp:426][ConstructOpenMsg][tid:119067] [TsdClient] tsd get process sign successfully, procpid[119067] signSize[48] [INFO] TDT(119067,python3.7):2024-01-11-05:59:22.745.249 [version_verify.cpp:112][SpecialFeatureCheck][tid:119067] VersionVerify: previous type[6], supported [INFO] TDT(119067,python3.7):2024-01-11-05:59:22.745.273 [process_mode_manager.cpp:126][OpenProcess][tid:119067] [ProcessModeManager] deviceId[3] sessionId[1] rankSize[0], wait sub process start respond [INFO] TDT(119067,python3.7):2024-01-11-05:59:22.972.553 [stub_process_mode_nowin.cpp:63][ProcessQueueForMdc][tid:119067] [TsdClient] it is unnecessary of current mode[0] chiptype[1] to grant queue auth to aicpusd [INFO] TDT(119067,python3.7):2024-01-11-05:59:22.972.586 [stub_process_mode_nowin.cpp:101][OpenInHost][tid:119067] enter into OpenInHost deviceid[3] [INFO] TDT(119067,python3.7):2024-01-11-05:59:22.972.599 [stub_process_mode_nowin.cpp:105][OpenInHost][tid:119067] host cpu not support [INFO] TDT(119067,python3.7):2024-01-11-05:59:22.972.607 [process_mode_manager.cpp:156][OpenProcess][tid:119067] [TsdClient][deviceId=3] [sessionId=1] start hccp and computer process success [INFO] RUNTIME(119067,python3.7):2024-01-11-05:59:22.975.405 [device.cc:340] 119067 Init: isDoubledie:0, topologytype:0 [INFO] RUNTIME(119067,python3.7):2024-01-11-05:59:22.991.158 [npu_driver.cc:5428] 119653 GetDeviceStatus: GetDeviceStatus status=1. [INFO] ATRACE(119067,python3.7):2024-01-11-05:59:22.991.244 [atrace_api.c:28](tid:119067) AtraceCreate start [INFO] ATRACE(119067,python3.7):2024-01-11-05:59:22.991.359 [trace_rb_log.c:84](tid:119067) [RUNTIME_ATRACE_DEV3_TS0] create ring buffer success, buffer size : 131152. [INFO] ATRACE(119067,python3.7):2024-01-11-05:59:22.991.380 [atrace_api.c:32](tid:119067) AtraceCreate end [INFO] TDT(119067,python3.7):2024-01-11-05:59:22.991.407 [client_manager.cpp:157][SetProfilingCallback][tid:119067] [TsdClient] set profiling callback success [INFO] TDT(119067,python3.7):2024-01-11-05:59:23.003.172 [process_mode_manager.cpp:184][Close][tid:119067] [TsdClient] Close [deviceId=3][sessionId=1] hccp and computer enter [INFO] TDT(119067,python3.7):2024-01-11-05:59:23.003.211 [version_verify.cpp:112][SpecialFeatureCheck][tid:119067] VersionVerify: previous type[7], supported [INFO] TDT(119067,python3.7):2024-01-11-05:59:23.003.252 [process_mode_manager.cpp:192][Close][tid:119067] [TsdClient][deviceId=3] [sessionId=1] wait hccp and computer process close respond [INFO] TDT(119067,python3.7):2024-01-11-05:59:23.024.976 [process_mode_manager.cpp:197][Close][tid:119067] [TsdClient][logicDeviceId_=3]has recv close hccp and computer process respond [INFO] TDT(119067,python3.7):2024-01-11-05:59:23.024.996 [stub_process_mode_nowin.cpp:151][CloseInHost][tid:119067] enter into CloseInHost deviceid[3] [INFO] TDT(119067,python3.7):2024-01-11-05:59:23.025.008 [stub_process_mode_nowin.cpp:154][CloseInHost][tid:119067] host cpu not support [INFO] TDT(119067,python3.7):2024-01-11-05:59:23.025.044 [process_mode_manager.cpp:208][Close][tid:119067] [TsdClient][deviceId=3] [sessionId=1] close hccp and computer process success [INFO] ATRACE(119067,python3.7):2024-01-11-05:59:23.025.059 [atrace_api.c:93](tid:119067) AtraceDestroy start [INFO] ATRACE(119067,python3.7):2024-01-11-05:59:23.025.075 [atrace_api.c:95](tid:119067) AtraceDestroy end F[INFO] TDT(119067,python3.7):2024-01-11-05:59:23.605.229 [process_mode_manager.cpp:109][OpenProcess][tid:119067] [ProcessModeManager] enter into open process deviceId[3] rankSize[0] [INFO] TDT(119067,python3.7):2024-01-11-05:59:23.605.730 [process_mode_manager.cpp:705][GetDeviceCheckCode][tid:119067] [ProcessModeManager][deviceId=3] aicpu package already exist in device [INFO] TDT(119067,python3.7):2024-01-11-05:59:23.605.780 [process_mode_manager.cpp:379][InitTsdClient][tid:119067] [TsdClient] deviceId[3] begin to init hdc client [INFO] TDT(119067,python3.7):2024-01-11-05:59:23.605.903 [version_verify.cpp:34][SetVersionInfo][tid:119067] VersionVerify: send client version to server [INFO] TDT(119067,python3.7):2024-01-11-05:59:23.605.926 [version_verify.cpp:50][SetVersionInfo][tid:119067] send feature_info:{msg_type:35, features:{check before send aicpu package,}} [INFO] TDT(119067,python3.7):2024-01-11-05:59:23.605.951 [version_verify.cpp:50][SetVersionInfo][tid:119067] send feature_info:{msg_type:37, features:{check before send open qs message,}} [INFO] TDT(119067,python3.7):2024-01-11-05:59:23.606.221 [version_verify.cpp:66][PeerVersionCheck][tid:119067] VersionVerify: Check client version info, server[1230], client[1230] [INFO] TDT(119067,python3.7):2024-01-11-05:59:23.606.239 [version_verify.cpp:87][ParseVersionInfo][tid:119067] VersionVerify: pass client version info success [INFO] TDT(119067,python3.7):2024-01-11-05:59:23.606.250 [hdc_client.cpp:276][CheckHdcConnection][tid:119067] Service[2] create hdc success [INFO] TDT(119067,python3.7):2024-01-11-05:59:23.606.266 [process_mode_manager.cpp:426][ConstructOpenMsg][tid:119067] [TsdClient] tsd get process sign successfully, procpid[119067] signSize[48] [INFO] TDT(119067,python3.7):2024-01-11-05:59:23.606.284 [version_verify.cpp:112][SpecialFeatureCheck][tid:119067] VersionVerify: previous type[6], supported [INFO] TDT(119067,python3.7):2024-01-11-05:59:23.606.309 [process_mode_manager.cpp:126][OpenProcess][tid:119067] [ProcessModeManager] deviceId[3] sessionId[1] rankSize[0], wait sub process start respond [INFO] TDT(119067,python3.7):2024-01-11-05:59:23.886.870 [stub_process_mode_nowin.cpp:63][ProcessQueueForMdc][tid:119067] [TsdClient] it is unnecessary of current mode[0] chiptype[1] to grant queue auth to aicpusd [INFO] TDT(119067,python3.7):2024-01-11-05:59:23.886.894 [stub_process_mode_nowin.cpp:101][OpenInHost][tid:119067] enter into OpenInHost deviceid[3] [INFO] TDT(119067,python3.7):2024-01-11-05:59:23.886.907 [stub_process_mode_nowin.cpp:105][OpenInHost][tid:119067] host cpu not support [INFO] TDT(119067,python3.7):2024-01-11-05:59:23.886.918 [process_mode_manager.cpp:156][OpenProcess][tid:119067] [TsdClient][deviceId=3] [sessionId=1] start hccp and computer process success [INFO] RUNTIME(119067,python3.7):2024-01-11-05:59:23.889.530 [device.cc:340] 119067 Init: isDoubledie:0, topologytype:0 [INFO] ATRACE(119067,python3.7):2024-01-11-05:59:23.904.581 [atrace_api.c:28](tid:119067) AtraceCreate start [INFO] ATRACE(119067,python3.7):2024-01-11-05:59:23.904.646 [trace_rb_log.c:84](tid:119067) [RUNTIME_ATRACE_DEV3_TS0] create ring buffer success, buffer size : 131152. [INFO] ATRACE(119067,python3.7):2024-01-11-05:59:23.904.662 [atrace_api.c:32](tid:119067) AtraceCreate end [INFO] TDT(119067,python3.7):2024-01-11-05:59:23.904.675 [client_manager.cpp:157][SetProfilingCallback][tid:119067] [TsdClient] set profiling callback success [INFO] TDT(119067,python3.7):2024-01-11-05:59:23.914.492 [process_mode_manager.cpp:184][Close][tid:119067] [TsdClient] Close [deviceId=3][sessionId=1] hccp and computer enter [INFO] TDT(119067,python3.7):2024-01-11-05:59:23.914.523 [version_verify.cpp:112][SpecialFeatureCheck][tid:119067] VersionVerify: previous type[7], supported [INFO] TDT(119067,python3.7):2024-01-11-05:59:23.914.568 [process_mode_manager.cpp:192][Close][tid:119067] [TsdClient][deviceId=3] [sessionId=1] wait hccp and computer process close respond [INFO] TDT(119067,python3.7):2024-01-11-05:59:23.935.893 [process_mode_manager.cpp:197][Close][tid:119067] [TsdClient][logicDeviceId_=3]has recv close hccp and computer process respond [INFO] TDT(119067,python3.7):2024-01-11-05:59:23.935.913 [stub_process_mode_nowin.cpp:151][CloseInHost][tid:119067] enter into CloseInHost deviceid[3] [INFO] TDT(119067,python3.7):2024-01-11-05:59:23.935.925 [stub_process_mode_nowin.cpp:154][CloseInHost][tid:119067] host cpu not support [INFO] TDT(119067,python3.7):2024-01-11-05:59:23.935.963 [process_mode_manager.cpp:208][Close][tid:119067] [TsdClient][deviceId=3] [sessionId=1] close hccp and computer process success [INFO] ATRACE(119067,python3.7):2024-01-11-05:59:23.935.979 [atrace_api.c:93](tid:119067) AtraceDestroy start [INFO] ATRACE(119067,python3.7):2024-01-11-05:59:23.935.995 [atrace_api.c:95](tid:119067) AtraceDestroy end F =================================== FAILURES =================================== ____________________________ test_tile_backward[0] _____________________________ mode = 0 @pytest.mark.level1 @pytest.mark.env_onecard @pytest.mark.platform_x86_cpu @pytest.mark.platform_x86_gpu_training @pytest.mark.platform_arm_ascend_training @pytest.mark.parametrize('mode', [ms.context.GRAPH_MODE, ms.context.PYNATIVE_MODE]) def test_tile_backward(mode): """ Feature: Auto grad. Description: test auto grad of op tile. Expectation: expect correct result. """ ms.context.set_context(mode=mode) x = Tensor(np.random.rand(2, 3, 4, 5).astype(np.float32)) mul = (1, 1, 2, 2) > grads = tile_backward_func(x, mul) test_tile.py:103: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ test_utils.py:42: in wrapper cell_obj = Net(fn) test_utils.py:29: in __init__ super().__init__() _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ self = Net<>, auto_prefix = True, flags = None def __init__(self, auto_prefix=True, flags=None): Cell_.__init__(self, self._cell_tag) self._params = OrderedDict() self._cells = OrderedDict() self._params_list = OrderedDict() self._tensor_list = OrderedDict() self._primitives = OrderedDict() self.training = False self.requires_grad = False self.pynative = False self._attr_synced = False self._param_prefix = '' self._auto_prefix = auto_prefix self._scope = None self._phase = 'train' self._parameter_layout_dict = {} self._parallel_parameter_name_list = () self._parallel_parameter_merge_net_dict = {} self._create_time = int(time.time() * 1e9) self.arguments_key = "" self.compile_cache = set() cells_compile_cache[id(self)] = self.compile_cache self.parameter_broadcast_done = False self._id = 1 self.exist_names = set("") self.exist_objs = set() > init_pipeline() E RuntimeError: Ascend kernel runtime initialization failed. The details refer to 'Ascend Error Message'. E E ---------------------------------------------------- E - Framework Error Message: E ---------------------------------------------------- E Malloc device memory failed, free memory size is less than half of total memory size.Device 3 Device HBM total size:34359738368 Device HBM free size:1602916352 may be other processes occupying this card, check as: ps -ef|grep python E E ---------------------------------------------------- E - C++ Call Stack: (For framework developers) E ---------------------------------------------------- E mindspore/ccsrc/plugin/device/ascend/hal/device/ascend_kernel_runtime.cc:357 Init E mindspore/ccsrc/plugin/device/ascend/hal/device/ascend_memory_adapter.cc:73 Initialize /home/jenkins/.local/lib/python3.7/site-packages/mindspore/nn/cell.py:134: RuntimeError ____________________________ test_tile_backward[1] _____________________________ mode = 1 @pytest.mark.level1 @pytest.mark.env_onecard @pytest.mark.platform_x86_cpu @pytest.mark.platform_x86_gpu_training @pytest.mark.platform_arm_ascend_training @pytest.mark.parametrize('mode', [ms.context.GRAPH_MODE, ms.context.PYNATIVE_MODE]) def test_tile_backward(mode): """ Feature: Auto grad. Description: test auto grad of op tile. Expectation: expect correct result. """ ms.context.set_context(mode=mode) x = Tensor(np.random.rand(2, 3, 4, 5).astype(np.float32)) mul = (1, 1, 2, 2) > grads = tile_backward_func(x, mul) test_tile.py:103: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ test_utils.py:42: in wrapper cell_obj = Net(fn) test_utils.py:29: in __init__ super().__init__() _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ self = Net<>, auto_prefix = True, flags = None def __init__(self, auto_prefix=True, flags=None): Cell_.__init__(self, self._cell_tag) self._params = OrderedDict() self._cells = OrderedDict() self._params_list = OrderedDict() self._tensor_list = OrderedDict() self._primitives = OrderedDict() self.training = False self.requires_grad = False self.pynative = False self._attr_synced = False self._param_prefix = '' self._auto_prefix = auto_prefix self._scope = None self._phase = 'train' self._parameter_layout_dict = {} self._parallel_parameter_name_list = () self._parallel_parameter_merge_net_dict = {} self._create_time = int(time.time() * 1e9) self.arguments_key = "" self.compile_cache = set() cells_compile_cache[id(self)] = self.compile_cache self.parameter_broadcast_done = False self._id = 1 self.exist_names = set("") self.exist_objs = set() > init_pipeline() E RuntimeError: Ascend kernel runtime initialization failed. The details refer to 'Ascend Error Message'. E E ---------------------------------------------------- E - Framework Error Message: E ---------------------------------------------------- E Malloc device memory failed, free memory size is less than half of total memory size.Device 3 Device HBM total size:34359738368 Device HBM free size:1602883584 may be other processes occupying this card, check as: ps -ef|grep python E E ---------------------------------------------------- E - C++ Call Stack: (For framework developers) E ---------------------------------------------------- E mindspore/ccsrc/plugin/device/ascend/hal/device/ascend_kernel_runtime.cc:357 Init E mindspore/ccsrc/plugin/device/ascend/hal/device/ascend_memory_adapter.cc:73 Initialize /home/jenkins/.local/lib/python3.7/site-packages/mindspore/nn/cell.py:134: RuntimeError =========================== short test summary info ============================ FAILED test_tile.py::test_tile_backward[0] - RuntimeError: Ascend kernel runt... FAILED test_tile.py::test_tile_backward[1] - RuntimeError: Ascend kernel runt... ============================== 2 failed in 7.86s =============================== [INFO] RUNTIME(119067,python3.7):2024-01-11-05:59:25.540.039 [runtime.cc:1737] 119067 ~Runtime: deconstruct runtime.