============================= test session starts ============================== platform linux -- Python 3.7.5, pytest-5.4.3, py-1.8.1, pluggy-0.13.1 rootdir: /home/jenkins/mindspore/testcases/testcases/tests/st/dyn_shape_dev, inifile: /home/jenkins/sault/virtual_test/virtualenv_001/sault/config/pytest.ini plugins: anyio-3.7.1, xdist-1.32.0, forked-1.1.3 [INFO] ATRACE(11300,python3.7):2024-01-11-05:42:31.383.096 [trace_attr.c:105](tid:11300) platform is 1. [INFO] ATRACE(11300,python3.7):2024-01-11-05:42:31.383.270 [trace_recorder.c:114](tid:11300) use root path: /home/jenkins/ascend/atrace [INFO] ATRACE(11300,python3.7):2024-01-11-05:42:31.383.300 [trace_signal.c:133](tid:11300) register signal handler for signo 2 succeed. [INFO] ATRACE(11300,python3.7):2024-01-11-05:42:31.383.311 [trace_signal.c:133](tid:11300) register signal handler for signo 15 succeed. [INFO] RUNTIME(11300,python3.7):2024-01-11-05:42:31.816.479 [runtime.cc:1159] 11300 GetAicoreNumByLevel: workingDev_=0 [INFO] RUNTIME(11300,python3.7):2024-01-11-05:42:31.816.558 [runtime.cc:4719] 11300 GetVisibleDevices: ASCEND_RT_VISIBLE_DEVICES param was not set collected 1 item test_transpose.py [INFO] TDT(11300,python3.7):2024-01-11-05:42:36.347.955 [process_mode_manager.cpp:109][OpenProcess][tid:11300] [ProcessModeManager] enter into open process deviceId[0] rankSize[0] [INFO] TDT(11300,python3.7):2024-01-11-05:42:36.349.697 [process_mode_manager.cpp:379][InitTsdClient][tid:11300] [TsdClient] deviceId[0] begin to init hdc client [INFO] TDT(11300,python3.7):2024-01-11-05:42:36.349.858 [version_verify.cpp:34][SetVersionInfo][tid:11300] VersionVerify: send client version to server [INFO] TDT(11300,python3.7):2024-01-11-05:42:36.349.888 [version_verify.cpp:50][SetVersionInfo][tid:11300] send feature_info:{msg_type:35, features:{check before send aicpu package,}} [INFO] TDT(11300,python3.7):2024-01-11-05:42:36.349.900 [version_verify.cpp:50][SetVersionInfo][tid:11300] send feature_info:{msg_type:37, features:{check before send open qs message,}} [INFO] TDT(11300,python3.7):2024-01-11-05:42:36.350.353 [version_verify.cpp:66][PeerVersionCheck][tid:11300] VersionVerify: Check client version info, server[1230], client[1230] [INFO] TDT(11300,python3.7):2024-01-11-05:42:36.350.369 [version_verify.cpp:87][ParseVersionInfo][tid:11300] VersionVerify: pass client version info success [INFO] TDT(11300,python3.7):2024-01-11-05:42:36.350.378 [hdc_client.cpp:276][CheckHdcConnection][tid:11300] Service[2] create hdc success [INFO] TDT(11300,python3.7):2024-01-11-05:42:36.350.394 [version_verify.cpp:120][SpecialFeatureCheck][tid:11300] VersionVerify: new type[35], supported [INFO] TDT(11300,python3.7):2024-01-11-05:42:36.350.441 [process_mode_manager.cpp:748][GetDeviceCheckCode][tid:11300] [TsdClient][deviceId=0] [sessionId=1] wait package info respond [INFO] TDT(11300,python3.7):2024-01-11-05:42:36.350.577 [process_mode_manager.cpp:379][InitTsdClient][tid:11300] [TsdClient] deviceId[0] begin to init hdc client [INFO] TDT(11300,python3.7):2024-01-11-05:42:36.350.747 [version_verify.cpp:34][SetVersionInfo][tid:11300] VersionVerify: send client version to server [INFO] TDT(11300,python3.7):2024-01-11-05:42:36.350.759 [version_verify.cpp:50][SetVersionInfo][tid:11300] send feature_info:{msg_type:35, features:{check before send aicpu package,}} [INFO] TDT(11300,python3.7):2024-01-11-05:42:36.350.769 [version_verify.cpp:50][SetVersionInfo][tid:11300] send feature_info:{msg_type:37, features:{check before send open qs message,}} [INFO] TDT(11300,python3.7):2024-01-11-05:42:36.350.905 [version_verify.cpp:66][PeerVersionCheck][tid:11300] VersionVerify: Check client version info, server[1230], client[1230] [INFO] TDT(11300,python3.7):2024-01-11-05:42:36.350.917 [version_verify.cpp:87][ParseVersionInfo][tid:11300] VersionVerify: pass client version info success [INFO] TDT(11300,python3.7):2024-01-11-05:42:36.350.925 [hdc_client.cpp:276][CheckHdcConnection][tid:11300] Service[2] create hdc success [INFO] TDT(11300,python3.7):2024-01-11-05:42:36.350.937 [process_mode_manager.cpp:426][ConstructOpenMsg][tid:11300] [TsdClient] tsd get process sign successfully, procpid[11300] signSize[48] [INFO] TDT(11300,python3.7):2024-01-11-05:42:36.350.972 [version_verify.cpp:112][SpecialFeatureCheck][tid:11300] VersionVerify: previous type[6], supported [INFO] TDT(11300,python3.7):2024-01-11-05:42:36.350.994 [process_mode_manager.cpp:126][OpenProcess][tid:11300] [ProcessModeManager] deviceId[0] sessionId[1] rankSize[0], wait sub process start respond [INFO] TDT(11300,python3.7):2024-01-11-05:42:36.583.264 [stub_process_mode_nowin.cpp:63][ProcessQueueForMdc][tid:11300] [TsdClient] it is unnecessary of current mode[0] chiptype[1] to grant queue auth to aicpusd [INFO] TDT(11300,python3.7):2024-01-11-05:42:36.583.306 [stub_process_mode_nowin.cpp:101][OpenInHost][tid:11300] enter into OpenInHost deviceid[0] [INFO] TDT(11300,python3.7):2024-01-11-05:42:36.583.316 [stub_process_mode_nowin.cpp:105][OpenInHost][tid:11300] host cpu not support [INFO] TDT(11300,python3.7):2024-01-11-05:42:36.583.325 [process_mode_manager.cpp:156][OpenProcess][tid:11300] [TsdClient][deviceId=0] [sessionId=1] start hccp and computer process success [INFO] RUNTIME(11300,python3.7):2024-01-11-05:42:36.585.989 [device.cc:340] 11300 Init: isDoubledie:0, topologytype:0 [INFO] RUNTIME(11300,python3.7):2024-01-11-05:42:36.600.525 [npu_driver.cc:5428] 12724 GetDeviceStatus: GetDeviceStatus status=1. [INFO] ATRACE(11300,python3.7):2024-01-11-05:42:36.600.574 [atrace_api.c:28](tid:11300) AtraceCreate start [INFO] ATRACE(11300,python3.7):2024-01-11-05:42:36.600.675 [trace_rb_log.c:84](tid:11300) [RUNTIME_ATRACE_DEV0_TS0] create ring buffer success, buffer size : 131152. [INFO] ATRACE(11300,python3.7):2024-01-11-05:42:36.600.690 [atrace_api.c:32](tid:11300) AtraceCreate end [INFO] TDT(11300,python3.7):2024-01-11-05:42:36.600.704 [client_manager.cpp:157][SetProfilingCallback][tid:11300] [TsdClient] set profiling callback success [TRACE] GE(11300,python3.7):2024-01-11-05:42:36.674.646 [status:INIT] [ge_api.cc:144]11300 GEInitializeImpl:GEInitialize start [INFO] PROFILING(11300,python3.7):2024-01-11-05:42:36.884.400 [msprofiler_impl.cpp:156] >>> (tid:11300) ProfNotifySetDevice called, is open: 1, devId: 0 [INFO] PROFILING(11300,python3.7):2024-01-11-05:42:36.884.548 [platform.cpp:38] >>> (tid:11300) Profiling platform version: 1.0. [INFO] PROFILING(11300,python3.7):2024-01-11-05:42:36.884.565 [ai_drv_dev_api.cpp:384] >>> (tid:11300) Succeeded to DrvGetApiVersion version: 0x72313 [TRACE] GE(11300,python3.7):2024-01-11-05:42:36.934.937 [status:RUNNING] [ge_api.cc:211]11300 GEInitializeImpl:Initializing environment [INFO] GE(11300,python3.7):2024-01-11-05:42:36.935.012 [gelib.cc:98][EVENT]11300 Initialize:[GEPERFTRACE] GE Init Start [INFO] GE(11300,python3.7):2024-01-11-05:42:36.935.289 [gelib.cc:307][EVENT]11300 SystemInitialize:Online infer init GELib success, device id :0 [INFO] DVPP(11300,python3.7):2024-01-11-05:42:37.301.452 [dvpp_engine.cc:41][ENGINE][Initialize:41][tid:11300]dvpp engine do not support [INFO] TUNE(11300,python3.7):2024-01-11-05:42:37.305.030 [cann_kb_pyfunc_mgr.cpp:72][CANNKB][Tid:11300]"CannKbPyfuncMgr: Enter PyObjectInit, reference_ is 0!" [INFO] TUNE(11300,python3.7):2024-01-11-05:42:37.305.068 [handle_manager.cpp:115][CANNKB][Tid:11300]"Start to run init functions to load dynamic python lib!" [INFO] TUNE(11300,python3.7):2024-01-11-05:42:37.305.130 [handle_manager.cpp:407][CANNKB][Tid:11300]"Init functions of loading dynamic python lib end!" [INFO] TUNE(11300,python3.7):2024-01-11-05:42:37.305.140 [cann_kb_pyfunc_mgr.cpp:24][CANNKB][Tid:11300]"CANN_KB_Py has already been initialized." [INFO] TUNE(11300,python3.7):2024-01-11-05:42:37.305.215 [cann_kb_pyfunc_mgr.cpp:117][CANNKB][Tid:11300]"CannKbPyfuncMgr: Run PyObjectInit successfully!" [INFO] HCCL(11300,python3.7):2024-01-11-05:42:49.253.445 [plugin_manager.cc:42][11300]hcom running normal mode. [INFO] DVPP(11300,python3.7):2024-01-11-05:42:49.254.133 [dvpp_engine.cc:92][ENGINE][GetOpsKernelInfoStores:92][tid:11300]dvpp ops kernel info store do not support [INFO] DVPP(11300,python3.7):2024-01-11-05:42:49.254.298 [dvpp_engine.cc:69][ENGINE][GetGraphOptimizerObjs:69][tid:11300]dvpp graph optimizer do not support [INFO] DVPP(11300,python3.7):2024-01-11-05:42:49.774.625 [dvpp_ops_kernel_builder.cc:48][ENGINE][Initialize:48][tid:11300]dvpp ops kernel builder do not support [INFO] GE(11300,python3.7):2024-01-11-05:42:49.783.714 [gelib.cc:169][EVENT]11300 Initialize:[GEPERFTRACE] The time cost of GELib::Initialize is [12848647] micro second. [TRACE] GE(11300,python3.7):2024-01-11-05:42:49.872.326 [status:STOP] [ge_api.cc:255]11300 GEInitializeImpl:GEInitialize finished [TRACE] GE(11300,python3.7):2024-01-11-05:42:49.872.507 [status:INIT] [ge_api.cc:398]11300 Session:Start to construct session. [TRACE] GE(11300,python3.7):2024-01-11-05:42:49.872.526 [status:RUNNING] [ge_api.cc:408]11300 Session:Creating session [INFO] GE(11300,python3.7):2024-01-11-05:42:49.872.948 [graph_var_manager.cc:1445][EVENT]11300 SetMemoryMallocSize:Total memory size is 34359738368 [INFO] GE(11300,python3.7):2024-01-11-05:42:49.872.966 [graph_var_manager.cc:1424][EVENT]11300 SetAllMemoryMaxValue:The graph_mem_max_size is 27917287424 and the var_mem_max_size is 5368709120 [INFO] PROFILING(11300,python3.7):2024-01-11-05:42:49.873.296 [msprofiler_impl.cpp:156] >>> (tid:11300) ProfNotifySetDevice called, is open: 1, devId: 0 [TRACE] GE(11300,python3.7):2024-01-11-05:42:49.874.163 [status:RUNNING] [ge_api.cc:411]11300 Session:Session id is 0 [TRACE] GE(11300,python3.7):2024-01-11-05:42:49.874.185 [status:STOP] [ge_api.cc:420]11300 Session:Session Constructor finished [INFO] PROFILING(11300,python3.7):2024-01-11-05:42:49.883.934 [platform.cpp:38] >>> (tid:11300) Profiling platform version: 1.0. [INFO] PROFILING(11300,python3.7):2024-01-11-05:42:49.883.965 [ai_drv_dev_api.cpp:384] >>> (tid:11300) Succeeded to DrvGetApiVersion version: 0x72313 [TRACE] GE(11300,python3.7):2024-01-11-05:42:49.884.201 [status:INIT] [ge_api.cc:144]11300 GEInitializeImpl:GEInitialize start TotalTime = 0.414217, [20] [parse]: 0.227845 [symbol_resolve]: 0.0299563, [1] [Cycle 1]: 0.0298785, [1] [resolve]: 0.0298509 [combine_like_graphs]: 9.09997e-07 [graph_reusing]: 2.58e-06 [meta_unpack_prepare]: 0.00015682 [pre_cconv]: 4e-06 [abstract_specialize]: 0.00481445 [pack_expand]: 1.513e-05 [auto_monad]: 0.00012462 [inline]: 1.43e-06 [pre_auto_parallel]: 2.141e-05 [pipeline_split]: 3.13e-06 [optimize]: 0.144341, [35] [py_interpret_to_execute]: 3.6e-06 [rewriter_before_opt_a]: 0.00017215 [opt_a]: 0.143063, [4] [Cycle 1]: 0.0967662, [30] [expand_dump_flag]: 4.81e-06 [switch_simplify]: 2.402e-05 [a_1]: 0.00040563 [recompute_prepare]: 8.97e-06 [updatestate_depend_eliminate]: 1.117e-05 [updatestate_assign_eliminate]: 7.57e-06 [updatestate_loads_eliminate]: 6.51e-06 [parameter_eliminate]: 5.7e-06 [a_2]: 8.99e-05 [accelerated_algorithm]: 5.8e-06 [pynative_shard]: 1.66e-06 [auto_parallel]: 3.36e-06 [parallel]: 1.634e-05 [merge_comm]: 9.81e-06 [allreduce_fusion]: 2.41e-06 [virtual_dataset]: 5.28e-06 [get_grad_eliminate_]: 4.36e-06 [virtual_output]: 3.87e-06 [merge_forward]: 9.15e-06 [cell_reuse_recompute_pass]: 9.80006e-07 [cell_reuse_handle_not_recompute_node_pass]: 1.24e-05 [meta_fg_expand]: 0.0382277, [1] [Cycle 1]: 0.00542309, [1] [resolve]: 0.00540162 [after_resolve]: 4.428e-05 [a_after_grad]: 0.00011913 [renormalize]: 0.0570231 [real_op_eliminate]: 3.06e-05 [auto_monad_grad]: 6.515e-05 [auto_monad_eliminator]: 6.038e-05 [cse]: 0.00016665 [a_3]: 0.00020726 [Cycle 2]: 0.0366181, [30] [expand_dump_flag]: 2.91e-06 [switch_simplify]: 9.05e-05 [a_1]: 0.00051829 [recompute_prepare]: 1.145e-05 [updatestate_depend_eliminate]: 1.24e-05 [updatestate_assign_eliminate]: 8.54e-06 [updatestate_loads_eliminate]: 8.09e-06 [parameter_eliminate]: 3.53e-06 [a_2]: 0.00011983 [accelerated_algorithm]: 1.273e-05 [pynative_shard]: 1.56e-06 [auto_parallel]: 5.2e-06 [parallel]: 5.02e-06 [merge_comm]: 2.63e-06 [allreduce_fusion]: 1.38e-06 [virtual_dataset]: 7.27e-06 [get_grad_eliminate_]: 5.73e-06 [virtual_output]: 5.84e-06 [merge_forward]: 9.67e-06 [cell_reuse_recompute_pass]: 4.19997e-07 [cell_reuse_handle_not_recompute_node_pass]: 1.51e-05 [meta_fg_expand]: 0.0105787, [3] [Cycle 1]: 0.00034272, [1] [resolve]: 0.00032341 [Cycle 1]: 0.00164415, [1] [resolve]: 0.00162224 [Cycle 1]: 0.00033238, [1] [resolve]: 0.00031373 [after_resolve]: 5.752e-05 [a_after_grad]: 0.00013597 [renormalize]: 0.0242944 [real_op_eliminate]: 3.116e-05 [auto_monad_grad]: 4.072e-05 [auto_monad_eliminator]: 6.325e-05 [cse]: 0.00014267 [a_3]: 0.00025427 [Cycle 3]: 0.00281857, [30] [expand_dump_flag]: 2.78e-06 [switch_simplify]: 9.027e-05 [a_1]: 0.00060541 [recompute_prepare]: 1.196e-05 [updatestate_depend_eliminate]: 1.334e-05 [updatestate_assign_eliminate]: 9.75e-06 [updatestate_loads_eliminate]: 9.58e-06 [parameter_eliminate]: 3.43e-06 [a_2]: 0.00015052 [accelerated_algorithm]: 1.469e-05 [pynative_shard]: 1.06e-06 [auto_parallel]: 4.09e-06 [parallel]: 3.85e-06 [merge_comm]: 3.01e-06 [allreduce_fusion]: 1.69e-06 [virtual_dataset]: 8.65e-06 [get_grad_eliminate_]: 7.7e-06 [virtual_output]: 7.04e-06 [merge_forward]: 1.129e-05 [cell_reuse_recompute_pass]: 4.1e-07 [cell_reuse_handle_not_recompute_node_pass]: 1.958e-05 [meta_fg_expand]: 3.133e-05 [after_resolve]: 1.167e-05 [a_after_grad]: 1.374e-05 [renormalize]: 0.00141588 [real_op_eliminate]: 1.345e-05 [auto_monad_grad]: 5.31e-06 [auto_monad_eliminator]: 2.339e-05 [cse]: 9.754e-05 [a_3]: 7.277e-05 [Cycle 4]: 0.00073354, [30] [expand_dump_flag]: 1.31e-06 [switch_simplify]: 8.96e-06 [a_1]: 0.00015379 [recompute_prepare]: 1.047e-05 [updatestate_depend_eliminate]: 1.371e-05 [updatestate_assign_eliminate]: 1.008e-05 [updatestate_loads_eliminate]: 9.68e-06 [parameter_eliminate]: 1.94e-06 [a_2]: 0.00014809 [accelerated_algorithm]: 1.472e-05 [pynative_shard]: 1.39e-06 [auto_parallel]: 3.87e-06 [parallel]: 3.61e-06 [merge_comm]: 2.54e-06 [allreduce_fusion]: 1.67e-06 [virtual_dataset]: 8.78e-06 [get_grad_eliminate_]: 7.51e-06 [virtual_output]: 6.92e-06 [merge_forward]: 1.178e-05 [cell_reuse_recompute_pass]: 3.99996e-07 [cell_reuse_handle_not_recompute_node_pass]: 1.888e-05 [meta_fg_expand]: 7.92e-06 [after_resolve]: 1.088e-05 [a_after_grad]: 1.395e-05 [renormalize]: 7.0002e-08 [real_op_eliminate]: 7.47e-06 [auto_monad_grad]: 2.16e-06 [auto_monad_eliminator]: 2.087e-05 [cse]: 5.2e-05 [a_3]: 6.328e-05 [py_interpret_to_execute_after_opt_a]: 3.97e-06 [slice_cell_reuse_recomputed_activation]: 2.69e-06 [rewriter_after_opt_a]: 8.081e-05 [convert_after_rewriter]: 1.752e-05 [order_py_execute_after_rewriter]: 1.15e-05 [opt_b]: 0.00058635, [2] [Cycle 1]: 0.00049574, [7] [b_1]: 0.00043495 [b_2]: 4.03e-06 [updatestate_depend_eliminate]: 3.7e-06 [updatestate_assign_eliminate]: 2.61e-06 [updatestate_loads_eliminate]: 2.53e-06 [renormalize]: 3.90006e-07 [cse]: 1.37e-05 [Cycle 2]: 8.088e-05, [7] [b_1]: 3.764e-05 [b_2]: 2.54e-06 [updatestate_depend_eliminate]: 2.53e-06 [updatestate_assign_eliminate]: 2.27e-06 [updatestate_loads_eliminate]: 1.96e-06 [renormalize]: 7.99992e-08 [cse]: 8.34e-06 [cconv]: 2.224e-05 [opt_after_cconv]: 5.188e-05, [1] [Cycle 1]: 4.772e-05, [7] [c_1]: 5.74e-06 [parameter_eliminate]: 2.01e-06 [updatestate_depend_eliminate]: 2.39e-06 [updatestate_assign_eliminate]: 1.91e-06 [updatestate_loads_eliminate]: 1.81e-06 [cse]: 8.28e-06 [renormalize]: 2.59999e-07 [remove_dup_value]: 1.117e-05 [tuple_transform]: 4.088e-05, [1] [Cycle 1]: 3.64e-05, [3] [d_1]: 1.695e-05 [d_2]: 6.84e-06 [renormalize]: 1.90004e-07 [add_cache_embedding]: 1.085e-05 [add_recomputation]: 5.261e-05 [cse_after_recomputation]: 1.973e-05, [1] [Cycle 1]: 1.517e-05, [1] [cse]: 1.043e-05 [environ_conv]: 2.042e-05 [label_micro_interleaved_index]: 2.7e-06 [label_fine_grained_interleaved_index]: 2.87e-06 [assign_add_opt]: 3.35e-06 [slice_recompute_activation]: 2.05e-06 [micro_interleaved_order_control]: 2.15e-06 [full_micro_interleaved_order_control]: 2.39e-06 [comp_comm_scheduling]: 2.1e-06 [reorder_send_recv_between_fp_bp]: 1.94e-06 [comm_op_add_attrs]: 1.13e-06 [add_comm_op_reuse_tag]: 1e-06 [overlap_opt_shard_in_pipeline]: 1.26e-06 [grouped_pairwise_exchange_alltoall]: 1.49e-06 [overlap_recompute_and_grad_model_parallel]: 2.03e-06 [overlap_grad_matmul_and_grad_allreduce]: 7.50006e-07 [split_matmul_comm_elemetwise]: 2.47001e-06 [split_layernorm_comm]: 2.4e-06 [process_send_recv_for_ge]: 2.55e-06 [handle_group_info]: 9.79999e-07 [auto_monad_reorder]: 2.366e-05 [get_jit_bprop_graph]: 4.29995e-07 [eliminate_special_op_node]: 0.0005002 [validate]: 5.49e-05 [distribtued_split]: 1.18e-06 [task_emit]: 0.00610255 [execute]: 7.47e-06 Sums parse : 0.227845s : 62.30% symbol_resolve.resolve : 0.029851s : 8.16% combine_like_graphs : 0.000001s : 0.00% graph_reusing : 0.000003s : 0.00% meta_unpack_prepare : 0.000157s : 0.04% pre_cconv : 0.000004s : 0.00% abstract_specialize : 0.004814s : 1.32% pack_expand : 0.000015s : 0.00% auto_monad : 0.000125s : 0.03% inline : 0.000001s : 0.00% pre_auto_parallel : 0.000021s : 0.01% pipeline_split : 0.000003s : 0.00% optimize.py_interpret_to_execute : 0.000004s : 0.00% optimize.rewriter_before_opt_a : 0.000172s : 0.05% optimize.opt_a.expand_dump_flag : 0.000012s : 0.00% optimize.opt_a.switch_simplify : 0.000214s : 0.06% optimize.opt_a.a_1 : 0.001683s : 0.46% optimize.opt_a.recompute_prepare : 0.000043s : 0.01% optimize.opt_a.updatestate_depend_eliminate : 0.000051s : 0.01% optimize.opt_a.updatestate_assign_eliminate : 0.000036s : 0.01% optimize.opt_a.updatestate_loads_eliminate : 0.000034s : 0.01% optimize.opt_a.parameter_eliminate : 0.000015s : 0.00% optimize.opt_a.a_2 : 0.000508s : 0.14% optimize.opt_a.accelerated_algorithm : 0.000048s : 0.01% optimize.opt_a.pynative_shard : 0.000006s : 0.00% optimize.opt_a.auto_parallel : 0.000017s : 0.00% optimize.opt_a.parallel : 0.000029s : 0.01% optimize.opt_a.merge_comm : 0.000018s : 0.00% optimize.opt_a.allreduce_fusion : 0.000007s : 0.00% optimize.opt_a.virtual_dataset : 0.000030s : 0.01% optimize.opt_a.get_grad_eliminate_ : 0.000025s : 0.01% optimize.opt_a.virtual_output : 0.000024s : 0.01% optimize.opt_a.merge_forward : 0.000042s : 0.01% optimize.opt_a.cell_reuse_recompute_pass : 0.000002s : 0.00% optimize.opt_a.cell_reuse_handle_not_recompute_node_pass : 0.000066s : 0.02% optimize.opt_a.meta_fg_expand : 0.000039s : 0.01% optimize.opt_a.meta_fg_expand.resolve : 0.007661s : 2.09% optimize.opt_a.after_resolve : 0.000124s : 0.03% optimize.opt_a.a_after_grad : 0.000283s : 0.08% optimize.opt_a.renormalize : 0.082733s : 22.62% optimize.opt_a.real_op_eliminate : 0.000083s : 0.02% optimize.opt_a.auto_monad_grad : 0.000113s : 0.03% optimize.opt_a.auto_monad_eliminator : 0.000168s : 0.05% optimize.opt_a.cse : 0.000459s : 0.13% optimize.opt_a.a_3 : 0.000598s : 0.16% optimize.py_interpret_to_execute_after_opt_a : 0.000004s : 0.00% optimize.slice_cell_reuse_recomputed_activation : 0.000003s : 0.00% optimize.rewriter_after_opt_a : 0.000081s : 0.02% optimize.convert_after_rewriter : 0.000018s : 0.00% optimize.order_py_execute_after_rewriter : 0.000012s : 0.00% optimize.opt_b.b_1 : 0.000473s : 0.13% optimize.opt_b.b_2 : 0.000007s : 0.00% optimize.opt_b.updatestate_depend_eliminate : 0.000006s : 0.00% optimize.opt_b.updatestate_assign_eliminate : 0.000005s : 0.00% optimize.opt_b.updatestate_loads_eliminate : 0.000004s : 0.00% optimize.opt_b.renormalize : 0.000000s : 0.00% optimize.opt_b.cse : 0.000022s : 0.01% optimize.cconv : 0.000022s : 0.01% optimize.opt_after_cconv.c_1 : 0.000006s : 0.00% optimize.opt_after_cconv.parameter_eliminate : 0.000002s : 0.00% optimize.opt_after_cconv.updatestate_depend_eliminate : 0.000002s : 0.00% optimize.opt_after_cconv.updatestate_assign_eliminate : 0.000002s : 0.00% optimize.opt_after_cconv.updatestate_loads_eliminate : 0.000002s : 0.00% optimize.opt_after_cconv.cse : 0.000008s : 0.00% optimize.opt_after_cconv.renormalize : 0.000000s : 0.00% optimize.remove_dup_value : 0.000011s : 0.00% optimize.tuple_transform.d_1 : 0.000017s : 0.00% optimize.tuple_transform.d_2 : 0.000007s : 0.00% optimize.tuple_transform.renormalize : 0.000000s : 0.00% optimize.add_cache_embedding : 0.000011s : 0.00% optimize.add_recomputation : 0.000053s : 0.01% optimize.cse_after_recomputation.cse : 0.000010s : 0.00% optimize.environ_conv : 0.000020s : 0.01% optimize.label_micro_interleaved_index : 0.000003s : 0.00% optimize.label_fine_grained_interleaved_index : 0.000003s : 0.00% optimize.assign_add_opt : 0.000003s : 0.00% optimize.slice_recompute_activation : 0.000002s : 0.00% optimize.micro_interleaved_order_control : 0.000002s : 0.00% optimize.full_micro_interleaved_order_control : 0.000002s : 0.00% optimize.comp_comm_scheduling : 0.000002s : 0.00% optimize.reorder_send_recv_between_fp_bp : 0.000002s : 0.00% optimize.comm_op_add_attrs : 0.000001s : 0.00% optimize.add_comm_op_reuse_tag : 0.000001s : 0.00% optimize.overlap_opt_shard_in_pipeline : 0.000001s : 0.00% optimize.grouped_pairwise_exchange_alltoall : 0.000001s : 0.00% optimize.overlap_recompute_and_grad_model_parallel : 0.000002s : 0.00% optimize.overlap_grad_matmul_and_grad_allreduce : 0.000001s : 0.00% optimize.split_matmul_comm_elemetwise : 0.000002s : 0.00% optimize.split_layernorm_comm : 0.000002s : 0.00% optimize.process_send_recv_for_ge : 0.000003s : 0.00% optimize.handle_group_info : 0.000001s : 0.00% auto_monad_reorder : 0.000024s : 0.01% get_jit_bprop_graph : 0.000000s : 0.00% eliminate_special_op_node : 0.000500s : 0.14% validate : 0.000055s : 0.02% distribtued_split : 0.000001s : 0.00% task_emit : 0.006103s : 1.67% execute : 0.000007s : 0.00% Time group info: ------[substitution.] 0.037279 459 0.01% : 0.000004s : 5: substitution.float_depend_g_call 0.03% : 0.000011s : 14: substitution.float_tuple_getitem_switch 96.84% : 0.036100s : 50: substitution.getattr_setattr_resolve 0.01% : 0.000005s : 4: substitution.graph_param_transform 0.01% : 0.000003s : 3: substitution.incorporate_call 0.03% : 0.000013s : 3: substitution.incorporate_call_switch 2.02% : 0.000755s : 79: substitution.inline 0.02% : 0.000006s : 10: substitution.less_batch_normalization 0.13% : 0.000049s : 24: substitution.meta_unpack_prepare 0.03% : 0.000012s : 11: substitution.minmaximum_grad 0.01% : 0.000004s : 5: substitution.partial_eliminate 0.00% : 0.000001s : 4: substitution.partial_unused_args_eliminate 0.02% : 0.000007s : 47: substitution.remove_not_recompute_node 0.19% : 0.000072s : 46: substitution.replace_applicator 0.03% : 0.000010s : 30: substitution.replace_old_param 0.01% : 0.000004s : 2: substitution.reset_defer_inline 0.02% : 0.000007s : 8: substitution.set_cell_output_no_recompute 0.02% : 0.000007s : 5: substitution.specialize_transform 0.03% : 0.000010s : 8: substitution.switch_simplify 0.08% : 0.000029s : 8: substitution.transpose_eliminate 0.11% : 0.000040s : 15: substitution.tuple_list_convert_item_index_to_positive 0.04% : 0.000016s : 15: substitution.tuple_list_get_item_const_eliminator 0.06% : 0.000022s : 15: substitution.tuple_list_get_item_depend_reorder 0.19% : 0.000071s : 33: substitution.tuple_list_get_item_eliminator 0.06% : 0.000022s : 15: substitution.tuple_list_get_set_item_eliminator ------[renormalize.] 0.082718 6 94.56% : 0.078222s : 3: renormalize.infer 5.44% : 0.004496s : 3: renormalize.specialize ------[replace.] 0.001021 99 58.46% : 0.000597s : 42: replace.getattr_setattr_resolve 23.14% : 0.000236s : 39: replace.inline 4.47% : 0.000046s : 2: replace.meta_unpack_prepare 8.74% : 0.000089s : 8: replace.switch_simplify 0.93% : 0.000009s : 2: replace.transpose_eliminate 4.26% : 0.000044s : 6: replace.tuple_list_get_item_eliminator ------[match.] 0.036564 99 98.20% : 0.035905s : 42: match.getattr_setattr_resolve 1.60% : 0.000585s : 39: match.inline 0.10% : 0.000036s : 2: match.meta_unpack_prepare 0.03% : 0.000010s : 8: match.switch_simplify 0.02% : 0.000009s : 2: match.transpose_eliminate 0.05% : 0.000020s : 6: match.tuple_list_get_item_eliminator ------[func_graph_cloner_run.] 0.004525 85 66.09% : 0.002991s : 36: func_graph_cloner_run.FuncGraphClonerGraph 33.91% : 0.001534s : 49: func_graph_cloner_run.FuncGraphSpecializer ------[meta_graph.] 0.000000 0 ------[manager.] 0.000000 0 ------[pynative] 0.000000 0 ------[others.] 0.041457 255 2.70% : 0.001121s : 104: opt.transform.opt_a 1.07% : 0.000442s : 92: opt.transform.opt_b 89.74% : 0.037203s : 10: opt.transform.opt_resolve 0.32% : 0.000132s : 1: opt.transforms.meta_unpack_prepare 6.07% : 0.002518s : 40: opt.transforms.opt_a 0.01% : 0.000004s : 1: opt.transforms.opt_after_cconv 0.01% : 0.000005s : 2: opt.transforms.opt_b 0.05% : 0.000022s : 2: opt.transforms.opt_trans_graph 0.02% : 0.000010s : 3: opt.transforms.special_op_eliminate [INFO] GE(11300,python3.7):2024-01-11-05:42:50.404.381 [scalable_config.cc:55][EVENT]14719 ScalableConfig:device total max size: 34359738368, page_mem_size_total_thresold: 32641751449, uncacheable_size_threshold: 17179869184 [INFO] GE(11300,python3.7):2024-01-11-05:42:50.491.000 [graph_var_manager.cc:1424][EVENT]14719 SetAllMemoryMaxValue:The graph_mem_max_size is 27917287424 and the var_mem_max_size is 5368709120 [INFO] GE(11300,python3.7):2024-01-11-05:42:50.491.125 [graph_manager.cc:1248][EVENT]14719 PreRun:PreRun start: graph node size 4, session id 1, graph id 0, graph name online. [INFO] ATRACE(11300,python3.7):2024-01-11-05:42:50.492.089 [atrace_api.c:28](tid:14719) AtraceCreate start [INFO] ATRACE(11300,python3.7):2024-01-11-05:42:50.492.236 [trace_rb_log.c:84](tid:14719) [RUNTIME_ATRACE_DEV64_TS0] create ring buffer success, buffer size : 131152. [INFO] ATRACE(11300,python3.7):2024-01-11-05:42:50.492.252 [atrace_api.c:32](tid:14719) AtraceCreate end [INFO] TDT(11300,python3.7):2024-01-11-05:42:50.492.284 [client_manager.cpp:157][SetProfilingCallback][tid:14719] [TsdClient] set profiling callback success [INFO] GE(11300,python3.7):2024-01-11-05:42:50.493.281 [parallel_partitioner.cc:165][EVENT]14719 DoPipelinePartition:[GEPERFTRACE] The time cost of OptimizeSubgraph::PipelinePartition is [29] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.493.327 [parallel_partitioner.cc:178][EVENT]14719 DoFlowGraphPartition:[GEPERFTRACE] The time cost of OptimizeSubgraph::FlowGraphPartition is [19] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.493.394 [graph_prepare.cc:1378][EVENT]14719 Init:[GEPERFTRACE] The time cost of FileConstantUtils::ConvertFileConstToConst is [11] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.494.096 [graph_manager.cc:1050][EVENT]14719 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.preparer.PrepareInit is [733] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.494.127 [graph_manager.cc:1052][EVENT]14719 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.optimizer.HandleSummaryOp is [8] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.494.304 [base_pass.cc:339][EVENT]14719 Run:[GEPERFTRACE] The time cost of ForToWhilePass is [1] micro second, call num is [4] [INFO] GE(11300,python3.7):2024-01-11-05:42:50.494.338 [pass_manager.cc:82][EVENT]14719 Run:[GEPERFTRACE] The time cost of ProcessNetOutput::SavePass is [5] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.494.412 [pass_manager.cc:82][EVENT]14719 Run:[GEPERFTRACE] The time cost of ProcessNetOutput::NetOutputPass is [61] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.494.426 [pass_manager.cc:82][EVENT]14719 Run:[GEPERFTRACE] The time cost of ProcessNetOutput::DataPass is [2] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.494.525 [pass_manager.cc:82][EVENT]14719 Run:[GEPERFTRACE] The time cost of CreateSubGraphWithScopePass is [22] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.494.539 [pass_manager.cc:82][EVENT]14719 Run:[GEPERFTRACE] The time cost of SubgraphMultiDimsClonePass is [1] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.494.562 [pass_manager.cc:82][EVENT]14719 Run:[GEPERFTRACE] The time cost of MultiBatchClonePass is [13] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.494.679 [base_pass.cc:339][EVENT]14719 Run:[GEPERFTRACE] The time cost of SplitVariableIntoSubgraphPass is [3] micro second, call num is [4] [INFO] GE(11300,python3.7):2024-01-11-05:42:50.494.700 [graph_manager.cc:1054][EVENT]14719 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.preparer.NormalizeGraph is [560] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.502.125 [graph_manager.cc:1055][EVENT]14719 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.optimizer.OptimizeGraphInit is [7391] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.503.256 [base_pass.cc:339][EVENT]14719 Run:[GEPERFTRACE] The time cost of AssertPass is [4] micro second, call num is [8] [INFO] GE(11300,python3.7):2024-01-11-05:42:50.503.286 [base_pass.cc:339][EVENT]14719 Run:[GEPERFTRACE] The time cost of SwitchDeadBranchElimination is [3] micro second, call num is [4] [INFO] GE(11300,python3.7):2024-01-11-05:42:50.503.298 [base_pass.cc:339][EVENT]14719 Run:[GEPERFTRACE] The time cost of MergePass is [7] micro second, call num is [4] [INFO] GE(11300,python3.7):2024-01-11-05:42:50.503.308 [base_pass.cc:339][EVENT]14719 Run:[GEPERFTRACE] The time cost of InferShapePass is [352] micro second, call num is [4] [INFO] GE(11300,python3.7):2024-01-11-05:42:50.503.317 [base_pass.cc:339][EVENT]14719 Run:[GEPERFTRACE] The time cost of ReplaceWithEmptyConstPass is [17] micro second, call num is [4] [INFO] GE(11300,python3.7):2024-01-11-05:42:50.503.326 [base_pass.cc:339][EVENT]14719 Run:[GEPERFTRACE] The time cost of SplitShapeNPass is [4] micro second, call num is [8] [INFO] GE(11300,python3.7):2024-01-11-05:42:50.503.334 [base_pass.cc:339][EVENT]14719 Run:[GEPERFTRACE] The time cost of DimensionComputePass is [27] micro second, call num is [4] [INFO] GE(11300,python3.7):2024-01-11-05:42:50.503.342 [base_pass.cc:339][EVENT]14719 Run:[GEPERFTRACE] The time cost of ConstantFoldingPass is [19] micro second, call num is [4] [INFO] GE(11300,python3.7):2024-01-11-05:42:50.503.350 [base_pass.cc:339][EVENT]14719 Run:[GEPERFTRACE] The time cost of InferValuePass is [7] micro second, call num is [4] [INFO] GE(11300,python3.7):2024-01-11-05:42:50.505.650 [graph_manager.cc:1056][EVENT]14719 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.optimizer.OptimizeOriginalGraphForQuantize is [3485] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.505.721 [base_pass.cc:339][EVENT]14719 Run:[GEPERFTRACE] The time cost of CondRemovePass is [5] micro second, call num is [4] [INFO] GE(11300,python3.7):2024-01-11-05:42:50.505.740 [graph_prepare.cc:1982][EVENT]14719 PrepareDynShape:[GEPERFTRACE] The time cost of Prepare::ProcessBeforeInfershape is [52] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.506.194 [base_pass.cc:339][EVENT]14719 Run:[GEPERFTRACE] The time cost of AssertPass is [1] micro second, call num is [8] [INFO] GE(11300,python3.7):2024-01-11-05:42:50.506.220 [base_pass.cc:339][EVENT]14719 Run:[GEPERFTRACE] The time cost of SwitchDeadBranchElimination is [3] micro second, call num is [4] [INFO] GE(11300,python3.7):2024-01-11-05:42:50.506.231 [base_pass.cc:339][EVENT]14719 Run:[GEPERFTRACE] The time cost of MergePass is [2] micro second, call num is [4] [INFO] GE(11300,python3.7):2024-01-11-05:42:50.506.241 [base_pass.cc:339][EVENT]14719 Run:[GEPERFTRACE] The time cost of InferShapePass is [264] micro second, call num is [4] [INFO] GE(11300,python3.7):2024-01-11-05:42:50.506.249 [base_pass.cc:339][EVENT]14719 Run:[GEPERFTRACE] The time cost of ReplaceWithEmptyConstPass is [9] micro second, call num is [4] [INFO] GE(11300,python3.7):2024-01-11-05:42:50.506.257 [base_pass.cc:339][EVENT]14719 Run:[GEPERFTRACE] The time cost of SplitShapeNPass is [1] micro second, call num is [8] [INFO] GE(11300,python3.7):2024-01-11-05:42:50.506.265 [base_pass.cc:339][EVENT]14719 Run:[GEPERFTRACE] The time cost of DimensionComputePass is [6] micro second, call num is [4] [INFO] GE(11300,python3.7):2024-01-11-05:42:50.506.274 [base_pass.cc:339][EVENT]14719 Run:[GEPERFTRACE] The time cost of ConstantFoldingPass is [12] micro second, call num is [4] [INFO] GE(11300,python3.7):2024-01-11-05:42:50.506.282 [base_pass.cc:339][EVENT]14719 Run:[GEPERFTRACE] The time cost of InferValuePass is [4] micro second, call num is [4] [INFO] GE(11300,python3.7):2024-01-11-05:42:50.506.319 [graph_prepare.cc:1983][EVENT]14719 PrepareDynShape:[GEPERFTRACE] The time cost of Prepare::FormatAndShapeProcess is [565] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.506.345 [pass_manager.cc:82][EVENT]14719 Run:[GEPERFTRACE] The time cost of PreRun::MarkForceUnknownForCondPass is [5] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.506.357 [graph_prepare.cc:1984][EVENT]14719 PrepareDynShape:[GEPERFTRACE] The time cost of Prepare::CtrlFlowPreProcess is [22] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.506.371 [graph_prepare.cc:1985][EVENT]14719 PrepareDynShape:[GEPERFTRACE] The time cost of Prepare::multibatch::GetDynamicOutputShape is [3] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.506.392 [graph_prepare.cc:1986][EVENT]14719 PrepareDynShape:[GEPERFTRACE] The time cost of Prepare::InsertAippOpUtil::Instance().UpdateDataNodeByAipp is [9] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.506.404 [graph_prepare.cc:1987][EVENT]14719 PrepareDynShape:[GEPERFTRACE] The time cost of Prepare::SaveOriginalGraphToOmModel is [0] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.506.418 [pass_manager.cc:82][EVENT]14719 Run:[GEPERFTRACE] The time cost of PrepareOptimize::ShapeOperateOpRemovePass is [3] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.506.430 [pass_manager.cc:82][EVENT]14719 Run:[GEPERFTRACE] The time cost of PrepareOptimize::ReplaceTransShapePass is [2] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.506.443 [pass_manager.cc:82][EVENT]14719 Run:[GEPERFTRACE] The time cost of PrepareOptimize::MarkAgnosticPass is [4] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.506.542 [base_pass.cc:339][EVENT]14719 Run:[GEPERFTRACE] The time cost of EnterPass is [2] micro second, call num is [4] [INFO] GE(11300,python3.7):2024-01-11-05:42:50.506.555 [base_pass.cc:339][EVENT]14719 Run:[GEPERFTRACE] The time cost of CondPass is [4] micro second, call num is [4] [INFO] GE(11300,python3.7):2024-01-11-05:42:50.506.564 [base_pass.cc:339][EVENT]14719 Run:[GEPERFTRACE] The time cost of PrintOpPass is [1] micro second, call num is [4] [INFO] GE(11300,python3.7):2024-01-11-05:42:50.506.572 [base_pass.cc:339][EVENT]14719 Run:[GEPERFTRACE] The time cost of NoUseReshapeRemovePass is [2] micro second, call num is [4] [INFO] GE(11300,python3.7):2024-01-11-05:42:50.506.581 [base_pass.cc:339][EVENT]14719 Run:[GEPERFTRACE] The time cost of DropOutPass is [2] micro second, call num is [4] [INFO] GE(11300,python3.7):2024-01-11-05:42:50.506.589 [base_pass.cc:339][EVENT]14719 Run:[GEPERFTRACE] The time cost of AssertPass is [2] micro second, call num is [4] [INFO] GE(11300,python3.7):2024-01-11-05:42:50.506.597 [base_pass.cc:339][EVENT]14719 Run:[GEPERFTRACE] The time cost of TransposeRemovePass is [9] micro second, call num is [4] [INFO] GE(11300,python3.7):2024-01-11-05:42:50.506.605 [base_pass.cc:339][EVENT]14719 Run:[GEPERFTRACE] The time cost of UnusedConstPass is [1] micro second, call num is [4] [INFO] GE(11300,python3.7):2024-01-11-05:42:50.506.613 [base_pass.cc:339][EVENT]14719 Run:[GEPERFTRACE] The time cost of StopGradientPass is [2] micro second, call num is [4] [INFO] GE(11300,python3.7):2024-01-11-05:42:50.506.621 [base_pass.cc:339][EVENT]14719 Run:[GEPERFTRACE] The time cost of PreventGradientPass is [1] micro second, call num is [4] [INFO] GE(11300,python3.7):2024-01-11-05:42:50.506.629 [base_pass.cc:339][EVENT]14719 Run:[GEPERFTRACE] The time cost of PlaceholderWithDefaultPass is [2] micro second, call num is [4] [INFO] GE(11300,python3.7):2024-01-11-05:42:50.506.637 [base_pass.cc:339][EVENT]14719 Run:[GEPERFTRACE] The time cost of SnapshotPass is [3] micro second, call num is [4] [INFO] GE(11300,python3.7):2024-01-11-05:42:50.506.645 [base_pass.cc:339][EVENT]14719 Run:[GEPERFTRACE] The time cost of GuaranteeConstPass is [1] micro second, call num is [4] [INFO] GE(11300,python3.7):2024-01-11-05:42:50.506.653 [base_pass.cc:339][EVENT]14719 Run:[GEPERFTRACE] The time cost of VarIsInitializedOpPass is [6] micro second, call num is [4] [INFO] GE(11300,python3.7):2024-01-11-05:42:50.506.670 [base_pass.cc:339][EVENT]14719 Run:[GEPERFTRACE] The time cost of ParallelConcatStartOpPass is [1] micro second, call num is [4] [INFO] GE(11300,python3.7):2024-01-11-05:42:50.506.679 [base_pass.cc:339][EVENT]14719 Run:[GEPERFTRACE] The time cost of IdentityPass is [4] micro second, call num is [4] [INFO] GE(11300,python3.7):2024-01-11-05:42:50.506.702 [pass_manager.cc:82][EVENT]14719 Run:[GEPERFTRACE] The time cost of PrepareOptimize::PrunePass is [11] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.506.715 [pass_manager.cc:82][EVENT]14719 Run:[GEPERFTRACE] The time cost of PrepareOptimize::HcclMemcpyPass is [2] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.506.747 [graph_prepare.cc:1988][EVENT]14719 PrepareDynShape:[GEPERFTRACE] The time cost of Prepare::PrepareOptimize is [335] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.506.760 [graph_manager.cc:1065][EVENT]14719 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.preparer.PrepareDynShape is [1073] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.524.194 [graph_manager.cc:1077][EVENT]14719 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.optimizer.OptimizeOriginalGraph is [17415] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.524.271 [pass_manager.cc:82][EVENT]14719 Run:[GEPERFTRACE] The time cost of PrepareRunningFormatRefiner::VariablePrepareOpPass is [7] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.524.323 [graph_manager.cc:1080][EVENT]14719 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.preparer.PrepareRunningFormatRefiner is [88] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.528.654 [graph_manager.cc:1081][EVENT]14719 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.optimizer.OptimizeOriginalGraphJudgeInsert is [4313] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.528.698 [pass_manager.cc:82][EVENT]14719 Run:[GEPERFTRACE] The time cost of SubexpressionMigrationPass is [2] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.528.714 [pass_manager.cc:82][EVENT]14719 Run:[GEPERFTRACE] The time cost of UnusedArgsCleanPass is [2] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.528.726 [graph_manager.cc:1082][EVENT]14719 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::SubexpressionMigration is [34] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.528.757 [pass_manager.cc:82][EVENT]14719 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::MergeInputMemcpyPass is [4] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.528.773 [pass_manager.cc:82][EVENT]14719 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::SwitchDataEdgesBypass is [2] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.528.787 [pass_manager.cc:82][EVENT]14719 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::ConstantFuseSamePass is [4] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.528.897 [pass_manager.cc:82][EVENT]14719 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::CSEBeforeFuseDataNodesWithCommonInputPass is [100] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.528.915 [pass_manager.cc:82][EVENT]14719 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::FuseDataNodesWithCommonInputPass is [4] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.528.965 [pass_manager.cc:82][EVENT]14719 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::CommonSubexpressionEliminationPass is [40] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.528.982 [pass_manager.cc:82][EVENT]14719 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::PermutePass is [3] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.529.021 [pass_manager.cc:82][EVENT]14719 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::SameTransdataBreadthFusionPass is [29] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.529.060 [pass_manager.cc:82][EVENT]14719 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::VariableOpPass is [17] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.529.114 [pass_manager.cc:82][EVENT]14719 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::TransOpWithoutReshapeFusionPass is [43] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.529.175 [pass_manager.cc:82][EVENT]14719 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::TransOpBreadthFusionPass is [49] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.529.195 [pass_manager.cc:82][EVENT]14719 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::DataFlowPreparePass is [8] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.529.208 [pass_manager.cc:82][EVENT]14719 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::MergeUnknownShapeNPass is [3] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.529.217 [graph_manager.cc:2700][EVENT]14719 OptimizeStage1:[GEPERFTRACE] The time cost of GraphManager::OptimizeStage1_1 is [465] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.529.357 [base_pass.cc:339][EVENT]14719 Run:[GEPERFTRACE] The time cost of EnterPass is [3] micro second, call num is [4] [INFO] GE(11300,python3.7):2024-01-11-05:42:50.529.372 [base_pass.cc:339][EVENT]14719 Run:[GEPERFTRACE] The time cost of AddNPass is [2] micro second, call num is [4] [INFO] GE(11300,python3.7):2024-01-11-05:42:50.529.382 [base_pass.cc:339][EVENT]14719 Run:[GEPERFTRACE] The time cost of SwitchDeadBranchElimination is [2] micro second, call num is [4] [INFO] GE(11300,python3.7):2024-01-11-05:42:50.529.391 [base_pass.cc:339][EVENT]14719 Run:[GEPERFTRACE] The time cost of SwitchLogicRemovePass is [2] micro second, call num is [4] [INFO] GE(11300,python3.7):2024-01-11-05:42:50.529.399 [base_pass.cc:339][EVENT]14719 Run:[GEPERFTRACE] The time cost of MergePass is [2] micro second, call num is [4] [INFO] GE(11300,python3.7):2024-01-11-05:42:50.529.407 [base_pass.cc:339][EVENT]14719 Run:[GEPERFTRACE] The time cost of CastRemovePass is [23] micro second, call num is [4] [INFO] GE(11300,python3.7):2024-01-11-05:42:50.529.415 [base_pass.cc:339][EVENT]14719 Run:[GEPERFTRACE] The time cost of TransposeTransDataPass is [4] micro second, call num is [4] [INFO] GE(11300,python3.7):2024-01-11-05:42:50.529.424 [base_pass.cc:339][EVENT]14719 Run:[GEPERFTRACE] The time cost of ReshapeRemovePass is [5] micro second, call num is [4] [INFO] GE(11300,python3.7):2024-01-11-05:42:50.529.432 [base_pass.cc:339][EVENT]14719 Run:[GEPERFTRACE] The time cost of TransOpSymmetryEliminationPass is [4] micro second, call num is [4] [INFO] GE(11300,python3.7):2024-01-11-05:42:50.529.440 [base_pass.cc:339][EVENT]14719 Run:[GEPERFTRACE] The time cost of TransOpNearbyAllreduceFusionPass is [4] micro second, call num is [4] [INFO] GE(11300,python3.7):2024-01-11-05:42:50.529.448 [base_pass.cc:339][EVENT]14719 Run:[GEPERFTRACE] The time cost of ReplaceWithEmptyConstPass is [8] micro second, call num is [4] [INFO] GE(11300,python3.7):2024-01-11-05:42:50.529.456 [base_pass.cc:339][EVENT]14719 Run:[GEPERFTRACE] The time cost of DimensionComputePass is [5] micro second, call num is [4] [INFO] GE(11300,python3.7):2024-01-11-05:42:50.529.464 [base_pass.cc:339][EVENT]14719 Run:[GEPERFTRACE] The time cost of ConstantFoldingPass is [10] micro second, call num is [4] [INFO] GE(11300,python3.7):2024-01-11-05:42:50.529.472 [base_pass.cc:339][EVENT]14719 Run:[GEPERFTRACE] The time cost of DimensionAdjustPass is [6] micro second, call num is [4] [INFO] GE(11300,python3.7):2024-01-11-05:42:50.529.480 [base_pass.cc:339][EVENT]14719 Run:[GEPERFTRACE] The time cost of UselessControlOutRemovePass is [2] micro second, call num is [4] [INFO] GE(11300,python3.7):2024-01-11-05:42:50.529.489 [graph_manager.cc:2741][EVENT]14719 OptimizeStage1:[GEPERFTRACE] The time cost of GraphManager::OptimizeStage1_2 is [255] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.529.507 [graph_manager.cc:2752][EVENT]14719 OptimizeStage1:[GEPERFTRACE] The time cost of extern constant folding is [0] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.529.531 [pass_manager.cc:82][EVENT]14719 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::Migration is [3] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.529.543 [pass_manager.cc:82][EVENT]14719 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::ArgsClean is [1] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.529.560 [pass_manager.cc:82][EVENT]14719 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::PrunePass is [8] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.529.575 [pass_manager.cc:82][EVENT]14719 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::NextIterationPass is [5] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.529.587 [pass_manager.cc:82][EVENT]14719 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::ControlTriggerPass is [3] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.529.599 [pass_manager.cc:82][EVENT]14719 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::MergeToStreamMergePass is [3] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.529.626 [pass_manager.cc:82][EVENT]14719 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::SwitchToStreamSwitchPass is [18] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.529.641 [pass_manager.cc:82][EVENT]14719 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::AttachStreamLabelPass is [4] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.529.653 [pass_manager.cc:82][EVENT]14719 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::MultiBatchPass is [4] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.529.663 [pass_manager.cc:82][EVENT]14719 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::SubgraphMultiDimsPass is [2] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.529.676 [pass_manager.cc:82][EVENT]14719 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::IteratorOpPass is [4] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.529.687 [pass_manager.cc:82][EVENT]14719 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::VariableRefUselessControlOutDeletePass is [2] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.529.705 [pass_manager.cc:82][EVENT]14719 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::ReshapeRecoveryPass is [9] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.529.718 [pass_manager.cc:82][EVENT]14719 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::RemoveSameConstPass is [3] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.529.726 [graph_manager.cc:2810][EVENT]14719 OptimizeStage1:[GEPERFTRACE] The time cost of GraphManager::OptimizeStage1_3 is [200] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.529.757 [base_pass.cc:339][EVENT]14719 Run:[GEPERFTRACE] The time cost of IdentityPass is [3] micro second, call num is [4] [INFO] GE(11300,python3.7):2024-01-11-05:42:50.529.769 [graph_manager.cc:2821][EVENT]14719 OptimizeStage1:[GEPERFTRACE] The time cost of GraphPrepare::node_pass is [35] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.529.799 [graph_manager.cc:1087][EVENT]14719 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::OptimizeStage1 is [1056] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.529.938 [graph_manager.cc:1088][EVENT]14719 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.optimizer.OptimizeAfterStage1 is [124] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.529.982 [graph_manager.cc:1089][EVENT]14719 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::GraphUtilsEx::InferShapeInNeed is [24] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.530.001 [pass_manager.cc:82][EVENT]14719 Run:[GEPERFTRACE] The time cost of PreRun::CtrlEdgeTransferPass is [2] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.530.030 [graph_manager.cc:1097][EVENT]14719 PreRunOptimizeOriginalGraph:PreRun:PreRunOptimizeOriginalGraph success. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.530.051 [graph_manager.cc:3325][EVENT]14719 OptimizeSubgraph:[GEPERFTRACE] The time cost of OptimizeSubgraph::StagePartition is [4] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.531.049 [engine_place.cc:144][EVENT]14719 Run:The time cost of AIcoreEngine::CheckSupported is [870] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.531.079 [engine_place.cc:144][EVENT]14719 Run:The time cost of DNN_VM_GE_LOCAL_OP_STORE::CheckSupported is [9] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.531.089 [engine_place.cc:144][EVENT]14719 Run:The time cost of DNN_VM_RTS_OP_STORE::CheckSupported is [10] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.531.186 [graph_manager.cc:3351][EVENT]14719 OptimizeSubgraph:[GEPERFTRACE] The time cost of OptimizeSubgraph::GraphPartitionDynamicShape is [1122] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.531.205 [graph_manager.cc:3364][EVENT]14719 OptimizeSubgraph:[GEPERFTRACE] The time cost of OptimizeSubgraph::SubgraphPartitionAndOptimization::CompositeEngine is [2] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.531.278 [engine_partitioner.cc:1139][EVENT]14719 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionInitialize is [19] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.531.296 [engine_partitioner.cc:1142][EVENT]14719 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionMarkClusters is [6] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.531.466 [engine_partitioner.cc:1148][EVENT]14719 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionSplitSubGraphs is [160] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.531.511 [engine_partitioner.cc:1155][EVENT]14719 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionSortSubGraphs is [31] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.531.561 [engine_partitioner.cc:1164][EVENT]14719 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionAddPartitionsToGraphNode is [39] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.531.598 [graph_manager.cc:3405][EVENT]14719 SubgraphPartitionAndOptimization:[GEPERFTRACE] The time cost of OptimizeSubgraph::Partition1 is [380] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.531.618 [graph_manager.cc:3412][EVENT]14719 SubgraphPartitionAndOptimization:[GEPERFTRACE] The time cost of OptimizeSubgraph::SetSubgraphPreProc is [7] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.541.545 [graph_manager.cc:3422][EVENT]14719 SubgraphPartitionAndOptimization:[GEPERFTRACE] The time cost of OptimizeSubgraph::SetSubGraph is [9914] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.541.583 [graph_manager.cc:3428][EVENT]14719 SubgraphPartitionAndOptimization:[GEPERFTRACE] The time cost of OptimizeSubgraph::SetSubgraphPostProc is [8] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.541.713 [graph_manager.cc:3467][EVENT]14719 SubgraphPartitionAndOptimization:[GEPERFTRACE] The time cost of OptimizeSubgraph::MergeSubGraph is [108] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.541.731 [graph_manager.cc:3377][EVENT]14719 OptimizeSubgraph:[GEPERFTRACE] The time cost of OptimizeSubgraph::SubgraphPartitionAndOptimization::AtomicEngine is [10513] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.541.747 [graph_manager.cc:1106][EVENT]14719 PreRunOptimizeSubGraph:[GEPERFTRACE] The time cost of GraphManager::OptimizeSubgraph is [11702] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.541.758 [graph_manager.cc:1115][EVENT]14719 PreRunOptimizeSubGraph:PreRun:PreRunOptimizeSubGraph success. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.541.788 [graph_manager.cc:1130][EVENT]14719 PreRunAfterOptimizeSubGraph:[GEPERFTRACE] The time cost of GraphManager::stages.optimizer.ReplacePrecompiledNodeWithOmGraph is [6] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.541.829 [graph_manager.cc:1131][EVENT]14719 PreRunAfterOptimizeSubGraph:[GEPERFTRACE] The time cost of GraphManager::stages.optimizer.OptimizeWholeGraph is [18] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.541.857 [pass_manager.cc:82][EVENT]14719 Run:[GEPERFTRACE] The time cost of OptimizeStage2::AfterMergePasses::LinkGenMaskNodesPass is [10] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.541.875 [pass_manager.cc:82][EVENT]14719 Run:[GEPERFTRACE] The time cost of OptimizeStage2::HcclContinuousMemcpyPass is [6] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.541.885 [graph_manager.cc:2837][EVENT]14719 OptimizeStage2:[GEPERFTRACE] The time cost of OptimizeStage2::AfterMergePasses is [40] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.541.968 [base_pass.cc:339][EVENT]14719 Run:[GEPERFTRACE] The time cost of ConstantFoldingPass is [13] micro second, call num is [4] [INFO] GE(11300,python3.7):2024-01-11-05:42:50.541.981 [base_pass.cc:339][EVENT]14719 Run:[GEPERFTRACE] The time cost of ReshapeRemovePass is [1] micro second, call num is [4] [INFO] GE(11300,python3.7):2024-01-11-05:42:50.541.990 [base_pass.cc:339][EVENT]14719 Run:[GEPERFTRACE] The time cost of CondRemovePass is [3] micro second, call num is [4] [INFO] GE(11300,python3.7):2024-01-11-05:42:50.541.999 [base_pass.cc:339][EVENT]14719 Run:[GEPERFTRACE] The time cost of BitcastPass is [1] micro second, call num is [4] [INFO] GE(11300,python3.7):2024-01-11-05:42:50.542.007 [base_pass.cc:339][EVENT]14719 Run:[GEPERFTRACE] The time cost of AssignRemovePass is [6] micro second, call num is [4] [INFO] GE(11300,python3.7):2024-01-11-05:42:50.542.016 [base_pass.cc:339][EVENT]14719 Run:[GEPERFTRACE] The time cost of DimensionAdjustPass is [7] micro second, call num is [4] [INFO] GE(11300,python3.7):2024-01-11-05:42:50.542.025 [graph_manager.cc:2864][EVENT]14719 OptimizeStage2:[GEPERFTRACE] The time cost of OptimizeStage2::MergedGraphNameToPasses is [123] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.542.046 [graph_manager.cc:2872][EVENT]14719 OptimizeStage2:[GEPERFTRACE] The time cost of OptimizeStage2::RemoveIsolatedConst is [12] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.542.066 [pass_manager.cc:82][EVENT]14719 Run:[GEPERFTRACE] The time cost of OptimizeStage2::ControlAttrOptimize::MultiBatchPass is [1] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.542.081 [pass_manager.cc:82][EVENT]14719 Run:[GEPERFTRACE] The time cost of OptimizeStage2::AfterMergePasses::RefIdentityDeleteOpPass is [4] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.542.096 [pass_manager.cc:82][EVENT]14719 Run:[GEPERFTRACE] The time cost of OptimizeStage2::AfterMergePasses::VariableRefDeleteOpPass is [6] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.542.110 [compile_nodes_pass.cc:88][EVENT]14719 Run:[GEPERFTRACE] The time cost of OptimizeStage2::ControlAttrOptimize::CompileNodesPass is [4] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.542.120 [pass_manager.cc:82][EVENT]14719 Run:[GEPERFTRACE] The time cost of OptimizeStage2::ControlAttrOptimize::CompileNodesPass is [15] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.542.129 [pass_manager.cc:82][EVENT]14719 Run:[GEPERFTRACE] The time cost of OptimizeStage2::AfterMergePasses::SwapSpacePass is [1] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.542.235 [pass_manager.cc:82][EVENT]14719 Run:[GEPERFTRACE] The time cost of OptimizeStage2::AfterMergePasses::InputOutputConnectionIdentifyPass is [97] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.542.267 [pass_manager.cc:82][EVENT]14719 Run:[GEPERFTRACE] The time cost of OptimizeStage2::AfterMergePasses::AtomicAddrCleanPass is [19] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.542.281 [pass_manager.cc:82][EVENT]14719 Run:[GEPERFTRACE] The time cost of OptimizeStage2::AfterMergePasses::EndOfSequenceAddControlPass is [2] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.542.301 [pass_manager.cc:82][EVENT]14719 Run:[GEPERFTRACE] The time cost of OptimizeStage2::ControlAttrOptimize::SubgraphPass is [4] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.542.314 [pass_manager.cc:82][EVENT]14719 Run:[GEPERFTRACE] The time cost of OptimizeStage2::ControlAttrOptimize::AttachStreamLabelPass is [3] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.542.322 [graph_manager.cc:2927][EVENT]14719 OptimizeStage2:[GEPERFTRACE] The time cost of OptimizeStage2::ControlAttrOptimize is [259] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.542.335 [graph_manager.cc:2937][EVENT]14719 OptimizeStage2:[GEPERFTRACE] The time cost of ModelBuilder::AssignFunctionalLabels is [4] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.542.350 [graph_manager.cc:2943][EVENT]14719 OptimizeStage2:[GEPERFTRACE] The time cost of MemcpyAddrAsyncPass::Run. is [6] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.542.362 [graph_manager.cc:2950][EVENT]14719 OptimizeStage2:[GEPERFTRACE] The time cost of BufferPoolMemoryPass::Run. is [2] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.552.996 [graph_manager.cc:2958][EVENT]14719 OptimizeStage2:[GEPERFTRACE] The time cost of ParallelGroupPass::Run. is [45] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.553.045 [graph_manager.cc:1132][EVENT]14719 PreRunAfterOptimizeSubGraph:[GEPERFTRACE] The time cost of GraphManager::OptimizeStage2 is [11201] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.553.121 [graph_manager.cc:1135][EVENT]14719 PreRunAfterOptimizeSubGraph:[GEPERFTRACE] The time cost of GraphManager::GetCompilerStages(graph_node->GetGraphId()).optimizer.OptimizeGraphBeforeBuild is [59] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.553.174 [graph_manager.cc:2975][EVENT]14719 MemConflictProc:[GEPERFTRACE] The time cost of HandleMemoryRWConflict is [34] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.553.219 [graph_manager.cc:2981][EVENT]14719 MemConflictProc:[GEPERFTRACE] The time cost of MemLayoutConflictOptimizer::Run. is [31] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.553.235 [pass_manager.cc:82][EVENT]14719 Run:[GEPERFTRACE] The time cost of OptimizeStage2::SetFftsPlusAttrPass is [1] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.553.245 [graph_manager.cc:2986][EVENT]14719 MemConflictProc:[GEPERFTRACE] The time cost of SetFftsPlusAttrPass::last_passes.Run is [14] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.553.255 [graph_manager.cc:1136][EVENT]14719 PreRunAfterOptimizeSubGraph:[GEPERFTRACE] The time cost of GraphManager::MemConflictProc is [115] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.553.379 [graph_manager.cc:3555][EVENT]14719 Build:[GEPERFTRACE] The time cost of GraphManager::RecoverIrDefinitionAndModifyAippData is [87] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.553.488 [engine_partitioner.cc:1139][EVENT]14719 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionInitialize is [17] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.553.506 [engine_partitioner.cc:1142][EVENT]14719 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionMarkClusters is [3] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.553.637 [engine_partitioner.cc:1148][EVENT]14719 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionSplitSubGraphs is [121] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.553.672 [engine_partitioner.cc:1155][EVENT]14719 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionSortSubGraphs is [21] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.553.714 [engine_partitioner.cc:1164][EVENT]14719 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionAddPartitionsToGraphNode is [30] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.553.749 [graph_builder.cc:865][EVENT]14719 SecondPartition:[GEPERFTRACE] The time cost of EnginePartitioner::Partition2 is [296] micro second. [INFO] RUNTIME(11300,python3.7):2024-01-11-05:42:50.554.267 [logger.cc:1071] 14719 ModelBindStream: model_id=832, stream_id=1089, flag=0. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.554.311 [task_generator.cc:804][EVENT]14719 GenerateTask:[GEPERFTRACE] The time cost of TaskGenerator::SetStreamCtx is [189] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.554.390 [task_generator.cc:805][EVENT]14719 GenerateTask:[GEPERFTRACE] The time cost of TaskGenerator::PrepareForGenerateTask is [64] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.555.179 [task_generator.cc:814][EVENT]14719 GenerateTask:[GEPERFTRACE] The time cost of TaskGenerator::DoGenerateTask is [773] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.555.195 [task_generator.cc:954][EVENT]14719 GetTaskInfo:[GEPERFTRACE] The time cost of TaskGenerator::GenerateTask is [1074] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.555.267 [task_generator.cc:967][EVENT]14719 GetTaskInfo:[GEPERFTRACE] The time cost of TaskGenerator::AddModelTaskToModel is [45] micro second. [INFO] RUNTIME(11300,python3.7):2024-01-11-05:42:50.555.286 [logger.cc:1084] 14719 ModelUnbindStream: model_id=832, stream_id=1089, [INFO] GE(11300,python3.7):2024-01-11-05:42:50.555.673 [graph_manager.cc:1152][EVENT]14719 PreRunAfterOptimizeSubGraph:[GEPERFTRACE] The time cost of GraphManager::Build is [2390] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.555.699 [graph_manager.cc:1164][EVENT]14719 PreRunAfterOptimizeSubGraph:PreRun:PreRunAfterOptimizeSubGraph success. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.555.744 [graph_manager.cc:1271][EVENT]14719 PreRun:[GEPERFTRACE] The time cost of FlowModelBuild is [62594] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.555.756 [graph_manager.cc:1272][EVENT]14719 PreRun:[GEPERFTRACE] GE PreRun End [INFO] ATRACE(11300,python3.7):2024-01-11-05:42:50.556.067 [atrace_api.c:93](tid:14719) AtraceDestroy start [INFO] ATRACE(11300,python3.7):2024-01-11-05:42:50.556.091 [atrace_api.c:95](tid:14719) AtraceDestroy end [INFO] GE(11300,python3.7):2024-01-11-05:42:50.562.779 [graph_converter.cc:838][EVENT]14719 ConvertComputeGraphToExecuteGraph:[GEPERFTRACE] The time cost of ConvertComputeGraphToExecuteGraph::CreateMainNode is [2049] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.562.958 [base_optimizer.cc:70][EVENT]14719 Run:[GEPERFTRACE] The time cost of ZeroCopy is [131] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.563.591 [base_optimizer.cc:70][EVENT]14719 Run:[GEPERFTRACE] The time cost of CEM is [606] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.563.836 [copy_flow_launch_fuse.cc:395][EVENT]14719 Run:[GEPERFTRACE] The time cost of Pass::CopyFlowLaunchFuse is [210] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.563.859 [base_optimizer.cc:70][EVENT]14719 Run:[GEPERFTRACE] The time cost of CopyFlowLaunch is [242] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.564.148 [base_optimizer.cc:70][EVENT]14719 Run:[GEPERFTRACE] The time cost of TrustOutTensor is [277] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.564.189 [base_optimizer.cc:70][EVENT]14719 Run:[GEPERFTRACE] The time cost of AicpuFuseHostInputs is [17] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.564.229 [base_optimizer.cc:70][EVENT]14719 Run:[GEPERFTRACE] The time cost of ZeroCopy is [28] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.564.462 [base_optimizer.cc:70][EVENT]14719 Run:[GEPERFTRACE] The time cost of CEM is [218] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.564.557 [copy_flow_launch_fuse.cc:395][EVENT]14719 Run:[GEPERFTRACE] The time cost of Pass::CopyFlowLaunchFuse is [76] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.564.572 [base_optimizer.cc:70][EVENT]14719 Run:[GEPERFTRACE] The time cost of CopyFlowLaunch is [92] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.564.614 [base_optimizer.cc:70][EVENT]14719 Run:[GEPERFTRACE] The time cost of TrustOutTensor is [22] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.564.625 [base_optimizer.cc:70][EVENT]14719 Run:[GEPERFTRACE] The time cost of AicpuFuseHostInputs is [1] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.564.654 [base_optimizer.cc:70][EVENT]14719 Run:[GEPERFTRACE] The time cost of ZeroCopy is [19] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.564.745 [base_optimizer.cc:70][EVENT]14719 Run:[GEPERFTRACE] The time cost of CEM is [81] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.564.822 [copy_flow_launch_fuse.cc:395][EVENT]14719 Run:[GEPERFTRACE] The time cost of Pass::CopyFlowLaunchFuse is [66] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.564.833 [base_optimizer.cc:70][EVENT]14719 Run:[GEPERFTRACE] The time cost of CopyFlowLaunch is [77] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.564.861 [base_optimizer.cc:70][EVENT]14719 Run:[GEPERFTRACE] The time cost of TrustOutTensor is [20] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.564.870 [base_optimizer.cc:70][EVENT]14719 Run:[GEPERFTRACE] The time cost of AicpuFuseHostInputs is [0] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.564.884 [graph_converter.cc:849][EVENT]14719 ConvertComputeGraphToExecuteGraph:[GEPERFTRACE] The time cost of ConvertComputeGraphToExecuteGraph::RunAllPass is [2063] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.565.142 [graph_converter.cc:853][EVENT]14719 ConvertComputeGraphToExecuteGraph:[GEPERFTRACE] The time cost of ConvertComputeGraphToExecuteGraph::TopologicalSorting is [249] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.565.977 [graph_converter.cc:857][EVENT]14719 ConvertComputeGraphToExecuteGraph:[GEPERFTRACE] The time cost of ConvertComputeGraphToExecuteGraph::AppendGraphLevelData is [820] micro second. [INFO] GE(11300,python3.7):2024-01-11-05:42:50.566.164 [graph_converter.cc:862][EVENT]14719 ConvertComputeGraphToExecuteGraph:[GEPERFTRACE] The time cost of ConvertComputeGraphToExecuteGraph::CalculatePriority is [158] micro second. TotalTime = 0.108039, [20] [parse]: 0.00154489 [symbol_resolve]: 0.0131893, [1] [Cycle 1]: 0.0131148, [1] [resolve]: 0.0130935 [combine_like_graphs]: 9.20001e-07 [graph_reusing]: 3.43e-06 [meta_unpack_prepare]: 0.00014772 [pre_cconv]: 8.2e-07 [abstract_specialize]: 0.00427713 [pack_expand]: 1.768e-05 [auto_monad]: 8.997e-05 [inline]: 1.47e-06 [pre_auto_parallel]: 1.032e-05 [pipeline_split]: 3.27e-06 [optimize]: 0.0831452, [35] [py_interpret_to_execute]: 4.58e-06 [rewriter_before_opt_a]: 0.00026024 [opt_a]: 0.0818716, [4] [Cycle 1]: 0.0396001, [30] [expand_dump_flag]: 4.66e-06 [switch_simplify]: 2.371e-05 [a_1]: 0.00041656 [recompute_prepare]: 9.18e-06 [updatestate_depend_eliminate]: 1.094e-05 [updatestate_assign_eliminate]: 7.62e-06 [updatestate_loads_eliminate]: 6.5e-06 [parameter_eliminate]: 5.18e-06 [a_2]: 7.993e-05 [accelerated_algorithm]: 5.64e-06 [pynative_shard]: 1.67e-06 [auto_parallel]: 3.65e-06 [parallel]: 9.87e-06 [merge_comm]: 3.77e-06 [allreduce_fusion]: 2.1e-06 [virtual_dataset]: 5.39e-06 [get_grad_eliminate_]: 4.55e-06 [virtual_output]: 3.96001e-06 [merge_forward]: 9.26e-06 [cell_reuse_recompute_pass]: 9.29998e-07 [cell_reuse_handle_not_recompute_node_pass]: 1.187e-05 [meta_fg_expand]: 0.0073568, [1] [Cycle 1]: 0.00316655, [1] [resolve]: 0.0031477 [after_resolve]: 4.192e-05 [a_after_grad]: 0.00011408 [renormalize]: 0.0308102 [real_op_eliminate]: 2.821e-05 [auto_monad_grad]: 4.294e-05 [auto_monad_eliminator]: 5.772e-05 [cse]: 0.00014171 [a_3]: 0.00020542 [Cycle 2]: 0.0362461, [30] [expand_dump_flag]: 2.48e-06 [switch_simplify]: 8.668e-05 [a_1]: 0.00049878 [recompute_prepare]: 1.015e-05 [updatestate_depend_eliminate]: 1.184e-05 [updatestate_assign_eliminate]: 8.34e-06 [updatestate_loads_eliminate]: 8.11e-06 [parameter_eliminate]: 3.98e-06 [a_2]: 0.00012092 [accelerated_algorithm]: 1.275e-05 [pynative_shard]: 1.34e-06 [auto_parallel]: 4.7e-06 [parallel]: 4.35e-06 [merge_comm]: 2.55999e-06 [allreduce_fusion]: 1.47001e-06 [virtual_dataset]: 7.47e-06 [get_grad_eliminate_]: 6.20999e-06 [virtual_output]: 5.79e-06 [merge_forward]: 9.54e-06 [cell_reuse_recompute_pass]: 4.00003e-07 [cell_reuse_handle_not_recompute_node_pass]: 1.51e-05 [meta_fg_expand]: 0.0107939, [3] [Cycle 1]: 0.00032553, [1] [resolve]: 0.00030698 [Cycle 1]: 0.00164975, [1] [resolve]: 0.00162955 [Cycle 1]: 0.00032745, [1] [resolve]: 0.00030872 [after_resolve]: 5.817e-05 [a_after_grad]: 0.00013752 [renormalize]: 0.0237498 [real_op_eliminate]: 3.111e-05 [auto_monad_grad]: 4.193e-05 [auto_monad_eliminator]: 6.207e-05 [cse]: 0.00014334 [a_3]: 0.00023759 [Cycle 3]: 0.00290352, [30] [expand_dump_flag]: 2.67e-06 [switch_simplify]: 9.043e-05 [a_1]: 0.00063011 [recompute_prepare]: 1.239e-05 [updatestate_depend_eliminate]: 1.359e-05 [updatestate_assign_eliminate]: 1.004e-05 [updatestate_loads_eliminate]: 9.79e-06 [parameter_eliminate]: 3.41e-06 [a_2]: 0.00015125 [accelerated_algorithm]: 1.451e-05 [pynative_shard]: 9.70002e-07 [auto_parallel]: 4.34e-06 [parallel]: 4.73e-06 [merge_comm]: 3.19001e-06 [allreduce_fusion]: 1.64e-06 [virtual_dataset]: 8.43e-06 [get_grad_eliminate_]: 7.74e-06 [virtual_output]: 6.94e-06 [merge_forward]: 1.093e-05 [cell_reuse_recompute_pass]: 3.89999e-07 [cell_reuse_handle_not_recompute_node_pass]: 1.906e-05 [meta_fg_expand]: 3.316e-05 [after_resolve]: 1.142e-05 [a_after_grad]: 1.373e-05 [renormalize]: 0.00147591 [real_op_eliminate]: 1.358e-05 [auto_monad_grad]: 5.41e-06 [auto_monad_eliminator]: 2.327e-05 [cse]: 9.86e-05 [a_3]: 7.1e-05 [Cycle 4]: 0.00073155, [30] [expand_dump_flag]: 1.3e-06 [switch_simplify]: 8.72e-06 [a_1]: 0.00015475 [recompute_prepare]: 1.022e-05 [updatestate_depend_eliminate]: 1.385e-05 [updatestate_assign_eliminate]: 9.88e-06 [updatestate_loads_eliminate]: 9.92e-06 [parameter_eliminate]: 1.86e-06 [a_2]: 0.00014829 [accelerated_algorithm]: 1.468e-05 [pynative_shard]: 1.37999e-06 [auto_parallel]: 3.41e-06 [parallel]: 3.61e-06 [merge_comm]: 2.61e-06 [allreduce_fusion]: 1.66e-06 [virtual_dataset]: 8.48e-06 [get_grad_eliminate_]: 7.47e-06 [virtual_output]: 7.19e-06 [merge_forward]: 1.136e-05 [cell_reuse_recompute_pass]: 3.89999e-07 [cell_reuse_handle_not_recompute_node_pass]: 1.85e-05 [meta_fg_expand]: 8.01e-06 [after_resolve]: 1.067e-05 [a_after_grad]: 1.398e-05 [renormalize]: 6.99947e-08 [real_op_eliminate]: 7.78001e-06 [auto_monad_grad]: 2.14999e-06 [auto_monad_eliminator]: 2.097e-05 [cse]: 5.094e-05 [a_3]: 6.292e-05 [py_interpret_to_execute_after_opt_a]: 3.9e-06 [slice_cell_reuse_recomputed_activation]: 2.39e-06 [rewriter_after_opt_a]: 6.528e-05 [convert_after_rewriter]: 1.703e-05 [order_py_execute_after_rewriter]: 1.159e-05 [opt_b]: 0.000569, [2] [Cycle 1]: 0.00048033, [7] [b_1]: 0.00042203 [b_2]: 3.94999e-06 [updatestate_depend_eliminate]: 3.75e-06 [updatestate_assign_eliminate]: 2.72e-06 [updatestate_loads_eliminate]: 2.45e-06 [renormalize]: 3.40005e-07 [cse]: 1.236e-05 [Cycle 2]: 7.929e-05, [7] [b_1]: 3.78e-05 [b_2]: 2.39e-06 [updatestate_depend_eliminate]: 2.35999e-06 [updatestate_assign_eliminate]: 2.01e-06 [updatestate_loads_eliminate]: 1.96e-06 [renormalize]: 7.99992e-08 [cse]: 7.97e-06 [cconv]: 2.212e-05 [opt_after_cconv]: 5.08e-05, [1] [Cycle 1]: 4.657e-05, [7] [c_1]: 5.58e-06 [parameter_eliminate]: 1.70001e-06 [updatestate_depend_eliminate]: 2.32e-06 [updatestate_assign_eliminate]: 1.93001e-06 [updatestate_loads_eliminate]: 1.83001e-06 [cse]: 7.68e-06 [renormalize]: 3.09999e-07 [remove_dup_value]: 1.19e-05 [tuple_transform]: 3.857e-05, [1] [Cycle 1]: 3.477e-05, [3] [d_1]: 1.607e-05 [d_2]: 6.91001e-06 [renormalize]: 1.39997e-07 [add_cache_embedding]: 1.085e-05 [add_recomputation]: 4.133e-05 [cse_after_recomputation]: 1.747e-05, [1] [Cycle 1]: 1.318e-05, [1] [cse]: 8.68e-06 [environ_conv]: 7.88e-06 [label_micro_interleaved_index]: 2.56e-06 [label_fine_grained_interleaved_index]: 2.62e-06 [assign_add_opt]: 1.75e-06 [slice_recompute_activation]: 2.32999e-06 [micro_interleaved_order_control]: 2.31e-06 [full_micro_interleaved_order_control]: 2.21e-06 [comp_comm_scheduling]: 2.31e-06 [reorder_send_recv_between_fp_bp]: 2.2e-06 [comm_op_add_attrs]: 1.08e-06 [add_comm_op_reuse_tag]: 9.80006e-07 [overlap_opt_shard_in_pipeline]: 1.24001e-06 [grouped_pairwise_exchange_alltoall]: 1.34e-06 [overlap_recompute_and_grad_model_parallel]: 1.85e-06 [overlap_grad_matmul_and_grad_allreduce]: 7.7e-07 [split_matmul_comm_elemetwise]: 2.63999e-06 [split_layernorm_comm]: 1.89e-06 [process_send_recv_for_ge]: 1.12e-06 [handle_group_info]: 1.06001e-06 [auto_monad_reorder]: 1.534e-05 [get_jit_bprop_graph]: 4.29995e-07 [eliminate_special_op_node]: 0.00050045 [validate]: 3.25e-05 [distribtued_split]: 1.41e-06 [task_emit]: 0.00485574 [execute]: 7.43999e-06 Sums parse : 0.001545s : 1.68% symbol_resolve.resolve : 0.013093s : 14.27% combine_like_graphs : 0.000001s : 0.00% graph_reusing : 0.000003s : 0.00% meta_unpack_prepare : 0.000148s : 0.16% pre_cconv : 0.000001s : 0.00% abstract_specialize : 0.004277s : 4.66% pack_expand : 0.000018s : 0.02% auto_monad : 0.000090s : 0.10% inline : 0.000001s : 0.00% pre_auto_parallel : 0.000010s : 0.01% pipeline_split : 0.000003s : 0.00% optimize.py_interpret_to_execute : 0.000005s : 0.00% optimize.rewriter_before_opt_a : 0.000260s : 0.28% optimize.opt_a.expand_dump_flag : 0.000011s : 0.01% optimize.opt_a.switch_simplify : 0.000210s : 0.23% optimize.opt_a.a_1 : 0.001700s : 1.85% optimize.opt_a.recompute_prepare : 0.000042s : 0.05% optimize.opt_a.updatestate_depend_eliminate : 0.000050s : 0.05% optimize.opt_a.updatestate_assign_eliminate : 0.000036s : 0.04% optimize.opt_a.updatestate_loads_eliminate : 0.000034s : 0.04% optimize.opt_a.parameter_eliminate : 0.000014s : 0.02% optimize.opt_a.a_2 : 0.000500s : 0.55% optimize.opt_a.accelerated_algorithm : 0.000048s : 0.05% optimize.opt_a.pynative_shard : 0.000005s : 0.01% optimize.opt_a.auto_parallel : 0.000016s : 0.02% optimize.opt_a.parallel : 0.000023s : 0.02% optimize.opt_a.merge_comm : 0.000012s : 0.01% optimize.opt_a.allreduce_fusion : 0.000007s : 0.01% optimize.opt_a.virtual_dataset : 0.000030s : 0.03% optimize.opt_a.get_grad_eliminate_ : 0.000026s : 0.03% optimize.opt_a.virtual_output : 0.000024s : 0.03% optimize.opt_a.merge_forward : 0.000041s : 0.04% optimize.opt_a.cell_reuse_recompute_pass : 0.000002s : 0.00% optimize.opt_a.cell_reuse_handle_not_recompute_node_pass : 0.000065s : 0.07% optimize.opt_a.meta_fg_expand : 0.000041s : 0.04% optimize.opt_a.meta_fg_expand.resolve : 0.005393s : 5.88% optimize.opt_a.after_resolve : 0.000122s : 0.13% optimize.opt_a.a_after_grad : 0.000279s : 0.30% optimize.opt_a.renormalize : 0.056036s : 61.06% optimize.opt_a.real_op_eliminate : 0.000081s : 0.09% optimize.opt_a.auto_monad_grad : 0.000092s : 0.10% optimize.opt_a.auto_monad_eliminator : 0.000164s : 0.18% optimize.opt_a.cse : 0.000435s : 0.47% optimize.opt_a.a_3 : 0.000577s : 0.63% optimize.py_interpret_to_execute_after_opt_a : 0.000004s : 0.00% optimize.slice_cell_reuse_recomputed_activation : 0.000002s : 0.00% optimize.rewriter_after_opt_a : 0.000065s : 0.07% optimize.convert_after_rewriter : 0.000017s : 0.02% optimize.order_py_execute_after_rewriter : 0.000012s : 0.01% optimize.opt_b.b_1 : 0.000460s : 0.50% optimize.opt_b.b_2 : 0.000006s : 0.01% optimize.opt_b.updatestate_depend_eliminate : 0.000006s : 0.01% optimize.opt_b.updatestate_assign_eliminate : 0.000005s : 0.01% optimize.opt_b.updatestate_loads_eliminate : 0.000004s : 0.00% optimize.opt_b.renormalize : 0.000000s : 0.00% optimize.opt_b.cse : 0.000020s : 0.02% optimize.cconv : 0.000022s : 0.02% optimize.opt_after_cconv.c_1 : 0.000006s : 0.01% optimize.opt_after_cconv.parameter_eliminate : 0.000002s : 0.00% optimize.opt_after_cconv.updatestate_depend_eliminate : 0.000002s : 0.00% optimize.opt_after_cconv.updatestate_assign_eliminate : 0.000002s : 0.00% optimize.opt_after_cconv.updatestate_loads_eliminate : 0.000002s : 0.00% optimize.opt_after_cconv.cse : 0.000008s : 0.01% optimize.opt_after_cconv.renormalize : 0.000000s : 0.00% optimize.remove_dup_value : 0.000012s : 0.01% optimize.tuple_transform.d_1 : 0.000016s : 0.02% optimize.tuple_transform.d_2 : 0.000007s : 0.01% optimize.tuple_transform.renormalize : 0.000000s : 0.00% optimize.add_cache_embedding : 0.000011s : 0.01% optimize.add_recomputation : 0.000041s : 0.05% optimize.cse_after_recomputation.cse : 0.000009s : 0.01% optimize.environ_conv : 0.000008s : 0.01% optimize.label_micro_interleaved_index : 0.000003s : 0.00% optimize.label_fine_grained_interleaved_index : 0.000003s : 0.00% optimize.assign_add_opt : 0.000002s : 0.00% optimize.slice_recompute_activation : 0.000002s : 0.00% optimize.micro_interleaved_order_control : 0.000002s : 0.00% optimize.full_micro_interleaved_order_control : 0.000002s : 0.00% optimize.comp_comm_scheduling : 0.000002s : 0.00% optimize.reorder_send_recv_between_fp_bp : 0.000002s : 0.00% optimize.comm_op_add_attrs : 0.000001s : 0.00% optimize.add_comm_op_reuse_tag : 0.000001s : 0.00% optimize.overlap_opt_shard_in_pipeline : 0.000001s : 0.00% optimize.grouped_pairwise_exchange_alltoall : 0.000001s : 0.00% optimize.overlap_recompute_and_grad_model_parallel : 0.000002s : 0.00% optimize.overlap_grad_matmul_and_grad_allreduce : 0.000001s : 0.00% optimize.split_matmul_comm_elemetwise : 0.000003s : 0.00% optimize.split_layernorm_comm : 0.000002s : 0.00% optimize.process_send_recv_for_ge : 0.000001s : 0.00% optimize.handle_group_info : 0.000001s : 0.00% auto_monad_reorder : 0.000015s : 0.02% get_jit_bprop_graph : 0.000000s : 0.00% eliminate_special_op_node : 0.000500s : 0.55% validate : 0.000033s : 0.04% distribtued_split : 0.000001s : 0.00% task_emit : 0.004856s : 5.29% execute : 0.000007s : 0.01% Time group info: ------[substitution.] 0.018292 459 0.02% : 0.000004s : 5: substitution.float_depend_g_call 0.06% : 0.000010s : 14: substitution.float_tuple_getitem_switch 93.70% : 0.017139s : 50: substitution.getattr_setattr_resolve 0.03% : 0.000005s : 4: substitution.graph_param_transform 0.02% : 0.000003s : 3: substitution.incorporate_call 0.01% : 0.000002s : 3: substitution.incorporate_call_switch 4.13% : 0.000755s : 79: substitution.inline 0.03% : 0.000006s : 10: substitution.less_batch_normalization 0.19% : 0.000034s : 24: substitution.meta_unpack_prepare 0.07% : 0.000012s : 11: substitution.minmaximum_grad 0.02% : 0.000004s : 5: substitution.partial_eliminate 0.01% : 0.000001s : 4: substitution.partial_unused_args_eliminate 0.03% : 0.000006s : 47: substitution.remove_not_recompute_node 0.38% : 0.000070s : 46: substitution.replace_applicator 0.05% : 0.000009s : 30: substitution.replace_old_param 0.02% : 0.000004s : 2: substitution.reset_defer_inline 0.03% : 0.000006s : 8: substitution.set_cell_output_no_recompute 0.04% : 0.000008s : 5: substitution.specialize_transform 0.06% : 0.000011s : 8: substitution.switch_simplify 0.15% : 0.000027s : 8: substitution.transpose_eliminate 0.22% : 0.000041s : 15: substitution.tuple_list_convert_item_index_to_positive 0.09% : 0.000016s : 15: substitution.tuple_list_get_item_const_eliminator 0.12% : 0.000023s : 15: substitution.tuple_list_get_item_depend_reorder 0.39% : 0.000072s : 33: substitution.tuple_list_get_item_eliminator 0.12% : 0.000022s : 15: substitution.tuple_list_get_set_item_eliminator ------[renormalize.] 0.056022 6 92.29% : 0.051702s : 3: renormalize.infer 7.71% : 0.004320s : 3: renormalize.specialize ------[replace.] 0.001018 99 58.39% : 0.000595s : 42: replace.getattr_setattr_resolve 22.97% : 0.000234s : 39: replace.inline 5.12% : 0.000052s : 2: replace.meta_unpack_prepare 8.43% : 0.000086s : 8: replace.switch_simplify 0.91% : 0.000009s : 2: replace.transpose_eliminate 4.17% : 0.000042s : 6: replace.tuple_list_get_item_eliminator ------[match.] 0.017631 99 96.33% : 0.016984s : 42: match.getattr_setattr_resolve 3.32% : 0.000586s : 39: match.inline 0.11% : 0.000020s : 2: match.meta_unpack_prepare 0.06% : 0.000011s : 8: match.switch_simplify 0.05% : 0.000008s : 2: match.transpose_eliminate 0.12% : 0.000021s : 6: match.tuple_list_get_item_eliminator ------[func_graph_cloner_run.] 0.004472 85 67.20% : 0.003005s : 36: func_graph_cloner_run.FuncGraphClonerGraph 32.80% : 0.001467s : 49: func_graph_cloner_run.FuncGraphSpecializer ------[meta_graph.] 0.000000 0 ------[manager.] 0.000000 0 ------[pynative] 0.000000 0 ------[others.] 0.022439 255 4.87% : 0.001092s : 104: opt.transform.opt_a 1.91% : 0.000428s : 92: opt.transform.opt_b 81.25% : 0.018232s : 10: opt.transform.opt_resolve 0.55% : 0.000124s : 1: opt.transforms.meta_unpack_prepare 11.25% : 0.002524s : 40: opt.transforms.opt_a 0.02% : 0.000004s : 1: opt.transforms.opt_after_cconv 0.02% : 0.000005s : 2: opt.transforms.opt_b 0.10% : 0.000021s : 2: opt.transforms.opt_trans_graph 0.04% : 0.000010s : 3: opt.transforms.special_op_eliminate . ============================== 1 passed in 20.88s ============================== [TRACE] GE(11300,python3.7):2024-01-11-05:42:52.392.908 [status:INIT] [ge_api.cc:463]11300 ~Session:Start to destruct session. [TRACE] GE(11300,python3.7):2024-01-11-05:42:52.392.987 [status:RUNNING] [ge_api.cc:475]11300 ~Session:Session id is 0 [TRACE] GE(11300,python3.7):2024-01-11-05:42:52.392.999 [status:RUNNING] [ge_api.cc:476]11300 ~Session:Destroying session [TRACE] GE(11300,python3.7):2024-01-11-05:42:52.393.869 [status:STOP] [ge_api.cc:491]11300 ~Session:Session Destructor finished [TRACE] GE(11300,python3.7):2024-01-11-05:42:52.393.899 [status:INIT] [ge_api.cc:301]11300 GEFinalize:GEFinalize start [INFO] GE(11300,python3.7):2024-01-11-05:42:52.393.965 [execution_runtime.cc:80][EVENT]11300 FinalizeExecutionRuntime:Execution runtime finalize begin. [INFO] GE(11300,python3.7):2024-01-11-05:42:52.393.983 [execution_runtime.cc:92][EVENT]11300 FinalizeExecutionRuntime:Execution runtime finalized. [TRACE] GE(11300,python3.7):2024-01-11-05:42:52.393.995 [status:RUNNING] [ge_api.cc:313]11300 GEFinalize:Finalizing environment [INFO] TUNE(11300,python3.7):2024-01-11-05:42:52.679.505 [cann_kb_pyfunc_mgr.cpp:127][CANNKB][Tid:11300]"CannKbPyfuncMgr: enter PyObjectDeinit function, reference_[1]" [INFO] TUNE(11300,python3.7):2024-01-11-05:42:52.679.563 [cann_kb_pyfunc_mgr.cpp:138][CANNKB][Tid:11300]"CannKbPyfuncMgr: PyObjectDeinit function end successfully!" [INFO] GE(11300,python3.7):2024-01-11-05:42:52.681.022 [gelib.cc:324][EVENT]11300 SystemFinalize:Online infer finalize GELib success. [TRACE] GE(11300,python3.7):2024-01-11-05:42:52.786.535 [status:STOP] [ge_api.cc:341]11300 GEFinalize:GEFinalize finished [INFO] TDT(11300,python3.7):2024-01-11-05:42:53.218.625 [process_mode_manager.cpp:184][Close][tid:11300] [TsdClient] Close [deviceId=0][sessionId=1] hccp and computer enter [INFO] TDT(11300,python3.7):2024-01-11-05:42:53.218.664 [version_verify.cpp:112][SpecialFeatureCheck][tid:11300] VersionVerify: previous type[7], supported [INFO] TDT(11300,python3.7):2024-01-11-05:42:53.218.708 [process_mode_manager.cpp:192][Close][tid:11300] [TsdClient][deviceId=0] [sessionId=1] wait hccp and computer process close respond [INFO] TDT(11300,python3.7):2024-01-11-05:42:53.249.647 [process_mode_manager.cpp:197][Close][tid:11300] [TsdClient][logicDeviceId_=0]has recv close hccp and computer process respond [INFO] TDT(11300,python3.7):2024-01-11-05:42:53.249.661 [stub_process_mode_nowin.cpp:151][CloseInHost][tid:11300] enter into CloseInHost deviceid[0] [INFO] TDT(11300,python3.7):2024-01-11-05:42:53.249.673 [stub_process_mode_nowin.cpp:154][CloseInHost][tid:11300] host cpu not support [INFO] TDT(11300,python3.7):2024-01-11-05:42:53.249.708 [process_mode_manager.cpp:208][Close][tid:11300] [TsdClient][deviceId=0] [sessionId=1] close hccp and computer process success [INFO] ATRACE(11300,python3.7):2024-01-11-05:42:53.249.722 [atrace_api.c:93](tid:11300) AtraceDestroy start [INFO] ATRACE(11300,python3.7):2024-01-11-05:42:53.249.738 [atrace_api.c:95](tid:11300) AtraceDestroy end [INFO] PROFILING(11300,python3.7):2024-01-11-05:42:53.249.761 [msprofiler_impl.cpp:156] >>> (tid:11300) ProfNotifySetDevice called, is open: 0, devId: 0 [INFO] RUNTIME(11300,python3.7):2024-01-11-05:42:54.821.968 [runtime.cc:1737] 11300 ~Runtime: deconstruct runtime.