============================= test session starts ============================== platform linux -- Python 3.7.5, pytest-5.4.3, py-1.8.1, pluggy-0.13.1 rootdir: /home/jenkins/mindspore/testcases/testcases/tests/st/dyn_shape_dev, inifile: /home/jenkins/sault/virtual_test/virtualenv_002/sault/config/pytest.ini plugins: anyio-3.7.1, xdist-1.32.0, forked-1.1.3 [INFO] ATRACE(65168,python3.7):2024-01-11-05:47:05.098.789 [trace_attr.c:105](tid:65168) platform is 1. [INFO] ATRACE(65168,python3.7):2024-01-11-05:47:05.098.940 [trace_recorder.c:114](tid:65168) use root path: /home/jenkins/ascend/atrace [INFO] ATRACE(65168,python3.7):2024-01-11-05:47:05.098.963 [trace_signal.c:133](tid:65168) register signal handler for signo 2 succeed. [INFO] ATRACE(65168,python3.7):2024-01-11-05:47:05.098.974 [trace_signal.c:133](tid:65168) register signal handler for signo 15 succeed. [INFO] RUNTIME(65168,python3.7):2024-01-11-05:47:05.476.286 [runtime.cc:1159] 65168 GetAicoreNumByLevel: workingDev_=0 [INFO] RUNTIME(65168,python3.7):2024-01-11-05:47:05.476.335 [runtime.cc:4719] 65168 GetVisibleDevices: ASCEND_RT_VISIBLE_DEVICES param was not set collected 2 items test_hshrink.py [INFO] TDT(65168,python3.7):2024-01-11-05:47:09.704.946 [process_mode_manager.cpp:109][OpenProcess][tid:65168] [ProcessModeManager] enter into open process deviceId[1] rankSize[0] [INFO] TDT(65168,python3.7):2024-01-11-05:47:09.707.243 [process_mode_manager.cpp:379][InitTsdClient][tid:65168] [TsdClient] deviceId[1] begin to init hdc client [INFO] TDT(65168,python3.7):2024-01-11-05:47:09.716.202 [version_verify.cpp:34][SetVersionInfo][tid:65168] VersionVerify: send client version to server [INFO] TDT(65168,python3.7):2024-01-11-05:47:09.716.235 [version_verify.cpp:50][SetVersionInfo][tid:65168] send feature_info:{msg_type:35, features:{check before send aicpu package,}} [INFO] TDT(65168,python3.7):2024-01-11-05:47:09.716.248 [version_verify.cpp:50][SetVersionInfo][tid:65168] send feature_info:{msg_type:37, features:{check before send open qs message,}} [INFO] TDT(65168,python3.7):2024-01-11-05:47:09.716.533 [version_verify.cpp:66][PeerVersionCheck][tid:65168] VersionVerify: Check client version info, server[1230], client[1230] [INFO] TDT(65168,python3.7):2024-01-11-05:47:09.716.549 [version_verify.cpp:87][ParseVersionInfo][tid:65168] VersionVerify: pass client version info success [INFO] TDT(65168,python3.7):2024-01-11-05:47:09.716.558 [hdc_client.cpp:276][CheckHdcConnection][tid:65168] Service[2] create hdc success [INFO] TDT(65168,python3.7):2024-01-11-05:47:09.716.570 [version_verify.cpp:120][SpecialFeatureCheck][tid:65168] VersionVerify: new type[35], supported [INFO] TDT(65168,python3.7):2024-01-11-05:47:09.716.624 [process_mode_manager.cpp:748][GetDeviceCheckCode][tid:65168] [TsdClient][deviceId=1] [sessionId=1] wait package info respond [INFO] TDT(65168,python3.7):2024-01-11-05:47:09.716.836 [process_mode_manager.cpp:379][InitTsdClient][tid:65168] [TsdClient] deviceId[1] begin to init hdc client [INFO] TDT(65168,python3.7):2024-01-11-05:47:09.740.240 [version_verify.cpp:34][SetVersionInfo][tid:65168] VersionVerify: send client version to server [INFO] TDT(65168,python3.7):2024-01-11-05:47:09.740.270 [version_verify.cpp:50][SetVersionInfo][tid:65168] send feature_info:{msg_type:35, features:{check before send aicpu package,}} [INFO] TDT(65168,python3.7):2024-01-11-05:47:09.740.280 [version_verify.cpp:50][SetVersionInfo][tid:65168] send feature_info:{msg_type:37, features:{check before send open qs message,}} [INFO] TDT(65168,python3.7):2024-01-11-05:47:09.740.429 [version_verify.cpp:66][PeerVersionCheck][tid:65168] VersionVerify: Check client version info, server[1230], client[1230] [INFO] TDT(65168,python3.7):2024-01-11-05:47:09.740.442 [version_verify.cpp:87][ParseVersionInfo][tid:65168] VersionVerify: pass client version info success [INFO] TDT(65168,python3.7):2024-01-11-05:47:09.740.450 [hdc_client.cpp:276][CheckHdcConnection][tid:65168] Service[2] create hdc success [INFO] TDT(65168,python3.7):2024-01-11-05:47:09.740.462 [process_mode_manager.cpp:426][ConstructOpenMsg][tid:65168] [TsdClient] tsd get process sign successfully, procpid[65168] signSize[48] [INFO] TDT(65168,python3.7):2024-01-11-05:47:09.740.488 [version_verify.cpp:112][SpecialFeatureCheck][tid:65168] VersionVerify: previous type[6], supported [INFO] TDT(65168,python3.7):2024-01-11-05:47:09.740.510 [process_mode_manager.cpp:126][OpenProcess][tid:65168] [ProcessModeManager] deviceId[1] sessionId[1] rankSize[0], wait sub process start respond [INFO] TDT(65168,python3.7):2024-01-11-05:47:09.951.261 [stub_process_mode_nowin.cpp:63][ProcessQueueForMdc][tid:65168] [TsdClient] it is unnecessary of current mode[0] chiptype[1] to grant queue auth to aicpusd [INFO] TDT(65168,python3.7):2024-01-11-05:47:09.951.294 [stub_process_mode_nowin.cpp:101][OpenInHost][tid:65168] enter into OpenInHost deviceid[1] [INFO] TDT(65168,python3.7):2024-01-11-05:47:09.951.304 [stub_process_mode_nowin.cpp:105][OpenInHost][tid:65168] host cpu not support [INFO] TDT(65168,python3.7):2024-01-11-05:47:09.951.312 [process_mode_manager.cpp:156][OpenProcess][tid:65168] [TsdClient][deviceId=1] [sessionId=1] start hccp and computer process success [INFO] RUNTIME(65168,python3.7):2024-01-11-05:47:09.953.965 [device.cc:340] 65168 Init: isDoubledie:0, topologytype:0 [INFO] RUNTIME(65168,python3.7):2024-01-11-05:47:09.970.770 [npu_driver.cc:5428] 65912 GetDeviceStatus: GetDeviceStatus status=1. [INFO] ATRACE(65168,python3.7):2024-01-11-05:47:09.970.814 [atrace_api.c:28](tid:65168) AtraceCreate start [INFO] ATRACE(65168,python3.7):2024-01-11-05:47:09.970.923 [trace_rb_log.c:84](tid:65168) [RUNTIME_ATRACE_DEV1_TS0] create ring buffer success, buffer size : 131152. [INFO] ATRACE(65168,python3.7):2024-01-11-05:47:09.970.938 [atrace_api.c:32](tid:65168) AtraceCreate end [INFO] TDT(65168,python3.7):2024-01-11-05:47:09.970.952 [client_manager.cpp:157][SetProfilingCallback][tid:65168] [TsdClient] set profiling callback success [TRACE] GE(65168,python3.7):2024-01-11-05:47:10.123.857 [status:INIT] [ge_api.cc:144]65168 GEInitializeImpl:GEInitialize start [INFO] PROFILING(65168,python3.7):2024-01-11-05:47:10.322.914 [msprofiler_impl.cpp:156] >>> (tid:65168) ProfNotifySetDevice called, is open: 1, devId: 1 [INFO] PROFILING(65168,python3.7):2024-01-11-05:47:10.323.014 [platform.cpp:38] >>> (tid:65168) Profiling platform version: 1.0. [INFO] PROFILING(65168,python3.7):2024-01-11-05:47:10.323.026 [ai_drv_dev_api.cpp:384] >>> (tid:65168) Succeeded to DrvGetApiVersion version: 0x72313 [TRACE] GE(65168,python3.7):2024-01-11-05:47:10.369.163 [status:RUNNING] [ge_api.cc:211]65168 GEInitializeImpl:Initializing environment [INFO] GE(65168,python3.7):2024-01-11-05:47:10.369.223 [gelib.cc:98][EVENT]65168 Initialize:[GEPERFTRACE] GE Init Start [INFO] GE(65168,python3.7):2024-01-11-05:47:10.369.443 [gelib.cc:307][EVENT]65168 SystemInitialize:Online infer init GELib success, device id :1 [INFO] DVPP(65168,python3.7):2024-01-11-05:47:10.694.634 [dvpp_engine.cc:41][ENGINE][Initialize:41][tid:65168]dvpp engine do not support [INFO] TUNE(65168,python3.7):2024-01-11-05:47:10.697.536 [cann_kb_pyfunc_mgr.cpp:72][CANNKB][Tid:65168]"CannKbPyfuncMgr: Enter PyObjectInit, reference_ is 0!" [INFO] TUNE(65168,python3.7):2024-01-11-05:47:10.697.570 [handle_manager.cpp:115][CANNKB][Tid:65168]"Start to run init functions to load dynamic python lib!" [INFO] TUNE(65168,python3.7):2024-01-11-05:47:10.697.627 [handle_manager.cpp:407][CANNKB][Tid:65168]"Init functions of loading dynamic python lib end!" [INFO] TUNE(65168,python3.7):2024-01-11-05:47:10.697.638 [cann_kb_pyfunc_mgr.cpp:24][CANNKB][Tid:65168]"CANN_KB_Py has already been initialized." [INFO] TUNE(65168,python3.7):2024-01-11-05:47:10.697.702 [cann_kb_pyfunc_mgr.cpp:117][CANNKB][Tid:65168]"CannKbPyfuncMgr: Run PyObjectInit successfully!" [INFO] HCCL(65168,python3.7):2024-01-11-05:47:22.625.356 [plugin_manager.cc:42][65168]hcom running normal mode. [INFO] DVPP(65168,python3.7):2024-01-11-05:47:22.625.801 [dvpp_engine.cc:92][ENGINE][GetOpsKernelInfoStores:92][tid:65168]dvpp ops kernel info store do not support [INFO] DVPP(65168,python3.7):2024-01-11-05:47:22.625.922 [dvpp_engine.cc:69][ENGINE][GetGraphOptimizerObjs:69][tid:65168]dvpp graph optimizer do not support [INFO] DVPP(65168,python3.7):2024-01-11-05:47:23.143.138 [dvpp_ops_kernel_builder.cc:48][ENGINE][Initialize:48][tid:65168]dvpp ops kernel builder do not support [INFO] GE(65168,python3.7):2024-01-11-05:47:23.151.384 [gelib.cc:169][EVENT]65168 Initialize:[GEPERFTRACE] The time cost of GELib::Initialize is [12782117] micro second. [TRACE] GE(65168,python3.7):2024-01-11-05:47:23.234.376 [status:STOP] [ge_api.cc:255]65168 GEInitializeImpl:GEInitialize finished [TRACE] GE(65168,python3.7):2024-01-11-05:47:23.234.478 [status:INIT] [ge_api.cc:398]65168 Session:Start to construct session. [TRACE] GE(65168,python3.7):2024-01-11-05:47:23.234.493 [status:RUNNING] [ge_api.cc:408]65168 Session:Creating session [INFO] GE(65168,python3.7):2024-01-11-05:47:23.234.859 [graph_var_manager.cc:1445][EVENT]65168 SetMemoryMallocSize:Total memory size is 34359738368 [INFO] GE(65168,python3.7):2024-01-11-05:47:23.234.874 [graph_var_manager.cc:1424][EVENT]65168 SetAllMemoryMaxValue:The graph_mem_max_size is 27917287424 and the var_mem_max_size is 5368709120 [INFO] PROFILING(65168,python3.7):2024-01-11-05:47:23.235.138 [msprofiler_impl.cpp:156] >>> (tid:65168) ProfNotifySetDevice called, is open: 1, devId: 1 [TRACE] GE(65168,python3.7):2024-01-11-05:47:23.235.931 [status:RUNNING] [ge_api.cc:411]65168 Session:Session id is 0 [TRACE] GE(65168,python3.7):2024-01-11-05:47:23.235.950 [status:STOP] [ge_api.cc:420]65168 Session:Session Constructor finished [INFO] PROFILING(65168,python3.7):2024-01-11-05:47:23.245.559 [platform.cpp:38] >>> (tid:65168) Profiling platform version: 1.0. [INFO] PROFILING(65168,python3.7):2024-01-11-05:47:23.245.604 [ai_drv_dev_api.cpp:384] >>> (tid:65168) Succeeded to DrvGetApiVersion version: 0x72313 [TRACE] GE(65168,python3.7):2024-01-11-05:47:23.245.767 [status:INIT] [ge_api.cc:144]65168 GEInitializeImpl:GEInitialize start TotalTime = 0.38699, [20] [parse]: 0.221798 [symbol_resolve]: 0.0287705, [1] [Cycle 1]: 0.0287039, [1] [resolve]: 0.0286718 [combine_like_graphs]: 8.49999e-07 [graph_reusing]: 2.62e-06 [meta_unpack_prepare]: 0.00014575 [pre_cconv]: 3.1e-06 [abstract_specialize]: 0.00489802 [pack_expand]: 1.321e-05 [auto_monad]: 9.378e-05 [inline]: 1.71e-06 [pre_auto_parallel]: 1.347e-05 [pipeline_split]: 1.86e-06 [optimize]: 0.125443, [35] [py_interpret_to_execute]: 3.95e-06 [rewriter_before_opt_a]: 0.00018766 [opt_a]: 0.124188, [4] [Cycle 1]: 0.0852955, [30] [expand_dump_flag]: 3.41e-06 [switch_simplify]: 3.034e-05 [a_1]: 0.00043894 [recompute_prepare]: 8.92e-06 [updatestate_depend_eliminate]: 1.117e-05 [updatestate_assign_eliminate]: 7.38e-06 [updatestate_loads_eliminate]: 6.53e-06 [parameter_eliminate]: 4.53e-06 [a_2]: 8.905e-05 [accelerated_algorithm]: 6.22e-06 [pynative_shard]: 1.02e-06 [auto_parallel]: 4.06e-06 [parallel]: 1.195e-05 [merge_comm]: 7.19e-06 [allreduce_fusion]: 1.79e-06 [virtual_dataset]: 5.57e-06 [get_grad_eliminate_]: 4.86e-06 [virtual_output]: 4.71e-06 [merge_forward]: 7.35e-06 [cell_reuse_recompute_pass]: 1.03e-06 [cell_reuse_handle_not_recompute_node_pass]: 1.249e-05 [meta_fg_expand]: 0.0265087, [1] [Cycle 1]: 0.0005824, [1] [resolve]: 0.00055909 [after_resolve]: 2.845e-05 [a_after_grad]: 4.439e-05 [renormalize]: 0.0574125 [real_op_eliminate]: 5.885e-05 [auto_monad_grad]: 3.627e-05 [auto_monad_eliminator]: 4.903e-05 [cse]: 0.00011185 [a_3]: 0.00018835 [Cycle 2]: 0.029129, [30] [expand_dump_flag]: 2.47001e-06 [switch_simplify]: 6.601e-05 [a_1]: 0.00045754 [recompute_prepare]: 1.086e-05 [updatestate_depend_eliminate]: 1.187e-05 [updatestate_assign_eliminate]: 9.62e-06 [updatestate_loads_eliminate]: 8.89e-06 [parameter_eliminate]: 3.74e-06 [a_2]: 0.00013486 [accelerated_algorithm]: 1.275e-05 [pynative_shard]: 1.29e-06 [auto_parallel]: 6.21e-06 [parallel]: 5.61e-06 [merge_comm]: 2.72e-06 [allreduce_fusion]: 2.07e-06 [virtual_dataset]: 7.81e-06 [get_grad_eliminate_]: 6.75e-06 [virtual_output]: 6.62e-06 [merge_forward]: 1.04e-05 [cell_reuse_recompute_pass]: 4.19997e-07 [cell_reuse_handle_not_recompute_node_pass]: 1.63e-05 [meta_fg_expand]: 0.00703101, [3] [Cycle 1]: 0.00034659, [1] [resolve]: 0.0003258 [Cycle 1]: 0.0005013, [1] [resolve]: 0.00048271 [Cycle 1]: 0.00034305, [1] [resolve]: 0.00032657 [after_resolve]: 3.669e-05 [a_after_grad]: 5.9e-05 [renormalize]: 0.0205474 [real_op_eliminate]: 3.342e-05 [auto_monad_grad]: 4.348e-05 [auto_monad_eliminator]: 5.691e-05 [cse]: 0.00014107 [a_3]: 0.0002231 [Cycle 3]: 0.00296848, [30] [expand_dump_flag]: 3.04e-06 [switch_simplify]: 6.984e-05 [a_1]: 0.00058579 [recompute_prepare]: 1.217e-05 [updatestate_depend_eliminate]: 1.364e-05 [updatestate_assign_eliminate]: 1.159e-05 [updatestate_loads_eliminate]: 1.078e-05 [parameter_eliminate]: 3.61e-06 [a_2]: 0.00017019 [accelerated_algorithm]: 1.638e-05 [pynative_shard]: 1.11e-06 [auto_parallel]: 6.47e-06 [parallel]: 4.74e-06 [merge_comm]: 3.66e-06 [allreduce_fusion]: 2.06001e-06 [virtual_dataset]: 9.39e-06 [get_grad_eliminate_]: 9.14e-06 [virtual_output]: 8.27e-06 [merge_forward]: 1.209e-05 [cell_reuse_recompute_pass]: 3.99996e-07 [cell_reuse_handle_not_recompute_node_pass]: 2.025e-05 [meta_fg_expand]: 3.777e-05 [after_resolve]: 1.288e-05 [a_after_grad]: 1.505e-05 [renormalize]: 0.00156035 [real_op_eliminate]: 1.395e-05 [auto_monad_grad]: 5.22e-06 [auto_monad_eliminator]: 2.282e-05 [cse]: 9.391e-05 [a_3]: 7.954e-05 [Cycle 4]: 0.00077711, [30] [expand_dump_flag]: 1.26e-06 [switch_simplify]: 9.50001e-06 [a_1]: 0.00015966 [recompute_prepare]: 1.093e-05 [updatestate_depend_eliminate]: 1.399e-05 [updatestate_assign_eliminate]: 1.108e-05 [updatestate_loads_eliminate]: 1.098e-05 [parameter_eliminate]: 1.7e-06 [a_2]: 0.00016599 [accelerated_algorithm]: 1.507e-05 [pynative_shard]: 1.27e-06 [auto_parallel]: 3.15e-06 [parallel]: 3.37001e-06 [merge_comm]: 2.02e-06 [allreduce_fusion]: 1.58e-06 [virtual_dataset]: 9.25e-06 [get_grad_eliminate_]: 8.41e-06 [virtual_output]: 8.08e-06 [merge_forward]: 1.202e-05 [cell_reuse_recompute_pass]: 3.19997e-07 [cell_reuse_handle_not_recompute_node_pass]: 1.958e-05 [meta_fg_expand]: 9.5e-06 [after_resolve]: 1.121e-05 [a_after_grad]: 1.528e-05 [renormalize]: 7.0002e-08 [real_op_eliminate]: 8.54e-06 [auto_monad_grad]: 1.94e-06 [auto_monad_eliminator]: 2.038e-05 [cse]: 5.174e-05 [a_3]: 7.132e-05 [py_interpret_to_execute_after_opt_a]: 4.59e-06 [slice_cell_reuse_recomputed_activation]: 1.27e-06 [rewriter_after_opt_a]: 7.012e-05 [convert_after_rewriter]: 1.706e-05 [order_py_execute_after_rewriter]: 1.171e-05 [opt_b]: 0.00061098, [2] [Cycle 1]: 0.00051784, [7] [b_1]: 0.00046119 [b_2]: 3.92e-06 [updatestate_depend_eliminate]: 3.48999e-06 [updatestate_assign_eliminate]: 2.61e-06 [updatestate_loads_eliminate]: 2.62e-06 [renormalize]: 3.00002e-07 [cse]: 1.103e-05 [Cycle 2]: 8.452e-05, [7] [b_1]: 4.212e-05 [b_2]: 2.39e-06 [updatestate_depend_eliminate]: 2.92e-06 [updatestate_assign_eliminate]: 2.16e-06 [updatestate_loads_eliminate]: 2e-06 [renormalize]: 7.0002e-08 [cse]: 7.78e-06 [cconv]: 1.456e-05 [opt_after_cconv]: 5.311e-05, [1] [Cycle 1]: 4.901e-05, [7] [c_1]: 5.65e-06 [parameter_eliminate]: 1.68e-06 [updatestate_depend_eliminate]: 2.72e-06 [updatestate_assign_eliminate]: 2.57e-06 [updatestate_loads_eliminate]: 1.96e-06 [cse]: 7.82e-06 [renormalize]: 2.50002e-07 [remove_dup_value]: 7.93e-06 [tuple_transform]: 3.485e-05, [1] [Cycle 1]: 3.116e-05, [3] [d_1]: 1.242e-05 [d_2]: 6.54e-06 [renormalize]: 1.90004e-07 [add_cache_embedding]: 7.81e-06 [add_recomputation]: 3.78e-05 [cse_after_recomputation]: 1.78e-05, [1] [Cycle 1]: 1.378e-05, [1] [cse]: 8.93e-06 [environ_conv]: 1.63e-05 [label_micro_interleaved_index]: 1.99e-06 [label_fine_grained_interleaved_index]: 1.44e-06 [assign_add_opt]: 1.89e-06 [slice_recompute_activation]: 1.2e-06 [micro_interleaved_order_control]: 1.02e-06 [full_micro_interleaved_order_control]: 1.28e-06 [comp_comm_scheduling]: 1.3e-06 [reorder_send_recv_between_fp_bp]: 1.55e-06 [comm_op_add_attrs]: 6.00005e-07 [add_comm_op_reuse_tag]: 5.70006e-07 [overlap_opt_shard_in_pipeline]: 9.5e-07 [grouped_pairwise_exchange_alltoall]: 7.00005e-07 [overlap_recompute_and_grad_model_parallel]: 1.03001e-06 [overlap_grad_matmul_and_grad_allreduce]: 4.69998e-07 [split_matmul_comm_elemetwise]: 1.32e-06 [split_layernorm_comm]: 9.5e-07 [process_send_recv_for_ge]: 1.8e-06 [handle_group_info]: 5.4e-07 [auto_monad_reorder]: 1.721e-05 [get_jit_bprop_graph]: 2.80001e-07 [eliminate_special_op_node]: 0.00050427 [validate]: 4.139e-05 [distribtued_split]: 8.49999e-07 [task_emit]: 0.00501605 [execute]: 6.46e-06 Sums parse : 0.221798s : 63.75% symbol_resolve.resolve : 0.028672s : 8.24% combine_like_graphs : 0.000001s : 0.00% graph_reusing : 0.000003s : 0.00% meta_unpack_prepare : 0.000146s : 0.04% pre_cconv : 0.000003s : 0.00% abstract_specialize : 0.004898s : 1.41% pack_expand : 0.000013s : 0.00% auto_monad : 0.000094s : 0.03% inline : 0.000002s : 0.00% pre_auto_parallel : 0.000013s : 0.00% pipeline_split : 0.000002s : 0.00% optimize.py_interpret_to_execute : 0.000004s : 0.00% optimize.rewriter_before_opt_a : 0.000188s : 0.05% optimize.opt_a.expand_dump_flag : 0.000010s : 0.00% optimize.opt_a.switch_simplify : 0.000176s : 0.05% optimize.opt_a.a_1 : 0.001642s : 0.47% optimize.opt_a.recompute_prepare : 0.000043s : 0.01% optimize.opt_a.updatestate_depend_eliminate : 0.000051s : 0.01% optimize.opt_a.updatestate_assign_eliminate : 0.000040s : 0.01% optimize.opt_a.updatestate_loads_eliminate : 0.000037s : 0.01% optimize.opt_a.parameter_eliminate : 0.000014s : 0.00% optimize.opt_a.a_2 : 0.000560s : 0.16% optimize.opt_a.accelerated_algorithm : 0.000050s : 0.01% optimize.opt_a.pynative_shard : 0.000005s : 0.00% optimize.opt_a.auto_parallel : 0.000020s : 0.01% optimize.opt_a.parallel : 0.000026s : 0.01% optimize.opt_a.merge_comm : 0.000016s : 0.00% optimize.opt_a.allreduce_fusion : 0.000008s : 0.00% optimize.opt_a.virtual_dataset : 0.000032s : 0.01% optimize.opt_a.get_grad_eliminate_ : 0.000029s : 0.01% optimize.opt_a.virtual_output : 0.000028s : 0.01% optimize.opt_a.merge_forward : 0.000042s : 0.01% optimize.opt_a.cell_reuse_recompute_pass : 0.000002s : 0.00% optimize.opt_a.cell_reuse_handle_not_recompute_node_pass : 0.000069s : 0.02% optimize.opt_a.meta_fg_expand : 0.000047s : 0.01% optimize.opt_a.meta_fg_expand.resolve : 0.001694s : 0.49% optimize.opt_a.after_resolve : 0.000089s : 0.03% optimize.opt_a.a_after_grad : 0.000134s : 0.04% optimize.opt_a.renormalize : 0.079520s : 22.86% optimize.opt_a.real_op_eliminate : 0.000115s : 0.03% optimize.opt_a.auto_monad_grad : 0.000087s : 0.02% optimize.opt_a.auto_monad_eliminator : 0.000149s : 0.04% optimize.opt_a.cse : 0.000399s : 0.11% optimize.opt_a.a_3 : 0.000562s : 0.16% optimize.py_interpret_to_execute_after_opt_a : 0.000005s : 0.00% optimize.slice_cell_reuse_recomputed_activation : 0.000001s : 0.00% optimize.rewriter_after_opt_a : 0.000070s : 0.02% optimize.convert_after_rewriter : 0.000017s : 0.00% optimize.order_py_execute_after_rewriter : 0.000012s : 0.00% optimize.opt_b.b_1 : 0.000503s : 0.14% optimize.opt_b.b_2 : 0.000006s : 0.00% optimize.opt_b.updatestate_depend_eliminate : 0.000006s : 0.00% optimize.opt_b.updatestate_assign_eliminate : 0.000005s : 0.00% optimize.opt_b.updatestate_loads_eliminate : 0.000005s : 0.00% optimize.opt_b.renormalize : 0.000000s : 0.00% optimize.opt_b.cse : 0.000019s : 0.01% optimize.cconv : 0.000015s : 0.00% optimize.opt_after_cconv.c_1 : 0.000006s : 0.00% optimize.opt_after_cconv.parameter_eliminate : 0.000002s : 0.00% optimize.opt_after_cconv.updatestate_depend_eliminate : 0.000003s : 0.00% optimize.opt_after_cconv.updatestate_assign_eliminate : 0.000003s : 0.00% optimize.opt_after_cconv.updatestate_loads_eliminate : 0.000002s : 0.00% optimize.opt_after_cconv.cse : 0.000008s : 0.00% optimize.opt_after_cconv.renormalize : 0.000000s : 0.00% optimize.remove_dup_value : 0.000008s : 0.00% optimize.tuple_transform.d_1 : 0.000012s : 0.00% optimize.tuple_transform.d_2 : 0.000007s : 0.00% optimize.tuple_transform.renormalize : 0.000000s : 0.00% optimize.add_cache_embedding : 0.000008s : 0.00% optimize.add_recomputation : 0.000038s : 0.01% optimize.cse_after_recomputation.cse : 0.000009s : 0.00% optimize.environ_conv : 0.000016s : 0.00% optimize.label_micro_interleaved_index : 0.000002s : 0.00% optimize.label_fine_grained_interleaved_index : 0.000001s : 0.00% optimize.assign_add_opt : 0.000002s : 0.00% optimize.slice_recompute_activation : 0.000001s : 0.00% optimize.micro_interleaved_order_control : 0.000001s : 0.00% optimize.full_micro_interleaved_order_control : 0.000001s : 0.00% optimize.comp_comm_scheduling : 0.000001s : 0.00% optimize.reorder_send_recv_between_fp_bp : 0.000002s : 0.00% optimize.comm_op_add_attrs : 0.000001s : 0.00% optimize.add_comm_op_reuse_tag : 0.000001s : 0.00% optimize.overlap_opt_shard_in_pipeline : 0.000001s : 0.00% optimize.grouped_pairwise_exchange_alltoall : 0.000001s : 0.00% optimize.overlap_recompute_and_grad_model_parallel : 0.000001s : 0.00% optimize.overlap_grad_matmul_and_grad_allreduce : 0.000000s : 0.00% optimize.split_matmul_comm_elemetwise : 0.000001s : 0.00% optimize.split_layernorm_comm : 0.000001s : 0.00% optimize.process_send_recv_for_ge : 0.000002s : 0.00% optimize.handle_group_info : 0.000001s : 0.00% auto_monad_reorder : 0.000017s : 0.00% get_jit_bprop_graph : 0.000000s : 0.00% eliminate_special_op_node : 0.000504s : 0.14% validate : 0.000041s : 0.01% distribtued_split : 0.000001s : 0.00% task_emit : 0.005016s : 1.44% execute : 0.000006s : 0.00% Time group info: ------[substitution.] 0.030634 389 0.01% : 0.000003s : 5: substitution.float_depend_g_call 0.03% : 0.000009s : 14: substitution.float_tuple_getitem_switch 96.79% : 0.029650s : 27: substitution.getattr_setattr_resolve 0.01% : 0.000003s : 3: substitution.graph_param_transform 0.01% : 0.000003s : 3: substitution.incorporate_call 0.00% : 0.000001s : 3: substitution.incorporate_call_switch 2.07% : 0.000633s : 59: substitution.inline 0.02% : 0.000006s : 10: substitution.less_batch_normalization 0.13% : 0.000040s : 23: substitution.meta_unpack_prepare 0.04% : 0.000011s : 11: substitution.minmaximum_grad 0.03% : 0.000009s : 5: substitution.partial_eliminate 0.00% : 0.000001s : 3: substitution.partial_unused_args_eliminate 0.02% : 0.000006s : 47: substitution.remove_not_recompute_node 0.17% : 0.000052s : 38: substitution.replace_applicator 0.03% : 0.000008s : 24: substitution.replace_old_param 0.01% : 0.000003s : 2: substitution.reset_defer_inline 0.02% : 0.000005s : 8: substitution.set_cell_output_no_recompute 0.02% : 0.000006s : 5: substitution.specialize_transform 0.02% : 0.000007s : 4: substitution.switch_simplify 0.04% : 0.000011s : 2: substitution.transpose_eliminate 0.13% : 0.000039s : 15: substitution.tuple_list_convert_item_index_to_positive 0.05% : 0.000016s : 15: substitution.tuple_list_get_item_const_eliminator 0.07% : 0.000022s : 15: substitution.tuple_list_get_item_depend_reorder 0.22% : 0.000068s : 33: substitution.tuple_list_get_item_eliminator 0.07% : 0.000021s : 15: substitution.tuple_list_get_set_item_eliminator ------[renormalize.] 0.079506 6 94.64% : 0.075243s : 3: renormalize.infer 5.36% : 0.004263s : 3: renormalize.specialize ------[replace.] 0.000757 70 45.43% : 0.000344s : 25: replace.getattr_setattr_resolve 30.88% : 0.000234s : 31: replace.inline 6.60% : 0.000050s : 2: replace.meta_unpack_prepare 8.62% : 0.000065s : 4: replace.switch_simplify 1.57% : 0.000012s : 2: replace.transpose_eliminate 6.90% : 0.000052s : 6: replace.tuple_list_get_item_eliminator ------[match.] 0.030200 70 97.92% : 0.029572s : 25: match.getattr_setattr_resolve 1.87% : 0.000564s : 31: match.inline 0.10% : 0.000029s : 2: match.meta_unpack_prepare 0.02% : 0.000007s : 4: match.switch_simplify 0.04% : 0.000011s : 2: match.transpose_eliminate 0.06% : 0.000017s : 6: match.tuple_list_get_item_eliminator ------[func_graph_cloner_run.] 0.004616 72 66.12% : 0.003052s : 31: func_graph_cloner_run.FuncGraphClonerGraph 33.88% : 0.001564s : 41: func_graph_cloner_run.FuncGraphSpecializer ------[meta_graph.] 0.000000 0 ------[manager.] 0.000000 0 ------[pynative] 0.000000 0 ------[others.] 0.034317 255 3.33% : 0.001142s : 104: opt.transform.opt_a 1.37% : 0.000471s : 92: opt.transform.opt_b 88.12% : 0.030241s : 10: opt.transform.opt_resolve 0.37% : 0.000127s : 1: opt.transforms.meta_unpack_prepare 6.70% : 0.002300s : 40: opt.transforms.opt_a 0.01% : 0.000004s : 1: opt.transforms.opt_after_cconv 0.01% : 0.000004s : 2: opt.transforms.opt_b 0.05% : 0.000017s : 2: opt.transforms.opt_trans_graph 0.03% : 0.000009s : 3: opt.transforms.special_op_eliminate [INFO] GE(65168,python3.7):2024-01-11-05:47:23.752.697 [scalable_config.cc:55][EVENT]69564 ScalableConfig:device total max size: 34359738368, page_mem_size_total_thresold: 32641751449, uncacheable_size_threshold: 17179869184 [INFO] GE(65168,python3.7):2024-01-11-05:47:23.829.280 [graph_var_manager.cc:1424][EVENT]69564 SetAllMemoryMaxValue:The graph_mem_max_size is 27917287424 and the var_mem_max_size is 5368709120 [INFO] GE(65168,python3.7):2024-01-11-05:47:23.829.356 [graph_manager.cc:1248][EVENT]69564 PreRun:PreRun start: graph node size 3, session id 1, graph id 0, graph name online. [INFO] ATRACE(65168,python3.7):2024-01-11-05:47:23.830.261 [atrace_api.c:28](tid:69564) AtraceCreate start [INFO] ATRACE(65168,python3.7):2024-01-11-05:47:23.830.335 [trace_rb_log.c:84](tid:69564) [RUNTIME_ATRACE_DEV64_TS0] create ring buffer success, buffer size : 131152. [INFO] ATRACE(65168,python3.7):2024-01-11-05:47:23.830.347 [atrace_api.c:32](tid:69564) AtraceCreate end [INFO] TDT(65168,python3.7):2024-01-11-05:47:23.830.371 [client_manager.cpp:157][SetProfilingCallback][tid:69564] [TsdClient] set profiling callback success [INFO] GE(65168,python3.7):2024-01-11-05:47:23.831.314 [parallel_partitioner.cc:165][EVENT]69564 DoPipelinePartition:[GEPERFTRACE] The time cost of OptimizeSubgraph::PipelinePartition is [20] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.831.354 [parallel_partitioner.cc:178][EVENT]69564 DoFlowGraphPartition:[GEPERFTRACE] The time cost of OptimizeSubgraph::FlowGraphPartition is [15] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.831.402 [graph_prepare.cc:1378][EVENT]69564 Init:[GEPERFTRACE] The time cost of FileConstantUtils::ConvertFileConstToConst is [2] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.832.082 [graph_manager.cc:1050][EVENT]69564 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.preparer.PrepareInit is [696] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.832.137 [graph_manager.cc:1052][EVENT]69564 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.optimizer.HandleSummaryOp is [36] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.832.266 [base_pass.cc:339][EVENT]69564 Run:[GEPERFTRACE] The time cost of ForToWhilePass is [5] micro second, call num is [3] [INFO] GE(65168,python3.7):2024-01-11-05:47:23.832.291 [pass_manager.cc:82][EVENT]69564 Run:[GEPERFTRACE] The time cost of ProcessNetOutput::SavePass is [3] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.832.350 [pass_manager.cc:82][EVENT]69564 Run:[GEPERFTRACE] The time cost of ProcessNetOutput::NetOutputPass is [46] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.832.363 [pass_manager.cc:82][EVENT]69564 Run:[GEPERFTRACE] The time cost of ProcessNetOutput::DataPass is [0] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.832.445 [pass_manager.cc:82][EVENT]69564 Run:[GEPERFTRACE] The time cost of CreateSubGraphWithScopePass is [15] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.832.459 [pass_manager.cc:82][EVENT]69564 Run:[GEPERFTRACE] The time cost of SubgraphMultiDimsClonePass is [1] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.832.474 [pass_manager.cc:82][EVENT]69564 Run:[GEPERFTRACE] The time cost of MultiBatchClonePass is [5] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.832.551 [base_pass.cc:339][EVENT]69564 Run:[GEPERFTRACE] The time cost of SplitVariableIntoSubgraphPass is [2] micro second, call num is [3] [INFO] GE(65168,python3.7):2024-01-11-05:47:23.832.569 [graph_manager.cc:1054][EVENT]69564 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.preparer.NormalizeGraph is [412] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.840.044 [graph_manager.cc:1055][EVENT]69564 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.optimizer.OptimizeGraphInit is [7447] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.840.969 [base_pass.cc:339][EVENT]69564 Run:[GEPERFTRACE] The time cost of AssertPass is [2] micro second, call num is [6] [INFO] GE(65168,python3.7):2024-01-11-05:47:23.840.994 [base_pass.cc:339][EVENT]69564 Run:[GEPERFTRACE] The time cost of SwitchDeadBranchElimination is [4] micro second, call num is [3] [INFO] GE(65168,python3.7):2024-01-11-05:47:23.841.005 [base_pass.cc:339][EVENT]69564 Run:[GEPERFTRACE] The time cost of MergePass is [5] micro second, call num is [3] [INFO] GE(65168,python3.7):2024-01-11-05:47:23.841.015 [base_pass.cc:339][EVENT]69564 Run:[GEPERFTRACE] The time cost of InferShapePass is [258] micro second, call num is [3] [INFO] GE(65168,python3.7):2024-01-11-05:47:23.841.024 [base_pass.cc:339][EVENT]69564 Run:[GEPERFTRACE] The time cost of ReplaceWithEmptyConstPass is [15] micro second, call num is [3] [INFO] GE(65168,python3.7):2024-01-11-05:47:23.841.033 [base_pass.cc:339][EVENT]69564 Run:[GEPERFTRACE] The time cost of SplitShapeNPass is [2] micro second, call num is [6] [INFO] GE(65168,python3.7):2024-01-11-05:47:23.841.041 [base_pass.cc:339][EVENT]69564 Run:[GEPERFTRACE] The time cost of DimensionComputePass is [18] micro second, call num is [3] [INFO] GE(65168,python3.7):2024-01-11-05:47:23.841.049 [base_pass.cc:339][EVENT]69564 Run:[GEPERFTRACE] The time cost of ConstantFoldingPass is [21] micro second, call num is [3] [INFO] GE(65168,python3.7):2024-01-11-05:47:23.841.057 [base_pass.cc:339][EVENT]69564 Run:[GEPERFTRACE] The time cost of InferValuePass is [4] micro second, call num is [3] [INFO] GE(65168,python3.7):2024-01-11-05:47:23.842.612 [graph_manager.cc:1056][EVENT]69564 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.optimizer.OptimizeOriginalGraphForQuantize is [2525] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.842.673 [base_pass.cc:339][EVENT]69564 Run:[GEPERFTRACE] The time cost of CondRemovePass is [5] micro second, call num is [3] [INFO] GE(65168,python3.7):2024-01-11-05:47:23.842.691 [graph_prepare.cc:1982][EVENT]69564 PrepareDynShape:[GEPERFTRACE] The time cost of Prepare::ProcessBeforeInfershape is [49] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.843.021 [base_pass.cc:339][EVENT]69564 Run:[GEPERFTRACE] The time cost of AssertPass is [2] micro second, call num is [6] [INFO] GE(65168,python3.7):2024-01-11-05:47:23.843.041 [base_pass.cc:339][EVENT]69564 Run:[GEPERFTRACE] The time cost of SwitchDeadBranchElimination is [1] micro second, call num is [3] [INFO] GE(65168,python3.7):2024-01-11-05:47:23.843.051 [base_pass.cc:339][EVENT]69564 Run:[GEPERFTRACE] The time cost of MergePass is [1] micro second, call num is [3] [INFO] GE(65168,python3.7):2024-01-11-05:47:23.843.060 [base_pass.cc:339][EVENT]69564 Run:[GEPERFTRACE] The time cost of InferShapePass is [175] micro second, call num is [3] [INFO] GE(65168,python3.7):2024-01-11-05:47:23.843.069 [base_pass.cc:339][EVENT]69564 Run:[GEPERFTRACE] The time cost of ReplaceWithEmptyConstPass is [8] micro second, call num is [3] [INFO] GE(65168,python3.7):2024-01-11-05:47:23.843.077 [base_pass.cc:339][EVENT]69564 Run:[GEPERFTRACE] The time cost of SplitShapeNPass is [2] micro second, call num is [6] [INFO] GE(65168,python3.7):2024-01-11-05:47:23.843.085 [base_pass.cc:339][EVENT]69564 Run:[GEPERFTRACE] The time cost of DimensionComputePass is [5] micro second, call num is [3] [INFO] GE(65168,python3.7):2024-01-11-05:47:23.843.093 [base_pass.cc:339][EVENT]69564 Run:[GEPERFTRACE] The time cost of ConstantFoldingPass is [9] micro second, call num is [3] [INFO] GE(65168,python3.7):2024-01-11-05:47:23.843.101 [base_pass.cc:339][EVENT]69564 Run:[GEPERFTRACE] The time cost of InferValuePass is [3] micro second, call num is [3] [INFO] GE(65168,python3.7):2024-01-11-05:47:23.843.138 [graph_prepare.cc:1983][EVENT]69564 PrepareDynShape:[GEPERFTRACE] The time cost of Prepare::FormatAndShapeProcess is [434] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.843.161 [pass_manager.cc:82][EVENT]69564 Run:[GEPERFTRACE] The time cost of PreRun::MarkForceUnknownForCondPass is [4] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.843.173 [graph_prepare.cc:1984][EVENT]69564 PrepareDynShape:[GEPERFTRACE] The time cost of Prepare::CtrlFlowPreProcess is [19] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.843.185 [graph_prepare.cc:1985][EVENT]69564 PrepareDynShape:[GEPERFTRACE] The time cost of Prepare::multibatch::GetDynamicOutputShape is [2] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.843.210 [graph_prepare.cc:1986][EVENT]69564 PrepareDynShape:[GEPERFTRACE] The time cost of Prepare::InsertAippOpUtil::Instance().UpdateDataNodeByAipp is [14] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.843.224 [graph_prepare.cc:1987][EVENT]69564 PrepareDynShape:[GEPERFTRACE] The time cost of Prepare::SaveOriginalGraphToOmModel is [0] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.843.239 [pass_manager.cc:82][EVENT]69564 Run:[GEPERFTRACE] The time cost of PrepareOptimize::ShapeOperateOpRemovePass is [4] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.843.251 [pass_manager.cc:82][EVENT]69564 Run:[GEPERFTRACE] The time cost of PrepareOptimize::ReplaceTransShapePass is [1] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.843.263 [pass_manager.cc:82][EVENT]69564 Run:[GEPERFTRACE] The time cost of PrepareOptimize::MarkAgnosticPass is [3] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.843.330 [base_pass.cc:339][EVENT]69564 Run:[GEPERFTRACE] The time cost of EnterPass is [1] micro second, call num is [3] [INFO] GE(65168,python3.7):2024-01-11-05:47:23.843.343 [base_pass.cc:339][EVENT]69564 Run:[GEPERFTRACE] The time cost of CondPass is [4] micro second, call num is [3] [INFO] GE(65168,python3.7):2024-01-11-05:47:23.843.352 [base_pass.cc:339][EVENT]69564 Run:[GEPERFTRACE] The time cost of PrintOpPass is [1] micro second, call num is [3] [INFO] GE(65168,python3.7):2024-01-11-05:47:23.843.360 [base_pass.cc:339][EVENT]69564 Run:[GEPERFTRACE] The time cost of NoUseReshapeRemovePass is [2] micro second, call num is [3] [INFO] GE(65168,python3.7):2024-01-11-05:47:23.843.369 [base_pass.cc:339][EVENT]69564 Run:[GEPERFTRACE] The time cost of DropOutPass is [0] micro second, call num is [3] [INFO] GE(65168,python3.7):2024-01-11-05:47:23.843.377 [base_pass.cc:339][EVENT]69564 Run:[GEPERFTRACE] The time cost of AssertPass is [3] micro second, call num is [3] [INFO] GE(65168,python3.7):2024-01-11-05:47:23.843.386 [base_pass.cc:339][EVENT]69564 Run:[GEPERFTRACE] The time cost of TransposeRemovePass is [0] micro second, call num is [3] [INFO] GE(65168,python3.7):2024-01-11-05:47:23.843.394 [base_pass.cc:339][EVENT]69564 Run:[GEPERFTRACE] The time cost of UnusedConstPass is [2] micro second, call num is [3] [INFO] GE(65168,python3.7):2024-01-11-05:47:23.843.402 [base_pass.cc:339][EVENT]69564 Run:[GEPERFTRACE] The time cost of StopGradientPass is [1] micro second, call num is [3] [INFO] GE(65168,python3.7):2024-01-11-05:47:23.843.410 [base_pass.cc:339][EVENT]69564 Run:[GEPERFTRACE] The time cost of PreventGradientPass is [1] micro second, call num is [3] [INFO] GE(65168,python3.7):2024-01-11-05:47:23.843.419 [base_pass.cc:339][EVENT]69564 Run:[GEPERFTRACE] The time cost of PlaceholderWithDefaultPass is [0] micro second, call num is [3] [INFO] GE(65168,python3.7):2024-01-11-05:47:23.843.427 [base_pass.cc:339][EVENT]69564 Run:[GEPERFTRACE] The time cost of SnapshotPass is [2] micro second, call num is [3] [INFO] GE(65168,python3.7):2024-01-11-05:47:23.843.435 [base_pass.cc:339][EVENT]69564 Run:[GEPERFTRACE] The time cost of GuaranteeConstPass is [1] micro second, call num is [3] [INFO] GE(65168,python3.7):2024-01-11-05:47:23.843.443 [base_pass.cc:339][EVENT]69564 Run:[GEPERFTRACE] The time cost of VarIsInitializedOpPass is [4] micro second, call num is [3] [INFO] GE(65168,python3.7):2024-01-11-05:47:23.843.457 [base_pass.cc:339][EVENT]69564 Run:[GEPERFTRACE] The time cost of ParallelConcatStartOpPass is [1] micro second, call num is [3] [INFO] GE(65168,python3.7):2024-01-11-05:47:23.843.466 [base_pass.cc:339][EVENT]69564 Run:[GEPERFTRACE] The time cost of IdentityPass is [1] micro second, call num is [3] [INFO] GE(65168,python3.7):2024-01-11-05:47:23.843.484 [pass_manager.cc:82][EVENT]69564 Run:[GEPERFTRACE] The time cost of PrepareOptimize::PrunePass is [7] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.843.496 [pass_manager.cc:82][EVENT]69564 Run:[GEPERFTRACE] The time cost of PrepareOptimize::HcclMemcpyPass is [2] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.843.526 [graph_prepare.cc:1988][EVENT]69564 PrepareDynShape:[GEPERFTRACE] The time cost of Prepare::PrepareOptimize is [293] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.843.538 [graph_manager.cc:1065][EVENT]69564 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.preparer.PrepareDynShape is [896] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.855.796 [graph_manager.cc:1077][EVENT]69564 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.optimizer.OptimizeOriginalGraph is [12237] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.855.878 [pass_manager.cc:82][EVENT]69564 Run:[GEPERFTRACE] The time cost of PrepareRunningFormatRefiner::VariablePrepareOpPass is [6] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.855.920 [graph_manager.cc:1080][EVENT]69564 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.preparer.PrepareRunningFormatRefiner is [76] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.859.120 [graph_manager.cc:1081][EVENT]69564 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.optimizer.OptimizeOriginalGraphJudgeInsert is [3183] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.859.166 [pass_manager.cc:82][EVENT]69564 Run:[GEPERFTRACE] The time cost of SubexpressionMigrationPass is [2] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.859.181 [pass_manager.cc:82][EVENT]69564 Run:[GEPERFTRACE] The time cost of UnusedArgsCleanPass is [2] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.859.193 [graph_manager.cc:1082][EVENT]69564 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::SubexpressionMigration is [33] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.859.220 [pass_manager.cc:82][EVENT]69564 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::MergeInputMemcpyPass is [3] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.859.234 [pass_manager.cc:82][EVENT]69564 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::SwitchDataEdgesBypass is [2] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.859.247 [pass_manager.cc:82][EVENT]69564 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::ConstantFuseSamePass is [2] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.859.276 [pass_manager.cc:82][EVENT]69564 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::CSEBeforeFuseDataNodesWithCommonInputPass is [20] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.859.289 [pass_manager.cc:82][EVENT]69564 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::FuseDataNodesWithCommonInputPass is [2] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.859.304 [pass_manager.cc:82][EVENT]69564 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::CommonSubexpressionEliminationPass is [4] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.859.316 [pass_manager.cc:82][EVENT]69564 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::PermutePass is [2] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.859.352 [pass_manager.cc:82][EVENT]69564 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::SameTransdataBreadthFusionPass is [27] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.859.384 [pass_manager.cc:82][EVENT]69564 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::VariableOpPass is [7] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.859.401 [pass_manager.cc:82][EVENT]69564 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::TransOpWithoutReshapeFusionPass is [7] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.859.443 [pass_manager.cc:82][EVENT]69564 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::TransOpBreadthFusionPass is [31] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.859.460 [pass_manager.cc:82][EVENT]69564 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::DataFlowPreparePass is [7] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.859.472 [pass_manager.cc:82][EVENT]69564 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::MergeUnknownShapeNPass is [2] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.859.481 [graph_manager.cc:2700][EVENT]69564 OptimizeStage1:[GEPERFTRACE] The time cost of GraphManager::OptimizeStage1_1 is [266] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.859.591 [base_pass.cc:339][EVENT]69564 Run:[GEPERFTRACE] The time cost of EnterPass is [1] micro second, call num is [3] [INFO] GE(65168,python3.7):2024-01-11-05:47:23.859.605 [base_pass.cc:339][EVENT]69564 Run:[GEPERFTRACE] The time cost of AddNPass is [1] micro second, call num is [3] [INFO] GE(65168,python3.7):2024-01-11-05:47:23.859.615 [base_pass.cc:339][EVENT]69564 Run:[GEPERFTRACE] The time cost of SwitchDeadBranchElimination is [2] micro second, call num is [3] [INFO] GE(65168,python3.7):2024-01-11-05:47:23.859.623 [base_pass.cc:339][EVENT]69564 Run:[GEPERFTRACE] The time cost of SwitchLogicRemovePass is [0] micro second, call num is [3] [INFO] GE(65168,python3.7):2024-01-11-05:47:23.859.632 [base_pass.cc:339][EVENT]69564 Run:[GEPERFTRACE] The time cost of MergePass is [2] micro second, call num is [3] [INFO] GE(65168,python3.7):2024-01-11-05:47:23.859.640 [base_pass.cc:339][EVENT]69564 Run:[GEPERFTRACE] The time cost of CastRemovePass is [7] micro second, call num is [3] [INFO] GE(65168,python3.7):2024-01-11-05:47:23.859.648 [base_pass.cc:339][EVENT]69564 Run:[GEPERFTRACE] The time cost of TransposeTransDataPass is [1] micro second, call num is [3] [INFO] GE(65168,python3.7):2024-01-11-05:47:23.859.657 [base_pass.cc:339][EVENT]69564 Run:[GEPERFTRACE] The time cost of ReshapeRemovePass is [3] micro second, call num is [3] [INFO] GE(65168,python3.7):2024-01-11-05:47:23.859.665 [base_pass.cc:339][EVENT]69564 Run:[GEPERFTRACE] The time cost of TransOpSymmetryEliminationPass is [2] micro second, call num is [3] [INFO] GE(65168,python3.7):2024-01-11-05:47:23.859.673 [base_pass.cc:339][EVENT]69564 Run:[GEPERFTRACE] The time cost of TransOpNearbyAllreduceFusionPass is [2] micro second, call num is [3] [INFO] GE(65168,python3.7):2024-01-11-05:47:23.859.681 [base_pass.cc:339][EVENT]69564 Run:[GEPERFTRACE] The time cost of ReplaceWithEmptyConstPass is [9] micro second, call num is [3] [INFO] GE(65168,python3.7):2024-01-11-05:47:23.859.689 [base_pass.cc:339][EVENT]69564 Run:[GEPERFTRACE] The time cost of DimensionComputePass is [5] micro second, call num is [3] [INFO] GE(65168,python3.7):2024-01-11-05:47:23.859.697 [base_pass.cc:339][EVENT]69564 Run:[GEPERFTRACE] The time cost of ConstantFoldingPass is [11] micro second, call num is [3] [INFO] GE(65168,python3.7):2024-01-11-05:47:23.859.705 [base_pass.cc:339][EVENT]69564 Run:[GEPERFTRACE] The time cost of DimensionAdjustPass is [2] micro second, call num is [3] [INFO] GE(65168,python3.7):2024-01-11-05:47:23.859.713 [base_pass.cc:339][EVENT]69564 Run:[GEPERFTRACE] The time cost of UselessControlOutRemovePass is [1] micro second, call num is [3] [INFO] GE(65168,python3.7):2024-01-11-05:47:23.859.723 [graph_manager.cc:2741][EVENT]69564 OptimizeStage1:[GEPERFTRACE] The time cost of GraphManager::OptimizeStage1_2 is [226] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.859.738 [graph_manager.cc:2752][EVENT]69564 OptimizeStage1:[GEPERFTRACE] The time cost of extern constant folding is [0] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.859.759 [pass_manager.cc:82][EVENT]69564 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::Migration is [3] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.859.771 [pass_manager.cc:82][EVENT]69564 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::ArgsClean is [1] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.859.789 [pass_manager.cc:82][EVENT]69564 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::PrunePass is [8] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.859.803 [pass_manager.cc:82][EVENT]69564 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::NextIterationPass is [3] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.859.813 [pass_manager.cc:82][EVENT]69564 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::ControlTriggerPass is [1] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.859.825 [pass_manager.cc:82][EVENT]69564 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::MergeToStreamMergePass is [2] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.859.846 [pass_manager.cc:82][EVENT]69564 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::SwitchToStreamSwitchPass is [10] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.859.858 [pass_manager.cc:82][EVENT]69564 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::AttachStreamLabelPass is [3] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.859.870 [pass_manager.cc:82][EVENT]69564 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::MultiBatchPass is [2] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.859.881 [pass_manager.cc:82][EVENT]69564 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::SubgraphMultiDimsPass is [2] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.859.893 [pass_manager.cc:82][EVENT]69564 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::IteratorOpPass is [3] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.859.904 [pass_manager.cc:82][EVENT]69564 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::VariableRefUselessControlOutDeletePass is [2] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.859.920 [pass_manager.cc:82][EVENT]69564 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::ReshapeRecoveryPass is [6] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.859.931 [pass_manager.cc:82][EVENT]69564 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::RemoveSameConstPass is [2] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.859.939 [graph_manager.cc:2810][EVENT]69564 OptimizeStage1:[GEPERFTRACE] The time cost of GraphManager::OptimizeStage1_3 is [185] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.859.965 [base_pass.cc:339][EVENT]69564 Run:[GEPERFTRACE] The time cost of IdentityPass is [2] micro second, call num is [3] [INFO] GE(65168,python3.7):2024-01-11-05:47:23.859.977 [graph_manager.cc:2821][EVENT]69564 OptimizeStage1:[GEPERFTRACE] The time cost of GraphPrepare::node_pass is [30] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.859.998 [graph_manager.cc:1087][EVENT]69564 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::OptimizeStage1 is [788] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.860.141 [graph_manager.cc:1088][EVENT]69564 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.optimizer.OptimizeAfterStage1 is [131] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.860.178 [graph_manager.cc:1089][EVENT]69564 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::GraphUtilsEx::InferShapeInNeed is [15] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.860.196 [pass_manager.cc:82][EVENT]69564 Run:[GEPERFTRACE] The time cost of PreRun::CtrlEdgeTransferPass is [2] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.860.218 [graph_manager.cc:1097][EVENT]69564 PreRunOptimizeOriginalGraph:PreRun:PreRunOptimizeOriginalGraph success. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.860.237 [graph_manager.cc:3325][EVENT]69564 OptimizeSubgraph:[GEPERFTRACE] The time cost of OptimizeSubgraph::StagePartition is [2] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.860.616 [engine_place.cc:144][EVENT]69564 Run:The time cost of AIcoreEngine::CheckSupported is [266] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.860.638 [engine_place.cc:144][EVENT]69564 Run:The time cost of DNN_VM_GE_LOCAL_OP_STORE::CheckSupported is [5] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.860.648 [engine_place.cc:144][EVENT]69564 Run:The time cost of DNN_VM_RTS_OP_STORE::CheckSupported is [7] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.860.717 [graph_manager.cc:3351][EVENT]69564 OptimizeSubgraph:[GEPERFTRACE] The time cost of OptimizeSubgraph::GraphPartitionDynamicShape is [466] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.860.734 [graph_manager.cc:3364][EVENT]69564 OptimizeSubgraph:[GEPERFTRACE] The time cost of OptimizeSubgraph::SubgraphPartitionAndOptimization::CompositeEngine is [1] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.860.798 [engine_partitioner.cc:1139][EVENT]69564 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionInitialize is [17] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.860.813 [engine_partitioner.cc:1142][EVENT]69564 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionMarkClusters is [3] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.860.930 [engine_partitioner.cc:1148][EVENT]69564 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionSplitSubGraphs is [108] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.860.961 [engine_partitioner.cc:1155][EVENT]69564 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionSortSubGraphs is [19] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.861.006 [engine_partitioner.cc:1164][EVENT]69564 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionAddPartitionsToGraphNode is [34] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.861.029 [graph_manager.cc:3405][EVENT]69564 SubgraphPartitionAndOptimization:[GEPERFTRACE] The time cost of OptimizeSubgraph::Partition1 is [282] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.861.046 [graph_manager.cc:3412][EVENT]69564 SubgraphPartitionAndOptimization:[GEPERFTRACE] The time cost of OptimizeSubgraph::SetSubgraphPreProc is [7] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.872.650 [graph_manager.cc:3422][EVENT]69564 SubgraphPartitionAndOptimization:[GEPERFTRACE] The time cost of OptimizeSubgraph::SetSubGraph is [11590] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.872.695 [graph_manager.cc:3428][EVENT]69564 SubgraphPartitionAndOptimization:[GEPERFTRACE] The time cost of OptimizeSubgraph::SetSubgraphPostProc is [10] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.872.809 [graph_manager.cc:3467][EVENT]69564 SubgraphPartitionAndOptimization:[GEPERFTRACE] The time cost of OptimizeSubgraph::MergeSubGraph is [94] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.872.827 [graph_manager.cc:3377][EVENT]69564 OptimizeSubgraph:[GEPERFTRACE] The time cost of OptimizeSubgraph::SubgraphPartitionAndOptimization::AtomicEngine is [12082] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.872.842 [graph_manager.cc:1106][EVENT]69564 PreRunOptimizeSubGraph:[GEPERFTRACE] The time cost of GraphManager::OptimizeSubgraph is [12609] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.872.854 [graph_manager.cc:1115][EVENT]69564 PreRunOptimizeSubGraph:PreRun:PreRunOptimizeSubGraph success. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.872.874 [graph_manager.cc:1130][EVENT]69564 PreRunAfterOptimizeSubGraph:[GEPERFTRACE] The time cost of GraphManager::stages.optimizer.ReplacePrecompiledNodeWithOmGraph is [5] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.872.917 [graph_manager.cc:1131][EVENT]69564 PreRunAfterOptimizeSubGraph:[GEPERFTRACE] The time cost of GraphManager::stages.optimizer.OptimizeWholeGraph is [18] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.872.943 [pass_manager.cc:82][EVENT]69564 Run:[GEPERFTRACE] The time cost of OptimizeStage2::AfterMergePasses::LinkGenMaskNodesPass is [9] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.872.958 [pass_manager.cc:82][EVENT]69564 Run:[GEPERFTRACE] The time cost of OptimizeStage2::HcclContinuousMemcpyPass is [4] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.872.968 [graph_manager.cc:2837][EVENT]69564 OptimizeStage2:[GEPERFTRACE] The time cost of OptimizeStage2::AfterMergePasses is [36] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.873.041 [base_pass.cc:339][EVENT]69564 Run:[GEPERFTRACE] The time cost of ConstantFoldingPass is [13] micro second, call num is [3] [INFO] GE(65168,python3.7):2024-01-11-05:47:23.873.054 [base_pass.cc:339][EVENT]69564 Run:[GEPERFTRACE] The time cost of ReshapeRemovePass is [2] micro second, call num is [3] [INFO] GE(65168,python3.7):2024-01-11-05:47:23.873.063 [base_pass.cc:339][EVENT]69564 Run:[GEPERFTRACE] The time cost of CondRemovePass is [1] micro second, call num is [3] [INFO] GE(65168,python3.7):2024-01-11-05:47:23.873.072 [base_pass.cc:339][EVENT]69564 Run:[GEPERFTRACE] The time cost of BitcastPass is [1] micro second, call num is [3] [INFO] GE(65168,python3.7):2024-01-11-05:47:23.873.080 [base_pass.cc:339][EVENT]69564 Run:[GEPERFTRACE] The time cost of AssignRemovePass is [3] micro second, call num is [3] [INFO] GE(65168,python3.7):2024-01-11-05:47:23.873.088 [base_pass.cc:339][EVENT]69564 Run:[GEPERFTRACE] The time cost of DimensionAdjustPass is [4] micro second, call num is [3] [INFO] GE(65168,python3.7):2024-01-11-05:47:23.873.098 [graph_manager.cc:2864][EVENT]69564 OptimizeStage2:[GEPERFTRACE] The time cost of OptimizeStage2::MergedGraphNameToPasses is [115] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.873.108 [graph_manager.cc:2872][EVENT]69564 OptimizeStage2:[GEPERFTRACE] The time cost of OptimizeStage2::RemoveIsolatedConst is [1] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.873.127 [pass_manager.cc:82][EVENT]69564 Run:[GEPERFTRACE] The time cost of OptimizeStage2::ControlAttrOptimize::MultiBatchPass is [2] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.873.141 [pass_manager.cc:82][EVENT]69564 Run:[GEPERFTRACE] The time cost of OptimizeStage2::AfterMergePasses::RefIdentityDeleteOpPass is [3] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.873.155 [pass_manager.cc:82][EVENT]69564 Run:[GEPERFTRACE] The time cost of OptimizeStage2::AfterMergePasses::VariableRefDeleteOpPass is [4] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.873.168 [compile_nodes_pass.cc:88][EVENT]69564 Run:[GEPERFTRACE] The time cost of OptimizeStage2::ControlAttrOptimize::CompileNodesPass is [2] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.873.179 [pass_manager.cc:82][EVENT]69564 Run:[GEPERFTRACE] The time cost of OptimizeStage2::ControlAttrOptimize::CompileNodesPass is [13] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.873.188 [pass_manager.cc:82][EVENT]69564 Run:[GEPERFTRACE] The time cost of OptimizeStage2::AfterMergePasses::SwapSpacePass is [1] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.873.267 [pass_manager.cc:82][EVENT]69564 Run:[GEPERFTRACE] The time cost of OptimizeStage2::AfterMergePasses::InputOutputConnectionIdentifyPass is [67] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.873.296 [pass_manager.cc:82][EVENT]69564 Run:[GEPERFTRACE] The time cost of OptimizeStage2::AfterMergePasses::AtomicAddrCleanPass is [16] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.873.309 [pass_manager.cc:82][EVENT]69564 Run:[GEPERFTRACE] The time cost of OptimizeStage2::AfterMergePasses::EndOfSequenceAddControlPass is [2] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.873.327 [pass_manager.cc:82][EVENT]69564 Run:[GEPERFTRACE] The time cost of OptimizeStage2::ControlAttrOptimize::SubgraphPass is [2] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.873.340 [pass_manager.cc:82][EVENT]69564 Run:[GEPERFTRACE] The time cost of OptimizeStage2::ControlAttrOptimize::AttachStreamLabelPass is [4] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.873.349 [graph_manager.cc:2927][EVENT]69564 OptimizeStage2:[GEPERFTRACE] The time cost of OptimizeStage2::ControlAttrOptimize is [226] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.873.366 [graph_manager.cc:2937][EVENT]69564 OptimizeStage2:[GEPERFTRACE] The time cost of ModelBuilder::AssignFunctionalLabels is [8] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.873.383 [graph_manager.cc:2943][EVENT]69564 OptimizeStage2:[GEPERFTRACE] The time cost of MemcpyAddrAsyncPass::Run. is [6] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.873.393 [graph_manager.cc:2950][EVENT]69564 OptimizeStage2:[GEPERFTRACE] The time cost of BufferPoolMemoryPass::Run. is [1] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.882.095 [graph_manager.cc:2958][EVENT]69564 OptimizeStage2:[GEPERFTRACE] The time cost of ParallelGroupPass::Run. is [48] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.882.154 [graph_manager.cc:1132][EVENT]69564 PreRunAfterOptimizeSubGraph:[GEPERFTRACE] The time cost of GraphManager::OptimizeStage2 is [9223] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.882.249 [graph_manager.cc:1135][EVENT]69564 PreRunAfterOptimizeSubGraph:[GEPERFTRACE] The time cost of GraphManager::GetCompilerStages(graph_node->GetGraphId()).optimizer.OptimizeGraphBeforeBuild is [78] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.882.295 [graph_manager.cc:2975][EVENT]69564 MemConflictProc:[GEPERFTRACE] The time cost of HandleMemoryRWConflict is [27] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.882.333 [graph_manager.cc:2981][EVENT]69564 MemConflictProc:[GEPERFTRACE] The time cost of MemLayoutConflictOptimizer::Run. is [23] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.882.349 [pass_manager.cc:82][EVENT]69564 Run:[GEPERFTRACE] The time cost of OptimizeStage2::SetFftsPlusAttrPass is [0] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.882.360 [graph_manager.cc:2986][EVENT]69564 MemConflictProc:[GEPERFTRACE] The time cost of SetFftsPlusAttrPass::last_passes.Run is [15] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.882.370 [graph_manager.cc:1136][EVENT]69564 PreRunAfterOptimizeSubGraph:[GEPERFTRACE] The time cost of GraphManager::MemConflictProc is [102] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.882.505 [graph_manager.cc:3555][EVENT]69564 Build:[GEPERFTRACE] The time cost of GraphManager::RecoverIrDefinitionAndModifyAippData is [101] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.882.598 [engine_partitioner.cc:1139][EVENT]69564 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionInitialize is [17] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.882.614 [engine_partitioner.cc:1142][EVENT]69564 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionMarkClusters is [2] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.882.719 [engine_partitioner.cc:1148][EVENT]69564 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionSplitSubGraphs is [95] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.882.752 [engine_partitioner.cc:1155][EVENT]69564 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionSortSubGraphs is [20] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.882.791 [engine_partitioner.cc:1164][EVENT]69564 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionAddPartitionsToGraphNode is [29] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.882.828 [graph_builder.cc:865][EVENT]69564 SecondPartition:[GEPERFTRACE] The time cost of EnginePartitioner::Partition2 is [265] micro second. [INFO] RUNTIME(65168,python3.7):2024-01-11-05:47:23.883.264 [logger.cc:1071] 69564 ModelBindStream: model_id=832, stream_id=1089, flag=0. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.883.305 [task_generator.cc:804][EVENT]69564 GenerateTask:[GEPERFTRACE] The time cost of TaskGenerator::SetStreamCtx is [185] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.883.366 [task_generator.cc:805][EVENT]69564 GenerateTask:[GEPERFTRACE] The time cost of TaskGenerator::PrepareForGenerateTask is [49] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.884.221 [task_generator.cc:814][EVENT]69564 GenerateTask:[GEPERFTRACE] The time cost of TaskGenerator::DoGenerateTask is [841] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.884.238 [task_generator.cc:954][EVENT]69564 GetTaskInfo:[GEPERFTRACE] The time cost of TaskGenerator::GenerateTask is [1117] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.884.280 [task_generator.cc:967][EVENT]69564 GetTaskInfo:[GEPERFTRACE] The time cost of TaskGenerator::AddModelTaskToModel is [21] micro second. [INFO] RUNTIME(65168,python3.7):2024-01-11-05:47:23.884.296 [logger.cc:1084] 69564 ModelUnbindStream: model_id=832, stream_id=1089, [INFO] GE(65168,python3.7):2024-01-11-05:47:23.884.435 [graph_manager.cc:1152][EVENT]69564 PreRunAfterOptimizeSubGraph:[GEPERFTRACE] The time cost of GraphManager::Build is [2043] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.884.454 [graph_manager.cc:1164][EVENT]69564 PreRunAfterOptimizeSubGraph:PreRun:PreRunAfterOptimizeSubGraph success. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.884.482 [graph_manager.cc:1271][EVENT]69564 PreRun:[GEPERFTRACE] The time cost of FlowModelBuild is [53277] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.884.494 [graph_manager.cc:1272][EVENT]69564 PreRun:[GEPERFTRACE] GE PreRun End [INFO] ATRACE(65168,python3.7):2024-01-11-05:47:23.884.805 [atrace_api.c:93](tid:69564) AtraceDestroy start [INFO] ATRACE(65168,python3.7):2024-01-11-05:47:23.884.825 [atrace_api.c:95](tid:69564) AtraceDestroy end [INFO] GE(65168,python3.7):2024-01-11-05:47:23.889.457 [graph_converter.cc:838][EVENT]69564 ConvertComputeGraphToExecuteGraph:[GEPERFTRACE] The time cost of ConvertComputeGraphToExecuteGraph::CreateMainNode is [1324] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.889.612 [base_optimizer.cc:70][EVENT]69564 Run:[GEPERFTRACE] The time cost of ZeroCopy is [106] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.890.062 [base_optimizer.cc:70][EVENT]69564 Run:[GEPERFTRACE] The time cost of CEM is [429] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.890.236 [copy_flow_launch_fuse.cc:395][EVENT]69564 Run:[GEPERFTRACE] The time cost of Pass::CopyFlowLaunchFuse is [151] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.890.253 [base_optimizer.cc:70][EVENT]69564 Run:[GEPERFTRACE] The time cost of CopyFlowLaunch is [170] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.890.462 [base_optimizer.cc:70][EVENT]69564 Run:[GEPERFTRACE] The time cost of TrustOutTensor is [197] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.890.485 [base_optimizer.cc:70][EVENT]69564 Run:[GEPERFTRACE] The time cost of AicpuFuseHostInputs is [6] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.890.519 [base_optimizer.cc:70][EVENT]69564 Run:[GEPERFTRACE] The time cost of ZeroCopy is [22] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.890.697 [base_optimizer.cc:70][EVENT]69564 Run:[GEPERFTRACE] The time cost of CEM is [165] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.890.774 [copy_flow_launch_fuse.cc:395][EVENT]69564 Run:[GEPERFTRACE] The time cost of Pass::CopyFlowLaunchFuse is [61] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.890.786 [base_optimizer.cc:70][EVENT]69564 Run:[GEPERFTRACE] The time cost of CopyFlowLaunch is [74] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.890.827 [base_optimizer.cc:70][EVENT]69564 Run:[GEPERFTRACE] The time cost of TrustOutTensor is [18] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.890.838 [base_optimizer.cc:70][EVENT]69564 Run:[GEPERFTRACE] The time cost of AicpuFuseHostInputs is [0] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.890.863 [base_optimizer.cc:70][EVENT]69564 Run:[GEPERFTRACE] The time cost of ZeroCopy is [16] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.890.931 [base_optimizer.cc:70][EVENT]69564 Run:[GEPERFTRACE] The time cost of CEM is [59] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.890.995 [copy_flow_launch_fuse.cc:395][EVENT]69564 Run:[GEPERFTRACE] The time cost of Pass::CopyFlowLaunchFuse is [52] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.891.006 [base_optimizer.cc:70][EVENT]69564 Run:[GEPERFTRACE] The time cost of CopyFlowLaunch is [63] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.891.031 [base_optimizer.cc:70][EVENT]69564 Run:[GEPERFTRACE] The time cost of TrustOutTensor is [16] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.891.040 [base_optimizer.cc:70][EVENT]69564 Run:[GEPERFTRACE] The time cost of AicpuFuseHostInputs is [1] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.891.051 [graph_converter.cc:849][EVENT]69564 ConvertComputeGraphToExecuteGraph:[GEPERFTRACE] The time cost of ConvertComputeGraphToExecuteGraph::RunAllPass is [1548] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.891.257 [graph_converter.cc:853][EVENT]69564 ConvertComputeGraphToExecuteGraph:[GEPERFTRACE] The time cost of ConvertComputeGraphToExecuteGraph::TopologicalSorting is [198] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.891.901 [graph_converter.cc:857][EVENT]69564 ConvertComputeGraphToExecuteGraph:[GEPERFTRACE] The time cost of ConvertComputeGraphToExecuteGraph::AppendGraphLevelData is [630] micro second. [INFO] GE(65168,python3.7):2024-01-11-05:47:23.892.032 [graph_converter.cc:862][EVENT]69564 ConvertComputeGraphToExecuteGraph:[GEPERFTRACE] The time cost of ConvertComputeGraphToExecuteGraph::CalculatePriority is [110] micro second. . TotalTime = 0.0889905, [20] [parse]: 0.00139702 [symbol_resolve]: 0.0123357, [1] [Cycle 1]: 0.012273, [1] [resolve]: 0.0122504 [combine_like_graphs]: 9.79999e-07 [graph_reusing]: 3.12e-06 [meta_unpack_prepare]: 0.00016075 [pre_cconv]: 4.49996e-07 [abstract_specialize]: 0.00433031 [pack_expand]: 1.349e-05 [auto_monad]: 6.924e-05 [inline]: 1.34e-06 [pre_auto_parallel]: 8.55e-06 [pipeline_split]: 1.89e-06 [optimize]: 0.0672004, [35] [py_interpret_to_execute]: 4.04e-06 [rewriter_before_opt_a]: 0.00018338 [opt_a]: 0.0659794, [4] [Cycle 1]: 0.032146, [30] [expand_dump_flag]: 3.11e-06 [switch_simplify]: 2.826e-05 [a_1]: 0.00081041 [recompute_prepare]: 8.51e-06 [updatestate_depend_eliminate]: 1e-05 [updatestate_assign_eliminate]: 7.78e-06 [updatestate_loads_eliminate]: 7.14e-06 [parameter_eliminate]: 3.93e-06 [a_2]: 8.456e-05 [accelerated_algorithm]: 5.83e-06 [pynative_shard]: 1.36e-06 [auto_parallel]: 3.69e-06 [parallel]: 6.25e-06 [merge_comm]: 2.63999e-06 [allreduce_fusion]: 1.72e-06 [virtual_dataset]: 5.83e-06 [get_grad_eliminate_]: 5.53e-06 [virtual_output]: 5.03e-06 [merge_forward]: 7.74e-06 [cell_reuse_recompute_pass]: 4.69998e-07 [cell_reuse_handle_not_recompute_node_pass]: 1.175e-05 [meta_fg_expand]: 0.00203405, [1] [Cycle 1]: 0.00047832, [1] [resolve]: 0.00046024 [after_resolve]: 2.529e-05 [a_after_grad]: 4.286e-05 [renormalize]: 0.0284284 [real_op_eliminate]: 3.022e-05 [auto_monad_grad]: 3.495e-05 [auto_monad_eliminator]: 4.92e-05 [cse]: 0.00011394 [a_3]: 0.00018526 [Cycle 2]: 0.0268045, [30] [expand_dump_flag]: 2.58999e-06 [switch_simplify]: 8.031e-05 [a_1]: 0.00100751 [recompute_prepare]: 1.038e-05 [updatestate_depend_eliminate]: 1.214e-05 [updatestate_assign_eliminate]: 9.66001e-06 [updatestate_loads_eliminate]: 9.44e-06 [parameter_eliminate]: 3.21e-06 [a_2]: 0.00012933 [accelerated_algorithm]: 1.252e-05 [pynative_shard]: 1.16001e-06 [auto_parallel]: 4.95e-06 [parallel]: 4.37e-06 [merge_comm]: 2.55e-06 [allreduce_fusion]: 1.50999e-06 [virtual_dataset]: 8.36e-06 [get_grad_eliminate_]: 7.44e-06 [virtual_output]: 6.93e-06 [merge_forward]: 1.033e-05 [cell_reuse_recompute_pass]: 7.60003e-07 [cell_reuse_handle_not_recompute_node_pass]: 1.58e-05 [meta_fg_expand]: 0.00503227, [3] [Cycle 1]: 0.00031716, [1] [resolve]: 0.00029935 [Cycle 1]: 0.00047361, [1] [resolve]: 0.00045561 [Cycle 1]: 0.00030816, [1] [resolve]: 0.00028978 [after_resolve]: 3.483e-05 [a_after_grad]: 6.702e-05 [renormalize]: 0.0196849 [real_op_eliminate]: 3.387e-05 [auto_monad_grad]: 3.919e-05 [auto_monad_eliminator]: 5.674e-05 [cse]: 0.00013117 [a_3]: 0.00021822 [Cycle 3]: 0.00355651, [30] [expand_dump_flag]: 2.69e-06 [switch_simplify]: 8.769e-05 [a_1]: 0.00130011 [recompute_prepare]: 1.225e-05 [updatestate_depend_eliminate]: 1.377e-05 [updatestate_assign_eliminate]: 1.162e-05 [updatestate_loads_eliminate]: 1.1e-05 [parameter_eliminate]: 3.68e-06 [a_2]: 0.00016488 [accelerated_algorithm]: 1.591e-05 [pynative_shard]: 1.62001e-06 [auto_parallel]: 5.23e-06 [parallel]: 4.2e-06 [merge_comm]: 3.01e-06 [allreduce_fusion]: 2.05e-06 [virtual_dataset]: 9.67e-06 [get_grad_eliminate_]: 9.25e-06 [virtual_output]: 8.8e-06 [merge_forward]: 1.194e-05 [cell_reuse_recompute_pass]: 5.10001e-07 [cell_reuse_handle_not_recompute_node_pass]: 2.02e-05 [meta_fg_expand]: 3.734e-05 [after_resolve]: 1.277e-05 [a_after_grad]: 2.157e-05 [renormalize]: 0.00142182 [real_op_eliminate]: 1.411e-05 [auto_monad_grad]: 4.88e-06 [auto_monad_eliminator]: 2.351e-05 [cse]: 9.122e-05 [a_3]: 7.837e-05 [Cycle 4]: 0.00108475, [30] [expand_dump_flag]: 1.2e-06 [switch_simplify]: 9.17e-06 [a_1]: 0.00044041 [recompute_prepare]: 1.118e-05 [updatestate_depend_eliminate]: 1.391e-05 [updatestate_assign_eliminate]: 1.18e-05 [updatestate_loads_eliminate]: 1.078e-05 [parameter_eliminate]: 1.49e-06 [a_2]: 0.00016437 [accelerated_algorithm]: 1.482e-05 [pynative_shard]: 1.2e-06 [auto_parallel]: 3.14999e-06 [parallel]: 3.24e-06 [merge_comm]: 2.08e-06 [allreduce_fusion]: 1.68e-06 [virtual_dataset]: 9.77e-06 [get_grad_eliminate_]: 9.07e-06 [virtual_output]: 8.83e-06 [merge_forward]: 1.187e-05 [cell_reuse_recompute_pass]: 3.09999e-07 [cell_reuse_handle_not_recompute_node_pass]: 1.912e-05 [meta_fg_expand]: 9.26e-06 [after_resolve]: 1.206e-05 [a_after_grad]: 2.119e-05 [renormalize]: 7.0002e-08 [real_op_eliminate]: 8.95e-06 [auto_monad_grad]: 1.84e-06 [auto_monad_eliminator]: 2.111e-05 [cse]: 5.254e-05 [a_3]: 9.065e-05 [py_interpret_to_execute_after_opt_a]: 4.69e-06 [slice_cell_reuse_recomputed_activation]: 1.31e-06 [rewriter_after_opt_a]: 6.551e-05 [convert_after_rewriter]: 1.724e-05 [order_py_execute_after_rewriter]: 1.162e-05 [opt_b]: 0.00061569, [2] [Cycle 1]: 0.00052275, [7] [b_1]: 0.00046565 [b_2]: 3.46e-06 [updatestate_depend_eliminate]: 3.43e-06 [updatestate_assign_eliminate]: 2.62e-06 [updatestate_loads_eliminate]: 2.29e-06 [renormalize]: 2.50002e-07 [cse]: 1.091e-05 [Cycle 2]: 8.376e-05, [7] [b_1]: 4.149e-05 [b_2]: 2.22e-06 [updatestate_depend_eliminate]: 2.42001e-06 [updatestate_assign_eliminate]: 2.25e-06 [updatestate_loads_eliminate]: 2.48e-06 [renormalize]: 7.99992e-08 [cse]: 7.58001e-06 [cconv]: 1.459e-05 [opt_after_cconv]: 6.09e-05, [1] [Cycle 1]: 5.681e-05, [7] [c_1]: 1.506e-05 [parameter_eliminate]: 1.43e-06 [updatestate_depend_eliminate]: 2.33e-06 [updatestate_assign_eliminate]: 2.04e-06 [updatestate_loads_eliminate]: 1.97e-06 [cse]: 7.19e-06 [renormalize]: 2.3e-07 [remove_dup_value]: 7.93e-06 [tuple_transform]: 4.508e-05, [1] [Cycle 1]: 4.133e-05, [3] [d_1]: 2.203e-05 [d_2]: 6.22001e-06 [renormalize]: 1.59998e-07 [add_cache_embedding]: 7.87e-06 [add_recomputation]: 3.182e-05 [cse_after_recomputation]: 1.722e-05, [1] [Cycle 1]: 1.303e-05, [1] [cse]: 8.54e-06 [environ_conv]: 5.04e-06 [label_micro_interleaved_index]: 1.79e-06 [label_fine_grained_interleaved_index]: 1.41e-06 [assign_add_opt]: 1.12e-06 [slice_recompute_activation]: 1.32e-06 [micro_interleaved_order_control]: 1.06001e-06 [full_micro_interleaved_order_control]: 1.09e-06 [comp_comm_scheduling]: 1.25e-06 [reorder_send_recv_between_fp_bp]: 1.1e-06 [comm_op_add_attrs]: 6.00005e-07 [add_comm_op_reuse_tag]: 5.69999e-07 [overlap_opt_shard_in_pipeline]: 5.80003e-07 [grouped_pairwise_exchange_alltoall]: 6.79996e-07 [overlap_recompute_and_grad_model_parallel]: 1.66e-06 [overlap_grad_matmul_and_grad_allreduce]: 4.50003e-07 [split_matmul_comm_elemetwise]: 1.47001e-06 [split_layernorm_comm]: 1.17e-06 [process_send_recv_for_ge]: 7.09995e-07 [handle_group_info]: 6.40001e-07 [auto_monad_reorder]: 1.17e-05 [get_jit_bprop_graph]: 2.62e-06 [eliminate_special_op_node]: 0.00058644 [validate]: 2.617e-05 [distribtued_split]: 9.29998e-07 [task_emit]: 0.00264675 [execute]: 4.13e-06 Sums parse : 0.001397s : 1.75% symbol_resolve.resolve : 0.012250s : 15.33% combine_like_graphs : 0.000001s : 0.00% graph_reusing : 0.000003s : 0.00% meta_unpack_prepare : 0.000161s : 0.20% pre_cconv : 0.000000s : 0.00% abstract_specialize : 0.004330s : 5.42% pack_expand : 0.000013s : 0.02% auto_monad : 0.000069s : 0.09% inline : 0.000001s : 0.00% pre_auto_parallel : 0.000009s : 0.01% pipeline_split : 0.000002s : 0.00% optimize.py_interpret_to_execute : 0.000004s : 0.01% optimize.rewriter_before_opt_a : 0.000183s : 0.23% optimize.opt_a.expand_dump_flag : 0.000010s : 0.01% optimize.opt_a.switch_simplify : 0.000205s : 0.26% optimize.opt_a.a_1 : 0.003558s : 4.45% optimize.opt_a.recompute_prepare : 0.000042s : 0.05% optimize.opt_a.updatestate_depend_eliminate : 0.000050s : 0.06% optimize.opt_a.updatestate_assign_eliminate : 0.000041s : 0.05% optimize.opt_a.updatestate_loads_eliminate : 0.000038s : 0.05% optimize.opt_a.parameter_eliminate : 0.000012s : 0.02% optimize.opt_a.a_2 : 0.000543s : 0.68% optimize.opt_a.accelerated_algorithm : 0.000049s : 0.06% optimize.opt_a.pynative_shard : 0.000005s : 0.01% optimize.opt_a.auto_parallel : 0.000017s : 0.02% optimize.opt_a.parallel : 0.000018s : 0.02% optimize.opt_a.merge_comm : 0.000010s : 0.01% optimize.opt_a.allreduce_fusion : 0.000007s : 0.01% optimize.opt_a.virtual_dataset : 0.000034s : 0.04% optimize.opt_a.get_grad_eliminate_ : 0.000031s : 0.04% optimize.opt_a.virtual_output : 0.000030s : 0.04% optimize.opt_a.merge_forward : 0.000042s : 0.05% optimize.opt_a.cell_reuse_recompute_pass : 0.000002s : 0.00% optimize.opt_a.cell_reuse_handle_not_recompute_node_pass : 0.000067s : 0.08% optimize.opt_a.meta_fg_expand : 0.000047s : 0.06% optimize.opt_a.meta_fg_expand.resolve : 0.001505s : 1.88% optimize.opt_a.after_resolve : 0.000085s : 0.11% optimize.opt_a.a_after_grad : 0.000153s : 0.19% optimize.opt_a.renormalize : 0.049535s : 61.98% optimize.opt_a.real_op_eliminate : 0.000087s : 0.11% optimize.opt_a.auto_monad_grad : 0.000081s : 0.10% optimize.opt_a.auto_monad_eliminator : 0.000151s : 0.19% optimize.opt_a.cse : 0.000389s : 0.49% optimize.opt_a.a_3 : 0.000572s : 0.72% optimize.py_interpret_to_execute_after_opt_a : 0.000005s : 0.01% optimize.slice_cell_reuse_recomputed_activation : 0.000001s : 0.00% optimize.rewriter_after_opt_a : 0.000066s : 0.08% optimize.convert_after_rewriter : 0.000017s : 0.02% optimize.order_py_execute_after_rewriter : 0.000012s : 0.01% optimize.opt_b.b_1 : 0.000507s : 0.63% optimize.opt_b.b_2 : 0.000006s : 0.01% optimize.opt_b.updatestate_depend_eliminate : 0.000006s : 0.01% optimize.opt_b.updatestate_assign_eliminate : 0.000005s : 0.01% optimize.opt_b.updatestate_loads_eliminate : 0.000005s : 0.01% optimize.opt_b.renormalize : 0.000000s : 0.00% optimize.opt_b.cse : 0.000018s : 0.02% optimize.cconv : 0.000015s : 0.02% optimize.opt_after_cconv.c_1 : 0.000015s : 0.02% optimize.opt_after_cconv.parameter_eliminate : 0.000001s : 0.00% optimize.opt_after_cconv.updatestate_depend_eliminate : 0.000002s : 0.00% optimize.opt_after_cconv.updatestate_assign_eliminate : 0.000002s : 0.00% optimize.opt_after_cconv.updatestate_loads_eliminate : 0.000002s : 0.00% optimize.opt_after_cconv.cse : 0.000007s : 0.01% optimize.opt_after_cconv.renormalize : 0.000000s : 0.00% optimize.remove_dup_value : 0.000008s : 0.01% optimize.tuple_transform.d_1 : 0.000022s : 0.03% optimize.tuple_transform.d_2 : 0.000006s : 0.01% optimize.tuple_transform.renormalize : 0.000000s : 0.00% optimize.add_cache_embedding : 0.000008s : 0.01% optimize.add_recomputation : 0.000032s : 0.04% optimize.cse_after_recomputation.cse : 0.000009s : 0.01% optimize.environ_conv : 0.000005s : 0.01% optimize.label_micro_interleaved_index : 0.000002s : 0.00% optimize.label_fine_grained_interleaved_index : 0.000001s : 0.00% optimize.assign_add_opt : 0.000001s : 0.00% optimize.slice_recompute_activation : 0.000001s : 0.00% optimize.micro_interleaved_order_control : 0.000001s : 0.00% optimize.full_micro_interleaved_order_control : 0.000001s : 0.00% optimize.comp_comm_scheduling : 0.000001s : 0.00% optimize.reorder_send_recv_between_fp_bp : 0.000001s : 0.00% optimize.comm_op_add_attrs : 0.000001s : 0.00% optimize.add_comm_op_reuse_tag : 0.000001s : 0.00% optimize.overlap_opt_shard_in_pipeline : 0.000001s : 0.00% optimize.grouped_pairwise_exchange_alltoall : 0.000001s : 0.00% optimize.overlap_recompute_and_grad_model_parallel : 0.000002s : 0.00% optimize.overlap_grad_matmul_and_grad_allreduce : 0.000000s : 0.00% optimize.split_matmul_comm_elemetwise : 0.000001s : 0.00% optimize.split_layernorm_comm : 0.000001s : 0.00% optimize.process_send_recv_for_ge : 0.000001s : 0.00% optimize.handle_group_info : 0.000001s : 0.00% auto_monad_reorder : 0.000012s : 0.01% get_jit_bprop_graph : 0.000003s : 0.00% eliminate_special_op_node : 0.000586s : 0.73% validate : 0.000026s : 0.03% distribtued_split : 0.000001s : 0.00% task_emit : 0.002647s : 3.31% execute : 0.000004s : 0.01% Time group info: ------[substitution.] 0.014057 452 0.02% : 0.000003s : 6: substitution.float_depend_g_call 0.06% : 0.000009s : 14: substitution.float_tuple_getitem_switch 93.03% : 0.013077s : 27: substitution.getattr_setattr_resolve 0.02% : 0.000003s : 3: substitution.graph_param_transform 0.02% : 0.000002s : 3: substitution.incorporate_call 0.01% : 0.000001s : 3: substitution.incorporate_call_switch 4.28% : 0.000601s : 65: substitution.inline 0.04% : 0.000005s : 10: substitution.less_batch_normalization 0.25% : 0.000035s : 42: substitution.meta_unpack_prepare 0.11% : 0.000015s : 16: substitution.minmaximum_grad 0.03% : 0.000004s : 6: substitution.partial_eliminate 0.01% : 0.000001s : 3: substitution.partial_unused_args_eliminate 0.04% : 0.000006s : 47: substitution.remove_not_recompute_node 0.40% : 0.000056s : 44: substitution.replace_applicator 0.06% : 0.000008s : 24: substitution.replace_old_param 0.02% : 0.000003s : 2: substitution.reset_defer_inline 0.04% : 0.000005s : 8: substitution.set_cell_output_no_recompute 0.04% : 0.000006s : 5: substitution.specialize_transform 0.05% : 0.000007s : 4: substitution.switch_simplify 0.06% : 0.000008s : 2: substitution.transpose_eliminate 0.33% : 0.000046s : 20: substitution.tuple_list_convert_item_index_to_positive 0.15% : 0.000021s : 20: substitution.tuple_list_get_item_const_eliminator 0.19% : 0.000027s : 20: substitution.tuple_list_get_item_depend_reorder 0.57% : 0.000080s : 38: substitution.tuple_list_get_item_eliminator 0.20% : 0.000028s : 20: substitution.tuple_list_get_set_item_eliminator ------[renormalize.] 0.049521 6 92.09% : 0.045606s : 3: renormalize.infer 7.91% : 0.003915s : 3: renormalize.specialize ------[replace.] 0.000734 70 45.33% : 0.000333s : 25: replace.getattr_setattr_resolve 31.21% : 0.000229s : 31: replace.inline 6.65% : 0.000049s : 2: replace.meta_unpack_prepare 7.95% : 0.000058s : 4: replace.switch_simplify 1.90% : 0.000014s : 2: replace.transpose_eliminate 6.95% : 0.000051s : 6: replace.tuple_list_get_item_eliminator ------[match.] 0.013614 70 95.69% : 0.013028s : 25: match.getattr_setattr_resolve 3.97% : 0.000540s : 31: match.inline 0.12% : 0.000016s : 2: match.meta_unpack_prepare 0.05% : 0.000007s : 4: match.switch_simplify 0.06% : 0.000008s : 2: match.transpose_eliminate 0.11% : 0.000015s : 6: match.tuple_list_get_item_eliminator ------[func_graph_cloner_run.] 0.004315 72 67.32% : 0.002905s : 31: func_graph_cloner_run.FuncGraphClonerGraph 32.68% : 0.001410s : 41: func_graph_cloner_run.FuncGraphSpecializer ------[meta_graph.] 0.000000 0 ------[manager.] 0.000000 0 ------[pynative] 0.000000 0 ------[others.] 0.019578 585 0.72% : 0.000141s : 2: opt.transform.meta_unpack_prepare 26.90% : 0.005266s : 461: opt.transform.opt_a 0.06% : 0.000011s : 7: opt.transform.opt_after_cconv 2.44% : 0.000477s : 94: opt.transform.opt_b 69.72% : 0.013650s : 10: opt.transform.opt_resolve 0.12% : 0.000024s : 8: opt.transform.opt_trans_graph 0.05% : 0.000010s : 3: opt.transform.special_op_eliminate . ============================== 2 passed in 20.91s ============================== [TRACE] GE(65168,python3.7):2024-01-11-05:47:25.801.452 [status:INIT] [ge_api.cc:463]65168 ~Session:Start to destruct session. [TRACE] GE(65168,python3.7):2024-01-11-05:47:25.801.504 [status:RUNNING] [ge_api.cc:475]65168 ~Session:Session id is 0 [TRACE] GE(65168,python3.7):2024-01-11-05:47:25.801.515 [status:RUNNING] [ge_api.cc:476]65168 ~Session:Destroying session [TRACE] GE(65168,python3.7):2024-01-11-05:47:25.802.436 [status:STOP] [ge_api.cc:491]65168 ~Session:Session Destructor finished [TRACE] GE(65168,python3.7):2024-01-11-05:47:25.802.461 [status:INIT] [ge_api.cc:301]65168 GEFinalize:GEFinalize start [INFO] GE(65168,python3.7):2024-01-11-05:47:25.802.510 [execution_runtime.cc:80][EVENT]65168 FinalizeExecutionRuntime:Execution runtime finalize begin. [INFO] GE(65168,python3.7):2024-01-11-05:47:25.802.527 [execution_runtime.cc:92][EVENT]65168 FinalizeExecutionRuntime:Execution runtime finalized. [TRACE] GE(65168,python3.7):2024-01-11-05:47:25.802.538 [status:RUNNING] [ge_api.cc:313]65168 GEFinalize:Finalizing environment [INFO] TUNE(65168,python3.7):2024-01-11-05:47:26.093.106 [cann_kb_pyfunc_mgr.cpp:127][CANNKB][Tid:65168]"CannKbPyfuncMgr: enter PyObjectDeinit function, reference_[1]" [INFO] TUNE(65168,python3.7):2024-01-11-05:47:26.093.152 [cann_kb_pyfunc_mgr.cpp:138][CANNKB][Tid:65168]"CannKbPyfuncMgr: PyObjectDeinit function end successfully!" [INFO] GE(65168,python3.7):2024-01-11-05:47:26.094.396 [gelib.cc:324][EVENT]65168 SystemFinalize:Online infer finalize GELib success. [TRACE] GE(65168,python3.7):2024-01-11-05:47:26.825.703 [status:STOP] [ge_api.cc:341]65168 GEFinalize:GEFinalize finished [INFO] TDT(65168,python3.7):2024-01-11-05:47:27.457.091 [process_mode_manager.cpp:184][Close][tid:65168] [TsdClient] Close [deviceId=1][sessionId=1] hccp and computer enter [INFO] TDT(65168,python3.7):2024-01-11-05:47:27.457.147 [version_verify.cpp:112][SpecialFeatureCheck][tid:65168] VersionVerify: previous type[7], supported [INFO] TDT(65168,python3.7):2024-01-11-05:47:27.457.189 [process_mode_manager.cpp:192][Close][tid:65168] [TsdClient][deviceId=1] [sessionId=1] wait hccp and computer process close respond [INFO] TDT(65168,python3.7):2024-01-11-05:47:27.488.263 [process_mode_manager.cpp:197][Close][tid:65168] [TsdClient][logicDeviceId_=1]has recv close hccp and computer process respond [INFO] TDT(65168,python3.7):2024-01-11-05:47:27.488.289 [stub_process_mode_nowin.cpp:151][CloseInHost][tid:65168] enter into CloseInHost deviceid[1] [INFO] TDT(65168,python3.7):2024-01-11-05:47:27.488.300 [stub_process_mode_nowin.cpp:154][CloseInHost][tid:65168] host cpu not support [INFO] TDT(65168,python3.7):2024-01-11-05:47:27.488.345 [process_mode_manager.cpp:208][Close][tid:65168] [TsdClient][deviceId=1] [sessionId=1] close hccp and computer process success [INFO] ATRACE(65168,python3.7):2024-01-11-05:47:27.488.357 [atrace_api.c:93](tid:65168) AtraceDestroy start [INFO] ATRACE(65168,python3.7):2024-01-11-05:47:27.488.373 [atrace_api.c:95](tid:65168) AtraceDestroy end [INFO] PROFILING(65168,python3.7):2024-01-11-05:47:27.488.395 [msprofiler_impl.cpp:156] >>> (tid:65168) ProfNotifySetDevice called, is open: 0, devId: 1 [INFO] RUNTIME(65168,python3.7):2024-01-11-05:47:29.051.758 [runtime.cc:1737] 65168 ~Runtime: deconstruct runtime.