============================= test session starts ============================== platform linux -- Python 3.7.5, pytest-5.4.3, py-1.8.1, pluggy-0.13.1 rootdir: /home/jenkins/mindspore/testcases/testcases/tests/st/dyn_shape_dev, inifile: /home/jenkins/sault/virtual_test/virtualenv_004/sault/config/pytest.ini plugins: anyio-3.7.1, xdist-1.32.0, forked-1.1.3 [INFO] ATRACE(64685,python3.7):2024-01-11-05:30:42.035.587 [trace_attr.c:105](tid:64685) platform is 1. [INFO] ATRACE(64685,python3.7):2024-01-11-05:30:42.035.782 [trace_recorder.c:114](tid:64685) use root path: /home/jenkins/ascend/atrace [INFO] ATRACE(64685,python3.7):2024-01-11-05:30:42.035.812 [trace_signal.c:133](tid:64685) register signal handler for signo 2 succeed. [INFO] ATRACE(64685,python3.7):2024-01-11-05:30:42.035.824 [trace_signal.c:133](tid:64685) register signal handler for signo 15 succeed. [INFO] RUNTIME(64685,python3.7):2024-01-11-05:30:42.466.145 [runtime.cc:1159] 64685 GetAicoreNumByLevel: workingDev_=0 [INFO] RUNTIME(64685,python3.7):2024-01-11-05:30:42.466.232 [runtime.cc:4719] 64685 GetVisibleDevices: ASCEND_RT_VISIBLE_DEVICES param was not set collected 2 items test_reshape.py [INFO] TDT(64685,python3.7):2024-01-11-05:30:46.828.018 [process_mode_manager.cpp:109][OpenProcess][tid:64685] [ProcessModeManager] enter into open process deviceId[3] rankSize[0] [INFO] TDT(64685,python3.7):2024-01-11-05:30:46.829.084 [process_mode_manager.cpp:379][InitTsdClient][tid:64685] [TsdClient] deviceId[3] begin to init hdc client [INFO] TDT(64685,python3.7):2024-01-11-05:30:46.829.222 [version_verify.cpp:34][SetVersionInfo][tid:64685] VersionVerify: send client version to server [INFO] TDT(64685,python3.7):2024-01-11-05:30:46.829.250 [version_verify.cpp:50][SetVersionInfo][tid:64685] send feature_info:{msg_type:35, features:{check before send aicpu package,}} [INFO] TDT(64685,python3.7):2024-01-11-05:30:46.829.262 [version_verify.cpp:50][SetVersionInfo][tid:64685] send feature_info:{msg_type:37, features:{check before send open qs message,}} [INFO] TDT(64685,python3.7):2024-01-11-05:30:46.829.578 [version_verify.cpp:66][PeerVersionCheck][tid:64685] VersionVerify: Check client version info, server[1230], client[1230] [INFO] TDT(64685,python3.7):2024-01-11-05:30:46.829.595 [version_verify.cpp:87][ParseVersionInfo][tid:64685] VersionVerify: pass client version info success [INFO] TDT(64685,python3.7):2024-01-11-05:30:46.829.604 [hdc_client.cpp:276][CheckHdcConnection][tid:64685] Service[2] create hdc success [INFO] TDT(64685,python3.7):2024-01-11-05:30:46.829.619 [version_verify.cpp:120][SpecialFeatureCheck][tid:64685] VersionVerify: new type[35], supported [INFO] TDT(64685,python3.7):2024-01-11-05:30:46.829.665 [process_mode_manager.cpp:748][GetDeviceCheckCode][tid:64685] [TsdClient][deviceId=3] [sessionId=1] wait package info respond [INFO] TDT(64685,python3.7):2024-01-11-05:30:46.829.780 [process_mode_manager.cpp:379][InitTsdClient][tid:64685] [TsdClient] deviceId[3] begin to init hdc client [INFO] TDT(64685,python3.7):2024-01-11-05:30:46.829.870 [version_verify.cpp:34][SetVersionInfo][tid:64685] VersionVerify: send client version to server [INFO] TDT(64685,python3.7):2024-01-11-05:30:46.829.883 [version_verify.cpp:50][SetVersionInfo][tid:64685] send feature_info:{msg_type:35, features:{check before send aicpu package,}} [INFO] TDT(64685,python3.7):2024-01-11-05:30:46.829.894 [version_verify.cpp:50][SetVersionInfo][tid:64685] send feature_info:{msg_type:37, features:{check before send open qs message,}} [INFO] TDT(64685,python3.7):2024-01-11-05:30:46.830.107 [version_verify.cpp:66][PeerVersionCheck][tid:64685] VersionVerify: Check client version info, server[1230], client[1230] [INFO] TDT(64685,python3.7):2024-01-11-05:30:46.830.119 [version_verify.cpp:87][ParseVersionInfo][tid:64685] VersionVerify: pass client version info success [INFO] TDT(64685,python3.7):2024-01-11-05:30:46.830.127 [hdc_client.cpp:276][CheckHdcConnection][tid:64685] Service[2] create hdc success [INFO] TDT(64685,python3.7):2024-01-11-05:30:46.830.139 [process_mode_manager.cpp:426][ConstructOpenMsg][tid:64685] [TsdClient] tsd get process sign successfully, procpid[64685] signSize[48] [INFO] TDT(64685,python3.7):2024-01-11-05:30:46.830.171 [version_verify.cpp:112][SpecialFeatureCheck][tid:64685] VersionVerify: previous type[6], supported [INFO] TDT(64685,python3.7):2024-01-11-05:30:46.830.193 [process_mode_manager.cpp:126][OpenProcess][tid:64685] [ProcessModeManager] deviceId[3] sessionId[1] rankSize[0], wait sub process start respond [INFO] TDT(64685,python3.7):2024-01-11-05:30:47.431.107 [stub_process_mode_nowin.cpp:63][ProcessQueueForMdc][tid:64685] [TsdClient] it is unnecessary of current mode[0] chiptype[1] to grant queue auth to aicpusd [INFO] TDT(64685,python3.7):2024-01-11-05:30:47.431.137 [stub_process_mode_nowin.cpp:101][OpenInHost][tid:64685] enter into OpenInHost deviceid[3] [INFO] TDT(64685,python3.7):2024-01-11-05:30:47.431.147 [stub_process_mode_nowin.cpp:105][OpenInHost][tid:64685] host cpu not support [INFO] TDT(64685,python3.7):2024-01-11-05:30:47.431.154 [process_mode_manager.cpp:156][OpenProcess][tid:64685] [TsdClient][deviceId=3] [sessionId=1] start hccp and computer process success [INFO] RUNTIME(64685,python3.7):2024-01-11-05:30:47.433.844 [device.cc:340] 64685 Init: isDoubledie:0, topologytype:0 [INFO] RUNTIME(64685,python3.7):2024-01-11-05:30:47.448.013 [npu_driver.cc:5428] 66520 GetDeviceStatus: GetDeviceStatus status=1. [INFO] ATRACE(64685,python3.7):2024-01-11-05:30:47.448.050 [atrace_api.c:28](tid:64685) AtraceCreate start [INFO] ATRACE(64685,python3.7):2024-01-11-05:30:47.448.183 [trace_rb_log.c:84](tid:64685) [RUNTIME_ATRACE_DEV3_TS0] create ring buffer success, buffer size : 131152. [INFO] ATRACE(64685,python3.7):2024-01-11-05:30:47.448.200 [atrace_api.c:32](tid:64685) AtraceCreate end [INFO] TDT(64685,python3.7):2024-01-11-05:30:47.448.214 [client_manager.cpp:157][SetProfilingCallback][tid:64685] [TsdClient] set profiling callback success [TRACE] GE(64685,python3.7):2024-01-11-05:30:47.598.245 [status:INIT] [ge_api.cc:144]64685 GEInitializeImpl:GEInitialize start [INFO] PROFILING(64685,python3.7):2024-01-11-05:30:47.807.964 [msprofiler_impl.cpp:156] >>> (tid:64685) ProfNotifySetDevice called, is open: 1, devId: 3 [INFO] PROFILING(64685,python3.7):2024-01-11-05:30:47.808.148 [platform.cpp:38] >>> (tid:64685) Profiling platform version: 1.0. [INFO] PROFILING(64685,python3.7):2024-01-11-05:30:47.808.168 [ai_drv_dev_api.cpp:384] >>> (tid:64685) Succeeded to DrvGetApiVersion version: 0x72313 [TRACE] GE(64685,python3.7):2024-01-11-05:30:47.858.403 [status:RUNNING] [ge_api.cc:211]64685 GEInitializeImpl:Initializing environment [INFO] GE(64685,python3.7):2024-01-11-05:30:47.858.464 [gelib.cc:98][EVENT]64685 Initialize:[GEPERFTRACE] GE Init Start [INFO] GE(64685,python3.7):2024-01-11-05:30:47.858.735 [gelib.cc:307][EVENT]64685 SystemInitialize:Online infer init GELib success, device id :3 [INFO] DVPP(64685,python3.7):2024-01-11-05:30:48.223.046 [dvpp_engine.cc:41][ENGINE][Initialize:41][tid:64685]dvpp engine do not support [INFO] TUNE(64685,python3.7):2024-01-11-05:30:48.226.679 [cann_kb_pyfunc_mgr.cpp:72][CANNKB][Tid:64685]"CannKbPyfuncMgr: Enter PyObjectInit, reference_ is 0!" [INFO] TUNE(64685,python3.7):2024-01-11-05:30:48.226.720 [handle_manager.cpp:115][CANNKB][Tid:64685]"Start to run init functions to load dynamic python lib!" [INFO] TUNE(64685,python3.7):2024-01-11-05:30:48.226.782 [handle_manager.cpp:407][CANNKB][Tid:64685]"Init functions of loading dynamic python lib end!" [INFO] TUNE(64685,python3.7):2024-01-11-05:30:48.226.793 [cann_kb_pyfunc_mgr.cpp:24][CANNKB][Tid:64685]"CANN_KB_Py has already been initialized." [INFO] TUNE(64685,python3.7):2024-01-11-05:30:48.226.881 [cann_kb_pyfunc_mgr.cpp:117][CANNKB][Tid:64685]"CannKbPyfuncMgr: Run PyObjectInit successfully!" [INFO] HCCL(64685,python3.7):2024-01-11-05:30:59.989.807 [plugin_manager.cc:42][64685]hcom running normal mode. [INFO] DVPP(64685,python3.7):2024-01-11-05:30:59.990.472 [dvpp_engine.cc:92][ENGINE][GetOpsKernelInfoStores:92][tid:64685]dvpp ops kernel info store do not support [INFO] DVPP(64685,python3.7):2024-01-11-05:30:59.990.629 [dvpp_engine.cc:69][ENGINE][GetGraphOptimizerObjs:69][tid:64685]dvpp graph optimizer do not support [INFO] DVPP(64685,python3.7):2024-01-11-05:31:00.507.495 [dvpp_ops_kernel_builder.cc:48][ENGINE][Initialize:48][tid:64685]dvpp ops kernel builder do not support [INFO] GE(64685,python3.7):2024-01-11-05:31:00.516.387 [gelib.cc:169][EVENT]64685 Initialize:[GEPERFTRACE] The time cost of GELib::Initialize is [12657873] micro second. [TRACE] GE(64685,python3.7):2024-01-11-05:31:00.603.562 [status:STOP] [ge_api.cc:255]64685 GEInitializeImpl:GEInitialize finished [TRACE] GE(64685,python3.7):2024-01-11-05:31:00.603.732 [status:INIT] [ge_api.cc:398]64685 Session:Start to construct session. [TRACE] GE(64685,python3.7):2024-01-11-05:31:00.603.750 [status:RUNNING] [ge_api.cc:408]64685 Session:Creating session [INFO] GE(64685,python3.7):2024-01-11-05:31:00.604.186 [graph_var_manager.cc:1445][EVENT]64685 SetMemoryMallocSize:Total memory size is 34359738368 [INFO] GE(64685,python3.7):2024-01-11-05:31:00.604.205 [graph_var_manager.cc:1424][EVENT]64685 SetAllMemoryMaxValue:The graph_mem_max_size is 27917287424 and the var_mem_max_size is 5368709120 [INFO] PROFILING(64685,python3.7):2024-01-11-05:31:00.604.585 [msprofiler_impl.cpp:156] >>> (tid:64685) ProfNotifySetDevice called, is open: 1, devId: 3 [TRACE] GE(64685,python3.7):2024-01-11-05:31:00.605.429 [status:RUNNING] [ge_api.cc:411]64685 Session:Session id is 0 [TRACE] GE(64685,python3.7):2024-01-11-05:31:00.605.451 [status:STOP] [ge_api.cc:420]64685 Session:Session Constructor finished [INFO] PROFILING(64685,python3.7):2024-01-11-05:31:00.615.163 [platform.cpp:38] >>> (tid:64685) Profiling platform version: 1.0. [INFO] PROFILING(64685,python3.7):2024-01-11-05:31:00.615.193 [ai_drv_dev_api.cpp:384] >>> (tid:64685) Succeeded to DrvGetApiVersion version: 0x72313 [TRACE] GE(64685,python3.7):2024-01-11-05:31:00.615.380 [status:INIT] [ge_api.cc:144]64685 GEInitializeImpl:GEInitialize start TotalTime = 0.0584299, [20] [parse]: 0.0125127 [symbol_resolve]: 0.028186, [1] [Cycle 1]: 0.0281189, [1] [resolve]: 0.0280963 [combine_like_graphs]: 1.08e-06 [graph_reusing]: 3.14e-06 [meta_unpack_prepare]: 6.215e-05 [pre_cconv]: 3.41e-06 [abstract_specialize]: 0.00291005 [pack_expand]: 1.084e-05 [auto_monad]: 7.502e-05 [inline]: 1.68e-06 [pre_auto_parallel]: 1.867e-05 [pipeline_split]: 3.24e-06 [optimize]: 0.00794798, [35] [py_interpret_to_execute]: 3.35e-06 [rewriter_before_opt_a]: 4.237e-05 [opt_a]: 0.00738385, [2] [Cycle 1]: 0.00092658, [30] [expand_dump_flag]: 4.05e-06 [switch_simplify]: 1.519e-05 [a_1]: 0.00020744 [recompute_prepare]: 2.73e-06 [updatestate_depend_eliminate]: 6.72e-06 [updatestate_assign_eliminate]: 3.8e-06 [updatestate_loads_eliminate]: 3.29e-06 [parameter_eliminate]: 3.64e-06 [a_2]: 3.176e-05 [accelerated_algorithm]: 2.84e-06 [pynative_shard]: 1.70001e-06 [auto_parallel]: 3.32e-06 [parallel]: 1.67e-05 [merge_comm]: 9.18e-06 [allreduce_fusion]: 2.08e-06 [virtual_dataset]: 2.95e-06 [get_grad_eliminate_]: 2.14e-06 [virtual_output]: 1.92e-06 [merge_forward]: 5.34e-06 [cell_reuse_recompute_pass]: 8.2e-07 [cell_reuse_handle_not_recompute_node_pass]: 6.19e-06 [meta_fg_expand]: 3.58e-06 [after_resolve]: 5.03e-06 [a_after_grad]: 3.35e-06 [renormalize]: 0.00037589 [real_op_eliminate]: 5.43e-06 [auto_monad_grad]: 3.91999e-06 [auto_monad_eliminator]: 1.147e-05 [cse]: 2.837e-05 [a_3]: 1.687e-05 [Cycle 2]: 0.00023657, [30] [expand_dump_flag]: 1.03001e-06 [switch_simplify]: 2.41e-06 [a_1]: 2.352e-05 [recompute_prepare]: 1.93e-06 [updatestate_depend_eliminate]: 3.15e-06 [updatestate_assign_eliminate]: 2.41e-06 [updatestate_loads_eliminate]: 2.17e-06 [parameter_eliminate]: 8.90002e-07 [a_2]: 2.846e-05 [accelerated_algorithm]: 2.39e-06 [pynative_shard]: 1.04e-06 [auto_parallel]: 3.1e-06 [parallel]: 3.32e-06 [merge_comm]: 1.81e-06 [allreduce_fusion]: 1.29e-06 [virtual_dataset]: 2.3e-06 [get_grad_eliminate_]: 1.88e-06 [virtual_output]: 1.78e-06 [merge_forward]: 2.62e-06 [cell_reuse_recompute_pass]: 3.19997e-07 [cell_reuse_handle_not_recompute_node_pass]: 4.77e-06 [meta_fg_expand]: 1.96e-06 [after_resolve]: 4.01e-06 [a_after_grad]: 2.64e-06 [renormalize]: 6.99947e-08 [real_op_eliminate]: 1.9e-06 [auto_monad_grad]: 9.19994e-07 [auto_monad_eliminator]: 4.52e-06 [cse]: 9.71e-06 [a_3]: 1.377e-05 [py_interpret_to_execute_after_opt_a]: 3.3e-06 [slice_cell_reuse_recomputed_activation]: 2.50999e-06 [rewriter_after_opt_a]: 2.878e-05 [convert_after_rewriter]: 5.8e-06 [order_py_execute_after_rewriter]: 4.78e-06 [opt_b]: 9.209e-05, [1] [Cycle 1]: 8.729e-05, [7] [b_1]: 4.122e-05 [b_2]: 3.08e-06 [updatestate_depend_eliminate]: 2.61e-06 [updatestate_assign_eliminate]: 2.43e-06 [updatestate_loads_eliminate]: 2.23e-06 [renormalize]: 3.80001e-07 [cse]: 9.42e-06 [cconv]: 2.305e-05 [opt_after_cconv]: 5.113e-05, [1] [Cycle 1]: 4.74e-05, [7] [c_1]: 5.71999e-06 [parameter_eliminate]: 6.30003e-07 [updatestate_depend_eliminate]: 2.40999e-06 [updatestate_assign_eliminate]: 2.03001e-06 [updatestate_loads_eliminate]: 2.19e-06 [cse]: 8.19e-06 [renormalize]: 2.3e-07 [remove_dup_value]: 1.293e-05 [tuple_transform]: 3.643e-05, [1] [Cycle 1]: 3.311e-05, [3] [d_1]: 1.488e-05 [d_2]: 6.7e-06 [renormalize]: 1.8e-07 [add_cache_embedding]: 1.12e-05 [add_recomputation]: 4.713e-05 [cse_after_recomputation]: 1.728e-05, [1] [Cycle 1]: 1.297e-05, [1] [cse]: 8.7e-06 [environ_conv]: 2.27e-05 [label_micro_interleaved_index]: 2.2e-06 [label_fine_grained_interleaved_index]: 2.75e-06 [assign_add_opt]: 1.56e-06 [slice_recompute_activation]: 2.62e-06 [micro_interleaved_order_control]: 1.78999e-06 [full_micro_interleaved_order_control]: 1.83001e-06 [comp_comm_scheduling]: 2.09e-06 [reorder_send_recv_between_fp_bp]: 3.53e-06 [comm_op_add_attrs]: 1.09e-06 [add_comm_op_reuse_tag]: 9.70002e-07 [overlap_opt_shard_in_pipeline]: 1.17e-06 [grouped_pairwise_exchange_alltoall]: 1.29e-06 [overlap_recompute_and_grad_model_parallel]: 1.84e-06 [overlap_grad_matmul_and_grad_allreduce]: 8.49999e-07 [split_matmul_comm_elemetwise]: 2.83e-06 [split_layernorm_comm]: 1.98e-06 [process_send_recv_for_ge]: 2.31e-06 [handle_group_info]: 1.1e-06 [auto_monad_reorder]: 2.042e-05 [get_jit_bprop_graph]: 4.89999e-07 [eliminate_special_op_node]: 0.00046303 [validate]: 4.676e-05 [distribtued_split]: 1.21e-06 [task_emit]: 0.0059356 [execute]: 8.52e-06 Sums parse : 0.012513s : 24.32% symbol_resolve.resolve : 0.028096s : 54.61% combine_like_graphs : 0.000001s : 0.00% graph_reusing : 0.000003s : 0.01% meta_unpack_prepare : 0.000062s : 0.12% pre_cconv : 0.000003s : 0.01% abstract_specialize : 0.002910s : 5.66% pack_expand : 0.000011s : 0.02% auto_monad : 0.000075s : 0.15% inline : 0.000002s : 0.00% pre_auto_parallel : 0.000019s : 0.04% pipeline_split : 0.000003s : 0.01% optimize.py_interpret_to_execute : 0.000003s : 0.01% optimize.rewriter_before_opt_a : 0.000042s : 0.08% optimize.opt_a.expand_dump_flag : 0.000005s : 0.01% optimize.opt_a.switch_simplify : 0.000018s : 0.03% optimize.opt_a.a_1 : 0.000231s : 0.45% optimize.opt_a.recompute_prepare : 0.000005s : 0.01% optimize.opt_a.updatestate_depend_eliminate : 0.000010s : 0.02% optimize.opt_a.updatestate_assign_eliminate : 0.000006s : 0.01% optimize.opt_a.updatestate_loads_eliminate : 0.000005s : 0.01% optimize.opt_a.parameter_eliminate : 0.000005s : 0.01% optimize.opt_a.a_2 : 0.000060s : 0.12% optimize.opt_a.accelerated_algorithm : 0.000005s : 0.01% optimize.opt_a.pynative_shard : 0.000003s : 0.01% optimize.opt_a.auto_parallel : 0.000006s : 0.01% optimize.opt_a.parallel : 0.000020s : 0.04% optimize.opt_a.merge_comm : 0.000011s : 0.02% optimize.opt_a.allreduce_fusion : 0.000003s : 0.01% optimize.opt_a.virtual_dataset : 0.000005s : 0.01% optimize.opt_a.get_grad_eliminate_ : 0.000004s : 0.01% optimize.opt_a.virtual_output : 0.000004s : 0.01% optimize.opt_a.merge_forward : 0.000008s : 0.02% optimize.opt_a.cell_reuse_recompute_pass : 0.000001s : 0.00% optimize.opt_a.cell_reuse_handle_not_recompute_node_pass : 0.000011s : 0.02% optimize.opt_a.meta_fg_expand : 0.000006s : 0.01% optimize.opt_a.after_resolve : 0.000009s : 0.02% optimize.opt_a.a_after_grad : 0.000006s : 0.01% optimize.opt_a.renormalize : 0.000376s : 0.73% optimize.opt_a.real_op_eliminate : 0.000007s : 0.01% optimize.opt_a.auto_monad_grad : 0.000005s : 0.01% optimize.opt_a.auto_monad_eliminator : 0.000016s : 0.03% optimize.opt_a.cse : 0.000038s : 0.07% optimize.opt_a.a_3 : 0.000031s : 0.06% optimize.py_interpret_to_execute_after_opt_a : 0.000003s : 0.01% optimize.slice_cell_reuse_recomputed_activation : 0.000003s : 0.00% optimize.rewriter_after_opt_a : 0.000029s : 0.06% optimize.convert_after_rewriter : 0.000006s : 0.01% optimize.order_py_execute_after_rewriter : 0.000005s : 0.01% optimize.opt_b.b_1 : 0.000041s : 0.08% optimize.opt_b.b_2 : 0.000003s : 0.01% optimize.opt_b.updatestate_depend_eliminate : 0.000003s : 0.01% optimize.opt_b.updatestate_assign_eliminate : 0.000002s : 0.00% optimize.opt_b.updatestate_loads_eliminate : 0.000002s : 0.00% optimize.opt_b.renormalize : 0.000000s : 0.00% optimize.opt_b.cse : 0.000009s : 0.02% optimize.cconv : 0.000023s : 0.04% optimize.opt_after_cconv.c_1 : 0.000006s : 0.01% optimize.opt_after_cconv.parameter_eliminate : 0.000001s : 0.00% optimize.opt_after_cconv.updatestate_depend_eliminate : 0.000002s : 0.00% optimize.opt_after_cconv.updatestate_assign_eliminate : 0.000002s : 0.00% optimize.opt_after_cconv.updatestate_loads_eliminate : 0.000002s : 0.00% optimize.opt_after_cconv.cse : 0.000008s : 0.02% optimize.opt_after_cconv.renormalize : 0.000000s : 0.00% optimize.remove_dup_value : 0.000013s : 0.03% optimize.tuple_transform.d_1 : 0.000015s : 0.03% optimize.tuple_transform.d_2 : 0.000007s : 0.01% optimize.tuple_transform.renormalize : 0.000000s : 0.00% optimize.add_cache_embedding : 0.000011s : 0.02% optimize.add_recomputation : 0.000047s : 0.09% optimize.cse_after_recomputation.cse : 0.000009s : 0.02% optimize.environ_conv : 0.000023s : 0.04% optimize.label_micro_interleaved_index : 0.000002s : 0.00% optimize.label_fine_grained_interleaved_index : 0.000003s : 0.01% optimize.assign_add_opt : 0.000002s : 0.00% optimize.slice_recompute_activation : 0.000003s : 0.01% optimize.micro_interleaved_order_control : 0.000002s : 0.00% optimize.full_micro_interleaved_order_control : 0.000002s : 0.00% optimize.comp_comm_scheduling : 0.000002s : 0.00% optimize.reorder_send_recv_between_fp_bp : 0.000004s : 0.01% optimize.comm_op_add_attrs : 0.000001s : 0.00% optimize.add_comm_op_reuse_tag : 0.000001s : 0.00% optimize.overlap_opt_shard_in_pipeline : 0.000001s : 0.00% optimize.grouped_pairwise_exchange_alltoall : 0.000001s : 0.00% optimize.overlap_recompute_and_grad_model_parallel : 0.000002s : 0.00% optimize.overlap_grad_matmul_and_grad_allreduce : 0.000001s : 0.00% optimize.split_matmul_comm_elemetwise : 0.000003s : 0.01% optimize.split_layernorm_comm : 0.000002s : 0.00% optimize.process_send_recv_for_ge : 0.000002s : 0.00% optimize.handle_group_info : 0.000001s : 0.00% auto_monad_reorder : 0.000020s : 0.04% get_jit_bprop_graph : 0.000000s : 0.00% eliminate_special_op_node : 0.000463s : 0.90% validate : 0.000047s : 0.09% distribtued_split : 0.000001s : 0.00% task_emit : 0.005936s : 11.54% execute : 0.000009s : 0.02% Time group info: ------[substitution.] 0.028005 42 99.48% : 0.027860s : 8: substitution.getattr_setattr_resolve 0.02% : 0.000005s : 4: substitution.graph_param_transform 0.33% : 0.000093s : 3: substitution.inline 0.07% : 0.000020s : 13: substitution.meta_unpack_prepare 0.01% : 0.000001s : 4: substitution.partial_unused_args_eliminate 0.01% : 0.000001s : 4: substitution.remove_not_recompute_node 0.01% : 0.000002s : 2: substitution.replace_old_param 0.06% : 0.000016s : 3: substitution.reshape_eliminate 0.02% : 0.000006s : 1: substitution.tuple_list_get_item_eliminator ------[renormalize.] 0.000370 2 63.19% : 0.000234s : 1: renormalize.infer 36.81% : 0.000136s : 1: renormalize.specialize ------[replace.] 0.000155 10 77.11% : 0.000119s : 6: replace.getattr_setattr_resolve 17.55% : 0.000027s : 3: replace.inline 5.33% : 0.000008s : 1: replace.tuple_list_get_item_eliminator ------[match.] 0.027857 10 99.65% : 0.027758s : 6: match.getattr_setattr_resolve 0.33% : 0.000093s : 3: match.inline 0.02% : 0.000006s : 1: match.tuple_list_get_item_eliminator ------[func_graph_cloner_run.] 0.000491 10 66.78% : 0.000328s : 5: func_graph_cloner_run.FuncGraphClonerGraph 33.22% : 0.000163s : 5: func_graph_cloner_run.FuncGraphSpecializer ------[meta_graph.] 0.000000 0 ------[manager.] 0.000000 0 ------[pynative] 0.000000 0 ------[others.] 0.028546 105 0.29% : 0.000082s : 52: opt.transform.opt_a 0.12% : 0.000033s : 23: opt.transform.opt_b 98.35% : 0.028076s : 2: opt.transform.opt_resolve 0.14% : 0.000039s : 1: opt.transforms.meta_unpack_prepare 0.98% : 0.000279s : 20: opt.transforms.opt_a 0.02% : 0.000004s : 1: opt.transforms.opt_after_cconv 0.01% : 0.000002s : 1: opt.transforms.opt_b 0.07% : 0.000020s : 2: opt.transforms.opt_trans_graph 0.03% : 0.000009s : 3: opt.transforms.special_op_eliminate TotalTime = 0.0598794, [20] [parse]: 0.00148871 [symbol_resolve]: 0.0298158, [1] [Cycle 1]: 0.0297366, [1] [resolve]: 0.0297152 [combine_like_graphs]: 1.21e-06 [graph_reusing]: 3.62e-06 [meta_unpack_prepare]: 0.00011527 [pre_cconv]: 6.60002e-07 [abstract_specialize]: 0.0140104 [pack_expand]: 1.93e-05 [auto_monad]: 0.00016602 [inline]: 1.59e-06 [pre_auto_parallel]: 1.157e-05 [pipeline_split]: 3.06e-06 [optimize]: 0.0135519, [35] [py_interpret_to_execute]: 4.07e-06 [rewriter_before_opt_a]: 0.00010527 [opt_a]: 0.0130473, [2] [Cycle 1]: 0.0104518, [30] [expand_dump_flag]: 5.3e-06 [switch_simplify]: 9.16e-05 [a_1]: 0.00051447 [recompute_prepare]: 8.86e-06 [updatestate_depend_eliminate]: 1.02e-05 [updatestate_assign_eliminate]: 7.89e-06 [updatestate_loads_eliminate]: 6.67e-06 [parameter_eliminate]: 4.76e-06 [a_2]: 9.888e-05 [accelerated_algorithm]: 6.04e-06 [pynative_shard]: 1.72e-06 [auto_parallel]: 3.29001e-06 [parallel]: 8.61e-06 [merge_comm]: 7.92e-06 [allreduce_fusion]: 3.2e-06 [virtual_dataset]: 5.72e-06 [get_grad_eliminate_]: 4.66e-06 [virtual_output]: 4.38e-06 [merge_forward]: 9.76e-06 [cell_reuse_recompute_pass]: 8.29998e-07 [cell_reuse_handle_not_recompute_node_pass]: 1.315e-05 [meta_fg_expand]: 0.0008407 [after_resolve]: 2.776e-05 [a_after_grad]: 3.788e-05 [renormalize]: 0.00848863 [real_op_eliminate]: 5.67e-06 [auto_monad_grad]: 4.35999e-06 [auto_monad_eliminator]: 1.024e-05 [cse]: 2.267e-05 [a_3]: 1.378e-05 [Cycle 2]: 0.00021938, [30] [expand_dump_flag]: 1.26e-06 [switch_simplify]: 2.27e-06 [a_1]: 9.91e-06 [recompute_prepare]: 1.31e-06 [updatestate_depend_eliminate]: 2.58e-06 [updatestate_assign_eliminate]: 1.89e-06 [updatestate_loads_eliminate]: 1.55e-06 [parameter_eliminate]: 1.08e-06 [a_2]: 1.939e-05 [accelerated_algorithm]: 2.19e-06 [pynative_shard]: 1.39e-06 [auto_parallel]: 3.50999e-06 [parallel]: 3.37e-06 [merge_comm]: 2.06001e-06 [allreduce_fusion]: 1.22e-06 [virtual_dataset]: 1.84e-06 [get_grad_eliminate_]: 1.47001e-06 [virtual_output]: 1.35e-06 [merge_forward]: 2.51e-06 [cell_reuse_recompute_pass]: 4.39999e-07 [cell_reuse_handle_not_recompute_node_pass]: 5.26e-06 [meta_fg_expand]: 1.34e-05 [after_resolve]: 1.69e-06 [a_after_grad]: 1.82e-06 [renormalize]: 7.99992e-08 [real_op_eliminate]: 1.33e-06 [auto_monad_grad]: 1.05e-06 [auto_monad_eliminator]: 3.57e-06 [cse]: 8.61e-06 [a_3]: 1.003e-05 [py_interpret_to_execute_after_opt_a]: 3.84e-06 [slice_cell_reuse_recomputed_activation]: 2.68e-06 [rewriter_after_opt_a]: 1.844e-05 [convert_after_rewriter]: 4.92e-06 [order_py_execute_after_rewriter]: 3.66e-06 [opt_b]: 7.283e-05, [1] [Cycle 1]: 6.773e-05, [7] [b_1]: 2.82e-05 [b_2]: 2.53e-06 [updatestate_depend_eliminate]: 1.84e-06 [updatestate_assign_eliminate]: 1.75e-06 [updatestate_loads_eliminate]: 1.63e-06 [renormalize]: 4.1e-07 [cse]: 5.7e-06 [cconv]: 1.873e-05 [opt_after_cconv]: 4.303e-05, [1] [Cycle 1]: 3.919e-05, [7] [c_1]: 3.59e-06 [parameter_eliminate]: 5.60001e-07 [updatestate_depend_eliminate]: 1.5e-06 [updatestate_assign_eliminate]: 1.37e-06 [updatestate_loads_eliminate]: 1.38e-06 [cse]: 5.3e-06 [renormalize]: 2.00002e-07 [remove_dup_value]: 1.102e-05 [tuple_transform]: 2.801e-05, [1] [Cycle 1]: 2.464e-05, [3] [d_1]: 9.25e-06 [d_2]: 3.87e-06 [renormalize]: 1.40004e-07 [add_cache_embedding]: 9.82e-06 [add_recomputation]: 2.829e-05 [cse_after_recomputation]: 1.323e-05, [1] [Cycle 1]: 9.34e-06, [1] [cse]: 5.07e-06 [environ_conv]: 4.64e-06 [label_micro_interleaved_index]: 2.27e-06 [label_fine_grained_interleaved_index]: 2.92e-06 [assign_add_opt]: 1.52e-06 [slice_recompute_activation]: 2.17e-06 [micro_interleaved_order_control]: 1.83e-06 [full_micro_interleaved_order_control]: 1.75001e-06 [comp_comm_scheduling]: 2.76e-06 [reorder_send_recv_between_fp_bp]: 2.26e-06 [comm_op_add_attrs]: 1.13e-06 [add_comm_op_reuse_tag]: 1e-06 [overlap_opt_shard_in_pipeline]: 1.18e-06 [grouped_pairwise_exchange_alltoall]: 1.45e-06 [overlap_recompute_and_grad_model_parallel]: 1.82e-06 [overlap_grad_matmul_and_grad_allreduce]: 7.79997e-07 [split_matmul_comm_elemetwise]: 2.69e-06 [split_layernorm_comm]: 1.98e-06 [process_send_recv_for_ge]: 8.90002e-07 [handle_group_info]: 1.05e-06 [auto_monad_reorder]: 1.255e-05 [get_jit_bprop_graph]: 4.39999e-07 [eliminate_special_op_node]: 0.00046032 [validate]: 2.103e-05 [distribtued_split]: 1.24e-06 [task_emit]: 1.06e-06 [execute]: 9.10004e-07 Sums parse : 0.001489s : 2.62% symbol_resolve.resolve : 0.029715s : 52.38% combine_like_graphs : 0.000001s : 0.00% graph_reusing : 0.000004s : 0.01% meta_unpack_prepare : 0.000115s : 0.20% pre_cconv : 0.000001s : 0.00% abstract_specialize : 0.014010s : 24.69% pack_expand : 0.000019s : 0.03% auto_monad : 0.000166s : 0.29% inline : 0.000002s : 0.00% pre_auto_parallel : 0.000012s : 0.02% pipeline_split : 0.000003s : 0.01% optimize.py_interpret_to_execute : 0.000004s : 0.01% optimize.rewriter_before_opt_a : 0.000105s : 0.19% optimize.opt_a.expand_dump_flag : 0.000007s : 0.01% optimize.opt_a.switch_simplify : 0.000094s : 0.17% optimize.opt_a.a_1 : 0.000524s : 0.92% optimize.opt_a.recompute_prepare : 0.000010s : 0.02% optimize.opt_a.updatestate_depend_eliminate : 0.000013s : 0.02% optimize.opt_a.updatestate_assign_eliminate : 0.000010s : 0.02% optimize.opt_a.updatestate_loads_eliminate : 0.000008s : 0.01% optimize.opt_a.parameter_eliminate : 0.000006s : 0.01% optimize.opt_a.a_2 : 0.000118s : 0.21% optimize.opt_a.accelerated_algorithm : 0.000008s : 0.01% optimize.opt_a.pynative_shard : 0.000003s : 0.01% optimize.opt_a.auto_parallel : 0.000007s : 0.01% optimize.opt_a.parallel : 0.000012s : 0.02% optimize.opt_a.merge_comm : 0.000010s : 0.02% optimize.opt_a.allreduce_fusion : 0.000004s : 0.01% optimize.opt_a.virtual_dataset : 0.000008s : 0.01% optimize.opt_a.get_grad_eliminate_ : 0.000006s : 0.01% optimize.opt_a.virtual_output : 0.000006s : 0.01% optimize.opt_a.merge_forward : 0.000012s : 0.02% optimize.opt_a.cell_reuse_recompute_pass : 0.000001s : 0.00% optimize.opt_a.cell_reuse_handle_not_recompute_node_pass : 0.000018s : 0.03% optimize.opt_a.meta_fg_expand : 0.000854s : 1.51% optimize.opt_a.after_resolve : 0.000029s : 0.05% optimize.opt_a.a_after_grad : 0.000040s : 0.07% optimize.opt_a.renormalize : 0.008489s : 14.96% optimize.opt_a.real_op_eliminate : 0.000007s : 0.01% optimize.opt_a.auto_monad_grad : 0.000005s : 0.01% optimize.opt_a.auto_monad_eliminator : 0.000014s : 0.02% optimize.opt_a.cse : 0.000031s : 0.06% optimize.opt_a.a_3 : 0.000024s : 0.04% optimize.py_interpret_to_execute_after_opt_a : 0.000004s : 0.01% optimize.slice_cell_reuse_recomputed_activation : 0.000003s : 0.00% optimize.rewriter_after_opt_a : 0.000018s : 0.03% optimize.convert_after_rewriter : 0.000005s : 0.01% optimize.order_py_execute_after_rewriter : 0.000004s : 0.01% optimize.opt_b.b_1 : 0.000028s : 0.05% optimize.opt_b.b_2 : 0.000003s : 0.00% optimize.opt_b.updatestate_depend_eliminate : 0.000002s : 0.00% optimize.opt_b.updatestate_assign_eliminate : 0.000002s : 0.00% optimize.opt_b.updatestate_loads_eliminate : 0.000002s : 0.00% optimize.opt_b.renormalize : 0.000000s : 0.00% optimize.opt_b.cse : 0.000006s : 0.01% optimize.cconv : 0.000019s : 0.03% optimize.opt_after_cconv.c_1 : 0.000004s : 0.01% optimize.opt_after_cconv.parameter_eliminate : 0.000001s : 0.00% optimize.opt_after_cconv.updatestate_depend_eliminate : 0.000001s : 0.00% optimize.opt_after_cconv.updatestate_assign_eliminate : 0.000001s : 0.00% optimize.opt_after_cconv.updatestate_loads_eliminate : 0.000001s : 0.00% optimize.opt_after_cconv.cse : 0.000005s : 0.01% optimize.opt_after_cconv.renormalize : 0.000000s : 0.00% optimize.remove_dup_value : 0.000011s : 0.02% optimize.tuple_transform.d_1 : 0.000009s : 0.02% optimize.tuple_transform.d_2 : 0.000004s : 0.01% optimize.tuple_transform.renormalize : 0.000000s : 0.00% optimize.add_cache_embedding : 0.000010s : 0.02% optimize.add_recomputation : 0.000028s : 0.05% optimize.cse_after_recomputation.cse : 0.000005s : 0.01% optimize.environ_conv : 0.000005s : 0.01% optimize.label_micro_interleaved_index : 0.000002s : 0.00% optimize.label_fine_grained_interleaved_index : 0.000003s : 0.01% optimize.assign_add_opt : 0.000002s : 0.00% optimize.slice_recompute_activation : 0.000002s : 0.00% optimize.micro_interleaved_order_control : 0.000002s : 0.00% optimize.full_micro_interleaved_order_control : 0.000002s : 0.00% optimize.comp_comm_scheduling : 0.000003s : 0.00% optimize.reorder_send_recv_between_fp_bp : 0.000002s : 0.00% optimize.comm_op_add_attrs : 0.000001s : 0.00% optimize.add_comm_op_reuse_tag : 0.000001s : 0.00% optimize.overlap_opt_shard_in_pipeline : 0.000001s : 0.00% optimize.grouped_pairwise_exchange_alltoall : 0.000001s : 0.00% optimize.overlap_recompute_and_grad_model_parallel : 0.000002s : 0.00% optimize.overlap_grad_matmul_and_grad_allreduce : 0.000001s : 0.00% optimize.split_matmul_comm_elemetwise : 0.000003s : 0.00% optimize.split_layernorm_comm : 0.000002s : 0.00% optimize.process_send_recv_for_ge : 0.000001s : 0.00% optimize.handle_group_info : 0.000001s : 0.00% auto_monad_reorder : 0.000013s : 0.02% get_jit_bprop_graph : 0.000000s : 0.00% eliminate_special_op_node : 0.000460s : 0.81% validate : 0.000021s : 0.04% distribtued_split : 0.000001s : 0.00% task_emit : 0.000001s : 0.00% execute : 0.000001s : 0.00% Time group info: ------[substitution.] 0.029538 150 0.01% : 0.000003s : 8: substitution.float_depend_g_call 0.01% : 0.000003s : 2: substitution.float_tuple_getitem_switch 98.86% : 0.029200s : 19: substitution.getattr_setattr_resolve 0.01% : 0.000004s : 1: substitution.graph_param_transform 0.01% : 0.000002s : 2: substitution.incorporate_call 0.00% : 0.000001s : 2: substitution.incorporate_call_switch 0.76% : 0.000225s : 14: substitution.inline 0.09% : 0.000027s : 56: substitution.meta_unpack_prepare 0.01% : 0.000003s : 2: substitution.minmaximum_grad 0.05% : 0.000014s : 8: substitution.partial_eliminate 0.00% : 0.000001s : 1: substitution.partial_unused_args_eliminate 0.01% : 0.000002s : 9: substitution.remove_not_recompute_node 0.01% : 0.000002s : 1: substitution.replace_applicator 0.01% : 0.000003s : 7: substitution.replace_old_param 0.02% : 0.000007s : 1: substitution.reshape_eliminate 0.01% : 0.000002s : 1: substitution.set_cell_output_no_recompute 0.02% : 0.000006s : 3: substitution.switch_simplify 0.02% : 0.000007s : 2: substitution.tuple_list_convert_item_index_to_positive 0.01% : 0.000003s : 2: substitution.tuple_list_get_item_const_eliminator 0.01% : 0.000004s : 2: substitution.tuple_list_get_item_depend_reorder 0.05% : 0.000015s : 5: substitution.tuple_list_get_item_eliminator 0.01% : 0.000004s : 2: substitution.tuple_list_get_set_item_eliminator ------[renormalize.] 0.008483 2 97.27% : 0.008251s : 1: renormalize.infer 2.73% : 0.000231s : 1: renormalize.specialize ------[replace.] 0.000384 32 68.34% : 0.000262s : 16: replace.getattr_setattr_resolve 19.84% : 0.000076s : 12: replace.inline 9.71% : 0.000037s : 3: replace.switch_simplify 2.11% : 0.000008s : 1: replace.tuple_list_get_item_eliminator ------[match.] 0.029284 32 99.24% : 0.029062s : 16: match.getattr_setattr_resolve 0.71% : 0.000208s : 12: match.inline 0.02% : 0.000006s : 3: match.switch_simplify 0.03% : 0.000008s : 1: match.tuple_list_get_item_eliminator ------[func_graph_cloner_run.] 0.002397 36 85.77% : 0.002056s : 21: func_graph_cloner_run.FuncGraphClonerGraph 14.23% : 0.000341s : 15: func_graph_cloner_run.FuncGraphSpecializer ------[meta_graph.] 0.000000 0 ------[manager.] 0.000000 0 ------[pynative] 0.000000 0 ------[others.] 0.030697 105 0.45% : 0.000139s : 52: opt.transform.opt_a 0.06% : 0.000020s : 23: opt.transform.opt_b 96.78% : 0.029708s : 2: opt.transform.opt_resolve 0.30% : 0.000092s : 1: opt.transforms.meta_unpack_prepare 2.33% : 0.000716s : 20: opt.transforms.opt_a 0.01% : 0.000002s : 1: opt.transforms.opt_after_cconv 0.01% : 0.000002s : 1: opt.transforms.opt_b 0.04% : 0.000012s : 2: opt.transforms.opt_trans_graph 0.02% : 0.000006s : 3: opt.transforms.special_op_eliminate TotalTime = 0.0241106, [20] [parse]: 0.00144221 [symbol_resolve]: 0.0108025, [1] [Cycle 1]: 0.0107478, [1] [resolve]: 0.0107274 [combine_like_graphs]: 9.5e-07 [graph_reusing]: 3.56e-06 [meta_unpack_prepare]: 6.548e-05 [pre_cconv]: 6.69999e-07 [abstract_specialize]: 0.00223073 [pack_expand]: 1.112e-05 [auto_monad]: 5.078e-05 [inline]: 1.35e-06 [pre_auto_parallel]: 1.052e-05 [pipeline_split]: 3.02e-06 [optimize]: 0.00404303, [35] [py_interpret_to_execute]: 4.21e-06 [rewriter_before_opt_a]: 4.105e-05 [opt_a]: 0.00353865, [2] [Cycle 1]: 0.00091364, [30] [expand_dump_flag]: 3.66e-06 [switch_simplify]: 1.427e-05 [a_1]: 0.00020391 [recompute_prepare]: 3.06e-06 [updatestate_depend_eliminate]: 6.78e-06 [updatestate_assign_eliminate]: 4.21e-06 [updatestate_loads_eliminate]: 3.34e-06 [parameter_eliminate]: 3.69e-06 [a_2]: 3.252e-05 [accelerated_algorithm]: 2.94999e-06 [pynative_shard]: 2e-06 [auto_parallel]: 3.3e-06 [parallel]: 8.19e-06 [merge_comm]: 3.84e-06 [allreduce_fusion]: 2.02e-06 [virtual_dataset]: 2.71e-06 [get_grad_eliminate_]: 2.14999e-06 [virtual_output]: 1.82e-06 [merge_forward]: 4.56e-06 [cell_reuse_recompute_pass]: 1.28e-06 [cell_reuse_handle_not_recompute_node_pass]: 5.85e-06 [meta_fg_expand]: 3.76e-06 [after_resolve]: 4.46e-06 [a_after_grad]: 2.7e-06 [renormalize]: 0.00037887 [real_op_eliminate]: 5.93e-06 [auto_monad_grad]: 4.27999e-06 [auto_monad_eliminator]: 1.286e-05 [cse]: 2.753e-05 [a_3]: 1.74e-05 [Cycle 2]: 0.00024036, [30] [expand_dump_flag]: 1.05e-06 [switch_simplify]: 2.49e-06 [a_1]: 2.468e-05 [recompute_prepare]: 2.01e-06 [updatestate_depend_eliminate]: 3.27999e-06 [updatestate_assign_eliminate]: 2.5e-06 [updatestate_loads_eliminate]: 2.33e-06 [parameter_eliminate]: 1.02e-06 [a_2]: 2.899e-05 [accelerated_algorithm]: 2.43e-06 [pynative_shard]: 1.12e-06 [auto_parallel]: 3.29001e-06 [parallel]: 3.03e-06 [merge_comm]: 1.88e-06 [allreduce_fusion]: 1.31e-06 [virtual_dataset]: 2.42e-06 [get_grad_eliminate_]: 1.99e-06 [virtual_output]: 1.89e-06 [merge_forward]: 2.91e-06 [cell_reuse_recompute_pass]: 3.09999e-07 [cell_reuse_handle_not_recompute_node_pass]: 4.98e-06 [meta_fg_expand]: 1.97e-06 [after_resolve]: 3.94e-06 [a_after_grad]: 2.63e-06 [renormalize]: 6.00048e-08 [real_op_eliminate]: 2.01001e-06 [auto_monad_grad]: 8.30005e-07 [auto_monad_eliminator]: 4.75e-06 [cse]: 1.02e-05 [a_3]: 1.396e-05 [py_interpret_to_execute_after_opt_a]: 3.53e-06 [slice_cell_reuse_recomputed_activation]: 2.47e-06 [rewriter_after_opt_a]: 2.05e-05 [convert_after_rewriter]: 5.79999e-06 [order_py_execute_after_rewriter]: 4.32001e-06 [opt_b]: 9.361e-05, [1] [Cycle 1]: 8.892e-05, [7] [b_1]: 4.454e-05 [b_2]: 3.25e-06 [updatestate_depend_eliminate]: 2.48e-06 [updatestate_assign_eliminate]: 2.34e-06 [updatestate_loads_eliminate]: 2.09e-06 [renormalize]: 3.49995e-07 [cse]: 8.52e-06 [cconv]: 2.284e-05 [opt_after_cconv]: 4.941e-05, [1] [Cycle 1]: 4.517e-05, [7] [c_1]: 5.56e-06 [parameter_eliminate]: 6.40001e-07 [updatestate_depend_eliminate]: 2.36e-06 [updatestate_assign_eliminate]: 1.93e-06 [updatestate_loads_eliminate]: 2.07e-06 [cse]: 7.58e-06 [renormalize]: 1.89997e-07 [remove_dup_value]: 1.225e-05 [tuple_transform]: 3.604e-05, [1] [Cycle 1]: 3.246e-05, [3] [d_1]: 1.469e-05 [d_2]: 6.77e-06 [renormalize]: 1.50001e-07 [add_cache_embedding]: 1.151e-05 [add_recomputation]: 3.994e-05 [cse_after_recomputation]: 1.623e-05, [1] [Cycle 1]: 1.203e-05, [1] [cse]: 7.74e-06 [environ_conv]: 7.86e-06 [label_micro_interleaved_index]: 2.54e-06 [label_fine_grained_interleaved_index]: 2.28e-06 [assign_add_opt]: 1.75e-06 [slice_recompute_activation]: 2.29e-06 [micro_interleaved_order_control]: 2.12e-06 [full_micro_interleaved_order_control]: 2.09e-06 [comp_comm_scheduling]: 2.03e-06 [reorder_send_recv_between_fp_bp]: 2e-06 [comm_op_add_attrs]: 1.11e-06 [add_comm_op_reuse_tag]: 9.79999e-07 [overlap_opt_shard_in_pipeline]: 1.21e-06 [grouped_pairwise_exchange_alltoall]: 1.42e-06 [overlap_recompute_and_grad_model_parallel]: 1.88e-06 [overlap_grad_matmul_and_grad_allreduce]: 7.60003e-07 [split_matmul_comm_elemetwise]: 2.35e-06 [split_layernorm_comm]: 1.79e-06 [process_send_recv_for_ge]: 1.49001e-06 [handle_group_info]: 1.3e-06 [auto_monad_reorder]: 1.57e-05 [get_jit_bprop_graph]: 4.60001e-07 [eliminate_special_op_node]: 0.0004611 [validate]: 2.793e-05 [distribtued_split]: 1.14999e-06 [task_emit]: 0.00473418 [execute]: 8.14e-06 Sums parse : 0.001442s : 6.86% symbol_resolve.resolve : 0.010727s : 51.02% combine_like_graphs : 0.000001s : 0.00% graph_reusing : 0.000004s : 0.02% meta_unpack_prepare : 0.000065s : 0.31% pre_cconv : 0.000001s : 0.00% abstract_specialize : 0.002231s : 10.61% pack_expand : 0.000011s : 0.05% auto_monad : 0.000051s : 0.24% inline : 0.000001s : 0.01% pre_auto_parallel : 0.000011s : 0.05% pipeline_split : 0.000003s : 0.01% optimize.py_interpret_to_execute : 0.000004s : 0.02% optimize.rewriter_before_opt_a : 0.000041s : 0.20% optimize.opt_a.expand_dump_flag : 0.000005s : 0.02% optimize.opt_a.switch_simplify : 0.000017s : 0.08% optimize.opt_a.a_1 : 0.000229s : 1.09% optimize.opt_a.recompute_prepare : 0.000005s : 0.02% optimize.opt_a.updatestate_depend_eliminate : 0.000010s : 0.05% optimize.opt_a.updatestate_assign_eliminate : 0.000007s : 0.03% optimize.opt_a.updatestate_loads_eliminate : 0.000006s : 0.03% optimize.opt_a.parameter_eliminate : 0.000005s : 0.02% optimize.opt_a.a_2 : 0.000062s : 0.29% optimize.opt_a.accelerated_algorithm : 0.000005s : 0.03% optimize.opt_a.pynative_shard : 0.000003s : 0.01% optimize.opt_a.auto_parallel : 0.000007s : 0.03% optimize.opt_a.parallel : 0.000011s : 0.05% optimize.opt_a.merge_comm : 0.000006s : 0.03% optimize.opt_a.allreduce_fusion : 0.000003s : 0.02% optimize.opt_a.virtual_dataset : 0.000005s : 0.02% optimize.opt_a.get_grad_eliminate_ : 0.000004s : 0.02% optimize.opt_a.virtual_output : 0.000004s : 0.02% optimize.opt_a.merge_forward : 0.000007s : 0.04% optimize.opt_a.cell_reuse_recompute_pass : 0.000002s : 0.01% optimize.opt_a.cell_reuse_handle_not_recompute_node_pass : 0.000011s : 0.05% optimize.opt_a.meta_fg_expand : 0.000006s : 0.03% optimize.opt_a.after_resolve : 0.000008s : 0.04% optimize.opt_a.a_after_grad : 0.000005s : 0.03% optimize.opt_a.renormalize : 0.000379s : 1.80% optimize.opt_a.real_op_eliminate : 0.000008s : 0.04% optimize.opt_a.auto_monad_grad : 0.000005s : 0.02% optimize.opt_a.auto_monad_eliminator : 0.000018s : 0.08% optimize.opt_a.cse : 0.000038s : 0.18% optimize.opt_a.a_3 : 0.000031s : 0.15% optimize.py_interpret_to_execute_after_opt_a : 0.000004s : 0.02% optimize.slice_cell_reuse_recomputed_activation : 0.000002s : 0.01% optimize.rewriter_after_opt_a : 0.000020s : 0.10% optimize.convert_after_rewriter : 0.000006s : 0.03% optimize.order_py_execute_after_rewriter : 0.000004s : 0.02% optimize.opt_b.b_1 : 0.000045s : 0.21% optimize.opt_b.b_2 : 0.000003s : 0.02% optimize.opt_b.updatestate_depend_eliminate : 0.000002s : 0.01% optimize.opt_b.updatestate_assign_eliminate : 0.000002s : 0.01% optimize.opt_b.updatestate_loads_eliminate : 0.000002s : 0.01% optimize.opt_b.renormalize : 0.000000s : 0.00% optimize.opt_b.cse : 0.000009s : 0.04% optimize.cconv : 0.000023s : 0.11% optimize.opt_after_cconv.c_1 : 0.000006s : 0.03% optimize.opt_after_cconv.parameter_eliminate : 0.000001s : 0.00% optimize.opt_after_cconv.updatestate_depend_eliminate : 0.000002s : 0.01% optimize.opt_after_cconv.updatestate_assign_eliminate : 0.000002s : 0.01% optimize.opt_after_cconv.updatestate_loads_eliminate : 0.000002s : 0.01% optimize.opt_after_cconv.cse : 0.000008s : 0.04% optimize.opt_after_cconv.renormalize : 0.000000s : 0.00% optimize.remove_dup_value : 0.000012s : 0.06% optimize.tuple_transform.d_1 : 0.000015s : 0.07% optimize.tuple_transform.d_2 : 0.000007s : 0.03% optimize.tuple_transform.renormalize : 0.000000s : 0.00% optimize.add_cache_embedding : 0.000012s : 0.05% optimize.add_recomputation : 0.000040s : 0.19% optimize.cse_after_recomputation.cse : 0.000008s : 0.04% optimize.environ_conv : 0.000008s : 0.04% optimize.label_micro_interleaved_index : 0.000003s : 0.01% optimize.label_fine_grained_interleaved_index : 0.000002s : 0.01% optimize.assign_add_opt : 0.000002s : 0.01% optimize.slice_recompute_activation : 0.000002s : 0.01% optimize.micro_interleaved_order_control : 0.000002s : 0.01% optimize.full_micro_interleaved_order_control : 0.000002s : 0.01% optimize.comp_comm_scheduling : 0.000002s : 0.01% optimize.reorder_send_recv_between_fp_bp : 0.000002s : 0.01% optimize.comm_op_add_attrs : 0.000001s : 0.01% optimize.add_comm_op_reuse_tag : 0.000001s : 0.00% optimize.overlap_opt_shard_in_pipeline : 0.000001s : 0.01% optimize.grouped_pairwise_exchange_alltoall : 0.000001s : 0.01% optimize.overlap_recompute_and_grad_model_parallel : 0.000002s : 0.01% optimize.overlap_grad_matmul_and_grad_allreduce : 0.000001s : 0.00% optimize.split_matmul_comm_elemetwise : 0.000002s : 0.01% optimize.split_layernorm_comm : 0.000002s : 0.01% optimize.process_send_recv_for_ge : 0.000001s : 0.01% optimize.handle_group_info : 0.000001s : 0.01% auto_monad_reorder : 0.000016s : 0.07% get_jit_bprop_graph : 0.000000s : 0.00% eliminate_special_op_node : 0.000461s : 2.19% validate : 0.000028s : 0.13% distribtued_split : 0.000001s : 0.01% task_emit : 0.004734s : 22.51% execute : 0.000008s : 0.04% Time group info: ------[substitution.] 0.010640 42 98.76% : 0.010508s : 8: substitution.getattr_setattr_resolve 0.05% : 0.000005s : 4: substitution.graph_param_transform 0.84% : 0.000090s : 3: substitution.inline 0.10% : 0.000011s : 13: substitution.meta_unpack_prepare 0.01% : 0.000001s : 4: substitution.partial_unused_args_eliminate 0.01% : 0.000001s : 4: substitution.remove_not_recompute_node 0.02% : 0.000002s : 2: substitution.replace_old_param 0.15% : 0.000016s : 3: substitution.reshape_eliminate 0.05% : 0.000005s : 1: substitution.tuple_list_get_item_eliminator ------[renormalize.] 0.000373 2 61.50% : 0.000230s : 1: renormalize.infer 38.50% : 0.000144s : 1: renormalize.specialize ------[replace.] 0.000150 10 76.27% : 0.000115s : 6: replace.getattr_setattr_resolve 18.43% : 0.000028s : 3: replace.inline 5.30% : 0.000008s : 1: replace.tuple_list_get_item_eliminator ------[match.] 0.010536 10 99.10% : 0.010441s : 6: match.getattr_setattr_resolve 0.85% : 0.000090s : 3: match.inline 0.05% : 0.000005s : 1: match.tuple_list_get_item_eliminator ------[func_graph_cloner_run.] 0.000465 10 68.85% : 0.000320s : 5: func_graph_cloner_run.FuncGraphClonerGraph 31.15% : 0.000145s : 5: func_graph_cloner_run.FuncGraphSpecializer ------[meta_graph.] 0.000000 0 ------[manager.] 0.000000 0 ------[pynative] 0.000000 0 ------[others.] 0.011181 105 0.75% : 0.000084s : 52: opt.transform.opt_a 0.32% : 0.000035s : 23: opt.transform.opt_b 95.88% : 0.010720s : 2: opt.transform.opt_resolve 0.27% : 0.000030s : 1: opt.transforms.meta_unpack_prepare 2.47% : 0.000276s : 20: opt.transforms.opt_a 0.04% : 0.000004s : 1: opt.transforms.opt_after_cconv 0.02% : 0.000002s : 1: opt.transforms.opt_b 0.18% : 0.000020s : 2: opt.transforms.opt_trans_graph 0.08% : 0.000009s : 3: opt.transforms.special_op_eliminate TotalTime = 0.0430702, [20] [parse]: 0.00148685 [symbol_resolve]: 0.0194552, [1] [Cycle 1]: 0.0193775, [1] [resolve]: 0.0193561 [combine_like_graphs]: 8.70001e-07 [graph_reusing]: 3.26e-06 [meta_unpack_prepare]: 0.00011618 [pre_cconv]: 7.30004e-07 [abstract_specialize]: 0.00971859 [pack_expand]: 1.871e-05 [auto_monad]: 0.00016664 [inline]: 1.42e-06 [pre_auto_parallel]: 1.108e-05 [pipeline_split]: 2.6e-06 [optimize]: 0.0113897, [35] [py_interpret_to_execute]: 4.56e-06 [rewriter_before_opt_a]: 0.00010598 [opt_a]: 0.0108285, [2] [Cycle 1]: 0.00833277, [30] [expand_dump_flag]: 4.59e-06 [switch_simplify]: 9.065e-05 [a_1]: 0.00050691 [recompute_prepare]: 8.65e-06 [updatestate_depend_eliminate]: 1.05e-05 [updatestate_assign_eliminate]: 7.82e-06 [updatestate_loads_eliminate]: 6.84999e-06 [parameter_eliminate]: 4.34e-06 [a_2]: 9.912e-05 [accelerated_algorithm]: 6.19e-06 [pynative_shard]: 1.89e-06 [auto_parallel]: 3.91e-06 [parallel]: 9.03e-06 [merge_comm]: 7.98e-06 [allreduce_fusion]: 3.2e-06 [virtual_dataset]: 6.21e-06 [get_grad_eliminate_]: 5.03e-06 [virtual_output]: 4.54e-06 [merge_forward]: 1.018e-05 [cell_reuse_recompute_pass]: 8.30005e-07 [cell_reuse_handle_not_recompute_node_pass]: 1.382e-05 [meta_fg_expand]: 0.00072838 [after_resolve]: 2.64e-05 [a_after_grad]: 3.737e-05 [renormalize]: 0.00646577 [real_op_eliminate]: 5.65e-06 [auto_monad_grad]: 4.53e-06 [auto_monad_eliminator]: 9.97e-06 [cse]: 2.209e-05 [a_3]: 1.427e-05 [Cycle 2]: 0.00022122, [30] [expand_dump_flag]: 1.13e-06 [switch_simplify]: 2.09e-06 [a_1]: 1.077e-05 [recompute_prepare]: 1.73e-06 [updatestate_depend_eliminate]: 2.41e-06 [updatestate_assign_eliminate]: 2e-06 [updatestate_loads_eliminate]: 1.58e-06 [parameter_eliminate]: 1.22e-06 [a_2]: 1.968e-05 [accelerated_algorithm]: 1.94e-06 [pynative_shard]: 1.96e-06 [auto_parallel]: 3.77e-06 [parallel]: 3.65e-06 [merge_comm]: 1.91999e-06 [allreduce_fusion]: 1.15e-06 [virtual_dataset]: 1.82e-06 [get_grad_eliminate_]: 1.52e-06 [virtual_output]: 1.38e-06 [merge_forward]: 2.27e-06 [cell_reuse_recompute_pass]: 3.80001e-07 [cell_reuse_handle_not_recompute_node_pass]: 5.37e-06 [meta_fg_expand]: 1.356e-05 [after_resolve]: 1.58e-06 [a_after_grad]: 1.78e-06 [renormalize]: 7.0002e-08 [real_op_eliminate]: 1.43e-06 [auto_monad_grad]: 1.19e-06 [auto_monad_eliminator]: 3.56e-06 [cse]: 8.59e-06 [a_3]: 9.87e-06 [py_interpret_to_execute_after_opt_a]: 3.85e-06 [slice_cell_reuse_recomputed_activation]: 2.69e-06 [rewriter_after_opt_a]: 1.803e-05 [convert_after_rewriter]: 4.88e-06 [order_py_execute_after_rewriter]: 4.11e-06 [opt_b]: 7.355e-05, [1] [Cycle 1]: 6.84e-05, [7] [b_1]: 2.863e-05 [b_2]: 2.24e-06 [updatestate_depend_eliminate]: 1.91999e-06 [updatestate_assign_eliminate]: 1.78e-06 [updatestate_loads_eliminate]: 1.57e-06 [renormalize]: 4.29995e-07 [cse]: 6.05e-06 [cconv]: 1.992e-05 [opt_after_cconv]: 4.338e-05, [1] [Cycle 1]: 3.947e-05, [7] [c_1]: 3.69e-06 [parameter_eliminate]: 6.99998e-07 [updatestate_depend_eliminate]: 1.69e-06 [updatestate_assign_eliminate]: 1.24e-06 [updatestate_loads_eliminate]: 1.46e-06 [cse]: 4.96e-06 [renormalize]: 2.40005e-07 [remove_dup_value]: 1.129e-05 [tuple_transform]: 2.763e-05, [1] [Cycle 1]: 2.416e-05, [3] [d_1]: 8.94e-06 [d_2]: 4.13e-06 [renormalize]: 1.59998e-07 [add_cache_embedding]: 1.021e-05 [add_recomputation]: 2.728e-05 [cse_after_recomputation]: 6.529e-05, [1] [Cycle 1]: 6.099e-05, [1] [cse]: 5.582e-05 [environ_conv]: 4.58e-06 [label_micro_interleaved_index]: 2.16e-06 [label_fine_grained_interleaved_index]: 2.67e-06 [assign_add_opt]: 1.5e-06 [slice_recompute_activation]: 2.21e-06 [micro_interleaved_order_control]: 2.03e-06 [full_micro_interleaved_order_control]: 2.03e-06 [comp_comm_scheduling]: 2.01e-06 [reorder_send_recv_between_fp_bp]: 2.3e-06 [comm_op_add_attrs]: 1.09e-06 [add_comm_op_reuse_tag]: 9.70002e-07 [overlap_opt_shard_in_pipeline]: 1.18e-06 [grouped_pairwise_exchange_alltoall]: 1.39001e-06 [overlap_recompute_and_grad_model_parallel]: 1.86999e-06 [overlap_grad_matmul_and_grad_allreduce]: 7.80004e-07 [split_matmul_comm_elemetwise]: 2.39e-06 [split_layernorm_comm]: 1.82e-06 [process_send_recv_for_ge]: 1.45e-06 [handle_group_info]: 1.24e-06 [auto_monad_reorder]: 1.381e-05 [get_jit_bprop_graph]: 4.19997e-07 [eliminate_special_op_node]: 0.0004636 [validate]: 2.045e-05 [distribtued_split]: 1.18e-06 [task_emit]: 1.01999e-06 [execute]: 9.29998e-07 Sums parse : 0.001487s : 3.72% symbol_resolve.resolve : 0.019356s : 48.39% combine_like_graphs : 0.000001s : 0.00% graph_reusing : 0.000003s : 0.01% meta_unpack_prepare : 0.000116s : 0.29% pre_cconv : 0.000001s : 0.00% abstract_specialize : 0.009719s : 24.30% pack_expand : 0.000019s : 0.05% auto_monad : 0.000167s : 0.42% inline : 0.000001s : 0.00% pre_auto_parallel : 0.000011s : 0.03% pipeline_split : 0.000003s : 0.01% optimize.py_interpret_to_execute : 0.000005s : 0.01% optimize.rewriter_before_opt_a : 0.000106s : 0.26% optimize.opt_a.expand_dump_flag : 0.000006s : 0.01% optimize.opt_a.switch_simplify : 0.000093s : 0.23% optimize.opt_a.a_1 : 0.000518s : 1.29% optimize.opt_a.recompute_prepare : 0.000010s : 0.03% optimize.opt_a.updatestate_depend_eliminate : 0.000013s : 0.03% optimize.opt_a.updatestate_assign_eliminate : 0.000010s : 0.02% optimize.opt_a.updatestate_loads_eliminate : 0.000008s : 0.02% optimize.opt_a.parameter_eliminate : 0.000006s : 0.01% optimize.opt_a.a_2 : 0.000119s : 0.30% optimize.opt_a.accelerated_algorithm : 0.000008s : 0.02% optimize.opt_a.pynative_shard : 0.000004s : 0.01% optimize.opt_a.auto_parallel : 0.000008s : 0.02% optimize.opt_a.parallel : 0.000013s : 0.03% optimize.opt_a.merge_comm : 0.000010s : 0.02% optimize.opt_a.allreduce_fusion : 0.000004s : 0.01% optimize.opt_a.virtual_dataset : 0.000008s : 0.02% optimize.opt_a.get_grad_eliminate_ : 0.000007s : 0.02% optimize.opt_a.virtual_output : 0.000006s : 0.01% optimize.opt_a.merge_forward : 0.000012s : 0.03% optimize.opt_a.cell_reuse_recompute_pass : 0.000001s : 0.00% optimize.opt_a.cell_reuse_handle_not_recompute_node_pass : 0.000019s : 0.05% optimize.opt_a.meta_fg_expand : 0.000742s : 1.86% optimize.opt_a.after_resolve : 0.000028s : 0.07% optimize.opt_a.a_after_grad : 0.000039s : 0.10% optimize.opt_a.renormalize : 0.006466s : 16.17% optimize.opt_a.real_op_eliminate : 0.000007s : 0.02% optimize.opt_a.auto_monad_grad : 0.000006s : 0.01% optimize.opt_a.auto_monad_eliminator : 0.000014s : 0.03% optimize.opt_a.cse : 0.000031s : 0.08% optimize.opt_a.a_3 : 0.000024s : 0.06% optimize.py_interpret_to_execute_after_opt_a : 0.000004s : 0.01% optimize.slice_cell_reuse_recomputed_activation : 0.000003s : 0.01% optimize.rewriter_after_opt_a : 0.000018s : 0.05% optimize.convert_after_rewriter : 0.000005s : 0.01% optimize.order_py_execute_after_rewriter : 0.000004s : 0.01% optimize.opt_b.b_1 : 0.000029s : 0.07% optimize.opt_b.b_2 : 0.000002s : 0.01% optimize.opt_b.updatestate_depend_eliminate : 0.000002s : 0.00% optimize.opt_b.updatestate_assign_eliminate : 0.000002s : 0.00% optimize.opt_b.updatestate_loads_eliminate : 0.000002s : 0.00% optimize.opt_b.renormalize : 0.000000s : 0.00% optimize.opt_b.cse : 0.000006s : 0.02% optimize.cconv : 0.000020s : 0.05% optimize.opt_after_cconv.c_1 : 0.000004s : 0.01% optimize.opt_after_cconv.parameter_eliminate : 0.000001s : 0.00% optimize.opt_after_cconv.updatestate_depend_eliminate : 0.000002s : 0.00% optimize.opt_after_cconv.updatestate_assign_eliminate : 0.000001s : 0.00% optimize.opt_after_cconv.updatestate_loads_eliminate : 0.000001s : 0.00% optimize.opt_after_cconv.cse : 0.000005s : 0.01% optimize.opt_after_cconv.renormalize : 0.000000s : 0.00% optimize.remove_dup_value : 0.000011s : 0.03% optimize.tuple_transform.d_1 : 0.000009s : 0.02% optimize.tuple_transform.d_2 : 0.000004s : 0.01% optimize.tuple_transform.renormalize : 0.000000s : 0.00% optimize.add_cache_embedding : 0.000010s : 0.03% optimize.add_recomputation : 0.000027s : 0.07% optimize.cse_after_recomputation.cse : 0.000056s : 0.14% optimize.environ_conv : 0.000005s : 0.01% optimize.label_micro_interleaved_index : 0.000002s : 0.01% optimize.label_fine_grained_interleaved_index : 0.000003s : 0.01% optimize.assign_add_opt : 0.000001s : 0.00% optimize.slice_recompute_activation : 0.000002s : 0.01% optimize.micro_interleaved_order_control : 0.000002s : 0.01% optimize.full_micro_interleaved_order_control : 0.000002s : 0.01% optimize.comp_comm_scheduling : 0.000002s : 0.01% optimize.reorder_send_recv_between_fp_bp : 0.000002s : 0.01% optimize.comm_op_add_attrs : 0.000001s : 0.00% optimize.add_comm_op_reuse_tag : 0.000001s : 0.00% optimize.overlap_opt_shard_in_pipeline : 0.000001s : 0.00% optimize.grouped_pairwise_exchange_alltoall : 0.000001s : 0.00% optimize.overlap_recompute_and_grad_model_parallel : 0.000002s : 0.00% optimize.overlap_grad_matmul_and_grad_allreduce : 0.000001s : 0.00% optimize.split_matmul_comm_elemetwise : 0.000002s : 0.01% optimize.split_layernorm_comm : 0.000002s : 0.00% optimize.process_send_recv_for_ge : 0.000001s : 0.00% optimize.handle_group_info : 0.000001s : 0.00% auto_monad_reorder : 0.000014s : 0.03% get_jit_bprop_graph : 0.000000s : 0.00% eliminate_special_op_node : 0.000464s : 1.16% validate : 0.000020s : 0.05% distribtued_split : 0.000001s : 0.00% task_emit : 0.000001s : 0.00% execute : 0.000001s : 0.00% Time group info: ------[substitution.] 0.019171 150 0.02% : 0.000003s : 8: substitution.float_depend_g_call 0.01% : 0.000003s : 2: substitution.float_tuple_getitem_switch 98.29% : 0.018842s : 19: substitution.getattr_setattr_resolve 0.02% : 0.000004s : 1: substitution.graph_param_transform 0.01% : 0.000002s : 2: substitution.incorporate_call 0.01% : 0.000001s : 2: substitution.incorporate_call_switch 1.17% : 0.000225s : 14: substitution.inline 0.15% : 0.000028s : 56: substitution.meta_unpack_prepare 0.02% : 0.000003s : 2: substitution.minmaximum_grad 0.02% : 0.000003s : 8: substitution.partial_eliminate 0.01% : 0.000001s : 1: substitution.partial_unused_args_eliminate 0.01% : 0.000002s : 9: substitution.remove_not_recompute_node 0.01% : 0.000002s : 1: substitution.replace_applicator 0.01% : 0.000003s : 7: substitution.replace_old_param 0.03% : 0.000006s : 1: substitution.reshape_eliminate 0.01% : 0.000002s : 1: substitution.set_cell_output_no_recompute 0.04% : 0.000007s : 3: substitution.switch_simplify 0.04% : 0.000007s : 2: substitution.tuple_list_convert_item_index_to_positive 0.01% : 0.000003s : 2: substitution.tuple_list_get_item_const_eliminator 0.02% : 0.000005s : 2: substitution.tuple_list_get_item_depend_reorder 0.08% : 0.000015s : 5: substitution.tuple_list_get_item_eliminator 0.02% : 0.000004s : 2: substitution.tuple_list_get_set_item_eliminator ------[renormalize.] 0.006459 2 96.20% : 0.006214s : 1: renormalize.infer 3.80% : 0.000246s : 1: renormalize.specialize ------[replace.] 0.000381 32 67.89% : 0.000259s : 16: replace.getattr_setattr_resolve 20.23% : 0.000077s : 12: replace.inline 9.78% : 0.000037s : 3: replace.switch_simplify 2.09% : 0.000008s : 1: replace.tuple_list_get_item_eliminator ------[match.] 0.018929 32 98.83% : 0.018706s : 16: match.getattr_setattr_resolve 1.10% : 0.000208s : 12: match.inline 0.04% : 0.000007s : 3: match.switch_simplify 0.04% : 0.000008s : 1: match.tuple_list_get_item_eliminator ------[func_graph_cloner_run.] 0.002360 36 86.07% : 0.002032s : 21: func_graph_cloner_run.FuncGraphClonerGraph 13.93% : 0.000329s : 15: func_graph_cloner_run.FuncGraphSpecializer ------[meta_graph.] 0.000000 0 ------[manager.] 0.000000 0 ------[pynative] 0.000000 0 ------[others.] 0.020331 105 0.69% : 0.000139s : 52: opt.transform.opt_a 0.10% : 0.000020s : 23: opt.transform.opt_b 95.17% : 0.019349s : 2: opt.transform.opt_resolve 0.46% : 0.000094s : 1: opt.transforms.meta_unpack_prepare 3.48% : 0.000707s : 20: opt.transforms.opt_a 0.01% : 0.000003s : 1: opt.transforms.opt_after_cconv 0.01% : 0.000001s : 1: opt.transforms.opt_b 0.06% : 0.000012s : 2: opt.transforms.opt_trans_graph 0.03% : 0.000007s : 3: opt.transforms.special_op_eliminate .[INFO] GE(64685,python3.7):2024-01-11-05:31:02.041.947 [scalable_config.cc:55][EVENT]64685 ScalableConfig:device total max size: 34359738368, page_mem_size_total_thresold: 32641751449, uncacheable_size_threshold: 17179869184 [INFO] GE(64685,python3.7):2024-01-11-05:31:02.120.262 [graph_var_manager.cc:1424][EVENT]64685 SetAllMemoryMaxValue:The graph_mem_max_size is 27917287424 and the var_mem_max_size is 5368709120 [INFO] GE(64685,python3.7):2024-01-11-05:31:02.120.386 [graph_manager.cc:1248][EVENT]64685 PreRun:PreRun start: graph node size 3, session id 1, graph id 0, graph name online. [INFO] ATRACE(64685,python3.7):2024-01-11-05:31:02.121.330 [atrace_api.c:28](tid:64685) AtraceCreate start [INFO] ATRACE(64685,python3.7):2024-01-11-05:31:02.121.408 [trace_rb_log.c:84](tid:64685) [RUNTIME_ATRACE_DEV64_TS0] create ring buffer success, buffer size : 131152. [INFO] ATRACE(64685,python3.7):2024-01-11-05:31:02.121.422 [atrace_api.c:32](tid:64685) AtraceCreate end [INFO] TDT(64685,python3.7):2024-01-11-05:31:02.121.453 [client_manager.cpp:157][SetProfilingCallback][tid:64685] [TsdClient] set profiling callback success [INFO] GE(64685,python3.7):2024-01-11-05:31:02.122.302 [parallel_partitioner.cc:165][EVENT]64685 DoPipelinePartition:[GEPERFTRACE] The time cost of OptimizeSubgraph::PipelinePartition is [24] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.122.350 [parallel_partitioner.cc:178][EVENT]64685 DoFlowGraphPartition:[GEPERFTRACE] The time cost of OptimizeSubgraph::FlowGraphPartition is [19] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.122.410 [graph_prepare.cc:1378][EVENT]64685 Init:[GEPERFTRACE] The time cost of FileConstantUtils::ConvertFileConstToConst is [4] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.122.953 [graph_manager.cc:1050][EVENT]64685 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.preparer.PrepareInit is [566] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.122.982 [graph_manager.cc:1052][EVENT]64685 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.optimizer.HandleSummaryOp is [8] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.123.118 [base_pass.cc:339][EVENT]64685 Run:[GEPERFTRACE] The time cost of ForToWhilePass is [1] micro second, call num is [3] [INFO] GE(64685,python3.7):2024-01-11-05:31:02.123.150 [pass_manager.cc:82][EVENT]64685 Run:[GEPERFTRACE] The time cost of ProcessNetOutput::SavePass is [4] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.123.230 [pass_manager.cc:82][EVENT]64685 Run:[GEPERFTRACE] The time cost of ProcessNetOutput::NetOutputPass is [68] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.123.244 [pass_manager.cc:82][EVENT]64685 Run:[GEPERFTRACE] The time cost of ProcessNetOutput::DataPass is [1] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.123.330 [pass_manager.cc:82][EVENT]64685 Run:[GEPERFTRACE] The time cost of CreateSubGraphWithScopePass is [17] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.123.348 [pass_manager.cc:82][EVENT]64685 Run:[GEPERFTRACE] The time cost of SubgraphMultiDimsClonePass is [7] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.123.365 [pass_manager.cc:82][EVENT]64685 Run:[GEPERFTRACE] The time cost of MultiBatchClonePass is [7] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.123.473 [base_pass.cc:339][EVENT]64685 Run:[GEPERFTRACE] The time cost of SplitVariableIntoSubgraphPass is [2] micro second, call num is [3] [INFO] GE(64685,python3.7):2024-01-11-05:31:02.123.493 [graph_manager.cc:1054][EVENT]64685 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.preparer.NormalizeGraph is [498] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.130.857 [graph_manager.cc:1055][EVENT]64685 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.optimizer.OptimizeGraphInit is [7349] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.131.924 [base_pass.cc:339][EVENT]64685 Run:[GEPERFTRACE] The time cost of SwitchDeadBranchElimination is [3] micro second, call num is [3] [INFO] GE(64685,python3.7):2024-01-11-05:31:02.131.954 [base_pass.cc:339][EVENT]64685 Run:[GEPERFTRACE] The time cost of MergePass is [4] micro second, call num is [3] [INFO] GE(64685,python3.7):2024-01-11-05:31:02.131.965 [base_pass.cc:339][EVENT]64685 Run:[GEPERFTRACE] The time cost of InferShapePass is [328] micro second, call num is [3] [INFO] GE(64685,python3.7):2024-01-11-05:31:02.131.975 [base_pass.cc:339][EVENT]64685 Run:[GEPERFTRACE] The time cost of ReplaceWithEmptyConstPass is [16] micro second, call num is [3] [INFO] GE(64685,python3.7):2024-01-11-05:31:02.131.997 [base_pass.cc:339][EVENT]64685 Run:[GEPERFTRACE] The time cost of SplitShapeNPass is [1] micro second, call num is [3] [INFO] GE(64685,python3.7):2024-01-11-05:31:02.132.006 [base_pass.cc:339][EVENT]64685 Run:[GEPERFTRACE] The time cost of DimensionComputePass is [18] micro second, call num is [3] [INFO] GE(64685,python3.7):2024-01-11-05:31:02.132.015 [base_pass.cc:339][EVENT]64685 Run:[GEPERFTRACE] The time cost of ConstantFoldingPass is [21] micro second, call num is [3] [INFO] GE(64685,python3.7):2024-01-11-05:31:02.132.023 [base_pass.cc:339][EVENT]64685 Run:[GEPERFTRACE] The time cost of InferValuePass is [7] micro second, call num is [3] [INFO] GE(64685,python3.7):2024-01-11-05:31:02.134.127 [graph_manager.cc:1056][EVENT]64685 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.optimizer.OptimizeOriginalGraphForQuantize is [3231] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.134.194 [base_pass.cc:339][EVENT]64685 Run:[GEPERFTRACE] The time cost of CondRemovePass is [3] micro second, call num is [3] [INFO] GE(64685,python3.7):2024-01-11-05:31:02.134.213 [graph_prepare.cc:1982][EVENT]64685 PrepareDynShape:[GEPERFTRACE] The time cost of Prepare::ProcessBeforeInfershape is [51] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.134.567 [base_pass.cc:339][EVENT]64685 Run:[GEPERFTRACE] The time cost of SwitchDeadBranchElimination is [3] micro second, call num is [3] [INFO] GE(64685,python3.7):2024-01-11-05:31:02.134.592 [base_pass.cc:339][EVENT]64685 Run:[GEPERFTRACE] The time cost of MergePass is [2] micro second, call num is [3] [INFO] GE(64685,python3.7):2024-01-11-05:31:02.134.602 [base_pass.cc:339][EVENT]64685 Run:[GEPERFTRACE] The time cost of InferShapePass is [186] micro second, call num is [3] [INFO] GE(64685,python3.7):2024-01-11-05:31:02.134.611 [base_pass.cc:339][EVENT]64685 Run:[GEPERFTRACE] The time cost of ReplaceWithEmptyConstPass is [9] micro second, call num is [3] [INFO] GE(64685,python3.7):2024-01-11-05:31:02.134.620 [base_pass.cc:339][EVENT]64685 Run:[GEPERFTRACE] The time cost of SplitShapeNPass is [0] micro second, call num is [3] [INFO] GE(64685,python3.7):2024-01-11-05:31:02.134.628 [base_pass.cc:339][EVENT]64685 Run:[GEPERFTRACE] The time cost of DimensionComputePass is [6] micro second, call num is [3] [INFO] GE(64685,python3.7):2024-01-11-05:31:02.134.637 [base_pass.cc:339][EVENT]64685 Run:[GEPERFTRACE] The time cost of ConstantFoldingPass is [9] micro second, call num is [3] [INFO] GE(64685,python3.7):2024-01-11-05:31:02.134.645 [base_pass.cc:339][EVENT]64685 Run:[GEPERFTRACE] The time cost of InferValuePass is [1] micro second, call num is [3] [INFO] GE(64685,python3.7):2024-01-11-05:31:02.134.671 [graph_prepare.cc:1983][EVENT]64685 PrepareDynShape:[GEPERFTRACE] The time cost of Prepare::FormatAndShapeProcess is [444] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.134.696 [pass_manager.cc:82][EVENT]64685 Run:[GEPERFTRACE] The time cost of PreRun::MarkForceUnknownForCondPass is [6] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.134.708 [graph_prepare.cc:1984][EVENT]64685 PrepareDynShape:[GEPERFTRACE] The time cost of Prepare::CtrlFlowPreProcess is [21] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.134.721 [graph_prepare.cc:1985][EVENT]64685 PrepareDynShape:[GEPERFTRACE] The time cost of Prepare::multibatch::GetDynamicOutputShape is [3] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.134.753 [graph_prepare.cc:1986][EVENT]64685 PrepareDynShape:[GEPERFTRACE] The time cost of Prepare::InsertAippOpUtil::Instance().UpdateDataNodeByAipp is [20] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.134.766 [graph_prepare.cc:1987][EVENT]64685 PrepareDynShape:[GEPERFTRACE] The time cost of Prepare::SaveOriginalGraphToOmModel is [1] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.134.781 [pass_manager.cc:82][EVENT]64685 Run:[GEPERFTRACE] The time cost of PrepareOptimize::ShapeOperateOpRemovePass is [3] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.134.806 [pass_manager.cc:82][EVENT]64685 Run:[GEPERFTRACE] The time cost of PrepareOptimize::ReplaceTransShapePass is [2] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.134.820 [pass_manager.cc:82][EVENT]64685 Run:[GEPERFTRACE] The time cost of PrepareOptimize::MarkAgnosticPass is [4] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.134.898 [base_pass.cc:339][EVENT]64685 Run:[GEPERFTRACE] The time cost of EnterPass is [2] micro second, call num is [3] [INFO] GE(64685,python3.7):2024-01-11-05:31:02.134.911 [base_pass.cc:339][EVENT]64685 Run:[GEPERFTRACE] The time cost of CondPass is [3] micro second, call num is [3] [INFO] GE(64685,python3.7):2024-01-11-05:31:02.134.920 [base_pass.cc:339][EVENT]64685 Run:[GEPERFTRACE] The time cost of PrintOpPass is [1] micro second, call num is [3] [INFO] GE(64685,python3.7):2024-01-11-05:31:02.134.928 [base_pass.cc:339][EVENT]64685 Run:[GEPERFTRACE] The time cost of NoUseReshapeRemovePass is [3] micro second, call num is [3] [INFO] GE(64685,python3.7):2024-01-11-05:31:02.134.936 [base_pass.cc:339][EVENT]64685 Run:[GEPERFTRACE] The time cost of TransposeRemovePass is [1] micro second, call num is [3] [INFO] GE(64685,python3.7):2024-01-11-05:31:02.134.945 [base_pass.cc:339][EVENT]64685 Run:[GEPERFTRACE] The time cost of UnusedConstPass is [3] micro second, call num is [3] [INFO] GE(64685,python3.7):2024-01-11-05:31:02.134.953 [base_pass.cc:339][EVENT]64685 Run:[GEPERFTRACE] The time cost of StopGradientPass is [1] micro second, call num is [3] [INFO] GE(64685,python3.7):2024-01-11-05:31:02.134.961 [base_pass.cc:339][EVENT]64685 Run:[GEPERFTRACE] The time cost of PreventGradientPass is [2] micro second, call num is [3] [INFO] GE(64685,python3.7):2024-01-11-05:31:02.134.969 [base_pass.cc:339][EVENT]64685 Run:[GEPERFTRACE] The time cost of PlaceholderWithDefaultPass is [1] micro second, call num is [3] [INFO] GE(64685,python3.7):2024-01-11-05:31:02.134.977 [base_pass.cc:339][EVENT]64685 Run:[GEPERFTRACE] The time cost of SnapshotPass is [2] micro second, call num is [3] [INFO] GE(64685,python3.7):2024-01-11-05:31:02.134.985 [base_pass.cc:339][EVENT]64685 Run:[GEPERFTRACE] The time cost of GuaranteeConstPass is [2] micro second, call num is [3] [INFO] GE(64685,python3.7):2024-01-11-05:31:02.134.993 [base_pass.cc:339][EVENT]64685 Run:[GEPERFTRACE] The time cost of VarIsInitializedOpPass is [6] micro second, call num is [3] [INFO] GE(64685,python3.7):2024-01-11-05:31:02.135.001 [base_pass.cc:339][EVENT]64685 Run:[GEPERFTRACE] The time cost of ParallelConcatStartOpPass is [1] micro second, call num is [3] [INFO] GE(64685,python3.7):2024-01-11-05:31:02.135.010 [base_pass.cc:339][EVENT]64685 Run:[GEPERFTRACE] The time cost of IdentityPass is [3] micro second, call num is [3] [INFO] GE(64685,python3.7):2024-01-11-05:31:02.135.033 [pass_manager.cc:82][EVENT]64685 Run:[GEPERFTRACE] The time cost of PrepareOptimize::PrunePass is [10] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.135.048 [pass_manager.cc:82][EVENT]64685 Run:[GEPERFTRACE] The time cost of PrepareOptimize::HcclMemcpyPass is [3] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.135.081 [graph_prepare.cc:1988][EVENT]64685 PrepareDynShape:[GEPERFTRACE] The time cost of Prepare::PrepareOptimize is [305] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.135.094 [graph_manager.cc:1065][EVENT]64685 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.preparer.PrepareDynShape is [933] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.148.276 [graph_manager.cc:1077][EVENT]64685 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.optimizer.OptimizeOriginalGraph is [13161] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.148.347 [pass_manager.cc:82][EVENT]64685 Run:[GEPERFTRACE] The time cost of PrepareRunningFormatRefiner::VariablePrepareOpPass is [6] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.148.382 [graph_manager.cc:1080][EVENT]64685 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.preparer.PrepareRunningFormatRefiner is [73] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.152.226 [graph_manager.cc:1081][EVENT]64685 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.optimizer.OptimizeOriginalGraphJudgeInsert is [3827] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.152.269 [pass_manager.cc:82][EVENT]64685 Run:[GEPERFTRACE] The time cost of SubexpressionMigrationPass is [4] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.152.284 [pass_manager.cc:82][EVENT]64685 Run:[GEPERFTRACE] The time cost of UnusedArgsCleanPass is [2] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.152.296 [graph_manager.cc:1082][EVENT]64685 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::SubexpressionMigration is [35] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.152.328 [pass_manager.cc:82][EVENT]64685 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::MergeInputMemcpyPass is [3] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.152.344 [pass_manager.cc:82][EVENT]64685 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::SwitchDataEdgesBypass is [4] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.152.359 [pass_manager.cc:82][EVENT]64685 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::ConstantFuseSamePass is [4] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.152.454 [pass_manager.cc:82][EVENT]64685 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::CSEBeforeFuseDataNodesWithCommonInputPass is [85] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.152.471 [pass_manager.cc:82][EVENT]64685 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::FuseDataNodesWithCommonInputPass is [4] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.152.518 [pass_manager.cc:82][EVENT]64685 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::CommonSubexpressionEliminationPass is [35] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.152.533 [pass_manager.cc:82][EVENT]64685 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::PermutePass is [4] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.152.581 [pass_manager.cc:82][EVENT]64685 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::SameTransdataBreadthFusionPass is [36] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.152.602 [pass_manager.cc:82][EVENT]64685 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::VariableOpPass is [7] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.152.622 [pass_manager.cc:82][EVENT]64685 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::TransOpWithoutReshapeFusionPass is [8] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.152.681 [pass_manager.cc:82][EVENT]64685 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::TransOpBreadthFusionPass is [49] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.152.703 [pass_manager.cc:82][EVENT]64685 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::DataFlowPreparePass is [7] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.152.716 [pass_manager.cc:82][EVENT]64685 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::MergeUnknownShapeNPass is [2] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.152.725 [graph_manager.cc:2700][EVENT]64685 OptimizeStage1:[GEPERFTRACE] The time cost of GraphManager::OptimizeStage1_1 is [401] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.152.833 [base_pass.cc:339][EVENT]64685 Run:[GEPERFTRACE] The time cost of EnterPass is [0] micro second, call num is [3] [INFO] GE(64685,python3.7):2024-01-11-05:31:02.152.849 [base_pass.cc:339][EVENT]64685 Run:[GEPERFTRACE] The time cost of AddNPass is [1] micro second, call num is [3] [INFO] GE(64685,python3.7):2024-01-11-05:31:02.152.858 [base_pass.cc:339][EVENT]64685 Run:[GEPERFTRACE] The time cost of SwitchDeadBranchElimination is [1] micro second, call num is [3] [INFO] GE(64685,python3.7):2024-01-11-05:31:02.152.876 [base_pass.cc:339][EVENT]64685 Run:[GEPERFTRACE] The time cost of SwitchLogicRemovePass is [1] micro second, call num is [3] [INFO] GE(64685,python3.7):2024-01-11-05:31:02.152.885 [base_pass.cc:339][EVENT]64685 Run:[GEPERFTRACE] The time cost of MergePass is [1] micro second, call num is [3] [INFO] GE(64685,python3.7):2024-01-11-05:31:02.152.894 [base_pass.cc:339][EVENT]64685 Run:[GEPERFTRACE] The time cost of CastRemovePass is [10] micro second, call num is [3] [INFO] GE(64685,python3.7):2024-01-11-05:31:02.152.902 [base_pass.cc:339][EVENT]64685 Run:[GEPERFTRACE] The time cost of TransposeTransDataPass is [2] micro second, call num is [3] [INFO] GE(64685,python3.7):2024-01-11-05:31:02.152.911 [base_pass.cc:339][EVENT]64685 Run:[GEPERFTRACE] The time cost of ReshapeRemovePass is [4] micro second, call num is [3] [INFO] GE(64685,python3.7):2024-01-11-05:31:02.152.919 [base_pass.cc:339][EVENT]64685 Run:[GEPERFTRACE] The time cost of TransOpSymmetryEliminationPass is [3] micro second, call num is [3] [INFO] GE(64685,python3.7):2024-01-11-05:31:02.152.927 [base_pass.cc:339][EVENT]64685 Run:[GEPERFTRACE] The time cost of TransOpNearbyAllreduceFusionPass is [2] micro second, call num is [3] [INFO] GE(64685,python3.7):2024-01-11-05:31:02.152.935 [base_pass.cc:339][EVENT]64685 Run:[GEPERFTRACE] The time cost of ReplaceWithEmptyConstPass is [6] micro second, call num is [3] [INFO] GE(64685,python3.7):2024-01-11-05:31:02.152.943 [base_pass.cc:339][EVENT]64685 Run:[GEPERFTRACE] The time cost of DimensionComputePass is [3] micro second, call num is [3] [INFO] GE(64685,python3.7):2024-01-11-05:31:02.152.951 [base_pass.cc:339][EVENT]64685 Run:[GEPERFTRACE] The time cost of ConstantFoldingPass is [9] micro second, call num is [3] [INFO] GE(64685,python3.7):2024-01-11-05:31:02.152.959 [base_pass.cc:339][EVENT]64685 Run:[GEPERFTRACE] The time cost of DimensionAdjustPass is [4] micro second, call num is [3] [INFO] GE(64685,python3.7):2024-01-11-05:31:02.152.967 [base_pass.cc:339][EVENT]64685 Run:[GEPERFTRACE] The time cost of UselessControlOutRemovePass is [2] micro second, call num is [3] [INFO] GE(64685,python3.7):2024-01-11-05:31:02.152.977 [graph_manager.cc:2741][EVENT]64685 OptimizeStage1:[GEPERFTRACE] The time cost of GraphManager::OptimizeStage1_2 is [233] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.152.985 [graph_manager.cc:2752][EVENT]64685 OptimizeStage1:[GEPERFTRACE] The time cost of extern constant folding is [0] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.153.010 [pass_manager.cc:82][EVENT]64685 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::Migration is [3] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.153.023 [pass_manager.cc:82][EVENT]64685 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::ArgsClean is [1] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.153.040 [pass_manager.cc:82][EVENT]64685 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::PrunePass is [7] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.153.054 [pass_manager.cc:82][EVENT]64685 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::NextIterationPass is [5] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.153.066 [pass_manager.cc:82][EVENT]64685 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::ControlTriggerPass is [3] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.153.077 [pass_manager.cc:82][EVENT]64685 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::MergeToStreamMergePass is [3] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.153.106 [pass_manager.cc:82][EVENT]64685 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::SwitchToStreamSwitchPass is [17] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.153.121 [pass_manager.cc:82][EVENT]64685 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::AttachStreamLabelPass is [5] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.153.135 [pass_manager.cc:82][EVENT]64685 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::MultiBatchPass is [4] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.153.152 [pass_manager.cc:82][EVENT]64685 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::SubgraphMultiDimsPass is [2] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.153.165 [pass_manager.cc:82][EVENT]64685 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::IteratorOpPass is [5] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.153.176 [pass_manager.cc:82][EVENT]64685 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::VariableRefUselessControlOutDeletePass is [3] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.153.193 [pass_manager.cc:82][EVENT]64685 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::ReshapeRecoveryPass is [9] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.153.205 [pass_manager.cc:82][EVENT]64685 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::RemoveSameConstPass is [3] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.153.213 [graph_manager.cc:2810][EVENT]64685 OptimizeStage1:[GEPERFTRACE] The time cost of GraphManager::OptimizeStage1_3 is [208] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.153.241 [base_pass.cc:339][EVENT]64685 Run:[GEPERFTRACE] The time cost of IdentityPass is [3] micro second, call num is [3] [INFO] GE(64685,python3.7):2024-01-11-05:31:02.153.253 [graph_manager.cc:2821][EVENT]64685 OptimizeStage1:[GEPERFTRACE] The time cost of GraphPrepare::node_pass is [31] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.153.282 [graph_manager.cc:1087][EVENT]64685 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::OptimizeStage1 is [967] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.153.420 [graph_manager.cc:1088][EVENT]64685 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.optimizer.OptimizeAfterStage1 is [125] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.153.462 [graph_manager.cc:1089][EVENT]64685 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::GraphUtilsEx::InferShapeInNeed is [23] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.153.482 [pass_manager.cc:82][EVENT]64685 Run:[GEPERFTRACE] The time cost of PreRun::CtrlEdgeTransferPass is [1] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.153.496 [graph_manager.cc:1097][EVENT]64685 PreRunOptimizeOriginalGraph:PreRun:PreRunOptimizeOriginalGraph success. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.153.516 [graph_manager.cc:3325][EVENT]64685 OptimizeSubgraph:[GEPERFTRACE] The time cost of OptimizeSubgraph::StagePartition is [4] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.153.900 [engine_place.cc:144][EVENT]64685 Run:The time cost of AIcoreEngine::CheckSupported is [256] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.153.927 [engine_place.cc:144][EVENT]64685 Run:The time cost of DNN_VM_GE_LOCAL_OP_STORE::CheckSupported is [8] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.153.936 [engine_place.cc:144][EVENT]64685 Run:The time cost of DNN_VM_RTS_OP_STORE::CheckSupported is [9] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.154.025 [graph_manager.cc:3351][EVENT]64685 OptimizeSubgraph:[GEPERFTRACE] The time cost of OptimizeSubgraph::GraphPartitionDynamicShape is [495] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.154.046 [graph_manager.cc:3364][EVENT]64685 OptimizeSubgraph:[GEPERFTRACE] The time cost of OptimizeSubgraph::SubgraphPartitionAndOptimization::CompositeEngine is [2] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.154.126 [engine_partitioner.cc:1139][EVENT]64685 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionInitialize is [22] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.154.143 [engine_partitioner.cc:1142][EVENT]64685 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionMarkClusters is [4] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.154.292 [engine_partitioner.cc:1148][EVENT]64685 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionSplitSubGraphs is [131] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.154.338 [engine_partitioner.cc:1155][EVENT]64685 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionSortSubGraphs is [33] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.154.385 [engine_partitioner.cc:1164][EVENT]64685 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionAddPartitionsToGraphNode is [36] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.154.418 [graph_manager.cc:3405][EVENT]64685 SubgraphPartitionAndOptimization:[GEPERFTRACE] The time cost of OptimizeSubgraph::Partition1 is [358] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.154.436 [graph_manager.cc:3412][EVENT]64685 SubgraphPartitionAndOptimization:[GEPERFTRACE] The time cost of OptimizeSubgraph::SetSubgraphPreProc is [7] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.166.274 [graph_manager.cc:3422][EVENT]64685 SubgraphPartitionAndOptimization:[GEPERFTRACE] The time cost of OptimizeSubgraph::SetSubGraph is [11822] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.166.309 [graph_manager.cc:3428][EVENT]64685 SubgraphPartitionAndOptimization:[GEPERFTRACE] The time cost of OptimizeSubgraph::SetSubgraphPostProc is [7] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.166.436 [graph_manager.cc:3467][EVENT]64685 SubgraphPartitionAndOptimization:[GEPERFTRACE] The time cost of OptimizeSubgraph::MergeSubGraph is [107] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.166.455 [graph_manager.cc:3377][EVENT]64685 OptimizeSubgraph:[GEPERFTRACE] The time cost of OptimizeSubgraph::SubgraphPartitionAndOptimization::AtomicEngine is [12396] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.166.470 [graph_manager.cc:1106][EVENT]64685 PreRunOptimizeSubGraph:[GEPERFTRACE] The time cost of GraphManager::OptimizeSubgraph is [12960] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.166.482 [graph_manager.cc:1115][EVENT]64685 PreRunOptimizeSubGraph:PreRun:PreRunOptimizeSubGraph success. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.166.505 [graph_manager.cc:1130][EVENT]64685 PreRunAfterOptimizeSubGraph:[GEPERFTRACE] The time cost of GraphManager::stages.optimizer.ReplacePrecompiledNodeWithOmGraph is [6] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.166.534 [graph_manager.cc:1131][EVENT]64685 PreRunAfterOptimizeSubGraph:[GEPERFTRACE] The time cost of GraphManager::stages.optimizer.OptimizeWholeGraph is [17] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.166.562 [pass_manager.cc:82][EVENT]64685 Run:[GEPERFTRACE] The time cost of OptimizeStage2::AfterMergePasses::LinkGenMaskNodesPass is [9] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.166.579 [pass_manager.cc:82][EVENT]64685 Run:[GEPERFTRACE] The time cost of OptimizeStage2::HcclContinuousMemcpyPass is [5] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.166.588 [graph_manager.cc:2837][EVENT]64685 OptimizeStage2:[GEPERFTRACE] The time cost of OptimizeStage2::AfterMergePasses is [37] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.166.659 [base_pass.cc:339][EVENT]64685 Run:[GEPERFTRACE] The time cost of ConstantFoldingPass is [12] micro second, call num is [3] [INFO] GE(64685,python3.7):2024-01-11-05:31:02.166.672 [base_pass.cc:339][EVENT]64685 Run:[GEPERFTRACE] The time cost of ReshapeRemovePass is [2] micro second, call num is [3] [INFO] GE(64685,python3.7):2024-01-11-05:31:02.166.681 [base_pass.cc:339][EVENT]64685 Run:[GEPERFTRACE] The time cost of CondRemovePass is [2] micro second, call num is [3] [INFO] GE(64685,python3.7):2024-01-11-05:31:02.166.689 [base_pass.cc:339][EVENT]64685 Run:[GEPERFTRACE] The time cost of BitcastPass is [1] micro second, call num is [3] [INFO] GE(64685,python3.7):2024-01-11-05:31:02.166.698 [base_pass.cc:339][EVENT]64685 Run:[GEPERFTRACE] The time cost of AssignRemovePass is [7] micro second, call num is [3] [INFO] GE(64685,python3.7):2024-01-11-05:31:02.166.716 [base_pass.cc:339][EVENT]64685 Run:[GEPERFTRACE] The time cost of DimensionAdjustPass is [3] micro second, call num is [3] [INFO] GE(64685,python3.7):2024-01-11-05:31:02.166.727 [graph_manager.cc:2864][EVENT]64685 OptimizeStage2:[GEPERFTRACE] The time cost of OptimizeStage2::MergedGraphNameToPasses is [122] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.166.739 [graph_manager.cc:2872][EVENT]64685 OptimizeStage2:[GEPERFTRACE] The time cost of OptimizeStage2::RemoveIsolatedConst is [4] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.166.760 [pass_manager.cc:82][EVENT]64685 Run:[GEPERFTRACE] The time cost of OptimizeStage2::ControlAttrOptimize::FlowCtrlPass is [4] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.166.772 [pass_manager.cc:82][EVENT]64685 Run:[GEPERFTRACE] The time cost of OptimizeStage2::ControlAttrOptimize::MultiBatchPass is [1] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.166.784 [pass_manager.cc:82][EVENT]64685 Run:[GEPERFTRACE] The time cost of OptimizeStage2::AfterMergePasses::RefIdentityDeleteOpPass is [4] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.166.799 [pass_manager.cc:82][EVENT]64685 Run:[GEPERFTRACE] The time cost of OptimizeStage2::AfterMergePasses::VariableRefDeleteOpPass is [5] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.166.813 [compile_nodes_pass.cc:88][EVENT]64685 Run:[GEPERFTRACE] The time cost of OptimizeStage2::ControlAttrOptimize::CompileNodesPass is [3] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.166.823 [pass_manager.cc:82][EVENT]64685 Run:[GEPERFTRACE] The time cost of OptimizeStage2::ControlAttrOptimize::CompileNodesPass is [15] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.166.833 [pass_manager.cc:82][EVENT]64685 Run:[GEPERFTRACE] The time cost of OptimizeStage2::AfterMergePasses::SwapSpacePass is [1] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.166.918 [pass_manager.cc:82][EVENT]64685 Run:[GEPERFTRACE] The time cost of OptimizeStage2::AfterMergePasses::InputOutputConnectionIdentifyPass is [76] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.166.940 [pass_manager.cc:82][EVENT]64685 Run:[GEPERFTRACE] The time cost of OptimizeStage2::AfterMergePasses::AtomicAddrCleanPass is [9] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.166.952 [pass_manager.cc:82][EVENT]64685 Run:[GEPERFTRACE] The time cost of OptimizeStage2::AfterMergePasses::EndOfSequenceAddControlPass is [3] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.166.964 [pass_manager.cc:82][EVENT]64685 Run:[GEPERFTRACE] The time cost of OptimizeStage2::ControlAttrOptimize::SubgraphPass is [3] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.166.977 [pass_manager.cc:82][EVENT]64685 Run:[GEPERFTRACE] The time cost of OptimizeStage2::ControlAttrOptimize::AttachStreamLabelPass is [3] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.166.986 [graph_manager.cc:2927][EVENT]64685 OptimizeStage2:[GEPERFTRACE] The time cost of OptimizeStage2::ControlAttrOptimize is [231] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.167.006 [graph_manager.cc:2937][EVENT]64685 OptimizeStage2:[GEPERFTRACE] The time cost of ModelBuilder::AssignFunctionalLabels is [11] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.167.022 [graph_manager.cc:2943][EVENT]64685 OptimizeStage2:[GEPERFTRACE] The time cost of MemcpyAddrAsyncPass::Run. is [6] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.167.034 [graph_manager.cc:2950][EVENT]64685 OptimizeStage2:[GEPERFTRACE] The time cost of BufferPoolMemoryPass::Run. is [2] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.177.422 [graph_manager.cc:2958][EVENT]64685 OptimizeStage2:[GEPERFTRACE] The time cost of ParallelGroupPass::Run. is [43] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.177.475 [graph_manager.cc:1132][EVENT]64685 PreRunAfterOptimizeSubGraph:[GEPERFTRACE] The time cost of GraphManager::OptimizeStage2 is [10926] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.177.561 [graph_manager.cc:1135][EVENT]64685 PreRunAfterOptimizeSubGraph:[GEPERFTRACE] The time cost of GraphManager::GetCompilerStages(graph_node->GetGraphId()).optimizer.OptimizeGraphBeforeBuild is [58] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.177.609 [graph_manager.cc:2975][EVENT]64685 MemConflictProc:[GEPERFTRACE] The time cost of HandleMemoryRWConflict is [29] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.177.651 [graph_manager.cc:2981][EVENT]64685 MemConflictProc:[GEPERFTRACE] The time cost of MemLayoutConflictOptimizer::Run. is [26] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.177.668 [pass_manager.cc:82][EVENT]64685 Run:[GEPERFTRACE] The time cost of OptimizeStage2::SetFftsPlusAttrPass is [1] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.177.678 [graph_manager.cc:2986][EVENT]64685 MemConflictProc:[GEPERFTRACE] The time cost of SetFftsPlusAttrPass::last_passes.Run is [15] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.177.688 [graph_manager.cc:1136][EVENT]64685 PreRunAfterOptimizeSubGraph:[GEPERFTRACE] The time cost of GraphManager::MemConflictProc is [109] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.177.806 [graph_manager.cc:3555][EVENT]64685 Build:[GEPERFTRACE] The time cost of GraphManager::RecoverIrDefinitionAndModifyAippData is [83] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.177.912 [engine_partitioner.cc:1139][EVENT]64685 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionInitialize is [15] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.177.929 [engine_partitioner.cc:1142][EVENT]64685 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionMarkClusters is [3] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.178.035 [engine_partitioner.cc:1148][EVENT]64685 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionSplitSubGraphs is [96] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.178.064 [engine_partitioner.cc:1155][EVENT]64685 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionSortSubGraphs is [17] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.178.105 [engine_partitioner.cc:1164][EVENT]64685 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionAddPartitionsToGraphNode is [29] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.178.127 [graph_builder.cc:865][EVENT]64685 SecondPartition:[GEPERFTRACE] The time cost of EnginePartitioner::Partition2 is [249] micro second. [INFO] RUNTIME(64685,python3.7):2024-01-11-05:31:02.178.599 [logger.cc:1071] 64685 ModelBindStream: model_id=1344, stream_id=1601, flag=0. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.178.643 [task_generator.cc:804][EVENT]64685 GenerateTask:[GEPERFTRACE] The time cost of TaskGenerator::SetStreamCtx is [178] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.178.716 [task_generator.cc:805][EVENT]64685 GenerateTask:[GEPERFTRACE] The time cost of TaskGenerator::PrepareForGenerateTask is [60] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.179.473 [task_generator.cc:814][EVENT]64685 GenerateTask:[GEPERFTRACE] The time cost of TaskGenerator::DoGenerateTask is [741] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.179.489 [task_generator.cc:954][EVENT]64685 GetTaskInfo:[GEPERFTRACE] The time cost of TaskGenerator::GenerateTask is [1024] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.179.560 [task_generator.cc:967][EVENT]64685 GetTaskInfo:[GEPERFTRACE] The time cost of TaskGenerator::AddModelTaskToModel is [43] micro second. [INFO] RUNTIME(64685,python3.7):2024-01-11-05:31:02.179.579 [logger.cc:1084] 64685 ModelUnbindStream: model_id=1344, stream_id=1601, [INFO] GE(64685,python3.7):2024-01-11-05:31:02.179.768 [graph_manager.cc:1152][EVENT]64685 PreRunAfterOptimizeSubGraph:[GEPERFTRACE] The time cost of GraphManager::Build is [2053] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.179.797 [graph_manager.cc:1164][EVENT]64685 PreRunAfterOptimizeSubGraph:PreRun:PreRunAfterOptimizeSubGraph success. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.179.832 [graph_manager.cc:1271][EVENT]64685 PreRun:[GEPERFTRACE] The time cost of FlowModelBuild is [57660] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.179.843 [graph_manager.cc:1272][EVENT]64685 PreRun:[GEPERFTRACE] GE PreRun End [INFO] ATRACE(64685,python3.7):2024-01-11-05:31:02.180.181 [atrace_api.c:93](tid:64685) AtraceDestroy start [INFO] ATRACE(64685,python3.7):2024-01-11-05:31:02.180.206 [atrace_api.c:95](tid:64685) AtraceDestroy end [INFO] GE(64685,python3.7):2024-01-11-05:31:02.185.187 [graph_converter.cc:838][EVENT]64685 ConvertComputeGraphToExecuteGraph:[GEPERFTRACE] The time cost of ConvertComputeGraphToExecuteGraph::CreateMainNode is [1448] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.185.358 [base_optimizer.cc:70][EVENT]64685 Run:[GEPERFTRACE] The time cost of ZeroCopy is [125] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.185.835 [base_optimizer.cc:70][EVENT]64685 Run:[GEPERFTRACE] The time cost of CEM is [455] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.186.031 [copy_flow_launch_fuse.cc:395][EVENT]64685 Run:[GEPERFTRACE] The time cost of Pass::CopyFlowLaunchFuse is [171] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.186.052 [base_optimizer.cc:70][EVENT]64685 Run:[GEPERFTRACE] The time cost of CopyFlowLaunch is [194] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.186.291 [base_optimizer.cc:70][EVENT]64685 Run:[GEPERFTRACE] The time cost of TrustOutTensor is [226] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.186.319 [base_optimizer.cc:70][EVENT]64685 Run:[GEPERFTRACE] The time cost of AicpuFuseHostInputs is [9] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.186.358 [base_optimizer.cc:70][EVENT]64685 Run:[GEPERFTRACE] The time cost of ZeroCopy is [26] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.186.547 [base_optimizer.cc:70][EVENT]64685 Run:[GEPERFTRACE] The time cost of CEM is [174] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.186.628 [copy_flow_launch_fuse.cc:395][EVENT]64685 Run:[GEPERFTRACE] The time cost of Pass::CopyFlowLaunchFuse is [63] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.186.642 [base_optimizer.cc:70][EVENT]64685 Run:[GEPERFTRACE] The time cost of CopyFlowLaunch is [79] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.186.670 [base_optimizer.cc:70][EVENT]64685 Run:[GEPERFTRACE] The time cost of TrustOutTensor is [18] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.186.680 [base_optimizer.cc:70][EVENT]64685 Run:[GEPERFTRACE] The time cost of AicpuFuseHostInputs is [0] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.186.704 [base_optimizer.cc:70][EVENT]64685 Run:[GEPERFTRACE] The time cost of ZeroCopy is [16] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.186.773 [base_optimizer.cc:70][EVENT]64685 Run:[GEPERFTRACE] The time cost of CEM is [59] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.186.837 [copy_flow_launch_fuse.cc:395][EVENT]64685 Run:[GEPERFTRACE] The time cost of Pass::CopyFlowLaunchFuse is [53] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.186.848 [base_optimizer.cc:70][EVENT]64685 Run:[GEPERFTRACE] The time cost of CopyFlowLaunch is [64] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.186.872 [base_optimizer.cc:70][EVENT]64685 Run:[GEPERFTRACE] The time cost of TrustOutTensor is [15] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.186.882 [base_optimizer.cc:70][EVENT]64685 Run:[GEPERFTRACE] The time cost of AicpuFuseHostInputs is [1] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.186.894 [graph_converter.cc:849][EVENT]64685 ConvertComputeGraphToExecuteGraph:[GEPERFTRACE] The time cost of ConvertComputeGraphToExecuteGraph::RunAllPass is [1668] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.187.110 [graph_converter.cc:853][EVENT]64685 ConvertComputeGraphToExecuteGraph:[GEPERFTRACE] The time cost of ConvertComputeGraphToExecuteGraph::TopologicalSorting is [197] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.187.798 [graph_converter.cc:857][EVENT]64685 ConvertComputeGraphToExecuteGraph:[GEPERFTRACE] The time cost of ConvertComputeGraphToExecuteGraph::AppendGraphLevelData is [671] micro second. [INFO] GE(64685,python3.7):2024-01-11-05:31:02.187.949 [graph_converter.cc:862][EVENT]64685 ConvertComputeGraphToExecuteGraph:[GEPERFTRACE] The time cost of ConvertComputeGraphToExecuteGraph::CalculatePriority is [126] micro second. . ============================== 2 passed in 21.65s ============================== [TRACE] GE(64685,python3.7):2024-01-11-05:31:04.015.309 [status:INIT] [ge_api.cc:463]64685 ~Session:Start to destruct session. [TRACE] GE(64685,python3.7):2024-01-11-05:31:04.015.519 [status:RUNNING] [ge_api.cc:475]64685 ~Session:Session id is 0 [TRACE] GE(64685,python3.7):2024-01-11-05:31:04.015.539 [status:RUNNING] [ge_api.cc:476]64685 ~Session:Destroying session [TRACE] GE(64685,python3.7):2024-01-11-05:31:04.016.451 [status:STOP] [ge_api.cc:491]64685 ~Session:Session Destructor finished [TRACE] GE(64685,python3.7):2024-01-11-05:31:04.016.482 [status:INIT] [ge_api.cc:301]64685 GEFinalize:GEFinalize start [INFO] GE(64685,python3.7):2024-01-11-05:31:04.016.555 [execution_runtime.cc:80][EVENT]64685 FinalizeExecutionRuntime:Execution runtime finalize begin. [INFO] GE(64685,python3.7):2024-01-11-05:31:04.016.574 [execution_runtime.cc:92][EVENT]64685 FinalizeExecutionRuntime:Execution runtime finalized. [TRACE] GE(64685,python3.7):2024-01-11-05:31:04.016.586 [status:RUNNING] [ge_api.cc:313]64685 GEFinalize:Finalizing environment [INFO] TUNE(64685,python3.7):2024-01-11-05:31:04.304.156 [cann_kb_pyfunc_mgr.cpp:127][CANNKB][Tid:64685]"CannKbPyfuncMgr: enter PyObjectDeinit function, reference_[1]" [INFO] TUNE(64685,python3.7):2024-01-11-05:31:04.304.217 [cann_kb_pyfunc_mgr.cpp:138][CANNKB][Tid:64685]"CannKbPyfuncMgr: PyObjectDeinit function end successfully!" [INFO] GE(64685,python3.7):2024-01-11-05:31:04.305.566 [gelib.cc:324][EVENT]64685 SystemFinalize:Online infer finalize GELib success. [TRACE] GE(64685,python3.7):2024-01-11-05:31:04.672.724 [status:STOP] [ge_api.cc:341]64685 GEFinalize:GEFinalize finished [INFO] TDT(64685,python3.7):2024-01-11-05:31:04.830.620 [process_mode_manager.cpp:184][Close][tid:64685] [TsdClient] Close [deviceId=3][sessionId=1] hccp and computer enter [INFO] TDT(64685,python3.7):2024-01-11-05:31:04.830.658 [version_verify.cpp:112][SpecialFeatureCheck][tid:64685] VersionVerify: previous type[7], supported [INFO] TDT(64685,python3.7):2024-01-11-05:31:04.830.699 [process_mode_manager.cpp:192][Close][tid:64685] [TsdClient][deviceId=3] [sessionId=1] wait hccp and computer process close respond [INFO] TDT(64685,python3.7):2024-01-11-05:31:04.852.334 [process_mode_manager.cpp:197][Close][tid:64685] [TsdClient][logicDeviceId_=3]has recv close hccp and computer process respond [INFO] TDT(64685,python3.7):2024-01-11-05:31:04.852.349 [stub_process_mode_nowin.cpp:151][CloseInHost][tid:64685] enter into CloseInHost deviceid[3] [INFO] TDT(64685,python3.7):2024-01-11-05:31:04.852.360 [stub_process_mode_nowin.cpp:154][CloseInHost][tid:64685] host cpu not support [INFO] TDT(64685,python3.7):2024-01-11-05:31:04.852.399 [process_mode_manager.cpp:208][Close][tid:64685] [TsdClient][deviceId=3] [sessionId=1] close hccp and computer process success [INFO] ATRACE(64685,python3.7):2024-01-11-05:31:04.852.412 [atrace_api.c:93](tid:64685) AtraceDestroy start [INFO] ATRACE(64685,python3.7):2024-01-11-05:31:04.852.428 [atrace_api.c:95](tid:64685) AtraceDestroy end [INFO] PROFILING(64685,python3.7):2024-01-11-05:31:04.852.450 [msprofiler_impl.cpp:156] >>> (tid:64685) ProfNotifySetDevice called, is open: 0, devId: 3 [INFO] RUNTIME(64685,python3.7):2024-01-11-05:31:06.355.852 [runtime.cc:1737] 64685 ~Runtime: deconstruct runtime.