============================= test session starts ============================== platform linux -- Python 3.7.5, pytest-5.4.3, py-1.8.1, pluggy-0.13.1 rootdir: /home/jenkins/mindspore/testcases/testcases/tests/st/dyn_shape_dev, inifile: /home/jenkins/sault/virtual_test/virtualenv_008/sault/config/pytest.ini plugins: anyio-3.7.1, forked-1.1.3, xdist-1.32.0 [INFO] ATRACE(28287,python3.7):2024-01-11-05:33:34.425.932 [trace_attr.c:105](tid:28287) platform is 1. [INFO] ATRACE(28287,python3.7):2024-01-11-05:33:34.426.097 [trace_recorder.c:114](tid:28287) use root path: /home/jenkins/ascend/atrace [INFO] ATRACE(28287,python3.7):2024-01-11-05:33:34.426.122 [trace_signal.c:133](tid:28287) register signal handler for signo 2 succeed. [INFO] ATRACE(28287,python3.7):2024-01-11-05:33:34.426.133 [trace_signal.c:133](tid:28287) register signal handler for signo 15 succeed. [INFO] RUNTIME(28287,python3.7):2024-01-11-05:33:34.816.049 [runtime.cc:1159] 28287 GetAicoreNumByLevel: workingDev_=0 [INFO] RUNTIME(28287,python3.7):2024-01-11-05:33:34.816.118 [runtime.cc:4719] 28287 GetVisibleDevices: ASCEND_RT_VISIBLE_DEVICES param was not set collected 2 items test_round.py [INFO] TDT(28287,python3.7):2024-01-11-05:33:39.217.725 [process_mode_manager.cpp:109][OpenProcess][tid:28287] [ProcessModeManager] enter into open process deviceId[7] rankSize[0] [INFO] TDT(28287,python3.7):2024-01-11-05:33:39.218.724 [process_mode_manager.cpp:379][InitTsdClient][tid:28287] [TsdClient] deviceId[7] begin to init hdc client [INFO] TDT(28287,python3.7):2024-01-11-05:33:39.218.872 [version_verify.cpp:34][SetVersionInfo][tid:28287] VersionVerify: send client version to server [INFO] TDT(28287,python3.7):2024-01-11-05:33:39.218.917 [version_verify.cpp:50][SetVersionInfo][tid:28287] send feature_info:{msg_type:35, features:{check before send aicpu package,}} [INFO] TDT(28287,python3.7):2024-01-11-05:33:39.218.929 [version_verify.cpp:50][SetVersionInfo][tid:28287] send feature_info:{msg_type:37, features:{check before send open qs message,}} [INFO] TDT(28287,python3.7):2024-01-11-05:33:39.219.280 [version_verify.cpp:66][PeerVersionCheck][tid:28287] VersionVerify: Check client version info, server[1230], client[1230] [INFO] TDT(28287,python3.7):2024-01-11-05:33:39.219.296 [version_verify.cpp:87][ParseVersionInfo][tid:28287] VersionVerify: pass client version info success [INFO] TDT(28287,python3.7):2024-01-11-05:33:39.219.305 [hdc_client.cpp:276][CheckHdcConnection][tid:28287] Service[2] create hdc success [INFO] TDT(28287,python3.7):2024-01-11-05:33:39.219.321 [version_verify.cpp:120][SpecialFeatureCheck][tid:28287] VersionVerify: new type[35], supported [INFO] TDT(28287,python3.7):2024-01-11-05:33:39.219.368 [process_mode_manager.cpp:748][GetDeviceCheckCode][tid:28287] [TsdClient][deviceId=7] [sessionId=1] wait package info respond [INFO] TDT(28287,python3.7):2024-01-11-05:33:39.219.502 [process_mode_manager.cpp:379][InitTsdClient][tid:28287] [TsdClient] deviceId[7] begin to init hdc client [INFO] TDT(28287,python3.7):2024-01-11-05:33:39.219.646 [version_verify.cpp:34][SetVersionInfo][tid:28287] VersionVerify: send client version to server [INFO] TDT(28287,python3.7):2024-01-11-05:33:39.219.658 [version_verify.cpp:50][SetVersionInfo][tid:28287] send feature_info:{msg_type:35, features:{check before send aicpu package,}} [INFO] TDT(28287,python3.7):2024-01-11-05:33:39.219.667 [version_verify.cpp:50][SetVersionInfo][tid:28287] send feature_info:{msg_type:37, features:{check before send open qs message,}} [INFO] TDT(28287,python3.7):2024-01-11-05:33:39.219.818 [version_verify.cpp:66][PeerVersionCheck][tid:28287] VersionVerify: Check client version info, server[1230], client[1230] [INFO] TDT(28287,python3.7):2024-01-11-05:33:39.219.831 [version_verify.cpp:87][ParseVersionInfo][tid:28287] VersionVerify: pass client version info success [INFO] TDT(28287,python3.7):2024-01-11-05:33:39.219.840 [hdc_client.cpp:276][CheckHdcConnection][tid:28287] Service[2] create hdc success [INFO] TDT(28287,python3.7):2024-01-11-05:33:39.219.851 [process_mode_manager.cpp:426][ConstructOpenMsg][tid:28287] [TsdClient] tsd get process sign successfully, procpid[28287] signSize[48] [INFO] TDT(28287,python3.7):2024-01-11-05:33:39.219.873 [version_verify.cpp:112][SpecialFeatureCheck][tid:28287] VersionVerify: previous type[6], supported [INFO] TDT(28287,python3.7):2024-01-11-05:33:39.219.894 [process_mode_manager.cpp:126][OpenProcess][tid:28287] [ProcessModeManager] deviceId[7] sessionId[1] rankSize[0], wait sub process start respond [INFO] TDT(28287,python3.7):2024-01-11-05:33:39.450.381 [stub_process_mode_nowin.cpp:63][ProcessQueueForMdc][tid:28287] [TsdClient] it is unnecessary of current mode[0] chiptype[1] to grant queue auth to aicpusd [INFO] TDT(28287,python3.7):2024-01-11-05:33:39.450.427 [stub_process_mode_nowin.cpp:101][OpenInHost][tid:28287] enter into OpenInHost deviceid[7] [INFO] TDT(28287,python3.7):2024-01-11-05:33:39.450.438 [stub_process_mode_nowin.cpp:105][OpenInHost][tid:28287] host cpu not support [INFO] TDT(28287,python3.7):2024-01-11-05:33:39.450.446 [process_mode_manager.cpp:156][OpenProcess][tid:28287] [TsdClient][deviceId=7] [sessionId=1] start hccp and computer process success [INFO] RUNTIME(28287,python3.7):2024-01-11-05:33:39.453.222 [device.cc:340] 28287 Init: isDoubledie:0, topologytype:0 [INFO] RUNTIME(28287,python3.7):2024-01-11-05:33:39.465.939 [npu_driver.cc:5428] 29731 GetDeviceStatus: GetDeviceStatus status=1. [INFO] ATRACE(28287,python3.7):2024-01-11-05:33:39.466.658 [atrace_api.c:28](tid:28287) AtraceCreate start [INFO] ATRACE(28287,python3.7):2024-01-11-05:33:39.466.785 [trace_rb_log.c:84](tid:28287) [RUNTIME_ATRACE_DEV7_TS0] create ring buffer success, buffer size : 131152. [INFO] ATRACE(28287,python3.7):2024-01-11-05:33:39.466.800 [atrace_api.c:32](tid:28287) AtraceCreate end [INFO] TDT(28287,python3.7):2024-01-11-05:33:39.466.833 [client_manager.cpp:157][SetProfilingCallback][tid:28287] [TsdClient] set profiling callback success [TRACE] GE(28287,python3.7):2024-01-11-05:33:39.614.185 [status:INIT] [ge_api.cc:144]28287 GEInitializeImpl:GEInitialize start [INFO] PROFILING(28287,python3.7):2024-01-11-05:33:39.841.354 [msprofiler_impl.cpp:156] >>> (tid:28287) ProfNotifySetDevice called, is open: 1, devId: 7 [INFO] PROFILING(28287,python3.7):2024-01-11-05:33:39.841.500 [platform.cpp:38] >>> (tid:28287) Profiling platform version: 1.0. [INFO] PROFILING(28287,python3.7):2024-01-11-05:33:39.841.518 [ai_drv_dev_api.cpp:384] >>> (tid:28287) Succeeded to DrvGetApiVersion version: 0x72313 [TRACE] GE(28287,python3.7):2024-01-11-05:33:39.893.204 [status:RUNNING] [ge_api.cc:211]28287 GEInitializeImpl:Initializing environment [INFO] GE(28287,python3.7):2024-01-11-05:33:39.893.271 [gelib.cc:98][EVENT]28287 Initialize:[GEPERFTRACE] GE Init Start [INFO] GE(28287,python3.7):2024-01-11-05:33:39.893.593 [gelib.cc:307][EVENT]28287 SystemInitialize:Online infer init GELib success, device id :7 [INFO] DVPP(28287,python3.7):2024-01-11-05:33:40.263.531 [dvpp_engine.cc:41][ENGINE][Initialize:41][tid:28287]dvpp engine do not support [INFO] TUNE(28287,python3.7):2024-01-11-05:33:40.268.405 [cann_kb_pyfunc_mgr.cpp:72][CANNKB][Tid:28287]"CannKbPyfuncMgr: Enter PyObjectInit, reference_ is 0!" [INFO] TUNE(28287,python3.7):2024-01-11-05:33:40.268.450 [handle_manager.cpp:115][CANNKB][Tid:28287]"Start to run init functions to load dynamic python lib!" [INFO] TUNE(28287,python3.7):2024-01-11-05:33:40.268.511 [handle_manager.cpp:407][CANNKB][Tid:28287]"Init functions of loading dynamic python lib end!" [INFO] TUNE(28287,python3.7):2024-01-11-05:33:40.268.522 [cann_kb_pyfunc_mgr.cpp:24][CANNKB][Tid:28287]"CANN_KB_Py has already been initialized." [INFO] TUNE(28287,python3.7):2024-01-11-05:33:40.268.591 [cann_kb_pyfunc_mgr.cpp:117][CANNKB][Tid:28287]"CannKbPyfuncMgr: Run PyObjectInit successfully!" [INFO] HCCL(28287,python3.7):2024-01-11-05:33:52.048.767 [plugin_manager.cc:42][28287]hcom running normal mode. [INFO] DVPP(28287,python3.7):2024-01-11-05:33:52.049.600 [dvpp_engine.cc:92][ENGINE][GetOpsKernelInfoStores:92][tid:28287]dvpp ops kernel info store do not support [INFO] DVPP(28287,python3.7):2024-01-11-05:33:52.049.770 [dvpp_engine.cc:69][ENGINE][GetGraphOptimizerObjs:69][tid:28287]dvpp graph optimizer do not support [INFO] DVPP(28287,python3.7):2024-01-11-05:33:52.564.063 [dvpp_ops_kernel_builder.cc:48][ENGINE][Initialize:48][tid:28287]dvpp ops kernel builder do not support [INFO] GE(28287,python3.7):2024-01-11-05:33:52.573.266 [gelib.cc:169][EVENT]28287 Initialize:[GEPERFTRACE] The time cost of GELib::Initialize is [12679898] micro second. [TRACE] GE(28287,python3.7):2024-01-11-05:33:52.664.664 [status:STOP] [ge_api.cc:255]28287 GEInitializeImpl:GEInitialize finished [TRACE] GE(28287,python3.7):2024-01-11-05:33:52.664.821 [status:INIT] [ge_api.cc:398]28287 Session:Start to construct session. [TRACE] GE(28287,python3.7):2024-01-11-05:33:52.664.838 [status:RUNNING] [ge_api.cc:408]28287 Session:Creating session [INFO] GE(28287,python3.7):2024-01-11-05:33:52.665.305 [graph_var_manager.cc:1445][EVENT]28287 SetMemoryMallocSize:Total memory size is 34359738368 [INFO] GE(28287,python3.7):2024-01-11-05:33:52.665.323 [graph_var_manager.cc:1424][EVENT]28287 SetAllMemoryMaxValue:The graph_mem_max_size is 27917287424 and the var_mem_max_size is 5368709120 [INFO] PROFILING(28287,python3.7):2024-01-11-05:33:52.665.696 [msprofiler_impl.cpp:156] >>> (tid:28287) ProfNotifySetDevice called, is open: 1, devId: 7 [TRACE] GE(28287,python3.7):2024-01-11-05:33:52.666.537 [status:RUNNING] [ge_api.cc:411]28287 Session:Session id is 0 [TRACE] GE(28287,python3.7):2024-01-11-05:33:52.666.556 [status:STOP] [ge_api.cc:420]28287 Session:Session Constructor finished [INFO] PROFILING(28287,python3.7):2024-01-11-05:33:52.676.280 [platform.cpp:38] >>> (tid:28287) Profiling platform version: 1.0. [INFO] PROFILING(28287,python3.7):2024-01-11-05:33:52.676.305 [ai_drv_dev_api.cpp:384] >>> (tid:28287) Succeeded to DrvGetApiVersion version: 0x72313 [TRACE] GE(28287,python3.7):2024-01-11-05:33:52.676.557 [status:INIT] [ge_api.cc:144]28287 GEInitializeImpl:GEInitialize start TotalTime = 0.0566461, [20] [parse]: 0.0119703 [symbol_resolve]: 0.0267766, [1] [Cycle 1]: 0.0267071, [1] [resolve]: 0.026684 [combine_like_graphs]: 1.58e-06 [graph_reusing]: 3.27e-06 [meta_unpack_prepare]: 7.296e-05 [pre_cconv]: 3.81e-06 [abstract_specialize]: 0.00279071 [pack_expand]: 1.091e-05 [auto_monad]: 8.31e-05 [inline]: 1.49e-06 [pre_auto_parallel]: 1.46e-05 [pipeline_split]: 3.24e-06 [optimize]: 0.00795633, [35] [py_interpret_to_execute]: 3.88e-06 [rewriter_before_opt_a]: 4.621e-05 [opt_a]: 0.00735784, [2] [Cycle 1]: 0.00082464, [30] [expand_dump_flag]: 3.38e-06 [switch_simplify]: 1.304e-05 [a_1]: 0.00018221 [recompute_prepare]: 2.3e-06 [updatestate_depend_eliminate]: 6.45e-06 [updatestate_assign_eliminate]: 3.25e-06 [updatestate_loads_eliminate]: 3.3e-06 [parameter_eliminate]: 3.43e-06 [a_2]: 2.856e-05 [accelerated_algorithm]: 2.74e-06 [pynative_shard]: 2.01001e-06 [auto_parallel]: 3.94e-06 [parallel]: 2.383e-05 [merge_comm]: 1.469e-05 [allreduce_fusion]: 1.85e-06 [virtual_dataset]: 2.35e-06 [get_grad_eliminate_]: 1.94e-06 [virtual_output]: 1.63e-06 [merge_forward]: 5.17e-06 [cell_reuse_recompute_pass]: 9e-07 [cell_reuse_handle_not_recompute_node_pass]: 1.994e-05 [meta_fg_expand]: 3.04e-06 [after_resolve]: 4.67e-06 [a_after_grad]: 2.2e-06 [renormalize]: 0.00028404 [real_op_eliminate]: 4.49e-06 [auto_monad_grad]: 4.07e-06 [auto_monad_eliminator]: 1.001e-05 [cse]: 2.506e-05 [a_3]: 1.444e-05 [Cycle 2]: 0.00022678, [30] [expand_dump_flag]: 1.09e-06 [switch_simplify]: 2.16e-06 [a_1]: 1.386e-05 [recompute_prepare]: 1.51e-06 [updatestate_depend_eliminate]: 2.71e-06 [updatestate_assign_eliminate]: 2.22e-06 [updatestate_loads_eliminate]: 2e-06 [parameter_eliminate]: 8.59996e-07 [a_2]: 2.562e-05 [accelerated_algorithm]: 2.05e-06 [pynative_shard]: 9.10004e-07 [auto_parallel]: 3.55001e-06 [parallel]: 3.26e-06 [merge_comm]: 1.55e-06 [allreduce_fusion]: 1.18e-06 [virtual_dataset]: 1.89e-06 [get_grad_eliminate_]: 1.64e-06 [virtual_output]: 1.55999e-06 [merge_forward]: 2.63e-06 [cell_reuse_recompute_pass]: 4.1e-07 [cell_reuse_handle_not_recompute_node_pass]: 5.74e-06 [meta_fg_expand]: 1.58e-06 [after_resolve]: 3.73e-06 [a_after_grad]: 2.18e-06 [renormalize]: 5.99975e-08 [real_op_eliminate]: 1.72e-06 [auto_monad_grad]: 7.00005e-07 [auto_monad_eliminator]: 3.89e-06 [cse]: 7.59e-06 [a_3]: 1.233e-05 [py_interpret_to_execute_after_opt_a]: 3.09999e-06 [slice_cell_reuse_recomputed_activation]: 2.45999e-06 [rewriter_after_opt_a]: 3.796e-05 [convert_after_rewriter]: 6.89999e-06 [order_py_execute_after_rewriter]: 5.41e-06 [opt_b]: 8.892e-05, [1] [Cycle 1]: 8.407e-05, [7] [b_1]: 3.811e-05 [b_2]: 3.09e-06 [updatestate_depend_eliminate]: 2.24001e-06 [updatestate_assign_eliminate]: 2.22e-06 [updatestate_loads_eliminate]: 2.1e-06 [renormalize]: 3.20004e-07 [cse]: 7.22001e-06 [cconv]: 2.326e-05 [opt_after_cconv]: 4.917e-05, [1] [Cycle 1]: 4.538e-05, [7] [c_1]: 4.58e-06 [parameter_eliminate]: 6.40001e-07 [updatestate_depend_eliminate]: 1.96e-06 [updatestate_assign_eliminate]: 1.72e-06 [updatestate_loads_eliminate]: 2.12e-06 [cse]: 6.36e-06 [renormalize]: 1.89997e-07 [remove_dup_value]: 1.129e-05 [tuple_transform]: 3.425e-05, [1] [Cycle 1]: 3.065e-05, [3] [d_1]: 1.256e-05 [d_2]: 5.57e-06 [renormalize]: 1.59998e-07 [add_cache_embedding]: 1.102e-05 [add_recomputation]: 4.892e-05 [cse_after_recomputation]: 1.498e-05, [1] [Cycle 1]: 1.102e-05, [1] [cse]: 6.52e-06 [environ_conv]: 1.632e-05 [label_micro_interleaved_index]: 2.43e-06 [label_fine_grained_interleaved_index]: 2.78e-06 [assign_add_opt]: 2.05e-06 [slice_recompute_activation]: 2.11e-06 [micro_interleaved_order_control]: 2.07e-06 [full_micro_interleaved_order_control]: 2.19001e-06 [comp_comm_scheduling]: 2.54e-06 [reorder_send_recv_between_fp_bp]: 3.17e-06 [comm_op_add_attrs]: 1.07e-06 [add_comm_op_reuse_tag]: 8.90002e-07 [overlap_opt_shard_in_pipeline]: 1.52e-06 [grouped_pairwise_exchange_alltoall]: 1.59e-06 [overlap_recompute_and_grad_model_parallel]: 1.73e-06 [overlap_grad_matmul_and_grad_allreduce]: 9.89996e-07 [split_matmul_comm_elemetwise]: 2.68e-06 [split_layernorm_comm]: 1.84e-06 [process_send_recv_for_ge]: 2.26e-06 [handle_group_info]: 9.79999e-07 [auto_monad_reorder]: 1.93e-05 [get_jit_bprop_graph]: 3.89999e-07 [eliminate_special_op_node]: 0.00045506 [validate]: 5.757e-05 [distribtued_split]: 1.25e-06 [task_emit]: 0.00617984 [execute]: 7.58e-06 Sums parse : 0.011970s : 24.18% symbol_resolve.resolve : 0.026684s : 53.91% combine_like_graphs : 0.000002s : 0.00% graph_reusing : 0.000003s : 0.01% meta_unpack_prepare : 0.000073s : 0.15% pre_cconv : 0.000004s : 0.01% abstract_specialize : 0.002791s : 5.64% pack_expand : 0.000011s : 0.02% auto_monad : 0.000083s : 0.17% inline : 0.000001s : 0.00% pre_auto_parallel : 0.000015s : 0.03% pipeline_split : 0.000003s : 0.01% optimize.py_interpret_to_execute : 0.000004s : 0.01% optimize.rewriter_before_opt_a : 0.000046s : 0.09% optimize.opt_a.expand_dump_flag : 0.000004s : 0.01% optimize.opt_a.switch_simplify : 0.000015s : 0.03% optimize.opt_a.a_1 : 0.000196s : 0.40% optimize.opt_a.recompute_prepare : 0.000004s : 0.01% optimize.opt_a.updatestate_depend_eliminate : 0.000009s : 0.02% optimize.opt_a.updatestate_assign_eliminate : 0.000005s : 0.01% optimize.opt_a.updatestate_loads_eliminate : 0.000005s : 0.01% optimize.opt_a.parameter_eliminate : 0.000004s : 0.01% optimize.opt_a.a_2 : 0.000054s : 0.11% optimize.opt_a.accelerated_algorithm : 0.000005s : 0.01% optimize.opt_a.pynative_shard : 0.000003s : 0.01% optimize.opt_a.auto_parallel : 0.000007s : 0.02% optimize.opt_a.parallel : 0.000027s : 0.05% optimize.opt_a.merge_comm : 0.000016s : 0.03% optimize.opt_a.allreduce_fusion : 0.000003s : 0.01% optimize.opt_a.virtual_dataset : 0.000004s : 0.01% optimize.opt_a.get_grad_eliminate_ : 0.000004s : 0.01% optimize.opt_a.virtual_output : 0.000003s : 0.01% optimize.opt_a.merge_forward : 0.000008s : 0.02% optimize.opt_a.cell_reuse_recompute_pass : 0.000001s : 0.00% optimize.opt_a.cell_reuse_handle_not_recompute_node_pass : 0.000026s : 0.05% optimize.opt_a.meta_fg_expand : 0.000005s : 0.01% optimize.opt_a.after_resolve : 0.000008s : 0.02% optimize.opt_a.a_after_grad : 0.000004s : 0.01% optimize.opt_a.renormalize : 0.000284s : 0.57% optimize.opt_a.real_op_eliminate : 0.000006s : 0.01% optimize.opt_a.auto_monad_grad : 0.000005s : 0.01% optimize.opt_a.auto_monad_eliminator : 0.000014s : 0.03% optimize.opt_a.cse : 0.000033s : 0.07% optimize.opt_a.a_3 : 0.000027s : 0.05% optimize.py_interpret_to_execute_after_opt_a : 0.000003s : 0.01% optimize.slice_cell_reuse_recomputed_activation : 0.000002s : 0.00% optimize.rewriter_after_opt_a : 0.000038s : 0.08% optimize.convert_after_rewriter : 0.000007s : 0.01% optimize.order_py_execute_after_rewriter : 0.000005s : 0.01% optimize.opt_b.b_1 : 0.000038s : 0.08% optimize.opt_b.b_2 : 0.000003s : 0.01% optimize.opt_b.updatestate_depend_eliminate : 0.000002s : 0.00% optimize.opt_b.updatestate_assign_eliminate : 0.000002s : 0.00% optimize.opt_b.updatestate_loads_eliminate : 0.000002s : 0.00% optimize.opt_b.renormalize : 0.000000s : 0.00% optimize.opt_b.cse : 0.000007s : 0.01% optimize.cconv : 0.000023s : 0.05% optimize.opt_after_cconv.c_1 : 0.000005s : 0.01% optimize.opt_after_cconv.parameter_eliminate : 0.000001s : 0.00% optimize.opt_after_cconv.updatestate_depend_eliminate : 0.000002s : 0.00% optimize.opt_after_cconv.updatestate_assign_eliminate : 0.000002s : 0.00% optimize.opt_after_cconv.updatestate_loads_eliminate : 0.000002s : 0.00% optimize.opt_after_cconv.cse : 0.000006s : 0.01% optimize.opt_after_cconv.renormalize : 0.000000s : 0.00% optimize.remove_dup_value : 0.000011s : 0.02% optimize.tuple_transform.d_1 : 0.000013s : 0.03% optimize.tuple_transform.d_2 : 0.000006s : 0.01% optimize.tuple_transform.renormalize : 0.000000s : 0.00% optimize.add_cache_embedding : 0.000011s : 0.02% optimize.add_recomputation : 0.000049s : 0.10% optimize.cse_after_recomputation.cse : 0.000007s : 0.01% optimize.environ_conv : 0.000016s : 0.03% optimize.label_micro_interleaved_index : 0.000002s : 0.00% optimize.label_fine_grained_interleaved_index : 0.000003s : 0.01% optimize.assign_add_opt : 0.000002s : 0.00% optimize.slice_recompute_activation : 0.000002s : 0.00% optimize.micro_interleaved_order_control : 0.000002s : 0.00% optimize.full_micro_interleaved_order_control : 0.000002s : 0.00% optimize.comp_comm_scheduling : 0.000003s : 0.01% optimize.reorder_send_recv_between_fp_bp : 0.000003s : 0.01% optimize.comm_op_add_attrs : 0.000001s : 0.00% optimize.add_comm_op_reuse_tag : 0.000001s : 0.00% optimize.overlap_opt_shard_in_pipeline : 0.000002s : 0.00% optimize.grouped_pairwise_exchange_alltoall : 0.000002s : 0.00% optimize.overlap_recompute_and_grad_model_parallel : 0.000002s : 0.00% optimize.overlap_grad_matmul_and_grad_allreduce : 0.000001s : 0.00% optimize.split_matmul_comm_elemetwise : 0.000003s : 0.01% optimize.split_layernorm_comm : 0.000002s : 0.00% optimize.process_send_recv_for_ge : 0.000002s : 0.00% optimize.handle_group_info : 0.000001s : 0.00% auto_monad_reorder : 0.000019s : 0.04% get_jit_bprop_graph : 0.000000s : 0.00% eliminate_special_op_node : 0.000455s : 0.92% validate : 0.000058s : 0.12% distribtued_split : 0.000001s : 0.00% task_emit : 0.006180s : 12.48% execute : 0.000008s : 0.02% Time group info: ------[substitution.] 0.026576 37 99.42% : 0.026423s : 8: substitution.getattr_setattr_resolve 0.02% : 0.000005s : 3: substitution.graph_param_transform 0.33% : 0.000089s : 3: substitution.inline 0.12% : 0.000032s : 13: substitution.meta_unpack_prepare 0.01% : 0.000001s : 3: substitution.partial_unused_args_eliminate 0.06% : 0.000015s : 4: substitution.remove_not_recompute_node 0.01% : 0.000003s : 2: substitution.replace_old_param 0.03% : 0.000008s : 1: substitution.tuple_list_get_item_eliminator ------[renormalize.] 0.000278 2 59.57% : 0.000166s : 1: renormalize.infer 40.43% : 0.000113s : 1: renormalize.specialize ------[replace.] 0.000152 10 80.28% : 0.000122s : 6: replace.getattr_setattr_resolve 15.56% : 0.000024s : 3: replace.inline 4.16% : 0.000006s : 1: replace.tuple_list_get_item_eliminator ------[match.] 0.026432 10 99.63% : 0.026335s : 6: match.getattr_setattr_resolve 0.34% : 0.000089s : 3: match.inline 0.03% : 0.000008s : 1: match.tuple_list_get_item_eliminator ------[func_graph_cloner_run.] 0.000461 10 68.08% : 0.000314s : 5: func_graph_cloner_run.FuncGraphClonerGraph 31.92% : 0.000147s : 5: func_graph_cloner_run.FuncGraphSpecializer ------[meta_graph.] 0.000000 0 ------[manager.] 0.000000 0 ------[pynative] 0.000000 0 ------[others.] 0.027073 105 0.32% : 0.000087s : 52: opt.transform.opt_a 0.11% : 0.000030s : 23: opt.transform.opt_b 98.40% : 0.026639s : 2: opt.transform.opt_resolve 0.19% : 0.000051s : 1: opt.transforms.meta_unpack_prepare 0.87% : 0.000236s : 20: opt.transforms.opt_a 0.01% : 0.000003s : 1: opt.transforms.opt_after_cconv 0.01% : 0.000002s : 1: opt.transforms.opt_b 0.06% : 0.000017s : 2: opt.transforms.opt_trans_graph 0.03% : 0.000007s : 3: opt.transforms.special_op_eliminate [INFO] GE(28287,python3.7):2024-01-11-05:33:53.033.635 [scalable_config.cc:55][EVENT]31309 ScalableConfig:device total max size: 34359738368, page_mem_size_total_thresold: 32641751449, uncacheable_size_threshold: 17179869184 [INFO] GE(28287,python3.7):2024-01-11-05:33:53.116.142 [graph_var_manager.cc:1424][EVENT]31309 SetAllMemoryMaxValue:The graph_mem_max_size is 27917287424 and the var_mem_max_size is 5368709120 [INFO] GE(28287,python3.7):2024-01-11-05:33:53.116.270 [graph_manager.cc:1248][EVENT]31309 PreRun:PreRun start: graph node size 3, session id 1, graph id 0, graph name online. [INFO] ATRACE(28287,python3.7):2024-01-11-05:33:53.117.148 [atrace_api.c:28](tid:31309) AtraceCreate start [INFO] ATRACE(28287,python3.7):2024-01-11-05:33:53.117.229 [trace_rb_log.c:84](tid:31309) [RUNTIME_ATRACE_DEV64_TS0] create ring buffer success, buffer size : 131152. [INFO] ATRACE(28287,python3.7):2024-01-11-05:33:53.117.243 [atrace_api.c:32](tid:31309) AtraceCreate end [INFO] TDT(28287,python3.7):2024-01-11-05:33:53.117.274 [client_manager.cpp:157][SetProfilingCallback][tid:31309] [TsdClient] set profiling callback success [INFO] GE(28287,python3.7):2024-01-11-05:33:53.118.332 [parallel_partitioner.cc:165][EVENT]31309 DoPipelinePartition:[GEPERFTRACE] The time cost of OptimizeSubgraph::PipelinePartition is [38] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.118.379 [parallel_partitioner.cc:178][EVENT]31309 DoFlowGraphPartition:[GEPERFTRACE] The time cost of OptimizeSubgraph::FlowGraphPartition is [19] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.118.459 [graph_prepare.cc:1378][EVENT]31309 Init:[GEPERFTRACE] The time cost of FileConstantUtils::ConvertFileConstToConst is [15] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.119.093 [graph_manager.cc:1050][EVENT]31309 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.preparer.PrepareInit is [677] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.119.125 [graph_manager.cc:1052][EVENT]31309 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.optimizer.HandleSummaryOp is [8] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.119.287 [base_pass.cc:339][EVENT]31309 Run:[GEPERFTRACE] The time cost of ForToWhilePass is [3] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:53.119.320 [pass_manager.cc:82][EVENT]31309 Run:[GEPERFTRACE] The time cost of ProcessNetOutput::SavePass is [4] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.119.456 [pass_manager.cc:82][EVENT]31309 Run:[GEPERFTRACE] The time cost of ProcessNetOutput::NetOutputPass is [123] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.119.471 [pass_manager.cc:82][EVENT]31309 Run:[GEPERFTRACE] The time cost of ProcessNetOutput::DataPass is [1] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.119.588 [pass_manager.cc:82][EVENT]31309 Run:[GEPERFTRACE] The time cost of CreateSubGraphWithScopePass is [39] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.119.602 [pass_manager.cc:82][EVENT]31309 Run:[GEPERFTRACE] The time cost of SubgraphMultiDimsClonePass is [2] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.119.628 [pass_manager.cc:82][EVENT]31309 Run:[GEPERFTRACE] The time cost of MultiBatchClonePass is [14] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.119.735 [base_pass.cc:339][EVENT]31309 Run:[GEPERFTRACE] The time cost of SplitVariableIntoSubgraphPass is [1] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:53.119.756 [graph_manager.cc:1054][EVENT]31309 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.preparer.NormalizeGraph is [617] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.127.112 [graph_manager.cc:1055][EVENT]31309 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.optimizer.OptimizeGraphInit is [7342] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.128.540 [base_pass.cc:339][EVENT]31309 Run:[GEPERFTRACE] The time cost of AssertPass is [2] micro second, call num is [6] [INFO] GE(28287,python3.7):2024-01-11-05:33:53.128.564 [base_pass.cc:339][EVENT]31309 Run:[GEPERFTRACE] The time cost of SwitchDeadBranchElimination is [4] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:53.128.586 [base_pass.cc:339][EVENT]31309 Run:[GEPERFTRACE] The time cost of MergePass is [17] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:53.128.596 [base_pass.cc:339][EVENT]31309 Run:[GEPERFTRACE] The time cost of InferShapePass is [351] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:53.128.605 [base_pass.cc:339][EVENT]31309 Run:[GEPERFTRACE] The time cost of ReplaceWithEmptyConstPass is [42] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:53.128.614 [base_pass.cc:339][EVENT]31309 Run:[GEPERFTRACE] The time cost of SplitShapeNPass is [2] micro second, call num is [6] [INFO] GE(28287,python3.7):2024-01-11-05:33:53.128.623 [base_pass.cc:339][EVENT]31309 Run:[GEPERFTRACE] The time cost of DimensionComputePass is [149] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:53.128.631 [base_pass.cc:339][EVENT]31309 Run:[GEPERFTRACE] The time cost of ConstantFoldingPass is [18] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:53.128.639 [base_pass.cc:339][EVENT]31309 Run:[GEPERFTRACE] The time cost of InferValuePass is [11] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:53.130.766 [graph_manager.cc:1056][EVENT]31309 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.optimizer.OptimizeOriginalGraphForQuantize is [3618] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.130.828 [base_pass.cc:339][EVENT]31309 Run:[GEPERFTRACE] The time cost of CondRemovePass is [5] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:53.130.846 [graph_prepare.cc:1982][EVENT]31309 PrepareDynShape:[GEPERFTRACE] The time cost of Prepare::ProcessBeforeInfershape is [50] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.131.193 [base_pass.cc:339][EVENT]31309 Run:[GEPERFTRACE] The time cost of AssertPass is [2] micro second, call num is [6] [INFO] GE(28287,python3.7):2024-01-11-05:33:53.131.214 [base_pass.cc:339][EVENT]31309 Run:[GEPERFTRACE] The time cost of SwitchDeadBranchElimination is [1] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:53.131.224 [base_pass.cc:339][EVENT]31309 Run:[GEPERFTRACE] The time cost of MergePass is [1] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:53.131.233 [base_pass.cc:339][EVENT]31309 Run:[GEPERFTRACE] The time cost of InferShapePass is [174] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:53.131.242 [base_pass.cc:339][EVENT]31309 Run:[GEPERFTRACE] The time cost of ReplaceWithEmptyConstPass is [8] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:53.131.250 [base_pass.cc:339][EVENT]31309 Run:[GEPERFTRACE] The time cost of SplitShapeNPass is [2] micro second, call num is [6] [INFO] GE(28287,python3.7):2024-01-11-05:33:53.131.258 [base_pass.cc:339][EVENT]31309 Run:[GEPERFTRACE] The time cost of DimensionComputePass is [7] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:53.131.266 [base_pass.cc:339][EVENT]31309 Run:[GEPERFTRACE] The time cost of ConstantFoldingPass is [11] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:53.131.274 [base_pass.cc:339][EVENT]31309 Run:[GEPERFTRACE] The time cost of InferValuePass is [4] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:53.131.299 [graph_prepare.cc:1983][EVENT]31309 PrepareDynShape:[GEPERFTRACE] The time cost of Prepare::FormatAndShapeProcess is [439] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.131.323 [pass_manager.cc:82][EVENT]31309 Run:[GEPERFTRACE] The time cost of PreRun::MarkForceUnknownForCondPass is [5] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.131.335 [graph_prepare.cc:1984][EVENT]31309 PrepareDynShape:[GEPERFTRACE] The time cost of Prepare::CtrlFlowPreProcess is [21] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.131.350 [graph_prepare.cc:1985][EVENT]31309 PrepareDynShape:[GEPERFTRACE] The time cost of Prepare::multibatch::GetDynamicOutputShape is [4] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.131.378 [graph_prepare.cc:1986][EVENT]31309 PrepareDynShape:[GEPERFTRACE] The time cost of Prepare::InsertAippOpUtil::Instance().UpdateDataNodeByAipp is [9] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.131.391 [graph_prepare.cc:1987][EVENT]31309 PrepareDynShape:[GEPERFTRACE] The time cost of Prepare::SaveOriginalGraphToOmModel is [0] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.131.409 [pass_manager.cc:82][EVENT]31309 Run:[GEPERFTRACE] The time cost of PrepareOptimize::ShapeOperateOpRemovePass is [2] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.131.421 [pass_manager.cc:82][EVENT]31309 Run:[GEPERFTRACE] The time cost of PrepareOptimize::ReplaceTransShapePass is [1] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.131.435 [pass_manager.cc:82][EVENT]31309 Run:[GEPERFTRACE] The time cost of PrepareOptimize::MarkAgnosticPass is [5] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.131.519 [base_pass.cc:339][EVENT]31309 Run:[GEPERFTRACE] The time cost of EnterPass is [3] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:53.131.532 [base_pass.cc:339][EVENT]31309 Run:[GEPERFTRACE] The time cost of CondPass is [3] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:53.131.540 [base_pass.cc:339][EVENT]31309 Run:[GEPERFTRACE] The time cost of PrintOpPass is [4] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:53.131.549 [base_pass.cc:339][EVENT]31309 Run:[GEPERFTRACE] The time cost of NoUseReshapeRemovePass is [1] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:53.131.557 [base_pass.cc:339][EVENT]31309 Run:[GEPERFTRACE] The time cost of DropOutPass is [2] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:53.131.565 [base_pass.cc:339][EVENT]31309 Run:[GEPERFTRACE] The time cost of AssertPass is [1] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:53.131.573 [base_pass.cc:339][EVENT]31309 Run:[GEPERFTRACE] The time cost of TransposeRemovePass is [1] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:53.131.581 [base_pass.cc:339][EVENT]31309 Run:[GEPERFTRACE] The time cost of UnusedConstPass is [2] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:53.131.590 [base_pass.cc:339][EVENT]31309 Run:[GEPERFTRACE] The time cost of StopGradientPass is [1] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:53.131.598 [base_pass.cc:339][EVENT]31309 Run:[GEPERFTRACE] The time cost of PreventGradientPass is [1] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:53.131.606 [base_pass.cc:339][EVENT]31309 Run:[GEPERFTRACE] The time cost of PlaceholderWithDefaultPass is [1] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:53.131.614 [base_pass.cc:339][EVENT]31309 Run:[GEPERFTRACE] The time cost of SnapshotPass is [1] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:53.131.622 [base_pass.cc:339][EVENT]31309 Run:[GEPERFTRACE] The time cost of GuaranteeConstPass is [2] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:53.131.630 [base_pass.cc:339][EVENT]31309 Run:[GEPERFTRACE] The time cost of VarIsInitializedOpPass is [5] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:53.131.638 [base_pass.cc:339][EVENT]31309 Run:[GEPERFTRACE] The time cost of ParallelConcatStartOpPass is [2] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:53.131.646 [base_pass.cc:339][EVENT]31309 Run:[GEPERFTRACE] The time cost of IdentityPass is [3] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:53.131.669 [pass_manager.cc:82][EVENT]31309 Run:[GEPERFTRACE] The time cost of PrepareOptimize::PrunePass is [10] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.131.683 [pass_manager.cc:82][EVENT]31309 Run:[GEPERFTRACE] The time cost of PrepareOptimize::HcclMemcpyPass is [2] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.131.719 [graph_prepare.cc:1988][EVENT]31309 PrepareDynShape:[GEPERFTRACE] The time cost of Prepare::PrepareOptimize is [317] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.131.733 [graph_manager.cc:1065][EVENT]31309 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.preparer.PrepareDynShape is [937] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.144.875 [graph_manager.cc:1077][EVENT]31309 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.optimizer.OptimizeOriginalGraph is [13122] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.144.937 [pass_manager.cc:82][EVENT]31309 Run:[GEPERFTRACE] The time cost of PrepareRunningFormatRefiner::VariablePrepareOpPass is [6] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.144.985 [graph_manager.cc:1080][EVENT]31309 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.preparer.PrepareRunningFormatRefiner is [79] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.149.186 [graph_manager.cc:1081][EVENT]31309 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.optimizer.OptimizeOriginalGraphJudgeInsert is [4186] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.149.225 [pass_manager.cc:82][EVENT]31309 Run:[GEPERFTRACE] The time cost of SubexpressionMigrationPass is [4] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.149.239 [pass_manager.cc:82][EVENT]31309 Run:[GEPERFTRACE] The time cost of UnusedArgsCleanPass is [2] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.149.251 [graph_manager.cc:1082][EVENT]31309 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::SubexpressionMigration is [35] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.149.283 [pass_manager.cc:82][EVENT]31309 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::MergeInputMemcpyPass is [4] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.149.300 [pass_manager.cc:82][EVENT]31309 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::SwitchDataEdgesBypass is [3] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.149.314 [pass_manager.cc:82][EVENT]31309 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::ConstantFuseSamePass is [4] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.149.424 [pass_manager.cc:82][EVENT]31309 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::CSEBeforeFuseDataNodesWithCommonInputPass is [100] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.149.442 [pass_manager.cc:82][EVENT]31309 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::FuseDataNodesWithCommonInputPass is [4] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.149.493 [pass_manager.cc:82][EVENT]31309 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::CommonSubexpressionEliminationPass is [38] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.149.510 [pass_manager.cc:82][EVENT]31309 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::PermutePass is [4] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.149.549 [pass_manager.cc:82][EVENT]31309 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::SameTransdataBreadthFusionPass is [28] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.149.577 [pass_manager.cc:82][EVENT]31309 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::VariableOpPass is [15] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.149.608 [pass_manager.cc:82][EVENT]31309 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::TransOpWithoutReshapeFusionPass is [20] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.149.675 [pass_manager.cc:82][EVENT]31309 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::TransOpBreadthFusionPass is [57] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.149.716 [pass_manager.cc:82][EVENT]31309 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::DataFlowPreparePass is [28] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.149.735 [pass_manager.cc:82][EVENT]31309 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::MergeUnknownShapeNPass is [3] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.149.745 [graph_manager.cc:2700][EVENT]31309 OptimizeStage1:[GEPERFTRACE] The time cost of GraphManager::OptimizeStage1_1 is [467] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.149.893 [base_pass.cc:339][EVENT]31309 Run:[GEPERFTRACE] The time cost of EnterPass is [2] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:53.149.908 [base_pass.cc:339][EVENT]31309 Run:[GEPERFTRACE] The time cost of AddNPass is [0] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:53.149.917 [base_pass.cc:339][EVENT]31309 Run:[GEPERFTRACE] The time cost of SwitchDeadBranchElimination is [2] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:53.149.927 [base_pass.cc:339][EVENT]31309 Run:[GEPERFTRACE] The time cost of SwitchLogicRemovePass is [1] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:53.149.936 [base_pass.cc:339][EVENT]31309 Run:[GEPERFTRACE] The time cost of MergePass is [0] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:53.149.945 [base_pass.cc:339][EVENT]31309 Run:[GEPERFTRACE] The time cost of CastRemovePass is [10] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:53.149.953 [base_pass.cc:339][EVENT]31309 Run:[GEPERFTRACE] The time cost of TransposeTransDataPass is [3] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:53.149.961 [base_pass.cc:339][EVENT]31309 Run:[GEPERFTRACE] The time cost of ReshapeRemovePass is [20] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:53.149.969 [base_pass.cc:339][EVENT]31309 Run:[GEPERFTRACE] The time cost of TransOpSymmetryEliminationPass is [20] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:53.149.977 [base_pass.cc:339][EVENT]31309 Run:[GEPERFTRACE] The time cost of TransOpNearbyAllreduceFusionPass is [1] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:53.149.986 [base_pass.cc:339][EVENT]31309 Run:[GEPERFTRACE] The time cost of ReplaceWithEmptyConstPass is [7] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:53.149.994 [base_pass.cc:339][EVENT]31309 Run:[GEPERFTRACE] The time cost of DimensionComputePass is [5] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:53.150.002 [base_pass.cc:339][EVENT]31309 Run:[GEPERFTRACE] The time cost of ConstantFoldingPass is [9] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:53.150.010 [base_pass.cc:339][EVENT]31309 Run:[GEPERFTRACE] The time cost of DimensionAdjustPass is [3] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:53.150.018 [base_pass.cc:339][EVENT]31309 Run:[GEPERFTRACE] The time cost of UselessControlOutRemovePass is [2] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:53.150.027 [graph_manager.cc:2741][EVENT]31309 OptimizeStage1:[GEPERFTRACE] The time cost of GraphManager::OptimizeStage1_2 is [264] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.150.039 [graph_manager.cc:2752][EVENT]31309 OptimizeStage1:[GEPERFTRACE] The time cost of extern constant folding is [0] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.150.064 [pass_manager.cc:82][EVENT]31309 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::Migration is [4] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.150.077 [pass_manager.cc:82][EVENT]31309 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::ArgsClean is [1] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.150.093 [pass_manager.cc:82][EVENT]31309 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::PrunePass is [7] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.150.109 [pass_manager.cc:82][EVENT]31309 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::NextIterationPass is [5] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.150.128 [pass_manager.cc:82][EVENT]31309 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::ControlTriggerPass is [2] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.150.142 [pass_manager.cc:82][EVENT]31309 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::MergeToStreamMergePass is [3] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.150.169 [pass_manager.cc:82][EVENT]31309 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::SwitchToStreamSwitchPass is [18] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.150.185 [pass_manager.cc:82][EVENT]31309 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::AttachStreamLabelPass is [5] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.150.199 [pass_manager.cc:82][EVENT]31309 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::MultiBatchPass is [4] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.150.211 [pass_manager.cc:82][EVENT]31309 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::SubgraphMultiDimsPass is [1] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.150.225 [pass_manager.cc:82][EVENT]31309 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::IteratorOpPass is [5] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.150.237 [pass_manager.cc:82][EVENT]31309 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::VariableRefUselessControlOutDeletePass is [2] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.150.256 [pass_manager.cc:82][EVENT]31309 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::ReshapeRecoveryPass is [9] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.150.268 [pass_manager.cc:82][EVENT]31309 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::RemoveSameConstPass is [3] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.150.278 [graph_manager.cc:2810][EVENT]31309 OptimizeStage1:[GEPERFTRACE] The time cost of GraphManager::OptimizeStage1_3 is [219] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.150.306 [base_pass.cc:339][EVENT]31309 Run:[GEPERFTRACE] The time cost of IdentityPass is [3] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:53.150.318 [graph_manager.cc:2821][EVENT]31309 OptimizeStage1:[GEPERFTRACE] The time cost of GraphPrepare::node_pass is [32] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.150.347 [graph_manager.cc:1087][EVENT]31309 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::OptimizeStage1 is [1076] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.150.489 [graph_manager.cc:1088][EVENT]31309 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.optimizer.OptimizeAfterStage1 is [126] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.150.545 [graph_manager.cc:1089][EVENT]31309 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::GraphUtilsEx::InferShapeInNeed is [37] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.150.565 [pass_manager.cc:82][EVENT]31309 Run:[GEPERFTRACE] The time cost of PreRun::CtrlEdgeTransferPass is [2] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.150.581 [graph_manager.cc:1097][EVENT]31309 PreRunOptimizeOriginalGraph:PreRun:PreRunOptimizeOriginalGraph success. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.150.603 [graph_manager.cc:3325][EVENT]31309 OptimizeSubgraph:[GEPERFTRACE] The time cost of OptimizeSubgraph::StagePartition is [4] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.151.014 [engine_place.cc:144][EVENT]31309 Run:The time cost of AIcoreEngine::CheckSupported is [269] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.151.037 [engine_place.cc:144][EVENT]31309 Run:The time cost of DNN_VM_GE_LOCAL_OP_STORE::CheckSupported is [8] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.151.047 [engine_place.cc:144][EVENT]31309 Run:The time cost of DNN_VM_RTS_OP_STORE::CheckSupported is [8] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.151.186 [graph_manager.cc:3351][EVENT]31309 OptimizeSubgraph:[GEPERFTRACE] The time cost of OptimizeSubgraph::GraphPartitionDynamicShape is [570] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.151.205 [graph_manager.cc:3364][EVENT]31309 OptimizeSubgraph:[GEPERFTRACE] The time cost of OptimizeSubgraph::SubgraphPartitionAndOptimization::CompositeEngine is [2] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.151.274 [engine_partitioner.cc:1139][EVENT]31309 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionInitialize is [23] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.151.291 [engine_partitioner.cc:1142][EVENT]31309 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionMarkClusters is [4] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.151.429 [engine_partitioner.cc:1148][EVENT]31309 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionSplitSubGraphs is [129] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.151.470 [engine_partitioner.cc:1155][EVENT]31309 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionSortSubGraphs is [29] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.151.520 [engine_partitioner.cc:1164][EVENT]31309 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionAddPartitionsToGraphNode is [38] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.151.554 [graph_manager.cc:3405][EVENT]31309 SubgraphPartitionAndOptimization:[GEPERFTRACE] The time cost of OptimizeSubgraph::Partition1 is [337] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.151.574 [graph_manager.cc:3412][EVENT]31309 SubgraphPartitionAndOptimization:[GEPERFTRACE] The time cost of OptimizeSubgraph::SetSubgraphPreProc is [7] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.163.618 [graph_manager.cc:3422][EVENT]31309 SubgraphPartitionAndOptimization:[GEPERFTRACE] The time cost of OptimizeSubgraph::SetSubGraph is [12029] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.163.652 [graph_manager.cc:3428][EVENT]31309 SubgraphPartitionAndOptimization:[GEPERFTRACE] The time cost of OptimizeSubgraph::SetSubgraphPostProc is [8] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.163.781 [graph_manager.cc:3467][EVENT]31309 SubgraphPartitionAndOptimization:[GEPERFTRACE] The time cost of OptimizeSubgraph::MergeSubGraph is [107] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.163.798 [graph_manager.cc:3377][EVENT]31309 OptimizeSubgraph:[GEPERFTRACE] The time cost of OptimizeSubgraph::SubgraphPartitionAndOptimization::AtomicEngine is [12581] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.163.814 [graph_manager.cc:1106][EVENT]31309 PreRunOptimizeSubGraph:[GEPERFTRACE] The time cost of GraphManager::OptimizeSubgraph is [13218] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.163.827 [graph_manager.cc:1115][EVENT]31309 PreRunOptimizeSubGraph:PreRun:PreRunOptimizeSubGraph success. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.163.860 [graph_manager.cc:1130][EVENT]31309 PreRunAfterOptimizeSubGraph:[GEPERFTRACE] The time cost of GraphManager::stages.optimizer.ReplacePrecompiledNodeWithOmGraph is [6] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.163.894 [graph_manager.cc:1131][EVENT]31309 PreRunAfterOptimizeSubGraph:[GEPERFTRACE] The time cost of GraphManager::stages.optimizer.OptimizeWholeGraph is [19] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.163.955 [pass_manager.cc:82][EVENT]31309 Run:[GEPERFTRACE] The time cost of OptimizeStage2::AfterMergePasses::LinkGenMaskNodesPass is [41] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.163.974 [pass_manager.cc:82][EVENT]31309 Run:[GEPERFTRACE] The time cost of OptimizeStage2::HcclContinuousMemcpyPass is [6] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.163.991 [graph_manager.cc:2837][EVENT]31309 OptimizeStage2:[GEPERFTRACE] The time cost of OptimizeStage2::AfterMergePasses is [79] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.164.080 [base_pass.cc:339][EVENT]31309 Run:[GEPERFTRACE] The time cost of ConstantFoldingPass is [13] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:53.164.093 [base_pass.cc:339][EVENT]31309 Run:[GEPERFTRACE] The time cost of ReshapeRemovePass is [2] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:53.164.102 [base_pass.cc:339][EVENT]31309 Run:[GEPERFTRACE] The time cost of CondRemovePass is [1] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:53.164.110 [base_pass.cc:339][EVENT]31309 Run:[GEPERFTRACE] The time cost of BitcastPass is [1] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:53.164.119 [base_pass.cc:339][EVENT]31309 Run:[GEPERFTRACE] The time cost of AssignRemovePass is [18] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:53.164.127 [base_pass.cc:339][EVENT]31309 Run:[GEPERFTRACE] The time cost of DimensionAdjustPass is [4] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:53.164.137 [graph_manager.cc:2864][EVENT]31309 OptimizeStage2:[GEPERFTRACE] The time cost of OptimizeStage2::MergedGraphNameToPasses is [129] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.164.158 [graph_manager.cc:2872][EVENT]31309 OptimizeStage2:[GEPERFTRACE] The time cost of OptimizeStage2::RemoveIsolatedConst is [12] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.164.178 [pass_manager.cc:82][EVENT]31309 Run:[GEPERFTRACE] The time cost of OptimizeStage2::ControlAttrOptimize::MultiBatchPass is [2] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.164.194 [pass_manager.cc:82][EVENT]31309 Run:[GEPERFTRACE] The time cost of OptimizeStage2::AfterMergePasses::RefIdentityDeleteOpPass is [4] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.164.210 [pass_manager.cc:82][EVENT]31309 Run:[GEPERFTRACE] The time cost of OptimizeStage2::AfterMergePasses::VariableRefDeleteOpPass is [6] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.164.225 [compile_nodes_pass.cc:88][EVENT]31309 Run:[GEPERFTRACE] The time cost of OptimizeStage2::ControlAttrOptimize::CompileNodesPass is [3] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.164.236 [pass_manager.cc:82][EVENT]31309 Run:[GEPERFTRACE] The time cost of OptimizeStage2::ControlAttrOptimize::CompileNodesPass is [15] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.164.246 [pass_manager.cc:82][EVENT]31309 Run:[GEPERFTRACE] The time cost of OptimizeStage2::AfterMergePasses::SwapSpacePass is [2] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.164.339 [pass_manager.cc:82][EVENT]31309 Run:[GEPERFTRACE] The time cost of OptimizeStage2::AfterMergePasses::InputOutputConnectionIdentifyPass is [83] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.164.387 [pass_manager.cc:82][EVENT]31309 Run:[GEPERFTRACE] The time cost of OptimizeStage2::AfterMergePasses::AtomicAddrCleanPass is [35] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.164.400 [pass_manager.cc:82][EVENT]31309 Run:[GEPERFTRACE] The time cost of OptimizeStage2::AfterMergePasses::EndOfSequenceAddControlPass is [2] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.164.415 [pass_manager.cc:82][EVENT]31309 Run:[GEPERFTRACE] The time cost of OptimizeStage2::ControlAttrOptimize::SubgraphPass is [4] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.164.428 [pass_manager.cc:82][EVENT]31309 Run:[GEPERFTRACE] The time cost of OptimizeStage2::ControlAttrOptimize::AttachStreamLabelPass is [4] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.164.437 [graph_manager.cc:2927][EVENT]31309 OptimizeStage2:[GEPERFTRACE] The time cost of OptimizeStage2::ControlAttrOptimize is [263] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.164.466 [graph_manager.cc:2937][EVENT]31309 OptimizeStage2:[GEPERFTRACE] The time cost of ModelBuilder::AssignFunctionalLabels is [20] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.164.515 [graph_manager.cc:2943][EVENT]31309 OptimizeStage2:[GEPERFTRACE] The time cost of MemcpyAddrAsyncPass::Run. is [35] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.164.529 [graph_manager.cc:2950][EVENT]31309 OptimizeStage2:[GEPERFTRACE] The time cost of BufferPoolMemoryPass::Run. is [3] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.175.264 [graph_manager.cc:2958][EVENT]31309 OptimizeStage2:[GEPERFTRACE] The time cost of ParallelGroupPass::Run. is [45] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.175.308 [graph_manager.cc:1132][EVENT]31309 PreRunAfterOptimizeSubGraph:[GEPERFTRACE] The time cost of GraphManager::OptimizeStage2 is [11399] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.175.385 [graph_manager.cc:1135][EVENT]31309 PreRunAfterOptimizeSubGraph:[GEPERFTRACE] The time cost of GraphManager::GetCompilerStages(graph_node->GetGraphId()).optimizer.OptimizeGraphBeforeBuild is [60] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.175.437 [graph_manager.cc:2975][EVENT]31309 MemConflictProc:[GEPERFTRACE] The time cost of HandleMemoryRWConflict is [33] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.175.512 [graph_manager.cc:2981][EVENT]31309 MemConflictProc:[GEPERFTRACE] The time cost of MemLayoutConflictOptimizer::Run. is [60] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.175.529 [pass_manager.cc:82][EVENT]31309 Run:[GEPERFTRACE] The time cost of OptimizeStage2::SetFftsPlusAttrPass is [1] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.175.540 [graph_manager.cc:2986][EVENT]31309 MemConflictProc:[GEPERFTRACE] The time cost of SetFftsPlusAttrPass::last_passes.Run is [15] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.175.550 [graph_manager.cc:1136][EVENT]31309 PreRunAfterOptimizeSubGraph:[GEPERFTRACE] The time cost of GraphManager::MemConflictProc is [147] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.175.694 [graph_manager.cc:3555][EVENT]31309 Build:[GEPERFTRACE] The time cost of GraphManager::RecoverIrDefinitionAndModifyAippData is [111] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.175.835 [engine_partitioner.cc:1139][EVENT]31309 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionInitialize is [16] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.175.851 [engine_partitioner.cc:1142][EVENT]31309 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionMarkClusters is [2] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.175.959 [engine_partitioner.cc:1148][EVENT]31309 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionSplitSubGraphs is [99] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.175.990 [engine_partitioner.cc:1155][EVENT]31309 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionSortSubGraphs is [17] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.176.034 [engine_partitioner.cc:1164][EVENT]31309 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionAddPartitionsToGraphNode is [33] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.176.055 [graph_builder.cc:865][EVENT]31309 SecondPartition:[GEPERFTRACE] The time cost of EnginePartitioner::Partition2 is [253] micro second. [INFO] RUNTIME(28287,python3.7):2024-01-11-05:33:53.176.577 [logger.cc:1071] 31309 ModelBindStream: model_id=832, stream_id=1089, flag=0. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.176.619 [task_generator.cc:804][EVENT]31309 GenerateTask:[GEPERFTRACE] The time cost of TaskGenerator::SetStreamCtx is [177] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.176.708 [task_generator.cc:805][EVENT]31309 GenerateTask:[GEPERFTRACE] The time cost of TaskGenerator::PrepareForGenerateTask is [75] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.177.514 [task_generator.cc:814][EVENT]31309 GenerateTask:[GEPERFTRACE] The time cost of TaskGenerator::DoGenerateTask is [783] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.177.539 [task_generator.cc:954][EVENT]31309 GetTaskInfo:[GEPERFTRACE] The time cost of TaskGenerator::GenerateTask is [1097] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.177.610 [task_generator.cc:967][EVENT]31309 GetTaskInfo:[GEPERFTRACE] The time cost of TaskGenerator::AddModelTaskToModel is [43] micro second. [INFO] RUNTIME(28287,python3.7):2024-01-11-05:33:53.177.630 [logger.cc:1084] 31309 ModelUnbindStream: model_id=832, stream_id=1089, [INFO] GE(28287,python3.7):2024-01-11-05:33:53.177.845 [graph_manager.cc:1152][EVENT]31309 PreRunAfterOptimizeSubGraph:[GEPERFTRACE] The time cost of GraphManager::Build is [2269] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.177.864 [graph_manager.cc:1164][EVENT]31309 PreRunAfterOptimizeSubGraph:PreRun:PreRunAfterOptimizeSubGraph success. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.177.907 [graph_manager.cc:1271][EVENT]31309 PreRun:[GEPERFTRACE] The time cost of FlowModelBuild is [59771] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.177.920 [graph_manager.cc:1272][EVENT]31309 PreRun:[GEPERFTRACE] GE PreRun End [INFO] ATRACE(28287,python3.7):2024-01-11-05:33:53.178.235 [atrace_api.c:93](tid:31309) AtraceDestroy start [INFO] ATRACE(28287,python3.7):2024-01-11-05:33:53.178.259 [atrace_api.c:95](tid:31309) AtraceDestroy end [INFO] GE(28287,python3.7):2024-01-11-05:33:53.183.592 [graph_converter.cc:838][EVENT]31309 ConvertComputeGraphToExecuteGraph:[GEPERFTRACE] The time cost of ConvertComputeGraphToExecuteGraph::CreateMainNode is [1672] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.183.769 [base_optimizer.cc:70][EVENT]31309 Run:[GEPERFTRACE] The time cost of ZeroCopy is [136] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.184.267 [base_optimizer.cc:70][EVENT]31309 Run:[GEPERFTRACE] The time cost of CEM is [477] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.184.479 [copy_flow_launch_fuse.cc:395][EVENT]31309 Run:[GEPERFTRACE] The time cost of Pass::CopyFlowLaunchFuse is [188] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.184.499 [base_optimizer.cc:70][EVENT]31309 Run:[GEPERFTRACE] The time cost of CopyFlowLaunch is [210] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.184.753 [base_optimizer.cc:70][EVENT]31309 Run:[GEPERFTRACE] The time cost of TrustOutTensor is [243] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.184.793 [base_optimizer.cc:70][EVENT]31309 Run:[GEPERFTRACE] The time cost of AicpuFuseHostInputs is [22] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.184.829 [base_optimizer.cc:70][EVENT]31309 Run:[GEPERFTRACE] The time cost of ZeroCopy is [23] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.185.017 [base_optimizer.cc:70][EVENT]31309 Run:[GEPERFTRACE] The time cost of CEM is [175] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.185.097 [copy_flow_launch_fuse.cc:395][EVENT]31309 Run:[GEPERFTRACE] The time cost of Pass::CopyFlowLaunchFuse is [64] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.185.111 [base_optimizer.cc:70][EVENT]31309 Run:[GEPERFTRACE] The time cost of CopyFlowLaunch is [78] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.185.162 [base_optimizer.cc:70][EVENT]31309 Run:[GEPERFTRACE] The time cost of TrustOutTensor is [19] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.185.175 [base_optimizer.cc:70][EVENT]31309 Run:[GEPERFTRACE] The time cost of AicpuFuseHostInputs is [0] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.185.201 [base_optimizer.cc:70][EVENT]31309 Run:[GEPERFTRACE] The time cost of ZeroCopy is [16] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.185.272 [base_optimizer.cc:70][EVENT]31309 Run:[GEPERFTRACE] The time cost of CEM is [61] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.185.340 [copy_flow_launch_fuse.cc:395][EVENT]31309 Run:[GEPERFTRACE] The time cost of Pass::CopyFlowLaunchFuse is [56] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.185.360 [base_optimizer.cc:70][EVENT]31309 Run:[GEPERFTRACE] The time cost of CopyFlowLaunch is [77] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.185.388 [base_optimizer.cc:70][EVENT]31309 Run:[GEPERFTRACE] The time cost of TrustOutTensor is [18] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.185.398 [base_optimizer.cc:70][EVENT]31309 Run:[GEPERFTRACE] The time cost of AicpuFuseHostInputs is [0] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.185.411 [graph_converter.cc:849][EVENT]31309 ConvertComputeGraphToExecuteGraph:[GEPERFTRACE] The time cost of ConvertComputeGraphToExecuteGraph::RunAllPass is [1784] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.185.616 [graph_converter.cc:853][EVENT]31309 ConvertComputeGraphToExecuteGraph:[GEPERFTRACE] The time cost of ConvertComputeGraphToExecuteGraph::TopologicalSorting is [196] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.186.365 [graph_converter.cc:857][EVENT]31309 ConvertComputeGraphToExecuteGraph:[GEPERFTRACE] The time cost of ConvertComputeGraphToExecuteGraph::AppendGraphLevelData is [735] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:53.186.521 [graph_converter.cc:862][EVENT]31309 ConvertComputeGraphToExecuteGraph:[GEPERFTRACE] The time cost of ConvertComputeGraphToExecuteGraph::CalculatePriority is [133] micro second. TotalTime = 0.0532908, [20] [parse]: 0.00142941 [symbol_resolve]: 0.0288228, [1] [Cycle 1]: 0.0287435, [1] [resolve]: 0.0287218 [combine_like_graphs]: 1.16e-06 [graph_reusing]: 4.15e-06 [meta_unpack_prepare]: 0.00011335 [pre_cconv]: 6.99998e-07 [abstract_specialize]: 0.0137593 [pack_expand]: 1.783e-05 [auto_monad]: 0.000163 [inline]: 1.55e-06 [pre_auto_parallel]: 1.188e-05 [pipeline_split]: 2.79e-06 [optimize]: 0.00824639, [35] [py_interpret_to_execute]: 3.87e-06 [rewriter_before_opt_a]: 9.859e-05 [opt_a]: 0.00761547, [3] [Cycle 1]: 0.00402348, [30] [expand_dump_flag]: 5.13e-06 [switch_simplify]: 8.979e-05 [a_1]: 0.00046789 [recompute_prepare]: 8.55e-06 [updatestate_depend_eliminate]: 9.82e-06 [updatestate_assign_eliminate]: 7.76e-06 [updatestate_loads_eliminate]: 6.71e-06 [parameter_eliminate]: 4.5e-06 [a_2]: 9.07e-05 [accelerated_algorithm]: 5.77e-06 [pynative_shard]: 1.62e-06 [auto_parallel]: 3.72e-06 [parallel]: 8.65e-06 [merge_comm]: 8.49e-06 [allreduce_fusion]: 3.17e-06 [virtual_dataset]: 5.13e-06 [get_grad_eliminate_]: 4.45e-06 [virtual_output]: 3.94e-06 [merge_forward]: 9.16e-06 [cell_reuse_recompute_pass]: 1.09e-06 [cell_reuse_handle_not_recompute_node_pass]: 1.202e-05 [meta_fg_expand]: 0.00080061 [after_resolve]: 2.301e-05 [a_after_grad]: 3.589e-05 [renormalize]: 0.00187892 [real_op_eliminate]: 4.174e-05 [auto_monad_grad]: 1.205e-05 [auto_monad_eliminator]: 3.804e-05 [cse]: 0.00013495 [a_3]: 0.00011547 [Cycle 2]: 0.0010167, [30] [expand_dump_flag]: 1.57e-06 [switch_simplify]: 1.278e-05 [a_1]: 0.00041991 [recompute_prepare]: 2.32e-06 [updatestate_depend_eliminate]: 3.82e-06 [updatestate_assign_eliminate]: 2.38e-06 [updatestate_loads_eliminate]: 2.11e-06 [parameter_eliminate]: 2.19e-06 [a_2]: 2.76e-05 [accelerated_algorithm]: 2.48e-06 [pynative_shard]: 1.06e-06 [auto_parallel]: 3.6e-06 [parallel]: 3.88e-06 [merge_comm]: 2.09e-06 [allreduce_fusion]: 1.35e-06 [virtual_dataset]: 2.32e-06 [get_grad_eliminate_]: 1.98e-06 [virtual_output]: 1.82e-06 [merge_forward]: 3.14e-06 [cell_reuse_recompute_pass]: 4.50003e-07 [cell_reuse_handle_not_recompute_node_pass]: 5.17e-06 [meta_fg_expand]: 1.283e-05 [after_resolve]: 3.64e-06 [a_after_grad]: 2.60001e-06 [renormalize]: 0.000308 [real_op_eliminate]: 5e-06 [auto_monad_grad]: 3.37e-06 [auto_monad_eliminator]: 6.44e-06 [cse]: 1.497e-05 [a_3]: 1.477e-05 [Cycle 3]: 0.00023053, [30] [expand_dump_flag]: 1.17e-06 [switch_simplify]: 2.05e-06 [a_1]: 1.475e-05 [recompute_prepare]: 1.51e-06 [updatestate_depend_eliminate]: 2.81e-06 [updatestate_assign_eliminate]: 2.22e-06 [updatestate_loads_eliminate]: 1.95e-06 [parameter_eliminate]: 9.5e-07 [a_2]: 2.59e-05 [accelerated_algorithm]: 2.36e-06 [pynative_shard]: 1.14e-06 [auto_parallel]: 3.49e-06 [parallel]: 3.72e-06 [merge_comm]: 1.84e-06 [allreduce_fusion]: 1.41e-06 [virtual_dataset]: 1.96999e-06 [get_grad_eliminate_]: 1.63e-06 [virtual_output]: 1.51e-06 [merge_forward]: 2.73e-06 [cell_reuse_recompute_pass]: 3.89999e-07 [cell_reuse_handle_not_recompute_node_pass]: 5.78e-06 [meta_fg_expand]: 1.76e-06 [after_resolve]: 3.66e-06 [a_after_grad]: 2.08e-06 [renormalize]: 7.0002e-08 [real_op_eliminate]: 1.56e-06 [auto_monad_grad]: 7.49998e-07 [auto_monad_eliminator]: 3.83e-06 [cse]: 7.96001e-06 [a_3]: 1.542e-05 [py_interpret_to_execute_after_opt_a]: 3.32e-06 [slice_cell_reuse_recomputed_activation]: 2.5e-06 [rewriter_after_opt_a]: 1.974e-05 [convert_after_rewriter]: 5.25e-06 [order_py_execute_after_rewriter]: 4.56e-06 [opt_b]: 0.00020534, [2] [Cycle 1]: 0.00013651, [7] [b_1]: 9.078e-05 [b_2]: 2.65e-06 [updatestate_depend_eliminate]: 2e-06 [updatestate_assign_eliminate]: 1.66e-06 [updatestate_loads_eliminate]: 1.45e-06 [renormalize]: 4.29995e-07 [cse]: 7.12e-06 [Cycle 2]: 6.153e-05, [7] [b_1]: 2.491e-05 [b_2]: 1.57e-06 [updatestate_depend_eliminate]: 1.46e-06 [updatestate_assign_eliminate]: 1.26e-06 [updatestate_loads_eliminate]: 1.29e-06 [renormalize]: 7.0002e-08 [cse]: 4.43e-06 [cconv]: 1.747e-05 [opt_after_cconv]: 4.523e-05, [1] [Cycle 1]: 4.123e-05, [7] [c_1]: 3.21e-06 [parameter_eliminate]: 7.40001e-07 [updatestate_depend_eliminate]: 1.5e-06 [updatestate_assign_eliminate]: 1.26e-06 [updatestate_loads_eliminate]: 1.45e-06 [cse]: 4.82e-06 [renormalize]: 2.89998e-07 [remove_dup_value]: 1.028e-05 [tuple_transform]: 2.82e-05, [1] [Cycle 1]: 2.466e-05, [3] [d_1]: 8.37e-06 [d_2]: 3.9e-06 [renormalize]: 1.8e-07 [add_cache_embedding]: 9.93e-06 [add_recomputation]: 2.574e-05 [cse_after_recomputation]: 1.435e-05, [1] [Cycle 1]: 1.01e-05, [1] [cse]: 5.32e-06 [environ_conv]: 4.59e-06 [label_micro_interleaved_index]: 2.11e-06 [label_fine_grained_interleaved_index]: 2.93e-06 [assign_add_opt]: 1.47e-06 [slice_recompute_activation]: 2.45e-06 [micro_interleaved_order_control]: 1.86e-06 [full_micro_interleaved_order_control]: 1.78e-06 [comp_comm_scheduling]: 1.9e-06 [reorder_send_recv_between_fp_bp]: 2.68e-06 [comm_op_add_attrs]: 1.08e-06 [add_comm_op_reuse_tag]: 9.20001e-07 [overlap_opt_shard_in_pipeline]: 1.14e-06 [grouped_pairwise_exchange_alltoall]: 1.46e-06 [overlap_recompute_and_grad_model_parallel]: 1.77e-06 [overlap_grad_matmul_and_grad_allreduce]: 1.03e-06 [split_matmul_comm_elemetwise]: 2.45e-06 [split_layernorm_comm]: 2.1e-06 [process_send_recv_for_ge]: 7.99999e-07 [handle_group_info]: 1.25e-06 [auto_monad_reorder]: 1.202e-05 [get_jit_bprop_graph]: 6.59995e-07 [eliminate_special_op_node]: 0.00048612 [validate]: 1.993e-05 [distribtued_split]: 1.67e-06 [task_emit]: 9.30006e-07 [execute]: 7.80004e-07 Sums parse : 0.001429s : 2.86% symbol_resolve.resolve : 0.028722s : 57.45% combine_like_graphs : 0.000001s : 0.00% graph_reusing : 0.000004s : 0.01% meta_unpack_prepare : 0.000113s : 0.23% pre_cconv : 0.000001s : 0.00% abstract_specialize : 0.013759s : 27.52% pack_expand : 0.000018s : 0.04% auto_monad : 0.000163s : 0.33% inline : 0.000002s : 0.00% pre_auto_parallel : 0.000012s : 0.02% pipeline_split : 0.000003s : 0.01% optimize.py_interpret_to_execute : 0.000004s : 0.01% optimize.rewriter_before_opt_a : 0.000099s : 0.20% optimize.opt_a.expand_dump_flag : 0.000008s : 0.02% optimize.opt_a.switch_simplify : 0.000105s : 0.21% optimize.opt_a.a_1 : 0.000903s : 1.81% optimize.opt_a.recompute_prepare : 0.000012s : 0.02% optimize.opt_a.updatestate_depend_eliminate : 0.000016s : 0.03% optimize.opt_a.updatestate_assign_eliminate : 0.000012s : 0.02% optimize.opt_a.updatestate_loads_eliminate : 0.000011s : 0.02% optimize.opt_a.parameter_eliminate : 0.000008s : 0.02% optimize.opt_a.a_2 : 0.000144s : 0.29% optimize.opt_a.accelerated_algorithm : 0.000011s : 0.02% optimize.opt_a.pynative_shard : 0.000004s : 0.01% optimize.opt_a.auto_parallel : 0.000011s : 0.02% optimize.opt_a.parallel : 0.000016s : 0.03% optimize.opt_a.merge_comm : 0.000012s : 0.02% optimize.opt_a.allreduce_fusion : 0.000006s : 0.01% optimize.opt_a.virtual_dataset : 0.000009s : 0.02% optimize.opt_a.get_grad_eliminate_ : 0.000008s : 0.02% optimize.opt_a.virtual_output : 0.000007s : 0.01% optimize.opt_a.merge_forward : 0.000015s : 0.03% optimize.opt_a.cell_reuse_recompute_pass : 0.000002s : 0.00% optimize.opt_a.cell_reuse_handle_not_recompute_node_pass : 0.000023s : 0.05% optimize.opt_a.meta_fg_expand : 0.000815s : 1.63% optimize.opt_a.after_resolve : 0.000030s : 0.06% optimize.opt_a.a_after_grad : 0.000041s : 0.08% optimize.opt_a.renormalize : 0.002187s : 4.37% optimize.opt_a.real_op_eliminate : 0.000048s : 0.10% optimize.opt_a.auto_monad_grad : 0.000016s : 0.03% optimize.opt_a.auto_monad_eliminator : 0.000048s : 0.10% optimize.opt_a.cse : 0.000158s : 0.32% optimize.opt_a.a_3 : 0.000146s : 0.29% optimize.py_interpret_to_execute_after_opt_a : 0.000003s : 0.01% optimize.slice_cell_reuse_recomputed_activation : 0.000002s : 0.01% optimize.rewriter_after_opt_a : 0.000020s : 0.04% optimize.convert_after_rewriter : 0.000005s : 0.01% optimize.order_py_execute_after_rewriter : 0.000005s : 0.01% optimize.opt_b.b_1 : 0.000116s : 0.23% optimize.opt_b.b_2 : 0.000004s : 0.01% optimize.opt_b.updatestate_depend_eliminate : 0.000003s : 0.01% optimize.opt_b.updatestate_assign_eliminate : 0.000003s : 0.01% optimize.opt_b.updatestate_loads_eliminate : 0.000003s : 0.01% optimize.opt_b.renormalize : 0.000000s : 0.00% optimize.opt_b.cse : 0.000012s : 0.02% optimize.cconv : 0.000017s : 0.03% optimize.opt_after_cconv.c_1 : 0.000003s : 0.01% optimize.opt_after_cconv.parameter_eliminate : 0.000001s : 0.00% optimize.opt_after_cconv.updatestate_depend_eliminate : 0.000001s : 0.00% optimize.opt_after_cconv.updatestate_assign_eliminate : 0.000001s : 0.00% optimize.opt_after_cconv.updatestate_loads_eliminate : 0.000001s : 0.00% optimize.opt_after_cconv.cse : 0.000005s : 0.01% optimize.opt_after_cconv.renormalize : 0.000000s : 0.00% optimize.remove_dup_value : 0.000010s : 0.02% optimize.tuple_transform.d_1 : 0.000008s : 0.02% optimize.tuple_transform.d_2 : 0.000004s : 0.01% optimize.tuple_transform.renormalize : 0.000000s : 0.00% optimize.add_cache_embedding : 0.000010s : 0.02% optimize.add_recomputation : 0.000026s : 0.05% optimize.cse_after_recomputation.cse : 0.000005s : 0.01% optimize.environ_conv : 0.000005s : 0.01% optimize.label_micro_interleaved_index : 0.000002s : 0.00% optimize.label_fine_grained_interleaved_index : 0.000003s : 0.01% optimize.assign_add_opt : 0.000001s : 0.00% optimize.slice_recompute_activation : 0.000002s : 0.00% optimize.micro_interleaved_order_control : 0.000002s : 0.00% optimize.full_micro_interleaved_order_control : 0.000002s : 0.00% optimize.comp_comm_scheduling : 0.000002s : 0.00% optimize.reorder_send_recv_between_fp_bp : 0.000003s : 0.01% optimize.comm_op_add_attrs : 0.000001s : 0.00% optimize.add_comm_op_reuse_tag : 0.000001s : 0.00% optimize.overlap_opt_shard_in_pipeline : 0.000001s : 0.00% optimize.grouped_pairwise_exchange_alltoall : 0.000001s : 0.00% optimize.overlap_recompute_and_grad_model_parallel : 0.000002s : 0.00% optimize.overlap_grad_matmul_and_grad_allreduce : 0.000001s : 0.00% optimize.split_matmul_comm_elemetwise : 0.000002s : 0.00% optimize.split_layernorm_comm : 0.000002s : 0.00% optimize.process_send_recv_for_ge : 0.000001s : 0.00% optimize.handle_group_info : 0.000001s : 0.00% auto_monad_reorder : 0.000012s : 0.02% get_jit_bprop_graph : 0.000001s : 0.00% eliminate_special_op_node : 0.000486s : 0.97% validate : 0.000020s : 0.04% distribtued_split : 0.000002s : 0.00% task_emit : 0.000001s : 0.00% execute : 0.000001s : 0.00% Time group info: ------[substitution.] 0.028738 207 0.01% : 0.000004s : 10: substitution.float_depend_g_call 0.01% : 0.000003s : 2: substitution.float_tuple_getitem_switch 98.17% : 0.028213s : 19: substitution.getattr_setattr_resolve 0.01% : 0.000004s : 1: substitution.graph_param_transform 0.01% : 0.000002s : 2: substitution.incorporate_call 0.00% : 0.000001s : 2: substitution.incorporate_call_switch 1.17% : 0.000336s : 20: substitution.inline 0.10% : 0.000028s : 56: substitution.meta_unpack_prepare 0.02% : 0.000006s : 7: substitution.minmaximum_grad 0.02% : 0.000005s : 10: substitution.partial_eliminate 0.00% : 0.000001s : 1: substitution.partial_unused_args_eliminate 0.02% : 0.000006s : 1: substitution.real_op_eliminate 0.01% : 0.000002s : 12: substitution.remove_not_recompute_node 0.05% : 0.000013s : 9: substitution.replace_applicator 0.01% : 0.000004s : 7: substitution.replace_old_param 0.01% : 0.000002s : 1: substitution.set_cell_output_no_recompute 0.02% : 0.000006s : 3: substitution.switch_simplify 0.07% : 0.000020s : 7: substitution.tuple_list_convert_item_index_to_positive 0.03% : 0.000008s : 7: substitution.tuple_list_get_item_const_eliminator 0.04% : 0.000011s : 7: substitution.tuple_list_get_item_depend_reorder 0.11% : 0.000032s : 15: substitution.tuple_list_get_item_eliminator 0.04% : 0.000011s : 7: substitution.tuple_list_get_set_item_eliminator 0.07% : 0.000020s : 1: substitution.zero_like_fill_zero ------[renormalize.] 0.002178 4 59.04% : 0.001286s : 2: renormalize.infer 40.96% : 0.000892s : 2: renormalize.specialize ------[replace.] 0.000476 45 54.67% : 0.000260s : 16: replace.getattr_setattr_resolve 23.15% : 0.000110s : 18: replace.inline 2.34% : 0.000011s : 1: replace.real_op_eliminate 8.02% : 0.000038s : 3: replace.switch_simplify 9.24% : 0.000044s : 6: replace.tuple_list_get_item_eliminator 2.58% : 0.000012s : 1: replace.zero_like_fill_zero ------[match.] 0.028457 45 98.71% : 0.028089s : 16: match.getattr_setattr_resolve 1.12% : 0.000318s : 18: match.inline 0.02% : 0.000006s : 1: match.real_op_eliminate 0.02% : 0.000006s : 3: match.switch_simplify 0.06% : 0.000017s : 6: match.tuple_list_get_item_eliminator 0.07% : 0.000020s : 1: match.zero_like_fill_zero ------[func_graph_cloner_run.] 0.001927 36 73.12% : 0.001409s : 14: func_graph_cloner_run.FuncGraphClonerGraph 26.88% : 0.000518s : 22: func_graph_cloner_run.FuncGraphSpecializer ------[meta_graph.] 0.000000 0 ------[manager.] 0.000000 0 ------[pynative] 0.000000 0 ------[others.] 0.030355 188 0.92% : 0.000281s : 78: opt.transform.opt_a 0.31% : 0.000094s : 69: opt.transform.opt_b 94.60% : 0.028715s : 2: opt.transform.opt_resolve 0.30% : 0.000091s : 1: opt.transforms.meta_unpack_prepare 3.79% : 0.001152s : 30: opt.transforms.opt_a 0.01% : 0.000002s : 1: opt.transforms.opt_after_cconv 0.01% : 0.000003s : 2: opt.transforms.opt_b 0.04% : 0.000011s : 2: opt.transforms.opt_trans_graph 0.02% : 0.000006s : 3: opt.transforms.special_op_eliminate TotalTime = 0.0229007, [20] [parse]: 0.00140708 [symbol_resolve]: 0.0105361, [1] [Cycle 1]: 0.0104785, [1] [resolve]: 0.0104591 [combine_like_graphs]: 1.06001e-06 [graph_reusing]: 2.99e-06 [meta_unpack_prepare]: 4.922e-05 [pre_cconv]: 8.2e-07 [abstract_specialize]: 0.00204553 [pack_expand]: 1.127e-05 [auto_monad]: 4.829e-05 [inline]: 1.52e-06 [pre_auto_parallel]: 1.064e-05 [pipeline_split]: 2.87e-06 [optimize]: 0.00379163, [35] [py_interpret_to_execute]: 4.44e-06 [rewriter_before_opt_a]: 3.388e-05 [opt_a]: 0.0033153, [2] [Cycle 1]: 0.00080174, [30] [expand_dump_flag]: 3.58e-06 [switch_simplify]: 1.227e-05 [a_1]: 0.00017756 [recompute_prepare]: 2.32e-06 [updatestate_depend_eliminate]: 5.81e-06 [updatestate_assign_eliminate]: 3.5e-06 [updatestate_loads_eliminate]: 3.38e-06 [parameter_eliminate]: 3.39e-06 [a_2]: 2.853e-05 [accelerated_algorithm]: 2.72e-06 [pynative_shard]: 1.68e-06 [auto_parallel]: 3.68e-06 [parallel]: 9.25e-06 [merge_comm]: 3.62e-06 [allreduce_fusion]: 1.94e-06 [virtual_dataset]: 2.31e-06 [get_grad_eliminate_]: 1.99e-06 [virtual_output]: 1.57e-06 [merge_forward]: 4.5e-06 [cell_reuse_recompute_pass]: 8.49999e-07 [cell_reuse_handle_not_recompute_node_pass]: 5.87e-06 [meta_fg_expand]: 3.34e-06 [after_resolve]: 5.04999e-06 [a_after_grad]: 2.57e-06 [renormalize]: 0.00030853 [real_op_eliminate]: 4.77e-06 [auto_monad_grad]: 4.08e-06 [auto_monad_eliminator]: 9.34e-06 [cse]: 2.424e-05 [a_3]: 1.462e-05 [Cycle 2]: 0.00022847, [30] [expand_dump_flag]: 1.03e-06 [switch_simplify]: 2.25e-06 [a_1]: 1.422e-05 [recompute_prepare]: 1.48e-06 [updatestate_depend_eliminate]: 2.79e-06 [updatestate_assign_eliminate]: 2.18e-06 [updatestate_loads_eliminate]: 2.04999e-06 [parameter_eliminate]: 8.10003e-07 [a_2]: 2.577e-05 [accelerated_algorithm]: 1.94e-06 [pynative_shard]: 1.03e-06 [auto_parallel]: 3.55e-06 [parallel]: 3.31e-06 [merge_comm]: 1.72e-06 [allreduce_fusion]: 1.15e-06 [virtual_dataset]: 3.14e-06 [get_grad_eliminate_]: 1.88e-06 [virtual_output]: 1.71e-06 [merge_forward]: 2.9e-06 [cell_reuse_recompute_pass]: 4.19997e-07 [cell_reuse_handle_not_recompute_node_pass]: 5.74e-06 [meta_fg_expand]: 1.6e-06 [after_resolve]: 3.43e-06 [a_after_grad]: 2.21e-06 [renormalize]: 5.99975e-08 [real_op_eliminate]: 1.68e-06 [auto_monad_grad]: 8.2e-07 [auto_monad_eliminator]: 3.86e-06 [cse]: 7.61e-06 [a_3]: 1.241e-05 [py_interpret_to_execute_after_opt_a]: 3.24e-06 [slice_cell_reuse_recomputed_activation]: 2.3e-06 [rewriter_after_opt_a]: 1.97e-05 [convert_after_rewriter]: 5.44e-06 [order_py_execute_after_rewriter]: 4.28e-06 [opt_b]: 8.46e-05, [1] [Cycle 1]: 8.044e-05, [7] [b_1]: 3.74e-05 [b_2]: 2.55e-06 [updatestate_depend_eliminate]: 2.09e-06 [updatestate_assign_eliminate]: 1.98e-06 [updatestate_loads_eliminate]: 1.83e-06 [renormalize]: 3.29994e-07 [cse]: 6.52e-06 [cconv]: 2.375e-05 [opt_after_cconv]: 4.861e-05, [1] [Cycle 1]: 4.493e-05, [7] [c_1]: 4.58999e-06 [parameter_eliminate]: 6.99998e-07 [updatestate_depend_eliminate]: 2.09999e-06 [updatestate_assign_eliminate]: 1.72e-06 [updatestate_loads_eliminate]: 1.94e-06 [cse]: 5.9e-06 [renormalize]: 2.09999e-07 [remove_dup_value]: 1.04e-05 [tuple_transform]: 3.276e-05, [1] [Cycle 1]: 2.967e-05, [3] [d_1]: 1.214e-05 [d_2]: 5.4e-06 [renormalize]: 1.30007e-07 [add_cache_embedding]: 1.115e-05 [add_recomputation]: 3.966e-05 [cse_after_recomputation]: 1.477e-05, [1] [Cycle 1]: 1.098e-05, [1] [cse]: 6.46e-06 [environ_conv]: 5.04e-06 [label_micro_interleaved_index]: 1.99e-06 [label_fine_grained_interleaved_index]: 2.45e-06 [assign_add_opt]: 2.19e-06 [slice_recompute_activation]: 2.47e-06 [micro_interleaved_order_control]: 1.99e-06 [full_micro_interleaved_order_control]: 1.99e-06 [comp_comm_scheduling]: 2.29e-06 [reorder_send_recv_between_fp_bp]: 2.62e-06 [comm_op_add_attrs]: 1.06e-06 [add_comm_op_reuse_tag]: 9e-07 [overlap_opt_shard_in_pipeline]: 1.28e-06 [grouped_pairwise_exchange_alltoall]: 1.84e-06 [overlap_recompute_and_grad_model_parallel]: 1.79e-06 [overlap_grad_matmul_and_grad_allreduce]: 7.60003e-07 [split_matmul_comm_elemetwise]: 2.53e-06 [split_layernorm_comm]: 2.02e-06 [process_send_recv_for_ge]: 8.70001e-07 [handle_group_info]: 1.29001e-06 [auto_monad_reorder]: 1.423e-05 [get_jit_bprop_graph]: 3.70004e-07 [eliminate_special_op_node]: 0.00045803 [validate]: 2.297e-05 [distribtued_split]: 1.27e-06 [task_emit]: 0.0042963 [execute]: 8.03e-06 Sums parse : 0.001407s : 7.07% symbol_resolve.resolve : 0.010459s : 52.55% combine_like_graphs : 0.000001s : 0.01% graph_reusing : 0.000003s : 0.02% meta_unpack_prepare : 0.000049s : 0.25% pre_cconv : 0.000001s : 0.00% abstract_specialize : 0.002046s : 10.28% pack_expand : 0.000011s : 0.06% auto_monad : 0.000048s : 0.24% inline : 0.000002s : 0.01% pre_auto_parallel : 0.000011s : 0.05% pipeline_split : 0.000003s : 0.01% optimize.py_interpret_to_execute : 0.000004s : 0.02% optimize.rewriter_before_opt_a : 0.000034s : 0.17% optimize.opt_a.expand_dump_flag : 0.000005s : 0.02% optimize.opt_a.switch_simplify : 0.000015s : 0.07% optimize.opt_a.a_1 : 0.000192s : 0.96% optimize.opt_a.recompute_prepare : 0.000004s : 0.02% optimize.opt_a.updatestate_depend_eliminate : 0.000009s : 0.04% optimize.opt_a.updatestate_assign_eliminate : 0.000006s : 0.03% optimize.opt_a.updatestate_loads_eliminate : 0.000005s : 0.03% optimize.opt_a.parameter_eliminate : 0.000004s : 0.02% optimize.opt_a.a_2 : 0.000054s : 0.27% optimize.opt_a.accelerated_algorithm : 0.000005s : 0.02% optimize.opt_a.pynative_shard : 0.000003s : 0.01% optimize.opt_a.auto_parallel : 0.000007s : 0.04% optimize.opt_a.parallel : 0.000013s : 0.06% optimize.opt_a.merge_comm : 0.000005s : 0.03% optimize.opt_a.allreduce_fusion : 0.000003s : 0.02% optimize.opt_a.virtual_dataset : 0.000005s : 0.03% optimize.opt_a.get_grad_eliminate_ : 0.000004s : 0.02% optimize.opt_a.virtual_output : 0.000003s : 0.02% optimize.opt_a.merge_forward : 0.000007s : 0.04% optimize.opt_a.cell_reuse_recompute_pass : 0.000001s : 0.01% optimize.opt_a.cell_reuse_handle_not_recompute_node_pass : 0.000012s : 0.06% optimize.opt_a.meta_fg_expand : 0.000005s : 0.02% optimize.opt_a.after_resolve : 0.000008s : 0.04% optimize.opt_a.a_after_grad : 0.000005s : 0.02% optimize.opt_a.renormalize : 0.000309s : 1.55% optimize.opt_a.real_op_eliminate : 0.000006s : 0.03% optimize.opt_a.auto_monad_grad : 0.000005s : 0.02% optimize.opt_a.auto_monad_eliminator : 0.000013s : 0.07% optimize.opt_a.cse : 0.000032s : 0.16% optimize.opt_a.a_3 : 0.000027s : 0.14% optimize.py_interpret_to_execute_after_opt_a : 0.000003s : 0.02% optimize.slice_cell_reuse_recomputed_activation : 0.000002s : 0.01% optimize.rewriter_after_opt_a : 0.000020s : 0.10% optimize.convert_after_rewriter : 0.000005s : 0.03% optimize.order_py_execute_after_rewriter : 0.000004s : 0.02% optimize.opt_b.b_1 : 0.000037s : 0.19% optimize.opt_b.b_2 : 0.000003s : 0.01% optimize.opt_b.updatestate_depend_eliminate : 0.000002s : 0.01% optimize.opt_b.updatestate_assign_eliminate : 0.000002s : 0.01% optimize.opt_b.updatestate_loads_eliminate : 0.000002s : 0.01% optimize.opt_b.renormalize : 0.000000s : 0.00% optimize.opt_b.cse : 0.000007s : 0.03% optimize.cconv : 0.000024s : 0.12% optimize.opt_after_cconv.c_1 : 0.000005s : 0.02% optimize.opt_after_cconv.parameter_eliminate : 0.000001s : 0.00% optimize.opt_after_cconv.updatestate_depend_eliminate : 0.000002s : 0.01% optimize.opt_after_cconv.updatestate_assign_eliminate : 0.000002s : 0.01% optimize.opt_after_cconv.updatestate_loads_eliminate : 0.000002s : 0.01% optimize.opt_after_cconv.cse : 0.000006s : 0.03% optimize.opt_after_cconv.renormalize : 0.000000s : 0.00% optimize.remove_dup_value : 0.000010s : 0.05% optimize.tuple_transform.d_1 : 0.000012s : 0.06% optimize.tuple_transform.d_2 : 0.000005s : 0.03% optimize.tuple_transform.renormalize : 0.000000s : 0.00% optimize.add_cache_embedding : 0.000011s : 0.06% optimize.add_recomputation : 0.000040s : 0.20% optimize.cse_after_recomputation.cse : 0.000006s : 0.03% optimize.environ_conv : 0.000005s : 0.03% optimize.label_micro_interleaved_index : 0.000002s : 0.01% optimize.label_fine_grained_interleaved_index : 0.000002s : 0.01% optimize.assign_add_opt : 0.000002s : 0.01% optimize.slice_recompute_activation : 0.000002s : 0.01% optimize.micro_interleaved_order_control : 0.000002s : 0.01% optimize.full_micro_interleaved_order_control : 0.000002s : 0.01% optimize.comp_comm_scheduling : 0.000002s : 0.01% optimize.reorder_send_recv_between_fp_bp : 0.000003s : 0.01% optimize.comm_op_add_attrs : 0.000001s : 0.01% optimize.add_comm_op_reuse_tag : 0.000001s : 0.00% optimize.overlap_opt_shard_in_pipeline : 0.000001s : 0.01% optimize.grouped_pairwise_exchange_alltoall : 0.000002s : 0.01% optimize.overlap_recompute_and_grad_model_parallel : 0.000002s : 0.01% optimize.overlap_grad_matmul_and_grad_allreduce : 0.000001s : 0.00% optimize.split_matmul_comm_elemetwise : 0.000003s : 0.01% optimize.split_layernorm_comm : 0.000002s : 0.01% optimize.process_send_recv_for_ge : 0.000001s : 0.00% optimize.handle_group_info : 0.000001s : 0.01% auto_monad_reorder : 0.000014s : 0.07% get_jit_bprop_graph : 0.000000s : 0.00% eliminate_special_op_node : 0.000458s : 2.30% validate : 0.000023s : 0.12% distribtued_split : 0.000001s : 0.01% task_emit : 0.004296s : 21.59% execute : 0.000008s : 0.04% Time group info: ------[substitution.] 0.010355 37 98.91% : 0.010243s : 8: substitution.getattr_setattr_resolve 0.04% : 0.000004s : 3: substitution.graph_param_transform 0.81% : 0.000084s : 3: substitution.inline 0.09% : 0.000010s : 13: substitution.meta_unpack_prepare 0.01% : 0.000001s : 3: substitution.partial_unused_args_eliminate 0.02% : 0.000002s : 4: substitution.remove_not_recompute_node 0.03% : 0.000003s : 2: substitution.replace_old_param 0.08% : 0.000009s : 1: substitution.tuple_list_get_item_eliminator ------[renormalize.] 0.000302 2 61.51% : 0.000186s : 1: renormalize.infer 38.49% : 0.000116s : 1: renormalize.specialize ------[replace.] 0.000144 10 79.56% : 0.000114s : 6: replace.getattr_setattr_resolve 16.06% : 0.000023s : 3: replace.inline 4.38% : 0.000006s : 1: replace.tuple_list_get_item_eliminator ------[match.] 0.010275 10 99.09% : 0.010182s : 6: match.getattr_setattr_resolve 0.82% : 0.000084s : 3: match.inline 0.09% : 0.000009s : 1: match.tuple_list_get_item_eliminator ------[func_graph_cloner_run.] 0.000428 10 69.64% : 0.000298s : 5: func_graph_cloner_run.FuncGraphClonerGraph 30.36% : 0.000130s : 5: func_graph_cloner_run.FuncGraphSpecializer ------[meta_graph.] 0.000000 0 ------[manager.] 0.000000 0 ------[pynative] 0.000000 0 ------[others.] 0.010845 105 0.68% : 0.000073s : 52: opt.transform.opt_a 0.27% : 0.000030s : 23: opt.transform.opt_b 96.38% : 0.010452s : 2: opt.transform.opt_resolve 0.25% : 0.000027s : 1: opt.transforms.meta_unpack_prepare 2.16% : 0.000234s : 20: opt.transforms.opt_a 0.03% : 0.000003s : 1: opt.transforms.opt_after_cconv 0.02% : 0.000002s : 1: opt.transforms.opt_b 0.15% : 0.000016s : 2: opt.transforms.opt_trans_graph 0.07% : 0.000007s : 3: opt.transforms.special_op_eliminate TotalTime = 0.0385798, [20] [parse]: 0.00133973 [symbol_resolve]: 0.0189202, [1] [Cycle 1]: 0.0188389, [1] [resolve]: 0.0188182 [combine_like_graphs]: 9.79999e-07 [graph_reusing]: 3.67e-06 [meta_unpack_prepare]: 0.00011257 [pre_cconv]: 7.7e-07 [abstract_specialize]: 0.0093811 [pack_expand]: 1.759e-05 [auto_monad]: 0.00016497 [inline]: 1.79e-06 [pre_auto_parallel]: 1.119e-05 [pipeline_split]: 3e-06 [optimize]: 0.00791954, [35] [py_interpret_to_execute]: 4.35e-06 [rewriter_before_opt_a]: 9.83e-05 [opt_a]: 0.00728312, [3] [Cycle 1]: 0.00372432, [30] [expand_dump_flag]: 4.79e-06 [switch_simplify]: 8.613e-05 [a_1]: 0.0004674 [recompute_prepare]: 8.35001e-06 [updatestate_depend_eliminate]: 9.62e-06 [updatestate_assign_eliminate]: 7.73e-06 [updatestate_loads_eliminate]: 6.39e-06 [parameter_eliminate]: 4.22e-06 [a_2]: 9.144e-05 [accelerated_algorithm]: 5.4e-06 [pynative_shard]: 1.86e-06 [auto_parallel]: 3.63999e-06 [parallel]: 8.74e-06 [merge_comm]: 8.24e-06 [allreduce_fusion]: 2.95e-06 [virtual_dataset]: 5.08e-06 [get_grad_eliminate_]: 4.62e-06 [virtual_output]: 4.07e-06 [merge_forward]: 9.39e-06 [cell_reuse_recompute_pass]: 8.30005e-07 [cell_reuse_handle_not_recompute_node_pass]: 1.218e-05 [meta_fg_expand]: 0.00065135 [after_resolve]: 2.171e-05 [a_after_grad]: 3.454e-05 [renormalize]: 0.00174352 [real_op_eliminate]: 4.512e-05 [auto_monad_grad]: 1.208e-05 [auto_monad_eliminator]: 3.921e-05 [cse]: 0.00012337 [a_3]: 0.00011359 [Cycle 2]: 0.00101426, [30] [expand_dump_flag]: 1.53e-06 [switch_simplify]: 1.272e-05 [a_1]: 0.00041866 [recompute_prepare]: 2.38e-06 [updatestate_depend_eliminate]: 4.05e-06 [updatestate_assign_eliminate]: 2.49e-06 [updatestate_loads_eliminate]: 2.03e-06 [parameter_eliminate]: 1.96e-06 [a_2]: 2.779e-05 [accelerated_algorithm]: 2.87e-06 [pynative_shard]: 9.40003e-07 [auto_parallel]: 3.65e-06 [parallel]: 3.76e-06 [merge_comm]: 2.31e-06 [allreduce_fusion]: 1.37e-06 [virtual_dataset]: 2.49e-06 [get_grad_eliminate_]: 1.91999e-06 [virtual_output]: 1.72e-06 [merge_forward]: 2.99e-06 [cell_reuse_recompute_pass]: 4.19997e-07 [cell_reuse_handle_not_recompute_node_pass]: 5.1e-06 [meta_fg_expand]: 1.292e-05 [after_resolve]: 3.85e-06 [a_after_grad]: 2.3e-06 [renormalize]: 0.00030742 [real_op_eliminate]: 4.71e-06 [auto_monad_grad]: 3.48e-06 [auto_monad_eliminator]: 6.12e-06 [cse]: 1.51e-05 [a_3]: 1.49e-05 [Cycle 3]: 0.00023122, [30] [expand_dump_flag]: 1.11e-06 [switch_simplify]: 2.07e-06 [a_1]: 1.483e-05 [recompute_prepare]: 1.66e-06 [updatestate_depend_eliminate]: 2.72e-06 [updatestate_assign_eliminate]: 2.21e-06 [updatestate_loads_eliminate]: 2.01e-06 [parameter_eliminate]: 8.80005e-07 [a_2]: 2.649e-05 [accelerated_algorithm]: 2.19999e-06 [pynative_shard]: 8.79998e-07 [auto_parallel]: 3.36e-06 [parallel]: 3.48e-06 [merge_comm]: 1.87e-06 [allreduce_fusion]: 1.26e-06 [virtual_dataset]: 1.97e-06 [get_grad_eliminate_]: 1.68999e-06 [virtual_output]: 1.58e-06 [merge_forward]: 2.81e-06 [cell_reuse_recompute_pass]: 4.39999e-07 [cell_reuse_handle_not_recompute_node_pass]: 6.17e-06 [meta_fg_expand]: 1.76e-06 [after_resolve]: 3.46e-06 [a_after_grad]: 2.59e-06 [renormalize]: 5.99975e-08 [real_op_eliminate]: 1.59e-06 [auto_monad_grad]: 8.10003e-07 [auto_monad_eliminator]: 3.80001e-06 [cse]: 7.92e-06 [a_3]: 1.231e-05 [py_interpret_to_execute_after_opt_a]: 3.3e-06 [slice_cell_reuse_recomputed_activation]: 2.42001e-06 [rewriter_after_opt_a]: 2.025e-05 [convert_after_rewriter]: 5.55e-06 [order_py_execute_after_rewriter]: 4.55e-06 [opt_b]: 0.00020466, [2] [Cycle 1]: 0.0001356, [7] [b_1]: 9.036e-05 [b_2]: 2.53e-06 [updatestate_depend_eliminate]: 1.96999e-06 [updatestate_assign_eliminate]: 1.8e-06 [updatestate_loads_eliminate]: 1.6e-06 [renormalize]: 3.19997e-07 [cse]: 7.23e-06 [Cycle 2]: 6.178e-05, [7] [b_1]: 2.516e-05 [b_2]: 1.68e-06 [updatestate_depend_eliminate]: 1.41e-06 [updatestate_assign_eliminate]: 1.4e-06 [updatestate_loads_eliminate]: 1.35e-06 [renormalize]: 7.99992e-08 [cse]: 4.15e-06 [cconv]: 1.798e-05 [opt_after_cconv]: 4.548e-05, [1] [Cycle 1]: 4.172e-05, [7] [c_1]: 3.14999e-06 [parameter_eliminate]: 7.60003e-07 [updatestate_depend_eliminate]: 1.52e-06 [updatestate_assign_eliminate]: 1.24e-06 [updatestate_loads_eliminate]: 1.48e-06 [cse]: 4.6e-06 [renormalize]: 2.19996e-07 [remove_dup_value]: 1.036e-05 [tuple_transform]: 2.951e-05, [1] [Cycle 1]: 2.624e-05, [3] [d_1]: 9.68e-06 [d_2]: 3.82e-06 [renormalize]: 1.50001e-07 [add_cache_embedding]: 9.36e-06 [add_recomputation]: 2.707e-05 [cse_after_recomputation]: 1.386e-05, [1] [Cycle 1]: 9.88e-06, [1] [cse]: 5.22e-06 [environ_conv]: 5.12e-06 [label_micro_interleaved_index]: 2.2e-06 [label_fine_grained_interleaved_index]: 2.29e-06 [assign_add_opt]: 1.7e-06 [slice_recompute_activation]: 1.92e-06 [micro_interleaved_order_control]: 1.85e-06 [full_micro_interleaved_order_control]: 2.02e-06 [comp_comm_scheduling]: 2.14e-06 [reorder_send_recv_between_fp_bp]: 2.41e-06 [comm_op_add_attrs]: 1.07e-06 [add_comm_op_reuse_tag]: 1.29e-06 [overlap_opt_shard_in_pipeline]: 1.14e-06 [grouped_pairwise_exchange_alltoall]: 1.4e-06 [overlap_recompute_and_grad_model_parallel]: 1.76e-06 [overlap_grad_matmul_and_grad_allreduce]: 7.40001e-07 [split_matmul_comm_elemetwise]: 2.44e-06 [split_layernorm_comm]: 1.81e-06 [process_send_recv_for_ge]: 7.99999e-07 [handle_group_info]: 1.4e-06 [auto_monad_reorder]: 1.223e-05 [get_jit_bprop_graph]: 4.20005e-07 [eliminate_special_op_node]: 0.00047205 [validate]: 2.036e-05 [distribtued_split]: 1.21e-06 [task_emit]: 1.01e-06 [execute]: 8.00006e-07 Sums parse : 0.001340s : 3.79% symbol_resolve.resolve : 0.018818s : 53.30% combine_like_graphs : 0.000001s : 0.00% graph_reusing : 0.000004s : 0.01% meta_unpack_prepare : 0.000113s : 0.32% pre_cconv : 0.000001s : 0.00% abstract_specialize : 0.009381s : 26.57% pack_expand : 0.000018s : 0.05% auto_monad : 0.000165s : 0.47% inline : 0.000002s : 0.01% pre_auto_parallel : 0.000011s : 0.03% pipeline_split : 0.000003s : 0.01% optimize.py_interpret_to_execute : 0.000004s : 0.01% optimize.rewriter_before_opt_a : 0.000098s : 0.28% optimize.opt_a.expand_dump_flag : 0.000007s : 0.02% optimize.opt_a.switch_simplify : 0.000101s : 0.29% optimize.opt_a.a_1 : 0.000901s : 2.55% optimize.opt_a.recompute_prepare : 0.000012s : 0.04% optimize.opt_a.updatestate_depend_eliminate : 0.000016s : 0.05% optimize.opt_a.updatestate_assign_eliminate : 0.000012s : 0.04% optimize.opt_a.updatestate_loads_eliminate : 0.000010s : 0.03% optimize.opt_a.parameter_eliminate : 0.000007s : 0.02% optimize.opt_a.a_2 : 0.000146s : 0.41% optimize.opt_a.accelerated_algorithm : 0.000010s : 0.03% optimize.opt_a.pynative_shard : 0.000004s : 0.01% optimize.opt_a.auto_parallel : 0.000011s : 0.03% optimize.opt_a.parallel : 0.000016s : 0.05% optimize.opt_a.merge_comm : 0.000012s : 0.04% optimize.opt_a.allreduce_fusion : 0.000006s : 0.02% optimize.opt_a.virtual_dataset : 0.000010s : 0.03% optimize.opt_a.get_grad_eliminate_ : 0.000008s : 0.02% optimize.opt_a.virtual_output : 0.000007s : 0.02% optimize.opt_a.merge_forward : 0.000015s : 0.04% optimize.opt_a.cell_reuse_recompute_pass : 0.000002s : 0.00% optimize.opt_a.cell_reuse_handle_not_recompute_node_pass : 0.000023s : 0.07% optimize.opt_a.meta_fg_expand : 0.000666s : 1.89% optimize.opt_a.after_resolve : 0.000029s : 0.08% optimize.opt_a.a_after_grad : 0.000039s : 0.11% optimize.opt_a.renormalize : 0.002051s : 5.81% optimize.opt_a.real_op_eliminate : 0.000051s : 0.15% optimize.opt_a.auto_monad_grad : 0.000016s : 0.05% optimize.opt_a.auto_monad_eliminator : 0.000049s : 0.14% optimize.opt_a.cse : 0.000146s : 0.41% optimize.opt_a.a_3 : 0.000141s : 0.40% optimize.py_interpret_to_execute_after_opt_a : 0.000003s : 0.01% optimize.slice_cell_reuse_recomputed_activation : 0.000002s : 0.01% optimize.rewriter_after_opt_a : 0.000020s : 0.06% optimize.convert_after_rewriter : 0.000006s : 0.02% optimize.order_py_execute_after_rewriter : 0.000005s : 0.01% optimize.opt_b.b_1 : 0.000116s : 0.33% optimize.opt_b.b_2 : 0.000004s : 0.01% optimize.opt_b.updatestate_depend_eliminate : 0.000003s : 0.01% optimize.opt_b.updatestate_assign_eliminate : 0.000003s : 0.01% optimize.opt_b.updatestate_loads_eliminate : 0.000003s : 0.01% optimize.opt_b.renormalize : 0.000000s : 0.00% optimize.opt_b.cse : 0.000011s : 0.03% optimize.cconv : 0.000018s : 0.05% optimize.opt_after_cconv.c_1 : 0.000003s : 0.01% optimize.opt_after_cconv.parameter_eliminate : 0.000001s : 0.00% optimize.opt_after_cconv.updatestate_depend_eliminate : 0.000002s : 0.00% optimize.opt_after_cconv.updatestate_assign_eliminate : 0.000001s : 0.00% optimize.opt_after_cconv.updatestate_loads_eliminate : 0.000001s : 0.00% optimize.opt_after_cconv.cse : 0.000005s : 0.01% optimize.opt_after_cconv.renormalize : 0.000000s : 0.00% optimize.remove_dup_value : 0.000010s : 0.03% optimize.tuple_transform.d_1 : 0.000010s : 0.03% optimize.tuple_transform.d_2 : 0.000004s : 0.01% optimize.tuple_transform.renormalize : 0.000000s : 0.00% optimize.add_cache_embedding : 0.000009s : 0.03% optimize.add_recomputation : 0.000027s : 0.08% optimize.cse_after_recomputation.cse : 0.000005s : 0.01% optimize.environ_conv : 0.000005s : 0.01% optimize.label_micro_interleaved_index : 0.000002s : 0.01% optimize.label_fine_grained_interleaved_index : 0.000002s : 0.01% optimize.assign_add_opt : 0.000002s : 0.00% optimize.slice_recompute_activation : 0.000002s : 0.01% optimize.micro_interleaved_order_control : 0.000002s : 0.01% optimize.full_micro_interleaved_order_control : 0.000002s : 0.01% optimize.comp_comm_scheduling : 0.000002s : 0.01% optimize.reorder_send_recv_between_fp_bp : 0.000002s : 0.01% optimize.comm_op_add_attrs : 0.000001s : 0.00% optimize.add_comm_op_reuse_tag : 0.000001s : 0.00% optimize.overlap_opt_shard_in_pipeline : 0.000001s : 0.00% optimize.grouped_pairwise_exchange_alltoall : 0.000001s : 0.00% optimize.overlap_recompute_and_grad_model_parallel : 0.000002s : 0.00% optimize.overlap_grad_matmul_and_grad_allreduce : 0.000001s : 0.00% optimize.split_matmul_comm_elemetwise : 0.000002s : 0.01% optimize.split_layernorm_comm : 0.000002s : 0.01% optimize.process_send_recv_for_ge : 0.000001s : 0.00% optimize.handle_group_info : 0.000001s : 0.00% auto_monad_reorder : 0.000012s : 0.03% get_jit_bprop_graph : 0.000000s : 0.00% eliminate_special_op_node : 0.000472s : 1.34% validate : 0.000020s : 0.06% distribtued_split : 0.000001s : 0.00% task_emit : 0.000001s : 0.00% execute : 0.000001s : 0.00% Time group info: ------[substitution.] 0.018828 207 0.02% : 0.000004s : 10: substitution.float_depend_g_call 0.01% : 0.000003s : 2: substitution.float_tuple_getitem_switch 97.25% : 0.018310s : 19: substitution.getattr_setattr_resolve 0.02% : 0.000004s : 1: substitution.graph_param_transform 0.01% : 0.000002s : 2: substitution.incorporate_call 0.01% : 0.000001s : 2: substitution.incorporate_call_switch 1.75% : 0.000330s : 20: substitution.inline 0.14% : 0.000027s : 56: substitution.meta_unpack_prepare 0.04% : 0.000007s : 7: substitution.minmaximum_grad 0.02% : 0.000005s : 10: substitution.partial_eliminate 0.01% : 0.000001s : 1: substitution.partial_unused_args_eliminate 0.03% : 0.000006s : 1: substitution.real_op_eliminate 0.01% : 0.000003s : 12: substitution.remove_not_recompute_node 0.07% : 0.000012s : 9: substitution.replace_applicator 0.02% : 0.000003s : 7: substitution.replace_old_param 0.01% : 0.000002s : 1: substitution.set_cell_output_no_recompute 0.03% : 0.000006s : 3: substitution.switch_simplify 0.11% : 0.000020s : 7: substitution.tuple_list_convert_item_index_to_positive 0.04% : 0.000008s : 7: substitution.tuple_list_get_item_const_eliminator 0.06% : 0.000011s : 7: substitution.tuple_list_get_item_depend_reorder 0.17% : 0.000032s : 15: substitution.tuple_list_get_item_eliminator 0.06% : 0.000011s : 7: substitution.tuple_list_get_set_item_eliminator 0.10% : 0.000020s : 1: substitution.zero_like_fill_zero ------[renormalize.] 0.002041 4 55.70% : 0.001137s : 2: renormalize.infer 44.30% : 0.000904s : 2: renormalize.specialize ------[replace.] 0.000471 45 54.86% : 0.000259s : 16: replace.getattr_setattr_resolve 23.43% : 0.000110s : 18: replace.inline 2.34% : 0.000011s : 1: replace.real_op_eliminate 7.63% : 0.000036s : 3: replace.switch_simplify 9.19% : 0.000043s : 6: replace.tuple_list_get_item_eliminator 2.55% : 0.000012s : 1: replace.zero_like_fill_zero ------[match.] 0.018549 45 98.04% : 0.018186s : 16: match.getattr_setattr_resolve 1.69% : 0.000313s : 18: match.inline 0.03% : 0.000006s : 1: match.real_op_eliminate 0.03% : 0.000006s : 3: match.switch_simplify 0.09% : 0.000017s : 6: match.tuple_list_get_item_eliminator 0.11% : 0.000020s : 1: match.zero_like_fill_zero ------[func_graph_cloner_run.] 0.001857 36 72.81% : 0.001352s : 14: func_graph_cloner_run.FuncGraphClonerGraph 27.19% : 0.000505s : 22: func_graph_cloner_run.FuncGraphSpecializer ------[meta_graph.] 0.000000 0 ------[manager.] 0.000000 0 ------[pynative] 0.000000 0 ------[others.] 0.020445 188 1.36% : 0.000279s : 78: opt.transform.opt_a 0.46% : 0.000094s : 69: opt.transform.opt_b 92.00% : 0.018811s : 2: opt.transform.opt_resolve 0.45% : 0.000091s : 1: opt.transforms.meta_unpack_prepare 5.61% : 0.001148s : 30: opt.transforms.opt_a 0.01% : 0.000002s : 1: opt.transforms.opt_after_cconv 0.01% : 0.000003s : 2: opt.transforms.opt_b 0.06% : 0.000012s : 2: opt.transforms.opt_trans_graph 0.03% : 0.000007s : 3: opt.transforms.special_op_eliminate .[INFO] GE(28287,python3.7):2024-01-11-05:33:54.395.371 [graph_var_manager.cc:1424][EVENT]28287 SetAllMemoryMaxValue:The graph_mem_max_size is 27917287424 and the var_mem_max_size is 5368709120 [INFO] GE(28287,python3.7):2024-01-11-05:33:54.395.459 [graph_manager.cc:1248][EVENT]28287 PreRun:PreRun start: graph node size 3, session id 2, graph id 1, graph name online. [INFO] ATRACE(28287,python3.7):2024-01-11-05:33:54.396.343 [atrace_api.c:28](tid:28287) AtraceCreate start [INFO] ATRACE(28287,python3.7):2024-01-11-05:33:54.396.413 [trace_rb_log.c:84](tid:28287) [RUNTIME_ATRACE_DEV64_TS0] create ring buffer success, buffer size : 131152. [INFO] ATRACE(28287,python3.7):2024-01-11-05:33:54.396.427 [atrace_api.c:32](tid:28287) AtraceCreate end [INFO] TDT(28287,python3.7):2024-01-11-05:33:54.396.441 [client_manager.cpp:157][SetProfilingCallback][tid:28287] [TsdClient] set profiling callback success [INFO] GE(28287,python3.7):2024-01-11-05:33:54.397.235 [parallel_partitioner.cc:165][EVENT]28287 DoPipelinePartition:[GEPERFTRACE] The time cost of OptimizeSubgraph::PipelinePartition is [18] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.397.277 [parallel_partitioner.cc:178][EVENT]28287 DoFlowGraphPartition:[GEPERFTRACE] The time cost of OptimizeSubgraph::FlowGraphPartition is [14] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.397.327 [graph_prepare.cc:1378][EVENT]28287 Init:[GEPERFTRACE] The time cost of FileConstantUtils::ConvertFileConstToConst is [4] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.397.876 [graph_manager.cc:1050][EVENT]28287 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.preparer.PrepareInit is [567] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.397.905 [graph_manager.cc:1052][EVENT]28287 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.optimizer.HandleSummaryOp is [8] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.398.030 [base_pass.cc:339][EVENT]28287 Run:[GEPERFTRACE] The time cost of ForToWhilePass is [1] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:54.398.062 [pass_manager.cc:82][EVENT]28287 Run:[GEPERFTRACE] The time cost of ProcessNetOutput::SavePass is [4] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.398.116 [pass_manager.cc:82][EVENT]28287 Run:[GEPERFTRACE] The time cost of ProcessNetOutput::NetOutputPass is [40] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.398.130 [pass_manager.cc:82][EVENT]28287 Run:[GEPERFTRACE] The time cost of ProcessNetOutput::DataPass is [2] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.398.181 [pass_manager.cc:82][EVENT]28287 Run:[GEPERFTRACE] The time cost of CreateSubGraphWithScopePass is [11] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.398.195 [pass_manager.cc:82][EVENT]28287 Run:[GEPERFTRACE] The time cost of SubgraphMultiDimsClonePass is [2] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.398.214 [pass_manager.cc:82][EVENT]28287 Run:[GEPERFTRACE] The time cost of MultiBatchClonePass is [7] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.398.321 [base_pass.cc:339][EVENT]28287 Run:[GEPERFTRACE] The time cost of SplitVariableIntoSubgraphPass is [2] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:54.398.356 [graph_manager.cc:1054][EVENT]28287 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.preparer.NormalizeGraph is [438] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.398.616 [graph_manager.cc:1055][EVENT]28287 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.optimizer.OptimizeGraphInit is [245] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.399.592 [base_pass.cc:339][EVENT]28287 Run:[GEPERFTRACE] The time cost of SwitchDeadBranchElimination is [5] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:54.399.616 [base_pass.cc:339][EVENT]28287 Run:[GEPERFTRACE] The time cost of MergePass is [2] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:54.399.627 [base_pass.cc:339][EVENT]28287 Run:[GEPERFTRACE] The time cost of InferShapePass is [286] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:54.399.636 [base_pass.cc:339][EVENT]28287 Run:[GEPERFTRACE] The time cost of ReplaceWithEmptyConstPass is [12] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:54.399.645 [base_pass.cc:339][EVENT]28287 Run:[GEPERFTRACE] The time cost of SplitShapeNPass is [2] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:54.399.654 [base_pass.cc:339][EVENT]28287 Run:[GEPERFTRACE] The time cost of DimensionComputePass is [67] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:54.399.662 [base_pass.cc:339][EVENT]28287 Run:[GEPERFTRACE] The time cost of ConstantFoldingPass is [17] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:54.399.670 [base_pass.cc:339][EVENT]28287 Run:[GEPERFTRACE] The time cost of InferValuePass is [8] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:54.401.682 [graph_manager.cc:1056][EVENT]28287 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.optimizer.OptimizeOriginalGraphForQuantize is [3046] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.401.746 [base_pass.cc:339][EVENT]28287 Run:[GEPERFTRACE] The time cost of CondRemovePass is [5] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:54.401.764 [graph_prepare.cc:1982][EVENT]28287 PrepareDynShape:[GEPERFTRACE] The time cost of Prepare::ProcessBeforeInfershape is [51] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.402.112 [base_pass.cc:339][EVENT]28287 Run:[GEPERFTRACE] The time cost of SwitchDeadBranchElimination is [1] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:54.402.133 [base_pass.cc:339][EVENT]28287 Run:[GEPERFTRACE] The time cost of MergePass is [0] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:54.402.143 [base_pass.cc:339][EVENT]28287 Run:[GEPERFTRACE] The time cost of InferShapePass is [175] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:54.402.153 [base_pass.cc:339][EVENT]28287 Run:[GEPERFTRACE] The time cost of ReplaceWithEmptyConstPass is [8] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:54.402.161 [base_pass.cc:339][EVENT]28287 Run:[GEPERFTRACE] The time cost of SplitShapeNPass is [0] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:54.402.170 [base_pass.cc:339][EVENT]28287 Run:[GEPERFTRACE] The time cost of DimensionComputePass is [4] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:54.402.178 [base_pass.cc:339][EVENT]28287 Run:[GEPERFTRACE] The time cost of ConstantFoldingPass is [9] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:54.402.186 [base_pass.cc:339][EVENT]28287 Run:[GEPERFTRACE] The time cost of InferValuePass is [2] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:54.402.211 [graph_prepare.cc:1983][EVENT]28287 PrepareDynShape:[GEPERFTRACE] The time cost of Prepare::FormatAndShapeProcess is [434] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.402.248 [pass_manager.cc:82][EVENT]28287 Run:[GEPERFTRACE] The time cost of PreRun::MarkForceUnknownForCondPass is [6] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.402.261 [graph_prepare.cc:1984][EVENT]28287 PrepareDynShape:[GEPERFTRACE] The time cost of Prepare::CtrlFlowPreProcess is [23] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.402.276 [graph_prepare.cc:1985][EVENT]28287 PrepareDynShape:[GEPERFTRACE] The time cost of Prepare::multibatch::GetDynamicOutputShape is [4] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.402.296 [graph_prepare.cc:1986][EVENT]28287 PrepareDynShape:[GEPERFTRACE] The time cost of Prepare::InsertAippOpUtil::Instance().UpdateDataNodeByAipp is [7] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.402.308 [graph_prepare.cc:1987][EVENT]28287 PrepareDynShape:[GEPERFTRACE] The time cost of Prepare::SaveOriginalGraphToOmModel is [1] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.402.324 [pass_manager.cc:82][EVENT]28287 Run:[GEPERFTRACE] The time cost of PrepareOptimize::ShapeOperateOpRemovePass is [2] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.402.336 [pass_manager.cc:82][EVENT]28287 Run:[GEPERFTRACE] The time cost of PrepareOptimize::ReplaceTransShapePass is [2] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.402.351 [pass_manager.cc:82][EVENT]28287 Run:[GEPERFTRACE] The time cost of PrepareOptimize::MarkAgnosticPass is [4] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.402.432 [base_pass.cc:339][EVENT]28287 Run:[GEPERFTRACE] The time cost of EnterPass is [2] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:54.402.445 [base_pass.cc:339][EVENT]28287 Run:[GEPERFTRACE] The time cost of CondPass is [3] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:54.402.454 [base_pass.cc:339][EVENT]28287 Run:[GEPERFTRACE] The time cost of PrintOpPass is [2] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:54.402.463 [base_pass.cc:339][EVENT]28287 Run:[GEPERFTRACE] The time cost of NoUseReshapeRemovePass is [1] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:54.402.471 [base_pass.cc:339][EVENT]28287 Run:[GEPERFTRACE] The time cost of TransposeRemovePass is [1] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:54.402.480 [base_pass.cc:339][EVENT]28287 Run:[GEPERFTRACE] The time cost of UnusedConstPass is [2] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:54.402.488 [base_pass.cc:339][EVENT]28287 Run:[GEPERFTRACE] The time cost of StopGradientPass is [2] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:54.402.496 [base_pass.cc:339][EVENT]28287 Run:[GEPERFTRACE] The time cost of PreventGradientPass is [2] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:54.402.504 [base_pass.cc:339][EVENT]28287 Run:[GEPERFTRACE] The time cost of PlaceholderWithDefaultPass is [1] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:54.402.512 [base_pass.cc:339][EVENT]28287 Run:[GEPERFTRACE] The time cost of SnapshotPass is [2] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:54.402.520 [base_pass.cc:339][EVENT]28287 Run:[GEPERFTRACE] The time cost of GuaranteeConstPass is [2] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:54.402.529 [base_pass.cc:339][EVENT]28287 Run:[GEPERFTRACE] The time cost of VarIsInitializedOpPass is [5] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:54.402.537 [base_pass.cc:339][EVENT]28287 Run:[GEPERFTRACE] The time cost of ParallelConcatStartOpPass is [1] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:54.402.545 [base_pass.cc:339][EVENT]28287 Run:[GEPERFTRACE] The time cost of IdentityPass is [2] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:54.402.569 [pass_manager.cc:82][EVENT]28287 Run:[GEPERFTRACE] The time cost of PrepareOptimize::PrunePass is [10] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.402.590 [pass_manager.cc:82][EVENT]28287 Run:[GEPERFTRACE] The time cost of PrepareOptimize::HcclMemcpyPass is [3] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.402.624 [graph_prepare.cc:1988][EVENT]28287 PrepareDynShape:[GEPERFTRACE] The time cost of Prepare::PrepareOptimize is [306] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.402.637 [graph_manager.cc:1065][EVENT]28287 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.preparer.PrepareDynShape is [925] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.415.061 [graph_manager.cc:1077][EVENT]28287 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.optimizer.OptimizeOriginalGraph is [12402] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.415.130 [pass_manager.cc:82][EVENT]28287 Run:[GEPERFTRACE] The time cost of PrepareRunningFormatRefiner::VariablePrepareOpPass is [7] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.415.155 [graph_manager.cc:1080][EVENT]28287 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.preparer.PrepareRunningFormatRefiner is [62] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.418.675 [graph_manager.cc:1081][EVENT]28287 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.optimizer.OptimizeOriginalGraphJudgeInsert is [3506] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.418.714 [pass_manager.cc:82][EVENT]28287 Run:[GEPERFTRACE] The time cost of SubexpressionMigrationPass is [3] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.418.730 [pass_manager.cc:82][EVENT]28287 Run:[GEPERFTRACE] The time cost of UnusedArgsCleanPass is [2] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.418.741 [graph_manager.cc:1082][EVENT]28287 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::SubexpressionMigration is [35] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.418.773 [pass_manager.cc:82][EVENT]28287 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::MergeInputMemcpyPass is [4] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.418.788 [pass_manager.cc:82][EVENT]28287 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::SwitchDataEdgesBypass is [4] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.418.803 [pass_manager.cc:82][EVENT]28287 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::ConstantFuseSamePass is [5] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.418.898 [pass_manager.cc:82][EVENT]28287 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::CSEBeforeFuseDataNodesWithCommonInputPass is [84] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.418.916 [pass_manager.cc:82][EVENT]28287 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::FuseDataNodesWithCommonInputPass is [4] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.418.964 [pass_manager.cc:82][EVENT]28287 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::CommonSubexpressionEliminationPass is [36] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.418.981 [pass_manager.cc:82][EVENT]28287 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::PermutePass is [3] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.419.022 [pass_manager.cc:82][EVENT]28287 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::SameTransdataBreadthFusionPass is [29] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.419.042 [pass_manager.cc:82][EVENT]28287 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::VariableOpPass is [8] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.419.061 [pass_manager.cc:82][EVENT]28287 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::TransOpWithoutReshapeFusionPass is [9] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.419.089 [pass_manager.cc:82][EVENT]28287 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::TransOpBreadthFusionPass is [17] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.419.116 [pass_manager.cc:82][EVENT]28287 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::DataFlowPreparePass is [5] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.419.129 [pass_manager.cc:82][EVENT]28287 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::MergeUnknownShapeNPass is [2] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.419.138 [graph_manager.cc:2700][EVENT]28287 OptimizeStage1:[GEPERFTRACE] The time cost of GraphManager::OptimizeStage1_1 is [371] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.419.246 [base_pass.cc:339][EVENT]28287 Run:[GEPERFTRACE] The time cost of EnterPass is [1] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:54.419.260 [base_pass.cc:339][EVENT]28287 Run:[GEPERFTRACE] The time cost of AddNPass is [1] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:54.419.269 [base_pass.cc:339][EVENT]28287 Run:[GEPERFTRACE] The time cost of SwitchDeadBranchElimination is [2] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:54.419.278 [base_pass.cc:339][EVENT]28287 Run:[GEPERFTRACE] The time cost of SwitchLogicRemovePass is [1] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:54.419.286 [base_pass.cc:339][EVENT]28287 Run:[GEPERFTRACE] The time cost of MergePass is [2] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:54.419.295 [base_pass.cc:339][EVENT]28287 Run:[GEPERFTRACE] The time cost of CastRemovePass is [9] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:54.419.303 [base_pass.cc:339][EVENT]28287 Run:[GEPERFTRACE] The time cost of TransposeTransDataPass is [2] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:54.419.311 [base_pass.cc:339][EVENT]28287 Run:[GEPERFTRACE] The time cost of ReshapeRemovePass is [4] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:54.419.319 [base_pass.cc:339][EVENT]28287 Run:[GEPERFTRACE] The time cost of TransOpSymmetryEliminationPass is [2] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:54.419.328 [base_pass.cc:339][EVENT]28287 Run:[GEPERFTRACE] The time cost of TransOpNearbyAllreduceFusionPass is [2] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:54.419.336 [base_pass.cc:339][EVENT]28287 Run:[GEPERFTRACE] The time cost of ReplaceWithEmptyConstPass is [7] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:54.419.344 [base_pass.cc:339][EVENT]28287 Run:[GEPERFTRACE] The time cost of DimensionComputePass is [4] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:54.419.352 [base_pass.cc:339][EVENT]28287 Run:[GEPERFTRACE] The time cost of ConstantFoldingPass is [9] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:54.419.361 [base_pass.cc:339][EVENT]28287 Run:[GEPERFTRACE] The time cost of DimensionAdjustPass is [3] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:54.419.369 [base_pass.cc:339][EVENT]28287 Run:[GEPERFTRACE] The time cost of UselessControlOutRemovePass is [1] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:54.419.379 [graph_manager.cc:2741][EVENT]28287 OptimizeStage1:[GEPERFTRACE] The time cost of GraphManager::OptimizeStage1_2 is [222] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.419.390 [graph_manager.cc:2752][EVENT]28287 OptimizeStage1:[GEPERFTRACE] The time cost of extern constant folding is [0] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.419.415 [pass_manager.cc:82][EVENT]28287 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::Migration is [3] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.419.428 [pass_manager.cc:82][EVENT]28287 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::ArgsClean is [1] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.419.451 [pass_manager.cc:82][EVENT]28287 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::PrunePass is [7] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.419.467 [pass_manager.cc:82][EVENT]28287 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::NextIterationPass is [6] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.419.480 [pass_manager.cc:82][EVENT]28287 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::ControlTriggerPass is [2] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.419.493 [pass_manager.cc:82][EVENT]28287 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::MergeToStreamMergePass is [4] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.419.514 [pass_manager.cc:82][EVENT]28287 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::SwitchToStreamSwitchPass is [11] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.419.528 [pass_manager.cc:82][EVENT]28287 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::AttachStreamLabelPass is [5] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.419.543 [pass_manager.cc:82][EVENT]28287 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::MultiBatchPass is [4] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.419.554 [pass_manager.cc:82][EVENT]28287 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::SubgraphMultiDimsPass is [2] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.419.568 [pass_manager.cc:82][EVENT]28287 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::IteratorOpPass is [5] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.419.580 [pass_manager.cc:82][EVENT]28287 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::VariableRefUselessControlOutDeletePass is [2] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.419.599 [pass_manager.cc:82][EVENT]28287 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::ReshapeRecoveryPass is [9] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.419.613 [pass_manager.cc:82][EVENT]28287 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::RemoveSameConstPass is [4] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.419.623 [graph_manager.cc:2810][EVENT]28287 OptimizeStage1:[GEPERFTRACE] The time cost of GraphManager::OptimizeStage1_3 is [212] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.419.651 [base_pass.cc:339][EVENT]28287 Run:[GEPERFTRACE] The time cost of IdentityPass is [1] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:54.419.664 [graph_manager.cc:2821][EVENT]28287 OptimizeStage1:[GEPERFTRACE] The time cost of GraphPrepare::node_pass is [32] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.419.693 [graph_manager.cc:1087][EVENT]28287 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::OptimizeStage1 is [933] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.419.832 [graph_manager.cc:1088][EVENT]28287 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.optimizer.OptimizeAfterStage1 is [123] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.419.871 [graph_manager.cc:1089][EVENT]28287 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::GraphUtilsEx::InferShapeInNeed is [20] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.419.890 [pass_manager.cc:82][EVENT]28287 Run:[GEPERFTRACE] The time cost of PreRun::CtrlEdgeTransferPass is [1] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.419.905 [graph_manager.cc:1097][EVENT]28287 PreRunOptimizeOriginalGraph:PreRun:PreRunOptimizeOriginalGraph success. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.419.926 [graph_manager.cc:3325][EVENT]28287 OptimizeSubgraph:[GEPERFTRACE] The time cost of OptimizeSubgraph::StagePartition is [4] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.420.278 [engine_place.cc:144][EVENT]28287 Run:The time cost of AIcoreEngine::CheckSupported is [260] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.420.307 [engine_place.cc:144][EVENT]28287 Run:The time cost of DNN_VM_GE_LOCAL_OP_STORE::CheckSupported is [8] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.420.317 [engine_place.cc:144][EVENT]28287 Run:The time cost of DNN_VM_RTS_OP_STORE::CheckSupported is [7] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.420.395 [graph_manager.cc:3351][EVENT]28287 OptimizeSubgraph:[GEPERFTRACE] The time cost of OptimizeSubgraph::GraphPartitionDynamicShape is [458] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.420.413 [graph_manager.cc:3364][EVENT]28287 OptimizeSubgraph:[GEPERFTRACE] The time cost of OptimizeSubgraph::SubgraphPartitionAndOptimization::CompositeEngine is [2] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.420.483 [engine_partitioner.cc:1139][EVENT]28287 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionInitialize is [16] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.420.501 [engine_partitioner.cc:1142][EVENT]28287 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionMarkClusters is [4] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.420.636 [engine_partitioner.cc:1148][EVENT]28287 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionSplitSubGraphs is [125] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.420.679 [engine_partitioner.cc:1155][EVENT]28287 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionSortSubGraphs is [29] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.420.726 [engine_partitioner.cc:1164][EVENT]28287 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionAddPartitionsToGraphNode is [37] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.420.761 [graph_manager.cc:3405][EVENT]28287 SubgraphPartitionAndOptimization:[GEPERFTRACE] The time cost of OptimizeSubgraph::Partition1 is [333] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.420.780 [graph_manager.cc:3412][EVENT]28287 SubgraphPartitionAndOptimization:[GEPERFTRACE] The time cost of OptimizeSubgraph::SetSubgraphPreProc is [8] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.428.910 [graph_manager.cc:3422][EVENT]28287 SubgraphPartitionAndOptimization:[GEPERFTRACE] The time cost of OptimizeSubgraph::SetSubGraph is [8115] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.428.943 [graph_manager.cc:3428][EVENT]28287 SubgraphPartitionAndOptimization:[GEPERFTRACE] The time cost of OptimizeSubgraph::SetSubgraphPostProc is [8] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.429.072 [graph_manager.cc:3467][EVENT]28287 SubgraphPartitionAndOptimization:[GEPERFTRACE] The time cost of OptimizeSubgraph::MergeSubGraph is [107] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.429.090 [graph_manager.cc:3377][EVENT]28287 OptimizeSubgraph:[GEPERFTRACE] The time cost of OptimizeSubgraph::SubgraphPartitionAndOptimization::AtomicEngine is [8663] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.429.107 [graph_manager.cc:1106][EVENT]28287 PreRunOptimizeSubGraph:[GEPERFTRACE] The time cost of GraphManager::OptimizeSubgraph is [9187] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.429.143 [graph_manager.cc:1115][EVENT]28287 PreRunOptimizeSubGraph:PreRun:PreRunOptimizeSubGraph success. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.429.176 [graph_manager.cc:1130][EVENT]28287 PreRunAfterOptimizeSubGraph:[GEPERFTRACE] The time cost of GraphManager::stages.optimizer.ReplacePrecompiledNodeWithOmGraph is [6] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.429.209 [graph_manager.cc:1131][EVENT]28287 PreRunAfterOptimizeSubGraph:[GEPERFTRACE] The time cost of GraphManager::stages.optimizer.OptimizeWholeGraph is [19] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.429.233 [pass_manager.cc:82][EVENT]28287 Run:[GEPERFTRACE] The time cost of OptimizeStage2::AfterMergePasses::LinkGenMaskNodesPass is [6] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.429.259 [pass_manager.cc:82][EVENT]28287 Run:[GEPERFTRACE] The time cost of OptimizeStage2::HcclContinuousMemcpyPass is [5] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.429.269 [graph_manager.cc:2837][EVENT]28287 OptimizeStage2:[GEPERFTRACE] The time cost of OptimizeStage2::AfterMergePasses is [45] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.429.340 [base_pass.cc:339][EVENT]28287 Run:[GEPERFTRACE] The time cost of ConstantFoldingPass is [12] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:54.429.353 [base_pass.cc:339][EVENT]28287 Run:[GEPERFTRACE] The time cost of ReshapeRemovePass is [1] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:54.429.362 [base_pass.cc:339][EVENT]28287 Run:[GEPERFTRACE] The time cost of CondRemovePass is [3] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:54.429.371 [base_pass.cc:339][EVENT]28287 Run:[GEPERFTRACE] The time cost of BitcastPass is [1] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:54.429.379 [base_pass.cc:339][EVENT]28287 Run:[GEPERFTRACE] The time cost of AssignRemovePass is [5] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:54.429.388 [base_pass.cc:339][EVENT]28287 Run:[GEPERFTRACE] The time cost of DimensionAdjustPass is [4] micro second, call num is [3] [INFO] GE(28287,python3.7):2024-01-11-05:33:54.429.397 [graph_manager.cc:2864][EVENT]28287 OptimizeStage2:[GEPERFTRACE] The time cost of OptimizeStage2::MergedGraphNameToPasses is [112] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.429.409 [graph_manager.cc:2872][EVENT]28287 OptimizeStage2:[GEPERFTRACE] The time cost of OptimizeStage2::RemoveIsolatedConst is [3] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.429.432 [pass_manager.cc:82][EVENT]28287 Run:[GEPERFTRACE] The time cost of OptimizeStage2::ControlAttrOptimize::FlowCtrlPass is [5] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.429.444 [pass_manager.cc:82][EVENT]28287 Run:[GEPERFTRACE] The time cost of OptimizeStage2::ControlAttrOptimize::MultiBatchPass is [2] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.429.460 [pass_manager.cc:82][EVENT]28287 Run:[GEPERFTRACE] The time cost of OptimizeStage2::AfterMergePasses::RefIdentityDeleteOpPass is [4] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.429.475 [pass_manager.cc:82][EVENT]28287 Run:[GEPERFTRACE] The time cost of OptimizeStage2::AfterMergePasses::VariableRefDeleteOpPass is [6] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.429.491 [compile_nodes_pass.cc:88][EVENT]28287 Run:[GEPERFTRACE] The time cost of OptimizeStage2::ControlAttrOptimize::CompileNodesPass is [4] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.429.501 [pass_manager.cc:82][EVENT]28287 Run:[GEPERFTRACE] The time cost of OptimizeStage2::ControlAttrOptimize::CompileNodesPass is [15] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.429.512 [pass_manager.cc:82][EVENT]28287 Run:[GEPERFTRACE] The time cost of OptimizeStage2::AfterMergePasses::SwapSpacePass is [1] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.429.587 [pass_manager.cc:82][EVENT]28287 Run:[GEPERFTRACE] The time cost of OptimizeStage2::AfterMergePasses::InputOutputConnectionIdentifyPass is [65] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.429.607 [pass_manager.cc:82][EVENT]28287 Run:[GEPERFTRACE] The time cost of OptimizeStage2::AfterMergePasses::AtomicAddrCleanPass is [8] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.429.621 [pass_manager.cc:82][EVENT]28287 Run:[GEPERFTRACE] The time cost of OptimizeStage2::AfterMergePasses::EndOfSequenceAddControlPass is [2] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.429.634 [pass_manager.cc:82][EVENT]28287 Run:[GEPERFTRACE] The time cost of OptimizeStage2::ControlAttrOptimize::SubgraphPass is [4] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.429.647 [pass_manager.cc:82][EVENT]28287 Run:[GEPERFTRACE] The time cost of OptimizeStage2::ControlAttrOptimize::AttachStreamLabelPass is [2] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.429.663 [graph_manager.cc:2927][EVENT]28287 OptimizeStage2:[GEPERFTRACE] The time cost of OptimizeStage2::ControlAttrOptimize is [236] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.429.676 [graph_manager.cc:2937][EVENT]28287 OptimizeStage2:[GEPERFTRACE] The time cost of ModelBuilder::AssignFunctionalLabels is [3] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.429.692 [graph_manager.cc:2943][EVENT]28287 OptimizeStage2:[GEPERFTRACE] The time cost of MemcpyAddrAsyncPass::Run. is [5] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.429.704 [graph_manager.cc:2950][EVENT]28287 OptimizeStage2:[GEPERFTRACE] The time cost of BufferPoolMemoryPass::Run. is [1] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.429.885 [graph_manager.cc:2958][EVENT]28287 OptimizeStage2:[GEPERFTRACE] The time cost of ParallelGroupPass::Run. is [38] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.429.922 [graph_manager.cc:1132][EVENT]28287 PreRunAfterOptimizeSubGraph:[GEPERFTRACE] The time cost of GraphManager::OptimizeStage2 is [699] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.429.994 [graph_manager.cc:1135][EVENT]28287 PreRunAfterOptimizeSubGraph:[GEPERFTRACE] The time cost of GraphManager::GetCompilerStages(graph_node->GetGraphId()).optimizer.OptimizeGraphBeforeBuild is [57] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.430.032 [graph_manager.cc:2975][EVENT]28287 MemConflictProc:[GEPERFTRACE] The time cost of HandleMemoryRWConflict is [22] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.430.066 [graph_manager.cc:2981][EVENT]28287 MemConflictProc:[GEPERFTRACE] The time cost of MemLayoutConflictOptimizer::Run. is [21] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.430.081 [pass_manager.cc:82][EVENT]28287 Run:[GEPERFTRACE] The time cost of OptimizeStage2::SetFftsPlusAttrPass is [1] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.430.091 [graph_manager.cc:2986][EVENT]28287 MemConflictProc:[GEPERFTRACE] The time cost of SetFftsPlusAttrPass::last_passes.Run is [15] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.430.100 [graph_manager.cc:1136][EVENT]28287 PreRunAfterOptimizeSubGraph:[GEPERFTRACE] The time cost of GraphManager::MemConflictProc is [91] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.430.214 [graph_manager.cc:3555][EVENT]28287 Build:[GEPERFTRACE] The time cost of GraphManager::RecoverIrDefinitionAndModifyAippData is [81] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.430.303 [engine_partitioner.cc:1139][EVENT]28287 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionInitialize is [16] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.430.318 [engine_partitioner.cc:1142][EVENT]28287 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionMarkClusters is [2] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.430.414 [engine_partitioner.cc:1148][EVENT]28287 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionSplitSubGraphs is [86] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.430.443 [engine_partitioner.cc:1155][EVENT]28287 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionSortSubGraphs is [16] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.430.481 [engine_partitioner.cc:1164][EVENT]28287 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionAddPartitionsToGraphNode is [27] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.430.503 [graph_builder.cc:865][EVENT]28287 SecondPartition:[GEPERFTRACE] The time cost of EnginePartitioner::Partition2 is [232] micro second. [INFO] RUNTIME(28287,python3.7):2024-01-11-05:33:54.430.936 [logger.cc:1071] 28287 ModelBindStream: model_id=1856, stream_id=65, flag=0. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.430.975 [task_generator.cc:804][EVENT]28287 GenerateTask:[GEPERFTRACE] The time cost of TaskGenerator::SetStreamCtx is [189] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.431.038 [task_generator.cc:805][EVENT]28287 GenerateTask:[GEPERFTRACE] The time cost of TaskGenerator::PrepareForGenerateTask is [50] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.431.691 [task_generator.cc:814][EVENT]28287 GenerateTask:[GEPERFTRACE] The time cost of TaskGenerator::DoGenerateTask is [637] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.431.705 [task_generator.cc:954][EVENT]28287 GetTaskInfo:[GEPERFTRACE] The time cost of TaskGenerator::GenerateTask is [919] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.431.768 [task_generator.cc:967][EVENT]28287 GetTaskInfo:[GEPERFTRACE] The time cost of TaskGenerator::AddModelTaskToModel is [36] micro second. [INFO] RUNTIME(28287,python3.7):2024-01-11-05:33:54.431.788 [logger.cc:1084] 28287 ModelUnbindStream: model_id=1856, stream_id=65, [INFO] GE(28287,python3.7):2024-01-11-05:33:54.431.970 [graph_manager.cc:1152][EVENT]28287 PreRunAfterOptimizeSubGraph:[GEPERFTRACE] The time cost of GraphManager::Build is [1845] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.431.989 [graph_manager.cc:1164][EVENT]28287 PreRunAfterOptimizeSubGraph:PreRun:PreRunAfterOptimizeSubGraph success. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.432.025 [graph_manager.cc:1271][EVENT]28287 PreRun:[GEPERFTRACE] The time cost of FlowModelBuild is [34884] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.432.037 [graph_manager.cc:1272][EVENT]28287 PreRun:[GEPERFTRACE] GE PreRun End [INFO] ATRACE(28287,python3.7):2024-01-11-05:33:54.432.343 [atrace_api.c:93](tid:28287) AtraceDestroy start [INFO] ATRACE(28287,python3.7):2024-01-11-05:33:54.432.359 [atrace_api.c:95](tid:28287) AtraceDestroy end [INFO] GE(28287,python3.7):2024-01-11-05:33:54.437.163 [graph_converter.cc:838][EVENT]28287 ConvertComputeGraphToExecuteGraph:[GEPERFTRACE] The time cost of ConvertComputeGraphToExecuteGraph::CreateMainNode is [1443] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.437.328 [base_optimizer.cc:70][EVENT]28287 Run:[GEPERFTRACE] The time cost of ZeroCopy is [121] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.437.814 [base_optimizer.cc:70][EVENT]28287 Run:[GEPERFTRACE] The time cost of CEM is [464] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.438.012 [copy_flow_launch_fuse.cc:395][EVENT]28287 Run:[GEPERFTRACE] The time cost of Pass::CopyFlowLaunchFuse is [175] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.438.031 [base_optimizer.cc:70][EVENT]28287 Run:[GEPERFTRACE] The time cost of CopyFlowLaunch is [195] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.438.251 [base_optimizer.cc:70][EVENT]28287 Run:[GEPERFTRACE] The time cost of TrustOutTensor is [208] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.438.269 [base_optimizer.cc:70][EVENT]28287 Run:[GEPERFTRACE] The time cost of AicpuFuseHostInputs is [1] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.438.301 [base_optimizer.cc:70][EVENT]28287 Run:[GEPERFTRACE] The time cost of ZeroCopy is [22] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.438.488 [base_optimizer.cc:70][EVENT]28287 Run:[GEPERFTRACE] The time cost of CEM is [174] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.438.568 [copy_flow_launch_fuse.cc:395][EVENT]28287 Run:[GEPERFTRACE] The time cost of Pass::CopyFlowLaunchFuse is [63] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.438.581 [base_optimizer.cc:70][EVENT]28287 Run:[GEPERFTRACE] The time cost of CopyFlowLaunch is [77] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.438.610 [base_optimizer.cc:70][EVENT]28287 Run:[GEPERFTRACE] The time cost of TrustOutTensor is [18] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.438.622 [base_optimizer.cc:70][EVENT]28287 Run:[GEPERFTRACE] The time cost of AicpuFuseHostInputs is [0] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.438.658 [base_optimizer.cc:70][EVENT]28287 Run:[GEPERFTRACE] The time cost of ZeroCopy is [17] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.438.731 [base_optimizer.cc:70][EVENT]28287 Run:[GEPERFTRACE] The time cost of CEM is [62] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.438.796 [copy_flow_launch_fuse.cc:395][EVENT]28287 Run:[GEPERFTRACE] The time cost of Pass::CopyFlowLaunchFuse is [54] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.438.807 [base_optimizer.cc:70][EVENT]28287 Run:[GEPERFTRACE] The time cost of CopyFlowLaunch is [66] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.438.833 [base_optimizer.cc:70][EVENT]28287 Run:[GEPERFTRACE] The time cost of TrustOutTensor is [18] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.438.844 [base_optimizer.cc:70][EVENT]28287 Run:[GEPERFTRACE] The time cost of AicpuFuseHostInputs is [1] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.438.857 [graph_converter.cc:849][EVENT]28287 ConvertComputeGraphToExecuteGraph:[GEPERFTRACE] The time cost of ConvertComputeGraphToExecuteGraph::RunAllPass is [1657] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.439.068 [graph_converter.cc:853][EVENT]28287 ConvertComputeGraphToExecuteGraph:[GEPERFTRACE] The time cost of ConvertComputeGraphToExecuteGraph::TopologicalSorting is [202] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.439.733 [graph_converter.cc:857][EVENT]28287 ConvertComputeGraphToExecuteGraph:[GEPERFTRACE] The time cost of ConvertComputeGraphToExecuteGraph::AppendGraphLevelData is [652] micro second. [INFO] GE(28287,python3.7):2024-01-11-05:33:54.439.867 [graph_converter.cc:862][EVENT]28287 ConvertComputeGraphToExecuteGraph:[GEPERFTRACE] The time cost of ConvertComputeGraphToExecuteGraph::CalculatePriority is [112] micro second. . ============================== 2 passed in 21.37s ============================== [TRACE] GE(28287,python3.7):2024-01-11-05:33:56.219.745 [status:INIT] [ge_api.cc:463]28287 ~Session:Start to destruct session. [TRACE] GE(28287,python3.7):2024-01-11-05:33:56.219.809 [status:RUNNING] [ge_api.cc:475]28287 ~Session:Session id is 0 [TRACE] GE(28287,python3.7):2024-01-11-05:33:56.219.819 [status:RUNNING] [ge_api.cc:476]28287 ~Session:Destroying session [TRACE] GE(28287,python3.7):2024-01-11-05:33:56.220.699 [status:STOP] [ge_api.cc:491]28287 ~Session:Session Destructor finished [TRACE] GE(28287,python3.7):2024-01-11-05:33:56.220.727 [status:INIT] [ge_api.cc:301]28287 GEFinalize:GEFinalize start [INFO] GE(28287,python3.7):2024-01-11-05:33:56.220.815 [execution_runtime.cc:80][EVENT]28287 FinalizeExecutionRuntime:Execution runtime finalize begin. [INFO] GE(28287,python3.7):2024-01-11-05:33:56.220.835 [execution_runtime.cc:92][EVENT]28287 FinalizeExecutionRuntime:Execution runtime finalized. [TRACE] GE(28287,python3.7):2024-01-11-05:33:56.220.847 [status:RUNNING] [ge_api.cc:313]28287 GEFinalize:Finalizing environment [INFO] TUNE(28287,python3.7):2024-01-11-05:33:56.578.103 [cann_kb_pyfunc_mgr.cpp:127][CANNKB][Tid:28287]"CannKbPyfuncMgr: enter PyObjectDeinit function, reference_[1]" [INFO] TUNE(28287,python3.7):2024-01-11-05:33:56.578.161 [cann_kb_pyfunc_mgr.cpp:138][CANNKB][Tid:28287]"CannKbPyfuncMgr: PyObjectDeinit function end successfully!" [INFO] GE(28287,python3.7):2024-01-11-05:33:56.579.669 [gelib.cc:324][EVENT]28287 SystemFinalize:Online infer finalize GELib success. [TRACE] GE(28287,python3.7):2024-01-11-05:33:57.468.634 [status:STOP] [ge_api.cc:341]28287 GEFinalize:GEFinalize finished [INFO] TDT(28287,python3.7):2024-01-11-05:33:57.851.527 [process_mode_manager.cpp:184][Close][tid:28287] [TsdClient] Close [deviceId=7][sessionId=1] hccp and computer enter [INFO] TDT(28287,python3.7):2024-01-11-05:33:57.851.561 [version_verify.cpp:112][SpecialFeatureCheck][tid:28287] VersionVerify: previous type[7], supported [INFO] TDT(28287,python3.7):2024-01-11-05:33:57.851.603 [process_mode_manager.cpp:192][Close][tid:28287] [TsdClient][deviceId=7] [sessionId=1] wait hccp and computer process close respond [INFO] TDT(28287,python3.7):2024-01-11-05:33:57.883.825 [process_mode_manager.cpp:197][Close][tid:28287] [TsdClient][logicDeviceId_=7]has recv close hccp and computer process respond [INFO] TDT(28287,python3.7):2024-01-11-05:33:57.883.839 [stub_process_mode_nowin.cpp:151][CloseInHost][tid:28287] enter into CloseInHost deviceid[7] [INFO] TDT(28287,python3.7):2024-01-11-05:33:57.883.850 [stub_process_mode_nowin.cpp:154][CloseInHost][tid:28287] host cpu not support [INFO] TDT(28287,python3.7):2024-01-11-05:33:57.883.883 [process_mode_manager.cpp:208][Close][tid:28287] [TsdClient][deviceId=7] [sessionId=1] close hccp and computer process success [INFO] ATRACE(28287,python3.7):2024-01-11-05:33:57.883.895 [atrace_api.c:93](tid:28287) AtraceDestroy start [INFO] ATRACE(28287,python3.7):2024-01-11-05:33:57.883.911 [atrace_api.c:95](tid:28287) AtraceDestroy end [INFO] PROFILING(28287,python3.7):2024-01-11-05:33:57.883.933 [msprofiler_impl.cpp:156] >>> (tid:28287) ProfNotifySetDevice called, is open: 0, devId: 7 [INFO] RUNTIME(28287,python3.7):2024-01-11-05:33:59.757.374 [runtime.cc:1737] 28287 ~Runtime: deconstruct runtime.