============================= test session starts ============================== platform linux -- Python 3.7.5, pytest-5.4.3, py-1.8.1, pluggy-0.13.1 rootdir: /home/jenkins/mindspore/testcases/testcases/tests/st/ops/graph_kernel, inifile: /home/jenkins/sault/virtual_test/virtualenv_004/sault/config/pytest.ini plugins: anyio-3.7.1, forked-1.1.3, xdist-1.32.0 [INFO] ATRACE(191849,python3.7):2024-01-11-05:30:49.886.486 [trace_attr.c:105](tid:191849) platform is 1. [INFO] ATRACE(191849,python3.7):2024-01-11-05:30:49.886.650 [trace_recorder.c:114](tid:191849) use root path: /home/jenkins/ascend/atrace [INFO] ATRACE(191849,python3.7):2024-01-11-05:30:49.886.678 [trace_signal.c:133](tid:191849) register signal handler for signo 2 succeed. [INFO] ATRACE(191849,python3.7):2024-01-11-05:30:49.886.690 [trace_signal.c:133](tid:191849) register signal handler for signo 15 succeed. [INFO] RUNTIME(191849,python3.7):2024-01-11-05:30:50.313.153 [runtime.cc:1159] 191849 GetAicoreNumByLevel: workingDev_=0 [INFO] RUNTIME(191849,python3.7):2024-01-11-05:30:50.313.225 [runtime.cc:4719] 191849 GetVisibleDevices: ASCEND_RT_VISIBLE_DEVICES param was not set collected 1 item test_relu.py [INFO] TDT(191849,python3.7):2024-01-11-05:30:54.558.085 [process_mode_manager.cpp:109][OpenProcess][tid:191849] [ProcessModeManager] enter into open process deviceId[3] rankSize[0] [INFO] TDT(191849,python3.7):2024-01-11-05:30:54.558.610 [process_mode_manager.cpp:379][InitTsdClient][tid:191849] [TsdClient] deviceId[3] begin to init hdc client [INFO] TDT(191849,python3.7):2024-01-11-05:30:54.558.731 [version_verify.cpp:34][SetVersionInfo][tid:191849] VersionVerify: send client version to server [INFO] TDT(191849,python3.7):2024-01-11-05:30:54.558.759 [version_verify.cpp:50][SetVersionInfo][tid:191849] send feature_info:{msg_type:35, features:{check before send aicpu package,}} [INFO] TDT(191849,python3.7):2024-01-11-05:30:54.558.773 [version_verify.cpp:50][SetVersionInfo][tid:191849] send feature_info:{msg_type:37, features:{check before send open qs message,}} [INFO] TDT(191849,python3.7):2024-01-11-05:30:54.558.998 [version_verify.cpp:66][PeerVersionCheck][tid:191849] VersionVerify: Check client version info, server[1230], client[1230] [INFO] TDT(191849,python3.7):2024-01-11-05:30:54.559.016 [version_verify.cpp:87][ParseVersionInfo][tid:191849] VersionVerify: pass client version info success [INFO] TDT(191849,python3.7):2024-01-11-05:30:54.559.026 [hdc_client.cpp:276][CheckHdcConnection][tid:191849] Service[2] create hdc success [INFO] TDT(191849,python3.7):2024-01-11-05:30:54.559.041 [version_verify.cpp:120][SpecialFeatureCheck][tid:191849] VersionVerify: new type[35], supported [INFO] TDT(191849,python3.7):2024-01-11-05:30:54.559.086 [process_mode_manager.cpp:748][GetDeviceCheckCode][tid:191849] [TsdClient][deviceId=3] [sessionId=1] wait package info respond [INFO] TDT(191849,python3.7):2024-01-11-05:30:54.559.183 [process_mode_manager.cpp:379][InitTsdClient][tid:191849] [TsdClient] deviceId[3] begin to init hdc client [INFO] TDT(191849,python3.7):2024-01-11-05:30:54.559.290 [version_verify.cpp:34][SetVersionInfo][tid:191849] VersionVerify: send client version to server [INFO] TDT(191849,python3.7):2024-01-11-05:30:54.559.302 [version_verify.cpp:50][SetVersionInfo][tid:191849] send feature_info:{msg_type:35, features:{check before send aicpu package,}} [INFO] TDT(191849,python3.7):2024-01-11-05:30:54.559.313 [version_verify.cpp:50][SetVersionInfo][tid:191849] send feature_info:{msg_type:37, features:{check before send open qs message,}} [INFO] TDT(191849,python3.7):2024-01-11-05:30:54.559.562 [version_verify.cpp:66][PeerVersionCheck][tid:191849] VersionVerify: Check client version info, server[1230], client[1230] [INFO] TDT(191849,python3.7):2024-01-11-05:30:54.559.574 [version_verify.cpp:87][ParseVersionInfo][tid:191849] VersionVerify: pass client version info success [INFO] TDT(191849,python3.7):2024-01-11-05:30:54.559.582 [hdc_client.cpp:276][CheckHdcConnection][tid:191849] Service[2] create hdc success [INFO] TDT(191849,python3.7):2024-01-11-05:30:54.559.594 [process_mode_manager.cpp:426][ConstructOpenMsg][tid:191849] [TsdClient] tsd get process sign successfully, procpid[191849] signSize[48] [INFO] TDT(191849,python3.7):2024-01-11-05:30:54.559.620 [version_verify.cpp:112][SpecialFeatureCheck][tid:191849] VersionVerify: previous type[6], supported [INFO] TDT(191849,python3.7):2024-01-11-05:30:54.559.641 [process_mode_manager.cpp:126][OpenProcess][tid:191849] [ProcessModeManager] deviceId[3] sessionId[1] rankSize[0], wait sub process start respond [INFO] TDT(191849,python3.7):2024-01-11-05:30:54.787.809 [stub_process_mode_nowin.cpp:63][ProcessQueueForMdc][tid:191849] [TsdClient] it is unnecessary of current mode[0] chiptype[1] to grant queue auth to aicpusd [INFO] TDT(191849,python3.7):2024-01-11-05:30:54.787.851 [stub_process_mode_nowin.cpp:101][OpenInHost][tid:191849] enter into OpenInHost deviceid[3] [INFO] TDT(191849,python3.7):2024-01-11-05:30:54.787.862 [stub_process_mode_nowin.cpp:105][OpenInHost][tid:191849] host cpu not support [INFO] TDT(191849,python3.7):2024-01-11-05:30:54.787.870 [process_mode_manager.cpp:156][OpenProcess][tid:191849] [TsdClient][deviceId=3] [sessionId=1] start hccp and computer process success [INFO] RUNTIME(191849,python3.7):2024-01-11-05:30:54.790.513 [device.cc:340] 191849 Init: isDoubledie:0, topologytype:0 [INFO] RUNTIME(191849,python3.7):2024-01-11-05:30:54.803.794 [npu_driver.cc:5428] 192924 GetDeviceStatus: GetDeviceStatus status=1. [INFO] ATRACE(191849,python3.7):2024-01-11-05:30:54.803.820 [atrace_api.c:28](tid:191849) AtraceCreate start [INFO] ATRACE(191849,python3.7):2024-01-11-05:30:54.803.946 [trace_rb_log.c:84](tid:191849) [RUNTIME_ATRACE_DEV3_TS0] create ring buffer success, buffer size : 131152. [INFO] ATRACE(191849,python3.7):2024-01-11-05:30:54.803.962 [atrace_api.c:32](tid:191849) AtraceCreate end [INFO] TDT(191849,python3.7):2024-01-11-05:30:54.803.978 [client_manager.cpp:157][SetProfilingCallback][tid:191849] [TsdClient] set profiling callback success [TRACE] GE(191849,python3.7):2024-01-11-05:30:54.955.467 [status:INIT] [ge_api.cc:144]191849 GEInitializeImpl:GEInitialize start [INFO] PROFILING(191849,python3.7):2024-01-11-05:30:55.175.053 [msprofiler_impl.cpp:156] >>> (tid:191849) ProfNotifySetDevice called, is open: 1, devId: 3 [INFO] PROFILING(191849,python3.7):2024-01-11-05:30:55.175.183 [platform.cpp:38] >>> (tid:191849) Profiling platform version: 1.0. [INFO] PROFILING(191849,python3.7):2024-01-11-05:30:55.175.198 [ai_drv_dev_api.cpp:384] >>> (tid:191849) Succeeded to DrvGetApiVersion version: 0x72313 [TRACE] GE(191849,python3.7):2024-01-11-05:30:55.225.386 [status:RUNNING] [ge_api.cc:211]191849 GEInitializeImpl:Initializing environment [INFO] GE(191849,python3.7):2024-01-11-05:30:55.225.459 [gelib.cc:98][EVENT]191849 Initialize:[GEPERFTRACE] GE Init Start [INFO] GE(191849,python3.7):2024-01-11-05:30:55.225.799 [gelib.cc:307][EVENT]191849 SystemInitialize:Online infer init GELib success, device id :3 [INFO] DVPP(191849,python3.7):2024-01-11-05:30:55.607.911 [dvpp_engine.cc:41][ENGINE][Initialize:41][tid:191849]dvpp engine do not support [INFO] TUNE(191849,python3.7):2024-01-11-05:30:55.612.102 [cann_kb_pyfunc_mgr.cpp:72][CANNKB][Tid:191849]"CannKbPyfuncMgr: Enter PyObjectInit, reference_ is 0!" [INFO] TUNE(191849,python3.7):2024-01-11-05:30:55.612.142 [handle_manager.cpp:115][CANNKB][Tid:191849]"Start to run init functions to load dynamic python lib!" [INFO] TUNE(191849,python3.7):2024-01-11-05:30:55.612.206 [handle_manager.cpp:407][CANNKB][Tid:191849]"Init functions of loading dynamic python lib end!" [INFO] TUNE(191849,python3.7):2024-01-11-05:30:55.612.217 [cann_kb_pyfunc_mgr.cpp:24][CANNKB][Tid:191849]"CANN_KB_Py has already been initialized." [INFO] TUNE(191849,python3.7):2024-01-11-05:30:55.612.300 [cann_kb_pyfunc_mgr.cpp:117][CANNKB][Tid:191849]"CannKbPyfuncMgr: Run PyObjectInit successfully!" [INFO] HCCL(191849,python3.7):2024-01-11-05:31:07.618.149 [plugin_manager.cc:42][191849]hcom running normal mode. [INFO] DVPP(191849,python3.7):2024-01-11-05:31:07.618.750 [dvpp_engine.cc:92][ENGINE][GetOpsKernelInfoStores:92][tid:191849]dvpp ops kernel info store do not support [INFO] DVPP(191849,python3.7):2024-01-11-05:31:07.618.904 [dvpp_engine.cc:69][ENGINE][GetGraphOptimizerObjs:69][tid:191849]dvpp graph optimizer do not support [INFO] DVPP(191849,python3.7):2024-01-11-05:31:08.279.182 [dvpp_ops_kernel_builder.cc:48][ENGINE][Initialize:48][tid:191849]dvpp ops kernel builder do not support [INFO] GE(191849,python3.7):2024-01-11-05:31:08.287.769 [gelib.cc:169][EVENT]191849 Initialize:[GEPERFTRACE] The time cost of GELib::Initialize is [13062256] micro second. [TRACE] GE(191849,python3.7):2024-01-11-05:31:08.371.666 [status:STOP] [ge_api.cc:255]191849 GEInitializeImpl:GEInitialize finished [TRACE] GE(191849,python3.7):2024-01-11-05:31:08.371.801 [status:INIT] [ge_api.cc:398]191849 Session:Start to construct session. [TRACE] GE(191849,python3.7):2024-01-11-05:31:08.371.818 [status:RUNNING] [ge_api.cc:408]191849 Session:Creating session [INFO] GE(191849,python3.7):2024-01-11-05:31:08.372.219 [graph_var_manager.cc:1445][EVENT]191849 SetMemoryMallocSize:Total memory size is 34359738368 [INFO] GE(191849,python3.7):2024-01-11-05:31:08.372.235 [graph_var_manager.cc:1424][EVENT]191849 SetAllMemoryMaxValue:The graph_mem_max_size is 27917287424 and the var_mem_max_size is 5368709120 [INFO] PROFILING(191849,python3.7):2024-01-11-05:31:08.372.552 [msprofiler_impl.cpp:156] >>> (tid:191849) ProfNotifySetDevice called, is open: 1, devId: 3 [TRACE] GE(191849,python3.7):2024-01-11-05:31:08.373.442 [status:RUNNING] [ge_api.cc:411]191849 Session:Session id is 0 [TRACE] GE(191849,python3.7):2024-01-11-05:31:08.373.465 [status:STOP] [ge_api.cc:420]191849 Session:Session Constructor finished [INFO] PROFILING(191849,python3.7):2024-01-11-05:31:08.383.073 [platform.cpp:38] >>> (tid:191849) Profiling platform version: 1.0. [INFO] PROFILING(191849,python3.7):2024-01-11-05:31:08.383.103 [ai_drv_dev_api.cpp:384] >>> (tid:191849) Succeeded to DrvGetApiVersion version: 0x72313 [TRACE] GE(191849,python3.7):2024-01-11-05:31:08.383.285 [status:INIT] [ge_api.cc:144]191849 GEInitializeImpl:GEInitialize start TotalTime = 0.0270086, [20] [parse]: 0.0110568 [symbol_resolve]: 0.00040211, [1] [Cycle 1]: 0.00035379, [1] [resolve]: 0.00033737 [combine_like_graphs]: 1.11e-06 [graph_reusing]: 3.62e-06 [meta_unpack_prepare]: 4.121e-05 [pre_cconv]: 4.07999e-06 [abstract_specialize]: 0.00073176 [pack_expand]: 9.15e-06 [auto_monad]: 6.205e-05 [inline]: 1.53e-06 [pre_auto_parallel]: 1.508e-05 [pipeline_split]: 3.26e-06 [optimize]: 0.00789663, [35] [py_interpret_to_execute]: 9.56999e-06 [rewriter_before_opt_a]: 1.931e-05 [opt_a]: 0.0072572, [1] [Cycle 1]: 0.00056226, [30] [expand_dump_flag]: 2.68e-06 [switch_simplify]: 8.24e-06 [a_1]: 1.821e-05 [recompute_prepare]: 1.74e-06 [updatestate_depend_eliminate]: 6.96e-06 [updatestate_assign_eliminate]: 3.66e-06 [updatestate_loads_eliminate]: 2.8e-06 [parameter_eliminate]: 2.86e-06 [a_2]: 2.698e-05 [accelerated_algorithm]: 2.68e-06 [pynative_shard]: 1.8e-06 [auto_parallel]: 3.55e-06 [parallel]: 2.029e-05 [merge_comm]: 1.295e-05 [allreduce_fusion]: 1.8e-06 [virtual_dataset]: 2.47001e-06 [get_grad_eliminate_]: 1.78e-06 [virtual_output]: 1.58e-06 [merge_forward]: 4.67e-06 [cell_reuse_recompute_pass]: 8.89995e-07 [cell_reuse_handle_not_recompute_node_pass]: 8.27e-06 [meta_fg_expand]: 3.2e-06 [after_resolve]: 4.99999e-06 [a_after_grad]: 2.57e-06 [renormalize]: 0.00023355 [real_op_eliminate]: 4.32e-06 [auto_monad_grad]: 3.65001e-06 [auto_monad_eliminator]: 1.058e-05 [cse]: 2.247e-05 [a_3]: 1.429e-05 [py_interpret_to_execute_after_opt_a]: 6.75e-06 [slice_cell_reuse_recomputed_activation]: 2.51e-06 [rewriter_after_opt_a]: 0.00013107 [convert_after_rewriter]: 7.04e-06 [order_py_execute_after_rewriter]: 4.75e-06 [opt_b]: 9.125e-05, [1] [Cycle 1]: 8.635e-05, [7] [b_1]: 3.72e-05 [b_2]: 2.99e-06 [updatestate_depend_eliminate]: 2.97e-06 [updatestate_assign_eliminate]: 2.39e-06 [updatestate_loads_eliminate]: 2.37999e-06 [renormalize]: 3.19997e-07 [cse]: 1.052e-05 [cconv]: 2.411e-05 [opt_after_cconv]: 4.851e-05, [1] [Cycle 1]: 4.443e-05, [7] [c_1]: 4.81e-06 [parameter_eliminate]: 8.49999e-07 [updatestate_depend_eliminate]: 2.51e-06 [updatestate_assign_eliminate]: 2.04e-06 [updatestate_loads_eliminate]: 1.85e-06 [cse]: 7.22e-06 [renormalize]: 5.10001e-07 [remove_dup_value]: 1.069e-05 [tuple_transform]: 3.31e-05, [1] [Cycle 1]: 2.98e-05, [3] [d_1]: 1.31e-05 [d_2]: 5.64e-06 [renormalize]: 2.19996e-07 [add_cache_embedding]: 1.097e-05 [add_recomputation]: 4.806e-05 [cse_after_recomputation]: 1.525e-05, [1] [Cycle 1]: 1.135e-05, [1] [cse]: 7.19e-06 [environ_conv]: 1.65e-05 [label_micro_interleaved_index]: 2.2e-06 [label_fine_grained_interleaved_index]: 2.42e-06 [assign_add_opt]: 1.70001e-06 [slice_recompute_activation]: 2.43e-06 [micro_interleaved_order_control]: 1.71e-06 [full_micro_interleaved_order_control]: 1.99e-06 [comp_comm_scheduling]: 2.2e-06 [reorder_send_recv_between_fp_bp]: 2.35e-06 [comm_op_add_attrs]: 1.11e-06 [add_comm_op_reuse_tag]: 9.09997e-07 [overlap_opt_shard_in_pipeline]: 1.16e-06 [grouped_pairwise_exchange_alltoall]: 1.43e-06 [overlap_recompute_and_grad_model_parallel]: 1.82e-06 [overlap_grad_matmul_and_grad_allreduce]: 1.01e-06 [split_matmul_comm_elemetwise]: 2.27e-06 [split_layernorm_comm]: 1.86999e-06 [process_send_recv_for_ge]: 2.19e-06 [handle_group_info]: 1.03e-06 [auto_monad_reorder]: 2.192e-05 [get_jit_bprop_graph]: 3.80001e-07 [eliminate_special_op_node]: 0.00046298 [validate]: 3.996e-05 [distribtued_split]: 1.08e-06 [task_emit]: 0.00605488 [execute]: 8.77e-06 Sums parse : 0.011057s : 56.07% symbol_resolve.resolve : 0.000337s : 1.71% combine_like_graphs : 0.000001s : 0.01% graph_reusing : 0.000004s : 0.02% meta_unpack_prepare : 0.000041s : 0.21% pre_cconv : 0.000004s : 0.02% abstract_specialize : 0.000732s : 3.71% pack_expand : 0.000009s : 0.05% auto_monad : 0.000062s : 0.31% inline : 0.000002s : 0.01% pre_auto_parallel : 0.000015s : 0.08% pipeline_split : 0.000003s : 0.02% optimize.py_interpret_to_execute : 0.000010s : 0.05% optimize.rewriter_before_opt_a : 0.000019s : 0.10% optimize.opt_a.expand_dump_flag : 0.000003s : 0.01% optimize.opt_a.switch_simplify : 0.000008s : 0.04% optimize.opt_a.a_1 : 0.000018s : 0.09% optimize.opt_a.recompute_prepare : 0.000002s : 0.01% optimize.opt_a.updatestate_depend_eliminate : 0.000007s : 0.04% optimize.opt_a.updatestate_assign_eliminate : 0.000004s : 0.02% optimize.opt_a.updatestate_loads_eliminate : 0.000003s : 0.01% optimize.opt_a.parameter_eliminate : 0.000003s : 0.01% optimize.opt_a.a_2 : 0.000027s : 0.14% optimize.opt_a.accelerated_algorithm : 0.000003s : 0.01% optimize.opt_a.pynative_shard : 0.000002s : 0.01% optimize.opt_a.auto_parallel : 0.000004s : 0.02% optimize.opt_a.parallel : 0.000020s : 0.10% optimize.opt_a.merge_comm : 0.000013s : 0.07% optimize.opt_a.allreduce_fusion : 0.000002s : 0.01% optimize.opt_a.virtual_dataset : 0.000002s : 0.01% optimize.opt_a.get_grad_eliminate_ : 0.000002s : 0.01% optimize.opt_a.virtual_output : 0.000002s : 0.01% optimize.opt_a.merge_forward : 0.000005s : 0.02% optimize.opt_a.cell_reuse_recompute_pass : 0.000001s : 0.00% optimize.opt_a.cell_reuse_handle_not_recompute_node_pass : 0.000008s : 0.04% optimize.opt_a.meta_fg_expand : 0.000003s : 0.02% optimize.opt_a.after_resolve : 0.000005s : 0.03% optimize.opt_a.a_after_grad : 0.000003s : 0.01% optimize.opt_a.renormalize : 0.000234s : 1.18% optimize.opt_a.real_op_eliminate : 0.000004s : 0.02% optimize.opt_a.auto_monad_grad : 0.000004s : 0.02% optimize.opt_a.auto_monad_eliminator : 0.000011s : 0.05% optimize.opt_a.cse : 0.000022s : 0.11% optimize.opt_a.a_3 : 0.000014s : 0.07% optimize.py_interpret_to_execute_after_opt_a : 0.000007s : 0.03% optimize.slice_cell_reuse_recomputed_activation : 0.000003s : 0.01% optimize.rewriter_after_opt_a : 0.000131s : 0.66% optimize.convert_after_rewriter : 0.000007s : 0.04% optimize.order_py_execute_after_rewriter : 0.000005s : 0.02% optimize.opt_b.b_1 : 0.000037s : 0.19% optimize.opt_b.b_2 : 0.000003s : 0.02% optimize.opt_b.updatestate_depend_eliminate : 0.000003s : 0.02% optimize.opt_b.updatestate_assign_eliminate : 0.000002s : 0.01% optimize.opt_b.updatestate_loads_eliminate : 0.000002s : 0.01% optimize.opt_b.renormalize : 0.000000s : 0.00% optimize.opt_b.cse : 0.000011s : 0.05% optimize.cconv : 0.000024s : 0.12% optimize.opt_after_cconv.c_1 : 0.000005s : 0.02% optimize.opt_after_cconv.parameter_eliminate : 0.000001s : 0.00% optimize.opt_after_cconv.updatestate_depend_eliminate : 0.000003s : 0.01% optimize.opt_after_cconv.updatestate_assign_eliminate : 0.000002s : 0.01% optimize.opt_after_cconv.updatestate_loads_eliminate : 0.000002s : 0.01% optimize.opt_after_cconv.cse : 0.000007s : 0.04% optimize.opt_after_cconv.renormalize : 0.000001s : 0.00% optimize.remove_dup_value : 0.000011s : 0.05% optimize.tuple_transform.d_1 : 0.000013s : 0.07% optimize.tuple_transform.d_2 : 0.000006s : 0.03% optimize.tuple_transform.renormalize : 0.000000s : 0.00% optimize.add_cache_embedding : 0.000011s : 0.06% optimize.add_recomputation : 0.000048s : 0.24% optimize.cse_after_recomputation.cse : 0.000007s : 0.04% optimize.environ_conv : 0.000017s : 0.08% optimize.label_micro_interleaved_index : 0.000002s : 0.01% optimize.label_fine_grained_interleaved_index : 0.000002s : 0.01% optimize.assign_add_opt : 0.000002s : 0.01% optimize.slice_recompute_activation : 0.000002s : 0.01% optimize.micro_interleaved_order_control : 0.000002s : 0.01% optimize.full_micro_interleaved_order_control : 0.000002s : 0.01% optimize.comp_comm_scheduling : 0.000002s : 0.01% optimize.reorder_send_recv_between_fp_bp : 0.000002s : 0.01% optimize.comm_op_add_attrs : 0.000001s : 0.01% optimize.add_comm_op_reuse_tag : 0.000001s : 0.00% optimize.overlap_opt_shard_in_pipeline : 0.000001s : 0.01% optimize.grouped_pairwise_exchange_alltoall : 0.000001s : 0.01% optimize.overlap_recompute_and_grad_model_parallel : 0.000002s : 0.01% optimize.overlap_grad_matmul_and_grad_allreduce : 0.000001s : 0.01% optimize.split_matmul_comm_elemetwise : 0.000002s : 0.01% optimize.split_layernorm_comm : 0.000002s : 0.01% optimize.process_send_recv_for_ge : 0.000002s : 0.01% optimize.handle_group_info : 0.000001s : 0.01% auto_monad_reorder : 0.000022s : 0.11% get_jit_bprop_graph : 0.000000s : 0.00% eliminate_special_op_node : 0.000463s : 2.35% validate : 0.000040s : 0.20% distribtued_split : 0.000001s : 0.01% task_emit : 0.006055s : 30.70% execute : 0.000009s : 0.04% Time group info: ------[substitution.] 0.000274 12 95.29% : 0.000261s : 1: substitution.getattr_setattr_resolve 1.79% : 0.000005s : 3: substitution.graph_param_transform 1.34% : 0.000004s : 2: substitution.meta_unpack_prepare 0.51% : 0.000001s : 3: substitution.partial_unused_args_eliminate 0.40% : 0.000001s : 2: substitution.remove_not_recompute_node 0.65% : 0.000002s : 1: substitution.replace_old_param ------[renormalize.] 0.000227 2 61.21% : 0.000139s : 1: renormalize.infer 38.79% : 0.000088s : 1: renormalize.specialize ------[replace.] 0.000029 1 100.00% : 0.000029s : 1: replace.getattr_setattr_resolve ------[match.] 0.000261 1 100.00% : 0.000261s : 1: match.getattr_setattr_resolve ------[func_graph_cloner_run.] 0.000101 3 17.49% : 0.000018s : 1: func_graph_cloner_run.FuncGraphClonerGraph 82.51% : 0.000083s : 2: func_graph_cloner_run.FuncGraphSpecializer ------[meta_graph.] 0.000000 0 ------[manager.] 0.000000 0 ------[pynative] 0.000000 0 ------[others.] 0.000468 69 8.35% : 0.000039s : 26: opt.transform.opt_a 6.20% : 0.000029s : 23: opt.transform.opt_b 67.93% : 0.000318s : 2: opt.transform.opt_resolve 2.26% : 0.000011s : 1: opt.transforms.meta_unpack_prepare 8.61% : 0.000040s : 10: opt.transforms.opt_a 0.78% : 0.000004s : 1: opt.transforms.opt_after_cconv 0.44% : 0.000002s : 1: opt.transforms.opt_b 3.70% : 0.000017s : 2: opt.transforms.opt_trans_graph 1.74% : 0.000008s : 3: opt.transforms.special_op_eliminate [INFO] GE(191849,python3.7):2024-01-11-05:31:08.689.099 [scalable_config.cc:55][EVENT]195885 ScalableConfig:device total max size: 34359738368, page_mem_size_total_thresold: 32641751449, uncacheable_size_threshold: 17179869184 [INFO] GE(191849,python3.7):2024-01-11-05:31:08.771.225 [graph_var_manager.cc:1424][EVENT]195885 SetAllMemoryMaxValue:The graph_mem_max_size is 27917287424 and the var_mem_max_size is 5368709120 [INFO] GE(191849,python3.7):2024-01-11-05:31:08.771.330 [graph_manager.cc:1248][EVENT]195885 PreRun:PreRun start: graph node size 3, session id 1, graph id 0, graph name online. [INFO] ATRACE(191849,python3.7):2024-01-11-05:31:08.772.213 [atrace_api.c:28](tid:195885) AtraceCreate start [INFO] ATRACE(191849,python3.7):2024-01-11-05:31:08.772.283 [trace_rb_log.c:84](tid:195885) [RUNTIME_ATRACE_DEV64_TS0] create ring buffer success, buffer size : 131152. [INFO] ATRACE(191849,python3.7):2024-01-11-05:31:08.772.296 [atrace_api.c:32](tid:195885) AtraceCreate end [INFO] TDT(191849,python3.7):2024-01-11-05:31:08.772.327 [client_manager.cpp:157][SetProfilingCallback][tid:195885] [TsdClient] set profiling callback success [INFO] GE(191849,python3.7):2024-01-11-05:31:08.773.304 [parallel_partitioner.cc:165][EVENT]195885 DoPipelinePartition:[GEPERFTRACE] The time cost of OptimizeSubgraph::PipelinePartition is [21] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.773.354 [parallel_partitioner.cc:178][EVENT]195885 DoFlowGraphPartition:[GEPERFTRACE] The time cost of OptimizeSubgraph::FlowGraphPartition is [19] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.773.411 [graph_prepare.cc:1378][EVENT]195885 Init:[GEPERFTRACE] The time cost of FileConstantUtils::ConvertFileConstToConst is [3] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.774.071 [graph_manager.cc:1050][EVENT]195885 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.preparer.PrepareInit is [682] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.774.100 [graph_manager.cc:1052][EVENT]195885 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.optimizer.HandleSummaryOp is [8] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.774.250 [base_pass.cc:339][EVENT]195885 Run:[GEPERFTRACE] The time cost of ForToWhilePass is [1] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:08.774.282 [pass_manager.cc:82][EVENT]195885 Run:[GEPERFTRACE] The time cost of ProcessNetOutput::SavePass is [4] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.774.360 [pass_manager.cc:82][EVENT]195885 Run:[GEPERFTRACE] The time cost of ProcessNetOutput::NetOutputPass is [65] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.774.374 [pass_manager.cc:82][EVENT]195885 Run:[GEPERFTRACE] The time cost of ProcessNetOutput::DataPass is [1] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.774.468 [pass_manager.cc:82][EVENT]195885 Run:[GEPERFTRACE] The time cost of CreateSubGraphWithScopePass is [17] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.774.486 [pass_manager.cc:82][EVENT]195885 Run:[GEPERFTRACE] The time cost of SubgraphMultiDimsClonePass is [7] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.774.505 [pass_manager.cc:82][EVENT]195885 Run:[GEPERFTRACE] The time cost of MultiBatchClonePass is [7] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.774.606 [base_pass.cc:339][EVENT]195885 Run:[GEPERFTRACE] The time cost of SplitVariableIntoSubgraphPass is [2] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:08.774.628 [graph_manager.cc:1054][EVENT]195885 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.preparer.NormalizeGraph is [513] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.782.330 [graph_manager.cc:1055][EVENT]195885 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.optimizer.OptimizeGraphInit is [7687] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.783.419 [base_pass.cc:339][EVENT]195885 Run:[GEPERFTRACE] The time cost of AssertPass is [2] micro second, call num is [6] [INFO] GE(191849,python3.7):2024-01-11-05:31:08.783.444 [base_pass.cc:339][EVENT]195885 Run:[GEPERFTRACE] The time cost of SwitchDeadBranchElimination is [3] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:08.783.456 [base_pass.cc:339][EVENT]195885 Run:[GEPERFTRACE] The time cost of MergePass is [4] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:08.783.466 [base_pass.cc:339][EVENT]195885 Run:[GEPERFTRACE] The time cost of InferShapePass is [298] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:08.783.475 [base_pass.cc:339][EVENT]195885 Run:[GEPERFTRACE] The time cost of ReplaceWithEmptyConstPass is [15] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:08.783.484 [base_pass.cc:339][EVENT]195885 Run:[GEPERFTRACE] The time cost of SplitShapeNPass is [2] micro second, call num is [6] [INFO] GE(191849,python3.7):2024-01-11-05:31:08.783.493 [base_pass.cc:339][EVENT]195885 Run:[GEPERFTRACE] The time cost of DimensionComputePass is [23] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:08.783.501 [base_pass.cc:339][EVENT]195885 Run:[GEPERFTRACE] The time cost of ConstantFoldingPass is [22] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:08.783.510 [base_pass.cc:339][EVENT]195885 Run:[GEPERFTRACE] The time cost of InferValuePass is [5] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:08.785.604 [graph_manager.cc:1056][EVENT]195885 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.optimizer.OptimizeOriginalGraphForQuantize is [3233] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.785.687 [base_pass.cc:339][EVENT]195885 Run:[GEPERFTRACE] The time cost of CondRemovePass is [3] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:08.785.705 [graph_prepare.cc:1982][EVENT]195885 PrepareDynShape:[GEPERFTRACE] The time cost of Prepare::ProcessBeforeInfershape is [55] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.786.075 [base_pass.cc:339][EVENT]195885 Run:[GEPERFTRACE] The time cost of AssertPass is [2] micro second, call num is [6] [INFO] GE(191849,python3.7):2024-01-11-05:31:08.786.096 [base_pass.cc:339][EVENT]195885 Run:[GEPERFTRACE] The time cost of SwitchDeadBranchElimination is [1] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:08.786.107 [base_pass.cc:339][EVENT]195885 Run:[GEPERFTRACE] The time cost of MergePass is [1] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:08.786.116 [base_pass.cc:339][EVENT]195885 Run:[GEPERFTRACE] The time cost of InferShapePass is [190] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:08.786.125 [base_pass.cc:339][EVENT]195885 Run:[GEPERFTRACE] The time cost of ReplaceWithEmptyConstPass is [8] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:08.786.133 [base_pass.cc:339][EVENT]195885 Run:[GEPERFTRACE] The time cost of SplitShapeNPass is [2] micro second, call num is [6] [INFO] GE(191849,python3.7):2024-01-11-05:31:08.786.142 [base_pass.cc:339][EVENT]195885 Run:[GEPERFTRACE] The time cost of DimensionComputePass is [5] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:08.786.150 [base_pass.cc:339][EVENT]195885 Run:[GEPERFTRACE] The time cost of ConstantFoldingPass is [10] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:08.786.159 [base_pass.cc:339][EVENT]195885 Run:[GEPERFTRACE] The time cost of InferValuePass is [3] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:08.786.183 [graph_prepare.cc:1983][EVENT]195885 PrepareDynShape:[GEPERFTRACE] The time cost of Prepare::FormatAndShapeProcess is [464] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.786.208 [pass_manager.cc:82][EVENT]195885 Run:[GEPERFTRACE] The time cost of PreRun::MarkForceUnknownForCondPass is [5] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.786.220 [graph_prepare.cc:1984][EVENT]195885 PrepareDynShape:[GEPERFTRACE] The time cost of Prepare::CtrlFlowPreProcess is [22] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.786.234 [graph_prepare.cc:1985][EVENT]195885 PrepareDynShape:[GEPERFTRACE] The time cost of Prepare::multibatch::GetDynamicOutputShape is [3] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.786.261 [graph_prepare.cc:1986][EVENT]195885 PrepareDynShape:[GEPERFTRACE] The time cost of Prepare::InsertAippOpUtil::Instance().UpdateDataNodeByAipp is [16] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.786.273 [graph_prepare.cc:1987][EVENT]195885 PrepareDynShape:[GEPERFTRACE] The time cost of Prepare::SaveOriginalGraphToOmModel is [1] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.786.288 [pass_manager.cc:82][EVENT]195885 Run:[GEPERFTRACE] The time cost of PrepareOptimize::ShapeOperateOpRemovePass is [2] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.786.299 [pass_manager.cc:82][EVENT]195885 Run:[GEPERFTRACE] The time cost of PrepareOptimize::ReplaceTransShapePass is [1] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.786.312 [pass_manager.cc:82][EVENT]195885 Run:[GEPERFTRACE] The time cost of PrepareOptimize::MarkAgnosticPass is [5] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.786.394 [base_pass.cc:339][EVENT]195885 Run:[GEPERFTRACE] The time cost of EnterPass is [1] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:08.786.407 [base_pass.cc:339][EVENT]195885 Run:[GEPERFTRACE] The time cost of CondPass is [4] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:08.786.423 [base_pass.cc:339][EVENT]195885 Run:[GEPERFTRACE] The time cost of PrintOpPass is [2] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:08.786.432 [base_pass.cc:339][EVENT]195885 Run:[GEPERFTRACE] The time cost of NoUseReshapeRemovePass is [2] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:08.786.441 [base_pass.cc:339][EVENT]195885 Run:[GEPERFTRACE] The time cost of DropOutPass is [2] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:08.786.450 [base_pass.cc:339][EVENT]195885 Run:[GEPERFTRACE] The time cost of AssertPass is [2] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:08.786.458 [base_pass.cc:339][EVENT]195885 Run:[GEPERFTRACE] The time cost of TransposeRemovePass is [0] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:08.786.467 [base_pass.cc:339][EVENT]195885 Run:[GEPERFTRACE] The time cost of UnusedConstPass is [1] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:08.786.475 [base_pass.cc:339][EVENT]195885 Run:[GEPERFTRACE] The time cost of StopGradientPass is [1] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:08.786.484 [base_pass.cc:339][EVENT]195885 Run:[GEPERFTRACE] The time cost of PreventGradientPass is [1] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:08.786.492 [base_pass.cc:339][EVENT]195885 Run:[GEPERFTRACE] The time cost of PlaceholderWithDefaultPass is [1] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:08.786.500 [base_pass.cc:339][EVENT]195885 Run:[GEPERFTRACE] The time cost of SnapshotPass is [1] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:08.786.509 [base_pass.cc:339][EVENT]195885 Run:[GEPERFTRACE] The time cost of GuaranteeConstPass is [3] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:08.786.517 [base_pass.cc:339][EVENT]195885 Run:[GEPERFTRACE] The time cost of VarIsInitializedOpPass is [5] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:08.786.525 [base_pass.cc:339][EVENT]195885 Run:[GEPERFTRACE] The time cost of ParallelConcatStartOpPass is [2] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:08.786.533 [base_pass.cc:339][EVENT]195885 Run:[GEPERFTRACE] The time cost of IdentityPass is [2] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:08.786.556 [pass_manager.cc:82][EVENT]195885 Run:[GEPERFTRACE] The time cost of PrepareOptimize::PrunePass is [9] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.786.569 [pass_manager.cc:82][EVENT]195885 Run:[GEPERFTRACE] The time cost of PrepareOptimize::HcclMemcpyPass is [2] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.786.604 [graph_prepare.cc:1988][EVENT]195885 PrepareDynShape:[GEPERFTRACE] The time cost of Prepare::PrepareOptimize is [321] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.786.616 [graph_manager.cc:1065][EVENT]195885 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.preparer.PrepareDynShape is [967] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.799.857 [graph_manager.cc:1077][EVENT]195885 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.optimizer.OptimizeOriginalGraph is [13219] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.799.938 [pass_manager.cc:82][EVENT]195885 Run:[GEPERFTRACE] The time cost of PrepareRunningFormatRefiner::VariablePrepareOpPass is [6] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.799.987 [graph_manager.cc:1080][EVENT]195885 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.preparer.PrepareRunningFormatRefiner is [90] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.803.908 [graph_manager.cc:1081][EVENT]195885 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.optimizer.OptimizeOriginalGraphJudgeInsert is [3905] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.803.960 [pass_manager.cc:82][EVENT]195885 Run:[GEPERFTRACE] The time cost of SubexpressionMigrationPass is [3] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.803.976 [pass_manager.cc:82][EVENT]195885 Run:[GEPERFTRACE] The time cost of UnusedArgsCleanPass is [2] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.803.989 [graph_manager.cc:1082][EVENT]195885 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::SubexpressionMigration is [37] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.804.023 [pass_manager.cc:82][EVENT]195885 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::MergeInputMemcpyPass is [5] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.804.038 [pass_manager.cc:82][EVENT]195885 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::SwitchDataEdgesBypass is [3] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.804.052 [pass_manager.cc:82][EVENT]195885 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::ConstantFuseSamePass is [5] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.804.083 [pass_manager.cc:82][EVENT]195885 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::CSEBeforeFuseDataNodesWithCommonInputPass is [22] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.804.096 [pass_manager.cc:82][EVENT]195885 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::FuseDataNodesWithCommonInputPass is [3] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.804.110 [pass_manager.cc:82][EVENT]195885 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::CommonSubexpressionEliminationPass is [4] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.804.125 [pass_manager.cc:82][EVENT]195885 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::PermutePass is [3] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.804.172 [pass_manager.cc:82][EVENT]195885 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::SameTransdataBreadthFusionPass is [36] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.804.192 [pass_manager.cc:82][EVENT]195885 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::VariableOpPass is [9] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.804.210 [pass_manager.cc:82][EVENT]195885 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::TransOpWithoutReshapeFusionPass is [8] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.804.259 [pass_manager.cc:82][EVENT]195885 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::TransOpBreadthFusionPass is [39] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.804.278 [pass_manager.cc:82][EVENT]195885 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::DataFlowPreparePass is [8] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.804.290 [pass_manager.cc:82][EVENT]195885 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::MergeUnknownShapeNPass is [2] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.804.299 [graph_manager.cc:2700][EVENT]195885 OptimizeStage1:[GEPERFTRACE] The time cost of GraphManager::OptimizeStage1_1 is [282] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.804.442 [base_pass.cc:339][EVENT]195885 Run:[GEPERFTRACE] The time cost of EnterPass is [3] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:08.804.457 [base_pass.cc:339][EVENT]195885 Run:[GEPERFTRACE] The time cost of AddNPass is [1] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:08.804.467 [base_pass.cc:339][EVENT]195885 Run:[GEPERFTRACE] The time cost of SwitchDeadBranchElimination is [3] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:08.804.476 [base_pass.cc:339][EVENT]195885 Run:[GEPERFTRACE] The time cost of SwitchLogicRemovePass is [1] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:08.804.485 [base_pass.cc:339][EVENT]195885 Run:[GEPERFTRACE] The time cost of MergePass is [3] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:08.804.499 [base_pass.cc:339][EVENT]195885 Run:[GEPERFTRACE] The time cost of CastRemovePass is [12] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:08.804.508 [base_pass.cc:339][EVENT]195885 Run:[GEPERFTRACE] The time cost of TransposeTransDataPass is [2] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:08.804.517 [base_pass.cc:339][EVENT]195885 Run:[GEPERFTRACE] The time cost of ReshapeRemovePass is [6] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:08.804.525 [base_pass.cc:339][EVENT]195885 Run:[GEPERFTRACE] The time cost of TransOpSymmetryEliminationPass is [3] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:08.804.533 [base_pass.cc:339][EVENT]195885 Run:[GEPERFTRACE] The time cost of TransOpNearbyAllreduceFusionPass is [3] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:08.804.542 [base_pass.cc:339][EVENT]195885 Run:[GEPERFTRACE] The time cost of ReplaceWithEmptyConstPass is [10] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:08.804.550 [base_pass.cc:339][EVENT]195885 Run:[GEPERFTRACE] The time cost of DimensionComputePass is [5] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:08.804.558 [base_pass.cc:339][EVENT]195885 Run:[GEPERFTRACE] The time cost of ConstantFoldingPass is [13] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:08.804.567 [base_pass.cc:339][EVENT]195885 Run:[GEPERFTRACE] The time cost of DimensionAdjustPass is [2] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:08.804.575 [base_pass.cc:339][EVENT]195885 Run:[GEPERFTRACE] The time cost of UselessControlOutRemovePass is [3] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:08.804.586 [graph_manager.cc:2741][EVENT]195885 OptimizeStage1:[GEPERFTRACE] The time cost of GraphManager::OptimizeStage1_2 is [267] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.804.595 [graph_manager.cc:2752][EVENT]195885 OptimizeStage1:[GEPERFTRACE] The time cost of extern constant folding is [0] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.804.617 [pass_manager.cc:82][EVENT]195885 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::Migration is [3] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.804.629 [pass_manager.cc:82][EVENT]195885 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::ArgsClean is [1] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.804.650 [pass_manager.cc:82][EVENT]195885 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::PrunePass is [10] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.804.665 [pass_manager.cc:82][EVENT]195885 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::NextIterationPass is [4] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.804.675 [pass_manager.cc:82][EVENT]195885 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::ControlTriggerPass is [2] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.804.687 [pass_manager.cc:82][EVENT]195885 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::MergeToStreamMergePass is [3] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.804.713 [pass_manager.cc:82][EVENT]195885 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::SwitchToStreamSwitchPass is [17] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.804.727 [pass_manager.cc:82][EVENT]195885 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::AttachStreamLabelPass is [5] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.804.740 [pass_manager.cc:82][EVENT]195885 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::MultiBatchPass is [5] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.804.750 [pass_manager.cc:82][EVENT]195885 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::SubgraphMultiDimsPass is [1] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.804.763 [pass_manager.cc:82][EVENT]195885 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::IteratorOpPass is [4] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.804.778 [pass_manager.cc:82][EVENT]195885 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::VariableRefUselessControlOutDeletePass is [3] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.804.796 [pass_manager.cc:82][EVENT]195885 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::ReshapeRecoveryPass is [9] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.804.810 [pass_manager.cc:82][EVENT]195885 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::RemoveSameConstPass is [4] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.804.820 [graph_manager.cc:2810][EVENT]195885 OptimizeStage1:[GEPERFTRACE] The time cost of GraphManager::OptimizeStage1_3 is [206] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.804.849 [base_pass.cc:339][EVENT]195885 Run:[GEPERFTRACE] The time cost of IdentityPass is [2] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:08.804.860 [graph_manager.cc:2821][EVENT]195885 OptimizeStage1:[GEPERFTRACE] The time cost of GraphPrepare::node_pass is [32] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.804.890 [graph_manager.cc:1087][EVENT]195885 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::OptimizeStage1 is [881] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.805.044 [graph_manager.cc:1088][EVENT]195885 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.optimizer.OptimizeAfterStage1 is [141] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.805.088 [graph_manager.cc:1089][EVENT]195885 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::GraphUtilsEx::InferShapeInNeed is [23] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.805.108 [pass_manager.cc:82][EVENT]195885 Run:[GEPERFTRACE] The time cost of PreRun::CtrlEdgeTransferPass is [2] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.805.135 [graph_manager.cc:1097][EVENT]195885 PreRunOptimizeOriginalGraph:PreRun:PreRunOptimizeOriginalGraph success. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.805.164 [graph_manager.cc:3325][EVENT]195885 OptimizeSubgraph:[GEPERFTRACE] The time cost of OptimizeSubgraph::StagePartition is [4] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.805.620 [engine_place.cc:144][EVENT]195885 Run:The time cost of AIcoreEngine::CheckSupported is [321] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.805.645 [engine_place.cc:144][EVENT]195885 Run:The time cost of DNN_VM_GE_LOCAL_OP_STORE::CheckSupported is [8] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.805.654 [engine_place.cc:144][EVENT]195885 Run:The time cost of DNN_VM_RTS_OP_STORE::CheckSupported is [10] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.805.744 [graph_manager.cc:3351][EVENT]195885 OptimizeSubgraph:[GEPERFTRACE] The time cost of OptimizeSubgraph::GraphPartitionDynamicShape is [565] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.805.762 [graph_manager.cc:3364][EVENT]195885 OptimizeSubgraph:[GEPERFTRACE] The time cost of OptimizeSubgraph::SubgraphPartitionAndOptimization::CompositeEngine is [2] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.805.837 [engine_partitioner.cc:1139][EVENT]195885 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionInitialize is [21] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.805.854 [engine_partitioner.cc:1142][EVENT]195885 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionMarkClusters is [4] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.806.014 [engine_partitioner.cc:1148][EVENT]195885 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionSplitSubGraphs is [151] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.806.056 [engine_partitioner.cc:1155][EVENT]195885 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionSortSubGraphs is [29] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.806.110 [engine_partitioner.cc:1164][EVENT]195885 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionAddPartitionsToGraphNode is [38] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.806.145 [graph_manager.cc:3405][EVENT]195885 SubgraphPartitionAndOptimization:[GEPERFTRACE] The time cost of OptimizeSubgraph::Partition1 is [370] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.806.163 [graph_manager.cc:3412][EVENT]195885 SubgraphPartitionAndOptimization:[GEPERFTRACE] The time cost of OptimizeSubgraph::SetSubgraphPreProc is [8] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.819.489 [graph_manager.cc:3422][EVENT]195885 SubgraphPartitionAndOptimization:[GEPERFTRACE] The time cost of OptimizeSubgraph::SetSubGraph is [13309] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.819.551 [graph_manager.cc:3428][EVENT]195885 SubgraphPartitionAndOptimization:[GEPERFTRACE] The time cost of OptimizeSubgraph::SetSubgraphPostProc is [13] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.819.727 [graph_manager.cc:3467][EVENT]195885 SubgraphPartitionAndOptimization:[GEPERFTRACE] The time cost of OptimizeSubgraph::MergeSubGraph is [152] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.819.746 [graph_manager.cc:3377][EVENT]195885 OptimizeSubgraph:[GEPERFTRACE] The time cost of OptimizeSubgraph::SubgraphPartitionAndOptimization::AtomicEngine is [13971] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.819.764 [graph_manager.cc:1106][EVENT]195885 PreRunOptimizeSubGraph:[GEPERFTRACE] The time cost of GraphManager::OptimizeSubgraph is [14608] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.819.778 [graph_manager.cc:1115][EVENT]195885 PreRunOptimizeSubGraph:PreRun:PreRunOptimizeSubGraph success. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.819.805 [graph_manager.cc:1130][EVENT]195885 PreRunAfterOptimizeSubGraph:[GEPERFTRACE] The time cost of GraphManager::stages.optimizer.ReplacePrecompiledNodeWithOmGraph is [6] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.819.839 [graph_manager.cc:1131][EVENT]195885 PreRunAfterOptimizeSubGraph:[GEPERFTRACE] The time cost of GraphManager::stages.optimizer.OptimizeWholeGraph is [20] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.819.870 [pass_manager.cc:82][EVENT]195885 Run:[GEPERFTRACE] The time cost of OptimizeStage2::AfterMergePasses::LinkGenMaskNodesPass is [10] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.819.888 [pass_manager.cc:82][EVENT]195885 Run:[GEPERFTRACE] The time cost of OptimizeStage2::HcclContinuousMemcpyPass is [5] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.819.898 [graph_manager.cc:2837][EVENT]195885 OptimizeStage2:[GEPERFTRACE] The time cost of OptimizeStage2::AfterMergePasses is [43] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.820.006 [base_pass.cc:339][EVENT]195885 Run:[GEPERFTRACE] The time cost of ConstantFoldingPass is [21] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:08.820.019 [base_pass.cc:339][EVENT]195885 Run:[GEPERFTRACE] The time cost of ReshapeRemovePass is [2] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:08.820.029 [base_pass.cc:339][EVENT]195885 Run:[GEPERFTRACE] The time cost of CondRemovePass is [3] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:08.820.038 [base_pass.cc:339][EVENT]195885 Run:[GEPERFTRACE] The time cost of BitcastPass is [2] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:08.820.047 [base_pass.cc:339][EVENT]195885 Run:[GEPERFTRACE] The time cost of AssignRemovePass is [5] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:08.820.055 [base_pass.cc:339][EVENT]195885 Run:[GEPERFTRACE] The time cost of DimensionAdjustPass is [9] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:08.820.079 [graph_manager.cc:2864][EVENT]195885 OptimizeStage2:[GEPERFTRACE] The time cost of OptimizeStage2::MergedGraphNameToPasses is [161] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.820.091 [graph_manager.cc:2872][EVENT]195885 OptimizeStage2:[GEPERFTRACE] The time cost of OptimizeStage2::RemoveIsolatedConst is [3] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.820.113 [pass_manager.cc:82][EVENT]195885 Run:[GEPERFTRACE] The time cost of OptimizeStage2::ControlAttrOptimize::MultiBatchPass is [4] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.820.127 [pass_manager.cc:82][EVENT]195885 Run:[GEPERFTRACE] The time cost of OptimizeStage2::AfterMergePasses::RefIdentityDeleteOpPass is [4] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.820.144 [pass_manager.cc:82][EVENT]195885 Run:[GEPERFTRACE] The time cost of OptimizeStage2::AfterMergePasses::VariableRefDeleteOpPass is [6] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.820.158 [compile_nodes_pass.cc:88][EVENT]195885 Run:[GEPERFTRACE] The time cost of OptimizeStage2::ControlAttrOptimize::CompileNodesPass is [4] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.820.167 [pass_manager.cc:82][EVENT]195885 Run:[GEPERFTRACE] The time cost of OptimizeStage2::ControlAttrOptimize::CompileNodesPass is [14] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.820.178 [pass_manager.cc:82][EVENT]195885 Run:[GEPERFTRACE] The time cost of OptimizeStage2::AfterMergePasses::SwapSpacePass is [2] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.820.266 [pass_manager.cc:82][EVENT]195885 Run:[GEPERFTRACE] The time cost of OptimizeStage2::AfterMergePasses::InputOutputConnectionIdentifyPass is [81] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.820.300 [pass_manager.cc:82][EVENT]195885 Run:[GEPERFTRACE] The time cost of OptimizeStage2::AfterMergePasses::AtomicAddrCleanPass is [21] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.820.314 [pass_manager.cc:82][EVENT]195885 Run:[GEPERFTRACE] The time cost of OptimizeStage2::AfterMergePasses::EndOfSequenceAddControlPass is [3] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.820.326 [pass_manager.cc:82][EVENT]195885 Run:[GEPERFTRACE] The time cost of OptimizeStage2::ControlAttrOptimize::SubgraphPass is [4] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.820.341 [pass_manager.cc:82][EVENT]195885 Run:[GEPERFTRACE] The time cost of OptimizeStage2::ControlAttrOptimize::AttachStreamLabelPass is [7] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.820.350 [graph_manager.cc:2927][EVENT]195885 OptimizeStage2:[GEPERFTRACE] The time cost of OptimizeStage2::ControlAttrOptimize is [243] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.820.369 [graph_manager.cc:2937][EVENT]195885 OptimizeStage2:[GEPERFTRACE] The time cost of ModelBuilder::AssignFunctionalLabels is [10] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.820.385 [graph_manager.cc:2943][EVENT]195885 OptimizeStage2:[GEPERFTRACE] The time cost of MemcpyAddrAsyncPass::Run. is [7] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.820.396 [graph_manager.cc:2950][EVENT]195885 OptimizeStage2:[GEPERFTRACE] The time cost of BufferPoolMemoryPass::Run. is [2] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.831.295 [graph_manager.cc:2958][EVENT]195885 OptimizeStage2:[GEPERFTRACE] The time cost of ParallelGroupPass::Run. is [54] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.831.349 [graph_manager.cc:1132][EVENT]195885 PreRunAfterOptimizeSubGraph:[GEPERFTRACE] The time cost of GraphManager::OptimizeStage2 is [11496] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.831.460 [graph_manager.cc:1135][EVENT]195885 PreRunAfterOptimizeSubGraph:[GEPERFTRACE] The time cost of GraphManager::GetCompilerStages(graph_node->GetGraphId()).optimizer.OptimizeGraphBeforeBuild is [95] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.831.524 [graph_manager.cc:2975][EVENT]195885 MemConflictProc:[GEPERFTRACE] The time cost of HandleMemoryRWConflict is [37] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.831.568 [graph_manager.cc:2981][EVENT]195885 MemConflictProc:[GEPERFTRACE] The time cost of MemLayoutConflictOptimizer::Run. is [30] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.831.586 [pass_manager.cc:82][EVENT]195885 Run:[GEPERFTRACE] The time cost of OptimizeStage2::SetFftsPlusAttrPass is [1] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.831.597 [graph_manager.cc:2986][EVENT]195885 MemConflictProc:[GEPERFTRACE] The time cost of SetFftsPlusAttrPass::last_passes.Run is [16] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.831.606 [graph_manager.cc:1136][EVENT]195885 PreRunAfterOptimizeSubGraph:[GEPERFTRACE] The time cost of GraphManager::MemConflictProc is [120] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.831.768 [graph_manager.cc:3555][EVENT]195885 Build:[GEPERFTRACE] The time cost of GraphManager::RecoverIrDefinitionAndModifyAippData is [125] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.831.886 [engine_partitioner.cc:1139][EVENT]195885 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionInitialize is [22] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.831.905 [engine_partitioner.cc:1142][EVENT]195885 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionMarkClusters is [5] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.832.044 [engine_partitioner.cc:1148][EVENT]195885 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionSplitSubGraphs is [129] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.832.083 [engine_partitioner.cc:1155][EVENT]195885 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionSortSubGraphs is [26] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.832.132 [engine_partitioner.cc:1164][EVENT]195885 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionAddPartitionsToGraphNode is [38] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.832.161 [graph_builder.cc:865][EVENT]195885 SecondPartition:[GEPERFTRACE] The time cost of EnginePartitioner::Partition2 is [317] micro second. [INFO] RUNTIME(191849,python3.7):2024-01-11-05:31:08.832.668 [logger.cc:1071] 195885 ModelBindStream: model_id=832, stream_id=1089, flag=0. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.832.707 [task_generator.cc:804][EVENT]195885 GenerateTask:[GEPERFTRACE] The time cost of TaskGenerator::SetStreamCtx is [185] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.832.780 [task_generator.cc:805][EVENT]195885 GenerateTask:[GEPERFTRACE] The time cost of TaskGenerator::PrepareForGenerateTask is [60] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.833.732 [task_generator.cc:814][EVENT]195885 GenerateTask:[GEPERFTRACE] The time cost of TaskGenerator::DoGenerateTask is [937] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.833.750 [task_generator.cc:954][EVENT]195885 GetTaskInfo:[GEPERFTRACE] The time cost of TaskGenerator::GenerateTask is [1229] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.833.825 [task_generator.cc:967][EVENT]195885 GetTaskInfo:[GEPERFTRACE] The time cost of TaskGenerator::AddModelTaskToModel is [45] micro second. [INFO] RUNTIME(191849,python3.7):2024-01-11-05:31:08.833.844 [logger.cc:1084] 195885 ModelUnbindStream: model_id=832, stream_id=1089, [INFO] GE(191849,python3.7):2024-01-11-05:31:08.834.040 [graph_manager.cc:1152][EVENT]195885 PreRunAfterOptimizeSubGraph:[GEPERFTRACE] The time cost of GraphManager::Build is [2408] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.834.060 [graph_manager.cc:1164][EVENT]195885 PreRunAfterOptimizeSubGraph:PreRun:PreRunAfterOptimizeSubGraph success. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.834.099 [graph_manager.cc:1271][EVENT]195885 PreRun:[GEPERFTRACE] The time cost of FlowModelBuild is [60926] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.834.117 [graph_manager.cc:1272][EVENT]195885 PreRun:[GEPERFTRACE] GE PreRun End [INFO] ATRACE(191849,python3.7):2024-01-11-05:31:08.834.437 [atrace_api.c:93](tid:195885) AtraceDestroy start [INFO] ATRACE(191849,python3.7):2024-01-11-05:31:08.834.461 [atrace_api.c:95](tid:195885) AtraceDestroy end [INFO] GE(191849,python3.7):2024-01-11-05:31:08.839.641 [graph_converter.cc:838][EVENT]195885 ConvertComputeGraphToExecuteGraph:[GEPERFTRACE] The time cost of ConvertComputeGraphToExecuteGraph::CreateMainNode is [1453] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.839.824 [base_optimizer.cc:70][EVENT]195885 Run:[GEPERFTRACE] The time cost of ZeroCopy is [137] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.840.325 [base_optimizer.cc:70][EVENT]195885 Run:[GEPERFTRACE] The time cost of CEM is [479] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.840.522 [copy_flow_launch_fuse.cc:395][EVENT]195885 Run:[GEPERFTRACE] The time cost of Pass::CopyFlowLaunchFuse is [174] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.840.541 [base_optimizer.cc:70][EVENT]195885 Run:[GEPERFTRACE] The time cost of CopyFlowLaunch is [194] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.840.775 [base_optimizer.cc:70][EVENT]195885 Run:[GEPERFTRACE] The time cost of TrustOutTensor is [221] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.840.800 [base_optimizer.cc:70][EVENT]195885 Run:[GEPERFTRACE] The time cost of AicpuFuseHostInputs is [7] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.840.834 [base_optimizer.cc:70][EVENT]195885 Run:[GEPERFTRACE] The time cost of ZeroCopy is [22] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.841.022 [base_optimizer.cc:70][EVENT]195885 Run:[GEPERFTRACE] The time cost of CEM is [176] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.841.103 [copy_flow_launch_fuse.cc:395][EVENT]195885 Run:[GEPERFTRACE] The time cost of Pass::CopyFlowLaunchFuse is [64] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.841.131 [base_optimizer.cc:70][EVENT]195885 Run:[GEPERFTRACE] The time cost of CopyFlowLaunch is [93] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.841.165 [base_optimizer.cc:70][EVENT]195885 Run:[GEPERFTRACE] The time cost of TrustOutTensor is [20] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.841.177 [base_optimizer.cc:70][EVENT]195885 Run:[GEPERFTRACE] The time cost of AicpuFuseHostInputs is [0] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.841.204 [base_optimizer.cc:70][EVENT]195885 Run:[GEPERFTRACE] The time cost of ZeroCopy is [16] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.841.276 [base_optimizer.cc:70][EVENT]195885 Run:[GEPERFTRACE] The time cost of CEM is [62] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.841.343 [copy_flow_launch_fuse.cc:395][EVENT]195885 Run:[GEPERFTRACE] The time cost of Pass::CopyFlowLaunchFuse is [54] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.841.354 [base_optimizer.cc:70][EVENT]195885 Run:[GEPERFTRACE] The time cost of CopyFlowLaunch is [66] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.841.380 [base_optimizer.cc:70][EVENT]195885 Run:[GEPERFTRACE] The time cost of TrustOutTensor is [16] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.841.390 [base_optimizer.cc:70][EVENT]195885 Run:[GEPERFTRACE] The time cost of AicpuFuseHostInputs is [0] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.841.403 [graph_converter.cc:849][EVENT]195885 ConvertComputeGraphToExecuteGraph:[GEPERFTRACE] The time cost of ConvertComputeGraphToExecuteGraph::RunAllPass is [1722] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.841.611 [graph_converter.cc:853][EVENT]195885 ConvertComputeGraphToExecuteGraph:[GEPERFTRACE] The time cost of ConvertComputeGraphToExecuteGraph::TopologicalSorting is [199] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.842.326 [graph_converter.cc:857][EVENT]195885 ConvertComputeGraphToExecuteGraph:[GEPERFTRACE] The time cost of ConvertComputeGraphToExecuteGraph::AppendGraphLevelData is [693] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:08.842.475 [graph_converter.cc:862][EVENT]195885 ConvertComputeGraphToExecuteGraph:[GEPERFTRACE] The time cost of ConvertComputeGraphToExecuteGraph::CalculatePriority is [124] micro second. TotalTime = 0.011043, [20] [parse]: 0.00146872 [symbol_resolve]: 0.00031638, [1] [Cycle 1]: 0.0002788, [1] [resolve]: 0.00026204 [combine_like_graphs]: 1.04e-06 [graph_reusing]: 3.37001e-06 [meta_unpack_prepare]: 3.117e-05 [pre_cconv]: 6.99998e-07 [abstract_specialize]: 0.00050836 [pack_expand]: 8.66e-06 [auto_monad]: 3.767e-05 [inline]: 1.51e-06 [pre_auto_parallel]: 1.303e-05 [pipeline_split]: 3.88e-06 [optimize]: 0.00359823, [35] [py_interpret_to_execute]: 1.114e-05 [rewriter_before_opt_a]: 1.799e-05 [opt_a]: 0.00300676, [1] [Cycle 1]: 0.00058878, [30] [expand_dump_flag]: 2.96e-06 [switch_simplify]: 8.54e-06 [a_1]: 1.912e-05 [recompute_prepare]: 1.78e-06 [updatestate_depend_eliminate]: 8.38999e-06 [updatestate_assign_eliminate]: 3.22e-06 [updatestate_loads_eliminate]: 3.07e-06 [parameter_eliminate]: 2.55e-06 [a_2]: 2.804e-05 [accelerated_algorithm]: 3.12e-06 [pynative_shard]: 1.63e-06 [auto_parallel]: 3.79e-06 [parallel]: 9.62e-06 [merge_comm]: 3.40999e-06 [allreduce_fusion]: 1.86e-06 [virtual_dataset]: 2.53e-06 [get_grad_eliminate_]: 2.05e-06 [virtual_output]: 1.86e-06 [merge_forward]: 4.84e-06 [cell_reuse_recompute_pass]: 1.23e-06 [cell_reuse_handle_not_recompute_node_pass]: 9.74999e-06 [meta_fg_expand]: 3.31e-06 [after_resolve]: 5.11e-06 [a_after_grad]: 2.46e-06 [renormalize]: 0.00026551 [real_op_eliminate]: 4.26e-06 [auto_monad_grad]: 3.71e-06 [auto_monad_eliminator]: 9.61e-06 [cse]: 2.5e-05 [a_3]: 1.433e-05 [py_interpret_to_execute_after_opt_a]: 7.73e-06 [slice_cell_reuse_recomputed_activation]: 2.47e-06 [rewriter_after_opt_a]: 0.00011898 [convert_after_rewriter]: 7.67e-06 [order_py_execute_after_rewriter]: 5.77e-06 [opt_b]: 9.259e-05, [1] [Cycle 1]: 8.76e-05, [7] [b_1]: 3.981e-05 [b_2]: 3.32e-06 [updatestate_depend_eliminate]: 3.09e-06 [updatestate_assign_eliminate]: 2.4e-06 [updatestate_loads_eliminate]: 2.12e-06 [renormalize]: 3.80001e-07 [cse]: 9.74e-06 [cconv]: 2.541e-05 [opt_after_cconv]: 4.806e-05, [1] [Cycle 1]: 4.427e-05, [7] [c_1]: 4.93e-06 [parameter_eliminate]: 6.99998e-07 [updatestate_depend_eliminate]: 2.41e-06 [updatestate_assign_eliminate]: 2.01001e-06 [updatestate_loads_eliminate]: 1.79e-06 [cse]: 6.73e-06 [renormalize]: 2.50002e-07 [remove_dup_value]: 1.008e-05 [tuple_transform]: 3.432e-05, [1] [Cycle 1]: 3.088e-05, [3] [d_1]: 1.303e-05 [d_2]: 6.27e-06 [renormalize]: 2.39997e-07 [add_cache_embedding]: 1.165e-05 [add_recomputation]: 3.91e-05 [cse_after_recomputation]: 1.543e-05, [1] [Cycle 1]: 1.135e-05, [1] [cse]: 7.02e-06 [environ_conv]: 4.79e-06 [label_micro_interleaved_index]: 2.39e-06 [label_fine_grained_interleaved_index]: 2.38e-06 [assign_add_opt]: 1.53e-06 [slice_recompute_activation]: 2.4e-06 [micro_interleaved_order_control]: 1.76e-06 [full_micro_interleaved_order_control]: 1.99999e-06 [comp_comm_scheduling]: 2.09e-06 [reorder_send_recv_between_fp_bp]: 2.40999e-06 [comm_op_add_attrs]: 1.36e-06 [add_comm_op_reuse_tag]: 8.90002e-07 [overlap_opt_shard_in_pipeline]: 1.04e-06 [grouped_pairwise_exchange_alltoall]: 1.25e-06 [overlap_recompute_and_grad_model_parallel]: 2.2e-06 [overlap_grad_matmul_and_grad_allreduce]: 1.14e-06 [split_matmul_comm_elemetwise]: 2.57e-06 [split_layernorm_comm]: 1.89e-06 [process_send_recv_for_ge]: 8.10003e-07 [handle_group_info]: 1.02e-06 [auto_monad_reorder]: 1.616e-05 [get_jit_bprop_graph]: 4.1e-07 [eliminate_special_op_node]: 0.00049309 [validate]: 2.301e-05 [distribtued_split]: 1.05e-06 [task_emit]: 0.00431769 [execute]: 8.15e-06 Sums parse : 0.001469s : 18.23% symbol_resolve.resolve : 0.000262s : 3.25% combine_like_graphs : 0.000001s : 0.01% graph_reusing : 0.000003s : 0.04% meta_unpack_prepare : 0.000031s : 0.39% pre_cconv : 0.000001s : 0.01% abstract_specialize : 0.000508s : 6.31% pack_expand : 0.000009s : 0.11% auto_monad : 0.000038s : 0.47% inline : 0.000002s : 0.02% pre_auto_parallel : 0.000013s : 0.16% pipeline_split : 0.000004s : 0.05% optimize.py_interpret_to_execute : 0.000011s : 0.14% optimize.rewriter_before_opt_a : 0.000018s : 0.22% optimize.opt_a.expand_dump_flag : 0.000003s : 0.04% optimize.opt_a.switch_simplify : 0.000009s : 0.11% optimize.opt_a.a_1 : 0.000019s : 0.24% optimize.opt_a.recompute_prepare : 0.000002s : 0.02% optimize.opt_a.updatestate_depend_eliminate : 0.000008s : 0.10% optimize.opt_a.updatestate_assign_eliminate : 0.000003s : 0.04% optimize.opt_a.updatestate_loads_eliminate : 0.000003s : 0.04% optimize.opt_a.parameter_eliminate : 0.000003s : 0.03% optimize.opt_a.a_2 : 0.000028s : 0.35% optimize.opt_a.accelerated_algorithm : 0.000003s : 0.04% optimize.opt_a.pynative_shard : 0.000002s : 0.02% optimize.opt_a.auto_parallel : 0.000004s : 0.05% optimize.opt_a.parallel : 0.000010s : 0.12% optimize.opt_a.merge_comm : 0.000003s : 0.04% optimize.opt_a.allreduce_fusion : 0.000002s : 0.02% optimize.opt_a.virtual_dataset : 0.000003s : 0.03% optimize.opt_a.get_grad_eliminate_ : 0.000002s : 0.03% optimize.opt_a.virtual_output : 0.000002s : 0.02% optimize.opt_a.merge_forward : 0.000005s : 0.06% optimize.opt_a.cell_reuse_recompute_pass : 0.000001s : 0.02% optimize.opt_a.cell_reuse_handle_not_recompute_node_pass : 0.000010s : 0.12% optimize.opt_a.meta_fg_expand : 0.000003s : 0.04% optimize.opt_a.after_resolve : 0.000005s : 0.06% optimize.opt_a.a_after_grad : 0.000002s : 0.03% optimize.opt_a.renormalize : 0.000266s : 3.30% optimize.opt_a.real_op_eliminate : 0.000004s : 0.05% optimize.opt_a.auto_monad_grad : 0.000004s : 0.05% optimize.opt_a.auto_monad_eliminator : 0.000010s : 0.12% optimize.opt_a.cse : 0.000025s : 0.31% optimize.opt_a.a_3 : 0.000014s : 0.18% optimize.py_interpret_to_execute_after_opt_a : 0.000008s : 0.10% optimize.slice_cell_reuse_recomputed_activation : 0.000002s : 0.03% optimize.rewriter_after_opt_a : 0.000119s : 1.48% optimize.convert_after_rewriter : 0.000008s : 0.10% optimize.order_py_execute_after_rewriter : 0.000006s : 0.07% optimize.opt_b.b_1 : 0.000040s : 0.49% optimize.opt_b.b_2 : 0.000003s : 0.04% optimize.opt_b.updatestate_depend_eliminate : 0.000003s : 0.04% optimize.opt_b.updatestate_assign_eliminate : 0.000002s : 0.03% optimize.opt_b.updatestate_loads_eliminate : 0.000002s : 0.03% optimize.opt_b.renormalize : 0.000000s : 0.00% optimize.opt_b.cse : 0.000010s : 0.12% optimize.cconv : 0.000025s : 0.32% optimize.opt_after_cconv.c_1 : 0.000005s : 0.06% optimize.opt_after_cconv.parameter_eliminate : 0.000001s : 0.01% optimize.opt_after_cconv.updatestate_depend_eliminate : 0.000002s : 0.03% optimize.opt_after_cconv.updatestate_assign_eliminate : 0.000002s : 0.02% optimize.opt_after_cconv.updatestate_loads_eliminate : 0.000002s : 0.02% optimize.opt_after_cconv.cse : 0.000007s : 0.08% optimize.opt_after_cconv.renormalize : 0.000000s : 0.00% optimize.remove_dup_value : 0.000010s : 0.13% optimize.tuple_transform.d_1 : 0.000013s : 0.16% optimize.tuple_transform.d_2 : 0.000006s : 0.08% optimize.tuple_transform.renormalize : 0.000000s : 0.00% optimize.add_cache_embedding : 0.000012s : 0.14% optimize.add_recomputation : 0.000039s : 0.49% optimize.cse_after_recomputation.cse : 0.000007s : 0.09% optimize.environ_conv : 0.000005s : 0.06% optimize.label_micro_interleaved_index : 0.000002s : 0.03% optimize.label_fine_grained_interleaved_index : 0.000002s : 0.03% optimize.assign_add_opt : 0.000002s : 0.02% optimize.slice_recompute_activation : 0.000002s : 0.03% optimize.micro_interleaved_order_control : 0.000002s : 0.02% optimize.full_micro_interleaved_order_control : 0.000002s : 0.02% optimize.comp_comm_scheduling : 0.000002s : 0.03% optimize.reorder_send_recv_between_fp_bp : 0.000002s : 0.03% optimize.comm_op_add_attrs : 0.000001s : 0.02% optimize.add_comm_op_reuse_tag : 0.000001s : 0.01% optimize.overlap_opt_shard_in_pipeline : 0.000001s : 0.01% optimize.grouped_pairwise_exchange_alltoall : 0.000001s : 0.02% optimize.overlap_recompute_and_grad_model_parallel : 0.000002s : 0.03% optimize.overlap_grad_matmul_and_grad_allreduce : 0.000001s : 0.01% optimize.split_matmul_comm_elemetwise : 0.000003s : 0.03% optimize.split_layernorm_comm : 0.000002s : 0.02% optimize.process_send_recv_for_ge : 0.000001s : 0.01% optimize.handle_group_info : 0.000001s : 0.01% auto_monad_reorder : 0.000016s : 0.20% get_jit_bprop_graph : 0.000000s : 0.01% eliminate_special_op_node : 0.000493s : 6.12% validate : 0.000023s : 0.29% distribtued_split : 0.000001s : 0.01% task_emit : 0.004318s : 53.59% execute : 0.000008s : 0.10% Time group info: ------[substitution.] 0.000207 12 94.10% : 0.000195s : 1: substitution.getattr_setattr_resolve 2.09% : 0.000004s : 3: substitution.graph_param_transform 1.81% : 0.000004s : 2: substitution.meta_unpack_prepare 0.60% : 0.000001s : 3: substitution.partial_unused_args_eliminate 0.70% : 0.000001s : 2: substitution.remove_not_recompute_node 0.69% : 0.000001s : 1: substitution.replace_old_param ------[renormalize.] 0.000259 2 63.14% : 0.000164s : 1: renormalize.infer 36.86% : 0.000095s : 1: renormalize.specialize ------[replace.] 0.000031 1 100.00% : 0.000031s : 1: replace.getattr_setattr_resolve ------[match.] 0.000195 1 100.00% : 0.000195s : 1: match.getattr_setattr_resolve ------[func_graph_cloner_run.] 0.000104 3 17.85% : 0.000019s : 1: func_graph_cloner_run.FuncGraphClonerGraph 82.15% : 0.000085s : 2: func_graph_cloner_run.FuncGraphSpecializer ------[meta_graph.] 0.000000 0 ------[manager.] 0.000000 0 ------[pynative] 0.000000 0 ------[others.] 0.000412 69 9.99% : 0.000041s : 26: opt.transform.opt_a 7.46% : 0.000031s : 23: opt.transform.opt_b 62.02% : 0.000256s : 2: opt.transform.opt_resolve 2.56% : 0.000011s : 1: opt.transforms.meta_unpack_prepare 10.10% : 0.000042s : 10: opt.transforms.opt_a 0.91% : 0.000004s : 1: opt.transforms.opt_after_cconv 0.55% : 0.000002s : 1: opt.transforms.opt_b 4.30% : 0.000018s : 2: opt.transforms.opt_trans_graph 2.11% : 0.000009s : 3: opt.transforms.special_op_eliminate TotalTime = 0.0108284, [20] [parse]: 0.00134193 [symbol_resolve]: 0.00031961, [1] [Cycle 1]: 0.00027965, [1] [resolve]: 0.00026285 [combine_like_graphs]: 1.26e-06 [graph_reusing]: 3.04999e-06 [meta_unpack_prepare]: 3.035e-05 [pre_cconv]: 7.10002e-07 [abstract_specialize]: 0.00047393 [pack_expand]: 8.62e-06 [auto_monad]: 3.772e-05 [inline]: 1.62e-06 [pre_auto_parallel]: 1.201e-05 [pipeline_split]: 3.17e-06 [optimize]: 0.00355364, [35] [py_interpret_to_execute]: 9.63e-06 [rewriter_before_opt_a]: 1.768e-05 [opt_a]: 0.00296391, [1] [Cycle 1]: 0.0005779, [30] [expand_dump_flag]: 3.3e-06 [switch_simplify]: 9.12e-06 [a_1]: 1.97e-05 [recompute_prepare]: 1.71e-06 [updatestate_depend_eliminate]: 7.47e-06 [updatestate_assign_eliminate]: 3.81e-06 [updatestate_loads_eliminate]: 2.91e-06 [parameter_eliminate]: 2.36e-06 [a_2]: 2.766e-05 [accelerated_algorithm]: 2.51e-06 [pynative_shard]: 1.62e-06 [auto_parallel]: 4.54e-06 [parallel]: 9.39e-06 [merge_comm]: 3.55e-06 [allreduce_fusion]: 1.74e-06 [virtual_dataset]: 2.3e-06 [get_grad_eliminate_]: 1.87e-06 [virtual_output]: 1.6e-06 [merge_forward]: 4.99e-06 [cell_reuse_recompute_pass]: 7.99999e-07 [cell_reuse_handle_not_recompute_node_pass]: 8.97e-06 [meta_fg_expand]: 3.4e-06 [after_resolve]: 4.50001e-06 [a_after_grad]: 2.51e-06 [renormalize]: 0.00025965 [real_op_eliminate]: 4.18e-06 [auto_monad_grad]: 3.81e-06 [auto_monad_eliminator]: 9.68e-06 [cse]: 2.673e-05 [a_3]: 1.437e-05 [py_interpret_to_execute_after_opt_a]: 8.14e-06 [slice_cell_reuse_recomputed_activation]: 2.45e-06 [rewriter_after_opt_a]: 0.00012301 [convert_after_rewriter]: 6.75e-06 [order_py_execute_after_rewriter]: 4.7e-06 [opt_b]: 9.222e-05, [1] [Cycle 1]: 8.695e-05, [7] [b_1]: 3.804e-05 [b_2]: 2.95e-06 [updatestate_depend_eliminate]: 2.99e-06 [updatestate_assign_eliminate]: 2.37e-06 [updatestate_loads_eliminate]: 2.18e-06 [renormalize]: 5.10001e-07 [cse]: 1.038e-05 [cconv]: 2.472e-05 [opt_after_cconv]: 4.864e-05, [1] [Cycle 1]: 4.475e-05, [7] [c_1]: 4.73e-06 [parameter_eliminate]: 8.30005e-07 [updatestate_depend_eliminate]: 2.53e-06 [updatestate_assign_eliminate]: 2.1e-06 [updatestate_loads_eliminate]: 1.83e-06 [cse]: 7.18e-06 [renormalize]: 2.69996e-07 [remove_dup_value]: 1.208e-05 [tuple_transform]: 3.355e-05, [1] [Cycle 1]: 3.026e-05, [3] [d_1]: 1.299e-05 [d_2]: 6.44e-06 [renormalize]: 1.89997e-07 [add_cache_embedding]: 1.115e-05 [add_recomputation]: 3.931e-05 [cse_after_recomputation]: 1.516e-05, [1] [Cycle 1]: 1.112e-05, [1] [cse]: 7.01e-06 [environ_conv]: 5.21e-06 [label_micro_interleaved_index]: 2.21e-06 [label_fine_grained_interleaved_index]: 2.57e-06 [assign_add_opt]: 2.02e-06 [slice_recompute_activation]: 2.07e-06 [micro_interleaved_order_control]: 1.77e-06 [full_micro_interleaved_order_control]: 1.74e-06 [comp_comm_scheduling]: 2.26e-06 [reorder_send_recv_between_fp_bp]: 2.25e-06 [comm_op_add_attrs]: 1.11e-06 [add_comm_op_reuse_tag]: 9.69994e-07 [overlap_opt_shard_in_pipeline]: 1.04e-06 [grouped_pairwise_exchange_alltoall]: 1.42e-06 [overlap_recompute_and_grad_model_parallel]: 2.02e-06 [overlap_grad_matmul_and_grad_allreduce]: 7.79997e-07 [split_matmul_comm_elemetwise]: 2.54e-06 [split_layernorm_comm]: 1.84e-06 [process_send_recv_for_ge]: 7.59996e-07 [handle_group_info]: 9.79999e-07 [auto_monad_reorder]: 1.491e-05 [get_jit_bprop_graph]: 4.30002e-07 [eliminate_special_op_node]: 0.00046475 [validate]: 2.326e-05 [distribtued_split]: 1.07e-06 [task_emit]: 0.00434004 [execute]: 9.23e-06 Sums parse : 0.001342s : 17.02% symbol_resolve.resolve : 0.000263s : 3.33% combine_like_graphs : 0.000001s : 0.02% graph_reusing : 0.000003s : 0.04% meta_unpack_prepare : 0.000030s : 0.39% pre_cconv : 0.000001s : 0.01% abstract_specialize : 0.000474s : 6.01% pack_expand : 0.000009s : 0.11% auto_monad : 0.000038s : 0.48% inline : 0.000002s : 0.02% pre_auto_parallel : 0.000012s : 0.15% pipeline_split : 0.000003s : 0.04% optimize.py_interpret_to_execute : 0.000010s : 0.12% optimize.rewriter_before_opt_a : 0.000018s : 0.22% optimize.opt_a.expand_dump_flag : 0.000003s : 0.04% optimize.opt_a.switch_simplify : 0.000009s : 0.12% optimize.opt_a.a_1 : 0.000020s : 0.25% optimize.opt_a.recompute_prepare : 0.000002s : 0.02% optimize.opt_a.updatestate_depend_eliminate : 0.000007s : 0.09% optimize.opt_a.updatestate_assign_eliminate : 0.000004s : 0.05% optimize.opt_a.updatestate_loads_eliminate : 0.000003s : 0.04% optimize.opt_a.parameter_eliminate : 0.000002s : 0.03% optimize.opt_a.a_2 : 0.000028s : 0.35% optimize.opt_a.accelerated_algorithm : 0.000003s : 0.03% optimize.opt_a.pynative_shard : 0.000002s : 0.02% optimize.opt_a.auto_parallel : 0.000005s : 0.06% optimize.opt_a.parallel : 0.000009s : 0.12% optimize.opt_a.merge_comm : 0.000004s : 0.05% optimize.opt_a.allreduce_fusion : 0.000002s : 0.02% optimize.opt_a.virtual_dataset : 0.000002s : 0.03% optimize.opt_a.get_grad_eliminate_ : 0.000002s : 0.02% optimize.opt_a.virtual_output : 0.000002s : 0.02% optimize.opt_a.merge_forward : 0.000005s : 0.06% optimize.opt_a.cell_reuse_recompute_pass : 0.000001s : 0.01% optimize.opt_a.cell_reuse_handle_not_recompute_node_pass : 0.000009s : 0.11% optimize.opt_a.meta_fg_expand : 0.000003s : 0.04% optimize.opt_a.after_resolve : 0.000005s : 0.06% optimize.opt_a.a_after_grad : 0.000003s : 0.03% optimize.opt_a.renormalize : 0.000260s : 3.29% optimize.opt_a.real_op_eliminate : 0.000004s : 0.05% optimize.opt_a.auto_monad_grad : 0.000004s : 0.05% optimize.opt_a.auto_monad_eliminator : 0.000010s : 0.12% optimize.opt_a.cse : 0.000027s : 0.34% optimize.opt_a.a_3 : 0.000014s : 0.18% optimize.py_interpret_to_execute_after_opt_a : 0.000008s : 0.10% optimize.slice_cell_reuse_recomputed_activation : 0.000002s : 0.03% optimize.rewriter_after_opt_a : 0.000123s : 1.56% optimize.convert_after_rewriter : 0.000007s : 0.09% optimize.order_py_execute_after_rewriter : 0.000005s : 0.06% optimize.opt_b.b_1 : 0.000038s : 0.48% optimize.opt_b.b_2 : 0.000003s : 0.04% optimize.opt_b.updatestate_depend_eliminate : 0.000003s : 0.04% optimize.opt_b.updatestate_assign_eliminate : 0.000002s : 0.03% optimize.opt_b.updatestate_loads_eliminate : 0.000002s : 0.03% optimize.opt_b.renormalize : 0.000001s : 0.01% optimize.opt_b.cse : 0.000010s : 0.13% optimize.cconv : 0.000025s : 0.31% optimize.opt_after_cconv.c_1 : 0.000005s : 0.06% optimize.opt_after_cconv.parameter_eliminate : 0.000001s : 0.01% optimize.opt_after_cconv.updatestate_depend_eliminate : 0.000003s : 0.03% optimize.opt_after_cconv.updatestate_assign_eliminate : 0.000002s : 0.03% optimize.opt_after_cconv.updatestate_loads_eliminate : 0.000002s : 0.02% optimize.opt_after_cconv.cse : 0.000007s : 0.09% optimize.opt_after_cconv.renormalize : 0.000000s : 0.00% optimize.remove_dup_value : 0.000012s : 0.15% optimize.tuple_transform.d_1 : 0.000013s : 0.16% optimize.tuple_transform.d_2 : 0.000006s : 0.08% optimize.tuple_transform.renormalize : 0.000000s : 0.00% optimize.add_cache_embedding : 0.000011s : 0.14% optimize.add_recomputation : 0.000039s : 0.50% optimize.cse_after_recomputation.cse : 0.000007s : 0.09% optimize.environ_conv : 0.000005s : 0.07% optimize.label_micro_interleaved_index : 0.000002s : 0.03% optimize.label_fine_grained_interleaved_index : 0.000003s : 0.03% optimize.assign_add_opt : 0.000002s : 0.03% optimize.slice_recompute_activation : 0.000002s : 0.03% optimize.micro_interleaved_order_control : 0.000002s : 0.02% optimize.full_micro_interleaved_order_control : 0.000002s : 0.02% optimize.comp_comm_scheduling : 0.000002s : 0.03% optimize.reorder_send_recv_between_fp_bp : 0.000002s : 0.03% optimize.comm_op_add_attrs : 0.000001s : 0.01% optimize.add_comm_op_reuse_tag : 0.000001s : 0.01% optimize.overlap_opt_shard_in_pipeline : 0.000001s : 0.01% optimize.grouped_pairwise_exchange_alltoall : 0.000001s : 0.02% optimize.overlap_recompute_and_grad_model_parallel : 0.000002s : 0.03% optimize.overlap_grad_matmul_and_grad_allreduce : 0.000001s : 0.01% optimize.split_matmul_comm_elemetwise : 0.000003s : 0.03% optimize.split_layernorm_comm : 0.000002s : 0.02% optimize.process_send_recv_for_ge : 0.000001s : 0.01% optimize.handle_group_info : 0.000001s : 0.01% auto_monad_reorder : 0.000015s : 0.19% get_jit_bprop_graph : 0.000000s : 0.01% eliminate_special_op_node : 0.000465s : 5.90% validate : 0.000023s : 0.30% distribtued_split : 0.000001s : 0.01% task_emit : 0.004340s : 55.06% execute : 0.000009s : 0.12% Time group info: ------[substitution.] 0.000209 12 93.93% : 0.000197s : 1: substitution.getattr_setattr_resolve 2.04% : 0.000004s : 3: substitution.graph_param_transform 1.72% : 0.000004s : 2: substitution.meta_unpack_prepare 1.05% : 0.000002s : 3: substitution.partial_unused_args_eliminate 0.55% : 0.000001s : 2: substitution.remove_not_recompute_node 0.70% : 0.000001s : 1: substitution.replace_old_param ------[renormalize.] 0.000253 2 62.73% : 0.000159s : 1: renormalize.infer 37.27% : 0.000094s : 1: renormalize.specialize ------[replace.] 0.000032 1 100.00% : 0.000032s : 1: replace.getattr_setattr_resolve ------[match.] 0.000197 1 100.00% : 0.000197s : 1: match.getattr_setattr_resolve ------[func_graph_cloner_run.] 0.000102 3 17.62% : 0.000018s : 1: func_graph_cloner_run.FuncGraphClonerGraph 82.38% : 0.000084s : 2: func_graph_cloner_run.FuncGraphSpecializer ------[meta_graph.] 0.000000 0 ------[manager.] 0.000000 0 ------[pynative] 0.000000 0 ------[others.] 0.000411 69 9.83% : 0.000040s : 26: opt.transform.opt_a 7.23% : 0.000030s : 23: opt.transform.opt_b 62.51% : 0.000257s : 2: opt.transform.opt_resolve 2.53% : 0.000010s : 1: opt.transforms.meta_unpack_prepare 10.07% : 0.000041s : 10: opt.transforms.opt_a 0.89% : 0.000004s : 1: opt.transforms.opt_after_cconv 0.50% : 0.000002s : 1: opt.transforms.opt_b 4.39% : 0.000018s : 2: opt.transforms.opt_trans_graph 2.05% : 0.000008s : 3: opt.transforms.special_op_eliminate [INFO] GE(191849,python3.7):2024-01-11-05:31:09.318.617 [graph_var_manager.cc:1424][EVENT]195887 SetAllMemoryMaxValue:The graph_mem_max_size is 27917287424 and the var_mem_max_size is 5368709120 [INFO] GE(191849,python3.7):2024-01-11-05:31:09.318.707 [graph_manager.cc:1248][EVENT]195887 PreRun:PreRun start: graph node size 3, session id 2, graph id 1, graph name online. [INFO] ATRACE(191849,python3.7):2024-01-11-05:31:09.319.573 [atrace_api.c:28](tid:195887) AtraceCreate start [INFO] ATRACE(191849,python3.7):2024-01-11-05:31:09.319.649 [trace_rb_log.c:84](tid:195887) [RUNTIME_ATRACE_DEV64_TS0] create ring buffer success, buffer size : 131152. [INFO] ATRACE(191849,python3.7):2024-01-11-05:31:09.319.664 [atrace_api.c:32](tid:195887) AtraceCreate end [INFO] TDT(191849,python3.7):2024-01-11-05:31:09.319.677 [client_manager.cpp:157][SetProfilingCallback][tid:195887] [TsdClient] set profiling callback success [INFO] GE(191849,python3.7):2024-01-11-05:31:09.320.583 [parallel_partitioner.cc:165][EVENT]195887 DoPipelinePartition:[GEPERFTRACE] The time cost of OptimizeSubgraph::PipelinePartition is [17] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.320.626 [parallel_partitioner.cc:178][EVENT]195887 DoFlowGraphPartition:[GEPERFTRACE] The time cost of OptimizeSubgraph::FlowGraphPartition is [14] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.320.677 [graph_prepare.cc:1378][EVENT]195887 Init:[GEPERFTRACE] The time cost of FileConstantUtils::ConvertFileConstToConst is [5] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.321.391 [graph_manager.cc:1050][EVENT]195887 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.preparer.PrepareInit is [734] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.321.428 [graph_manager.cc:1052][EVENT]195887 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.optimizer.HandleSummaryOp is [9] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.321.567 [base_pass.cc:339][EVENT]195887 Run:[GEPERFTRACE] The time cost of ForToWhilePass is [1] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:09.321.601 [pass_manager.cc:82][EVENT]195887 Run:[GEPERFTRACE] The time cost of ProcessNetOutput::SavePass is [4] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.321.652 [pass_manager.cc:82][EVENT]195887 Run:[GEPERFTRACE] The time cost of ProcessNetOutput::NetOutputPass is [38] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.321.665 [pass_manager.cc:82][EVENT]195887 Run:[GEPERFTRACE] The time cost of ProcessNetOutput::DataPass is [1] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.321.712 [pass_manager.cc:82][EVENT]195887 Run:[GEPERFTRACE] The time cost of CreateSubGraphWithScopePass is [11] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.321.726 [pass_manager.cc:82][EVENT]195887 Run:[GEPERFTRACE] The time cost of SubgraphMultiDimsClonePass is [1] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.321.745 [pass_manager.cc:82][EVENT]195887 Run:[GEPERFTRACE] The time cost of MultiBatchClonePass is [8] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.321.864 [base_pass.cc:339][EVENT]195887 Run:[GEPERFTRACE] The time cost of SplitVariableIntoSubgraphPass is [1] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:09.321.885 [graph_manager.cc:1054][EVENT]195887 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.preparer.NormalizeGraph is [445] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.322.117 [graph_manager.cc:1055][EVENT]195887 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.optimizer.OptimizeGraphInit is [217] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.323.034 [base_pass.cc:339][EVENT]195887 Run:[GEPERFTRACE] The time cost of AssertPass is [1] micro second, call num is [6] [INFO] GE(191849,python3.7):2024-01-11-05:31:09.323.062 [base_pass.cc:339][EVENT]195887 Run:[GEPERFTRACE] The time cost of SwitchDeadBranchElimination is [3] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:09.323.073 [base_pass.cc:339][EVENT]195887 Run:[GEPERFTRACE] The time cost of MergePass is [3] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:09.323.082 [base_pass.cc:339][EVENT]195887 Run:[GEPERFTRACE] The time cost of InferShapePass is [278] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:09.323.091 [base_pass.cc:339][EVENT]195887 Run:[GEPERFTRACE] The time cost of ReplaceWithEmptyConstPass is [12] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:09.323.100 [base_pass.cc:339][EVENT]195887 Run:[GEPERFTRACE] The time cost of SplitShapeNPass is [1] micro second, call num is [6] [INFO] GE(191849,python3.7):2024-01-11-05:31:09.323.109 [base_pass.cc:339][EVENT]195887 Run:[GEPERFTRACE] The time cost of DimensionComputePass is [11] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:09.323.118 [base_pass.cc:339][EVENT]195887 Run:[GEPERFTRACE] The time cost of ConstantFoldingPass is [17] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:09.323.126 [base_pass.cc:339][EVENT]195887 Run:[GEPERFTRACE] The time cost of InferValuePass is [5] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:09.325.257 [graph_manager.cc:1056][EVENT]195887 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.optimizer.OptimizeOriginalGraphForQuantize is [3120] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.325.350 [base_pass.cc:339][EVENT]195887 Run:[GEPERFTRACE] The time cost of CondRemovePass is [5] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:09.325.366 [graph_prepare.cc:1982][EVENT]195887 PrepareDynShape:[GEPERFTRACE] The time cost of Prepare::ProcessBeforeInfershape is [57] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.325.766 [base_pass.cc:339][EVENT]195887 Run:[GEPERFTRACE] The time cost of AssertPass is [2] micro second, call num is [6] [INFO] GE(191849,python3.7):2024-01-11-05:31:09.325.789 [base_pass.cc:339][EVENT]195887 Run:[GEPERFTRACE] The time cost of SwitchDeadBranchElimination is [2] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:09.325.800 [base_pass.cc:339][EVENT]195887 Run:[GEPERFTRACE] The time cost of MergePass is [3] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:09.325.810 [base_pass.cc:339][EVENT]195887 Run:[GEPERFTRACE] The time cost of InferShapePass is [203] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:09.325.819 [base_pass.cc:339][EVENT]195887 Run:[GEPERFTRACE] The time cost of ReplaceWithEmptyConstPass is [10] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:09.325.827 [base_pass.cc:339][EVENT]195887 Run:[GEPERFTRACE] The time cost of SplitShapeNPass is [2] micro second, call num is [6] [INFO] GE(191849,python3.7):2024-01-11-05:31:09.325.836 [base_pass.cc:339][EVENT]195887 Run:[GEPERFTRACE] The time cost of DimensionComputePass is [7] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:09.325.864 [base_pass.cc:339][EVENT]195887 Run:[GEPERFTRACE] The time cost of ConstantFoldingPass is [10] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:09.325.873 [base_pass.cc:339][EVENT]195887 Run:[GEPERFTRACE] The time cost of InferValuePass is [6] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:09.325.899 [graph_prepare.cc:1983][EVENT]195887 PrepareDynShape:[GEPERFTRACE] The time cost of Prepare::FormatAndShapeProcess is [520] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.325.925 [pass_manager.cc:82][EVENT]195887 Run:[GEPERFTRACE] The time cost of PreRun::MarkForceUnknownForCondPass is [5] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.325.937 [graph_prepare.cc:1984][EVENT]195887 PrepareDynShape:[GEPERFTRACE] The time cost of Prepare::CtrlFlowPreProcess is [22] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.325.952 [graph_prepare.cc:1985][EVENT]195887 PrepareDynShape:[GEPERFTRACE] The time cost of Prepare::multibatch::GetDynamicOutputShape is [3] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.325.974 [graph_prepare.cc:1986][EVENT]195887 PrepareDynShape:[GEPERFTRACE] The time cost of Prepare::InsertAippOpUtil::Instance().UpdateDataNodeByAipp is [10] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.325.986 [graph_prepare.cc:1987][EVENT]195887 PrepareDynShape:[GEPERFTRACE] The time cost of Prepare::SaveOriginalGraphToOmModel is [0] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.326.001 [pass_manager.cc:82][EVENT]195887 Run:[GEPERFTRACE] The time cost of PrepareOptimize::ShapeOperateOpRemovePass is [2] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.326.013 [pass_manager.cc:82][EVENT]195887 Run:[GEPERFTRACE] The time cost of PrepareOptimize::ReplaceTransShapePass is [2] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.326.027 [pass_manager.cc:82][EVENT]195887 Run:[GEPERFTRACE] The time cost of PrepareOptimize::MarkAgnosticPass is [4] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.326.111 [base_pass.cc:339][EVENT]195887 Run:[GEPERFTRACE] The time cost of EnterPass is [1] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:09.326.124 [base_pass.cc:339][EVENT]195887 Run:[GEPERFTRACE] The time cost of CondPass is [4] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:09.326.134 [base_pass.cc:339][EVENT]195887 Run:[GEPERFTRACE] The time cost of PrintOpPass is [2] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:09.326.143 [base_pass.cc:339][EVENT]195887 Run:[GEPERFTRACE] The time cost of NoUseReshapeRemovePass is [2] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:09.326.151 [base_pass.cc:339][EVENT]195887 Run:[GEPERFTRACE] The time cost of DropOutPass is [2] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:09.326.160 [base_pass.cc:339][EVENT]195887 Run:[GEPERFTRACE] The time cost of AssertPass is [2] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:09.326.168 [base_pass.cc:339][EVENT]195887 Run:[GEPERFTRACE] The time cost of TransposeRemovePass is [1] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:09.326.177 [base_pass.cc:339][EVENT]195887 Run:[GEPERFTRACE] The time cost of UnusedConstPass is [2] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:09.326.185 [base_pass.cc:339][EVENT]195887 Run:[GEPERFTRACE] The time cost of StopGradientPass is [1] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:09.326.193 [base_pass.cc:339][EVENT]195887 Run:[GEPERFTRACE] The time cost of PreventGradientPass is [1] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:09.326.202 [base_pass.cc:339][EVENT]195887 Run:[GEPERFTRACE] The time cost of PlaceholderWithDefaultPass is [1] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:09.326.219 [base_pass.cc:339][EVENT]195887 Run:[GEPERFTRACE] The time cost of SnapshotPass is [1] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:09.326.228 [base_pass.cc:339][EVENT]195887 Run:[GEPERFTRACE] The time cost of GuaranteeConstPass is [2] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:09.326.236 [base_pass.cc:339][EVENT]195887 Run:[GEPERFTRACE] The time cost of VarIsInitializedOpPass is [6] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:09.326.244 [base_pass.cc:339][EVENT]195887 Run:[GEPERFTRACE] The time cost of ParallelConcatStartOpPass is [1] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:09.326.252 [base_pass.cc:339][EVENT]195887 Run:[GEPERFTRACE] The time cost of IdentityPass is [2] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:09.326.275 [pass_manager.cc:82][EVENT]195887 Run:[GEPERFTRACE] The time cost of PrepareOptimize::PrunePass is [11] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.326.288 [pass_manager.cc:82][EVENT]195887 Run:[GEPERFTRACE] The time cost of PrepareOptimize::HcclMemcpyPass is [2] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.326.324 [graph_prepare.cc:1988][EVENT]195887 PrepareDynShape:[GEPERFTRACE] The time cost of Prepare::PrepareOptimize is [327] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.326.337 [graph_manager.cc:1065][EVENT]195887 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.preparer.PrepareDynShape is [1028] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.339.003 [graph_manager.cc:1077][EVENT]195887 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.optimizer.OptimizeOriginalGraph is [12643] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.339.104 [pass_manager.cc:82][EVENT]195887 Run:[GEPERFTRACE] The time cost of PrepareRunningFormatRefiner::VariablePrepareOpPass is [7] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.339.155 [graph_manager.cc:1080][EVENT]195887 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.preparer.PrepareRunningFormatRefiner is [92] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.343.093 [graph_manager.cc:1081][EVENT]195887 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.optimizer.OptimizeOriginalGraphJudgeInsert is [3922] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.343.156 [pass_manager.cc:82][EVENT]195887 Run:[GEPERFTRACE] The time cost of SubexpressionMigrationPass is [3] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.343.172 [pass_manager.cc:82][EVENT]195887 Run:[GEPERFTRACE] The time cost of UnusedArgsCleanPass is [2] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.343.184 [graph_manager.cc:1082][EVENT]195887 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::SubexpressionMigration is [38] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.343.219 [pass_manager.cc:82][EVENT]195887 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::MergeInputMemcpyPass is [5] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.343.234 [pass_manager.cc:82][EVENT]195887 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::SwitchDataEdgesBypass is [3] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.343.247 [pass_manager.cc:82][EVENT]195887 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::ConstantFuseSamePass is [5] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.343.277 [pass_manager.cc:82][EVENT]195887 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::CSEBeforeFuseDataNodesWithCommonInputPass is [21] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.343.292 [pass_manager.cc:82][EVENT]195887 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::FuseDataNodesWithCommonInputPass is [4] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.343.305 [pass_manager.cc:82][EVENT]195887 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::CommonSubexpressionEliminationPass is [4] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.343.334 [pass_manager.cc:82][EVENT]195887 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::PermutePass is [4] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.343.376 [pass_manager.cc:82][EVENT]195887 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::SameTransdataBreadthFusionPass is [33] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.343.398 [pass_manager.cc:82][EVENT]195887 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::VariableOpPass is [11] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.343.418 [pass_manager.cc:82][EVENT]195887 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::TransOpWithoutReshapeFusionPass is [10] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.343.445 [pass_manager.cc:82][EVENT]195887 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::TransOpBreadthFusionPass is [15] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.343.461 [pass_manager.cc:82][EVENT]195887 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::DataFlowPreparePass is [5] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.343.473 [pass_manager.cc:82][EVENT]195887 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::MergeUnknownShapeNPass is [3] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.343.483 [graph_manager.cc:2700][EVENT]195887 OptimizeStage1:[GEPERFTRACE] The time cost of GraphManager::OptimizeStage1_1 is [270] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.343.620 [base_pass.cc:339][EVENT]195887 Run:[GEPERFTRACE] The time cost of EnterPass is [0] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:09.343.636 [base_pass.cc:339][EVENT]195887 Run:[GEPERFTRACE] The time cost of AddNPass is [1] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:09.343.646 [base_pass.cc:339][EVENT]195887 Run:[GEPERFTRACE] The time cost of SwitchDeadBranchElimination is [3] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:09.343.655 [base_pass.cc:339][EVENT]195887 Run:[GEPERFTRACE] The time cost of SwitchLogicRemovePass is [2] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:09.343.664 [base_pass.cc:339][EVENT]195887 Run:[GEPERFTRACE] The time cost of MergePass is [2] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:09.343.673 [base_pass.cc:339][EVENT]195887 Run:[GEPERFTRACE] The time cost of CastRemovePass is [11] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:09.343.681 [base_pass.cc:339][EVENT]195887 Run:[GEPERFTRACE] The time cost of TransposeTransDataPass is [2] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:09.343.690 [base_pass.cc:339][EVENT]195887 Run:[GEPERFTRACE] The time cost of ReshapeRemovePass is [3] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:09.343.698 [base_pass.cc:339][EVENT]195887 Run:[GEPERFTRACE] The time cost of TransOpSymmetryEliminationPass is [2] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:09.343.706 [base_pass.cc:339][EVENT]195887 Run:[GEPERFTRACE] The time cost of TransOpNearbyAllreduceFusionPass is [3] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:09.343.714 [base_pass.cc:339][EVENT]195887 Run:[GEPERFTRACE] The time cost of ReplaceWithEmptyConstPass is [10] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:09.343.723 [base_pass.cc:339][EVENT]195887 Run:[GEPERFTRACE] The time cost of DimensionComputePass is [6] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:09.343.731 [base_pass.cc:339][EVENT]195887 Run:[GEPERFTRACE] The time cost of ConstantFoldingPass is [15] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:09.343.739 [base_pass.cc:339][EVENT]195887 Run:[GEPERFTRACE] The time cost of DimensionAdjustPass is [2] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:09.343.765 [base_pass.cc:339][EVENT]195887 Run:[GEPERFTRACE] The time cost of UselessControlOutRemovePass is [2] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:09.343.777 [graph_manager.cc:2741][EVENT]195887 OptimizeStage1:[GEPERFTRACE] The time cost of GraphManager::OptimizeStage1_2 is [274] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.343.786 [graph_manager.cc:2752][EVENT]195887 OptimizeStage1:[GEPERFTRACE] The time cost of extern constant folding is [0] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.343.810 [pass_manager.cc:82][EVENT]195887 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::Migration is [3] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.343.822 [pass_manager.cc:82][EVENT]195887 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::ArgsClean is [1] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.343.841 [pass_manager.cc:82][EVENT]195887 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::PrunePass is [10] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.343.857 [pass_manager.cc:82][EVENT]195887 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::NextIterationPass is [5] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.343.868 [pass_manager.cc:82][EVENT]195887 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::ControlTriggerPass is [2] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.343.880 [pass_manager.cc:82][EVENT]195887 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::MergeToStreamMergePass is [4] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.343.900 [pass_manager.cc:82][EVENT]195887 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::SwitchToStreamSwitchPass is [11] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.343.915 [pass_manager.cc:82][EVENT]195887 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::AttachStreamLabelPass is [4] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.343.928 [pass_manager.cc:82][EVENT]195887 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::MultiBatchPass is [4] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.343.937 [pass_manager.cc:82][EVENT]195887 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::SubgraphMultiDimsPass is [1] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.343.950 [pass_manager.cc:82][EVENT]195887 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::IteratorOpPass is [4] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.343.961 [pass_manager.cc:82][EVENT]195887 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::VariableRefUselessControlOutDeletePass is [3] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.343.979 [pass_manager.cc:82][EVENT]195887 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::ReshapeRecoveryPass is [9] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.343.991 [pass_manager.cc:82][EVENT]195887 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::RemoveSameConstPass is [4] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.344.000 [graph_manager.cc:2810][EVENT]195887 OptimizeStage1:[GEPERFTRACE] The time cost of GraphManager::OptimizeStage1_3 is [195] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.344.029 [base_pass.cc:339][EVENT]195887 Run:[GEPERFTRACE] The time cost of IdentityPass is [3] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:09.344.041 [graph_manager.cc:2821][EVENT]195887 OptimizeStage1:[GEPERFTRACE] The time cost of GraphPrepare::node_pass is [33] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.344.073 [graph_manager.cc:1087][EVENT]195887 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::OptimizeStage1 is [868] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.344.242 [graph_manager.cc:1088][EVENT]195887 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.optimizer.OptimizeAfterStage1 is [155] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.344.289 [graph_manager.cc:1089][EVENT]195887 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::GraphUtilsEx::InferShapeInNeed is [20] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.344.309 [pass_manager.cc:82][EVENT]195887 Run:[GEPERFTRACE] The time cost of PreRun::CtrlEdgeTransferPass is [2] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.344.325 [graph_manager.cc:1097][EVENT]195887 PreRunOptimizeOriginalGraph:PreRun:PreRunOptimizeOriginalGraph success. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.344.349 [graph_manager.cc:3325][EVENT]195887 OptimizeSubgraph:[GEPERFTRACE] The time cost of OptimizeSubgraph::StagePartition is [4] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.344.811 [engine_place.cc:144][EVENT]195887 Run:The time cost of AIcoreEngine::CheckSupported is [356] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.344.836 [engine_place.cc:144][EVENT]195887 Run:The time cost of DNN_VM_GE_LOCAL_OP_STORE::CheckSupported is [8] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.344.846 [engine_place.cc:144][EVENT]195887 Run:The time cost of DNN_VM_RTS_OP_STORE::CheckSupported is [10] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.344.923 [graph_manager.cc:3351][EVENT]195887 OptimizeSubgraph:[GEPERFTRACE] The time cost of OptimizeSubgraph::GraphPartitionDynamicShape is [560] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.344.942 [graph_manager.cc:3364][EVENT]195887 OptimizeSubgraph:[GEPERFTRACE] The time cost of OptimizeSubgraph::SubgraphPartitionAndOptimization::CompositeEngine is [2] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.345.013 [engine_partitioner.cc:1139][EVENT]195887 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionInitialize is [17] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.345.030 [engine_partitioner.cc:1142][EVENT]195887 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionMarkClusters is [4] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.345.211 [engine_partitioner.cc:1148][EVENT]195887 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionSplitSubGraphs is [172] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.345.256 [engine_partitioner.cc:1155][EVENT]195887 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionSortSubGraphs is [30] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.345.310 [engine_partitioner.cc:1164][EVENT]195887 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionAddPartitionsToGraphNode is [44] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.345.346 [graph_manager.cc:3405][EVENT]195887 SubgraphPartitionAndOptimization:[GEPERFTRACE] The time cost of OptimizeSubgraph::Partition1 is [391] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.345.365 [graph_manager.cc:3412][EVENT]195887 SubgraphPartitionAndOptimization:[GEPERFTRACE] The time cost of OptimizeSubgraph::SetSubgraphPreProc is [8] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.354.133 [graph_manager.cc:3422][EVENT]195887 SubgraphPartitionAndOptimization:[GEPERFTRACE] The time cost of OptimizeSubgraph::SetSubGraph is [8752] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.354.187 [graph_manager.cc:3428][EVENT]195887 SubgraphPartitionAndOptimization:[GEPERFTRACE] The time cost of OptimizeSubgraph::SetSubgraphPostProc is [10] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.354.341 [graph_manager.cc:3467][EVENT]195887 SubgraphPartitionAndOptimization:[GEPERFTRACE] The time cost of OptimizeSubgraph::MergeSubGraph is [131] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.354.360 [graph_manager.cc:3377][EVENT]195887 OptimizeSubgraph:[GEPERFTRACE] The time cost of OptimizeSubgraph::SubgraphPartitionAndOptimization::AtomicEngine is [9405] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.354.391 [graph_manager.cc:1106][EVENT]195887 PreRunOptimizeSubGraph:[GEPERFTRACE] The time cost of GraphManager::OptimizeSubgraph is [10049] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.354.403 [graph_manager.cc:1115][EVENT]195887 PreRunOptimizeSubGraph:PreRun:PreRunOptimizeSubGraph success. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.354.428 [graph_manager.cc:1130][EVENT]195887 PreRunAfterOptimizeSubGraph:[GEPERFTRACE] The time cost of GraphManager::stages.optimizer.ReplacePrecompiledNodeWithOmGraph is [6] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.354.461 [graph_manager.cc:1131][EVENT]195887 PreRunAfterOptimizeSubGraph:[GEPERFTRACE] The time cost of GraphManager::stages.optimizer.OptimizeWholeGraph is [21] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.354.488 [pass_manager.cc:82][EVENT]195887 Run:[GEPERFTRACE] The time cost of OptimizeStage2::AfterMergePasses::LinkGenMaskNodesPass is [7] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.354.506 [pass_manager.cc:82][EVENT]195887 Run:[GEPERFTRACE] The time cost of OptimizeStage2::HcclContinuousMemcpyPass is [6] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.354.516 [graph_manager.cc:2837][EVENT]195887 OptimizeStage2:[GEPERFTRACE] The time cost of OptimizeStage2::AfterMergePasses is [38] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.354.604 [base_pass.cc:339][EVENT]195887 Run:[GEPERFTRACE] The time cost of ConstantFoldingPass is [15] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:09.354.617 [base_pass.cc:339][EVENT]195887 Run:[GEPERFTRACE] The time cost of ReshapeRemovePass is [3] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:09.354.626 [base_pass.cc:339][EVENT]195887 Run:[GEPERFTRACE] The time cost of CondRemovePass is [2] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:09.354.635 [base_pass.cc:339][EVENT]195887 Run:[GEPERFTRACE] The time cost of BitcastPass is [0] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:09.354.644 [base_pass.cc:339][EVENT]195887 Run:[GEPERFTRACE] The time cost of AssignRemovePass is [6] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:09.354.652 [base_pass.cc:339][EVENT]195887 Run:[GEPERFTRACE] The time cost of DimensionAdjustPass is [7] micro second, call num is [3] [INFO] GE(191849,python3.7):2024-01-11-05:31:09.354.663 [graph_manager.cc:2864][EVENT]195887 OptimizeStage2:[GEPERFTRACE] The time cost of OptimizeStage2::MergedGraphNameToPasses is [129] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.354.675 [graph_manager.cc:2872][EVENT]195887 OptimizeStage2:[GEPERFTRACE] The time cost of OptimizeStage2::RemoveIsolatedConst is [3] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.354.696 [pass_manager.cc:82][EVENT]195887 Run:[GEPERFTRACE] The time cost of OptimizeStage2::ControlAttrOptimize::MultiBatchPass is [3] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.354.712 [pass_manager.cc:82][EVENT]195887 Run:[GEPERFTRACE] The time cost of OptimizeStage2::AfterMergePasses::RefIdentityDeleteOpPass is [4] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.354.727 [pass_manager.cc:82][EVENT]195887 Run:[GEPERFTRACE] The time cost of OptimizeStage2::AfterMergePasses::VariableRefDeleteOpPass is [6] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.354.741 [compile_nodes_pass.cc:88][EVENT]195887 Run:[GEPERFTRACE] The time cost of OptimizeStage2::ControlAttrOptimize::CompileNodesPass is [4] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.354.750 [pass_manager.cc:82][EVENT]195887 Run:[GEPERFTRACE] The time cost of OptimizeStage2::ControlAttrOptimize::CompileNodesPass is [14] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.354.760 [pass_manager.cc:82][EVENT]195887 Run:[GEPERFTRACE] The time cost of OptimizeStage2::AfterMergePasses::SwapSpacePass is [2] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.354.844 [pass_manager.cc:82][EVENT]195887 Run:[GEPERFTRACE] The time cost of OptimizeStage2::AfterMergePasses::InputOutputConnectionIdentifyPass is [69] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.354.874 [pass_manager.cc:82][EVENT]195887 Run:[GEPERFTRACE] The time cost of OptimizeStage2::AfterMergePasses::AtomicAddrCleanPass is [18] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.354.887 [pass_manager.cc:82][EVENT]195887 Run:[GEPERFTRACE] The time cost of OptimizeStage2::AfterMergePasses::EndOfSequenceAddControlPass is [3] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.354.899 [pass_manager.cc:82][EVENT]195887 Run:[GEPERFTRACE] The time cost of OptimizeStage2::ControlAttrOptimize::SubgraphPass is [3] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.354.912 [pass_manager.cc:82][EVENT]195887 Run:[GEPERFTRACE] The time cost of OptimizeStage2::ControlAttrOptimize::AttachStreamLabelPass is [4] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.354.921 [graph_manager.cc:2927][EVENT]195887 OptimizeStage2:[GEPERFTRACE] The time cost of OptimizeStage2::ControlAttrOptimize is [229] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.354.933 [graph_manager.cc:2937][EVENT]195887 OptimizeStage2:[GEPERFTRACE] The time cost of ModelBuilder::AssignFunctionalLabels is [4] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.354.948 [graph_manager.cc:2943][EVENT]195887 OptimizeStage2:[GEPERFTRACE] The time cost of MemcpyAddrAsyncPass::Run. is [5] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.354.959 [graph_manager.cc:2950][EVENT]195887 OptimizeStage2:[GEPERFTRACE] The time cost of BufferPoolMemoryPass::Run. is [1] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.355.146 [graph_manager.cc:2958][EVENT]195887 OptimizeStage2:[GEPERFTRACE] The time cost of ParallelGroupPass::Run. is [35] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.355.179 [graph_manager.cc:1132][EVENT]195887 PreRunAfterOptimizeSubGraph:[GEPERFTRACE] The time cost of GraphManager::OptimizeStage2 is [704] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.355.264 [graph_manager.cc:1135][EVENT]195887 PreRunAfterOptimizeSubGraph:[GEPERFTRACE] The time cost of GraphManager::GetCompilerStages(graph_node->GetGraphId()).optimizer.OptimizeGraphBeforeBuild is [72] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.355.303 [graph_manager.cc:2975][EVENT]195887 MemConflictProc:[GEPERFTRACE] The time cost of HandleMemoryRWConflict is [23] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.355.336 [graph_manager.cc:2981][EVENT]195887 MemConflictProc:[GEPERFTRACE] The time cost of MemLayoutConflictOptimizer::Run. is [19] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.355.349 [pass_manager.cc:82][EVENT]195887 Run:[GEPERFTRACE] The time cost of OptimizeStage2::SetFftsPlusAttrPass is [1] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.355.359 [graph_manager.cc:2986][EVENT]195887 MemConflictProc:[GEPERFTRACE] The time cost of SetFftsPlusAttrPass::last_passes.Run is [12] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.355.368 [graph_manager.cc:1136][EVENT]195887 PreRunAfterOptimizeSubGraph:[GEPERFTRACE] The time cost of GraphManager::MemConflictProc is [88] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.355.496 [graph_manager.cc:3555][EVENT]195887 Build:[GEPERFTRACE] The time cost of GraphManager::RecoverIrDefinitionAndModifyAippData is [96] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.355.589 [engine_partitioner.cc:1139][EVENT]195887 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionInitialize is [17] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.355.604 [engine_partitioner.cc:1142][EVENT]195887 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionMarkClusters is [3] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.355.717 [engine_partitioner.cc:1148][EVENT]195887 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionSplitSubGraphs is [103] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.355.758 [engine_partitioner.cc:1155][EVENT]195887 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionSortSubGraphs is [20] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.355.803 [engine_partitioner.cc:1164][EVENT]195887 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionAddPartitionsToGraphNode is [32] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.355.827 [graph_builder.cc:865][EVENT]195887 SecondPartition:[GEPERFTRACE] The time cost of EnginePartitioner::Partition2 is [272] micro second. [INFO] RUNTIME(191849,python3.7):2024-01-11-05:31:09.356.287 [logger.cc:1071] 195887 ModelBindStream: model_id=832, stream_id=1089, flag=0. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.356.319 [task_generator.cc:804][EVENT]195887 GenerateTask:[GEPERFTRACE] The time cost of TaskGenerator::SetStreamCtx is [185] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.356.384 [task_generator.cc:805][EVENT]195887 GenerateTask:[GEPERFTRACE] The time cost of TaskGenerator::PrepareForGenerateTask is [53] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.357.175 [task_generator.cc:814][EVENT]195887 GenerateTask:[GEPERFTRACE] The time cost of TaskGenerator::DoGenerateTask is [775] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.357.195 [task_generator.cc:954][EVENT]195887 GetTaskInfo:[GEPERFTRACE] The time cost of TaskGenerator::GenerateTask is [1062] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.357.260 [task_generator.cc:967][EVENT]195887 GetTaskInfo:[GEPERFTRACE] The time cost of TaskGenerator::AddModelTaskToModel is [37] micro second. [INFO] RUNTIME(191849,python3.7):2024-01-11-05:31:09.357.282 [logger.cc:1084] 195887 ModelUnbindStream: model_id=832, stream_id=1089, [INFO] GE(191849,python3.7):2024-01-11-05:31:09.357.471 [graph_manager.cc:1152][EVENT]195887 PreRunAfterOptimizeSubGraph:[GEPERFTRACE] The time cost of GraphManager::Build is [2080] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.357.491 [graph_manager.cc:1164][EVENT]195887 PreRunAfterOptimizeSubGraph:PreRun:PreRunAfterOptimizeSubGraph success. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.357.528 [graph_manager.cc:1271][EVENT]195887 PreRun:[GEPERFTRACE] The time cost of FlowModelBuild is [37040] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.357.539 [graph_manager.cc:1272][EVENT]195887 PreRun:[GEPERFTRACE] GE PreRun End [INFO] ATRACE(191849,python3.7):2024-01-11-05:31:09.357.860 [atrace_api.c:93](tid:195887) AtraceDestroy start [INFO] ATRACE(191849,python3.7):2024-01-11-05:31:09.357.879 [atrace_api.c:95](tid:195887) AtraceDestroy end [INFO] GE(191849,python3.7):2024-01-11-05:31:09.362.789 [graph_converter.cc:838][EVENT]195887 ConvertComputeGraphToExecuteGraph:[GEPERFTRACE] The time cost of ConvertComputeGraphToExecuteGraph::CreateMainNode is [1318] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.362.984 [base_optimizer.cc:70][EVENT]195887 Run:[GEPERFTRACE] The time cost of ZeroCopy is [129] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.363.454 [base_optimizer.cc:70][EVENT]195887 Run:[GEPERFTRACE] The time cost of CEM is [447] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.363.653 [copy_flow_launch_fuse.cc:395][EVENT]195887 Run:[GEPERFTRACE] The time cost of Pass::CopyFlowLaunchFuse is [175] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.363.672 [base_optimizer.cc:70][EVENT]195887 Run:[GEPERFTRACE] The time cost of CopyFlowLaunch is [196] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.363.895 [base_optimizer.cc:70][EVENT]195887 Run:[GEPERFTRACE] The time cost of TrustOutTensor is [211] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.363.913 [base_optimizer.cc:70][EVENT]195887 Run:[GEPERFTRACE] The time cost of AicpuFuseHostInputs is [1] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.363.964 [base_optimizer.cc:70][EVENT]195887 Run:[GEPERFTRACE] The time cost of ZeroCopy is [23] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.364.156 [base_optimizer.cc:70][EVENT]195887 Run:[GEPERFTRACE] The time cost of CEM is [177] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.364.236 [copy_flow_launch_fuse.cc:395][EVENT]195887 Run:[GEPERFTRACE] The time cost of Pass::CopyFlowLaunchFuse is [63] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.364.250 [base_optimizer.cc:70][EVENT]195887 Run:[GEPERFTRACE] The time cost of CopyFlowLaunch is [78] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.364.278 [base_optimizer.cc:70][EVENT]195887 Run:[GEPERFTRACE] The time cost of TrustOutTensor is [19] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.364.290 [base_optimizer.cc:70][EVENT]195887 Run:[GEPERFTRACE] The time cost of AicpuFuseHostInputs is [0] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.364.315 [base_optimizer.cc:70][EVENT]195887 Run:[GEPERFTRACE] The time cost of ZeroCopy is [16] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.364.386 [base_optimizer.cc:70][EVENT]195887 Run:[GEPERFTRACE] The time cost of CEM is [60] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.364.449 [copy_flow_launch_fuse.cc:395][EVENT]195887 Run:[GEPERFTRACE] The time cost of Pass::CopyFlowLaunchFuse is [53] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.364.460 [base_optimizer.cc:70][EVENT]195887 Run:[GEPERFTRACE] The time cost of CopyFlowLaunch is [64] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.364.486 [base_optimizer.cc:70][EVENT]195887 Run:[GEPERFTRACE] The time cost of TrustOutTensor is [17] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.364.495 [base_optimizer.cc:70][EVENT]195887 Run:[GEPERFTRACE] The time cost of AicpuFuseHostInputs is [0] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.364.508 [graph_converter.cc:849][EVENT]195887 ConvertComputeGraphToExecuteGraph:[GEPERFTRACE] The time cost of ConvertComputeGraphToExecuteGraph::RunAllPass is [1660] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.364.723 [graph_converter.cc:853][EVENT]195887 ConvertComputeGraphToExecuteGraph:[GEPERFTRACE] The time cost of ConvertComputeGraphToExecuteGraph::TopologicalSorting is [205] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.365.450 [graph_converter.cc:857][EVENT]195887 ConvertComputeGraphToExecuteGraph:[GEPERFTRACE] The time cost of ConvertComputeGraphToExecuteGraph::AppendGraphLevelData is [713] micro second. [INFO] GE(191849,python3.7):2024-01-11-05:31:09.365.586 [graph_converter.cc:862][EVENT]195887 ConvertComputeGraphToExecuteGraph:[GEPERFTRACE] The time cost of ConvertComputeGraphToExecuteGraph::CalculatePriority is [110] micro second. TotalTime = 0.0108158, [20] [parse]: 0.00137454 [symbol_resolve]: 0.00031676, [1] [Cycle 1]: 0.00027695, [1] [resolve]: 0.0002605 [combine_like_graphs]: 1.01e-06 [graph_reusing]: 3.53e-06 [meta_unpack_prepare]: 3.152e-05 [pre_cconv]: 8.90002e-07 [abstract_specialize]: 0.00047753 [pack_expand]: 9.14e-06 [auto_monad]: 3.679e-05 [inline]: 1.08e-06 [pre_auto_parallel]: 1.292e-05 [pipeline_split]: 2.82e-06 [optimize]: 0.00355994, [35] [py_interpret_to_execute]: 9.5e-06 [rewriter_before_opt_a]: 1.852e-05 [opt_a]: 0.00297614, [1] [Cycle 1]: 0.00058115, [30] [expand_dump_flag]: 3.34e-06 [switch_simplify]: 8.9e-06 [a_1]: 1.871e-05 [recompute_prepare]: 1.78e-06 [updatestate_depend_eliminate]: 7.91e-06 [updatestate_assign_eliminate]: 3.57e-06 [updatestate_loads_eliminate]: 2.99e-06 [parameter_eliminate]: 2.82e-06 [a_2]: 2.914e-05 [accelerated_algorithm]: 2.59e-06 [pynative_shard]: 1.65001e-06 [auto_parallel]: 4.03e-06 [parallel]: 9.35e-06 [merge_comm]: 3.69e-06 [allreduce_fusion]: 1.86999e-06 [virtual_dataset]: 2.47e-06 [get_grad_eliminate_]: 1.8e-06 [virtual_output]: 1.67e-06 [merge_forward]: 4.7e-06 [cell_reuse_recompute_pass]: 8.40002e-07 [cell_reuse_handle_not_recompute_node_pass]: 8.48e-06 [meta_fg_expand]: 3.23e-06 [after_resolve]: 4.62e-06 [a_after_grad]: 2.38e-06 [renormalize]: 0.00026046 [real_op_eliminate]: 4.6e-06 [auto_monad_grad]: 3.72e-06 [auto_monad_eliminator]: 9.47e-06 [cse]: 2.591e-05 [a_3]: 1.515e-05 [py_interpret_to_execute_after_opt_a]: 7.55e-06 [slice_cell_reuse_recomputed_activation]: 2.49e-06 [rewriter_after_opt_a]: 0.00011871 [convert_after_rewriter]: 6.2e-06 [order_py_execute_after_rewriter]: 5.17e-06 [opt_b]: 9.081e-05, [1] [Cycle 1]: 8.58e-05, [7] [b_1]: 3.823e-05 [b_2]: 3.25e-06 [updatestate_depend_eliminate]: 2.84e-06 [updatestate_assign_eliminate]: 2.37e-06 [updatestate_loads_eliminate]: 2.12e-06 [renormalize]: 4.39999e-07 [cse]: 1.016e-05 [cconv]: 2.499e-05 [opt_after_cconv]: 4.812e-05, [1] [Cycle 1]: 4.431e-05, [7] [c_1]: 4.96e-06 [parameter_eliminate]: 7.59996e-07 [updatestate_depend_eliminate]: 2.36e-06 [updatestate_assign_eliminate]: 2.09999e-06 [updatestate_loads_eliminate]: 1.82e-06 [cse]: 7.05e-06 [renormalize]: 3.30001e-07 [remove_dup_value]: 9.87999e-06 [tuple_transform]: 3.337e-05, [1] [Cycle 1]: 2.992e-05, [3] [d_1]: 1.327e-05 [d_2]: 5.44e-06 [renormalize]: 1.90004e-07 [add_cache_embedding]: 1.134e-05 [add_recomputation]: 4.064e-05 [cse_after_recomputation]: 1.515e-05, [1] [Cycle 1]: 1.098e-05, [1] [cse]: 6.88e-06 [environ_conv]: 5.1e-06 [label_micro_interleaved_index]: 2.12e-06 [label_fine_grained_interleaved_index]: 2.67e-06 [assign_add_opt]: 1.57e-06 [slice_recompute_activation]: 2.22e-06 [micro_interleaved_order_control]: 1.75001e-06 [full_micro_interleaved_order_control]: 1.86e-06 [comp_comm_scheduling]: 2.07e-06 [reorder_send_recv_between_fp_bp]: 2.32e-06 [comm_op_add_attrs]: 1.12e-06 [add_comm_op_reuse_tag]: 9.19994e-07 [overlap_opt_shard_in_pipeline]: 1.02e-06 [grouped_pairwise_exchange_alltoall]: 1.37e-06 [overlap_recompute_and_grad_model_parallel]: 2.02e-06 [overlap_grad_matmul_and_grad_allreduce]: 7.7e-07 [split_matmul_comm_elemetwise]: 2.44e-06 [split_layernorm_comm]: 1.8e-06 [process_send_recv_for_ge]: 8.2e-07 [handle_group_info]: 1.02e-06 [auto_monad_reorder]: 1.486e-05 [get_jit_bprop_graph]: 4.89999e-07 [eliminate_special_op_node]: 0.00045025 [validate]: 2.285e-05 [distribtued_split]: 1.08e-06 [task_emit]: 0.00430351 [execute]: 8.42e-06 Sums parse : 0.001375s : 17.49% symbol_resolve.resolve : 0.000260s : 3.31% combine_like_graphs : 0.000001s : 0.01% graph_reusing : 0.000004s : 0.04% meta_unpack_prepare : 0.000032s : 0.40% pre_cconv : 0.000001s : 0.01% abstract_specialize : 0.000478s : 6.08% pack_expand : 0.000009s : 0.12% auto_monad : 0.000037s : 0.47% inline : 0.000001s : 0.01% pre_auto_parallel : 0.000013s : 0.16% pipeline_split : 0.000003s : 0.04% optimize.py_interpret_to_execute : 0.000009s : 0.12% optimize.rewriter_before_opt_a : 0.000019s : 0.24% optimize.opt_a.expand_dump_flag : 0.000003s : 0.04% optimize.opt_a.switch_simplify : 0.000009s : 0.11% optimize.opt_a.a_1 : 0.000019s : 0.24% optimize.opt_a.recompute_prepare : 0.000002s : 0.02% optimize.opt_a.updatestate_depend_eliminate : 0.000008s : 0.10% optimize.opt_a.updatestate_assign_eliminate : 0.000004s : 0.05% optimize.opt_a.updatestate_loads_eliminate : 0.000003s : 0.04% optimize.opt_a.parameter_eliminate : 0.000003s : 0.04% optimize.opt_a.a_2 : 0.000029s : 0.37% optimize.opt_a.accelerated_algorithm : 0.000003s : 0.03% optimize.opt_a.pynative_shard : 0.000002s : 0.02% optimize.opt_a.auto_parallel : 0.000004s : 0.05% optimize.opt_a.parallel : 0.000009s : 0.12% optimize.opt_a.merge_comm : 0.000004s : 0.05% optimize.opt_a.allreduce_fusion : 0.000002s : 0.02% optimize.opt_a.virtual_dataset : 0.000002s : 0.03% optimize.opt_a.get_grad_eliminate_ : 0.000002s : 0.02% optimize.opt_a.virtual_output : 0.000002s : 0.02% optimize.opt_a.merge_forward : 0.000005s : 0.06% optimize.opt_a.cell_reuse_recompute_pass : 0.000001s : 0.01% optimize.opt_a.cell_reuse_handle_not_recompute_node_pass : 0.000008s : 0.11% optimize.opt_a.meta_fg_expand : 0.000003s : 0.04% optimize.opt_a.after_resolve : 0.000005s : 0.06% optimize.opt_a.a_after_grad : 0.000002s : 0.03% optimize.opt_a.renormalize : 0.000260s : 3.31% optimize.opt_a.real_op_eliminate : 0.000005s : 0.06% optimize.opt_a.auto_monad_grad : 0.000004s : 0.05% optimize.opt_a.auto_monad_eliminator : 0.000009s : 0.12% optimize.opt_a.cse : 0.000026s : 0.33% optimize.opt_a.a_3 : 0.000015s : 0.19% optimize.py_interpret_to_execute_after_opt_a : 0.000008s : 0.10% optimize.slice_cell_reuse_recomputed_activation : 0.000002s : 0.03% optimize.rewriter_after_opt_a : 0.000119s : 1.51% optimize.convert_after_rewriter : 0.000006s : 0.08% optimize.order_py_execute_after_rewriter : 0.000005s : 0.07% optimize.opt_b.b_1 : 0.000038s : 0.49% optimize.opt_b.b_2 : 0.000003s : 0.04% optimize.opt_b.updatestate_depend_eliminate : 0.000003s : 0.04% optimize.opt_b.updatestate_assign_eliminate : 0.000002s : 0.03% optimize.opt_b.updatestate_loads_eliminate : 0.000002s : 0.03% optimize.opt_b.renormalize : 0.000000s : 0.01% optimize.opt_b.cse : 0.000010s : 0.13% optimize.cconv : 0.000025s : 0.32% optimize.opt_after_cconv.c_1 : 0.000005s : 0.06% optimize.opt_after_cconv.parameter_eliminate : 0.000001s : 0.01% optimize.opt_after_cconv.updatestate_depend_eliminate : 0.000002s : 0.03% optimize.opt_after_cconv.updatestate_assign_eliminate : 0.000002s : 0.03% optimize.opt_after_cconv.updatestate_loads_eliminate : 0.000002s : 0.02% optimize.opt_after_cconv.cse : 0.000007s : 0.09% optimize.opt_after_cconv.renormalize : 0.000000s : 0.00% optimize.remove_dup_value : 0.000010s : 0.13% optimize.tuple_transform.d_1 : 0.000013s : 0.17% optimize.tuple_transform.d_2 : 0.000005s : 0.07% optimize.tuple_transform.renormalize : 0.000000s : 0.00% optimize.add_cache_embedding : 0.000011s : 0.14% optimize.add_recomputation : 0.000041s : 0.52% optimize.cse_after_recomputation.cse : 0.000007s : 0.09% optimize.environ_conv : 0.000005s : 0.06% optimize.label_micro_interleaved_index : 0.000002s : 0.03% optimize.label_fine_grained_interleaved_index : 0.000003s : 0.03% optimize.assign_add_opt : 0.000002s : 0.02% optimize.slice_recompute_activation : 0.000002s : 0.03% optimize.micro_interleaved_order_control : 0.000002s : 0.02% optimize.full_micro_interleaved_order_control : 0.000002s : 0.02% optimize.comp_comm_scheduling : 0.000002s : 0.03% optimize.reorder_send_recv_between_fp_bp : 0.000002s : 0.03% optimize.comm_op_add_attrs : 0.000001s : 0.01% optimize.add_comm_op_reuse_tag : 0.000001s : 0.01% optimize.overlap_opt_shard_in_pipeline : 0.000001s : 0.01% optimize.grouped_pairwise_exchange_alltoall : 0.000001s : 0.02% optimize.overlap_recompute_and_grad_model_parallel : 0.000002s : 0.03% optimize.overlap_grad_matmul_and_grad_allreduce : 0.000001s : 0.01% optimize.split_matmul_comm_elemetwise : 0.000002s : 0.03% optimize.split_layernorm_comm : 0.000002s : 0.02% optimize.process_send_recv_for_ge : 0.000001s : 0.01% optimize.handle_group_info : 0.000001s : 0.01% auto_monad_reorder : 0.000015s : 0.19% get_jit_bprop_graph : 0.000000s : 0.01% eliminate_special_op_node : 0.000450s : 5.73% validate : 0.000023s : 0.29% distribtued_split : 0.000001s : 0.01% task_emit : 0.004304s : 54.75% execute : 0.000008s : 0.11% Time group info: ------[substitution.] 0.000205 12 94.16% : 0.000193s : 1: substitution.getattr_setattr_resolve 2.10% : 0.000004s : 3: substitution.graph_param_transform 1.97% : 0.000004s : 2: substitution.meta_unpack_prepare 0.53% : 0.000001s : 3: substitution.partial_unused_args_eliminate 0.48% : 0.000001s : 2: substitution.remove_not_recompute_node 0.76% : 0.000002s : 1: substitution.replace_old_param ------[renormalize.] 0.000253 2 62.13% : 0.000157s : 1: renormalize.infer 37.87% : 0.000096s : 1: renormalize.specialize ------[replace.] 0.000032 1 100.00% : 0.000032s : 1: replace.getattr_setattr_resolve ------[match.] 0.000193 1 100.00% : 0.000193s : 1: match.getattr_setattr_resolve ------[func_graph_cloner_run.] 0.000102 3 17.35% : 0.000018s : 1: func_graph_cloner_run.FuncGraphClonerGraph 82.65% : 0.000084s : 2: func_graph_cloner_run.FuncGraphSpecializer ------[meta_graph.] 0.000000 0 ------[manager.] 0.000000 0 ------[pynative] 0.000000 0 ------[others.] 0.000408 69 10.25% : 0.000042s : 26: opt.transform.opt_a 7.33% : 0.000030s : 23: opt.transform.opt_b 62.29% : 0.000254s : 2: opt.transform.opt_resolve 2.66% : 0.000011s : 1: opt.transforms.meta_unpack_prepare 10.01% : 0.000041s : 10: opt.transforms.opt_a 0.89% : 0.000004s : 1: opt.transforms.opt_after_cconv 0.58% : 0.000002s : 1: opt.transforms.opt_b 4.23% : 0.000017s : 2: opt.transforms.opt_trans_graph 1.76% : 0.000007s : 3: opt.transforms.special_op_eliminate . ============================== 1 passed in 21.13s ============================== [TRACE] GE(191849,python3.7):2024-01-11-05:31:12.813.200 [status:INIT] [ge_api.cc:463]191849 ~Session:Start to destruct session. [TRACE] GE(191849,python3.7):2024-01-11-05:31:12.813.264 [status:RUNNING] [ge_api.cc:475]191849 ~Session:Session id is 0 [TRACE] GE(191849,python3.7):2024-01-11-05:31:12.813.275 [status:RUNNING] [ge_api.cc:476]191849 ~Session:Destroying session [TRACE] GE(191849,python3.7):2024-01-11-05:31:12.814.275 [status:STOP] [ge_api.cc:491]191849 ~Session:Session Destructor finished [TRACE] GE(191849,python3.7):2024-01-11-05:31:12.814.305 [status:INIT] [ge_api.cc:301]191849 GEFinalize:GEFinalize start [INFO] GE(191849,python3.7):2024-01-11-05:31:12.814.372 [execution_runtime.cc:80][EVENT]191849 FinalizeExecutionRuntime:Execution runtime finalize begin. [INFO] GE(191849,python3.7):2024-01-11-05:31:12.814.390 [execution_runtime.cc:92][EVENT]191849 FinalizeExecutionRuntime:Execution runtime finalized. [TRACE] GE(191849,python3.7):2024-01-11-05:31:12.814.416 [status:RUNNING] [ge_api.cc:313]191849 GEFinalize:Finalizing environment [INFO] TUNE(191849,python3.7):2024-01-11-05:31:13.102.726 [cann_kb_pyfunc_mgr.cpp:127][CANNKB][Tid:191849]"CannKbPyfuncMgr: enter PyObjectDeinit function, reference_[1]" [INFO] TUNE(191849,python3.7):2024-01-11-05:31:13.102.782 [cann_kb_pyfunc_mgr.cpp:138][CANNKB][Tid:191849]"CannKbPyfuncMgr: PyObjectDeinit function end successfully!" [INFO] GE(191849,python3.7):2024-01-11-05:31:13.104.233 [gelib.cc:324][EVENT]191849 SystemFinalize:Online infer finalize GELib success. [TRACE] GE(191849,python3.7):2024-01-11-05:31:13.567.462 [status:STOP] [ge_api.cc:341]191849 GEFinalize:GEFinalize finished [INFO] TDT(191849,python3.7):2024-01-11-05:31:13.630.711 [process_mode_manager.cpp:184][Close][tid:191849] [TsdClient] Close [deviceId=3][sessionId=1] hccp and computer enter [INFO] TDT(191849,python3.7):2024-01-11-05:31:13.630.738 [version_verify.cpp:112][SpecialFeatureCheck][tid:191849] VersionVerify: previous type[7], supported [INFO] TDT(191849,python3.7):2024-01-11-05:31:13.630.767 [process_mode_manager.cpp:192][Close][tid:191849] [TsdClient][deviceId=3] [sessionId=1] wait hccp and computer process close respond [INFO] TDT(191849,python3.7):2024-01-11-05:31:13.662.339 [process_mode_manager.cpp:197][Close][tid:191849] [TsdClient][logicDeviceId_=3]has recv close hccp and computer process respond [INFO] TDT(191849,python3.7):2024-01-11-05:31:13.662.353 [stub_process_mode_nowin.cpp:151][CloseInHost][tid:191849] enter into CloseInHost deviceid[3] [INFO] TDT(191849,python3.7):2024-01-11-05:31:13.662.362 [stub_process_mode_nowin.cpp:154][CloseInHost][tid:191849] host cpu not support [INFO] TDT(191849,python3.7):2024-01-11-05:31:13.662.393 [process_mode_manager.cpp:208][Close][tid:191849] [TsdClient][deviceId=3] [sessionId=1] close hccp and computer process success [INFO] ATRACE(191849,python3.7):2024-01-11-05:31:13.662.404 [atrace_api.c:93](tid:191849) AtraceDestroy start [INFO] ATRACE(191849,python3.7):2024-01-11-05:31:13.662.418 [atrace_api.c:95](tid:191849) AtraceDestroy end [INFO] PROFILING(191849,python3.7):2024-01-11-05:31:13.662.436 [msprofiler_impl.cpp:156] >>> (tid:191849) ProfNotifySetDevice called, is open: 0, devId: 3 [INFO] RUNTIME(191849,python3.7):2024-01-11-05:31:15.240.611 [runtime.cc:1737] 191849 ~Runtime: deconstruct runtime.