============================= test session starts ============================== platform linux -- Python 3.7.5, pytest-5.4.3, py-1.8.1, pluggy-0.13.1 rootdir: /home/jenkins/mindspore/testcases/testcases/tests/st/dyn_shape_dev, inifile: /home/jenkins/sault/virtual_test/virtualenv_003/sault/config/pytest.ini plugins: anyio-3.7.1, xdist-1.32.0, forked-1.1.3 [INFO] ATRACE(14258,python3.7):2024-01-11-05:42:48.624.037 [trace_attr.c:105](tid:14258) platform is 1. [INFO] ATRACE(14258,python3.7):2024-01-11-05:42:48.624.230 [trace_recorder.c:114](tid:14258) use root path: /home/jenkins/ascend/atrace [INFO] ATRACE(14258,python3.7):2024-01-11-05:42:48.624.259 [trace_signal.c:133](tid:14258) register signal handler for signo 2 succeed. [INFO] ATRACE(14258,python3.7):2024-01-11-05:42:48.624.270 [trace_signal.c:133](tid:14258) register signal handler for signo 15 succeed. [INFO] RUNTIME(14258,python3.7):2024-01-11-05:42:49.037.454 [runtime.cc:1159] 14258 GetAicoreNumByLevel: workingDev_=0 [INFO] RUNTIME(14258,python3.7):2024-01-11-05:42:49.037.534 [runtime.cc:4719] 14258 GetVisibleDevices: ASCEND_RT_VISIBLE_DEVICES param was not set collected 2 items test_argmax.py [INFO] TDT(14258,python3.7):2024-01-11-05:42:53.412.775 [process_mode_manager.cpp:109][OpenProcess][tid:14258] [ProcessModeManager] enter into open process deviceId[2] rankSize[0] [INFO] TDT(14258,python3.7):2024-01-11-05:42:53.414.956 [process_mode_manager.cpp:379][InitTsdClient][tid:14258] [TsdClient] deviceId[2] begin to init hdc client [INFO] TDT(14258,python3.7):2024-01-11-05:42:53.415.096 [version_verify.cpp:34][SetVersionInfo][tid:14258] VersionVerify: send client version to server [INFO] TDT(14258,python3.7):2024-01-11-05:42:53.415.124 [version_verify.cpp:50][SetVersionInfo][tid:14258] send feature_info:{msg_type:35, features:{check before send aicpu package,}} [INFO] TDT(14258,python3.7):2024-01-11-05:42:53.415.136 [version_verify.cpp:50][SetVersionInfo][tid:14258] send feature_info:{msg_type:37, features:{check before send open qs message,}} [INFO] TDT(14258,python3.7):2024-01-11-05:42:53.415.343 [version_verify.cpp:66][PeerVersionCheck][tid:14258] VersionVerify: Check client version info, server[1230], client[1230] [INFO] TDT(14258,python3.7):2024-01-11-05:42:53.415.359 [version_verify.cpp:87][ParseVersionInfo][tid:14258] VersionVerify: pass client version info success [INFO] TDT(14258,python3.7):2024-01-11-05:42:53.415.368 [hdc_client.cpp:276][CheckHdcConnection][tid:14258] Service[2] create hdc success [INFO] TDT(14258,python3.7):2024-01-11-05:42:53.415.384 [version_verify.cpp:120][SpecialFeatureCheck][tid:14258] VersionVerify: new type[35], supported [INFO] TDT(14258,python3.7):2024-01-11-05:42:53.415.423 [process_mode_manager.cpp:748][GetDeviceCheckCode][tid:14258] [TsdClient][deviceId=2] [sessionId=1] wait package info respond [INFO] TDT(14258,python3.7):2024-01-11-05:42:53.415.522 [process_mode_manager.cpp:379][InitTsdClient][tid:14258] [TsdClient] deviceId[2] begin to init hdc client [INFO] TDT(14258,python3.7):2024-01-11-05:42:53.415.623 [version_verify.cpp:34][SetVersionInfo][tid:14258] VersionVerify: send client version to server [INFO] TDT(14258,python3.7):2024-01-11-05:42:53.415.635 [version_verify.cpp:50][SetVersionInfo][tid:14258] send feature_info:{msg_type:35, features:{check before send aicpu package,}} [INFO] TDT(14258,python3.7):2024-01-11-05:42:53.415.645 [version_verify.cpp:50][SetVersionInfo][tid:14258] send feature_info:{msg_type:37, features:{check before send open qs message,}} [INFO] TDT(14258,python3.7):2024-01-11-05:42:53.415.874 [version_verify.cpp:66][PeerVersionCheck][tid:14258] VersionVerify: Check client version info, server[1230], client[1230] [INFO] TDT(14258,python3.7):2024-01-11-05:42:53.415.886 [version_verify.cpp:87][ParseVersionInfo][tid:14258] VersionVerify: pass client version info success [INFO] TDT(14258,python3.7):2024-01-11-05:42:53.415.894 [hdc_client.cpp:276][CheckHdcConnection][tid:14258] Service[2] create hdc success [INFO] TDT(14258,python3.7):2024-01-11-05:42:53.415.906 [process_mode_manager.cpp:426][ConstructOpenMsg][tid:14258] [TsdClient] tsd get process sign successfully, procpid[14258] signSize[48] [INFO] TDT(14258,python3.7):2024-01-11-05:42:53.415.936 [version_verify.cpp:112][SpecialFeatureCheck][tid:14258] VersionVerify: previous type[6], supported [INFO] TDT(14258,python3.7):2024-01-11-05:42:53.415.953 [process_mode_manager.cpp:126][OpenProcess][tid:14258] [ProcessModeManager] deviceId[2] sessionId[1] rankSize[0], wait sub process start respond [INFO] TDT(14258,python3.7):2024-01-11-05:42:53.650.330 [stub_process_mode_nowin.cpp:63][ProcessQueueForMdc][tid:14258] [TsdClient] it is unnecessary of current mode[0] chiptype[1] to grant queue auth to aicpusd [INFO] TDT(14258,python3.7):2024-01-11-05:42:53.650.393 [stub_process_mode_nowin.cpp:101][OpenInHost][tid:14258] enter into OpenInHost deviceid[2] [INFO] TDT(14258,python3.7):2024-01-11-05:42:53.650.404 [stub_process_mode_nowin.cpp:105][OpenInHost][tid:14258] host cpu not support [INFO] TDT(14258,python3.7):2024-01-11-05:42:53.650.412 [process_mode_manager.cpp:156][OpenProcess][tid:14258] [TsdClient][deviceId=2] [sessionId=1] start hccp and computer process success [INFO] RUNTIME(14258,python3.7):2024-01-11-05:42:53.653.095 [device.cc:340] 14258 Init: isDoubledie:0, topologytype:0 [INFO] RUNTIME(14258,python3.7):2024-01-11-05:42:53.668.551 [npu_driver.cc:5428] 15126 GetDeviceStatus: GetDeviceStatus status=1. [INFO] ATRACE(14258,python3.7):2024-01-11-05:42:53.668.601 [atrace_api.c:28](tid:14258) AtraceCreate start [INFO] ATRACE(14258,python3.7):2024-01-11-05:42:53.668.773 [trace_rb_log.c:84](tid:14258) [RUNTIME_ATRACE_DEV2_TS0] create ring buffer success, buffer size : 131152. [INFO] ATRACE(14258,python3.7):2024-01-11-05:42:53.668.790 [atrace_api.c:32](tid:14258) AtraceCreate end [INFO] TDT(14258,python3.7):2024-01-11-05:42:53.668.806 [client_manager.cpp:157][SetProfilingCallback][tid:14258] [TsdClient] set profiling callback success [TRACE] GE(14258,python3.7):2024-01-11-05:42:53.820.316 [status:INIT] [ge_api.cc:144]14258 GEInitializeImpl:GEInitialize start [INFO] PROFILING(14258,python3.7):2024-01-11-05:42:54.041.598 [msprofiler_impl.cpp:156] >>> (tid:14258) ProfNotifySetDevice called, is open: 1, devId: 2 [INFO] PROFILING(14258,python3.7):2024-01-11-05:42:54.041.752 [platform.cpp:38] >>> (tid:14258) Profiling platform version: 1.0. [INFO] PROFILING(14258,python3.7):2024-01-11-05:42:54.041.768 [ai_drv_dev_api.cpp:384] >>> (tid:14258) Succeeded to DrvGetApiVersion version: 0x72313 [TRACE] GE(14258,python3.7):2024-01-11-05:42:54.092.520 [status:RUNNING] [ge_api.cc:211]14258 GEInitializeImpl:Initializing environment [INFO] GE(14258,python3.7):2024-01-11-05:42:54.092.603 [gelib.cc:98][EVENT]14258 Initialize:[GEPERFTRACE] GE Init Start [INFO] GE(14258,python3.7):2024-01-11-05:42:54.092.884 [gelib.cc:307][EVENT]14258 SystemInitialize:Online infer init GELib success, device id :2 [INFO] DVPP(14258,python3.7):2024-01-11-05:42:54.469.594 [dvpp_engine.cc:41][ENGINE][Initialize:41][tid:14258]dvpp engine do not support [INFO] TUNE(14258,python3.7):2024-01-11-05:42:54.473.051 [cann_kb_pyfunc_mgr.cpp:72][CANNKB][Tid:14258]"CannKbPyfuncMgr: Enter PyObjectInit, reference_ is 0!" [INFO] TUNE(14258,python3.7):2024-01-11-05:42:54.473.088 [handle_manager.cpp:115][CANNKB][Tid:14258]"Start to run init functions to load dynamic python lib!" [INFO] TUNE(14258,python3.7):2024-01-11-05:42:54.473.151 [handle_manager.cpp:407][CANNKB][Tid:14258]"Init functions of loading dynamic python lib end!" [INFO] TUNE(14258,python3.7):2024-01-11-05:42:54.473.162 [cann_kb_pyfunc_mgr.cpp:24][CANNKB][Tid:14258]"CANN_KB_Py has already been initialized." [INFO] TUNE(14258,python3.7):2024-01-11-05:42:54.473.230 [cann_kb_pyfunc_mgr.cpp:117][CANNKB][Tid:14258]"CannKbPyfuncMgr: Run PyObjectInit successfully!" [INFO] HCCL(14258,python3.7):2024-01-11-05:43:06.712.319 [plugin_manager.cc:42][14258]hcom running normal mode. [INFO] DVPP(14258,python3.7):2024-01-11-05:43:06.712.977 [dvpp_engine.cc:92][ENGINE][GetOpsKernelInfoStores:92][tid:14258]dvpp ops kernel info store do not support [INFO] DVPP(14258,python3.7):2024-01-11-05:43:06.713.141 [dvpp_engine.cc:69][ENGINE][GetGraphOptimizerObjs:69][tid:14258]dvpp graph optimizer do not support [INFO] DVPP(14258,python3.7):2024-01-11-05:43:07.247.341 [dvpp_ops_kernel_builder.cc:48][ENGINE][Initialize:48][tid:14258]dvpp ops kernel builder do not support [INFO] GE(14258,python3.7):2024-01-11-05:43:07.255.947 [gelib.cc:169][EVENT]14258 Initialize:[GEPERFTRACE] The time cost of GELib::Initialize is [13163287] micro second. [TRACE] GE(14258,python3.7):2024-01-11-05:43:07.342.595 [status:STOP] [ge_api.cc:255]14258 GEInitializeImpl:GEInitialize finished [TRACE] GE(14258,python3.7):2024-01-11-05:43:07.342.731 [status:INIT] [ge_api.cc:398]14258 Session:Start to construct session. [TRACE] GE(14258,python3.7):2024-01-11-05:43:07.342.748 [status:RUNNING] [ge_api.cc:408]14258 Session:Creating session [INFO] GE(14258,python3.7):2024-01-11-05:43:07.343.166 [graph_var_manager.cc:1445][EVENT]14258 SetMemoryMallocSize:Total memory size is 34359738368 [INFO] GE(14258,python3.7):2024-01-11-05:43:07.343.184 [graph_var_manager.cc:1424][EVENT]14258 SetAllMemoryMaxValue:The graph_mem_max_size is 27917287424 and the var_mem_max_size is 5368709120 [INFO] PROFILING(14258,python3.7):2024-01-11-05:43:07.343.501 [msprofiler_impl.cpp:156] >>> (tid:14258) ProfNotifySetDevice called, is open: 1, devId: 2 [TRACE] GE(14258,python3.7):2024-01-11-05:43:07.344.346 [status:RUNNING] [ge_api.cc:411]14258 Session:Session id is 0 [TRACE] GE(14258,python3.7):2024-01-11-05:43:07.344.370 [status:STOP] [ge_api.cc:420]14258 Session:Session Constructor finished [INFO] PROFILING(14258,python3.7):2024-01-11-05:43:07.354.153 [platform.cpp:38] >>> (tid:14258) Profiling platform version: 1.0. [INFO] PROFILING(14258,python3.7):2024-01-11-05:43:07.354.188 [ai_drv_dev_api.cpp:384] >>> (tid:14258) Succeeded to DrvGetApiVersion version: 0x72313 [TRACE] GE(14258,python3.7):2024-01-11-05:43:07.354.362 [status:INIT] [ge_api.cc:144]14258 GEInitializeImpl:GEInitialize start TotalTime = 0.382257, [20] [parse]: 0.22996 [symbol_resolve]: 0.0281092, [1] [Cycle 1]: 0.0280299, [1] [resolve]: 0.0280015 [combine_like_graphs]: 1.11e-06 [graph_reusing]: 3.59e-06 [meta_unpack_prepare]: 0.00012956 [pre_cconv]: 4.86e-06 [abstract_specialize]: 0.00777834 [pack_expand]: 1.272e-05 [auto_monad]: 0.0001037 [inline]: 1.83e-06 [pre_auto_parallel]: 1.377e-05 [pipeline_split]: 2.99e-06 [optimize]: 0.108987, [35] [py_interpret_to_execute]: 3.56e-06 [rewriter_before_opt_a]: 0.00011856 [opt_a]: 0.107939, [3] [Cycle 1]: 0.099022, [30] [expand_dump_flag]: 3.64e-06 [switch_simplify]: 1.945e-05 [a_1]: 0.00026585 [recompute_prepare]: 7.58001e-06 [updatestate_depend_eliminate]: 9.1e-06 [updatestate_assign_eliminate]: 5.57e-06 [updatestate_loads_eliminate]: 4.96e-06 [parameter_eliminate]: 4.51e-06 [a_2]: 6.757e-05 [accelerated_algorithm]: 6.64e-06 [pynative_shard]: 1.59e-06 [auto_parallel]: 3.36e-06 [parallel]: 1.437e-05 [merge_comm]: 4.523e-05 [allreduce_fusion]: 2.02e-06 [virtual_dataset]: 4.75e-06 [get_grad_eliminate_]: 3.5e-06 [virtual_output]: 3.1e-06 [merge_forward]: 8.02e-06 [cell_reuse_recompute_pass]: 9.59997e-07 [cell_reuse_handle_not_recompute_node_pass]: 9.35e-06 [meta_fg_expand]: 0.0373655, [1] [Cycle 1]: 0.00565883, [1] [resolve]: 0.00563732 [after_resolve]: 3.75e-05 [a_after_grad]: 8.975e-05 [renormalize]: 0.06031 [real_op_eliminate]: 3.191e-05 [auto_monad_grad]: 4.289e-05 [auto_monad_eliminator]: 5.184e-05 [cse]: 0.00021095 [a_3]: 0.00019118 [Cycle 2]: 0.00226907, [30] [expand_dump_flag]: 3.66e-06 [switch_simplify]: 7.901e-05 [a_1]: 0.00045499 [recompute_prepare]: 1.138e-05 [updatestate_depend_eliminate]: 1.052e-05 [updatestate_assign_eliminate]: 7.06e-06 [updatestate_loads_eliminate]: 7.3e-06 [parameter_eliminate]: 3.99e-06 [a_2]: 9.853e-05 [accelerated_algorithm]: 1.385e-05 [pynative_shard]: 1.62e-06 [auto_parallel]: 1.023e-05 [parallel]: 6.26e-06 [merge_comm]: 3.99e-06 [allreduce_fusion]: 2.22e-06 [virtual_dataset]: 6.09e-06 [get_grad_eliminate_]: 4.85e-06 [virtual_output]: 4.51e-06 [merge_forward]: 8.55e-06 [cell_reuse_recompute_pass]: 8.2e-07 [cell_reuse_handle_not_recompute_node_pass]: 1.291e-05 [meta_fg_expand]: 2.32e-05 [after_resolve]: 8.74e-06 [a_after_grad]: 1.044e-05 [renormalize]: 0.00118248 [real_op_eliminate]: 1.078e-05 [auto_monad_grad]: 4.73001e-06 [auto_monad_eliminator]: 1.705e-05 [cse]: 5.472e-05 [a_3]: 4.687e-05 [Cycle 3]: 0.00050422, [30] [expand_dump_flag]: 1.3e-06 [switch_simplify]: 5.66e-06 [a_1]: 8.655e-05 [recompute_prepare]: 6.11e-06 [updatestate_depend_eliminate]: 8.87e-06 [updatestate_assign_eliminate]: 6.59e-06 [updatestate_loads_eliminate]: 5.97e-06 [parameter_eliminate]: 1.73e-06 [a_2]: 9.097e-05 [accelerated_algorithm]: 9.13e-06 [pynative_shard]: 1.26001e-06 [auto_parallel]: 3.48e-06 [parallel]: 3.66e-06 [merge_comm]: 2.69e-06 [allreduce_fusion]: 1.70001e-06 [virtual_dataset]: 5.30999e-06 [get_grad_eliminate_]: 4.91e-06 [virtual_output]: 4.56e-06 [merge_forward]: 7.71e-06 [cell_reuse_recompute_pass]: 3.89999e-07 [cell_reuse_handle_not_recompute_node_pass]: 1.214e-05 [meta_fg_expand]: 5.76e-06 [after_resolve]: 7.16e-06 [a_after_grad]: 8.48e-06 [renormalize]: 7.0002e-08 [real_op_eliminate]: 4.62e-06 [auto_monad_grad]: 1.92e-06 [auto_monad_eliminator]: 1.391e-05 [cse]: 3.297e-05 [a_3]: 3.839e-05 [py_interpret_to_execute_after_opt_a]: 4.65e-06 [slice_cell_reuse_recomputed_activation]: 2.72e-06 [rewriter_after_opt_a]: 4.873e-05 [convert_after_rewriter]: 1.213e-05 [order_py_execute_after_rewriter]: 8.96e-06 [opt_b]: 0.00044956, [2] [Cycle 1]: 0.00035583, [7] [b_1]: 0.00029258 [b_2]: 6.84e-06 [updatestate_depend_eliminate]: 3.61e-06 [updatestate_assign_eliminate]: 2.71e-06 [updatestate_loads_eliminate]: 2.31e-06 [renormalize]: 3.6e-07 [cse]: 1.362e-05 [Cycle 2]: 8.372e-05, [7] [b_1]: 4e-05 [b_2]: 2.50999e-06 [updatestate_depend_eliminate]: 2.41e-06 [updatestate_assign_eliminate]: 2.09e-06 [updatestate_loads_eliminate]: 1.88e-06 [renormalize]: 7.0002e-08 [cse]: 8.84999e-06 [cconv]: 2.115e-05 [opt_after_cconv]: 5.631e-05, [1] [Cycle 1]: 5.172e-05, [7] [c_1]: 5e-06 [parameter_eliminate]: 2.11e-06 [updatestate_depend_eliminate]: 2.52001e-06 [updatestate_assign_eliminate]: 1.97e-06 [updatestate_loads_eliminate]: 2.15e-06 [cse]: 9.36e-06 [renormalize]: 3.19997e-07 [remove_dup_value]: 1.147e-05 [tuple_transform]: 3.827e-05, [1] [Cycle 1]: 3.425e-05, [3] [d_1]: 1.461e-05 [d_2]: 6.29e-06 [renormalize]: 2.20003e-07 [add_cache_embedding]: 1.157e-05 [add_recomputation]: 4.976e-05 [cse_after_recomputation]: 1.997e-05, [1] [Cycle 1]: 1.5e-05, [1] [cse]: 1.01e-05 [environ_conv]: 2.023e-05 [label_micro_interleaved_index]: 2.52e-06 [label_fine_grained_interleaved_index]: 2.07e-06 [assign_add_opt]: 1.77e-06 [slice_recompute_activation]: 2.2e-06 [micro_interleaved_order_control]: 1.73e-06 [full_micro_interleaved_order_control]: 2e-06 [comp_comm_scheduling]: 2.35e-06 [reorder_send_recv_between_fp_bp]: 1.97e-06 [comm_op_add_attrs]: 1.28e-06 [add_comm_op_reuse_tag]: 9e-07 [overlap_opt_shard_in_pipeline]: 1.04e-06 [grouped_pairwise_exchange_alltoall]: 1.09e-06 [overlap_recompute_and_grad_model_parallel]: 1.76e-06 [overlap_grad_matmul_and_grad_allreduce]: 7.39994e-07 [split_matmul_comm_elemetwise]: 2.71e-06 [split_layernorm_comm]: 1.96e-06 [process_send_recv_for_ge]: 2.56e-06 [handle_group_info]: 9.70002e-07 [auto_monad_reorder]: 2.422e-05 [get_jit_bprop_graph]: 4.50003e-07 [eliminate_special_op_node]: 0.00054825 [validate]: 5.141e-05 [distribtued_split]: 1.35e-06 [task_emit]: 0.00625803 [execute]: 7.95e-06 Sums parse : 0.229960s : 66.99% symbol_resolve.resolve : 0.028001s : 8.16% combine_like_graphs : 0.000001s : 0.00% graph_reusing : 0.000004s : 0.00% meta_unpack_prepare : 0.000130s : 0.04% pre_cconv : 0.000005s : 0.00% abstract_specialize : 0.007778s : 2.27% pack_expand : 0.000013s : 0.00% auto_monad : 0.000104s : 0.03% inline : 0.000002s : 0.00% pre_auto_parallel : 0.000014s : 0.00% pipeline_split : 0.000003s : 0.00% optimize.py_interpret_to_execute : 0.000004s : 0.00% optimize.rewriter_before_opt_a : 0.000119s : 0.03% optimize.opt_a.expand_dump_flag : 0.000009s : 0.00% optimize.opt_a.switch_simplify : 0.000104s : 0.03% optimize.opt_a.a_1 : 0.000807s : 0.24% optimize.opt_a.recompute_prepare : 0.000025s : 0.01% optimize.opt_a.updatestate_depend_eliminate : 0.000028s : 0.01% optimize.opt_a.updatestate_assign_eliminate : 0.000019s : 0.01% optimize.opt_a.updatestate_loads_eliminate : 0.000018s : 0.01% optimize.opt_a.parameter_eliminate : 0.000010s : 0.00% optimize.opt_a.a_2 : 0.000257s : 0.07% optimize.opt_a.accelerated_algorithm : 0.000030s : 0.01% optimize.opt_a.pynative_shard : 0.000004s : 0.00% optimize.opt_a.auto_parallel : 0.000017s : 0.00% optimize.opt_a.parallel : 0.000024s : 0.01% optimize.opt_a.merge_comm : 0.000052s : 0.02% optimize.opt_a.allreduce_fusion : 0.000006s : 0.00% optimize.opt_a.virtual_dataset : 0.000016s : 0.00% optimize.opt_a.get_grad_eliminate_ : 0.000013s : 0.00% optimize.opt_a.virtual_output : 0.000012s : 0.00% optimize.opt_a.merge_forward : 0.000024s : 0.01% optimize.opt_a.cell_reuse_recompute_pass : 0.000002s : 0.00% optimize.opt_a.cell_reuse_handle_not_recompute_node_pass : 0.000034s : 0.01% optimize.opt_a.meta_fg_expand : 0.000029s : 0.01% optimize.opt_a.meta_fg_expand.resolve : 0.005637s : 1.64% optimize.opt_a.after_resolve : 0.000053s : 0.02% optimize.opt_a.a_after_grad : 0.000109s : 0.03% optimize.opt_a.renormalize : 0.061493s : 17.91% optimize.opt_a.real_op_eliminate : 0.000047s : 0.01% optimize.opt_a.auto_monad_grad : 0.000050s : 0.01% optimize.opt_a.auto_monad_eliminator : 0.000083s : 0.02% optimize.opt_a.cse : 0.000299s : 0.09% optimize.opt_a.a_3 : 0.000276s : 0.08% optimize.py_interpret_to_execute_after_opt_a : 0.000005s : 0.00% optimize.slice_cell_reuse_recomputed_activation : 0.000003s : 0.00% optimize.rewriter_after_opt_a : 0.000049s : 0.01% optimize.convert_after_rewriter : 0.000012s : 0.00% optimize.order_py_execute_after_rewriter : 0.000009s : 0.00% optimize.opt_b.b_1 : 0.000333s : 0.10% optimize.opt_b.b_2 : 0.000009s : 0.00% optimize.opt_b.updatestate_depend_eliminate : 0.000006s : 0.00% optimize.opt_b.updatestate_assign_eliminate : 0.000005s : 0.00% optimize.opt_b.updatestate_loads_eliminate : 0.000004s : 0.00% optimize.opt_b.renormalize : 0.000000s : 0.00% optimize.opt_b.cse : 0.000022s : 0.01% optimize.cconv : 0.000021s : 0.01% optimize.opt_after_cconv.c_1 : 0.000005s : 0.00% optimize.opt_after_cconv.parameter_eliminate : 0.000002s : 0.00% optimize.opt_after_cconv.updatestate_depend_eliminate : 0.000003s : 0.00% optimize.opt_after_cconv.updatestate_assign_eliminate : 0.000002s : 0.00% optimize.opt_after_cconv.updatestate_loads_eliminate : 0.000002s : 0.00% optimize.opt_after_cconv.cse : 0.000009s : 0.00% optimize.opt_after_cconv.renormalize : 0.000000s : 0.00% optimize.remove_dup_value : 0.000011s : 0.00% optimize.tuple_transform.d_1 : 0.000015s : 0.00% optimize.tuple_transform.d_2 : 0.000006s : 0.00% optimize.tuple_transform.renormalize : 0.000000s : 0.00% optimize.add_cache_embedding : 0.000012s : 0.00% optimize.add_recomputation : 0.000050s : 0.01% optimize.cse_after_recomputation.cse : 0.000010s : 0.00% optimize.environ_conv : 0.000020s : 0.01% optimize.label_micro_interleaved_index : 0.000003s : 0.00% optimize.label_fine_grained_interleaved_index : 0.000002s : 0.00% optimize.assign_add_opt : 0.000002s : 0.00% optimize.slice_recompute_activation : 0.000002s : 0.00% optimize.micro_interleaved_order_control : 0.000002s : 0.00% optimize.full_micro_interleaved_order_control : 0.000002s : 0.00% optimize.comp_comm_scheduling : 0.000002s : 0.00% optimize.reorder_send_recv_between_fp_bp : 0.000002s : 0.00% optimize.comm_op_add_attrs : 0.000001s : 0.00% optimize.add_comm_op_reuse_tag : 0.000001s : 0.00% optimize.overlap_opt_shard_in_pipeline : 0.000001s : 0.00% optimize.grouped_pairwise_exchange_alltoall : 0.000001s : 0.00% optimize.overlap_recompute_and_grad_model_parallel : 0.000002s : 0.00% optimize.overlap_grad_matmul_and_grad_allreduce : 0.000001s : 0.00% optimize.split_matmul_comm_elemetwise : 0.000003s : 0.00% optimize.split_layernorm_comm : 0.000002s : 0.00% optimize.process_send_recv_for_ge : 0.000003s : 0.00% optimize.handle_group_info : 0.000001s : 0.00% auto_monad_reorder : 0.000024s : 0.01% get_jit_bprop_graph : 0.000000s : 0.00% eliminate_special_op_node : 0.000548s : 0.16% validate : 0.000051s : 0.01% distribtued_split : 0.000001s : 0.00% task_emit : 0.006258s : 1.82% execute : 0.000008s : 0.00% Time group info: ------[substitution.] 0.033523 201 0.00% : 0.000002s : 2: substitution.float_depend_g_call 0.02% : 0.000006s : 6: substitution.float_tuple_getitem_switch 98.20% : 0.032921s : 25: substitution.getattr_setattr_resolve 0.01% : 0.000005s : 3: substitution.graph_param_transform 0.00% : 0.000002s : 1: substitution.incorporate_call 0.03% : 0.000010s : 1: substitution.incorporate_call_switch 1.13% : 0.000379s : 32: substitution.inline 0.01% : 0.000004s : 4: substitution.less_batch_normalization 0.12% : 0.000041s : 18: substitution.meta_unpack_prepare 0.01% : 0.000005s : 4: substitution.minmaximum_grad 0.01% : 0.000002s : 2: substitution.partial_eliminate 0.00% : 0.000001s : 3: substitution.partial_unused_args_eliminate 0.01% : 0.000004s : 21: substitution.remove_not_recompute_node 0.10% : 0.000034s : 18: substitution.replace_applicator 0.02% : 0.000006s : 14: substitution.replace_old_param 0.01% : 0.000003s : 1: substitution.reset_defer_inline 0.01% : 0.000004s : 3: substitution.set_cell_output_no_recompute 0.01% : 0.000004s : 2: substitution.specialize_transform 0.02% : 0.000006s : 3: substitution.switch_simplify 0.02% : 0.000007s : 1: substitution.transpose_eliminate 0.05% : 0.000017s : 6: substitution.tuple_list_convert_item_index_to_positive 0.02% : 0.000007s : 6: substitution.tuple_list_get_item_const_eliminator 0.03% : 0.000010s : 6: substitution.tuple_list_get_item_depend_reorder 0.11% : 0.000037s : 13: substitution.tuple_list_get_item_eliminator 0.03% : 0.000010s : 6: substitution.tuple_list_get_set_item_eliminator ------[renormalize.] 0.061480 4 96.62% : 0.059401s : 2: renormalize.infer 3.38% : 0.002080s : 2: renormalize.specialize ------[replace.] 0.000568 48 62.39% : 0.000354s : 23: replace.getattr_setattr_resolve 19.74% : 0.000112s : 17: replace.inline 5.98% : 0.000034s : 1: replace.meta_unpack_prepare 6.87% : 0.000039s : 3: replace.switch_simplify 0.85% : 0.000005s : 1: replace.transpose_eliminate 4.17% : 0.000024s : 3: replace.tuple_list_get_item_eliminator ------[match.] 0.033140 48 98.89% : 0.032771s : 23: match.getattr_setattr_resolve 0.95% : 0.000315s : 17: match.inline 0.09% : 0.000029s : 1: match.meta_unpack_prepare 0.02% : 0.000006s : 3: match.switch_simplify 0.02% : 0.000007s : 1: match.transpose_eliminate 0.04% : 0.000013s : 3: match.tuple_list_get_item_eliminator ------[func_graph_cloner_run.] 0.002623 43 68.52% : 0.001797s : 21: func_graph_cloner_run.FuncGraphClonerGraph 31.48% : 0.000826s : 22: func_graph_cloner_run.FuncGraphSpecializer ------[meta_graph.] 0.000000 0 ------[manager.] 0.000000 0 ------[pynative] 0.000000 0 ------[others.] 0.035692 213 1.49% : 0.000532s : 78: opt.transform.opt_a 0.84% : 0.000300s : 92: opt.transform.opt_b 93.95% : 0.033533s : 4: opt.transform.opt_resolve 0.29% : 0.000105s : 1: opt.transforms.meta_unpack_prepare 3.32% : 0.001184s : 30: opt.transforms.opt_a 0.01% : 0.000004s : 1: opt.transforms.opt_after_cconv 0.01% : 0.000005s : 2: opt.transforms.opt_b 0.05% : 0.000019s : 2: opt.transforms.opt_trans_graph 0.03% : 0.000011s : 3: opt.transforms.special_op_eliminate [INFO] GE(14258,python3.7):2024-01-11-05:43:07.853.558 [scalable_config.cc:55][EVENT]18796 ScalableConfig:device total max size: 34359738368, page_mem_size_total_thresold: 32641751449, uncacheable_size_threshold: 17179869184 [INFO] GE(14258,python3.7):2024-01-11-05:43:07.938.770 [graph_var_manager.cc:1424][EVENT]18796 SetAllMemoryMaxValue:The graph_mem_max_size is 27917287424 and the var_mem_max_size is 5368709120 [INFO] GE(14258,python3.7):2024-01-11-05:43:07.938.880 [graph_manager.cc:1248][EVENT]18796 PreRun:PreRun start: graph node size 4, session id 1, graph id 0, graph name online. [INFO] ATRACE(14258,python3.7):2024-01-11-05:43:07.939.845 [atrace_api.c:28](tid:18796) AtraceCreate start [INFO] ATRACE(14258,python3.7):2024-01-11-05:43:07.939.923 [trace_rb_log.c:84](tid:18796) [RUNTIME_ATRACE_DEV64_TS0] create ring buffer success, buffer size : 131152. [INFO] ATRACE(14258,python3.7):2024-01-11-05:43:07.939.937 [atrace_api.c:32](tid:18796) AtraceCreate end [INFO] TDT(14258,python3.7):2024-01-11-05:43:07.939.966 [client_manager.cpp:157][SetProfilingCallback][tid:18796] [TsdClient] set profiling callback success [INFO] GE(14258,python3.7):2024-01-11-05:43:07.941.008 [parallel_partitioner.cc:165][EVENT]18796 DoPipelinePartition:[GEPERFTRACE] The time cost of OptimizeSubgraph::PipelinePartition is [27] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:07.941.055 [parallel_partitioner.cc:178][EVENT]18796 DoFlowGraphPartition:[GEPERFTRACE] The time cost of OptimizeSubgraph::FlowGraphPartition is [17] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:07.941.119 [graph_prepare.cc:1378][EVENT]18796 Init:[GEPERFTRACE] The time cost of FileConstantUtils::ConvertFileConstToConst is [3] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:07.941.841 [graph_manager.cc:1050][EVENT]18796 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.preparer.PrepareInit is [750] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:07.941.871 [graph_manager.cc:1052][EVENT]18796 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.optimizer.HandleSummaryOp is [8] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:07.942.044 [base_pass.cc:339][EVENT]18796 Run:[GEPERFTRACE] The time cost of ForToWhilePass is [2] micro second, call num is [4] [INFO] GE(14258,python3.7):2024-01-11-05:43:07.942.077 [pass_manager.cc:82][EVENT]18796 Run:[GEPERFTRACE] The time cost of ProcessNetOutput::SavePass is [4] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:07.942.147 [pass_manager.cc:82][EVENT]18796 Run:[GEPERFTRACE] The time cost of ProcessNetOutput::NetOutputPass is [58] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:07.942.160 [pass_manager.cc:82][EVENT]18796 Run:[GEPERFTRACE] The time cost of ProcessNetOutput::DataPass is [1] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:07.942.257 [pass_manager.cc:82][EVENT]18796 Run:[GEPERFTRACE] The time cost of CreateSubGraphWithScopePass is [21] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:07.942.272 [pass_manager.cc:82][EVENT]18796 Run:[GEPERFTRACE] The time cost of SubgraphMultiDimsClonePass is [1] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:07.942.295 [pass_manager.cc:82][EVENT]18796 Run:[GEPERFTRACE] The time cost of MultiBatchClonePass is [12] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:07.942.415 [base_pass.cc:339][EVENT]18796 Run:[GEPERFTRACE] The time cost of SplitVariableIntoSubgraphPass is [2] micro second, call num is [4] [INFO] GE(14258,python3.7):2024-01-11-05:43:07.942.436 [graph_manager.cc:1054][EVENT]18796 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.preparer.NormalizeGraph is [553] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:07.950.097 [graph_manager.cc:1055][EVENT]18796 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.optimizer.OptimizeGraphInit is [7647] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:07.951.326 [base_pass.cc:339][EVENT]18796 Run:[GEPERFTRACE] The time cost of AssertPass is [4] micro second, call num is [8] [INFO] GE(14258,python3.7):2024-01-11-05:43:07.951.354 [base_pass.cc:339][EVENT]18796 Run:[GEPERFTRACE] The time cost of SwitchDeadBranchElimination is [3] micro second, call num is [4] [INFO] GE(14258,python3.7):2024-01-11-05:43:07.951.366 [base_pass.cc:339][EVENT]18796 Run:[GEPERFTRACE] The time cost of MergePass is [5] micro second, call num is [4] [INFO] GE(14258,python3.7):2024-01-11-05:43:07.951.376 [base_pass.cc:339][EVENT]18796 Run:[GEPERFTRACE] The time cost of InferShapePass is [367] micro second, call num is [4] [INFO] GE(14258,python3.7):2024-01-11-05:43:07.951.385 [base_pass.cc:339][EVENT]18796 Run:[GEPERFTRACE] The time cost of ReplaceWithEmptyConstPass is [16] micro second, call num is [4] [INFO] GE(14258,python3.7):2024-01-11-05:43:07.951.393 [base_pass.cc:339][EVENT]18796 Run:[GEPERFTRACE] The time cost of SplitShapeNPass is [4] micro second, call num is [8] [INFO] GE(14258,python3.7):2024-01-11-05:43:07.951.402 [base_pass.cc:339][EVENT]18796 Run:[GEPERFTRACE] The time cost of DimensionComputePass is [20] micro second, call num is [4] [INFO] GE(14258,python3.7):2024-01-11-05:43:07.951.421 [base_pass.cc:339][EVENT]18796 Run:[GEPERFTRACE] The time cost of ConstantFoldingPass is [25] micro second, call num is [4] [INFO] GE(14258,python3.7):2024-01-11-05:43:07.951.430 [base_pass.cc:339][EVENT]18796 Run:[GEPERFTRACE] The time cost of InferValuePass is [5] micro second, call num is [4] [INFO] GE(14258,python3.7):2024-01-11-05:43:07.952.942 [graph_manager.cc:1056][EVENT]18796 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.optimizer.OptimizeOriginalGraphForQuantize is [2807] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:07.953.010 [base_pass.cc:339][EVENT]18796 Run:[GEPERFTRACE] The time cost of CondRemovePass is [4] micro second, call num is [4] [INFO] GE(14258,python3.7):2024-01-11-05:43:07.953.029 [graph_prepare.cc:1982][EVENT]18796 PrepareDynShape:[GEPERFTRACE] The time cost of Prepare::ProcessBeforeInfershape is [53] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:07.953.465 [base_pass.cc:339][EVENT]18796 Run:[GEPERFTRACE] The time cost of AssertPass is [1] micro second, call num is [8] [INFO] GE(14258,python3.7):2024-01-11-05:43:07.953.489 [base_pass.cc:339][EVENT]18796 Run:[GEPERFTRACE] The time cost of SwitchDeadBranchElimination is [3] micro second, call num is [4] [INFO] GE(14258,python3.7):2024-01-11-05:43:07.953.501 [base_pass.cc:339][EVENT]18796 Run:[GEPERFTRACE] The time cost of MergePass is [3] micro second, call num is [4] [INFO] GE(14258,python3.7):2024-01-11-05:43:07.953.510 [base_pass.cc:339][EVENT]18796 Run:[GEPERFTRACE] The time cost of InferShapePass is [242] micro second, call num is [4] [INFO] GE(14258,python3.7):2024-01-11-05:43:07.953.519 [base_pass.cc:339][EVENT]18796 Run:[GEPERFTRACE] The time cost of ReplaceWithEmptyConstPass is [10] micro second, call num is [4] [INFO] GE(14258,python3.7):2024-01-11-05:43:07.953.527 [base_pass.cc:339][EVENT]18796 Run:[GEPERFTRACE] The time cost of SplitShapeNPass is [1] micro second, call num is [8] [INFO] GE(14258,python3.7):2024-01-11-05:43:07.953.536 [base_pass.cc:339][EVENT]18796 Run:[GEPERFTRACE] The time cost of DimensionComputePass is [6] micro second, call num is [4] [INFO] GE(14258,python3.7):2024-01-11-05:43:07.953.544 [base_pass.cc:339][EVENT]18796 Run:[GEPERFTRACE] The time cost of ConstantFoldingPass is [13] micro second, call num is [4] [INFO] GE(14258,python3.7):2024-01-11-05:43:07.953.552 [base_pass.cc:339][EVENT]18796 Run:[GEPERFTRACE] The time cost of InferValuePass is [3] micro second, call num is [4] [INFO] GE(14258,python3.7):2024-01-11-05:43:07.953.577 [graph_prepare.cc:1983][EVENT]18796 PrepareDynShape:[GEPERFTRACE] The time cost of Prepare::FormatAndShapeProcess is [534] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:07.953.602 [pass_manager.cc:82][EVENT]18796 Run:[GEPERFTRACE] The time cost of PreRun::MarkForceUnknownForCondPass is [5] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:07.953.615 [graph_prepare.cc:1984][EVENT]18796 PrepareDynShape:[GEPERFTRACE] The time cost of Prepare::CtrlFlowPreProcess is [23] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:07.953.630 [graph_prepare.cc:1985][EVENT]18796 PrepareDynShape:[GEPERFTRACE] The time cost of Prepare::multibatch::GetDynamicOutputShape is [3] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:07.953.651 [graph_prepare.cc:1986][EVENT]18796 PrepareDynShape:[GEPERFTRACE] The time cost of Prepare::InsertAippOpUtil::Instance().UpdateDataNodeByAipp is [8] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:07.953.663 [graph_prepare.cc:1987][EVENT]18796 PrepareDynShape:[GEPERFTRACE] The time cost of Prepare::SaveOriginalGraphToOmModel is [1] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:07.953.678 [pass_manager.cc:82][EVENT]18796 Run:[GEPERFTRACE] The time cost of PrepareOptimize::ShapeOperateOpRemovePass is [2] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:07.953.691 [pass_manager.cc:82][EVENT]18796 Run:[GEPERFTRACE] The time cost of PrepareOptimize::ReplaceTransShapePass is [1] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:07.953.716 [pass_manager.cc:82][EVENT]18796 Run:[GEPERFTRACE] The time cost of PrepareOptimize::MarkAgnosticPass is [5] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:07.953.815 [base_pass.cc:339][EVENT]18796 Run:[GEPERFTRACE] The time cost of EnterPass is [2] micro second, call num is [4] [INFO] GE(14258,python3.7):2024-01-11-05:43:07.953.828 [base_pass.cc:339][EVENT]18796 Run:[GEPERFTRACE] The time cost of CondPass is [3] micro second, call num is [4] [INFO] GE(14258,python3.7):2024-01-11-05:43:07.953.837 [base_pass.cc:339][EVENT]18796 Run:[GEPERFTRACE] The time cost of PrintOpPass is [3] micro second, call num is [4] [INFO] GE(14258,python3.7):2024-01-11-05:43:07.953.846 [base_pass.cc:339][EVENT]18796 Run:[GEPERFTRACE] The time cost of NoUseReshapeRemovePass is [3] micro second, call num is [4] [INFO] GE(14258,python3.7):2024-01-11-05:43:07.953.854 [base_pass.cc:339][EVENT]18796 Run:[GEPERFTRACE] The time cost of DropOutPass is [2] micro second, call num is [4] [INFO] GE(14258,python3.7):2024-01-11-05:43:07.953.863 [base_pass.cc:339][EVENT]18796 Run:[GEPERFTRACE] The time cost of AssertPass is [1] micro second, call num is [4] [INFO] GE(14258,python3.7):2024-01-11-05:43:07.953.871 [base_pass.cc:339][EVENT]18796 Run:[GEPERFTRACE] The time cost of TransposeRemovePass is [2] micro second, call num is [4] [INFO] GE(14258,python3.7):2024-01-11-05:43:07.953.879 [base_pass.cc:339][EVENT]18796 Run:[GEPERFTRACE] The time cost of UnusedConstPass is [1] micro second, call num is [4] [INFO] GE(14258,python3.7):2024-01-11-05:43:07.953.887 [base_pass.cc:339][EVENT]18796 Run:[GEPERFTRACE] The time cost of StopGradientPass is [3] micro second, call num is [4] [INFO] GE(14258,python3.7):2024-01-11-05:43:07.953.896 [base_pass.cc:339][EVENT]18796 Run:[GEPERFTRACE] The time cost of PreventGradientPass is [1] micro second, call num is [4] [INFO] GE(14258,python3.7):2024-01-11-05:43:07.953.904 [base_pass.cc:339][EVENT]18796 Run:[GEPERFTRACE] The time cost of PlaceholderWithDefaultPass is [3] micro second, call num is [4] [INFO] GE(14258,python3.7):2024-01-11-05:43:07.953.912 [base_pass.cc:339][EVENT]18796 Run:[GEPERFTRACE] The time cost of SnapshotPass is [2] micro second, call num is [4] [INFO] GE(14258,python3.7):2024-01-11-05:43:07.953.920 [base_pass.cc:339][EVENT]18796 Run:[GEPERFTRACE] The time cost of GuaranteeConstPass is [3] micro second, call num is [4] [INFO] GE(14258,python3.7):2024-01-11-05:43:07.953.928 [base_pass.cc:339][EVENT]18796 Run:[GEPERFTRACE] The time cost of VarIsInitializedOpPass is [6] micro second, call num is [4] [INFO] GE(14258,python3.7):2024-01-11-05:43:07.953.936 [base_pass.cc:339][EVENT]18796 Run:[GEPERFTRACE] The time cost of ParallelConcatStartOpPass is [1] micro second, call num is [4] [INFO] GE(14258,python3.7):2024-01-11-05:43:07.953.944 [base_pass.cc:339][EVENT]18796 Run:[GEPERFTRACE] The time cost of IdentityPass is [2] micro second, call num is [4] [INFO] GE(14258,python3.7):2024-01-11-05:43:07.953.968 [pass_manager.cc:82][EVENT]18796 Run:[GEPERFTRACE] The time cost of PrepareOptimize::PrunePass is [11] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:07.953.982 [pass_manager.cc:82][EVENT]18796 Run:[GEPERFTRACE] The time cost of PrepareOptimize::HcclMemcpyPass is [2] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:07.954.014 [graph_prepare.cc:1988][EVENT]18796 PrepareDynShape:[GEPERFTRACE] The time cost of Prepare::PrepareOptimize is [341] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:07.954.027 [graph_manager.cc:1065][EVENT]18796 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.preparer.PrepareDynShape is [1051] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:07.967.291 [graph_manager.cc:1077][EVENT]18796 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.optimizer.OptimizeOriginalGraph is [13245] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:07.967.360 [pass_manager.cc:82][EVENT]18796 Run:[GEPERFTRACE] The time cost of PrepareRunningFormatRefiner::VariablePrepareOpPass is [6] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:07.967.421 [graph_manager.cc:1080][EVENT]18796 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.preparer.PrepareRunningFormatRefiner is [94] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:07.970.378 [graph_manager.cc:1081][EVENT]18796 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.optimizer.OptimizeOriginalGraphJudgeInsert is [2938] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:07.970.419 [pass_manager.cc:82][EVENT]18796 Run:[GEPERFTRACE] The time cost of SubexpressionMigrationPass is [3] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:07.970.435 [pass_manager.cc:82][EVENT]18796 Run:[GEPERFTRACE] The time cost of UnusedArgsCleanPass is [2] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:07.970.448 [graph_manager.cc:1082][EVENT]18796 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::SubexpressionMigration is [38] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:07.970.480 [pass_manager.cc:82][EVENT]18796 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::MergeInputMemcpyPass is [4] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:07.970.496 [pass_manager.cc:82][EVENT]18796 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::SwitchDataEdgesBypass is [3] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:07.970.510 [pass_manager.cc:82][EVENT]18796 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::ConstantFuseSamePass is [5] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:07.970.546 [pass_manager.cc:82][EVENT]18796 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::CSEBeforeFuseDataNodesWithCommonInputPass is [26] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:07.970.561 [pass_manager.cc:82][EVENT]18796 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::FuseDataNodesWithCommonInputPass is [3] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:07.970.577 [pass_manager.cc:82][EVENT]18796 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::CommonSubexpressionEliminationPass is [5] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:07.970.592 [pass_manager.cc:82][EVENT]18796 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::PermutePass is [3] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:07.970.632 [pass_manager.cc:82][EVENT]18796 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::SameTransdataBreadthFusionPass is [30] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:07.970.658 [pass_manager.cc:82][EVENT]18796 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::VariableOpPass is [15] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:07.970.693 [pass_manager.cc:82][EVENT]18796 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::TransOpWithoutReshapeFusionPass is [23] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:07.970.743 [pass_manager.cc:82][EVENT]18796 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::TransOpBreadthFusionPass is [39] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:07.970.764 [pass_manager.cc:82][EVENT]18796 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::DataFlowPreparePass is [8] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:07.970.778 [pass_manager.cc:82][EVENT]18796 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::MergeUnknownShapeNPass is [3] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:07.970.788 [graph_manager.cc:2700][EVENT]18796 OptimizeStage1:[GEPERFTRACE] The time cost of GraphManager::OptimizeStage1_1 is [314] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:07.970.921 [base_pass.cc:339][EVENT]18796 Run:[GEPERFTRACE] The time cost of EnterPass is [2] micro second, call num is [4] [INFO] GE(14258,python3.7):2024-01-11-05:43:07.970.936 [base_pass.cc:339][EVENT]18796 Run:[GEPERFTRACE] The time cost of AddNPass is [1] micro second, call num is [4] [INFO] GE(14258,python3.7):2024-01-11-05:43:07.970.954 [base_pass.cc:339][EVENT]18796 Run:[GEPERFTRACE] The time cost of SwitchDeadBranchElimination is [4] micro second, call num is [4] [INFO] GE(14258,python3.7):2024-01-11-05:43:07.970.963 [base_pass.cc:339][EVENT]18796 Run:[GEPERFTRACE] The time cost of SwitchLogicRemovePass is [1] micro second, call num is [4] [INFO] GE(14258,python3.7):2024-01-11-05:43:07.970.972 [base_pass.cc:339][EVENT]18796 Run:[GEPERFTRACE] The time cost of MergePass is [1] micro second, call num is [4] [INFO] GE(14258,python3.7):2024-01-11-05:43:07.970.980 [base_pass.cc:339][EVENT]18796 Run:[GEPERFTRACE] The time cost of CastRemovePass is [12] micro second, call num is [4] [INFO] GE(14258,python3.7):2024-01-11-05:43:07.970.988 [base_pass.cc:339][EVENT]18796 Run:[GEPERFTRACE] The time cost of TransposeTransDataPass is [2] micro second, call num is [4] [INFO] GE(14258,python3.7):2024-01-11-05:43:07.970.997 [base_pass.cc:339][EVENT]18796 Run:[GEPERFTRACE] The time cost of ReshapeRemovePass is [4] micro second, call num is [4] [INFO] GE(14258,python3.7):2024-01-11-05:43:07.971.005 [base_pass.cc:339][EVENT]18796 Run:[GEPERFTRACE] The time cost of TransOpSymmetryEliminationPass is [4] micro second, call num is [4] [INFO] GE(14258,python3.7):2024-01-11-05:43:07.971.013 [base_pass.cc:339][EVENT]18796 Run:[GEPERFTRACE] The time cost of TransOpNearbyAllreduceFusionPass is [1] micro second, call num is [4] [INFO] GE(14258,python3.7):2024-01-11-05:43:07.971.021 [base_pass.cc:339][EVENT]18796 Run:[GEPERFTRACE] The time cost of ReplaceWithEmptyConstPass is [8] micro second, call num is [4] [INFO] GE(14258,python3.7):2024-01-11-05:43:07.971.029 [base_pass.cc:339][EVENT]18796 Run:[GEPERFTRACE] The time cost of DimensionComputePass is [4] micro second, call num is [4] [INFO] GE(14258,python3.7):2024-01-11-05:43:07.971.038 [base_pass.cc:339][EVENT]18796 Run:[GEPERFTRACE] The time cost of ConstantFoldingPass is [10] micro second, call num is [4] [INFO] GE(14258,python3.7):2024-01-11-05:43:07.971.046 [base_pass.cc:339][EVENT]18796 Run:[GEPERFTRACE] The time cost of DimensionAdjustPass is [3] micro second, call num is [4] [INFO] GE(14258,python3.7):2024-01-11-05:43:07.971.054 [base_pass.cc:339][EVENT]18796 Run:[GEPERFTRACE] The time cost of UselessControlOutRemovePass is [3] micro second, call num is [4] [INFO] GE(14258,python3.7):2024-01-11-05:43:07.971.064 [graph_manager.cc:2741][EVENT]18796 OptimizeStage1:[GEPERFTRACE] The time cost of GraphManager::OptimizeStage1_2 is [256] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:07.971.072 [graph_manager.cc:2752][EVENT]18796 OptimizeStage1:[GEPERFTRACE] The time cost of extern constant folding is [0] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:07.971.095 [pass_manager.cc:82][EVENT]18796 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::Migration is [3] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:07.971.108 [pass_manager.cc:82][EVENT]18796 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::ArgsClean is [1] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:07.971.126 [pass_manager.cc:82][EVENT]18796 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::PrunePass is [8] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:07.971.142 [pass_manager.cc:82][EVENT]18796 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::NextIterationPass is [6] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:07.971.155 [pass_manager.cc:82][EVENT]18796 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::ControlTriggerPass is [2] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:07.971.169 [pass_manager.cc:82][EVENT]18796 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::MergeToStreamMergePass is [3] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:07.971.194 [pass_manager.cc:82][EVENT]18796 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::SwitchToStreamSwitchPass is [16] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:07.971.210 [pass_manager.cc:82][EVENT]18796 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::AttachStreamLabelPass is [4] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:07.971.234 [pass_manager.cc:82][EVENT]18796 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::MultiBatchPass is [4] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:07.971.245 [pass_manager.cc:82][EVENT]18796 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::SubgraphMultiDimsPass is [2] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:07.971.260 [pass_manager.cc:82][EVENT]18796 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::IteratorOpPass is [5] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:07.971.272 [pass_manager.cc:82][EVENT]18796 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::VariableRefUselessControlOutDeletePass is [2] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:07.971.291 [pass_manager.cc:82][EVENT]18796 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::ReshapeRecoveryPass is [10] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:07.971.304 [pass_manager.cc:82][EVENT]18796 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::RemoveSameConstPass is [3] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:07.971.314 [graph_manager.cc:2810][EVENT]18796 OptimizeStage1:[GEPERFTRACE] The time cost of GraphManager::OptimizeStage1_3 is [224] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:07.971.345 [base_pass.cc:339][EVENT]18796 Run:[GEPERFTRACE] The time cost of IdentityPass is [3] micro second, call num is [4] [INFO] GE(14258,python3.7):2024-01-11-05:43:07.971.358 [graph_manager.cc:2821][EVENT]18796 OptimizeStage1:[GEPERFTRACE] The time cost of GraphPrepare::node_pass is [35] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:07.971.385 [graph_manager.cc:1087][EVENT]18796 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::OptimizeStage1 is [918] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:07.971.527 [graph_manager.cc:1088][EVENT]18796 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.optimizer.OptimizeAfterStage1 is [129] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:07.971.569 [graph_manager.cc:1089][EVENT]18796 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::GraphUtilsEx::InferShapeInNeed is [23] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:07.971.587 [pass_manager.cc:82][EVENT]18796 Run:[GEPERFTRACE] The time cost of PreRun::CtrlEdgeTransferPass is [2] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:07.971.603 [graph_manager.cc:1097][EVENT]18796 PreRunOptimizeOriginalGraph:PreRun:PreRunOptimizeOriginalGraph success. [INFO] GE(14258,python3.7):2024-01-11-05:43:07.971.623 [graph_manager.cc:3325][EVENT]18796 OptimizeSubgraph:[GEPERFTRACE] The time cost of OptimizeSubgraph::StagePartition is [4] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:07.971.892 [engine_place.cc:144][EVENT]18796 Run:The time cost of DNN_VM_GE_LOCAL_OP_STORE::CheckSupported is [13] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:07.971.911 [engine_place.cc:144][EVENT]18796 Run:The time cost of DNN_VM_RTS_OP_STORE::CheckSupported is [11] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:07.971.920 [engine_place.cc:144][EVENT]18796 Run:The time cost of aicpu_ascend_kernel::CheckSupported is [114] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:07.972.006 [graph_manager.cc:3351][EVENT]18796 OptimizeSubgraph:[GEPERFTRACE] The time cost of OptimizeSubgraph::GraphPartitionDynamicShape is [370] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:07.972.024 [graph_manager.cc:3364][EVENT]18796 OptimizeSubgraph:[GEPERFTRACE] The time cost of OptimizeSubgraph::SubgraphPartitionAndOptimization::CompositeEngine is [2] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:07.972.099 [engine_partitioner.cc:1139][EVENT]18796 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionInitialize is [23] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:07.972.182 [engine_partitioner.cc:1142][EVENT]18796 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionMarkClusters is [5] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:07.972.360 [engine_partitioner.cc:1148][EVENT]18796 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionSplitSubGraphs is [159] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:07.972.406 [engine_partitioner.cc:1155][EVENT]18796 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionSortSubGraphs is [32] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:07.972.458 [engine_partitioner.cc:1164][EVENT]18796 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionAddPartitionsToGraphNode is [39] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:07.972.499 [graph_manager.cc:3405][EVENT]18796 SubgraphPartitionAndOptimization:[GEPERFTRACE] The time cost of OptimizeSubgraph::Partition1 is [461] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:07.972.517 [graph_manager.cc:3412][EVENT]18796 SubgraphPartitionAndOptimization:[GEPERFTRACE] The time cost of OptimizeSubgraph::SetSubgraphPreProc is [7] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:07.999.375 [graph_manager.cc:3422][EVENT]18796 SubgraphPartitionAndOptimization:[GEPERFTRACE] The time cost of OptimizeSubgraph::SetSubGraph is [26841] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:07.999.467 [graph_manager.cc:3428][EVENT]18796 SubgraphPartitionAndOptimization:[GEPERFTRACE] The time cost of OptimizeSubgraph::SetSubgraphPostProc is [17] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:07.999.688 [graph_manager.cc:3467][EVENT]18796 SubgraphPartitionAndOptimization:[GEPERFTRACE] The time cost of OptimizeSubgraph::MergeSubGraph is [196] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:07.999.708 [graph_manager.cc:3377][EVENT]18796 OptimizeSubgraph:[GEPERFTRACE] The time cost of OptimizeSubgraph::SubgraphPartitionAndOptimization::AtomicEngine is [27671] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:07.999.726 [graph_manager.cc:1106][EVENT]18796 PreRunOptimizeSubGraph:[GEPERFTRACE] The time cost of GraphManager::OptimizeSubgraph is [28109] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:07.999.739 [graph_manager.cc:1115][EVENT]18796 PreRunOptimizeSubGraph:PreRun:PreRunOptimizeSubGraph success. [INFO] GE(14258,python3.7):2024-01-11-05:43:07.999.784 [graph_manager.cc:1130][EVENT]18796 PreRunAfterOptimizeSubGraph:[GEPERFTRACE] The time cost of GraphManager::stages.optimizer.ReplacePrecompiledNodeWithOmGraph is [10] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:07.999.825 [graph_manager.cc:1131][EVENT]18796 PreRunAfterOptimizeSubGraph:[GEPERFTRACE] The time cost of GraphManager::stages.optimizer.OptimizeWholeGraph is [27] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:07.999.861 [pass_manager.cc:82][EVENT]18796 Run:[GEPERFTRACE] The time cost of OptimizeStage2::AfterMergePasses::LinkGenMaskNodesPass is [13] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:07.999.878 [pass_manager.cc:82][EVENT]18796 Run:[GEPERFTRACE] The time cost of OptimizeStage2::HcclContinuousMemcpyPass is [6] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:07.999.888 [graph_manager.cc:2837][EVENT]18796 OptimizeStage2:[GEPERFTRACE] The time cost of OptimizeStage2::AfterMergePasses is [46] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:08.000.020 [base_pass.cc:339][EVENT]18796 Run:[GEPERFTRACE] The time cost of ConstantFoldingPass is [29] micro second, call num is [4] [INFO] GE(14258,python3.7):2024-01-11-05:43:08.000.036 [base_pass.cc:339][EVENT]18796 Run:[GEPERFTRACE] The time cost of ReshapeRemovePass is [3] micro second, call num is [4] [INFO] GE(14258,python3.7):2024-01-11-05:43:08.000.045 [base_pass.cc:339][EVENT]18796 Run:[GEPERFTRACE] The time cost of CondRemovePass is [3] micro second, call num is [4] [INFO] GE(14258,python3.7):2024-01-11-05:43:08.000.054 [base_pass.cc:339][EVENT]18796 Run:[GEPERFTRACE] The time cost of BitcastPass is [2] micro second, call num is [4] [INFO] GE(14258,python3.7):2024-01-11-05:43:08.000.083 [base_pass.cc:339][EVENT]18796 Run:[GEPERFTRACE] The time cost of AssignRemovePass is [10] micro second, call num is [4] [INFO] GE(14258,python3.7):2024-01-11-05:43:08.000.092 [base_pass.cc:339][EVENT]18796 Run:[GEPERFTRACE] The time cost of DimensionAdjustPass is [7] micro second, call num is [4] [INFO] GE(14258,python3.7):2024-01-11-05:43:08.000.153 [graph_manager.cc:2864][EVENT]18796 OptimizeStage2:[GEPERFTRACE] The time cost of OptimizeStage2::MergedGraphNameToPasses is [246] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:08.000.182 [graph_manager.cc:2872][EVENT]18796 OptimizeStage2:[GEPERFTRACE] The time cost of OptimizeStage2::RemoveIsolatedConst is [10] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:08.000.206 [pass_manager.cc:82][EVENT]18796 Run:[GEPERFTRACE] The time cost of OptimizeStage2::ControlAttrOptimize::MultiBatchPass is [5] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:08.000.222 [pass_manager.cc:82][EVENT]18796 Run:[GEPERFTRACE] The time cost of OptimizeStage2::AfterMergePasses::RefIdentityDeleteOpPass is [4] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:08.000.239 [pass_manager.cc:82][EVENT]18796 Run:[GEPERFTRACE] The time cost of OptimizeStage2::AfterMergePasses::VariableRefDeleteOpPass is [7] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:08.000.257 [compile_nodes_pass.cc:88][EVENT]18796 Run:[GEPERFTRACE] The time cost of OptimizeStage2::ControlAttrOptimize::CompileNodesPass is [4] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:08.000.268 [pass_manager.cc:82][EVENT]18796 Run:[GEPERFTRACE] The time cost of OptimizeStage2::ControlAttrOptimize::CompileNodesPass is [17] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:08.000.279 [pass_manager.cc:82][EVENT]18796 Run:[GEPERFTRACE] The time cost of OptimizeStage2::AfterMergePasses::SwapSpacePass is [1] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:08.000.386 [pass_manager.cc:82][EVENT]18796 Run:[GEPERFTRACE] The time cost of OptimizeStage2::AfterMergePasses::InputOutputConnectionIdentifyPass is [98] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:08.000.423 [pass_manager.cc:82][EVENT]18796 Run:[GEPERFTRACE] The time cost of OptimizeStage2::AfterMergePasses::AtomicAddrCleanPass is [24] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:08.000.437 [pass_manager.cc:82][EVENT]18796 Run:[GEPERFTRACE] The time cost of OptimizeStage2::AfterMergePasses::EndOfSequenceAddControlPass is [3] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:08.000.451 [pass_manager.cc:82][EVENT]18796 Run:[GEPERFTRACE] The time cost of OptimizeStage2::ControlAttrOptimize::SubgraphPass is [4] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:08.000.467 [pass_manager.cc:82][EVENT]18796 Run:[GEPERFTRACE] The time cost of OptimizeStage2::ControlAttrOptimize::AttachStreamLabelPass is [5] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:08.000.478 [graph_manager.cc:2927][EVENT]18796 OptimizeStage2:[GEPERFTRACE] The time cost of OptimizeStage2::ControlAttrOptimize is [277] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:08.000.498 [graph_manager.cc:2937][EVENT]18796 OptimizeStage2:[GEPERFTRACE] The time cost of ModelBuilder::AssignFunctionalLabels is [11] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:08.000.517 [graph_manager.cc:2943][EVENT]18796 OptimizeStage2:[GEPERFTRACE] The time cost of MemcpyAddrAsyncPass::Run. is [8] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:08.000.530 [graph_manager.cc:2950][EVENT]18796 OptimizeStage2:[GEPERFTRACE] The time cost of BufferPoolMemoryPass::Run. is [2] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:08.011.419 [graph_manager.cc:2958][EVENT]18796 OptimizeStage2:[GEPERFTRACE] The time cost of ParallelGroupPass::Run. is [50] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:08.011.474 [graph_manager.cc:1132][EVENT]18796 PreRunAfterOptimizeSubGraph:[GEPERFTRACE] The time cost of GraphManager::OptimizeStage2 is [11634] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:08.011.617 [graph_manager.cc:1135][EVENT]18796 PreRunAfterOptimizeSubGraph:[GEPERFTRACE] The time cost of GraphManager::GetCompilerStages(graph_node->GetGraphId()).optimizer.OptimizeGraphBeforeBuild is [115] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:08.011.670 [graph_manager.cc:2975][EVENT]18796 MemConflictProc:[GEPERFTRACE] The time cost of HandleMemoryRWConflict is [33] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:08.011.712 [graph_manager.cc:2981][EVENT]18796 MemConflictProc:[GEPERFTRACE] The time cost of MemLayoutConflictOptimizer::Run. is [29] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:08.011.729 [pass_manager.cc:82][EVENT]18796 Run:[GEPERFTRACE] The time cost of OptimizeStage2::SetFftsPlusAttrPass is [1] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:08.011.740 [graph_manager.cc:2986][EVENT]18796 MemConflictProc:[GEPERFTRACE] The time cost of SetFftsPlusAttrPass::last_passes.Run is [15] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:08.011.750 [graph_manager.cc:1136][EVENT]18796 PreRunAfterOptimizeSubGraph:[GEPERFTRACE] The time cost of GraphManager::MemConflictProc is [114] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:08.011.919 [graph_manager.cc:3555][EVENT]18796 Build:[GEPERFTRACE] The time cost of GraphManager::RecoverIrDefinitionAndModifyAippData is [130] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:08.012.040 [engine_partitioner.cc:1139][EVENT]18796 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionInitialize is [22] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:08.012.059 [engine_partitioner.cc:1142][EVENT]18796 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionMarkClusters is [6] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:08.012.259 [engine_partitioner.cc:1148][EVENT]18796 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionSplitSubGraphs is [190] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:08.012.305 [engine_partitioner.cc:1155][EVENT]18796 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionSortSubGraphs is [30] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:08.012.355 [engine_partitioner.cc:1164][EVENT]18796 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionAddPartitionsToGraphNode is [37] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:08.012.382 [graph_builder.cc:865][EVENT]18796 SecondPartition:[GEPERFTRACE] The time cost of EnginePartitioner::Partition2 is [386] micro second. [INFO] RUNTIME(14258,python3.7):2024-01-11-05:43:08.013.051 [logger.cc:1071] 18796 ModelBindStream: model_id=832, stream_id=1089, flag=0. [INFO] GE(14258,python3.7):2024-01-11-05:43:08.013.093 [task_generator.cc:804][EVENT]18796 GenerateTask:[GEPERFTRACE] The time cost of TaskGenerator::SetStreamCtx is [210] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:08.013.171 [task_generator.cc:805][EVENT]18796 GenerateTask:[GEPERFTRACE] The time cost of TaskGenerator::PrepareForGenerateTask is [64] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:08.013.783 [task_generator.cc:814][EVENT]18796 GenerateTask:[GEPERFTRACE] The time cost of TaskGenerator::DoGenerateTask is [596] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:08.013.799 [task_generator.cc:954][EVENT]18796 GetTaskInfo:[GEPERFTRACE] The time cost of TaskGenerator::GenerateTask is [916] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:08.013.866 [task_generator.cc:967][EVENT]18796 GetTaskInfo:[GEPERFTRACE] The time cost of TaskGenerator::AddModelTaskToModel is [37] micro second. [INFO] RUNTIME(14258,python3.7):2024-01-11-05:43:08.013.884 [logger.cc:1084] 18796 ModelUnbindStream: model_id=832, stream_id=1089, [INFO] GE(14258,python3.7):2024-01-11-05:43:08.052.193 [graph_manager.cc:1152][EVENT]18796 PreRunAfterOptimizeSubGraph:[GEPERFTRACE] The time cost of GraphManager::Build is [40412] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:08.052.296 [graph_manager.cc:1164][EVENT]18796 PreRunAfterOptimizeSubGraph:PreRun:PreRunAfterOptimizeSubGraph success. [INFO] GE(14258,python3.7):2024-01-11-05:43:08.052.358 [graph_manager.cc:1271][EVENT]18796 PreRun:[GEPERFTRACE] The time cost of FlowModelBuild is [111480] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:08.052.370 [graph_manager.cc:1272][EVENT]18796 PreRun:[GEPERFTRACE] GE PreRun End [INFO] ATRACE(14258,python3.7):2024-01-11-05:43:08.052.728 [atrace_api.c:93](tid:18796) AtraceDestroy start [INFO] ATRACE(14258,python3.7):2024-01-11-05:43:08.052.757 [atrace_api.c:95](tid:18796) AtraceDestroy end [INFO] GE(14258,python3.7):2024-01-11-05:43:08.154.923 [graph_converter.cc:838][EVENT]18796 ConvertComputeGraphToExecuteGraph:[GEPERFTRACE] The time cost of ConvertComputeGraphToExecuteGraph::CreateMainNode is [2085] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:08.155.142 [base_optimizer.cc:70][EVENT]18796 Run:[GEPERFTRACE] The time cost of ZeroCopy is [133] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:08.155.723 [base_optimizer.cc:70][EVENT]18796 Run:[GEPERFTRACE] The time cost of CEM is [555] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:08.155.825 [copy_flow_launch_fuse.cc:395][EVENT]18796 Run:[GEPERFTRACE] The time cost of Pass::CopyFlowLaunchFuse is [77] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:08.155.842 [base_optimizer.cc:70][EVENT]18796 Run:[GEPERFTRACE] The time cost of CopyFlowLaunch is [96] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:08.156.121 [base_optimizer.cc:70][EVENT]18796 Run:[GEPERFTRACE] The time cost of TrustOutTensor is [267] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:08.156.260 [base_optimizer.cc:70][EVENT]18796 Run:[GEPERFTRACE] The time cost of AicpuFuseHostInputs is [118] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:08.156.305 [base_optimizer.cc:70][EVENT]18796 Run:[GEPERFTRACE] The time cost of ZeroCopy is [26] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:08.156.496 [base_optimizer.cc:70][EVENT]18796 Run:[GEPERFTRACE] The time cost of CEM is [176] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:08.156.592 [copy_flow_launch_fuse.cc:395][EVENT]18796 Run:[GEPERFTRACE] The time cost of Pass::CopyFlowLaunchFuse is [77] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:08.156.608 [base_optimizer.cc:70][EVENT]18796 Run:[GEPERFTRACE] The time cost of CopyFlowLaunch is [93] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:08.156.640 [base_optimizer.cc:70][EVENT]18796 Run:[GEPERFTRACE] The time cost of TrustOutTensor is [23] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:08.156.672 [base_optimizer.cc:70][EVENT]18796 Run:[GEPERFTRACE] The time cost of AicpuFuseHostInputs is [21] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:08.156.704 [base_optimizer.cc:70][EVENT]18796 Run:[GEPERFTRACE] The time cost of ZeroCopy is [21] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:08.156.790 [base_optimizer.cc:70][EVENT]18796 Run:[GEPERFTRACE] The time cost of CEM is [74] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:08.156.865 [copy_flow_launch_fuse.cc:395][EVENT]18796 Run:[GEPERFTRACE] The time cost of Pass::CopyFlowLaunchFuse is [62] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:08.156.876 [base_optimizer.cc:70][EVENT]18796 Run:[GEPERFTRACE] The time cost of CopyFlowLaunch is [74] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:08.156.907 [base_optimizer.cc:70][EVENT]18796 Run:[GEPERFTRACE] The time cost of TrustOutTensor is [21] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:08.156.938 [base_optimizer.cc:70][EVENT]18796 Run:[GEPERFTRACE] The time cost of AicpuFuseHostInputs is [19] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:08.156.964 [graph_converter.cc:849][EVENT]18796 ConvertComputeGraphToExecuteGraph:[GEPERFTRACE] The time cost of ConvertComputeGraphToExecuteGraph::RunAllPass is [1961] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:08.157.259 [graph_converter.cc:853][EVENT]18796 ConvertComputeGraphToExecuteGraph:[GEPERFTRACE] The time cost of ConvertComputeGraphToExecuteGraph::TopologicalSorting is [264] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:08.158.104 [graph_converter.cc:857][EVENT]18796 ConvertComputeGraphToExecuteGraph:[GEPERFTRACE] The time cost of ConvertComputeGraphToExecuteGraph::AppendGraphLevelData is [828] micro second. [INFO] GE(14258,python3.7):2024-01-11-05:43:08.158.281 [graph_converter.cc:862][EVENT]18796 ConvertComputeGraphToExecuteGraph:[GEPERFTRACE] The time cost of ConvertComputeGraphToExecuteGraph::CalculatePriority is [150] micro second. . TotalTime = 0.0670196, [20] [parse]: 0.00148197 [symbol_resolve]: 0.0112135, [1] [Cycle 1]: 0.0111451, [1] [resolve]: 0.0111242 [combine_like_graphs]: 8.29998e-07 [graph_reusing]: 3.8e-06 [meta_unpack_prepare]: 0.00013566 [pre_cconv]: 6.00005e-07 [abstract_specialize]: 0.00497737 [pack_expand]: 1.312e-05 [auto_monad]: 6.939e-05 [inline]: 1.58e-06 [pre_auto_parallel]: 9.94e-06 [pipeline_split]: 2.98e-06 [optimize]: 0.0456722, [35] [py_interpret_to_execute]: 4.65e-06 [rewriter_before_opt_a]: 0.00011463 [opt_a]: 0.0446774, [3] [Cycle 1]: 0.0390407, [30] [expand_dump_flag]: 3.7e-06 [switch_simplify]: 1.966e-05 [a_1]: 0.00049973 [recompute_prepare]: 7.23e-06 [updatestate_depend_eliminate]: 8.56e-06 [updatestate_assign_eliminate]: 5.84999e-06 [updatestate_loads_eliminate]: 5.54e-06 [parameter_eliminate]: 4.73e-06 [a_2]: 5.628e-05 [accelerated_algorithm]: 4.47e-06 [pynative_shard]: 1.71e-06 [auto_parallel]: 3.17e-06 [parallel]: 9.94e-06 [merge_comm]: 4.06e-06 [allreduce_fusion]: 2.11e-06 [virtual_dataset]: 4.03e-06 [get_grad_eliminate_]: 3.51e-06 [virtual_output]: 3.26e-06 [merge_forward]: 6.75e-06 [cell_reuse_recompute_pass]: 8.10003e-07 [cell_reuse_handle_not_recompute_node_pass]: 9.17e-06 [meta_fg_expand]: 0.00687174, [1] [Cycle 1]: 0.00331179, [1] [resolve]: 0.00329198 [after_resolve]: 3.746e-05 [a_after_grad]: 0.00010018 [renormalize]: 0.0307057 [real_op_eliminate]: 2.767e-05 [auto_monad_grad]: 4.162e-05 [auto_monad_eliminator]: 5.263e-05 [cse]: 0.00017884 [a_3]: 0.00017931 [Cycle 2]: 0.00256526, [30] [expand_dump_flag]: 2.78e-06 [switch_simplify]: 8.896e-05 [a_1]: 0.00082088 [recompute_prepare]: 8.27e-06 [updatestate_depend_eliminate]: 9.58e-06 [updatestate_assign_eliminate]: 6.5e-06 [updatestate_loads_eliminate]: 6.48e-06 [parameter_eliminate]: 3.14e-06 [a_2]: 9.516e-05 [accelerated_algorithm]: 1.053e-05 [pynative_shard]: 1.27e-06 [auto_parallel]: 4.78e-06 [parallel]: 4.29e-06 [merge_comm]: 3.25e-06 [allreduce_fusion]: 1.85e-06 [virtual_dataset]: 5.58e-06 [get_grad_eliminate_]: 5.29e-06 [virtual_output]: 4.74e-06 [merge_forward]: 8.59e-06 [cell_reuse_recompute_pass]: 4.19997e-07 [cell_reuse_handle_not_recompute_node_pass]: 1.176e-05 [meta_fg_expand]: 2.077e-05 [after_resolve]: 8.45001e-06 [a_after_grad]: 1.156e-05 [renormalize]: 0.00106752 [real_op_eliminate]: 1.044e-05 [auto_monad_grad]: 4.94e-06 [auto_monad_eliminator]: 1.633e-05 [cse]: 5.415e-05 [a_3]: 4.485e-05 [Cycle 3]: 0.00065504, [30] [expand_dump_flag]: 1.11e-06 [switch_simplify]: 5.09e-06 [a_1]: 0.00023743 [recompute_prepare]: 6.44e-06 [updatestate_depend_eliminate]: 8.62e-06 [updatestate_assign_eliminate]: 6.82e-06 [updatestate_loads_eliminate]: 6.39e-06 [parameter_eliminate]: 1.62e-06 [a_2]: 8.937e-05 [accelerated_algorithm]: 8.88e-06 [pynative_shard]: 1.53e-06 [auto_parallel]: 3.28e-06 [parallel]: 3.55e-06 [merge_comm]: 2.45e-06 [allreduce_fusion]: 1.54e-06 [virtual_dataset]: 5.69e-06 [get_grad_eliminate_]: 5.08e-06 [virtual_output]: 4.78e-06 [merge_forward]: 7.29e-06 [cell_reuse_recompute_pass]: 4.50003e-07 [cell_reuse_handle_not_recompute_node_pass]: 1.162e-05 [meta_fg_expand]: 5.56e-06 [after_resolve]: 7.35e-06 [a_after_grad]: 1.176e-05 [renormalize]: 7.0002e-08 [real_op_eliminate]: 4.59e-06 [auto_monad_grad]: 1.72e-06 [auto_monad_eliminator]: 1.376e-05 [cse]: 3.304e-05 [a_3]: 3.911e-05 [py_interpret_to_execute_after_opt_a]: 4.16e-06 [slice_cell_reuse_recomputed_activation]: 2.53e-06 [rewriter_after_opt_a]: 4.65e-05 [convert_after_rewriter]: 1.119e-05 [order_py_execute_after_rewriter]: 9.07e-06 [opt_b]: 0.00044183, [2] [Cycle 1]: 0.00034807, [7] [b_1]: 0.0002883 [b_2]: 3.02e-06 [updatestate_depend_eliminate]: 3.47001e-06 [updatestate_assign_eliminate]: 3.25e-06 [updatestate_loads_eliminate]: 2.45e-06 [renormalize]: 3.99996e-07 [cse]: 1.314e-05 [Cycle 2]: 8.468e-05, [7] [b_1]: 3.996e-05 [b_2]: 2.26e-06 [updatestate_depend_eliminate]: 2.32e-06 [updatestate_assign_eliminate]: 2.09e-06 [updatestate_loads_eliminate]: 2.16e-06 [renormalize]: 6.00048e-08 [cse]: 9.36e-06 [cconv]: 2.123e-05 [opt_after_cconv]: 6.312e-05, [1] [Cycle 1]: 5.873e-05, [7] [c_1]: 1.401e-05 [parameter_eliminate]: 1.9e-06 [updatestate_depend_eliminate]: 2.47e-06 [updatestate_assign_eliminate]: 2.04999e-06 [updatestate_loads_eliminate]: 2.1e-06 [cse]: 8.8e-06 [renormalize]: 2.40005e-07 [remove_dup_value]: 1.139e-05 [tuple_transform]: 4.608e-05, [1] [Cycle 1]: 4.231e-05, [3] [d_1]: 2.302e-05 [d_2]: 7.16e-06 [renormalize]: 1.50001e-07 [add_cache_embedding]: 1.088e-05 [add_recomputation]: 4.229e-05 [cse_after_recomputation]: 1.831e-05, [1] [Cycle 1]: 1.39e-05, [1] [cse]: 9.25e-06 [environ_conv]: 6.22001e-06 [label_micro_interleaved_index]: 2.05e-06 [label_fine_grained_interleaved_index]: 2.9e-06 [assign_add_opt]: 1.52e-06 [slice_recompute_activation]: 1.99e-06 [micro_interleaved_order_control]: 1.67e-06 [full_micro_interleaved_order_control]: 2.02e-06 [comp_comm_scheduling]: 2.47e-06 [reorder_send_recv_between_fp_bp]: 2.08e-06 [comm_op_add_attrs]: 1.03e-06 [add_comm_op_reuse_tag]: 9.20001e-07 [overlap_opt_shard_in_pipeline]: 1.23e-06 [grouped_pairwise_exchange_alltoall]: 1.24e-06 [overlap_recompute_and_grad_model_parallel]: 1.68e-06 [overlap_grad_matmul_and_grad_allreduce]: 1e-06 [split_matmul_comm_elemetwise]: 2.51e-06 [split_layernorm_comm]: 1.88001e-06 [process_send_recv_for_ge]: 8.2e-07 [handle_group_info]: 9.69994e-07 [auto_monad_reorder]: 1.771e-05 [get_jit_bprop_graph]: 3.43e-06 [eliminate_special_op_node]: 0.0004936 [validate]: 2.807e-05 [distribtued_split]: 1.29e-06 [task_emit]: 0.00268915 [execute]: 6.15999e-06 Sums parse : 0.001482s : 2.47% symbol_resolve.resolve : 0.011124s : 18.55% combine_like_graphs : 0.000001s : 0.00% graph_reusing : 0.000004s : 0.01% meta_unpack_prepare : 0.000136s : 0.23% pre_cconv : 0.000001s : 0.00% abstract_specialize : 0.004977s : 8.30% pack_expand : 0.000013s : 0.02% auto_monad : 0.000069s : 0.12% inline : 0.000002s : 0.00% pre_auto_parallel : 0.000010s : 0.02% pipeline_split : 0.000003s : 0.00% optimize.py_interpret_to_execute : 0.000005s : 0.01% optimize.rewriter_before_opt_a : 0.000115s : 0.19% optimize.opt_a.expand_dump_flag : 0.000008s : 0.01% optimize.opt_a.switch_simplify : 0.000114s : 0.19% optimize.opt_a.a_1 : 0.001558s : 2.60% optimize.opt_a.recompute_prepare : 0.000022s : 0.04% optimize.opt_a.updatestate_depend_eliminate : 0.000027s : 0.04% optimize.opt_a.updatestate_assign_eliminate : 0.000019s : 0.03% optimize.opt_a.updatestate_loads_eliminate : 0.000018s : 0.03% optimize.opt_a.parameter_eliminate : 0.000009s : 0.02% optimize.opt_a.a_2 : 0.000241s : 0.40% optimize.opt_a.accelerated_algorithm : 0.000024s : 0.04% optimize.opt_a.pynative_shard : 0.000005s : 0.01% optimize.opt_a.auto_parallel : 0.000011s : 0.02% optimize.opt_a.parallel : 0.000018s : 0.03% optimize.opt_a.merge_comm : 0.000010s : 0.02% optimize.opt_a.allreduce_fusion : 0.000005s : 0.01% optimize.opt_a.virtual_dataset : 0.000015s : 0.03% optimize.opt_a.get_grad_eliminate_ : 0.000014s : 0.02% optimize.opt_a.virtual_output : 0.000013s : 0.02% optimize.opt_a.merge_forward : 0.000023s : 0.04% optimize.opt_a.cell_reuse_recompute_pass : 0.000002s : 0.00% optimize.opt_a.cell_reuse_handle_not_recompute_node_pass : 0.000033s : 0.05% optimize.opt_a.meta_fg_expand : 0.000026s : 0.04% optimize.opt_a.meta_fg_expand.resolve : 0.003292s : 5.49% optimize.opt_a.after_resolve : 0.000053s : 0.09% optimize.opt_a.a_after_grad : 0.000124s : 0.21% optimize.opt_a.renormalize : 0.031773s : 52.98% optimize.opt_a.real_op_eliminate : 0.000043s : 0.07% optimize.opt_a.auto_monad_grad : 0.000048s : 0.08% optimize.opt_a.auto_monad_eliminator : 0.000083s : 0.14% optimize.opt_a.cse : 0.000266s : 0.44% optimize.opt_a.a_3 : 0.000263s : 0.44% optimize.py_interpret_to_execute_after_opt_a : 0.000004s : 0.01% optimize.slice_cell_reuse_recomputed_activation : 0.000003s : 0.00% optimize.rewriter_after_opt_a : 0.000046s : 0.08% optimize.convert_after_rewriter : 0.000011s : 0.02% optimize.order_py_execute_after_rewriter : 0.000009s : 0.02% optimize.opt_b.b_1 : 0.000328s : 0.55% optimize.opt_b.b_2 : 0.000005s : 0.01% optimize.opt_b.updatestate_depend_eliminate : 0.000006s : 0.01% optimize.opt_b.updatestate_assign_eliminate : 0.000005s : 0.01% optimize.opt_b.updatestate_loads_eliminate : 0.000005s : 0.01% optimize.opt_b.renormalize : 0.000000s : 0.00% optimize.opt_b.cse : 0.000023s : 0.04% optimize.cconv : 0.000021s : 0.04% optimize.opt_after_cconv.c_1 : 0.000014s : 0.02% optimize.opt_after_cconv.parameter_eliminate : 0.000002s : 0.00% optimize.opt_after_cconv.updatestate_depend_eliminate : 0.000002s : 0.00% optimize.opt_after_cconv.updatestate_assign_eliminate : 0.000002s : 0.00% optimize.opt_after_cconv.updatestate_loads_eliminate : 0.000002s : 0.00% optimize.opt_after_cconv.cse : 0.000009s : 0.01% optimize.opt_after_cconv.renormalize : 0.000000s : 0.00% optimize.remove_dup_value : 0.000011s : 0.02% optimize.tuple_transform.d_1 : 0.000023s : 0.04% optimize.tuple_transform.d_2 : 0.000007s : 0.01% optimize.tuple_transform.renormalize : 0.000000s : 0.00% optimize.add_cache_embedding : 0.000011s : 0.02% optimize.add_recomputation : 0.000042s : 0.07% optimize.cse_after_recomputation.cse : 0.000009s : 0.02% optimize.environ_conv : 0.000006s : 0.01% optimize.label_micro_interleaved_index : 0.000002s : 0.00% optimize.label_fine_grained_interleaved_index : 0.000003s : 0.00% optimize.assign_add_opt : 0.000002s : 0.00% optimize.slice_recompute_activation : 0.000002s : 0.00% optimize.micro_interleaved_order_control : 0.000002s : 0.00% optimize.full_micro_interleaved_order_control : 0.000002s : 0.00% optimize.comp_comm_scheduling : 0.000002s : 0.00% optimize.reorder_send_recv_between_fp_bp : 0.000002s : 0.00% optimize.comm_op_add_attrs : 0.000001s : 0.00% optimize.add_comm_op_reuse_tag : 0.000001s : 0.00% optimize.overlap_opt_shard_in_pipeline : 0.000001s : 0.00% optimize.grouped_pairwise_exchange_alltoall : 0.000001s : 0.00% optimize.overlap_recompute_and_grad_model_parallel : 0.000002s : 0.00% optimize.overlap_grad_matmul_and_grad_allreduce : 0.000001s : 0.00% optimize.split_matmul_comm_elemetwise : 0.000003s : 0.00% optimize.split_layernorm_comm : 0.000002s : 0.00% optimize.process_send_recv_for_ge : 0.000001s : 0.00% optimize.handle_group_info : 0.000001s : 0.00% auto_monad_reorder : 0.000018s : 0.03% get_jit_bprop_graph : 0.000003s : 0.01% eliminate_special_op_node : 0.000494s : 0.82% validate : 0.000028s : 0.05% distribtued_split : 0.000001s : 0.00% task_emit : 0.002689s : 4.48% execute : 0.000006s : 0.01% Time group info: ------[substitution.] 0.014328 233 0.01% : 0.000002s : 2: substitution.float_depend_g_call 0.04% : 0.000006s : 6: substitution.float_tuple_getitem_switch 96.02% : 0.013758s : 25: substitution.getattr_setattr_resolve 0.03% : 0.000004s : 3: substitution.graph_param_transform 0.01% : 0.000002s : 1: substitution.incorporate_call 0.01% : 0.000001s : 1: substitution.incorporate_call_switch 2.50% : 0.000358s : 34: substitution.inline 0.03% : 0.000004s : 4: substitution.less_batch_normalization 0.22% : 0.000031s : 34: substitution.meta_unpack_prepare 0.04% : 0.000006s : 6: substitution.minmaximum_grad 0.02% : 0.000002s : 2: substitution.partial_eliminate 0.01% : 0.000001s : 3: substitution.partial_unused_args_eliminate 0.02% : 0.000003s : 21: substitution.remove_not_recompute_node 0.23% : 0.000033s : 20: substitution.replace_applicator 0.04% : 0.000005s : 14: substitution.replace_old_param 0.02% : 0.000003s : 1: substitution.reset_defer_inline 0.02% : 0.000004s : 3: substitution.set_cell_output_no_recompute 0.03% : 0.000004s : 2: substitution.specialize_transform 0.04% : 0.000005s : 3: substitution.switch_simplify 0.03% : 0.000005s : 1: substitution.transpose_eliminate 0.14% : 0.000020s : 8: substitution.tuple_list_convert_item_index_to_positive 0.06% : 0.000009s : 8: substitution.tuple_list_get_item_const_eliminator 0.08% : 0.000012s : 8: substitution.tuple_list_get_item_depend_reorder 0.27% : 0.000038s : 15: substitution.tuple_list_get_item_eliminator 0.08% : 0.000012s : 8: substitution.tuple_list_get_set_item_eliminator ------[renormalize.] 0.031763 4 93.94% : 0.029838s : 2: renormalize.infer 6.06% : 0.001925s : 2: renormalize.specialize ------[replace.] 0.000540 48 61.42% : 0.000332s : 23: replace.getattr_setattr_resolve 20.22% : 0.000109s : 17: replace.inline 6.17% : 0.000033s : 1: replace.meta_unpack_prepare 6.63% : 0.000036s : 3: replace.switch_simplify 1.12% : 0.000006s : 1: replace.transpose_eliminate 4.44% : 0.000024s : 3: replace.tuple_list_get_item_eliminator ------[match.] 0.013983 48 97.62% : 0.013650s : 23: match.getattr_setattr_resolve 2.12% : 0.000296s : 17: match.inline 0.10% : 0.000014s : 1: match.meta_unpack_prepare 0.04% : 0.000005s : 3: match.switch_simplify 0.03% : 0.000005s : 1: match.transpose_eliminate 0.09% : 0.000013s : 3: match.tuple_list_get_item_eliminator ------[func_graph_cloner_run.] 0.002408 43 69.49% : 0.001673s : 21: func_graph_cloner_run.FuncGraphClonerGraph 30.51% : 0.000735s : 22: func_graph_cloner_run.FuncGraphSpecializer ------[meta_graph.] 0.000000 0 ------[manager.] 0.000000 0 ------[pynative] 0.000000 0 ------[others.] 0.017177 452 0.65% : 0.000112s : 2: opt.transform.meta_unpack_prepare 13.87% : 0.002382s : 334: opt.transform.opt_a 0.06% : 0.000011s : 7: opt.transform.opt_after_cconv 1.75% : 0.000300s : 94: opt.transform.opt_b 83.46% : 0.014335s : 4: opt.transform.opt_resolve 0.15% : 0.000026s : 8: opt.transform.opt_trans_graph 0.06% : 0.000011s : 3: opt.transform.special_op_eliminate . ============================== 2 passed in 21.12s ============================== [TRACE] GE(14258,python3.7):2024-01-11-05:43:10.290.095 [status:INIT] [ge_api.cc:463]14258 ~Session:Start to destruct session. [TRACE] GE(14258,python3.7):2024-01-11-05:43:10.290.174 [status:RUNNING] [ge_api.cc:475]14258 ~Session:Session id is 0 [TRACE] GE(14258,python3.7):2024-01-11-05:43:10.290.186 [status:RUNNING] [ge_api.cc:476]14258 ~Session:Destroying session [TRACE] GE(14258,python3.7):2024-01-11-05:43:10.292.518 [status:STOP] [ge_api.cc:491]14258 ~Session:Session Destructor finished [TRACE] GE(14258,python3.7):2024-01-11-05:43:10.292.565 [status:INIT] [ge_api.cc:301]14258 GEFinalize:GEFinalize start [INFO] GE(14258,python3.7):2024-01-11-05:43:10.292.720 [execution_runtime.cc:80][EVENT]14258 FinalizeExecutionRuntime:Execution runtime finalize begin. [INFO] GE(14258,python3.7):2024-01-11-05:43:10.292.750 [execution_runtime.cc:92][EVENT]14258 FinalizeExecutionRuntime:Execution runtime finalized. [TRACE] GE(14258,python3.7):2024-01-11-05:43:10.292.763 [status:RUNNING] [ge_api.cc:313]14258 GEFinalize:Finalizing environment [INFO] TUNE(14258,python3.7):2024-01-11-05:43:10.717.718 [cann_kb_pyfunc_mgr.cpp:127][CANNKB][Tid:14258]"CannKbPyfuncMgr: enter PyObjectDeinit function, reference_[1]" [INFO] TUNE(14258,python3.7):2024-01-11-05:43:10.717.777 [cann_kb_pyfunc_mgr.cpp:138][CANNKB][Tid:14258]"CannKbPyfuncMgr: PyObjectDeinit function end successfully!" [INFO] GE(14258,python3.7):2024-01-11-05:43:10.719.580 [gelib.cc:324][EVENT]14258 SystemFinalize:Online infer finalize GELib success. [TRACE] GE(14258,python3.7):2024-01-11-05:43:10.788.533 [status:STOP] [ge_api.cc:341]14258 GEFinalize:GEFinalize finished [INFO] TDT(14258,python3.7):2024-01-11-05:43:11.106.871 [process_mode_manager.cpp:184][Close][tid:14258] [TsdClient] Close [deviceId=2][sessionId=1] hccp and computer enter [INFO] TDT(14258,python3.7):2024-01-11-05:43:11.106.950 [version_verify.cpp:112][SpecialFeatureCheck][tid:14258] VersionVerify: previous type[7], supported [INFO] TDT(14258,python3.7):2024-01-11-05:43:11.107.001 [process_mode_manager.cpp:192][Close][tid:14258] [TsdClient][deviceId=2] [sessionId=1] wait hccp and computer process close respond [INFO] TDT(14258,python3.7):2024-01-11-05:43:11.138.087 [process_mode_manager.cpp:197][Close][tid:14258] [TsdClient][logicDeviceId_=2]has recv close hccp and computer process respond [INFO] TDT(14258,python3.7):2024-01-11-05:43:11.138.146 [stub_process_mode_nowin.cpp:151][CloseInHost][tid:14258] enter into CloseInHost deviceid[2] [INFO] TDT(14258,python3.7):2024-01-11-05:43:11.138.159 [stub_process_mode_nowin.cpp:154][CloseInHost][tid:14258] host cpu not support [INFO] TDT(14258,python3.7):2024-01-11-05:43:11.138.227 [process_mode_manager.cpp:208][Close][tid:14258] [TsdClient][deviceId=2] [sessionId=1] close hccp and computer process success [INFO] ATRACE(14258,python3.7):2024-01-11-05:43:11.138.241 [atrace_api.c:93](tid:14258) AtraceDestroy start [INFO] ATRACE(14258,python3.7):2024-01-11-05:43:11.138.258 [atrace_api.c:95](tid:14258) AtraceDestroy end [INFO] PROFILING(14258,python3.7):2024-01-11-05:43:11.138.283 [msprofiler_impl.cpp:156] >>> (tid:14258) ProfNotifySetDevice called, is open: 0, devId: 2 [INFO] RUNTIME(14258,python3.7):2024-01-11-05:43:12.934.419 [runtime.cc:1737] 14258 ~Runtime: deconstruct runtime.