============================= test session starts ============================== platform linux -- Python 3.7.5, pytest-5.4.3, py-1.8.1, pluggy-0.13.1 rootdir: /home/jenkins/mindspore/testcases/testcases/tests/st/dyn_shape_dev, inifile: /home/jenkins/sault/virtual_test/virtualenv_003/sault/config/pytest.ini plugins: anyio-3.7.1, xdist-1.32.0, forked-1.1.3 [INFO] ATRACE(170035,python3.7):2024-01-11-05:39:25.960.883 [trace_attr.c:105](tid:170035) platform is 1. [INFO] ATRACE(170035,python3.7):2024-01-11-05:39:25.961.050 [trace_recorder.c:114](tid:170035) use root path: /home/jenkins/ascend/atrace [INFO] ATRACE(170035,python3.7):2024-01-11-05:39:25.961.080 [trace_signal.c:133](tid:170035) register signal handler for signo 2 succeed. [INFO] ATRACE(170035,python3.7):2024-01-11-05:39:25.961.091 [trace_signal.c:133](tid:170035) register signal handler for signo 15 succeed. [INFO] RUNTIME(170035,python3.7):2024-01-11-05:39:26.386.874 [runtime.cc:1159] 170035 GetAicoreNumByLevel: workingDev_=0 [INFO] RUNTIME(170035,python3.7):2024-01-11-05:39:26.386.952 [runtime.cc:4719] 170035 GetVisibleDevices: ASCEND_RT_VISIBLE_DEVICES param was not set collected 2 items test_gelu.py [INFO] TDT(170035,python3.7):2024-01-11-05:39:30.799.516 [process_mode_manager.cpp:109][OpenProcess][tid:170035] [ProcessModeManager] enter into open process deviceId[2] rankSize[0] [INFO] TDT(170035,python3.7):2024-01-11-05:39:30.801.966 [process_mode_manager.cpp:379][InitTsdClient][tid:170035] [TsdClient] deviceId[2] begin to init hdc client [INFO] TDT(170035,python3.7):2024-01-11-05:39:30.802.120 [version_verify.cpp:34][SetVersionInfo][tid:170035] VersionVerify: send client version to server [INFO] TDT(170035,python3.7):2024-01-11-05:39:30.802.170 [version_verify.cpp:50][SetVersionInfo][tid:170035] send feature_info:{msg_type:35, features:{check before send aicpu package,}} [INFO] TDT(170035,python3.7):2024-01-11-05:39:30.802.183 [version_verify.cpp:50][SetVersionInfo][tid:170035] send feature_info:{msg_type:37, features:{check before send open qs message,}} [INFO] TDT(170035,python3.7):2024-01-11-05:39:30.802.451 [version_verify.cpp:66][PeerVersionCheck][tid:170035] VersionVerify: Check client version info, server[1230], client[1230] [INFO] TDT(170035,python3.7):2024-01-11-05:39:30.802.468 [version_verify.cpp:87][ParseVersionInfo][tid:170035] VersionVerify: pass client version info success [INFO] TDT(170035,python3.7):2024-01-11-05:39:30.802.477 [hdc_client.cpp:276][CheckHdcConnection][tid:170035] Service[2] create hdc success [INFO] TDT(170035,python3.7):2024-01-11-05:39:30.802.493 [version_verify.cpp:120][SpecialFeatureCheck][tid:170035] VersionVerify: new type[35], supported [INFO] TDT(170035,python3.7):2024-01-11-05:39:30.802.540 [process_mode_manager.cpp:748][GetDeviceCheckCode][tid:170035] [TsdClient][deviceId=2] [sessionId=1] wait package info respond [INFO] TDT(170035,python3.7):2024-01-11-05:39:30.802.655 [process_mode_manager.cpp:379][InitTsdClient][tid:170035] [TsdClient] deviceId[2] begin to init hdc client [INFO] TDT(170035,python3.7):2024-01-11-05:39:30.802.830 [version_verify.cpp:34][SetVersionInfo][tid:170035] VersionVerify: send client version to server [INFO] TDT(170035,python3.7):2024-01-11-05:39:30.802.843 [version_verify.cpp:50][SetVersionInfo][tid:170035] send feature_info:{msg_type:35, features:{check before send aicpu package,}} [INFO] TDT(170035,python3.7):2024-01-11-05:39:30.802.853 [version_verify.cpp:50][SetVersionInfo][tid:170035] send feature_info:{msg_type:37, features:{check before send open qs message,}} [INFO] TDT(170035,python3.7):2024-01-11-05:39:30.802.997 [version_verify.cpp:66][PeerVersionCheck][tid:170035] VersionVerify: Check client version info, server[1230], client[1230] [INFO] TDT(170035,python3.7):2024-01-11-05:39:30.803.010 [version_verify.cpp:87][ParseVersionInfo][tid:170035] VersionVerify: pass client version info success [INFO] TDT(170035,python3.7):2024-01-11-05:39:30.803.018 [hdc_client.cpp:276][CheckHdcConnection][tid:170035] Service[2] create hdc success [INFO] TDT(170035,python3.7):2024-01-11-05:39:30.803.030 [process_mode_manager.cpp:426][ConstructOpenMsg][tid:170035] [TsdClient] tsd get process sign successfully, procpid[170035] signSize[48] [INFO] TDT(170035,python3.7):2024-01-11-05:39:30.803.062 [version_verify.cpp:112][SpecialFeatureCheck][tid:170035] VersionVerify: previous type[6], supported [INFO] TDT(170035,python3.7):2024-01-11-05:39:30.803.084 [process_mode_manager.cpp:126][OpenProcess][tid:170035] [ProcessModeManager] deviceId[2] sessionId[1] rankSize[0], wait sub process start respond [INFO] TDT(170035,python3.7):2024-01-11-05:39:31.023.342 [stub_process_mode_nowin.cpp:63][ProcessQueueForMdc][tid:170035] [TsdClient] it is unnecessary of current mode[0] chiptype[1] to grant queue auth to aicpusd [INFO] TDT(170035,python3.7):2024-01-11-05:39:31.023.372 [stub_process_mode_nowin.cpp:101][OpenInHost][tid:170035] enter into OpenInHost deviceid[2] [INFO] TDT(170035,python3.7):2024-01-11-05:39:31.023.382 [stub_process_mode_nowin.cpp:105][OpenInHost][tid:170035] host cpu not support [INFO] TDT(170035,python3.7):2024-01-11-05:39:31.023.390 [process_mode_manager.cpp:156][OpenProcess][tid:170035] [TsdClient][deviceId=2] [sessionId=1] start hccp and computer process success [INFO] RUNTIME(170035,python3.7):2024-01-11-05:39:31.028.694 [device.cc:340] 170035 Init: isDoubledie:0, topologytype:0 [INFO] RUNTIME(170035,python3.7):2024-01-11-05:39:31.043.334 [npu_driver.cc:5428] 170924 GetDeviceStatus: GetDeviceStatus status=1. [INFO] ATRACE(170035,python3.7):2024-01-11-05:39:31.044.153 [atrace_api.c:28](tid:170035) AtraceCreate start [INFO] ATRACE(170035,python3.7):2024-01-11-05:39:31.044.272 [trace_rb_log.c:84](tid:170035) [RUNTIME_ATRACE_DEV2_TS0] create ring buffer success, buffer size : 131152. [INFO] ATRACE(170035,python3.7):2024-01-11-05:39:31.044.287 [atrace_api.c:32](tid:170035) AtraceCreate end [INFO] TDT(170035,python3.7):2024-01-11-05:39:31.044.321 [client_manager.cpp:157][SetProfilingCallback][tid:170035] [TsdClient] set profiling callback success [TRACE] GE(170035,python3.7):2024-01-11-05:39:31.195.665 [status:INIT] [ge_api.cc:144]170035 GEInitializeImpl:GEInitialize start [INFO] PROFILING(170035,python3.7):2024-01-11-05:39:31.416.952 [msprofiler_impl.cpp:156] >>> (tid:170035) ProfNotifySetDevice called, is open: 1, devId: 2 [INFO] PROFILING(170035,python3.7):2024-01-11-05:39:31.417.132 [platform.cpp:38] >>> (tid:170035) Profiling platform version: 1.0. [INFO] PROFILING(170035,python3.7):2024-01-11-05:39:31.417.150 [ai_drv_dev_api.cpp:384] >>> (tid:170035) Succeeded to DrvGetApiVersion version: 0x72313 [TRACE] GE(170035,python3.7):2024-01-11-05:39:31.468.310 [status:RUNNING] [ge_api.cc:211]170035 GEInitializeImpl:Initializing environment [INFO] GE(170035,python3.7):2024-01-11-05:39:31.468.379 [gelib.cc:98][EVENT]170035 Initialize:[GEPERFTRACE] GE Init Start [INFO] GE(170035,python3.7):2024-01-11-05:39:31.468.697 [gelib.cc:307][EVENT]170035 SystemInitialize:Online infer init GELib success, device id :2 [INFO] DVPP(170035,python3.7):2024-01-11-05:39:31.839.957 [dvpp_engine.cc:41][ENGINE][Initialize:41][tid:170035]dvpp engine do not support [INFO] TUNE(170035,python3.7):2024-01-11-05:39:31.844.643 [cann_kb_pyfunc_mgr.cpp:72][CANNKB][Tid:170035]"CannKbPyfuncMgr: Enter PyObjectInit, reference_ is 0!" [INFO] TUNE(170035,python3.7):2024-01-11-05:39:31.844.682 [handle_manager.cpp:115][CANNKB][Tid:170035]"Start to run init functions to load dynamic python lib!" [INFO] TUNE(170035,python3.7):2024-01-11-05:39:31.844.747 [handle_manager.cpp:407][CANNKB][Tid:170035]"Init functions of loading dynamic python lib end!" [INFO] TUNE(170035,python3.7):2024-01-11-05:39:31.844.757 [cann_kb_pyfunc_mgr.cpp:24][CANNKB][Tid:170035]"CANN_KB_Py has already been initialized." [INFO] TUNE(170035,python3.7):2024-01-11-05:39:31.844.858 [cann_kb_pyfunc_mgr.cpp:117][CANNKB][Tid:170035]"CannKbPyfuncMgr: Run PyObjectInit successfully!" [INFO] HCCL(170035,python3.7):2024-01-11-05:39:44.077.271 [plugin_manager.cc:42][170035]hcom running normal mode. [INFO] DVPP(170035,python3.7):2024-01-11-05:39:44.078.056 [dvpp_engine.cc:92][ENGINE][GetOpsKernelInfoStores:92][tid:170035]dvpp ops kernel info store do not support [INFO] DVPP(170035,python3.7):2024-01-11-05:39:44.078.253 [dvpp_engine.cc:69][ENGINE][GetGraphOptimizerObjs:69][tid:170035]dvpp graph optimizer do not support [INFO] DVPP(170035,python3.7):2024-01-11-05:39:44.644.340 [dvpp_ops_kernel_builder.cc:48][ENGINE][Initialize:48][tid:170035]dvpp ops kernel builder do not support [INFO] GE(170035,python3.7):2024-01-11-05:39:44.653.654 [gelib.cc:169][EVENT]170035 Initialize:[GEPERFTRACE] The time cost of GELib::Initialize is [13185187] micro second. [TRACE] GE(170035,python3.7):2024-01-11-05:39:44.736.711 [status:STOP] [ge_api.cc:255]170035 GEInitializeImpl:GEInitialize finished [TRACE] GE(170035,python3.7):2024-01-11-05:39:44.736.877 [status:INIT] [ge_api.cc:398]170035 Session:Start to construct session. [TRACE] GE(170035,python3.7):2024-01-11-05:39:44.736.899 [status:RUNNING] [ge_api.cc:408]170035 Session:Creating session [INFO] GE(170035,python3.7):2024-01-11-05:39:44.737.380 [graph_var_manager.cc:1445][EVENT]170035 SetMemoryMallocSize:Total memory size is 34359738368 [INFO] GE(170035,python3.7):2024-01-11-05:39:44.737.402 [graph_var_manager.cc:1424][EVENT]170035 SetAllMemoryMaxValue:The graph_mem_max_size is 27917287424 and the var_mem_max_size is 5368709120 [INFO] PROFILING(170035,python3.7):2024-01-11-05:39:44.737.769 [msprofiler_impl.cpp:156] >>> (tid:170035) ProfNotifySetDevice called, is open: 1, devId: 2 [TRACE] GE(170035,python3.7):2024-01-11-05:39:44.738.619 [status:RUNNING] [ge_api.cc:411]170035 Session:Session id is 0 [TRACE] GE(170035,python3.7):2024-01-11-05:39:44.738.644 [status:STOP] [ge_api.cc:420]170035 Session:Session Constructor finished [INFO] PROFILING(170035,python3.7):2024-01-11-05:39:44.748.386 [platform.cpp:38] >>> (tid:170035) Profiling platform version: 1.0. [INFO] PROFILING(170035,python3.7):2024-01-11-05:39:44.748.421 [ai_drv_dev_api.cpp:384] >>> (tid:170035) Succeeded to DrvGetApiVersion version: 0x72313 [TRACE] GE(170035,python3.7):2024-01-11-05:39:44.748.659 [status:INIT] [ge_api.cc:144]170035 GEInitializeImpl:GEInitialize start TotalTime = 0.399099, [20] [parse]: 0.232785 [symbol_resolve]: 0.0295848, [1] [Cycle 1]: 0.0295046, [1] [resolve]: 0.0294592 [combine_like_graphs]: 1.46e-06 [graph_reusing]: 3.78e-06 [meta_unpack_prepare]: 0.00018639 [pre_cconv]: 4.98e-06 [abstract_specialize]: 0.00504094 [pack_expand]: 1.62e-05 [auto_monad]: 0.00012222 [inline]: 1.67e-06 [pre_auto_parallel]: 2.585e-05 [pipeline_split]: 3.43e-06 [optimize]: 0.12417, [35] [py_interpret_to_execute]: 5.21e-06 [rewriter_before_opt_a]: 0.00017981 [opt_a]: 0.12279, [4] [Cycle 1]: 0.0851801, [30] [expand_dump_flag]: 4.53e-06 [switch_simplify]: 2.772e-05 [a_1]: 0.00040614 [recompute_prepare]: 9.6e-06 [updatestate_depend_eliminate]: 1.102e-05 [updatestate_assign_eliminate]: 6.79e-06 [updatestate_loads_eliminate]: 6.02e-06 [parameter_eliminate]: 5.4e-06 [a_2]: 8.776e-05 [accelerated_algorithm]: 5.35e-06 [pynative_shard]: 1.97e-06 [auto_parallel]: 4.48e-06 [parallel]: 1.624e-05 [merge_comm]: 1.765e-05 [allreduce_fusion]: 2.36e-06 [virtual_dataset]: 5.18e-06 [get_grad_eliminate_]: 4.15e-06 [virtual_output]: 3.55e-06 [merge_forward]: 9.03e-06 [cell_reuse_recompute_pass]: 8.70001e-07 [cell_reuse_handle_not_recompute_node_pass]: 1.23e-05 [meta_fg_expand]: 0.0264653, [1] [Cycle 1]: 0.00059747, [1] [resolve]: 0.00056585 [after_resolve]: 2.46e-05 [a_after_grad]: 4.524e-05 [renormalize]: 0.0573863 [real_op_eliminate]: 3.014e-05 [auto_monad_grad]: 3.369e-05 [auto_monad_eliminator]: 4.822e-05 [cse]: 0.00014037 [a_3]: 0.00016671 [Cycle 2]: 0.0279517, [30] [expand_dump_flag]: 4.84e-06 [switch_simplify]: 6.538e-05 [a_1]: 0.00051461 [recompute_prepare]: 1.093e-05 [updatestate_depend_eliminate]: 1.293e-05 [updatestate_assign_eliminate]: 9.16e-06 [updatestate_loads_eliminate]: 8.37e-06 [parameter_eliminate]: 4.4e-06 [a_2]: 0.00011795 [accelerated_algorithm]: 1.266e-05 [pynative_shard]: 1.84e-06 [auto_parallel]: 1.031e-05 [parallel]: 7.47e-06 [merge_comm]: 4.58e-06 [allreduce_fusion]: 1.94e-06 [virtual_dataset]: 6.86e-06 [get_grad_eliminate_]: 5.93e-06 [virtual_output]: 5.37001e-06 [merge_forward]: 1.1e-05 [cell_reuse_recompute_pass]: 1.12e-06 [cell_reuse_handle_not_recompute_node_pass]: 1.566e-05 [meta_fg_expand]: 0.0065126, [3] [Cycle 1]: 0.00037256, [1] [resolve]: 0.000353 [Cycle 1]: 0.00042839, [1] [resolve]: 0.00040895 [Cycle 1]: 0.0003239, [1] [resolve]: 0.00030614 [after_resolve]: 3.121e-05 [a_after_grad]: 5.261e-05 [renormalize]: 0.019928 [real_op_eliminate]: 2.78e-05 [auto_monad_grad]: 3.459e-05 [auto_monad_eliminator]: 5.138e-05 [cse]: 0.00012198 [a_3]: 0.00019669 [Cycle 3]: 0.00251719, [30] [expand_dump_flag]: 3.32999e-06 [switch_simplify]: 6.42e-05 [a_1]: 0.00053892 [recompute_prepare]: 1.252e-05 [updatestate_depend_eliminate]: 1.366e-05 [updatestate_assign_eliminate]: 1.003e-05 [updatestate_loads_eliminate]: 9.25999e-06 [parameter_eliminate]: 4.12e-06 [a_2]: 0.00014501 [accelerated_algorithm]: 1.452e-05 [pynative_shard]: 1.71e-06 [auto_parallel]: 6.25e-06 [parallel]: 4.43e-06 [merge_comm]: 3.37e-06 [allreduce_fusion]: 1.96999e-06 [virtual_dataset]: 8.24e-06 [get_grad_eliminate_]: 7.33e-06 [virtual_output]: 6.66e-06 [merge_forward]: 1.146e-05 [cell_reuse_recompute_pass]: 7.30004e-07 [cell_reuse_handle_not_recompute_node_pass]: 1.932e-05 [meta_fg_expand]: 2.825e-05 [after_resolve]: 1.087e-05 [a_after_grad]: 1.357e-05 [renormalize]: 0.00123505 [real_op_eliminate]: 1.264e-05 [auto_monad_grad]: 5.38e-06 [auto_monad_eliminator]: 2.231e-05 [cse]: 8.608e-05 [a_3]: 6.847e-05 [Cycle 4]: 0.00070322, [30] [expand_dump_flag]: 1.54e-06 [switch_simplify]: 8.2e-06 [a_1]: 0.00014475 [recompute_prepare]: 9.71e-06 [updatestate_depend_eliminate]: 1.329e-05 [updatestate_assign_eliminate]: 9.76001e-06 [updatestate_loads_eliminate]: 9.34e-06 [parameter_eliminate]: 1.91999e-06 [a_2]: 0.00014315 [accelerated_algorithm]: 1.421e-05 [pynative_shard]: 1.32e-06 [auto_parallel]: 3.59e-06 [parallel]: 3.81e-06 [merge_comm]: 2.50999e-06 [allreduce_fusion]: 1.76e-06 [virtual_dataset]: 7.79e-06 [get_grad_eliminate_]: 6.89e-06 [virtual_output]: 6.52e-06 [merge_forward]: 1.147e-05 [cell_reuse_recompute_pass]: 4.30002e-07 [cell_reuse_handle_not_recompute_node_pass]: 1.84e-05 [meta_fg_expand]: 7.77e-06 [after_resolve]: 1.055e-05 [a_after_grad]: 1.312e-05 [renormalize]: 1.00001e-07 [real_op_eliminate]: 6.88e-06 [auto_monad_grad]: 2.34e-06 [auto_monad_eliminator]: 2.004e-05 [cse]: 4.784e-05 [a_3]: 6.077e-05 [py_interpret_to_execute_after_opt_a]: 3.76e-06 [slice_cell_reuse_recomputed_activation]: 2.52e-06 [rewriter_after_opt_a]: 0.0001121 [convert_after_rewriter]: 1.708e-05 [order_py_execute_after_rewriter]: 1.183e-05 [opt_b]: 0.00054923, [2] [Cycle 1]: 0.00046513, [7] [b_1]: 0.00041041 [b_2]: 3.32e-06 [updatestate_depend_eliminate]: 3.5e-06 [updatestate_assign_eliminate]: 2.36e-06 [updatestate_loads_eliminate]: 2.04e-06 [renormalize]: 4.50003e-07 [cse]: 1.032e-05 [Cycle 2]: 7.323e-05, [7] [b_1]: 3.414e-05 [b_2]: 2.28e-06 [updatestate_depend_eliminate]: 2.18e-06 [updatestate_assign_eliminate]: 1.91e-06 [updatestate_loads_eliminate]: 1.67e-06 [renormalize]: 7.0002e-08 [cse]: 5.83e-06 [cconv]: 2.195e-05 [opt_after_cconv]: 4.924e-05, [1] [Cycle 1]: 4.481e-05, [7] [c_1]: 4.99999e-06 [parameter_eliminate]: 1.96e-06 [updatestate_depend_eliminate]: 2.18e-06 [updatestate_assign_eliminate]: 1.7e-06 [updatestate_loads_eliminate]: 1.66e-06 [cse]: 5.91001e-06 [renormalize]: 3.39998e-07 [remove_dup_value]: 1.016e-05 [tuple_transform]: 3.79e-05, [1] [Cycle 1]: 3.328e-05, [3] [d_1]: 1.445e-05 [d_2]: 5.82e-06 [renormalize]: 2.00002e-07 [add_cache_embedding]: 1.086e-05 [add_recomputation]: 4.613e-05 [cse_after_recomputation]: 5.581e-05, [1] [Cycle 1]: 5.09e-05, [1] [cse]: 4.575e-05 [environ_conv]: 2.214e-05 [label_micro_interleaved_index]: 2.75e-06 [label_fine_grained_interleaved_index]: 2.59e-06 [assign_add_opt]: 3.63e-06 [slice_recompute_activation]: 2.32e-06 [micro_interleaved_order_control]: 1.82e-06 [full_micro_interleaved_order_control]: 2.14999e-06 [comp_comm_scheduling]: 2.37e-06 [reorder_send_recv_between_fp_bp]: 2.2e-06 [comm_op_add_attrs]: 1.11001e-06 [add_comm_op_reuse_tag]: 1.34e-06 [overlap_opt_shard_in_pipeline]: 1.35e-06 [grouped_pairwise_exchange_alltoall]: 2.14e-06 [overlap_recompute_and_grad_model_parallel]: 1.76e-06 [overlap_grad_matmul_and_grad_allreduce]: 7.89994e-07 [split_matmul_comm_elemetwise]: 2.55e-06 [split_layernorm_comm]: 1.75e-06 [process_send_recv_for_ge]: 2.52e-06 [handle_group_info]: 1.29e-06 [auto_monad_reorder]: 2.105e-05 [get_jit_bprop_graph]: 4.79995e-07 [eliminate_special_op_node]: 0.00054715 [validate]: 4.36e-05 [distribtued_split]: 1.38e-06 [task_emit]: 0.00623865 [execute]: 8.37001e-06 Sums parse : 0.232785s : 64.67% symbol_resolve.resolve : 0.029459s : 8.18% combine_like_graphs : 0.000001s : 0.00% graph_reusing : 0.000004s : 0.00% meta_unpack_prepare : 0.000186s : 0.05% pre_cconv : 0.000005s : 0.00% abstract_specialize : 0.005041s : 1.40% pack_expand : 0.000016s : 0.00% auto_monad : 0.000122s : 0.03% inline : 0.000002s : 0.00% pre_auto_parallel : 0.000026s : 0.01% pipeline_split : 0.000003s : 0.00% optimize.py_interpret_to_execute : 0.000005s : 0.00% optimize.rewriter_before_opt_a : 0.000180s : 0.05% optimize.opt_a.expand_dump_flag : 0.000014s : 0.00% optimize.opt_a.switch_simplify : 0.000166s : 0.05% optimize.opt_a.a_1 : 0.001604s : 0.45% optimize.opt_a.recompute_prepare : 0.000043s : 0.01% optimize.opt_a.updatestate_depend_eliminate : 0.000051s : 0.01% optimize.opt_a.updatestate_assign_eliminate : 0.000036s : 0.01% optimize.opt_a.updatestate_loads_eliminate : 0.000033s : 0.01% optimize.opt_a.parameter_eliminate : 0.000016s : 0.00% optimize.opt_a.a_2 : 0.000494s : 0.14% optimize.opt_a.accelerated_algorithm : 0.000047s : 0.01% optimize.opt_a.pynative_shard : 0.000007s : 0.00% optimize.opt_a.auto_parallel : 0.000025s : 0.01% optimize.opt_a.parallel : 0.000032s : 0.01% optimize.opt_a.merge_comm : 0.000028s : 0.01% optimize.opt_a.allreduce_fusion : 0.000008s : 0.00% optimize.opt_a.virtual_dataset : 0.000028s : 0.01% optimize.opt_a.get_grad_eliminate_ : 0.000024s : 0.01% optimize.opt_a.virtual_output : 0.000022s : 0.01% optimize.opt_a.merge_forward : 0.000043s : 0.01% optimize.opt_a.cell_reuse_recompute_pass : 0.000003s : 0.00% optimize.opt_a.cell_reuse_handle_not_recompute_node_pass : 0.000066s : 0.02% optimize.opt_a.meta_fg_expand : 0.000036s : 0.01% optimize.opt_a.meta_fg_expand.resolve : 0.001634s : 0.45% optimize.opt_a.after_resolve : 0.000077s : 0.02% optimize.opt_a.a_after_grad : 0.000125s : 0.03% optimize.opt_a.renormalize : 0.078549s : 21.82% optimize.opt_a.real_op_eliminate : 0.000077s : 0.02% optimize.opt_a.auto_monad_grad : 0.000076s : 0.02% optimize.opt_a.auto_monad_eliminator : 0.000142s : 0.04% optimize.opt_a.cse : 0.000396s : 0.11% optimize.opt_a.a_3 : 0.000493s : 0.14% optimize.py_interpret_to_execute_after_opt_a : 0.000004s : 0.00% optimize.slice_cell_reuse_recomputed_activation : 0.000003s : 0.00% optimize.rewriter_after_opt_a : 0.000112s : 0.03% optimize.convert_after_rewriter : 0.000017s : 0.00% optimize.order_py_execute_after_rewriter : 0.000012s : 0.00% optimize.opt_b.b_1 : 0.000445s : 0.12% optimize.opt_b.b_2 : 0.000006s : 0.00% optimize.opt_b.updatestate_depend_eliminate : 0.000006s : 0.00% optimize.opt_b.updatestate_assign_eliminate : 0.000004s : 0.00% optimize.opt_b.updatestate_loads_eliminate : 0.000004s : 0.00% optimize.opt_b.renormalize : 0.000001s : 0.00% optimize.opt_b.cse : 0.000016s : 0.00% optimize.cconv : 0.000022s : 0.01% optimize.opt_after_cconv.c_1 : 0.000005s : 0.00% optimize.opt_after_cconv.parameter_eliminate : 0.000002s : 0.00% optimize.opt_after_cconv.updatestate_depend_eliminate : 0.000002s : 0.00% optimize.opt_after_cconv.updatestate_assign_eliminate : 0.000002s : 0.00% optimize.opt_after_cconv.updatestate_loads_eliminate : 0.000002s : 0.00% optimize.opt_after_cconv.cse : 0.000006s : 0.00% optimize.opt_after_cconv.renormalize : 0.000000s : 0.00% optimize.remove_dup_value : 0.000010s : 0.00% optimize.tuple_transform.d_1 : 0.000014s : 0.00% optimize.tuple_transform.d_2 : 0.000006s : 0.00% optimize.tuple_transform.renormalize : 0.000000s : 0.00% optimize.add_cache_embedding : 0.000011s : 0.00% optimize.add_recomputation : 0.000046s : 0.01% optimize.cse_after_recomputation.cse : 0.000046s : 0.01% optimize.environ_conv : 0.000022s : 0.01% optimize.label_micro_interleaved_index : 0.000003s : 0.00% optimize.label_fine_grained_interleaved_index : 0.000003s : 0.00% optimize.assign_add_opt : 0.000004s : 0.00% optimize.slice_recompute_activation : 0.000002s : 0.00% optimize.micro_interleaved_order_control : 0.000002s : 0.00% optimize.full_micro_interleaved_order_control : 0.000002s : 0.00% optimize.comp_comm_scheduling : 0.000002s : 0.00% optimize.reorder_send_recv_between_fp_bp : 0.000002s : 0.00% optimize.comm_op_add_attrs : 0.000001s : 0.00% optimize.add_comm_op_reuse_tag : 0.000001s : 0.00% optimize.overlap_opt_shard_in_pipeline : 0.000001s : 0.00% optimize.grouped_pairwise_exchange_alltoall : 0.000002s : 0.00% optimize.overlap_recompute_and_grad_model_parallel : 0.000002s : 0.00% optimize.overlap_grad_matmul_and_grad_allreduce : 0.000001s : 0.00% optimize.split_matmul_comm_elemetwise : 0.000003s : 0.00% optimize.split_layernorm_comm : 0.000002s : 0.00% optimize.process_send_recv_for_ge : 0.000003s : 0.00% optimize.handle_group_info : 0.000001s : 0.00% auto_monad_reorder : 0.000021s : 0.01% get_jit_bprop_graph : 0.000000s : 0.00% eliminate_special_op_node : 0.000547s : 0.15% validate : 0.000044s : 0.01% distribtued_split : 0.000001s : 0.00% task_emit : 0.006239s : 1.73% execute : 0.000008s : 0.00% Time group info: ------[substitution.] 0.031335 383 0.01% : 0.000004s : 5: substitution.float_depend_g_call 0.03% : 0.000010s : 14: substitution.float_tuple_getitem_switch 96.68% : 0.030295s : 25: substitution.getattr_setattr_resolve 0.02% : 0.000005s : 3: substitution.graph_param_transform 0.01% : 0.000005s : 3: substitution.incorporate_call 0.04% : 0.000014s : 3: substitution.incorporate_call_switch 1.98% : 0.000621s : 59: substitution.inline 0.02% : 0.000007s : 10: substitution.less_batch_normalization 0.22% : 0.000067s : 23: substitution.meta_unpack_prepare 0.04% : 0.000012s : 11: substitution.minmaximum_grad 0.02% : 0.000005s : 5: substitution.partial_eliminate 0.00% : 0.000001s : 3: substitution.partial_unused_args_eliminate 0.02% : 0.000007s : 47: substitution.remove_not_recompute_node 0.18% : 0.000057s : 38: substitution.replace_applicator 0.03% : 0.000009s : 20: substitution.replace_old_param 0.01% : 0.000004s : 2: substitution.reset_defer_inline 0.03% : 0.000008s : 8: substitution.set_cell_output_no_recompute 0.03% : 0.000009s : 5: substitution.specialize_transform 0.03% : 0.000011s : 4: substitution.switch_simplify 0.05% : 0.000015s : 2: substitution.transpose_eliminate 0.13% : 0.000040s : 15: substitution.tuple_list_convert_item_index_to_positive 0.05% : 0.000016s : 15: substitution.tuple_list_get_item_const_eliminator 0.07% : 0.000022s : 15: substitution.tuple_list_get_item_depend_reorder 0.23% : 0.000072s : 33: substitution.tuple_list_get_item_eliminator 0.07% : 0.000021s : 15: substitution.tuple_list_get_set_item_eliminator ------[renormalize.] 0.078533 6 95.55% : 0.075040s : 3: renormalize.infer 4.45% : 0.003493s : 3: renormalize.specialize ------[replace.] 0.000729 68 51.17% : 0.000373s : 23: replace.getattr_setattr_resolve 26.95% : 0.000197s : 31: replace.inline 6.69% : 0.000049s : 2: replace.meta_unpack_prepare 8.00% : 0.000058s : 4: replace.switch_simplify 1.30% : 0.000010s : 2: replace.transpose_eliminate 5.89% : 0.000043s : 6: replace.tuple_list_get_item_eliminator ------[match.] 0.030861 68 97.89% : 0.030210s : 23: match.getattr_setattr_resolve 1.78% : 0.000550s : 31: match.inline 0.18% : 0.000055s : 2: match.meta_unpack_prepare 0.03% : 0.000011s : 4: match.switch_simplify 0.05% : 0.000015s : 2: match.transpose_eliminate 0.07% : 0.000021s : 6: match.tuple_list_get_item_eliminator ------[func_graph_cloner_run.] 0.003979 69 68.87% : 0.002741s : 28: func_graph_cloner_run.FuncGraphClonerGraph 31.13% : 0.001239s : 41: func_graph_cloner_run.FuncGraphSpecializer ------[meta_graph.] 0.000000 0 ------[manager.] 0.000000 0 ------[pynative] 0.000000 0 ------[others.] 0.034736 255 2.88% : 0.001001s : 104: opt.transform.opt_a 1.19% : 0.000413s : 92: opt.transform.opt_b 89.13% : 0.030961s : 10: opt.transform.opt_resolve 0.44% : 0.000154s : 1: opt.transforms.meta_unpack_prepare 6.26% : 0.002173s : 40: opt.transforms.opt_a 0.01% : 0.000004s : 1: opt.transforms.opt_after_cconv 0.01% : 0.000004s : 2: opt.transforms.opt_b 0.05% : 0.000018s : 2: opt.transforms.opt_trans_graph 0.02% : 0.000009s : 3: opt.transforms.special_op_eliminate [INFO] GE(170035,python3.7):2024-01-11-05:39:45.287.558 [scalable_config.cc:55][EVENT]174525 ScalableConfig:device total max size: 34359738368, page_mem_size_total_thresold: 32641751449, uncacheable_size_threshold: 17179869184 [INFO] GE(170035,python3.7):2024-01-11-05:39:45.366.030 [graph_var_manager.cc:1424][EVENT]174525 SetAllMemoryMaxValue:The graph_mem_max_size is 27917287424 and the var_mem_max_size is 5368709120 [INFO] GE(170035,python3.7):2024-01-11-05:39:45.366.161 [graph_manager.cc:1248][EVENT]174525 PreRun:PreRun start: graph node size 3, session id 1, graph id 0, graph name online. [INFO] ATRACE(170035,python3.7):2024-01-11-05:39:45.367.060 [atrace_api.c:28](tid:174525) AtraceCreate start [INFO] ATRACE(170035,python3.7):2024-01-11-05:39:45.367.135 [trace_rb_log.c:84](tid:174525) [RUNTIME_ATRACE_DEV64_TS0] create ring buffer success, buffer size : 131152. [INFO] ATRACE(170035,python3.7):2024-01-11-05:39:45.367.150 [atrace_api.c:32](tid:174525) AtraceCreate end [INFO] TDT(170035,python3.7):2024-01-11-05:39:45.367.179 [client_manager.cpp:157][SetProfilingCallback][tid:174525] [TsdClient] set profiling callback success [INFO] GE(170035,python3.7):2024-01-11-05:39:45.368.258 [parallel_partitioner.cc:165][EVENT]174525 DoPipelinePartition:[GEPERFTRACE] The time cost of OptimizeSubgraph::PipelinePartition is [27] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.368.308 [parallel_partitioner.cc:178][EVENT]174525 DoFlowGraphPartition:[GEPERFTRACE] The time cost of OptimizeSubgraph::FlowGraphPartition is [18] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.368.371 [graph_prepare.cc:1378][EVENT]174525 Init:[GEPERFTRACE] The time cost of FileConstantUtils::ConvertFileConstToConst is [10] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.369.023 [graph_manager.cc:1050][EVENT]174525 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.preparer.PrepareInit is [677] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.369.052 [graph_manager.cc:1052][EVENT]174525 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.optimizer.HandleSummaryOp is [9] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.369.223 [base_pass.cc:339][EVENT]174525 Run:[GEPERFTRACE] The time cost of ForToWhilePass is [2] micro second, call num is [3] [INFO] GE(170035,python3.7):2024-01-11-05:39:45.369.257 [pass_manager.cc:82][EVENT]174525 Run:[GEPERFTRACE] The time cost of ProcessNetOutput::SavePass is [4] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.369.372 [pass_manager.cc:82][EVENT]174525 Run:[GEPERFTRACE] The time cost of ProcessNetOutput::NetOutputPass is [101] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.369.389 [pass_manager.cc:82][EVENT]174525 Run:[GEPERFTRACE] The time cost of ProcessNetOutput::DataPass is [2] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.369.496 [pass_manager.cc:82][EVENT]174525 Run:[GEPERFTRACE] The time cost of CreateSubGraphWithScopePass is [34] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.369.510 [pass_manager.cc:82][EVENT]174525 Run:[GEPERFTRACE] The time cost of SubgraphMultiDimsClonePass is [2] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.369.534 [pass_manager.cc:82][EVENT]174525 Run:[GEPERFTRACE] The time cost of MultiBatchClonePass is [14] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.369.649 [base_pass.cc:339][EVENT]174525 Run:[GEPERFTRACE] The time cost of SplitVariableIntoSubgraphPass is [1] micro second, call num is [3] [INFO] GE(170035,python3.7):2024-01-11-05:39:45.369.690 [graph_manager.cc:1054][EVENT]174525 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.preparer.NormalizeGraph is [625] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.376.979 [graph_manager.cc:1055][EVENT]174525 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.optimizer.OptimizeGraphInit is [7274] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.378.254 [base_pass.cc:339][EVENT]174525 Run:[GEPERFTRACE] The time cost of AssertPass is [2] micro second, call num is [6] [INFO] GE(170035,python3.7):2024-01-11-05:39:45.378.284 [base_pass.cc:339][EVENT]174525 Run:[GEPERFTRACE] The time cost of SwitchDeadBranchElimination is [2] micro second, call num is [3] [INFO] GE(170035,python3.7):2024-01-11-05:39:45.378.296 [base_pass.cc:339][EVENT]174525 Run:[GEPERFTRACE] The time cost of MergePass is [12] micro second, call num is [3] [INFO] GE(170035,python3.7):2024-01-11-05:39:45.378.305 [base_pass.cc:339][EVENT]174525 Run:[GEPERFTRACE] The time cost of InferShapePass is [322] micro second, call num is [3] [INFO] GE(170035,python3.7):2024-01-11-05:39:45.378.314 [base_pass.cc:339][EVENT]174525 Run:[GEPERFTRACE] The time cost of ReplaceWithEmptyConstPass is [32] micro second, call num is [3] [INFO] GE(170035,python3.7):2024-01-11-05:39:45.378.322 [base_pass.cc:339][EVENT]174525 Run:[GEPERFTRACE] The time cost of SplitShapeNPass is [2] micro second, call num is [6] [INFO] GE(170035,python3.7):2024-01-11-05:39:45.378.331 [base_pass.cc:339][EVENT]174525 Run:[GEPERFTRACE] The time cost of DimensionComputePass is [105] micro second, call num is [3] [INFO] GE(170035,python3.7):2024-01-11-05:39:45.378.339 [base_pass.cc:339][EVENT]174525 Run:[GEPERFTRACE] The time cost of ConstantFoldingPass is [29] micro second, call num is [3] [INFO] GE(170035,python3.7):2024-01-11-05:39:45.378.348 [base_pass.cc:339][EVENT]174525 Run:[GEPERFTRACE] The time cost of InferValuePass is [5] micro second, call num is [3] [INFO] GE(170035,python3.7):2024-01-11-05:39:45.380.421 [graph_manager.cc:1056][EVENT]174525 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.optimizer.OptimizeOriginalGraphForQuantize is [3403] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.380.487 [base_pass.cc:339][EVENT]174525 Run:[GEPERFTRACE] The time cost of CondRemovePass is [3] micro second, call num is [3] [INFO] GE(170035,python3.7):2024-01-11-05:39:45.380.506 [graph_prepare.cc:1982][EVENT]174525 PrepareDynShape:[GEPERFTRACE] The time cost of Prepare::ProcessBeforeInfershape is [50] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.380.843 [base_pass.cc:339][EVENT]174525 Run:[GEPERFTRACE] The time cost of AssertPass is [4] micro second, call num is [6] [INFO] GE(170035,python3.7):2024-01-11-05:39:45.380.867 [base_pass.cc:339][EVENT]174525 Run:[GEPERFTRACE] The time cost of SwitchDeadBranchElimination is [3] micro second, call num is [3] [INFO] GE(170035,python3.7):2024-01-11-05:39:45.380.878 [base_pass.cc:339][EVENT]174525 Run:[GEPERFTRACE] The time cost of MergePass is [1] micro second, call num is [3] [INFO] GE(170035,python3.7):2024-01-11-05:39:45.380.886 [base_pass.cc:339][EVENT]174525 Run:[GEPERFTRACE] The time cost of InferShapePass is [170] micro second, call num is [3] [INFO] GE(170035,python3.7):2024-01-11-05:39:45.380.895 [base_pass.cc:339][EVENT]174525 Run:[GEPERFTRACE] The time cost of ReplaceWithEmptyConstPass is [7] micro second, call num is [3] [INFO] GE(170035,python3.7):2024-01-11-05:39:45.380.904 [base_pass.cc:339][EVENT]174525 Run:[GEPERFTRACE] The time cost of SplitShapeNPass is [4] micro second, call num is [6] [INFO] GE(170035,python3.7):2024-01-11-05:39:45.380.912 [base_pass.cc:339][EVENT]174525 Run:[GEPERFTRACE] The time cost of DimensionComputePass is [5] micro second, call num is [3] [INFO] GE(170035,python3.7):2024-01-11-05:39:45.380.920 [base_pass.cc:339][EVENT]174525 Run:[GEPERFTRACE] The time cost of ConstantFoldingPass is [9] micro second, call num is [3] [INFO] GE(170035,python3.7):2024-01-11-05:39:45.380.942 [base_pass.cc:339][EVENT]174525 Run:[GEPERFTRACE] The time cost of InferValuePass is [3] micro second, call num is [3] [INFO] GE(170035,python3.7):2024-01-11-05:39:45.380.969 [graph_prepare.cc:1983][EVENT]174525 PrepareDynShape:[GEPERFTRACE] The time cost of Prepare::FormatAndShapeProcess is [449] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.380.993 [pass_manager.cc:82][EVENT]174525 Run:[GEPERFTRACE] The time cost of PreRun::MarkForceUnknownForCondPass is [6] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.381.006 [graph_prepare.cc:1984][EVENT]174525 PrepareDynShape:[GEPERFTRACE] The time cost of Prepare::CtrlFlowPreProcess is [21] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.381.020 [graph_prepare.cc:1985][EVENT]174525 PrepareDynShape:[GEPERFTRACE] The time cost of Prepare::multibatch::GetDynamicOutputShape is [3] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.381.041 [graph_prepare.cc:1986][EVENT]174525 PrepareDynShape:[GEPERFTRACE] The time cost of Prepare::InsertAippOpUtil::Instance().UpdateDataNodeByAipp is [9] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.381.053 [graph_prepare.cc:1987][EVENT]174525 PrepareDynShape:[GEPERFTRACE] The time cost of Prepare::SaveOriginalGraphToOmModel is [0] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.381.068 [pass_manager.cc:82][EVENT]174525 Run:[GEPERFTRACE] The time cost of PrepareOptimize::ShapeOperateOpRemovePass is [2] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.381.079 [pass_manager.cc:82][EVENT]174525 Run:[GEPERFTRACE] The time cost of PrepareOptimize::ReplaceTransShapePass is [1] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.381.094 [pass_manager.cc:82][EVENT]174525 Run:[GEPERFTRACE] The time cost of PrepareOptimize::MarkAgnosticPass is [4] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.381.175 [base_pass.cc:339][EVENT]174525 Run:[GEPERFTRACE] The time cost of EnterPass is [2] micro second, call num is [3] [INFO] GE(170035,python3.7):2024-01-11-05:39:45.381.187 [base_pass.cc:339][EVENT]174525 Run:[GEPERFTRACE] The time cost of CondPass is [2] micro second, call num is [3] [INFO] GE(170035,python3.7):2024-01-11-05:39:45.381.195 [base_pass.cc:339][EVENT]174525 Run:[GEPERFTRACE] The time cost of PrintOpPass is [1] micro second, call num is [3] [INFO] GE(170035,python3.7):2024-01-11-05:39:45.381.204 [base_pass.cc:339][EVENT]174525 Run:[GEPERFTRACE] The time cost of NoUseReshapeRemovePass is [1] micro second, call num is [3] [INFO] GE(170035,python3.7):2024-01-11-05:39:45.381.212 [base_pass.cc:339][EVENT]174525 Run:[GEPERFTRACE] The time cost of DropOutPass is [1] micro second, call num is [3] [INFO] GE(170035,python3.7):2024-01-11-05:39:45.381.221 [base_pass.cc:339][EVENT]174525 Run:[GEPERFTRACE] The time cost of AssertPass is [1] micro second, call num is [3] [INFO] GE(170035,python3.7):2024-01-11-05:39:45.381.229 [base_pass.cc:339][EVENT]174525 Run:[GEPERFTRACE] The time cost of TransposeRemovePass is [1] micro second, call num is [3] [INFO] GE(170035,python3.7):2024-01-11-05:39:45.381.237 [base_pass.cc:339][EVENT]174525 Run:[GEPERFTRACE] The time cost of UnusedConstPass is [3] micro second, call num is [3] [INFO] GE(170035,python3.7):2024-01-11-05:39:45.381.245 [base_pass.cc:339][EVENT]174525 Run:[GEPERFTRACE] The time cost of StopGradientPass is [1] micro second, call num is [3] [INFO] GE(170035,python3.7):2024-01-11-05:39:45.381.253 [base_pass.cc:339][EVENT]174525 Run:[GEPERFTRACE] The time cost of PreventGradientPass is [3] micro second, call num is [3] [INFO] GE(170035,python3.7):2024-01-11-05:39:45.381.261 [base_pass.cc:339][EVENT]174525 Run:[GEPERFTRACE] The time cost of PlaceholderWithDefaultPass is [2] micro second, call num is [3] [INFO] GE(170035,python3.7):2024-01-11-05:39:45.381.270 [base_pass.cc:339][EVENT]174525 Run:[GEPERFTRACE] The time cost of SnapshotPass is [1] micro second, call num is [3] [INFO] GE(170035,python3.7):2024-01-11-05:39:45.381.287 [base_pass.cc:339][EVENT]174525 Run:[GEPERFTRACE] The time cost of GuaranteeConstPass is [2] micro second, call num is [3] [INFO] GE(170035,python3.7):2024-01-11-05:39:45.381.296 [base_pass.cc:339][EVENT]174525 Run:[GEPERFTRACE] The time cost of VarIsInitializedOpPass is [5] micro second, call num is [3] [INFO] GE(170035,python3.7):2024-01-11-05:39:45.381.304 [base_pass.cc:339][EVENT]174525 Run:[GEPERFTRACE] The time cost of ParallelConcatStartOpPass is [1] micro second, call num is [3] [INFO] GE(170035,python3.7):2024-01-11-05:39:45.381.312 [base_pass.cc:339][EVENT]174525 Run:[GEPERFTRACE] The time cost of IdentityPass is [2] micro second, call num is [3] [INFO] GE(170035,python3.7):2024-01-11-05:39:45.381.336 [pass_manager.cc:82][EVENT]174525 Run:[GEPERFTRACE] The time cost of PrepareOptimize::PrunePass is [11] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.381.349 [pass_manager.cc:82][EVENT]174525 Run:[GEPERFTRACE] The time cost of PrepareOptimize::HcclMemcpyPass is [2] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.381.380 [graph_prepare.cc:1988][EVENT]174525 PrepareDynShape:[GEPERFTRACE] The time cost of Prepare::PrepareOptimize is [318] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.381.392 [graph_manager.cc:1065][EVENT]174525 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.preparer.PrepareDynShape is [936] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.394.373 [graph_manager.cc:1077][EVENT]174525 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.optimizer.OptimizeOriginalGraph is [12960] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.394.442 [pass_manager.cc:82][EVENT]174525 Run:[GEPERFTRACE] The time cost of PrepareRunningFormatRefiner::VariablePrepareOpPass is [6] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.394.490 [graph_manager.cc:1080][EVENT]174525 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.preparer.PrepareRunningFormatRefiner is [80] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.398.229 [graph_manager.cc:1081][EVENT]174525 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.optimizer.OptimizeOriginalGraphJudgeInsert is [3722] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.398.272 [pass_manager.cc:82][EVENT]174525 Run:[GEPERFTRACE] The time cost of SubexpressionMigrationPass is [3] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.398.287 [pass_manager.cc:82][EVENT]174525 Run:[GEPERFTRACE] The time cost of UnusedArgsCleanPass is [2] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.398.299 [graph_manager.cc:1082][EVENT]174525 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::SubexpressionMigration is [36] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.398.330 [pass_manager.cc:82][EVENT]174525 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::MergeInputMemcpyPass is [4] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.398.346 [pass_manager.cc:82][EVENT]174525 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::SwitchDataEdgesBypass is [4] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.398.360 [pass_manager.cc:82][EVENT]174525 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::ConstantFuseSamePass is [4] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.398.393 [pass_manager.cc:82][EVENT]174525 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::CSEBeforeFuseDataNodesWithCommonInputPass is [22] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.398.407 [pass_manager.cc:82][EVENT]174525 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::FuseDataNodesWithCommonInputPass is [3] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.398.422 [pass_manager.cc:82][EVENT]174525 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::CommonSubexpressionEliminationPass is [4] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.398.445 [pass_manager.cc:82][EVENT]174525 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::PermutePass is [4] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.398.494 [pass_manager.cc:82][EVENT]174525 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::SameTransdataBreadthFusionPass is [38] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.398.513 [pass_manager.cc:82][EVENT]174525 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::VariableOpPass is [7] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.398.531 [pass_manager.cc:82][EVENT]174525 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::TransOpWithoutReshapeFusionPass is [8] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.398.593 [pass_manager.cc:82][EVENT]174525 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::TransOpBreadthFusionPass is [52] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.398.630 [pass_manager.cc:82][EVENT]174525 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::DataFlowPreparePass is [25] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.398.645 [pass_manager.cc:82][EVENT]174525 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::MergeUnknownShapeNPass is [2] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.398.654 [graph_manager.cc:2700][EVENT]174525 OptimizeStage1:[GEPERFTRACE] The time cost of GraphManager::OptimizeStage1_1 is [329] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.398.797 [base_pass.cc:339][EVENT]174525 Run:[GEPERFTRACE] The time cost of EnterPass is [1] micro second, call num is [3] [INFO] GE(170035,python3.7):2024-01-11-05:39:45.398.812 [base_pass.cc:339][EVENT]174525 Run:[GEPERFTRACE] The time cost of AddNPass is [1] micro second, call num is [3] [INFO] GE(170035,python3.7):2024-01-11-05:39:45.398.822 [base_pass.cc:339][EVENT]174525 Run:[GEPERFTRACE] The time cost of SwitchDeadBranchElimination is [1] micro second, call num is [3] [INFO] GE(170035,python3.7):2024-01-11-05:39:45.398.831 [base_pass.cc:339][EVENT]174525 Run:[GEPERFTRACE] The time cost of SwitchLogicRemovePass is [1] micro second, call num is [3] [INFO] GE(170035,python3.7):2024-01-11-05:39:45.398.840 [base_pass.cc:339][EVENT]174525 Run:[GEPERFTRACE] The time cost of MergePass is [2] micro second, call num is [3] [INFO] GE(170035,python3.7):2024-01-11-05:39:45.398.848 [base_pass.cc:339][EVENT]174525 Run:[GEPERFTRACE] The time cost of CastRemovePass is [10] micro second, call num is [3] [INFO] GE(170035,python3.7):2024-01-11-05:39:45.398.857 [base_pass.cc:339][EVENT]174525 Run:[GEPERFTRACE] The time cost of TransposeTransDataPass is [2] micro second, call num is [3] [INFO] GE(170035,python3.7):2024-01-11-05:39:45.398.865 [base_pass.cc:339][EVENT]174525 Run:[GEPERFTRACE] The time cost of ReshapeRemovePass is [24] micro second, call num is [3] [INFO] GE(170035,python3.7):2024-01-11-05:39:45.398.873 [base_pass.cc:339][EVENT]174525 Run:[GEPERFTRACE] The time cost of TransOpSymmetryEliminationPass is [13] micro second, call num is [3] [INFO] GE(170035,python3.7):2024-01-11-05:39:45.398.881 [base_pass.cc:339][EVENT]174525 Run:[GEPERFTRACE] The time cost of TransOpNearbyAllreduceFusionPass is [1] micro second, call num is [3] [INFO] GE(170035,python3.7):2024-01-11-05:39:45.398.889 [base_pass.cc:339][EVENT]174525 Run:[GEPERFTRACE] The time cost of ReplaceWithEmptyConstPass is [7] micro second, call num is [3] [INFO] GE(170035,python3.7):2024-01-11-05:39:45.398.898 [base_pass.cc:339][EVENT]174525 Run:[GEPERFTRACE] The time cost of DimensionComputePass is [4] micro second, call num is [3] [INFO] GE(170035,python3.7):2024-01-11-05:39:45.398.906 [base_pass.cc:339][EVENT]174525 Run:[GEPERFTRACE] The time cost of ConstantFoldingPass is [9] micro second, call num is [3] [INFO] GE(170035,python3.7):2024-01-11-05:39:45.398.914 [base_pass.cc:339][EVENT]174525 Run:[GEPERFTRACE] The time cost of DimensionAdjustPass is [4] micro second, call num is [3] [INFO] GE(170035,python3.7):2024-01-11-05:39:45.398.941 [base_pass.cc:339][EVENT]174525 Run:[GEPERFTRACE] The time cost of UselessControlOutRemovePass is [1] micro second, call num is [3] [INFO] GE(170035,python3.7):2024-01-11-05:39:45.398.952 [graph_manager.cc:2741][EVENT]174525 OptimizeStage1:[GEPERFTRACE] The time cost of GraphManager::OptimizeStage1_2 is [279] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.398.961 [graph_manager.cc:2752][EVENT]174525 OptimizeStage1:[GEPERFTRACE] The time cost of extern constant folding is [0] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.398.984 [pass_manager.cc:82][EVENT]174525 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::Migration is [3] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.398.998 [pass_manager.cc:82][EVENT]174525 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::ArgsClean is [1] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.399.015 [pass_manager.cc:82][EVENT]174525 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::PrunePass is [7] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.399.031 [pass_manager.cc:82][EVENT]174525 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::NextIterationPass is [5] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.399.045 [pass_manager.cc:82][EVENT]174525 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::ControlTriggerPass is [2] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.399.057 [pass_manager.cc:82][EVENT]174525 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::MergeToStreamMergePass is [3] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.399.078 [pass_manager.cc:82][EVENT]174525 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::SwitchToStreamSwitchPass is [10] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.399.093 [pass_manager.cc:82][EVENT]174525 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::AttachStreamLabelPass is [4] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.399.107 [pass_manager.cc:82][EVENT]174525 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::MultiBatchPass is [4] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.399.117 [pass_manager.cc:82][EVENT]174525 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::SubgraphMultiDimsPass is [1] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.399.130 [pass_manager.cc:82][EVENT]174525 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::IteratorOpPass is [4] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.399.141 [pass_manager.cc:82][EVENT]174525 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::VariableRefUselessControlOutDeletePass is [1] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.399.160 [pass_manager.cc:82][EVENT]174525 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::ReshapeRecoveryPass is [9] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.399.173 [pass_manager.cc:82][EVENT]174525 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::RemoveSameConstPass is [4] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.399.183 [graph_manager.cc:2810][EVENT]174525 OptimizeStage1:[GEPERFTRACE] The time cost of GraphManager::OptimizeStage1_3 is [204] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.399.212 [base_pass.cc:339][EVENT]174525 Run:[GEPERFTRACE] The time cost of IdentityPass is [1] micro second, call num is [3] [INFO] GE(170035,python3.7):2024-01-11-05:39:45.399.224 [graph_manager.cc:2821][EVENT]174525 OptimizeStage1:[GEPERFTRACE] The time cost of GraphPrepare::node_pass is [31] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.399.252 [graph_manager.cc:1087][EVENT]174525 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::OptimizeStage1 is [933] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.399.389 [graph_manager.cc:1088][EVENT]174525 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.optimizer.OptimizeAfterStage1 is [124] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.399.451 [graph_manager.cc:1089][EVENT]174525 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::GraphUtilsEx::InferShapeInNeed is [33] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.399.472 [pass_manager.cc:82][EVENT]174525 Run:[GEPERFTRACE] The time cost of PreRun::CtrlEdgeTransferPass is [2] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.399.486 [graph_manager.cc:1097][EVENT]174525 PreRunOptimizeOriginalGraph:PreRun:PreRunOptimizeOriginalGraph success. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.399.508 [graph_manager.cc:3325][EVENT]174525 OptimizeSubgraph:[GEPERFTRACE] The time cost of OptimizeSubgraph::StagePartition is [3] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.399.906 [engine_place.cc:144][EVENT]174525 Run:The time cost of AIcoreEngine::CheckSupported is [261] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.399.934 [engine_place.cc:144][EVENT]174525 Run:The time cost of DNN_VM_GE_LOCAL_OP_STORE::CheckSupported is [9] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.399.943 [engine_place.cc:144][EVENT]174525 Run:The time cost of DNN_VM_RTS_OP_STORE::CheckSupported is [7] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.400.071 [graph_manager.cc:3351][EVENT]174525 OptimizeSubgraph:[GEPERFTRACE] The time cost of OptimizeSubgraph::GraphPartitionDynamicShape is [549] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.400.092 [graph_manager.cc:3364][EVENT]174525 OptimizeSubgraph:[GEPERFTRACE] The time cost of OptimizeSubgraph::SubgraphPartitionAndOptimization::CompositeEngine is [2] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.400.198 [engine_partitioner.cc:1139][EVENT]174525 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionInitialize is [20] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.400.216 [engine_partitioner.cc:1142][EVENT]174525 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionMarkClusters is [4] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.400.355 [engine_partitioner.cc:1148][EVENT]174525 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionSplitSubGraphs is [129] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.400.398 [engine_partitioner.cc:1155][EVENT]174525 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionSortSubGraphs is [30] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.400.445 [engine_partitioner.cc:1164][EVENT]174525 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionAddPartitionsToGraphNode is [35] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.400.481 [graph_manager.cc:3405][EVENT]174525 SubgraphPartitionAndOptimization:[GEPERFTRACE] The time cost of OptimizeSubgraph::Partition1 is [346] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.400.501 [graph_manager.cc:3412][EVENT]174525 SubgraphPartitionAndOptimization:[GEPERFTRACE] The time cost of OptimizeSubgraph::SetSubgraphPreProc is [7] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.412.706 [graph_manager.cc:3422][EVENT]174525 SubgraphPartitionAndOptimization:[GEPERFTRACE] The time cost of OptimizeSubgraph::SetSubGraph is [12190] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.412.746 [graph_manager.cc:3428][EVENT]174525 SubgraphPartitionAndOptimization:[GEPERFTRACE] The time cost of OptimizeSubgraph::SetSubgraphPostProc is [8] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.412.869 [graph_manager.cc:3467][EVENT]174525 SubgraphPartitionAndOptimization:[GEPERFTRACE] The time cost of OptimizeSubgraph::MergeSubGraph is [103] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.412.888 [graph_manager.cc:3377][EVENT]174525 OptimizeSubgraph:[GEPERFTRACE] The time cost of OptimizeSubgraph::SubgraphPartitionAndOptimization::AtomicEngine is [12754] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.412.905 [graph_manager.cc:1106][EVENT]174525 PreRunOptimizeSubGraph:[GEPERFTRACE] The time cost of GraphManager::OptimizeSubgraph is [13404] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.412.927 [graph_manager.cc:1115][EVENT]174525 PreRunOptimizeSubGraph:PreRun:PreRunOptimizeSubGraph success. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.412.950 [graph_manager.cc:1130][EVENT]174525 PreRunAfterOptimizeSubGraph:[GEPERFTRACE] The time cost of GraphManager::stages.optimizer.ReplacePrecompiledNodeWithOmGraph is [5] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.412.981 [graph_manager.cc:1131][EVENT]174525 PreRunAfterOptimizeSubGraph:[GEPERFTRACE] The time cost of GraphManager::stages.optimizer.OptimizeWholeGraph is [18] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.413.029 [pass_manager.cc:82][EVENT]174525 Run:[GEPERFTRACE] The time cost of OptimizeStage2::AfterMergePasses::LinkGenMaskNodesPass is [29] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.413.047 [pass_manager.cc:82][EVENT]174525 Run:[GEPERFTRACE] The time cost of OptimizeStage2::HcclContinuousMemcpyPass is [5] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.413.057 [graph_manager.cc:2837][EVENT]174525 OptimizeStage2:[GEPERFTRACE] The time cost of OptimizeStage2::AfterMergePasses is [60] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.413.139 [base_pass.cc:339][EVENT]174525 Run:[GEPERFTRACE] The time cost of ConstantFoldingPass is [12] micro second, call num is [3] [INFO] GE(170035,python3.7):2024-01-11-05:39:45.413.152 [base_pass.cc:339][EVENT]174525 Run:[GEPERFTRACE] The time cost of ReshapeRemovePass is [0] micro second, call num is [3] [INFO] GE(170035,python3.7):2024-01-11-05:39:45.413.161 [base_pass.cc:339][EVENT]174525 Run:[GEPERFTRACE] The time cost of CondRemovePass is [3] micro second, call num is [3] [INFO] GE(170035,python3.7):2024-01-11-05:39:45.413.170 [base_pass.cc:339][EVENT]174525 Run:[GEPERFTRACE] The time cost of BitcastPass is [1] micro second, call num is [3] [INFO] GE(170035,python3.7):2024-01-11-05:39:45.413.179 [base_pass.cc:339][EVENT]174525 Run:[GEPERFTRACE] The time cost of AssignRemovePass is [14] micro second, call num is [3] [INFO] GE(170035,python3.7):2024-01-11-05:39:45.413.187 [base_pass.cc:339][EVENT]174525 Run:[GEPERFTRACE] The time cost of DimensionAdjustPass is [6] micro second, call num is [3] [INFO] GE(170035,python3.7):2024-01-11-05:39:45.413.197 [graph_manager.cc:2864][EVENT]174525 OptimizeStage2:[GEPERFTRACE] The time cost of OptimizeStage2::MergedGraphNameToPasses is [124] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.413.210 [graph_manager.cc:2872][EVENT]174525 OptimizeStage2:[GEPERFTRACE] The time cost of OptimizeStage2::RemoveIsolatedConst is [4] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.413.231 [pass_manager.cc:82][EVENT]174525 Run:[GEPERFTRACE] The time cost of OptimizeStage2::ControlAttrOptimize::MultiBatchPass is [2] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.413.247 [pass_manager.cc:82][EVENT]174525 Run:[GEPERFTRACE] The time cost of OptimizeStage2::AfterMergePasses::RefIdentityDeleteOpPass is [4] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.413.263 [pass_manager.cc:82][EVENT]174525 Run:[GEPERFTRACE] The time cost of OptimizeStage2::AfterMergePasses::VariableRefDeleteOpPass is [5] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.413.277 [compile_nodes_pass.cc:88][EVENT]174525 Run:[GEPERFTRACE] The time cost of OptimizeStage2::ControlAttrOptimize::CompileNodesPass is [4] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.413.288 [pass_manager.cc:82][EVENT]174525 Run:[GEPERFTRACE] The time cost of OptimizeStage2::ControlAttrOptimize::CompileNodesPass is [15] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.413.298 [pass_manager.cc:82][EVENT]174525 Run:[GEPERFTRACE] The time cost of OptimizeStage2::AfterMergePasses::SwapSpacePass is [1] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.413.389 [pass_manager.cc:82][EVENT]174525 Run:[GEPERFTRACE] The time cost of OptimizeStage2::AfterMergePasses::InputOutputConnectionIdentifyPass is [82] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.413.443 [pass_manager.cc:82][EVENT]174525 Run:[GEPERFTRACE] The time cost of OptimizeStage2::AfterMergePasses::AtomicAddrCleanPass is [33] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.413.458 [pass_manager.cc:82][EVENT]174525 Run:[GEPERFTRACE] The time cost of OptimizeStage2::AfterMergePasses::EndOfSequenceAddControlPass is [3] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.413.471 [pass_manager.cc:82][EVENT]174525 Run:[GEPERFTRACE] The time cost of OptimizeStage2::ControlAttrOptimize::SubgraphPass is [4] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.413.485 [pass_manager.cc:82][EVENT]174525 Run:[GEPERFTRACE] The time cost of OptimizeStage2::ControlAttrOptimize::AttachStreamLabelPass is [3] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.413.494 [graph_manager.cc:2927][EVENT]174525 OptimizeStage2:[GEPERFTRACE] The time cost of OptimizeStage2::ControlAttrOptimize is [266] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.413.507 [graph_manager.cc:2937][EVENT]174525 OptimizeStage2:[GEPERFTRACE] The time cost of ModelBuilder::AssignFunctionalLabels is [4] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.413.541 [graph_manager.cc:2943][EVENT]174525 OptimizeStage2:[GEPERFTRACE] The time cost of MemcpyAddrAsyncPass::Run. is [25] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.413.554 [graph_manager.cc:2950][EVENT]174525 OptimizeStage2:[GEPERFTRACE] The time cost of BufferPoolMemoryPass::Run. is [2] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.424.089 [graph_manager.cc:2958][EVENT]174525 OptimizeStage2:[GEPERFTRACE] The time cost of ParallelGroupPass::Run. is [43] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.424.216 [graph_manager.cc:1132][EVENT]174525 PreRunAfterOptimizeSubGraph:[GEPERFTRACE] The time cost of GraphManager::OptimizeStage2 is [11220] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.424.294 [graph_manager.cc:1135][EVENT]174525 PreRunAfterOptimizeSubGraph:[GEPERFTRACE] The time cost of GraphManager::GetCompilerStages(graph_node->GetGraphId()).optimizer.OptimizeGraphBeforeBuild is [60] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.424.337 [graph_manager.cc:2975][EVENT]174525 MemConflictProc:[GEPERFTRACE] The time cost of HandleMemoryRWConflict is [25] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.424.407 [graph_manager.cc:2981][EVENT]174525 MemConflictProc:[GEPERFTRACE] The time cost of MemLayoutConflictOptimizer::Run. is [57] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.424.425 [pass_manager.cc:82][EVENT]174525 Run:[GEPERFTRACE] The time cost of OptimizeStage2::SetFftsPlusAttrPass is [1] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.424.436 [graph_manager.cc:2986][EVENT]174525 MemConflictProc:[GEPERFTRACE] The time cost of SetFftsPlusAttrPass::last_passes.Run is [14] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.424.445 [graph_manager.cc:1136][EVENT]174525 PreRunAfterOptimizeSubGraph:[GEPERFTRACE] The time cost of GraphManager::MemConflictProc is [134] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.424.585 [graph_manager.cc:3555][EVENT]174525 Build:[GEPERFTRACE] The time cost of GraphManager::RecoverIrDefinitionAndModifyAippData is [92] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.424.704 [engine_partitioner.cc:1139][EVENT]174525 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionInitialize is [15] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.424.723 [engine_partitioner.cc:1142][EVENT]174525 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionMarkClusters is [4] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.424.832 [engine_partitioner.cc:1148][EVENT]174525 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionSplitSubGraphs is [100] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.424.874 [engine_partitioner.cc:1155][EVENT]174525 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionSortSubGraphs is [19] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.424.914 [engine_partitioner.cc:1164][EVENT]174525 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionAddPartitionsToGraphNode is [28] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.424.939 [graph_builder.cc:865][EVENT]174525 SecondPartition:[GEPERFTRACE] The time cost of EnginePartitioner::Partition2 is [270] micro second. [INFO] RUNTIME(170035,python3.7):2024-01-11-05:39:45.425.449 [logger.cc:1071] 174525 ModelBindStream: model_id=832, stream_id=1089, flag=0. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.425.501 [task_generator.cc:804][EVENT]174525 GenerateTask:[GEPERFTRACE] The time cost of TaskGenerator::SetStreamCtx is [190] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.425.590 [task_generator.cc:805][EVENT]174525 GenerateTask:[GEPERFTRACE] The time cost of TaskGenerator::PrepareForGenerateTask is [75] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.426.371 [task_generator.cc:814][EVENT]174525 GenerateTask:[GEPERFTRACE] The time cost of TaskGenerator::DoGenerateTask is [758] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.426.387 [task_generator.cc:954][EVENT]174525 GetTaskInfo:[GEPERFTRACE] The time cost of TaskGenerator::GenerateTask is [1077] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.426.462 [task_generator.cc:967][EVENT]174525 GetTaskInfo:[GEPERFTRACE] The time cost of TaskGenerator::AddModelTaskToModel is [41] micro second. [INFO] RUNTIME(170035,python3.7):2024-01-11-05:39:45.426.482 [logger.cc:1084] 174525 ModelUnbindStream: model_id=832, stream_id=1089, [INFO] GE(170035,python3.7):2024-01-11-05:39:45.426.689 [graph_manager.cc:1152][EVENT]174525 PreRunAfterOptimizeSubGraph:[GEPERFTRACE] The time cost of GraphManager::Build is [2218] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.426.709 [graph_manager.cc:1164][EVENT]174525 PreRunAfterOptimizeSubGraph:PreRun:PreRunAfterOptimizeSubGraph success. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.426.750 [graph_manager.cc:1271][EVENT]174525 PreRun:[GEPERFTRACE] The time cost of FlowModelBuild is [58692] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.426.762 [graph_manager.cc:1272][EVENT]174525 PreRun:[GEPERFTRACE] GE PreRun End [INFO] ATRACE(170035,python3.7):2024-01-11-05:39:45.427.074 [atrace_api.c:93](tid:174525) AtraceDestroy start [INFO] ATRACE(170035,python3.7):2024-01-11-05:39:45.427.101 [atrace_api.c:95](tid:174525) AtraceDestroy end [INFO] GE(170035,python3.7):2024-01-11-05:39:45.432.451 [graph_converter.cc:838][EVENT]174525 ConvertComputeGraphToExecuteGraph:[GEPERFTRACE] The time cost of ConvertComputeGraphToExecuteGraph::CreateMainNode is [1622] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.432.643 [base_optimizer.cc:70][EVENT]174525 Run:[GEPERFTRACE] The time cost of ZeroCopy is [145] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.433.135 [base_optimizer.cc:70][EVENT]174525 Run:[GEPERFTRACE] The time cost of CEM is [466] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.433.345 [copy_flow_launch_fuse.cc:395][EVENT]174525 Run:[GEPERFTRACE] The time cost of Pass::CopyFlowLaunchFuse is [184] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.433.369 [base_optimizer.cc:70][EVENT]174525 Run:[GEPERFTRACE] The time cost of CopyFlowLaunch is [208] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.433.617 [base_optimizer.cc:70][EVENT]174525 Run:[GEPERFTRACE] The time cost of TrustOutTensor is [237] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.433.652 [base_optimizer.cc:70][EVENT]174525 Run:[GEPERFTRACE] The time cost of AicpuFuseHostInputs is [15] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.433.689 [base_optimizer.cc:70][EVENT]174525 Run:[GEPERFTRACE] The time cost of ZeroCopy is [23] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.433.889 [base_optimizer.cc:70][EVENT]174525 Run:[GEPERFTRACE] The time cost of CEM is [176] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.433.971 [copy_flow_launch_fuse.cc:395][EVENT]174525 Run:[GEPERFTRACE] The time cost of Pass::CopyFlowLaunchFuse is [63] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.433.986 [base_optimizer.cc:70][EVENT]174525 Run:[GEPERFTRACE] The time cost of CopyFlowLaunch is [79] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.434.016 [base_optimizer.cc:70][EVENT]174525 Run:[GEPERFTRACE] The time cost of TrustOutTensor is [20] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.434.027 [base_optimizer.cc:70][EVENT]174525 Run:[GEPERFTRACE] The time cost of AicpuFuseHostInputs is [0] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.434.054 [base_optimizer.cc:70][EVENT]174525 Run:[GEPERFTRACE] The time cost of ZeroCopy is [17] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.434.124 [base_optimizer.cc:70][EVENT]174525 Run:[GEPERFTRACE] The time cost of CEM is [60] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.434.188 [copy_flow_launch_fuse.cc:395][EVENT]174525 Run:[GEPERFTRACE] The time cost of Pass::CopyFlowLaunchFuse is [52] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.434.199 [base_optimizer.cc:70][EVENT]174525 Run:[GEPERFTRACE] The time cost of CopyFlowLaunch is [65] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.434.225 [base_optimizer.cc:70][EVENT]174525 Run:[GEPERFTRACE] The time cost of TrustOutTensor is [16] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.434.235 [base_optimizer.cc:70][EVENT]174525 Run:[GEPERFTRACE] The time cost of AicpuFuseHostInputs is [0] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.434.248 [graph_converter.cc:849][EVENT]174525 ConvertComputeGraphToExecuteGraph:[GEPERFTRACE] The time cost of ConvertComputeGraphToExecuteGraph::RunAllPass is [1757] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.434.452 [graph_converter.cc:853][EVENT]174525 ConvertComputeGraphToExecuteGraph:[GEPERFTRACE] The time cost of ConvertComputeGraphToExecuteGraph::TopologicalSorting is [194] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.435.167 [graph_converter.cc:857][EVENT]174525 ConvertComputeGraphToExecuteGraph:[GEPERFTRACE] The time cost of ConvertComputeGraphToExecuteGraph::AppendGraphLevelData is [701] micro second. [INFO] GE(170035,python3.7):2024-01-11-05:39:45.435.324 [graph_converter.cc:862][EVENT]174525 ConvertComputeGraphToExecuteGraph:[GEPERFTRACE] The time cost of ConvertComputeGraphToExecuteGraph::CalculatePriority is [130] micro second. TotalTime = 0.0853156, [20] [parse]: 0.00152073 [symbol_resolve]: 0.0125267, [1] [Cycle 1]: 0.0124528, [1] [resolve]: 0.0124331 [combine_like_graphs]: 1.18e-06 [graph_reusing]: 3.44e-06 [meta_unpack_prepare]: 0.00013755 [pre_cconv]: 6.60002e-07 [abstract_specialize]: 0.00398092 [pack_expand]: 1.48e-05 [auto_monad]: 9.008e-05 [inline]: 1.65e-06 [pre_auto_parallel]: 1.091e-05 [pipeline_split]: 3.19e-06 [optimize]: 0.0614651, [35] [py_interpret_to_execute]: 3.93e-06 [rewriter_before_opt_a]: 0.00016236 [opt_a]: 0.0603, [4] [Cycle 1]: 0.0294665, [30] [expand_dump_flag]: 5.11e-06 [switch_simplify]: 2.33e-05 [a_1]: 0.00040386 [recompute_prepare]: 9.94001e-06 [updatestate_depend_eliminate]: 1.073e-05 [updatestate_assign_eliminate]: 6.71999e-06 [updatestate_loads_eliminate]: 6.17999e-06 [parameter_eliminate]: 5.28e-06 [a_2]: 7.643e-05 [accelerated_algorithm]: 5.37e-06 [pynative_shard]: 1.88e-06 [auto_parallel]: 3.52e-06 [parallel]: 9.63e-06 [merge_comm]: 4.62e-06 [allreduce_fusion]: 2.27e-06 [virtual_dataset]: 4.88e-06 [get_grad_eliminate_]: 4.14e-06 [virtual_output]: 3.91e-06 [merge_forward]: 8.86e-06 [cell_reuse_recompute_pass]: 1.18e-06 [cell_reuse_handle_not_recompute_node_pass]: 1.179e-05 [meta_fg_expand]: 0.00178041, [1] [Cycle 1]: 0.00046144, [1] [resolve]: 0.00044229 [after_resolve]: 2.03e-05 [a_after_grad]: 3.423e-05 [renormalize]: 0.0264764 [real_op_eliminate]: 2.389e-05 [auto_monad_grad]: 3.027e-05 [auto_monad_eliminator]: 4.746e-05 [cse]: 0.00011552 [a_3]: 0.00015858 [Cycle 2]: 0.0252434, [30] [expand_dump_flag]: 2.53e-06 [switch_simplify]: 5.936e-05 [a_1]: 0.00040096 [recompute_prepare]: 1.022e-05 [updatestate_depend_eliminate]: 1.148e-05 [updatestate_assign_eliminate]: 8.02e-06 [updatestate_loads_eliminate]: 7.58e-06 [parameter_eliminate]: 3.75e-06 [a_2]: 0.00011432 [accelerated_algorithm]: 1.234e-05 [pynative_shard]: 1.47e-06 [auto_parallel]: 4.15e-06 [parallel]: 3.97e-06 [merge_comm]: 2.83e-06 [allreduce_fusion]: 1.58e-06 [virtual_dataset]: 7.47e-06 [get_grad_eliminate_]: 6.02999e-06 [virtual_output]: 5.39e-06 [merge_forward]: 9.69e-06 [cell_reuse_recompute_pass]: 4.99997e-07 [cell_reuse_handle_not_recompute_node_pass]: 1.507e-05 [meta_fg_expand]: 0.00469562, [3] [Cycle 1]: 0.00032721, [1] [resolve]: 0.00030912 [Cycle 1]: 0.00040983, [1] [resolve]: 0.00039188 [Cycle 1]: 0.00031828, [1] [resolve]: 0.00030043 [after_resolve]: 3.053e-05 [a_after_grad]: 5.056e-05 [renormalize]: 0.0191288 [real_op_eliminate]: 2.96e-05 [auto_monad_grad]: 3.429e-05 [auto_monad_eliminator]: 5.245e-05 [cse]: 0.00012139 [a_3]: 0.00019782 [Cycle 3]: 0.00248003, [30] [expand_dump_flag]: 2.71999e-06 [switch_simplify]: 6.269e-05 [a_1]: 0.00051968 [recompute_prepare]: 1.16e-05 [updatestate_depend_eliminate]: 1.286e-05 [updatestate_assign_eliminate]: 9.4e-06 [updatestate_loads_eliminate]: 9.35001e-06 [parameter_eliminate]: 3.43e-06 [a_2]: 0.00014801 [accelerated_algorithm]: 1.499e-05 [pynative_shard]: 1.6e-06 [auto_parallel]: 5.51e-06 [parallel]: 4.52e-06 [merge_comm]: 3.36e-06 [allreduce_fusion]: 1.75001e-06 [virtual_dataset]: 8.15e-06 [get_grad_eliminate_]: 7.44e-06 [virtual_output]: 6.62e-06 [merge_forward]: 1.148e-05 [cell_reuse_recompute_pass]: 6.69999e-07 [cell_reuse_handle_not_recompute_node_pass]: 1.871e-05 [meta_fg_expand]: 2.736e-05 [after_resolve]: 1.121e-05 [a_after_grad]: 1.333e-05 [renormalize]: 0.00121625 [real_op_eliminate]: 1.285e-05 [auto_monad_grad]: 5.53999e-06 [auto_monad_eliminator]: 2.328e-05 [cse]: 8.811e-05 [a_3]: 6.891e-05 [Cycle 4]: 0.00071075, [30] [expand_dump_flag]: 1.29e-06 [switch_simplify]: 8.44e-06 [a_1]: 0.00014645 [recompute_prepare]: 9.9e-06 [updatestate_depend_eliminate]: 1.301e-05 [updatestate_assign_eliminate]: 9.71001e-06 [updatestate_loads_eliminate]: 9.14e-06 [parameter_eliminate]: 1.92e-06 [a_2]: 0.00014547 [accelerated_algorithm]: 1.444e-05 [pynative_shard]: 1.48e-06 [auto_parallel]: 3.82e-06 [parallel]: 3.94e-06 [merge_comm]: 2.59e-06 [allreduce_fusion]: 1.71e-06 [virtual_dataset]: 7.96e-06 [get_grad_eliminate_]: 6.93e-06 [virtual_output]: 6.8e-06 [merge_forward]: 1.142e-05 [cell_reuse_recompute_pass]: 5.00004e-07 [cell_reuse_handle_not_recompute_node_pass]: 1.842e-05 [meta_fg_expand]: 7.55e-06 [after_resolve]: 1.071e-05 [a_after_grad]: 1.351e-05 [renormalize]: 8.99963e-08 [real_op_eliminate]: 7.03e-06 [auto_monad_grad]: 2.24e-06 [auto_monad_eliminator]: 2.04e-05 [cse]: 4.904e-05 [a_3]: 6.095e-05 [py_interpret_to_execute_after_opt_a]: 4.16e-06 [slice_cell_reuse_recomputed_activation]: 2.91e-06 [rewriter_after_opt_a]: 6.475e-05 [convert_after_rewriter]: 1.646e-05 [order_py_execute_after_rewriter]: 1.159e-05 [opt_b]: 0.00056342, [2] [Cycle 1]: 0.00048172, [7] [b_1]: 0.00042553 [b_2]: 3.88e-06 [updatestate_depend_eliminate]: 3.66e-06 [updatestate_assign_eliminate]: 2.22e-06 [updatestate_loads_eliminate]: 2.15e-06 [renormalize]: 2.69996e-07 [cse]: 1.124e-05 [Cycle 2]: 7.269e-05, [7] [b_1]: 3.379e-05 [b_2]: 2.1e-06 [updatestate_depend_eliminate]: 2.23e-06 [updatestate_assign_eliminate]: 1.95e-06 [updatestate_loads_eliminate]: 1.79e-06 [renormalize]: 7.0002e-08 [cse]: 6.24e-06 [cconv]: 2.292e-05 [opt_after_cconv]: 5.134e-05, [1] [Cycle 1]: 4.694e-05, [7] [c_1]: 5.63999e-06 [parameter_eliminate]: 2.04e-06 [updatestate_depend_eliminate]: 2.38e-06 [updatestate_assign_eliminate]: 1.9e-06 [updatestate_loads_eliminate]: 1.71e-06 [cse]: 6.38e-06 [renormalize]: 2.80001e-07 [remove_dup_value]: 1.03e-05 [tuple_transform]: 3.554e-05, [1] [Cycle 1]: 3.213e-05, [3] [d_1]: 1.473e-05 [d_2]: 5.99999e-06 [renormalize]: 1.59998e-07 [add_cache_embedding]: 1.132e-05 [add_recomputation]: 4.124e-05 [cse_after_recomputation]: 1.604e-05, [1] [Cycle 1]: 1.152e-05, [1] [cse]: 7.37e-06 [environ_conv]: 5.67e-06 [label_micro_interleaved_index]: 2.48e-06 [label_fine_grained_interleaved_index]: 3.13e-06 [assign_add_opt]: 2.13e-06 [slice_recompute_activation]: 2.32e-06 [micro_interleaved_order_control]: 1.95e-06 [full_micro_interleaved_order_control]: 1.89e-06 [comp_comm_scheduling]: 2.62e-06 [reorder_send_recv_between_fp_bp]: 2.39e-06 [comm_op_add_attrs]: 1.17e-06 [add_comm_op_reuse_tag]: 1.44e-06 [overlap_opt_shard_in_pipeline]: 1.42e-06 [grouped_pairwise_exchange_alltoall]: 1.66e-06 [overlap_recompute_and_grad_model_parallel]: 1.97e-06 [overlap_grad_matmul_and_grad_allreduce]: 8.80005e-07 [split_matmul_comm_elemetwise]: 3.16e-06 [split_layernorm_comm]: 1.82e-06 [process_send_recv_for_ge]: 9.5e-07 [handle_group_info]: 1.35e-06 [auto_monad_reorder]: 1.661e-05 [get_jit_bprop_graph]: 6.60002e-07 [eliminate_special_op_node]: 0.00050786 [validate]: 2.81e-05 [distribtued_split]: 1.24e-06 [task_emit]: 0.00480452 [execute]: 8.36e-06 Sums parse : 0.001521s : 1.98% symbol_resolve.resolve : 0.012433s : 16.20% combine_like_graphs : 0.000001s : 0.00% graph_reusing : 0.000003s : 0.00% meta_unpack_prepare : 0.000138s : 0.18% pre_cconv : 0.000001s : 0.00% abstract_specialize : 0.003981s : 5.19% pack_expand : 0.000015s : 0.02% auto_monad : 0.000090s : 0.12% inline : 0.000002s : 0.00% pre_auto_parallel : 0.000011s : 0.01% pipeline_split : 0.000003s : 0.00% optimize.py_interpret_to_execute : 0.000004s : 0.01% optimize.rewriter_before_opt_a : 0.000162s : 0.21% optimize.opt_a.expand_dump_flag : 0.000012s : 0.02% optimize.opt_a.switch_simplify : 0.000154s : 0.20% optimize.opt_a.a_1 : 0.001471s : 1.92% optimize.opt_a.recompute_prepare : 0.000042s : 0.05% optimize.opt_a.updatestate_depend_eliminate : 0.000048s : 0.06% optimize.opt_a.updatestate_assign_eliminate : 0.000034s : 0.04% optimize.opt_a.updatestate_loads_eliminate : 0.000032s : 0.04% optimize.opt_a.parameter_eliminate : 0.000014s : 0.02% optimize.opt_a.a_2 : 0.000484s : 0.63% optimize.opt_a.accelerated_algorithm : 0.000047s : 0.06% optimize.opt_a.pynative_shard : 0.000006s : 0.01% optimize.opt_a.auto_parallel : 0.000017s : 0.02% optimize.opt_a.parallel : 0.000022s : 0.03% optimize.opt_a.merge_comm : 0.000013s : 0.02% optimize.opt_a.allreduce_fusion : 0.000007s : 0.01% optimize.opt_a.virtual_dataset : 0.000028s : 0.04% optimize.opt_a.get_grad_eliminate_ : 0.000025s : 0.03% optimize.opt_a.virtual_output : 0.000023s : 0.03% optimize.opt_a.merge_forward : 0.000041s : 0.05% optimize.opt_a.cell_reuse_recompute_pass : 0.000003s : 0.00% optimize.opt_a.cell_reuse_handle_not_recompute_node_pass : 0.000064s : 0.08% optimize.opt_a.meta_fg_expand : 0.000035s : 0.05% optimize.opt_a.meta_fg_expand.resolve : 0.001444s : 1.88% optimize.opt_a.after_resolve : 0.000073s : 0.09% optimize.opt_a.a_after_grad : 0.000112s : 0.15% optimize.opt_a.renormalize : 0.046821s : 61.02% optimize.opt_a.real_op_eliminate : 0.000073s : 0.10% optimize.opt_a.auto_monad_grad : 0.000072s : 0.09% optimize.opt_a.auto_monad_eliminator : 0.000144s : 0.19% optimize.opt_a.cse : 0.000374s : 0.49% optimize.opt_a.a_3 : 0.000486s : 0.63% optimize.py_interpret_to_execute_after_opt_a : 0.000004s : 0.01% optimize.slice_cell_reuse_recomputed_activation : 0.000003s : 0.00% optimize.rewriter_after_opt_a : 0.000065s : 0.08% optimize.convert_after_rewriter : 0.000016s : 0.02% optimize.order_py_execute_after_rewriter : 0.000012s : 0.02% optimize.opt_b.b_1 : 0.000459s : 0.60% optimize.opt_b.b_2 : 0.000006s : 0.01% optimize.opt_b.updatestate_depend_eliminate : 0.000006s : 0.01% optimize.opt_b.updatestate_assign_eliminate : 0.000004s : 0.01% optimize.opt_b.updatestate_loads_eliminate : 0.000004s : 0.01% optimize.opt_b.renormalize : 0.000000s : 0.00% optimize.opt_b.cse : 0.000017s : 0.02% optimize.cconv : 0.000023s : 0.03% optimize.opt_after_cconv.c_1 : 0.000006s : 0.01% optimize.opt_after_cconv.parameter_eliminate : 0.000002s : 0.00% optimize.opt_after_cconv.updatestate_depend_eliminate : 0.000002s : 0.00% optimize.opt_after_cconv.updatestate_assign_eliminate : 0.000002s : 0.00% optimize.opt_after_cconv.updatestate_loads_eliminate : 0.000002s : 0.00% optimize.opt_after_cconv.cse : 0.000006s : 0.01% optimize.opt_after_cconv.renormalize : 0.000000s : 0.00% optimize.remove_dup_value : 0.000010s : 0.01% optimize.tuple_transform.d_1 : 0.000015s : 0.02% optimize.tuple_transform.d_2 : 0.000006s : 0.01% optimize.tuple_transform.renormalize : 0.000000s : 0.00% optimize.add_cache_embedding : 0.000011s : 0.01% optimize.add_recomputation : 0.000041s : 0.05% optimize.cse_after_recomputation.cse : 0.000007s : 0.01% optimize.environ_conv : 0.000006s : 0.01% optimize.label_micro_interleaved_index : 0.000002s : 0.00% optimize.label_fine_grained_interleaved_index : 0.000003s : 0.00% optimize.assign_add_opt : 0.000002s : 0.00% optimize.slice_recompute_activation : 0.000002s : 0.00% optimize.micro_interleaved_order_control : 0.000002s : 0.00% optimize.full_micro_interleaved_order_control : 0.000002s : 0.00% optimize.comp_comm_scheduling : 0.000003s : 0.00% optimize.reorder_send_recv_between_fp_bp : 0.000002s : 0.00% optimize.comm_op_add_attrs : 0.000001s : 0.00% optimize.add_comm_op_reuse_tag : 0.000001s : 0.00% optimize.overlap_opt_shard_in_pipeline : 0.000001s : 0.00% optimize.grouped_pairwise_exchange_alltoall : 0.000002s : 0.00% optimize.overlap_recompute_and_grad_model_parallel : 0.000002s : 0.00% optimize.overlap_grad_matmul_and_grad_allreduce : 0.000001s : 0.00% optimize.split_matmul_comm_elemetwise : 0.000003s : 0.00% optimize.split_layernorm_comm : 0.000002s : 0.00% optimize.process_send_recv_for_ge : 0.000001s : 0.00% optimize.handle_group_info : 0.000001s : 0.00% auto_monad_reorder : 0.000017s : 0.02% get_jit_bprop_graph : 0.000001s : 0.00% eliminate_special_op_node : 0.000508s : 0.66% validate : 0.000028s : 0.04% distribtued_split : 0.000001s : 0.00% task_emit : 0.004805s : 6.26% execute : 0.000008s : 0.01% Time group info: ------[substitution.] 0.014079 383 0.02% : 0.000003s : 5: substitution.float_depend_g_call 0.08% : 0.000011s : 14: substitution.float_tuple_getitem_switch 93.39% : 0.013148s : 25: substitution.getattr_setattr_resolve 0.04% : 0.000005s : 3: substitution.graph_param_transform 0.02% : 0.000003s : 3: substitution.incorporate_call 0.01% : 0.000002s : 3: substitution.incorporate_call_switch 4.07% : 0.000573s : 59: substitution.inline 0.05% : 0.000007s : 10: substitution.less_batch_normalization 0.23% : 0.000032s : 23: substitution.meta_unpack_prepare 0.08% : 0.000012s : 11: substitution.minmaximum_grad 0.03% : 0.000004s : 5: substitution.partial_eliminate 0.01% : 0.000001s : 3: substitution.partial_unused_args_eliminate 0.04% : 0.000006s : 47: substitution.remove_not_recompute_node 0.40% : 0.000056s : 38: substitution.replace_applicator 0.06% : 0.000008s : 20: substitution.replace_old_param 0.03% : 0.000004s : 2: substitution.reset_defer_inline 0.05% : 0.000007s : 8: substitution.set_cell_output_no_recompute 0.05% : 0.000008s : 5: substitution.specialize_transform 0.06% : 0.000009s : 4: substitution.switch_simplify 0.08% : 0.000012s : 2: substitution.transpose_eliminate 0.28% : 0.000040s : 15: substitution.tuple_list_convert_item_index_to_positive 0.11% : 0.000015s : 15: substitution.tuple_list_get_item_const_eliminator 0.16% : 0.000022s : 15: substitution.tuple_list_get_item_depend_reorder 0.49% : 0.000069s : 33: substitution.tuple_list_get_item_eliminator 0.15% : 0.000022s : 15: substitution.tuple_list_get_set_item_eliminator ------[renormalize.] 0.046808 6 92.93% : 0.043498s : 3: renormalize.infer 7.07% : 0.003310s : 3: renormalize.specialize ------[replace.] 0.000680 68 49.63% : 0.000338s : 23: replace.getattr_setattr_resolve 28.08% : 0.000191s : 31: replace.inline 6.63% : 0.000045s : 2: replace.meta_unpack_prepare 8.09% : 0.000055s : 4: replace.switch_simplify 1.41% : 0.000010s : 2: replace.transpose_eliminate 6.17% : 0.000042s : 6: replace.tuple_list_get_item_eliminator ------[match.] 0.013668 68 95.80% : 0.013094s : 23: match.getattr_setattr_resolve 3.76% : 0.000514s : 31: match.inline 0.14% : 0.000019s : 2: match.meta_unpack_prepare 0.06% : 0.000009s : 4: match.switch_simplify 0.09% : 0.000012s : 2: match.transpose_eliminate 0.15% : 0.000020s : 6: match.tuple_list_get_item_eliminator ------[func_graph_cloner_run.] 0.003677 69 68.28% : 0.002511s : 28: func_graph_cloner_run.FuncGraphClonerGraph 31.72% : 0.001167s : 41: func_graph_cloner_run.FuncGraphSpecializer ------[meta_graph.] 0.000000 0 ------[manager.] 0.000000 0 ------[pynative] 0.000000 0 ------[others.] 0.017355 255 5.67% : 0.000985s : 104: opt.transform.opt_a 2.47% : 0.000428s : 92: opt.transform.opt_b 79.44% : 0.013786s : 10: opt.transform.opt_resolve 0.65% : 0.000113s : 1: opt.transforms.meta_unpack_prepare 11.57% : 0.002008s : 40: opt.transforms.opt_a 0.02% : 0.000004s : 1: opt.transforms.opt_after_cconv 0.02% : 0.000004s : 2: opt.transforms.opt_b 0.11% : 0.000019s : 2: opt.transforms.opt_trans_graph 0.05% : 0.000009s : 3: opt.transforms.special_op_eliminate . TotalTime = 0.0850508, [20] [parse]: 0.00131358 [symbol_resolve]: 0.0125751, [1] [Cycle 1]: 0.0125126, [1] [resolve]: 0.0124901 [combine_like_graphs]: 1.26001e-06 [graph_reusing]: 3.13e-06 [meta_unpack_prepare]: 0.00016529 [pre_cconv]: 5.00004e-07 [abstract_specialize]: 0.00385726 [pack_expand]: 1.783e-05 [auto_monad]: 8.587e-05 [inline]: 1.67e-06 [pre_auto_parallel]: 9.95999e-06 [pipeline_split]: 3.03e-06 [optimize]: 0.0637314, [35] [py_interpret_to_execute]: 4.14e-06 [rewriter_before_opt_a]: 0.0001595 [opt_a]: 0.0625876, [4] [Cycle 1]: 0.030398, [30] [expand_dump_flag]: 3.61e-06 [switch_simplify]: 2.491e-05 [a_1]: 0.00068474 [recompute_prepare]: 8.07e-06 [updatestate_depend_eliminate]: 1.021e-05 [updatestate_assign_eliminate]: 6.66e-06 [updatestate_loads_eliminate]: 6.43e-06 [parameter_eliminate]: 4.82e-06 [a_2]: 7.149e-05 [accelerated_algorithm]: 5.08e-06 [pynative_shard]: 1.58e-06 [auto_parallel]: 4.18e-06 [parallel]: 6.58e-06 [merge_comm]: 3.74e-06 [allreduce_fusion]: 1.73e-06 [virtual_dataset]: 4.58e-06 [get_grad_eliminate_]: 4.19e-06 [virtual_output]: 4.11001e-06 [merge_forward]: 7.81e-06 [cell_reuse_recompute_pass]: 8.89995e-07 [cell_reuse_handle_not_recompute_node_pass]: 1.095e-05 [meta_fg_expand]: 0.00179844, [1] [Cycle 1]: 0.00042429, [1] [resolve]: 0.00040614 [after_resolve]: 2.143e-05 [a_after_grad]: 3.845e-05 [renormalize]: 0.0271247 [real_op_eliminate]: 2.542e-05 [auto_monad_grad]: 2.999e-05 [auto_monad_eliminator]: 4.777e-05 [cse]: 0.00010684 [a_3]: 0.00015922 [Cycle 2]: 0.0257928, [30] [expand_dump_flag]: 2.37e-06 [switch_simplify]: 7.399e-05 [a_1]: 0.00084886 [recompute_prepare]: 8.87e-06 [updatestate_depend_eliminate]: 1.16e-05 [updatestate_assign_eliminate]: 8.11e-06 [updatestate_loads_eliminate]: 7.91e-06 [parameter_eliminate]: 3.62e-06 [a_2]: 0.00010962 [accelerated_algorithm]: 1.143e-05 [pynative_shard]: 1.3e-06 [auto_parallel]: 4.5e-06 [parallel]: 4.34e-06 [merge_comm]: 2.95e-06 [allreduce_fusion]: 1.52e-06 [virtual_dataset]: 7.09e-06 [get_grad_eliminate_]: 6.06e-06 [virtual_output]: 5.79e-06 [merge_forward]: 9.57e-06 [cell_reuse_recompute_pass]: 4.69998e-07 [cell_reuse_handle_not_recompute_node_pass]: 1.428e-05 [meta_fg_expand]: 0.00474022, [3] [Cycle 1]: 0.00033418, [1] [resolve]: 0.00031585 [Cycle 1]: 0.00042122, [1] [resolve]: 0.00040318 [Cycle 1]: 0.00032362, [1] [resolve]: 0.00030529 [after_resolve]: 3.115e-05 [a_after_grad]: 6.083e-05 [renormalize]: 0.0192321 [real_op_eliminate]: 2.976e-05 [auto_monad_grad]: 3.285e-05 [auto_monad_eliminator]: 5.189e-05 [cse]: 0.00011806 [a_3]: 0.00019247 [Cycle 3]: 0.00308155, [30] [expand_dump_flag]: 2.27e-06 [switch_simplify]: 7.948e-05 [a_1]: 0.00108309 [recompute_prepare]: 1.053e-05 [updatestate_depend_eliminate]: 1.284e-05 [updatestate_assign_eliminate]: 9.61e-06 [updatestate_loads_eliminate]: 9.32e-06 [parameter_eliminate]: 3.57e-06 [a_2]: 0.0001413 [accelerated_algorithm]: 1.384e-05 [pynative_shard]: 1.36e-06 [auto_parallel]: 4.5e-06 [parallel]: 4.39e-06 [merge_comm]: 3.04e-06 [allreduce_fusion]: 1.72e-06 [virtual_dataset]: 8.42e-06 [get_grad_eliminate_]: 7.76e-06 [virtual_output]: 7.15e-06 [merge_forward]: 1.082e-05 [cell_reuse_recompute_pass]: 4.60001e-07 [cell_reuse_handle_not_recompute_node_pass]: 1.783e-05 [meta_fg_expand]: 2.749e-05 [after_resolve]: 1.298e-05 [a_after_grad]: 1.887e-05 [renormalize]: 0.00122115 [real_op_eliminate]: 1.376e-05 [auto_monad_grad]: 5.69e-06 [auto_monad_eliminator]: 2.307e-05 [cse]: 8.916e-05 [a_3]: 6.707e-05 [Cycle 4]: 0.000932, [30] [expand_dump_flag]: 1.28e-06 [switch_simplify]: 7.92e-06 [a_1]: 0.00037057 [recompute_prepare]: 1.007e-05 [updatestate_depend_eliminate]: 1.315e-05 [updatestate_assign_eliminate]: 9.91e-06 [updatestate_loads_eliminate]: 9.64e-06 [parameter_eliminate]: 1.92e-06 [a_2]: 0.00014123 [accelerated_algorithm]: 1.358e-05 [pynative_shard]: 1.4e-06 [auto_parallel]: 3.85e-06 [parallel]: 3.75e-06 [merge_comm]: 2.54e-06 [allreduce_fusion]: 1.68e-06 [virtual_dataset]: 8.34e-06 [get_grad_eliminate_]: 7.53e-06 [virtual_output]: 7.37001e-06 [merge_forward]: 1.135e-05 [cell_reuse_recompute_pass]: 4.89999e-07 [cell_reuse_handle_not_recompute_node_pass]: 1.81e-05 [meta_fg_expand]: 7.6e-06 [after_resolve]: 1.064e-05 [a_after_grad]: 1.831e-05 [renormalize]: 7.99992e-08 [real_op_eliminate]: 7.32e-06 [auto_monad_grad]: 2.09e-06 [auto_monad_eliminator]: 2.05e-05 [cse]: 4.878e-05 [a_3]: 5.948e-05 [py_interpret_to_execute_after_opt_a]: 4.22e-06 [slice_cell_reuse_recomputed_activation]: 2.75e-06 [rewriter_after_opt_a]: 6.346e-05 [convert_after_rewriter]: 1.654e-05 [order_py_execute_after_rewriter]: 1.151e-05 [opt_b]: 0.00054425, [2] [Cycle 1]: 0.00046123, [7] [b_1]: 0.00040669 [b_2]: 3.37e-06 [updatestate_depend_eliminate]: 3.52e-06 [updatestate_assign_eliminate]: 2.44e-06 [updatestate_loads_eliminate]: 2.02e-06 [renormalize]: 3.6e-07 [cse]: 1.038e-05 [Cycle 2]: 7.375e-05, [7] [b_1]: 3.525e-05 [b_2]: 2.19e-06 [updatestate_depend_eliminate]: 2.22e-06 [updatestate_assign_eliminate]: 1.84e-06 [updatestate_loads_eliminate]: 1.71e-06 [renormalize]: 7.0002e-08 [cse]: 6.41e-06 [cconv]: 2.117e-05 [opt_after_cconv]: 5.669e-05, [1] [Cycle 1]: 5.219e-05, [7] [c_1]: 1.253e-05 [parameter_eliminate]: 1.85e-06 [updatestate_depend_eliminate]: 2.13e-06 [updatestate_assign_eliminate]: 1.72e-06 [updatestate_loads_eliminate]: 1.72e-06 [cse]: 6.35999e-06 [renormalize]: 2.19996e-07 [remove_dup_value]: 1.015e-05 [tuple_transform]: 4.333e-05, [1] [Cycle 1]: 3.979e-05, [3] [d_1]: 2.248e-05 [d_2]: 5.86e-06 [renormalize]: 1.60006e-07 [add_cache_embedding]: 1.078e-05 [add_recomputation]: 3.827e-05 [cse_after_recomputation]: 1.708e-05, [1] [Cycle 1]: 1.205e-05, [1] [cse]: 7.31e-06 [environ_conv]: 6.29e-06 [label_micro_interleaved_index]: 2.45e-06 [label_fine_grained_interleaved_index]: 2.13e-06 [assign_add_opt]: 1.82e-06 [slice_recompute_activation]: 2.71e-06 [micro_interleaved_order_control]: 1.68999e-06 [full_micro_interleaved_order_control]: 2.06e-06 [comp_comm_scheduling]: 2.14e-06 [reorder_send_recv_between_fp_bp]: 2.87e-06 [comm_op_add_attrs]: 9.20001e-07 [add_comm_op_reuse_tag]: 9.99993e-07 [overlap_opt_shard_in_pipeline]: 1.37999e-06 [grouped_pairwise_exchange_alltoall]: 1.39e-06 [overlap_recompute_and_grad_model_parallel]: 1.57e-06 [overlap_grad_matmul_and_grad_allreduce]: 7.89994e-07 [split_matmul_comm_elemetwise]: 1.9e-06 [split_layernorm_comm]: 1.53e-06 [process_send_recv_for_ge]: 8.69994e-07 [handle_group_info]: 8.49999e-07 [auto_monad_reorder]: 1.517e-05 [get_jit_bprop_graph]: 4.01e-06 [eliminate_special_op_node]: 0.0005238 [validate]: 2.671e-05 [distribtued_split]: 1.31e-06 [task_emit]: 0.0024939 [execute]: 6.21e-06 Sums parse : 0.001314s : 1.72% symbol_resolve.resolve : 0.012490s : 16.34% combine_like_graphs : 0.000001s : 0.00% graph_reusing : 0.000003s : 0.00% meta_unpack_prepare : 0.000165s : 0.22% pre_cconv : 0.000001s : 0.00% abstract_specialize : 0.003857s : 5.05% pack_expand : 0.000018s : 0.02% auto_monad : 0.000086s : 0.11% inline : 0.000002s : 0.00% pre_auto_parallel : 0.000010s : 0.01% pipeline_split : 0.000003s : 0.00% optimize.py_interpret_to_execute : 0.000004s : 0.01% optimize.rewriter_before_opt_a : 0.000159s : 0.21% optimize.opt_a.expand_dump_flag : 0.000010s : 0.01% optimize.opt_a.switch_simplify : 0.000186s : 0.24% optimize.opt_a.a_1 : 0.002987s : 3.91% optimize.opt_a.recompute_prepare : 0.000038s : 0.05% optimize.opt_a.updatestate_depend_eliminate : 0.000048s : 0.06% optimize.opt_a.updatestate_assign_eliminate : 0.000034s : 0.04% optimize.opt_a.updatestate_loads_eliminate : 0.000033s : 0.04% optimize.opt_a.parameter_eliminate : 0.000014s : 0.02% optimize.opt_a.a_2 : 0.000464s : 0.61% optimize.opt_a.accelerated_algorithm : 0.000044s : 0.06% optimize.opt_a.pynative_shard : 0.000006s : 0.01% optimize.opt_a.auto_parallel : 0.000017s : 0.02% optimize.opt_a.parallel : 0.000019s : 0.02% optimize.opt_a.merge_comm : 0.000012s : 0.02% optimize.opt_a.allreduce_fusion : 0.000007s : 0.01% optimize.opt_a.virtual_dataset : 0.000028s : 0.04% optimize.opt_a.get_grad_eliminate_ : 0.000026s : 0.03% optimize.opt_a.virtual_output : 0.000024s : 0.03% optimize.opt_a.merge_forward : 0.000040s : 0.05% optimize.opt_a.cell_reuse_recompute_pass : 0.000002s : 0.00% optimize.opt_a.cell_reuse_handle_not_recompute_node_pass : 0.000061s : 0.08% optimize.opt_a.meta_fg_expand : 0.000035s : 0.05% optimize.opt_a.meta_fg_expand.resolve : 0.001430s : 1.87% optimize.opt_a.after_resolve : 0.000076s : 0.10% optimize.opt_a.a_after_grad : 0.000136s : 0.18% optimize.opt_a.renormalize : 0.047578s : 62.25% optimize.opt_a.real_op_eliminate : 0.000076s : 0.10% optimize.opt_a.auto_monad_grad : 0.000071s : 0.09% optimize.opt_a.auto_monad_eliminator : 0.000143s : 0.19% optimize.opt_a.cse : 0.000363s : 0.47% optimize.opt_a.a_3 : 0.000478s : 0.63% optimize.py_interpret_to_execute_after_opt_a : 0.000004s : 0.01% optimize.slice_cell_reuse_recomputed_activation : 0.000003s : 0.00% optimize.rewriter_after_opt_a : 0.000063s : 0.08% optimize.convert_after_rewriter : 0.000017s : 0.02% optimize.order_py_execute_after_rewriter : 0.000012s : 0.02% optimize.opt_b.b_1 : 0.000442s : 0.58% optimize.opt_b.b_2 : 0.000006s : 0.01% optimize.opt_b.updatestate_depend_eliminate : 0.000006s : 0.01% optimize.opt_b.updatestate_assign_eliminate : 0.000004s : 0.01% optimize.opt_b.updatestate_loads_eliminate : 0.000004s : 0.00% optimize.opt_b.renormalize : 0.000000s : 0.00% optimize.opt_b.cse : 0.000017s : 0.02% optimize.cconv : 0.000021s : 0.03% optimize.opt_after_cconv.c_1 : 0.000013s : 0.02% optimize.opt_after_cconv.parameter_eliminate : 0.000002s : 0.00% optimize.opt_after_cconv.updatestate_depend_eliminate : 0.000002s : 0.00% optimize.opt_after_cconv.updatestate_assign_eliminate : 0.000002s : 0.00% optimize.opt_after_cconv.updatestate_loads_eliminate : 0.000002s : 0.00% optimize.opt_after_cconv.cse : 0.000006s : 0.01% optimize.opt_after_cconv.renormalize : 0.000000s : 0.00% optimize.remove_dup_value : 0.000010s : 0.01% optimize.tuple_transform.d_1 : 0.000022s : 0.03% optimize.tuple_transform.d_2 : 0.000006s : 0.01% optimize.tuple_transform.renormalize : 0.000000s : 0.00% optimize.add_cache_embedding : 0.000011s : 0.01% optimize.add_recomputation : 0.000038s : 0.05% optimize.cse_after_recomputation.cse : 0.000007s : 0.01% optimize.environ_conv : 0.000006s : 0.01% optimize.label_micro_interleaved_index : 0.000002s : 0.00% optimize.label_fine_grained_interleaved_index : 0.000002s : 0.00% optimize.assign_add_opt : 0.000002s : 0.00% optimize.slice_recompute_activation : 0.000003s : 0.00% optimize.micro_interleaved_order_control : 0.000002s : 0.00% optimize.full_micro_interleaved_order_control : 0.000002s : 0.00% optimize.comp_comm_scheduling : 0.000002s : 0.00% optimize.reorder_send_recv_between_fp_bp : 0.000003s : 0.00% optimize.comm_op_add_attrs : 0.000001s : 0.00% optimize.add_comm_op_reuse_tag : 0.000001s : 0.00% optimize.overlap_opt_shard_in_pipeline : 0.000001s : 0.00% optimize.grouped_pairwise_exchange_alltoall : 0.000001s : 0.00% optimize.overlap_recompute_and_grad_model_parallel : 0.000002s : 0.00% optimize.overlap_grad_matmul_and_grad_allreduce : 0.000001s : 0.00% optimize.split_matmul_comm_elemetwise : 0.000002s : 0.00% optimize.split_layernorm_comm : 0.000002s : 0.00% optimize.process_send_recv_for_ge : 0.000001s : 0.00% optimize.handle_group_info : 0.000001s : 0.00% auto_monad_reorder : 0.000015s : 0.02% get_jit_bprop_graph : 0.000004s : 0.01% eliminate_special_op_node : 0.000524s : 0.69% validate : 0.000027s : 0.03% distribtued_split : 0.000001s : 0.00% task_emit : 0.002494s : 3.26% execute : 0.000006s : 0.01% Time group info: ------[substitution.] 0.014169 446 0.03% : 0.000004s : 6: substitution.float_depend_g_call 0.07% : 0.000010s : 14: substitution.float_tuple_getitem_switch 93.47% : 0.013243s : 25: substitution.getattr_setattr_resolve 0.04% : 0.000005s : 3: substitution.graph_param_transform 0.02% : 0.000003s : 3: substitution.incorporate_call 0.01% : 0.000001s : 3: substitution.incorporate_call_switch 3.91% : 0.000554s : 65: substitution.inline 0.04% : 0.000006s : 10: substitution.less_batch_normalization 0.28% : 0.000039s : 42: substitution.meta_unpack_prepare 0.09% : 0.000013s : 16: substitution.minmaximum_grad 0.03% : 0.000004s : 6: substitution.partial_eliminate 0.01% : 0.000001s : 3: substitution.partial_unused_args_eliminate 0.04% : 0.000006s : 47: substitution.remove_not_recompute_node 0.41% : 0.000058s : 44: substitution.replace_applicator 0.05% : 0.000007s : 20: substitution.replace_old_param 0.02% : 0.000003s : 2: substitution.reset_defer_inline 0.04% : 0.000006s : 8: substitution.set_cell_output_no_recompute 0.05% : 0.000008s : 5: substitution.specialize_transform 0.05% : 0.000007s : 4: substitution.switch_simplify 0.06% : 0.000009s : 2: substitution.transpose_eliminate 0.30% : 0.000043s : 20: substitution.tuple_list_convert_item_index_to_positive 0.13% : 0.000018s : 20: substitution.tuple_list_get_item_const_eliminator 0.18% : 0.000026s : 20: substitution.tuple_list_get_item_depend_reorder 0.50% : 0.000071s : 38: substitution.tuple_list_get_item_eliminator 0.18% : 0.000025s : 20: substitution.tuple_list_get_set_item_eliminator ------[renormalize.] 0.047564 6 93.06% : 0.044265s : 3: renormalize.infer 6.94% : 0.003300s : 3: renormalize.specialize ------[replace.] 0.000668 68 49.07% : 0.000328s : 23: replace.getattr_setattr_resolve 27.99% : 0.000187s : 31: replace.inline 6.91% : 0.000046s : 2: replace.meta_unpack_prepare 8.11% : 0.000054s : 4: replace.switch_simplify 1.82% : 0.000012s : 2: replace.transpose_eliminate 6.10% : 0.000041s : 6: replace.tuple_list_get_item_eliminator ------[match.] 0.013740 68 96.04% : 0.013196s : 23: match.getattr_setattr_resolve 3.57% : 0.000491s : 31: match.inline 0.14% : 0.000020s : 2: match.meta_unpack_prepare 0.05% : 0.000007s : 4: match.switch_simplify 0.06% : 0.000009s : 2: match.transpose_eliminate 0.13% : 0.000017s : 6: match.tuple_list_get_item_eliminator ------[func_graph_cloner_run.] 0.003645 69 68.40% : 0.002493s : 28: func_graph_cloner_run.FuncGraphClonerGraph 31.60% : 0.001152s : 41: func_graph_cloner_run.FuncGraphSpecializer ------[meta_graph.] 0.000000 0 ------[manager.] 0.000000 0 ------[pynative] 0.000000 0 ------[others.] 0.018871 585 0.76% : 0.000143s : 2: opt.transform.meta_unpack_prepare 23.53% : 0.004441s : 461: opt.transform.opt_a 0.05% : 0.000009s : 7: opt.transform.opt_after_cconv 2.20% : 0.000415s : 94: opt.transform.opt_b 73.29% : 0.013830s : 10: opt.transform.opt_resolve 0.13% : 0.000024s : 8: opt.transform.opt_trans_graph 0.05% : 0.000009s : 3: opt.transform.special_op_eliminate . ============================== 2 passed in 21.08s ============================== [TRACE] GE(170035,python3.7):2024-01-11-05:39:48.455.574 [status:INIT] [ge_api.cc:463]170035 ~Session:Start to destruct session. [TRACE] GE(170035,python3.7):2024-01-11-05:39:48.455.684 [status:RUNNING] [ge_api.cc:475]170035 ~Session:Session id is 0 [TRACE] GE(170035,python3.7):2024-01-11-05:39:48.455.696 [status:RUNNING] [ge_api.cc:476]170035 ~Session:Destroying session [TRACE] GE(170035,python3.7):2024-01-11-05:39:48.456.819 [status:STOP] [ge_api.cc:491]170035 ~Session:Session Destructor finished [TRACE] GE(170035,python3.7):2024-01-11-05:39:48.456.857 [status:INIT] [ge_api.cc:301]170035 GEFinalize:GEFinalize start [INFO] GE(170035,python3.7):2024-01-11-05:39:48.456.949 [execution_runtime.cc:80][EVENT]170035 FinalizeExecutionRuntime:Execution runtime finalize begin. [INFO] GE(170035,python3.7):2024-01-11-05:39:48.456.971 [execution_runtime.cc:92][EVENT]170035 FinalizeExecutionRuntime:Execution runtime finalized. [TRACE] GE(170035,python3.7):2024-01-11-05:39:48.456.982 [status:RUNNING] [ge_api.cc:313]170035 GEFinalize:Finalizing environment [INFO] TUNE(170035,python3.7):2024-01-11-05:39:48.814.894 [cann_kb_pyfunc_mgr.cpp:127][CANNKB][Tid:170035]"CannKbPyfuncMgr: enter PyObjectDeinit function, reference_[1]" [INFO] TUNE(170035,python3.7):2024-01-11-05:39:48.814.953 [cann_kb_pyfunc_mgr.cpp:138][CANNKB][Tid:170035]"CannKbPyfuncMgr: PyObjectDeinit function end successfully!" [INFO] GE(170035,python3.7):2024-01-11-05:39:48.816.581 [gelib.cc:324][EVENT]170035 SystemFinalize:Online infer finalize GELib success. [TRACE] GE(170035,python3.7):2024-01-11-05:39:49.755.711 [status:STOP] [ge_api.cc:341]170035 GEFinalize:GEFinalize finished [INFO] TDT(170035,python3.7):2024-01-11-05:39:50.262.214 [process_mode_manager.cpp:184][Close][tid:170035] [TsdClient] Close [deviceId=2][sessionId=1] hccp and computer enter [INFO] TDT(170035,python3.7):2024-01-11-05:39:50.262.291 [version_verify.cpp:112][SpecialFeatureCheck][tid:170035] VersionVerify: previous type[7], supported [INFO] TDT(170035,python3.7):2024-01-11-05:39:50.262.347 [process_mode_manager.cpp:192][Close][tid:170035] [TsdClient][deviceId=2] [sessionId=1] wait hccp and computer process close respond [INFO] TDT(170035,python3.7):2024-01-11-05:39:50.293.585 [process_mode_manager.cpp:197][Close][tid:170035] [TsdClient][logicDeviceId_=2]has recv close hccp and computer process respond [INFO] TDT(170035,python3.7):2024-01-11-05:39:50.293.611 [stub_process_mode_nowin.cpp:151][CloseInHost][tid:170035] enter into CloseInHost deviceid[2] [INFO] TDT(170035,python3.7):2024-01-11-05:39:50.293.622 [stub_process_mode_nowin.cpp:154][CloseInHost][tid:170035] host cpu not support [INFO] TDT(170035,python3.7):2024-01-11-05:39:50.293.671 [process_mode_manager.cpp:208][Close][tid:170035] [TsdClient][deviceId=2] [sessionId=1] close hccp and computer process success [INFO] ATRACE(170035,python3.7):2024-01-11-05:39:50.293.685 [atrace_api.c:93](tid:170035) AtraceDestroy start [INFO] ATRACE(170035,python3.7):2024-01-11-05:39:50.293.702 [atrace_api.c:95](tid:170035) AtraceDestroy end [INFO] PROFILING(170035,python3.7):2024-01-11-05:39:50.293.728 [msprofiler_impl.cpp:156] >>> (tid:170035) ProfNotifySetDevice called, is open: 0, devId: 2 [INFO] RUNTIME(170035,python3.7):2024-01-11-05:39:52.155.868 [runtime.cc:1737] 170035 ~Runtime: deconstruct runtime.