============================= test session starts ============================== platform linux -- Python 3.7.5, pytest-5.4.3, py-1.8.1, pluggy-0.13.1 rootdir: /home/jenkins/mindspore/testcases/testcases/tests/st/dyn_shape_dev, inifile: /home/jenkins/sault/virtual_test/virtualenv_004/sault/config/pytest.ini plugins: anyio-3.7.1, xdist-1.32.0, forked-1.1.3 [INFO] ATRACE(171751,python3.7):2024-01-11-05:24:11.192.467 [trace_attr.c:105](tid:171751) platform is 1. [INFO] ATRACE(171751,python3.7):2024-01-11-05:24:11.192.666 [trace_recorder.c:114](tid:171751) use root path: /home/jenkins/ascend/atrace [INFO] ATRACE(171751,python3.7):2024-01-11-05:24:11.192.696 [trace_signal.c:133](tid:171751) register signal handler for signo 2 succeed. [INFO] ATRACE(171751,python3.7):2024-01-11-05:24:11.192.708 [trace_signal.c:133](tid:171751) register signal handler for signo 15 succeed. [INFO] RUNTIME(171751,python3.7):2024-01-11-05:24:11.611.604 [runtime.cc:1159] 171751 GetAicoreNumByLevel: workingDev_=0 [INFO] RUNTIME(171751,python3.7):2024-01-11-05:24:11.611.687 [runtime.cc:4719] 171751 GetVisibleDevices: ASCEND_RT_VISIBLE_DEVICES param was not set collected 2 items test_reshape.py [INFO] TDT(171751,python3.7):2024-01-11-05:24:16.013.745 [process_mode_manager.cpp:109][OpenProcess][tid:171751] [ProcessModeManager] enter into open process deviceId[3] rankSize[0] [INFO] TDT(171751,python3.7):2024-01-11-05:24:16.016.272 [process_mode_manager.cpp:379][InitTsdClient][tid:171751] [TsdClient] deviceId[3] begin to init hdc client [INFO] TDT(171751,python3.7):2024-01-11-05:24:16.016.401 [version_verify.cpp:34][SetVersionInfo][tid:171751] VersionVerify: send client version to server [INFO] TDT(171751,python3.7):2024-01-11-05:24:16.016.430 [version_verify.cpp:50][SetVersionInfo][tid:171751] send feature_info:{msg_type:35, features:{check before send aicpu package,}} [INFO] TDT(171751,python3.7):2024-01-11-05:24:16.016.443 [version_verify.cpp:50][SetVersionInfo][tid:171751] send feature_info:{msg_type:37, features:{check before send open qs message,}} [INFO] TDT(171751,python3.7):2024-01-11-05:24:16.016.753 [version_verify.cpp:66][PeerVersionCheck][tid:171751] VersionVerify: Check client version info, server[1230], client[1230] [INFO] TDT(171751,python3.7):2024-01-11-05:24:16.016.767 [version_verify.cpp:87][ParseVersionInfo][tid:171751] VersionVerify: pass client version info success [INFO] TDT(171751,python3.7):2024-01-11-05:24:16.016.776 [hdc_client.cpp:276][CheckHdcConnection][tid:171751] Service[2] create hdc success [INFO] TDT(171751,python3.7):2024-01-11-05:24:16.016.790 [version_verify.cpp:120][SpecialFeatureCheck][tid:171751] VersionVerify: new type[35], supported [INFO] TDT(171751,python3.7):2024-01-11-05:24:16.016.836 [process_mode_manager.cpp:748][GetDeviceCheckCode][tid:171751] [TsdClient][deviceId=3] [sessionId=1] wait package info respond [INFO] TDT(171751,python3.7):2024-01-11-05:24:16.016.947 [process_mode_manager.cpp:379][InitTsdClient][tid:171751] [TsdClient] deviceId[3] begin to init hdc client [INFO] TDT(171751,python3.7):2024-01-11-05:24:16.017.051 [version_verify.cpp:34][SetVersionInfo][tid:171751] VersionVerify: send client version to server [INFO] TDT(171751,python3.7):2024-01-11-05:24:16.017.064 [version_verify.cpp:50][SetVersionInfo][tid:171751] send feature_info:{msg_type:35, features:{check before send aicpu package,}} [INFO] TDT(171751,python3.7):2024-01-11-05:24:16.017.074 [version_verify.cpp:50][SetVersionInfo][tid:171751] send feature_info:{msg_type:37, features:{check before send open qs message,}} [INFO] TDT(171751,python3.7):2024-01-11-05:24:16.017.201 [version_verify.cpp:66][PeerVersionCheck][tid:171751] VersionVerify: Check client version info, server[1230], client[1230] [INFO] TDT(171751,python3.7):2024-01-11-05:24:16.017.213 [version_verify.cpp:87][ParseVersionInfo][tid:171751] VersionVerify: pass client version info success [INFO] TDT(171751,python3.7):2024-01-11-05:24:16.017.221 [hdc_client.cpp:276][CheckHdcConnection][tid:171751] Service[2] create hdc success [INFO] TDT(171751,python3.7):2024-01-11-05:24:16.017.232 [process_mode_manager.cpp:426][ConstructOpenMsg][tid:171751] [TsdClient] tsd get process sign successfully, procpid[171751] signSize[48] [INFO] TDT(171751,python3.7):2024-01-11-05:24:16.017.263 [version_verify.cpp:112][SpecialFeatureCheck][tid:171751] VersionVerify: previous type[6], supported [INFO] TDT(171751,python3.7):2024-01-11-05:24:16.017.284 [process_mode_manager.cpp:126][OpenProcess][tid:171751] [ProcessModeManager] deviceId[3] sessionId[1] rankSize[0], wait sub process start respond [INFO] TDT(171751,python3.7):2024-01-11-05:24:16.254.670 [stub_process_mode_nowin.cpp:63][ProcessQueueForMdc][tid:171751] [TsdClient] it is unnecessary of current mode[0] chiptype[1] to grant queue auth to aicpusd [INFO] TDT(171751,python3.7):2024-01-11-05:24:16.254.704 [stub_process_mode_nowin.cpp:101][OpenInHost][tid:171751] enter into OpenInHost deviceid[3] [INFO] TDT(171751,python3.7):2024-01-11-05:24:16.254.714 [stub_process_mode_nowin.cpp:105][OpenInHost][tid:171751] host cpu not support [INFO] TDT(171751,python3.7):2024-01-11-05:24:16.254.723 [process_mode_manager.cpp:156][OpenProcess][tid:171751] [TsdClient][deviceId=3] [sessionId=1] start hccp and computer process success [INFO] RUNTIME(171751,python3.7):2024-01-11-05:24:16.257.388 [device.cc:340] 171751 Init: isDoubledie:0, topologytype:0 [INFO] RUNTIME(171751,python3.7):2024-01-11-05:24:16.271.904 [npu_driver.cc:5428] 172975 GetDeviceStatus: GetDeviceStatus status=1. [INFO] ATRACE(171751,python3.7):2024-01-11-05:24:16.271.944 [atrace_api.c:28](tid:171751) AtraceCreate start [INFO] ATRACE(171751,python3.7):2024-01-11-05:24:16.272.030 [trace_rb_log.c:84](tid:171751) [RUNTIME_ATRACE_DEV3_TS0] create ring buffer success, buffer size : 131152. [INFO] ATRACE(171751,python3.7):2024-01-11-05:24:16.272.044 [atrace_api.c:32](tid:171751) AtraceCreate end [INFO] TDT(171751,python3.7):2024-01-11-05:24:16.272.058 [client_manager.cpp:157][SetProfilingCallback][tid:171751] [TsdClient] set profiling callback success [TRACE] GE(171751,python3.7):2024-01-11-05:24:16.424.926 [status:INIT] [ge_api.cc:144]171751 GEInitializeImpl:GEInitialize start [INFO] PROFILING(171751,python3.7):2024-01-11-05:24:16.633.412 [msprofiler_impl.cpp:156] >>> (tid:171751) ProfNotifySetDevice called, is open: 1, devId: 3 [INFO] PROFILING(171751,python3.7):2024-01-11-05:24:16.633.558 [platform.cpp:38] >>> (tid:171751) Profiling platform version: 1.0. [INFO] PROFILING(171751,python3.7):2024-01-11-05:24:16.633.573 [ai_drv_dev_api.cpp:384] >>> (tid:171751) Succeeded to DrvGetApiVersion version: 0x72313 [TRACE] GE(171751,python3.7):2024-01-11-05:24:16.683.373 [status:RUNNING] [ge_api.cc:211]171751 GEInitializeImpl:Initializing environment [INFO] GE(171751,python3.7):2024-01-11-05:24:16.683.439 [gelib.cc:98][EVENT]171751 Initialize:[GEPERFTRACE] GE Init Start [INFO] GE(171751,python3.7):2024-01-11-05:24:16.683.711 [gelib.cc:307][EVENT]171751 SystemInitialize:Online infer init GELib success, device id :3 [INFO] DVPP(171751,python3.7):2024-01-11-05:24:17.040.079 [dvpp_engine.cc:41][ENGINE][Initialize:41][tid:171751]dvpp engine do not support [INFO] TUNE(171751,python3.7):2024-01-11-05:24:17.043.508 [cann_kb_pyfunc_mgr.cpp:72][CANNKB][Tid:171751]"CannKbPyfuncMgr: Enter PyObjectInit, reference_ is 0!" [INFO] TUNE(171751,python3.7):2024-01-11-05:24:17.043.548 [handle_manager.cpp:115][CANNKB][Tid:171751]"Start to run init functions to load dynamic python lib!" [INFO] TUNE(171751,python3.7):2024-01-11-05:24:17.043.609 [handle_manager.cpp:407][CANNKB][Tid:171751]"Init functions of loading dynamic python lib end!" [INFO] TUNE(171751,python3.7):2024-01-11-05:24:17.043.619 [cann_kb_pyfunc_mgr.cpp:24][CANNKB][Tid:171751]"CANN_KB_Py has already been initialized." [INFO] TUNE(171751,python3.7):2024-01-11-05:24:17.043.714 [cann_kb_pyfunc_mgr.cpp:117][CANNKB][Tid:171751]"CannKbPyfuncMgr: Run PyObjectInit successfully!" [INFO] HCCL(171751,python3.7):2024-01-11-05:24:29.024.239 [plugin_manager.cc:42][171751]hcom running normal mode. [INFO] DVPP(171751,python3.7):2024-01-11-05:24:29.024.866 [dvpp_engine.cc:92][ENGINE][GetOpsKernelInfoStores:92][tid:171751]dvpp ops kernel info store do not support [INFO] DVPP(171751,python3.7):2024-01-11-05:24:29.025.041 [dvpp_engine.cc:69][ENGINE][GetGraphOptimizerObjs:69][tid:171751]dvpp graph optimizer do not support [INFO] DVPP(171751,python3.7):2024-01-11-05:24:29.567.232 [dvpp_ops_kernel_builder.cc:48][ENGINE][Initialize:48][tid:171751]dvpp ops kernel builder do not support [INFO] GE(171751,python3.7):2024-01-11-05:24:29.575.926 [gelib.cc:169][EVENT]171751 Initialize:[GEPERFTRACE] The time cost of GELib::Initialize is [12892436] micro second. [TRACE] GE(171751,python3.7):2024-01-11-05:24:29.661.382 [status:STOP] [ge_api.cc:255]171751 GEInitializeImpl:GEInitialize finished [TRACE] GE(171751,python3.7):2024-01-11-05:24:29.661.526 [status:INIT] [ge_api.cc:398]171751 Session:Start to construct session. [TRACE] GE(171751,python3.7):2024-01-11-05:24:29.661.544 [status:RUNNING] [ge_api.cc:408]171751 Session:Creating session [INFO] GE(171751,python3.7):2024-01-11-05:24:29.661.948 [graph_var_manager.cc:1445][EVENT]171751 SetMemoryMallocSize:Total memory size is 34359738368 [INFO] GE(171751,python3.7):2024-01-11-05:24:29.661.965 [graph_var_manager.cc:1424][EVENT]171751 SetAllMemoryMaxValue:The graph_mem_max_size is 27917287424 and the var_mem_max_size is 5368709120 [INFO] PROFILING(171751,python3.7):2024-01-11-05:24:29.662.287 [msprofiler_impl.cpp:156] >>> (tid:171751) ProfNotifySetDevice called, is open: 1, devId: 3 [TRACE] GE(171751,python3.7):2024-01-11-05:24:29.663.130 [status:RUNNING] [ge_api.cc:411]171751 Session:Session id is 0 [TRACE] GE(171751,python3.7):2024-01-11-05:24:29.663.151 [status:STOP] [ge_api.cc:420]171751 Session:Session Constructor finished [INFO] PROFILING(171751,python3.7):2024-01-11-05:24:29.673.108 [platform.cpp:38] >>> (tid:171751) Profiling platform version: 1.0. [INFO] PROFILING(171751,python3.7):2024-01-11-05:24:29.673.138 [ai_drv_dev_api.cpp:384] >>> (tid:171751) Succeeded to DrvGetApiVersion version: 0x72313 [TRACE] GE(171751,python3.7):2024-01-11-05:24:29.673.303 [status:INIT] [ge_api.cc:144]171751 GEInitializeImpl:GEInitialize start TotalTime = 0.466823, [20] [parse]: 0.235591 [symbol_resolve]: 0.0296269, [1] [Cycle 1]: 0.0295495, [1] [resolve]: 0.0295231 [combine_like_graphs]: 9.04e-06 [graph_reusing]: 3.24e-06 [meta_unpack_prepare]: 0.00014853 [pre_cconv]: 3.45999e-06 [abstract_specialize]: 0.00508895 [pack_expand]: 1.854e-05 [auto_monad]: 0.00011346 [inline]: 1.4e-06 [pre_auto_parallel]: 1.246e-05 [pipeline_split]: 2.84e-06 [optimize]: 0.188521, [35] [py_interpret_to_execute]: 3.9e-06 [rewriter_before_opt_a]: 0.00021326 [opt_a]: 0.187031, [4] [Cycle 1]: 0.116167, [30] [expand_dump_flag]: 4.46e-06 [switch_simplify]: 2.576e-05 [a_1]: 0.00043908 [recompute_prepare]: 1.022e-05 [updatestate_depend_eliminate]: 1.079e-05 [updatestate_assign_eliminate]: 7.37001e-06 [updatestate_loads_eliminate]: 6.98e-06 [parameter_eliminate]: 4.89999e-06 [a_2]: 8.507e-05 [accelerated_algorithm]: 8.84999e-06 [pynative_shard]: 1.98e-06 [auto_parallel]: 3.03e-06 [parallel]: 1.583e-05 [merge_comm]: 9.2e-06 [allreduce_fusion]: 2.14e-06 [virtual_dataset]: 6.22001e-06 [get_grad_eliminate_]: 4.85e-06 [virtual_output]: 4.44e-06 [merge_forward]: 9.25e-06 [cell_reuse_recompute_pass]: 8.49999e-07 [cell_reuse_handle_not_recompute_node_pass]: 1.262e-05 [meta_fg_expand]: 0.0701696, [1] [Cycle 1]: 0.0329789, [1] [resolve]: 0.0329562 [after_resolve]: 8.112e-05 [a_after_grad]: 0.00026814 [renormalize]: 0.0442643 [real_op_eliminate]: 3.148e-05 [auto_monad_grad]: 4.649e-05 [auto_monad_eliminator]: 6.2e-05 [cse]: 0.00015222 [a_3]: 0.00023356 [Cycle 2]: 0.0600295, [30] [expand_dump_flag]: 2.91e-06 [switch_simplify]: 0.00010191 [a_1]: 0.00055637 [recompute_prepare]: 1.251e-05 [updatestate_depend_eliminate]: 1.253e-05 [updatestate_assign_eliminate]: 9.68e-06 [updatestate_loads_eliminate]: 8.88e-06 [parameter_eliminate]: 3.85e-06 [a_2]: 0.00013954 [accelerated_algorithm]: 1.367e-05 [pynative_shard]: 1.49e-06 [auto_parallel]: 4.88e-06 [parallel]: 4.05e-06 [merge_comm]: 2.68999e-06 [allreduce_fusion]: 1.42e-06 [virtual_dataset]: 8.82e-06 [get_grad_eliminate_]: 7.41e-06 [virtual_output]: 6.98e-06 [merge_forward]: 1.075e-05 [cell_reuse_recompute_pass]: 4.30002e-07 [cell_reuse_handle_not_recompute_node_pass]: 1.758e-05 [meta_fg_expand]: 0.0244739, [4] [Cycle 1]: 0.001634, [1] [resolve]: 0.00161305 [Cycle 1]: 0.00034346, [1] [resolve]: 0.00032398 [Cycle 1]: 0.00290611, [1] [resolve]: 0.00288536 [Cycle 1]: 0.00033497, [1] [resolve]: 0.00031538 [after_resolve]: 0.0001296 [a_after_grad]: 0.00034614 [renormalize]: 0.0331016 [real_op_eliminate]: 4.22e-05 [auto_monad_grad]: 0.00016857 [auto_monad_eliminator]: 8.818e-05 [cse]: 0.00021605 [a_3]: 0.00034463 [Cycle 3]: 0.00367204, [30] [expand_dump_flag]: 3.73001e-06 [switch_simplify]: 0.00013067 [a_1]: 0.00084873 [recompute_prepare]: 1.501e-05 [updatestate_depend_eliminate]: 1.683e-05 [updatestate_assign_eliminate]: 1.226e-05 [updatestate_loads_eliminate]: 1.146e-05 [parameter_eliminate]: 3.63999e-06 [a_2]: 0.00019028 [accelerated_algorithm]: 1.821e-05 [pynative_shard]: 1.09e-06 [auto_parallel]: 4.19e-06 [parallel]: 3.75001e-06 [merge_comm]: 3.32e-06 [allreduce_fusion]: 1.73e-06 [virtual_dataset]: 1.07e-05 [get_grad_eliminate_]: 9.53e-06 [virtual_output]: 9.51e-06 [merge_forward]: 1.407e-05 [cell_reuse_recompute_pass]: 4.20005e-07 [cell_reuse_handle_not_recompute_node_pass]: 2.338e-05 [meta_fg_expand]: 3.713e-05 [after_resolve]: 1.357e-05 [a_after_grad]: 1.615e-05 [renormalize]: 0.00185067 [real_op_eliminate]: 1.629e-05 [auto_monad_grad]: 6.07e-06 [auto_monad_eliminator]: 2.675e-05 [cse]: 0.00012463 [a_3]: 8.999e-05 [Cycle 4]: 0.00088038, [30] [expand_dump_flag]: 1.45e-06 [switch_simplify]: 1.104e-05 [a_1]: 0.00018566 [recompute_prepare]: 1.242e-05 [updatestate_depend_eliminate]: 1.667e-05 [updatestate_assign_eliminate]: 1.251e-05 [updatestate_loads_eliminate]: 1.191e-05 [parameter_eliminate]: 2.22e-06 [a_2]: 0.00018711 [accelerated_algorithm]: 1.791e-05 [pynative_shard]: 1.46e-06 [auto_parallel]: 3.65e-06 [parallel]: 3.62e-06 [merge_comm]: 2.68e-06 [allreduce_fusion]: 1.7e-06 [virtual_dataset]: 1.086e-05 [get_grad_eliminate_]: 9.69e-06 [virtual_output]: 8.95e-06 [merge_forward]: 1.452e-05 [cell_reuse_recompute_pass]: 4.1e-07 [cell_reuse_handle_not_recompute_node_pass]: 2.318e-05 [meta_fg_expand]: 9.79e-06 [after_resolve]: 1.272e-05 [a_after_grad]: 1.705e-05 [renormalize]: 7.99992e-08 [real_op_eliminate]: 1.031e-05 [auto_monad_grad]: 2.26001e-06 [auto_monad_eliminator]: 2.397e-05 [cse]: 6.597e-05 [a_3]: 8.101e-05 [py_interpret_to_execute_after_opt_a]: 4.51e-06 [slice_cell_reuse_recomputed_activation]: 2.58e-06 [rewriter_after_opt_a]: 8.367e-05 [convert_after_rewriter]: 1.874e-05 [order_py_execute_after_rewriter]: 1.357e-05 [opt_b]: 0.00070653, [2] [Cycle 1]: 0.00059807, [7] [b_1]: 0.00052978 [b_2]: 4.94e-06 [updatestate_depend_eliminate]: 4.66e-06 [updatestate_assign_eliminate]: 3.32e-06 [updatestate_loads_eliminate]: 3.14999e-06 [renormalize]: 4.19997e-07 [cse]: 1.762e-05 [Cycle 2]: 9.859e-05, [7] [b_1]: 5.098e-05 [b_2]: 3.31e-06 [updatestate_depend_eliminate]: 3.04e-06 [updatestate_assign_eliminate]: 2.53e-06 [updatestate_loads_eliminate]: 3.06e-06 [renormalize]: 9.00036e-08 [cse]: 1.139e-05 [cconv]: 2.234e-05 [opt_after_cconv]: 5.856e-05, [1] [Cycle 1]: 5.438e-05, [7] [c_1]: 7.29e-06 [parameter_eliminate]: 1.89e-06 [updatestate_depend_eliminate]: 2.94e-06 [updatestate_assign_eliminate]: 2.51e-06 [updatestate_loads_eliminate]: 2.41e-06 [cse]: 1.144e-05 [renormalize]: 2.59999e-07 [remove_dup_value]: 1.317e-05 [tuple_transform]: 4.436e-05, [1] [Cycle 1]: 4.081e-05, [3] [d_1]: 1.959e-05 [d_2]: 9e-06 [renormalize]: 1.70003e-07 [add_cache_embedding]: 1.094e-05 [add_recomputation]: 6.014e-05 [cse_after_recomputation]: 2.213e-05, [1] [Cycle 1]: 1.771e-05, [1] [cse]: 1.286e-05 [environ_conv]: 2.192e-05 [label_micro_interleaved_index]: 2.79e-06 [label_fine_grained_interleaved_index]: 2.12e-06 [assign_add_opt]: 1.50999e-06 [slice_recompute_activation]: 2.42001e-06 [micro_interleaved_order_control]: 1.8e-06 [full_micro_interleaved_order_control]: 1.79e-06 [comp_comm_scheduling]: 2.12e-06 [reorder_send_recv_between_fp_bp]: 2.1e-06 [comm_op_add_attrs]: 1.12e-06 [add_comm_op_reuse_tag]: 9.59997e-07 [overlap_opt_shard_in_pipeline]: 1.09e-06 [grouped_pairwise_exchange_alltoall]: 1.4e-06 [overlap_recompute_and_grad_model_parallel]: 1.78e-06 [overlap_grad_matmul_and_grad_allreduce]: 7.7e-07 [split_matmul_comm_elemetwise]: 2.69e-06 [split_layernorm_comm]: 1.81e-06 [process_send_recv_for_ge]: 2.68e-06 [handle_group_info]: 9.79999e-07 [auto_monad_reorder]: 2.531e-05 [get_jit_bprop_graph]: 4.60001e-07 [eliminate_special_op_node]: 0.00050742 [validate]: 5.703e-05 [distribtued_split]: 1.27e-06 [task_emit]: 0.00684166 [execute]: 7.73001e-06 Sums parse : 0.235591s : 58.50% symbol_resolve.resolve : 0.029523s : 7.33% combine_like_graphs : 0.000009s : 0.00% graph_reusing : 0.000003s : 0.00% meta_unpack_prepare : 0.000149s : 0.04% pre_cconv : 0.000003s : 0.00% abstract_specialize : 0.005089s : 1.26% pack_expand : 0.000019s : 0.00% auto_monad : 0.000113s : 0.03% inline : 0.000001s : 0.00% pre_auto_parallel : 0.000012s : 0.00% pipeline_split : 0.000003s : 0.00% optimize.py_interpret_to_execute : 0.000004s : 0.00% optimize.rewriter_before_opt_a : 0.000213s : 0.05% optimize.opt_a.expand_dump_flag : 0.000013s : 0.00% optimize.opt_a.switch_simplify : 0.000269s : 0.07% optimize.opt_a.a_1 : 0.002030s : 0.50% optimize.opt_a.recompute_prepare : 0.000050s : 0.01% optimize.opt_a.updatestate_depend_eliminate : 0.000057s : 0.01% optimize.opt_a.updatestate_assign_eliminate : 0.000042s : 0.01% optimize.opt_a.updatestate_loads_eliminate : 0.000039s : 0.01% optimize.opt_a.parameter_eliminate : 0.000015s : 0.00% optimize.opt_a.a_2 : 0.000602s : 0.15% optimize.opt_a.accelerated_algorithm : 0.000059s : 0.01% optimize.opt_a.pynative_shard : 0.000006s : 0.00% optimize.opt_a.auto_parallel : 0.000016s : 0.00% optimize.opt_a.parallel : 0.000027s : 0.01% optimize.opt_a.merge_comm : 0.000018s : 0.00% optimize.opt_a.allreduce_fusion : 0.000007s : 0.00% optimize.opt_a.virtual_dataset : 0.000037s : 0.01% optimize.opt_a.get_grad_eliminate_ : 0.000031s : 0.01% optimize.opt_a.virtual_output : 0.000030s : 0.01% optimize.opt_a.merge_forward : 0.000049s : 0.01% optimize.opt_a.cell_reuse_recompute_pass : 0.000002s : 0.00% optimize.opt_a.cell_reuse_handle_not_recompute_node_pass : 0.000077s : 0.02% optimize.opt_a.meta_fg_expand : 0.000047s : 0.01% optimize.opt_a.meta_fg_expand.resolve : 0.038094s : 9.46% optimize.opt_a.after_resolve : 0.000237s : 0.06% optimize.opt_a.a_after_grad : 0.000647s : 0.16% optimize.opt_a.renormalize : 0.079217s : 19.67% optimize.opt_a.real_op_eliminate : 0.000100s : 0.02% optimize.opt_a.auto_monad_grad : 0.000223s : 0.06% optimize.opt_a.auto_monad_eliminator : 0.000201s : 0.05% optimize.opt_a.cse : 0.000559s : 0.14% optimize.opt_a.a_3 : 0.000749s : 0.19% optimize.py_interpret_to_execute_after_opt_a : 0.000005s : 0.00% optimize.slice_cell_reuse_recomputed_activation : 0.000003s : 0.00% optimize.rewriter_after_opt_a : 0.000084s : 0.02% optimize.convert_after_rewriter : 0.000019s : 0.00% optimize.order_py_execute_after_rewriter : 0.000014s : 0.00% optimize.opt_b.b_1 : 0.000581s : 0.14% optimize.opt_b.b_2 : 0.000008s : 0.00% optimize.opt_b.updatestate_depend_eliminate : 0.000008s : 0.00% optimize.opt_b.updatestate_assign_eliminate : 0.000006s : 0.00% optimize.opt_b.updatestate_loads_eliminate : 0.000006s : 0.00% optimize.opt_b.renormalize : 0.000001s : 0.00% optimize.opt_b.cse : 0.000029s : 0.01% optimize.cconv : 0.000022s : 0.01% optimize.opt_after_cconv.c_1 : 0.000007s : 0.00% optimize.opt_after_cconv.parameter_eliminate : 0.000002s : 0.00% optimize.opt_after_cconv.updatestate_depend_eliminate : 0.000003s : 0.00% optimize.opt_after_cconv.updatestate_assign_eliminate : 0.000003s : 0.00% optimize.opt_after_cconv.updatestate_loads_eliminate : 0.000002s : 0.00% optimize.opt_after_cconv.cse : 0.000011s : 0.00% optimize.opt_after_cconv.renormalize : 0.000000s : 0.00% optimize.remove_dup_value : 0.000013s : 0.00% optimize.tuple_transform.d_1 : 0.000020s : 0.00% optimize.tuple_transform.d_2 : 0.000009s : 0.00% optimize.tuple_transform.renormalize : 0.000000s : 0.00% optimize.add_cache_embedding : 0.000011s : 0.00% optimize.add_recomputation : 0.000060s : 0.01% optimize.cse_after_recomputation.cse : 0.000013s : 0.00% optimize.environ_conv : 0.000022s : 0.01% optimize.label_micro_interleaved_index : 0.000003s : 0.00% optimize.label_fine_grained_interleaved_index : 0.000002s : 0.00% optimize.assign_add_opt : 0.000002s : 0.00% optimize.slice_recompute_activation : 0.000002s : 0.00% optimize.micro_interleaved_order_control : 0.000002s : 0.00% optimize.full_micro_interleaved_order_control : 0.000002s : 0.00% optimize.comp_comm_scheduling : 0.000002s : 0.00% optimize.reorder_send_recv_between_fp_bp : 0.000002s : 0.00% optimize.comm_op_add_attrs : 0.000001s : 0.00% optimize.add_comm_op_reuse_tag : 0.000001s : 0.00% optimize.overlap_opt_shard_in_pipeline : 0.000001s : 0.00% optimize.grouped_pairwise_exchange_alltoall : 0.000001s : 0.00% optimize.overlap_recompute_and_grad_model_parallel : 0.000002s : 0.00% optimize.overlap_grad_matmul_and_grad_allreduce : 0.000001s : 0.00% optimize.split_matmul_comm_elemetwise : 0.000003s : 0.00% optimize.split_layernorm_comm : 0.000002s : 0.00% optimize.process_send_recv_for_ge : 0.000003s : 0.00% optimize.handle_group_info : 0.000001s : 0.00% auto_monad_reorder : 0.000025s : 0.01% get_jit_bprop_graph : 0.000000s : 0.00% eliminate_special_op_node : 0.000507s : 0.13% validate : 0.000057s : 0.01% distribtued_split : 0.000001s : 0.00% task_emit : 0.006842s : 1.70% execute : 0.000008s : 0.00% Time group info: ------[substitution.] 0.066529 619 0.00% : 0.000003s : 5: substitution.float_depend_g_call 0.02% : 0.000012s : 17: substitution.float_tuple_getitem_switch 97.59% : 0.064924s : 99: substitution.getattr_setattr_resolve 0.01% : 0.000006s : 6: substitution.graph_param_transform 0.00% : 0.000003s : 3: substitution.incorporate_call 0.00% : 0.000002s : 3: substitution.incorporate_call_switch 1.66% : 0.001107s : 114: substitution.inline 0.01% : 0.000006s : 12: substitution.less_batch_normalization 0.07% : 0.000044s : 23: substitution.meta_unpack_prepare 0.02% : 0.000013s : 14: substitution.minmaximum_grad 0.02% : 0.000012s : 5: substitution.partial_eliminate 0.00% : 0.000001s : 6: substitution.partial_unused_args_eliminate 0.01% : 0.000007s : 54: substitution.remove_not_recompute_node 0.14% : 0.000093s : 56: substitution.replace_applicator 0.02% : 0.000012s : 48: substitution.replace_old_param 0.01% : 0.000003s : 2: substitution.reset_defer_inline 0.05% : 0.000030s : 6: substitution.reshape_eliminate 0.01% : 0.000006s : 8: substitution.set_cell_output_no_recompute 0.01% : 0.000007s : 5: substitution.specialize_transform 0.02% : 0.000013s : 12: substitution.switch_simplify 0.03% : 0.000017s : 5: substitution.transpose_eliminate 0.08% : 0.000050s : 19: substitution.tuple_list_convert_item_index_to_positive 0.03% : 0.000019s : 19: substitution.tuple_list_get_item_const_eliminator 0.04% : 0.000027s : 19: substitution.tuple_list_get_item_depend_reorder 0.13% : 0.000084s : 40: substitution.tuple_list_get_item_eliminator 0.04% : 0.000026s : 19: substitution.tuple_list_get_set_item_eliminator ------[renormalize.] 0.079200 6 92.31% : 0.073113s : 3: renormalize.infer 7.69% : 0.006087s : 3: renormalize.specialize ------[replace.] 0.001666 152 70.02% : 0.001167s : 81: replace.getattr_setattr_resolve 17.10% : 0.000285s : 49: replace.inline 2.75% : 0.000046s : 2: replace.meta_unpack_prepare 6.81% : 0.000113s : 12: replace.switch_simplify 0.28% : 0.000005s : 1: replace.transpose_eliminate 3.04% : 0.000051s : 7: replace.tuple_list_get_item_eliminator ------[match.] 0.065265 152 98.81% : 0.064487s : 81: match.getattr_setattr_resolve 1.08% : 0.000705s : 49: match.inline 0.05% : 0.000031s : 2: match.meta_unpack_prepare 0.02% : 0.000013s : 12: match.switch_simplify 0.01% : 0.000004s : 1: match.transpose_eliminate 0.04% : 0.000024s : 7: match.tuple_list_get_item_eliminator ------[func_graph_cloner_run.] 0.005717 107 67.50% : 0.003859s : 48: func_graph_cloner_run.FuncGraphClonerGraph 32.50% : 0.001858s : 59: func_graph_cloner_run.FuncGraphSpecializer ------[meta_graph.] 0.000000 0 ------[manager.] 0.000000 0 ------[pynative] 0.000000 0 ------[others.] 0.072668 257 1.89% : 0.001377s : 104: opt.transform.opt_a 0.76% : 0.000549s : 92: opt.transform.opt_b 92.36% : 0.067114s : 12: opt.transform.opt_resolve 0.17% : 0.000126s : 1: opt.transforms.meta_unpack_prepare 4.75% : 0.003451s : 40: opt.transforms.opt_a 0.01% : 0.000006s : 1: opt.transforms.opt_after_cconv 0.01% : 0.000006s : 2: opt.transforms.opt_b 0.04% : 0.000027s : 2: opt.transforms.opt_trans_graph 0.02% : 0.000012s : 3: opt.transforms.special_op_eliminate [INFO] GE(171751,python3.7):2024-01-11-05:24:30.253.280 [scalable_config.cc:55][EVENT]176119 ScalableConfig:device total max size: 34359738368, page_mem_size_total_thresold: 32641751449, uncacheable_size_threshold: 17179869184 [INFO] GE(171751,python3.7):2024-01-11-05:24:30.333.699 [graph_var_manager.cc:1424][EVENT]176119 SetAllMemoryMaxValue:The graph_mem_max_size is 27917287424 and the var_mem_max_size is 5368709120 [INFO] GE(171751,python3.7):2024-01-11-05:24:30.333.814 [graph_manager.cc:1248][EVENT]176119 PreRun:PreRun start: graph node size 4, session id 1, graph id 0, graph name online. [INFO] ATRACE(171751,python3.7):2024-01-11-05:24:30.334.733 [atrace_api.c:28](tid:176119) AtraceCreate start [INFO] ATRACE(171751,python3.7):2024-01-11-05:24:30.334.810 [trace_rb_log.c:84](tid:176119) [RUNTIME_ATRACE_DEV64_TS0] create ring buffer success, buffer size : 131152. [INFO] ATRACE(171751,python3.7):2024-01-11-05:24:30.334.824 [atrace_api.c:32](tid:176119) AtraceCreate end [INFO] TDT(171751,python3.7):2024-01-11-05:24:30.334.849 [client_manager.cpp:157][SetProfilingCallback][tid:176119] [TsdClient] set profiling callback success [INFO] GE(171751,python3.7):2024-01-11-05:24:30.335.766 [parallel_partitioner.cc:165][EVENT]176119 DoPipelinePartition:[GEPERFTRACE] The time cost of OptimizeSubgraph::PipelinePartition is [22] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.335.811 [parallel_partitioner.cc:178][EVENT]176119 DoFlowGraphPartition:[GEPERFTRACE] The time cost of OptimizeSubgraph::FlowGraphPartition is [16] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.335.867 [graph_prepare.cc:1378][EVENT]176119 Init:[GEPERFTRACE] The time cost of FileConstantUtils::ConvertFileConstToConst is [4] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.336.588 [graph_manager.cc:1050][EVENT]176119 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.preparer.PrepareInit is [743] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.336.622 [graph_manager.cc:1052][EVENT]176119 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.optimizer.HandleSummaryOp is [8] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.336.796 [base_pass.cc:339][EVENT]176119 Run:[GEPERFTRACE] The time cost of ForToWhilePass is [2] micro second, call num is [4] [INFO] GE(171751,python3.7):2024-01-11-05:24:30.336.829 [pass_manager.cc:82][EVENT]176119 Run:[GEPERFTRACE] The time cost of ProcessNetOutput::SavePass is [4] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.336.895 [pass_manager.cc:82][EVENT]176119 Run:[GEPERFTRACE] The time cost of ProcessNetOutput::NetOutputPass is [54] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.336.908 [pass_manager.cc:82][EVENT]176119 Run:[GEPERFTRACE] The time cost of ProcessNetOutput::DataPass is [1] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.336.984 [pass_manager.cc:82][EVENT]176119 Run:[GEPERFTRACE] The time cost of CreateSubGraphWithScopePass is [18] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.336.998 [pass_manager.cc:82][EVENT]176119 Run:[GEPERFTRACE] The time cost of SubgraphMultiDimsClonePass is [1] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.337.015 [pass_manager.cc:82][EVENT]176119 Run:[GEPERFTRACE] The time cost of MultiBatchClonePass is [6] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.337.135 [base_pass.cc:339][EVENT]176119 Run:[GEPERFTRACE] The time cost of SplitVariableIntoSubgraphPass is [1] micro second, call num is [4] [INFO] GE(171751,python3.7):2024-01-11-05:24:30.337.157 [graph_manager.cc:1054][EVENT]176119 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.preparer.NormalizeGraph is [522] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.344.304 [graph_manager.cc:1055][EVENT]176119 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.optimizer.OptimizeGraphInit is [7134] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.345.435 [base_pass.cc:339][EVENT]176119 Run:[GEPERFTRACE] The time cost of AssertPass is [1] micro second, call num is [8] [INFO] GE(171751,python3.7):2024-01-11-05:24:30.345.464 [base_pass.cc:339][EVENT]176119 Run:[GEPERFTRACE] The time cost of SwitchDeadBranchElimination is [4] micro second, call num is [4] [INFO] GE(171751,python3.7):2024-01-11-05:24:30.345.476 [base_pass.cc:339][EVENT]176119 Run:[GEPERFTRACE] The time cost of MergePass is [4] micro second, call num is [4] [INFO] GE(171751,python3.7):2024-01-11-05:24:30.345.485 [base_pass.cc:339][EVENT]176119 Run:[GEPERFTRACE] The time cost of InferShapePass is [355] micro second, call num is [4] [INFO] GE(171751,python3.7):2024-01-11-05:24:30.345.494 [base_pass.cc:339][EVENT]176119 Run:[GEPERFTRACE] The time cost of ReplaceWithEmptyConstPass is [16] micro second, call num is [4] [INFO] GE(171751,python3.7):2024-01-11-05:24:30.345.503 [base_pass.cc:339][EVENT]176119 Run:[GEPERFTRACE] The time cost of SplitShapeNPass is [1] micro second, call num is [8] [INFO] GE(171751,python3.7):2024-01-11-05:24:30.345.512 [base_pass.cc:339][EVENT]176119 Run:[GEPERFTRACE] The time cost of DimensionComputePass is [26] micro second, call num is [4] [INFO] GE(171751,python3.7):2024-01-11-05:24:30.345.520 [base_pass.cc:339][EVENT]176119 Run:[GEPERFTRACE] The time cost of ConstantFoldingPass is [24] micro second, call num is [4] [INFO] GE(171751,python3.7):2024-01-11-05:24:30.345.528 [base_pass.cc:339][EVENT]176119 Run:[GEPERFTRACE] The time cost of InferValuePass is [7] micro second, call num is [4] [INFO] GE(171751,python3.7):2024-01-11-05:24:30.347.696 [graph_manager.cc:1056][EVENT]176119 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.optimizer.OptimizeOriginalGraphForQuantize is [3354] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.347.764 [base_pass.cc:339][EVENT]176119 Run:[GEPERFTRACE] The time cost of CondRemovePass is [4] micro second, call num is [4] [INFO] GE(171751,python3.7):2024-01-11-05:24:30.347.781 [graph_prepare.cc:1982][EVENT]176119 PrepareDynShape:[GEPERFTRACE] The time cost of Prepare::ProcessBeforeInfershape is [50] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.348.268 [base_pass.cc:339][EVENT]176119 Run:[GEPERFTRACE] The time cost of AssertPass is [2] micro second, call num is [8] [INFO] GE(171751,python3.7):2024-01-11-05:24:30.348.296 [base_pass.cc:339][EVENT]176119 Run:[GEPERFTRACE] The time cost of SwitchDeadBranchElimination is [2] micro second, call num is [4] [INFO] GE(171751,python3.7):2024-01-11-05:24:30.348.307 [base_pass.cc:339][EVENT]176119 Run:[GEPERFTRACE] The time cost of MergePass is [2] micro second, call num is [4] [INFO] GE(171751,python3.7):2024-01-11-05:24:30.348.316 [base_pass.cc:339][EVENT]176119 Run:[GEPERFTRACE] The time cost of InferShapePass is [297] micro second, call num is [4] [INFO] GE(171751,python3.7):2024-01-11-05:24:30.348.324 [base_pass.cc:339][EVENT]176119 Run:[GEPERFTRACE] The time cost of ReplaceWithEmptyConstPass is [10] micro second, call num is [4] [INFO] GE(171751,python3.7):2024-01-11-05:24:30.348.332 [base_pass.cc:339][EVENT]176119 Run:[GEPERFTRACE] The time cost of SplitShapeNPass is [2] micro second, call num is [8] [INFO] GE(171751,python3.7):2024-01-11-05:24:30.348.341 [base_pass.cc:339][EVENT]176119 Run:[GEPERFTRACE] The time cost of DimensionComputePass is [8] micro second, call num is [4] [INFO] GE(171751,python3.7):2024-01-11-05:24:30.348.359 [base_pass.cc:339][EVENT]176119 Run:[GEPERFTRACE] The time cost of ConstantFoldingPass is [11] micro second, call num is [4] [INFO] GE(171751,python3.7):2024-01-11-05:24:30.348.368 [base_pass.cc:339][EVENT]176119 Run:[GEPERFTRACE] The time cost of InferValuePass is [3] micro second, call num is [4] [INFO] GE(171751,python3.7):2024-01-11-05:24:30.348.395 [graph_prepare.cc:1983][EVENT]176119 PrepareDynShape:[GEPERFTRACE] The time cost of Prepare::FormatAndShapeProcess is [601] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.348.419 [pass_manager.cc:82][EVENT]176119 Run:[GEPERFTRACE] The time cost of PreRun::MarkForceUnknownForCondPass is [4] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.348.432 [graph_prepare.cc:1984][EVENT]176119 PrepareDynShape:[GEPERFTRACE] The time cost of Prepare::CtrlFlowPreProcess is [21] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.348.445 [graph_prepare.cc:1985][EVENT]176119 PrepareDynShape:[GEPERFTRACE] The time cost of Prepare::multibatch::GetDynamicOutputShape is [3] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.348.475 [graph_prepare.cc:1986][EVENT]176119 PrepareDynShape:[GEPERFTRACE] The time cost of Prepare::InsertAippOpUtil::Instance().UpdateDataNodeByAipp is [18] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.348.488 [graph_prepare.cc:1987][EVENT]176119 PrepareDynShape:[GEPERFTRACE] The time cost of Prepare::SaveOriginalGraphToOmModel is [0] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.348.503 [pass_manager.cc:82][EVENT]176119 Run:[GEPERFTRACE] The time cost of PrepareOptimize::ShapeOperateOpRemovePass is [2] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.348.515 [pass_manager.cc:82][EVENT]176119 Run:[GEPERFTRACE] The time cost of PrepareOptimize::ReplaceTransShapePass is [2] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.348.530 [pass_manager.cc:82][EVENT]176119 Run:[GEPERFTRACE] The time cost of PrepareOptimize::MarkAgnosticPass is [4] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.348.628 [base_pass.cc:339][EVENT]176119 Run:[GEPERFTRACE] The time cost of EnterPass is [1] micro second, call num is [4] [INFO] GE(171751,python3.7):2024-01-11-05:24:30.348.641 [base_pass.cc:339][EVENT]176119 Run:[GEPERFTRACE] The time cost of CondPass is [3] micro second, call num is [4] [INFO] GE(171751,python3.7):2024-01-11-05:24:30.348.650 [base_pass.cc:339][EVENT]176119 Run:[GEPERFTRACE] The time cost of PrintOpPass is [3] micro second, call num is [4] [INFO] GE(171751,python3.7):2024-01-11-05:24:30.348.658 [base_pass.cc:339][EVENT]176119 Run:[GEPERFTRACE] The time cost of NoUseReshapeRemovePass is [1] micro second, call num is [4] [INFO] GE(171751,python3.7):2024-01-11-05:24:30.348.667 [base_pass.cc:339][EVENT]176119 Run:[GEPERFTRACE] The time cost of DropOutPass is [3] micro second, call num is [4] [INFO] GE(171751,python3.7):2024-01-11-05:24:30.348.675 [base_pass.cc:339][EVENT]176119 Run:[GEPERFTRACE] The time cost of AssertPass is [1] micro second, call num is [4] [INFO] GE(171751,python3.7):2024-01-11-05:24:30.348.684 [base_pass.cc:339][EVENT]176119 Run:[GEPERFTRACE] The time cost of TransposeRemovePass is [8] micro second, call num is [4] [INFO] GE(171751,python3.7):2024-01-11-05:24:30.348.692 [base_pass.cc:339][EVENT]176119 Run:[GEPERFTRACE] The time cost of UnusedConstPass is [2] micro second, call num is [4] [INFO] GE(171751,python3.7):2024-01-11-05:24:30.348.700 [base_pass.cc:339][EVENT]176119 Run:[GEPERFTRACE] The time cost of StopGradientPass is [3] micro second, call num is [4] [INFO] GE(171751,python3.7):2024-01-11-05:24:30.348.708 [base_pass.cc:339][EVENT]176119 Run:[GEPERFTRACE] The time cost of PreventGradientPass is [1] micro second, call num is [4] [INFO] GE(171751,python3.7):2024-01-11-05:24:30.348.716 [base_pass.cc:339][EVENT]176119 Run:[GEPERFTRACE] The time cost of PlaceholderWithDefaultPass is [1] micro second, call num is [4] [INFO] GE(171751,python3.7):2024-01-11-05:24:30.348.735 [base_pass.cc:339][EVENT]176119 Run:[GEPERFTRACE] The time cost of SnapshotPass is [1] micro second, call num is [4] [INFO] GE(171751,python3.7):2024-01-11-05:24:30.348.744 [base_pass.cc:339][EVENT]176119 Run:[GEPERFTRACE] The time cost of GuaranteeConstPass is [1] micro second, call num is [4] [INFO] GE(171751,python3.7):2024-01-11-05:24:30.348.752 [base_pass.cc:339][EVENT]176119 Run:[GEPERFTRACE] The time cost of VarIsInitializedOpPass is [6] micro second, call num is [4] [INFO] GE(171751,python3.7):2024-01-11-05:24:30.348.760 [base_pass.cc:339][EVENT]176119 Run:[GEPERFTRACE] The time cost of ParallelConcatStartOpPass is [1] micro second, call num is [4] [INFO] GE(171751,python3.7):2024-01-11-05:24:30.348.769 [base_pass.cc:339][EVENT]176119 Run:[GEPERFTRACE] The time cost of IdentityPass is [3] micro second, call num is [4] [INFO] GE(171751,python3.7):2024-01-11-05:24:30.348.792 [pass_manager.cc:82][EVENT]176119 Run:[GEPERFTRACE] The time cost of PrepareOptimize::PrunePass is [11] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.348.807 [pass_manager.cc:82][EVENT]176119 Run:[GEPERFTRACE] The time cost of PrepareOptimize::HcclMemcpyPass is [2] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.348.840 [graph_prepare.cc:1988][EVENT]176119 PrepareDynShape:[GEPERFTRACE] The time cost of Prepare::PrepareOptimize is [342] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.348.853 [graph_manager.cc:1065][EVENT]176119 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.preparer.PrepareDynShape is [1123] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.365.891 [graph_manager.cc:1077][EVENT]176119 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.optimizer.OptimizeOriginalGraph is [17017] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.365.962 [pass_manager.cc:82][EVENT]176119 Run:[GEPERFTRACE] The time cost of PrepareRunningFormatRefiner::VariablePrepareOpPass is [6] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.366.014 [graph_manager.cc:1080][EVENT]176119 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.preparer.PrepareRunningFormatRefiner is [86] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.370.142 [graph_manager.cc:1081][EVENT]176119 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.optimizer.OptimizeOriginalGraphJudgeInsert is [4112] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.370.184 [pass_manager.cc:82][EVENT]176119 Run:[GEPERFTRACE] The time cost of SubexpressionMigrationPass is [4] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.370.200 [pass_manager.cc:82][EVENT]176119 Run:[GEPERFTRACE] The time cost of UnusedArgsCleanPass is [2] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.370.212 [graph_manager.cc:1082][EVENT]176119 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::SubexpressionMigration is [36] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.370.243 [pass_manager.cc:82][EVENT]176119 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::MergeInputMemcpyPass is [4] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.370.259 [pass_manager.cc:82][EVENT]176119 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::SwitchDataEdgesBypass is [3] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.370.274 [pass_manager.cc:82][EVENT]176119 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::ConstantFuseSamePass is [4] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.370.382 [pass_manager.cc:82][EVENT]176119 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::CSEBeforeFuseDataNodesWithCommonInputPass is [97] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.370.399 [pass_manager.cc:82][EVENT]176119 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::FuseDataNodesWithCommonInputPass is [4] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.370.464 [pass_manager.cc:82][EVENT]176119 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::CommonSubexpressionEliminationPass is [44] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.370.482 [pass_manager.cc:82][EVENT]176119 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::PermutePass is [4] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.370.528 [pass_manager.cc:82][EVENT]176119 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::SameTransdataBreadthFusionPass is [36] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.370.547 [pass_manager.cc:82][EVENT]176119 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::VariableOpPass is [7] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.370.596 [pass_manager.cc:82][EVENT]176119 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::TransOpWithoutReshapeFusionPass is [40] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.370.652 [pass_manager.cc:82][EVENT]176119 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::TransOpBreadthFusionPass is [43] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.370.673 [pass_manager.cc:82][EVENT]176119 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::DataFlowPreparePass is [8] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.370.686 [pass_manager.cc:82][EVENT]176119 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::MergeUnknownShapeNPass is [3] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.370.695 [graph_manager.cc:2700][EVENT]176119 OptimizeStage1:[GEPERFTRACE] The time cost of GraphManager::OptimizeStage1_1 is [458] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.370.834 [base_pass.cc:339][EVENT]176119 Run:[GEPERFTRACE] The time cost of EnterPass is [1] micro second, call num is [4] [INFO] GE(171751,python3.7):2024-01-11-05:24:30.370.848 [base_pass.cc:339][EVENT]176119 Run:[GEPERFTRACE] The time cost of AddNPass is [1] micro second, call num is [4] [INFO] GE(171751,python3.7):2024-01-11-05:24:30.370.858 [base_pass.cc:339][EVENT]176119 Run:[GEPERFTRACE] The time cost of SwitchDeadBranchElimination is [2] micro second, call num is [4] [INFO] GE(171751,python3.7):2024-01-11-05:24:30.370.867 [base_pass.cc:339][EVENT]176119 Run:[GEPERFTRACE] The time cost of SwitchLogicRemovePass is [1] micro second, call num is [4] [INFO] GE(171751,python3.7):2024-01-11-05:24:30.370.875 [base_pass.cc:339][EVENT]176119 Run:[GEPERFTRACE] The time cost of MergePass is [3] micro second, call num is [4] [INFO] GE(171751,python3.7):2024-01-11-05:24:30.370.884 [base_pass.cc:339][EVENT]176119 Run:[GEPERFTRACE] The time cost of CastRemovePass is [22] micro second, call num is [4] [INFO] GE(171751,python3.7):2024-01-11-05:24:30.370.892 [base_pass.cc:339][EVENT]176119 Run:[GEPERFTRACE] The time cost of TransposeTransDataPass is [4] micro second, call num is [4] [INFO] GE(171751,python3.7):2024-01-11-05:24:30.370.900 [base_pass.cc:339][EVENT]176119 Run:[GEPERFTRACE] The time cost of ReshapeRemovePass is [5] micro second, call num is [4] [INFO] GE(171751,python3.7):2024-01-11-05:24:30.370.908 [base_pass.cc:339][EVENT]176119 Run:[GEPERFTRACE] The time cost of TransOpSymmetryEliminationPass is [3] micro second, call num is [4] [INFO] GE(171751,python3.7):2024-01-11-05:24:30.370.917 [base_pass.cc:339][EVENT]176119 Run:[GEPERFTRACE] The time cost of TransOpNearbyAllreduceFusionPass is [1] micro second, call num is [4] [INFO] GE(171751,python3.7):2024-01-11-05:24:30.370.925 [base_pass.cc:339][EVENT]176119 Run:[GEPERFTRACE] The time cost of ReplaceWithEmptyConstPass is [7] micro second, call num is [4] [INFO] GE(171751,python3.7):2024-01-11-05:24:30.370.933 [base_pass.cc:339][EVENT]176119 Run:[GEPERFTRACE] The time cost of DimensionComputePass is [5] micro second, call num is [4] [INFO] GE(171751,python3.7):2024-01-11-05:24:30.370.941 [base_pass.cc:339][EVENT]176119 Run:[GEPERFTRACE] The time cost of ConstantFoldingPass is [11] micro second, call num is [4] [INFO] GE(171751,python3.7):2024-01-11-05:24:30.370.958 [base_pass.cc:339][EVENT]176119 Run:[GEPERFTRACE] The time cost of DimensionAdjustPass is [4] micro second, call num is [4] [INFO] GE(171751,python3.7):2024-01-11-05:24:30.370.967 [base_pass.cc:339][EVENT]176119 Run:[GEPERFTRACE] The time cost of UselessControlOutRemovePass is [4] micro second, call num is [4] [INFO] GE(171751,python3.7):2024-01-11-05:24:30.370.976 [graph_manager.cc:2741][EVENT]176119 OptimizeStage1:[GEPERFTRACE] The time cost of GraphManager::OptimizeStage1_2 is [263] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.370.985 [graph_manager.cc:2752][EVENT]176119 OptimizeStage1:[GEPERFTRACE] The time cost of extern constant folding is [0] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.371.007 [pass_manager.cc:82][EVENT]176119 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::Migration is [3] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.371.020 [pass_manager.cc:82][EVENT]176119 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::ArgsClean is [1] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.371.038 [pass_manager.cc:82][EVENT]176119 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::PrunePass is [8] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.371.054 [pass_manager.cc:82][EVENT]176119 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::NextIterationPass is [4] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.371.065 [pass_manager.cc:82][EVENT]176119 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::ControlTriggerPass is [2] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.371.077 [pass_manager.cc:82][EVENT]176119 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::MergeToStreamMergePass is [3] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.371.102 [pass_manager.cc:82][EVENT]176119 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::SwitchToStreamSwitchPass is [17] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.371.116 [pass_manager.cc:82][EVENT]176119 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::AttachStreamLabelPass is [4] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.371.129 [pass_manager.cc:82][EVENT]176119 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::MultiBatchPass is [4] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.371.139 [pass_manager.cc:82][EVENT]176119 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::SubgraphMultiDimsPass is [1] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.371.151 [pass_manager.cc:82][EVENT]176119 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::IteratorOpPass is [3] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.371.162 [pass_manager.cc:82][EVENT]176119 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::VariableRefUselessControlOutDeletePass is [2] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.371.180 [pass_manager.cc:82][EVENT]176119 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::ReshapeRecoveryPass is [10] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.371.192 [pass_manager.cc:82][EVENT]176119 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::RemoveSameConstPass is [3] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.371.201 [graph_manager.cc:2810][EVENT]176119 OptimizeStage1:[GEPERFTRACE] The time cost of GraphManager::OptimizeStage1_3 is [198] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.371.230 [base_pass.cc:339][EVENT]176119 Run:[GEPERFTRACE] The time cost of IdentityPass is [2] micro second, call num is [4] [INFO] GE(171751,python3.7):2024-01-11-05:24:30.371.242 [graph_manager.cc:2821][EVENT]176119 OptimizeStage1:[GEPERFTRACE] The time cost of GraphPrepare::node_pass is [33] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.371.272 [graph_manager.cc:1087][EVENT]176119 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::OptimizeStage1 is [1040] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.371.409 [graph_manager.cc:1088][EVENT]176119 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.optimizer.OptimizeAfterStage1 is [125] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.371.461 [graph_manager.cc:1089][EVENT]176119 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::GraphUtilsEx::InferShapeInNeed is [23] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.371.479 [pass_manager.cc:82][EVENT]176119 Run:[GEPERFTRACE] The time cost of PreRun::CtrlEdgeTransferPass is [2] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.371.495 [graph_manager.cc:1097][EVENT]176119 PreRunOptimizeOriginalGraph:PreRun:PreRunOptimizeOriginalGraph success. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.371.516 [graph_manager.cc:3325][EVENT]176119 OptimizeSubgraph:[GEPERFTRACE] The time cost of OptimizeSubgraph::StagePartition is [4] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.372.562 [engine_place.cc:144][EVENT]176119 Run:The time cost of AIcoreEngine::CheckSupported is [912] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.372.592 [engine_place.cc:144][EVENT]176119 Run:The time cost of DNN_VM_GE_LOCAL_OP_STORE::CheckSupported is [9] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.372.601 [engine_place.cc:144][EVENT]176119 Run:The time cost of DNN_VM_RTS_OP_STORE::CheckSupported is [9] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.372.690 [graph_manager.cc:3351][EVENT]176119 OptimizeSubgraph:[GEPERFTRACE] The time cost of OptimizeSubgraph::GraphPartitionDynamicShape is [1160] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.372.709 [graph_manager.cc:3364][EVENT]176119 OptimizeSubgraph:[GEPERFTRACE] The time cost of OptimizeSubgraph::SubgraphPartitionAndOptimization::CompositeEngine is [3] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.372.778 [engine_partitioner.cc:1139][EVENT]176119 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionInitialize is [21] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.372.798 [engine_partitioner.cc:1142][EVENT]176119 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionMarkClusters is [5] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.372.964 [engine_partitioner.cc:1148][EVENT]176119 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionSplitSubGraphs is [156] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.373.010 [engine_partitioner.cc:1155][EVENT]176119 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionSortSubGraphs is [31] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.373.060 [engine_partitioner.cc:1164][EVENT]176119 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionAddPartitionsToGraphNode is [39] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.373.094 [graph_manager.cc:3405][EVENT]176119 SubgraphPartitionAndOptimization:[GEPERFTRACE] The time cost of OptimizeSubgraph::Partition1 is [372] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.373.112 [graph_manager.cc:3412][EVENT]176119 SubgraphPartitionAndOptimization:[GEPERFTRACE] The time cost of OptimizeSubgraph::SetSubgraphPreProc is [7] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.383.185 [graph_manager.cc:3422][EVENT]176119 SubgraphPartitionAndOptimization:[GEPERFTRACE] The time cost of OptimizeSubgraph::SetSubGraph is [10059] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.383.227 [graph_manager.cc:3428][EVENT]176119 SubgraphPartitionAndOptimization:[GEPERFTRACE] The time cost of OptimizeSubgraph::SetSubgraphPostProc is [8] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.383.357 [graph_manager.cc:3467][EVENT]176119 SubgraphPartitionAndOptimization:[GEPERFTRACE] The time cost of OptimizeSubgraph::MergeSubGraph is [108] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.383.377 [graph_manager.cc:3377][EVENT]176119 OptimizeSubgraph:[GEPERFTRACE] The time cost of OptimizeSubgraph::SubgraphPartitionAndOptimization::AtomicEngine is [10656] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.383.404 [graph_manager.cc:1106][EVENT]176119 PreRunOptimizeSubGraph:[GEPERFTRACE] The time cost of GraphManager::OptimizeSubgraph is [11894] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.383.417 [graph_manager.cc:1115][EVENT]176119 PreRunOptimizeSubGraph:PreRun:PreRunOptimizeSubGraph success. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.383.440 [graph_manager.cc:1130][EVENT]176119 PreRunAfterOptimizeSubGraph:[GEPERFTRACE] The time cost of GraphManager::stages.optimizer.ReplacePrecompiledNodeWithOmGraph is [6] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.383.473 [graph_manager.cc:1131][EVENT]176119 PreRunAfterOptimizeSubGraph:[GEPERFTRACE] The time cost of GraphManager::stages.optimizer.OptimizeWholeGraph is [18] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.383.502 [pass_manager.cc:82][EVENT]176119 Run:[GEPERFTRACE] The time cost of OptimizeStage2::AfterMergePasses::LinkGenMaskNodesPass is [10] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.383.520 [pass_manager.cc:82][EVENT]176119 Run:[GEPERFTRACE] The time cost of OptimizeStage2::HcclContinuousMemcpyPass is [6] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.383.531 [graph_manager.cc:2837][EVENT]176119 OptimizeStage2:[GEPERFTRACE] The time cost of OptimizeStage2::AfterMergePasses is [40] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.383.611 [base_pass.cc:339][EVENT]176119 Run:[GEPERFTRACE] The time cost of ConstantFoldingPass is [12] micro second, call num is [4] [INFO] GE(171751,python3.7):2024-01-11-05:24:30.383.624 [base_pass.cc:339][EVENT]176119 Run:[GEPERFTRACE] The time cost of ReshapeRemovePass is [1] micro second, call num is [4] [INFO] GE(171751,python3.7):2024-01-11-05:24:30.383.634 [base_pass.cc:339][EVENT]176119 Run:[GEPERFTRACE] The time cost of CondRemovePass is [2] micro second, call num is [4] [INFO] GE(171751,python3.7):2024-01-11-05:24:30.383.642 [base_pass.cc:339][EVENT]176119 Run:[GEPERFTRACE] The time cost of BitcastPass is [2] micro second, call num is [4] [INFO] GE(171751,python3.7):2024-01-11-05:24:30.383.652 [base_pass.cc:339][EVENT]176119 Run:[GEPERFTRACE] The time cost of AssignRemovePass is [4] micro second, call num is [4] [INFO] GE(171751,python3.7):2024-01-11-05:24:30.383.660 [base_pass.cc:339][EVENT]176119 Run:[GEPERFTRACE] The time cost of DimensionAdjustPass is [8] micro second, call num is [4] [INFO] GE(171751,python3.7):2024-01-11-05:24:30.383.670 [graph_manager.cc:2864][EVENT]176119 OptimizeStage2:[GEPERFTRACE] The time cost of OptimizeStage2::MergedGraphNameToPasses is [122] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.383.682 [graph_manager.cc:2872][EVENT]176119 OptimizeStage2:[GEPERFTRACE] The time cost of OptimizeStage2::RemoveIsolatedConst is [3] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.383.702 [pass_manager.cc:82][EVENT]176119 Run:[GEPERFTRACE] The time cost of OptimizeStage2::ControlAttrOptimize::MultiBatchPass is [1] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.383.718 [pass_manager.cc:82][EVENT]176119 Run:[GEPERFTRACE] The time cost of OptimizeStage2::AfterMergePasses::RefIdentityDeleteOpPass is [3] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.383.733 [pass_manager.cc:82][EVENT]176119 Run:[GEPERFTRACE] The time cost of OptimizeStage2::AfterMergePasses::VariableRefDeleteOpPass is [5] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.383.746 [compile_nodes_pass.cc:88][EVENT]176119 Run:[GEPERFTRACE] The time cost of OptimizeStage2::ControlAttrOptimize::CompileNodesPass is [3] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.383.757 [pass_manager.cc:82][EVENT]176119 Run:[GEPERFTRACE] The time cost of OptimizeStage2::ControlAttrOptimize::CompileNodesPass is [14] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.383.767 [pass_manager.cc:82][EVENT]176119 Run:[GEPERFTRACE] The time cost of OptimizeStage2::AfterMergePasses::SwapSpacePass is [1] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.383.864 [pass_manager.cc:82][EVENT]176119 Run:[GEPERFTRACE] The time cost of OptimizeStage2::AfterMergePasses::InputOutputConnectionIdentifyPass is [81] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.383.896 [pass_manager.cc:82][EVENT]176119 Run:[GEPERFTRACE] The time cost of OptimizeStage2::AfterMergePasses::AtomicAddrCleanPass is [19] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.383.909 [pass_manager.cc:82][EVENT]176119 Run:[GEPERFTRACE] The time cost of OptimizeStage2::AfterMergePasses::EndOfSequenceAddControlPass is [3] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.383.922 [pass_manager.cc:82][EVENT]176119 Run:[GEPERFTRACE] The time cost of OptimizeStage2::ControlAttrOptimize::SubgraphPass is [3] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.383.935 [pass_manager.cc:82][EVENT]176119 Run:[GEPERFTRACE] The time cost of OptimizeStage2::ControlAttrOptimize::AttachStreamLabelPass is [3] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.383.944 [graph_manager.cc:2927][EVENT]176119 OptimizeStage2:[GEPERFTRACE] The time cost of OptimizeStage2::ControlAttrOptimize is [244] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.383.956 [graph_manager.cc:2937][EVENT]176119 OptimizeStage2:[GEPERFTRACE] The time cost of ModelBuilder::AssignFunctionalLabels is [4] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.383.972 [graph_manager.cc:2943][EVENT]176119 OptimizeStage2:[GEPERFTRACE] The time cost of MemcpyAddrAsyncPass::Run. is [7] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.383.984 [graph_manager.cc:2950][EVENT]176119 OptimizeStage2:[GEPERFTRACE] The time cost of BufferPoolMemoryPass::Run. is [2] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.394.522 [graph_manager.cc:2958][EVENT]176119 OptimizeStage2:[GEPERFTRACE] The time cost of ParallelGroupPass::Run. is [45] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.394.570 [graph_manager.cc:1132][EVENT]176119 PreRunAfterOptimizeSubGraph:[GEPERFTRACE] The time cost of GraphManager::OptimizeStage2 is [11082] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.394.644 [graph_manager.cc:1135][EVENT]176119 PreRunAfterOptimizeSubGraph:[GEPERFTRACE] The time cost of GraphManager::GetCompilerStages(graph_node->GetGraphId()).optimizer.OptimizeGraphBeforeBuild is [58] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.394.694 [graph_manager.cc:2975][EVENT]176119 MemConflictProc:[GEPERFTRACE] The time cost of HandleMemoryRWConflict is [31] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.394.737 [graph_manager.cc:2981][EVENT]176119 MemConflictProc:[GEPERFTRACE] The time cost of MemLayoutConflictOptimizer::Run. is [28] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.394.753 [pass_manager.cc:82][EVENT]176119 Run:[GEPERFTRACE] The time cost of OptimizeStage2::SetFftsPlusAttrPass is [1] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.394.763 [graph_manager.cc:2986][EVENT]176119 MemConflictProc:[GEPERFTRACE] The time cost of SetFftsPlusAttrPass::last_passes.Run is [14] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.394.773 [graph_manager.cc:1136][EVENT]176119 PreRunAfterOptimizeSubGraph:[GEPERFTRACE] The time cost of GraphManager::MemConflictProc is [111] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.394.896 [graph_manager.cc:3555][EVENT]176119 Build:[GEPERFTRACE] The time cost of GraphManager::RecoverIrDefinitionAndModifyAippData is [88] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.395.003 [engine_partitioner.cc:1139][EVENT]176119 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionInitialize is [17] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.395.021 [engine_partitioner.cc:1142][EVENT]176119 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionMarkClusters is [4] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.395.159 [engine_partitioner.cc:1148][EVENT]176119 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionSplitSubGraphs is [120] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.395.191 [engine_partitioner.cc:1155][EVENT]176119 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionSortSubGraphs is [19] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.395.235 [engine_partitioner.cc:1164][EVENT]176119 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionAddPartitionsToGraphNode is [32] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.395.257 [graph_builder.cc:865][EVENT]176119 SecondPartition:[GEPERFTRACE] The time cost of EnginePartitioner::Partition2 is [289] micro second. [INFO] RUNTIME(171751,python3.7):2024-01-11-05:24:30.395.742 [logger.cc:1071] 176119 ModelBindStream: model_id=576, stream_id=833, flag=0. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.395.785 [task_generator.cc:804][EVENT]176119 GenerateTask:[GEPERFTRACE] The time cost of TaskGenerator::SetStreamCtx is [178] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.395.863 [task_generator.cc:805][EVENT]176119 GenerateTask:[GEPERFTRACE] The time cost of TaskGenerator::PrepareForGenerateTask is [64] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.396.646 [task_generator.cc:814][EVENT]176119 GenerateTask:[GEPERFTRACE] The time cost of TaskGenerator::DoGenerateTask is [767] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.396.665 [task_generator.cc:954][EVENT]176119 GetTaskInfo:[GEPERFTRACE] The time cost of TaskGenerator::GenerateTask is [1058] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.396.723 [task_generator.cc:967][EVENT]176119 GetTaskInfo:[GEPERFTRACE] The time cost of TaskGenerator::AddModelTaskToModel is [33] micro second. [INFO] RUNTIME(171751,python3.7):2024-01-11-05:24:30.396.742 [logger.cc:1084] 176119 ModelUnbindStream: model_id=576, stream_id=833, [INFO] GE(171751,python3.7):2024-01-11-05:24:30.397.098 [graph_manager.cc:1152][EVENT]176119 PreRunAfterOptimizeSubGraph:[GEPERFTRACE] The time cost of GraphManager::Build is [2297] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.397.123 [graph_manager.cc:1164][EVENT]176119 PreRunAfterOptimizeSubGraph:PreRun:PreRunAfterOptimizeSubGraph success. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.397.156 [graph_manager.cc:1271][EVENT]176119 PreRun:[GEPERFTRACE] The time cost of FlowModelBuild is [61516] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.397.168 [graph_manager.cc:1272][EVENT]176119 PreRun:[GEPERFTRACE] GE PreRun End [INFO] ATRACE(171751,python3.7):2024-01-11-05:24:30.397.479 [atrace_api.c:93](tid:176119) AtraceDestroy start [INFO] ATRACE(171751,python3.7):2024-01-11-05:24:30.397.502 [atrace_api.c:95](tid:176119) AtraceDestroy end [INFO] GE(171751,python3.7):2024-01-11-05:24:30.404.236 [graph_converter.cc:838][EVENT]176119 ConvertComputeGraphToExecuteGraph:[GEPERFTRACE] The time cost of ConvertComputeGraphToExecuteGraph::CreateMainNode is [2104] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.404.406 [base_optimizer.cc:70][EVENT]176119 Run:[GEPERFTRACE] The time cost of ZeroCopy is [124] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.404.959 [base_optimizer.cc:70][EVENT]176119 Run:[GEPERFTRACE] The time cost of CEM is [528] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.405.186 [copy_flow_launch_fuse.cc:395][EVENT]176119 Run:[GEPERFTRACE] The time cost of Pass::CopyFlowLaunchFuse is [200] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.405.209 [base_optimizer.cc:70][EVENT]176119 Run:[GEPERFTRACE] The time cost of CopyFlowLaunch is [226] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.405.456 [base_optimizer.cc:70][EVENT]176119 Run:[GEPERFTRACE] The time cost of TrustOutTensor is [234] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.405.487 [base_optimizer.cc:70][EVENT]176119 Run:[GEPERFTRACE] The time cost of AicpuFuseHostInputs is [10] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.405.536 [base_optimizer.cc:70][EVENT]176119 Run:[GEPERFTRACE] The time cost of ZeroCopy is [27] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.405.765 [base_optimizer.cc:70][EVENT]176119 Run:[GEPERFTRACE] The time cost of CEM is [215] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.405.858 [copy_flow_launch_fuse.cc:395][EVENT]176119 Run:[GEPERFTRACE] The time cost of Pass::CopyFlowLaunchFuse is [75] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.405.873 [base_optimizer.cc:70][EVENT]176119 Run:[GEPERFTRACE] The time cost of CopyFlowLaunch is [91] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.405.905 [base_optimizer.cc:70][EVENT]176119 Run:[GEPERFTRACE] The time cost of TrustOutTensor is [22] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.405.916 [base_optimizer.cc:70][EVENT]176119 Run:[GEPERFTRACE] The time cost of AicpuFuseHostInputs is [0] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.405.944 [base_optimizer.cc:70][EVENT]176119 Run:[GEPERFTRACE] The time cost of ZeroCopy is [19] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.406.034 [base_optimizer.cc:70][EVENT]176119 Run:[GEPERFTRACE] The time cost of CEM is [81] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.406.109 [copy_flow_launch_fuse.cc:395][EVENT]176119 Run:[GEPERFTRACE] The time cost of Pass::CopyFlowLaunchFuse is [64] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.406.120 [base_optimizer.cc:70][EVENT]176119 Run:[GEPERFTRACE] The time cost of CopyFlowLaunch is [76] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.406.149 [base_optimizer.cc:70][EVENT]176119 Run:[GEPERFTRACE] The time cost of TrustOutTensor is [20] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.406.159 [base_optimizer.cc:70][EVENT]176119 Run:[GEPERFTRACE] The time cost of AicpuFuseHostInputs is [0] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.406.172 [graph_converter.cc:849][EVENT]176119 ConvertComputeGraphToExecuteGraph:[GEPERFTRACE] The time cost of ConvertComputeGraphToExecuteGraph::RunAllPass is [1895] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.406.429 [graph_converter.cc:853][EVENT]176119 ConvertComputeGraphToExecuteGraph:[GEPERFTRACE] The time cost of ConvertComputeGraphToExecuteGraph::TopologicalSorting is [248] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.407.271 [graph_converter.cc:857][EVENT]176119 ConvertComputeGraphToExecuteGraph:[GEPERFTRACE] The time cost of ConvertComputeGraphToExecuteGraph::AppendGraphLevelData is [827] micro second. [INFO] GE(171751,python3.7):2024-01-11-05:24:30.407.457 [graph_converter.cc:862][EVENT]176119 ConvertComputeGraphToExecuteGraph:[GEPERFTRACE] The time cost of ConvertComputeGraphToExecuteGraph::CalculatePriority is [157] micro second. TotalTime = 0.141835, [20] [parse]: 0.00154001 [symbol_resolve]: 0.0130229, [1] [Cycle 1]: 0.0129535, [1] [resolve]: 0.0129331 [combine_like_graphs]: 1.01999e-06 [graph_reusing]: 3.43e-06 [meta_unpack_prepare]: 0.00013777 [pre_cconv]: 6.60002e-07 [abstract_specialize]: 0.00446379 [pack_expand]: 1.652e-05 [auto_monad]: 8.535e-05 [inline]: 1.76e-06 [pre_auto_parallel]: 1.071e-05 [pipeline_split]: 2.92e-06 [optimize]: 0.116203, [35] [py_interpret_to_execute]: 4.63999e-06 [rewriter_before_opt_a]: 0.0002051 [opt_a]: 0.114766, [4] [Cycle 1]: 0.05514, [30] [expand_dump_flag]: 4.17e-06 [switch_simplify]: 2.514e-05 [a_1]: 0.00048309 [recompute_prepare]: 1.018e-05 [updatestate_depend_eliminate]: 1.069e-05 [updatestate_assign_eliminate]: 7.65e-06 [updatestate_loads_eliminate]: 6.9e-06 [parameter_eliminate]: 4.86999e-06 [a_2]: 8.518e-05 [accelerated_algorithm]: 6.34e-06 [pynative_shard]: 1.89e-06 [auto_parallel]: 3.83e-06 [parallel]: 8.46999e-06 [merge_comm]: 4.15e-06 [allreduce_fusion]: 2.14e-06 [virtual_dataset]: 5.89e-06 [get_grad_eliminate_]: 4.74e-06 [virtual_output]: 4.22e-06 [merge_forward]: 9.19e-06 [cell_reuse_recompute_pass]: 8.30005e-07 [cell_reuse_handle_not_recompute_node_pass]: 1.282e-05 [meta_fg_expand]: 0.0217227, [1] [Cycle 1]: 0.0157981, [1] [resolve]: 0.0157776 [after_resolve]: 7.889e-05 [a_after_grad]: 0.00026405 [renormalize]: 0.0316631 [real_op_eliminate]: 3.124e-05 [auto_monad_grad]: 4.573e-05 [auto_monad_eliminator]: 6.302e-05 [cse]: 0.0001535 [a_3]: 0.0002343 [Cycle 2]: 0.0524409, [30] [expand_dump_flag]: 2.88e-06 [switch_simplify]: 0.00010003 [a_1]: 0.00055038 [recompute_prepare]: 1.214e-05 [updatestate_depend_eliminate]: 1.341e-05 [updatestate_assign_eliminate]: 9.22e-06 [updatestate_loads_eliminate]: 8.27e-06 [parameter_eliminate]: 3.59e-06 [a_2]: 0.00013732 [accelerated_algorithm]: 1.357e-05 [pynative_shard]: 1.11001e-06 [auto_parallel]: 4.28e-06 [parallel]: 4.5e-06 [merge_comm]: 2.48e-06 [allreduce_fusion]: 1.52e-06 [virtual_dataset]: 8.35e-06 [get_grad_eliminate_]: 6.83e-06 [virtual_output]: 6.97e-06 [merge_forward]: 1.043e-05 [cell_reuse_recompute_pass]: 6.39993e-07 [cell_reuse_handle_not_recompute_node_pass]: 1.724e-05 [meta_fg_expand]: 0.0176867, [4] [Cycle 1]: 0.0016059, [1] [resolve]: 0.00158532 [Cycle 1]: 0.00033131, [1] [resolve]: 0.00031188 [Cycle 1]: 0.0028643, [1] [resolve]: 0.002844 [Cycle 1]: 0.00032798, [1] [resolve]: 0.00030867 [after_resolve]: 0.00011066 [a_after_grad]: 0.00034464 [renormalize]: 0.0323348 [real_op_eliminate]: 4.146e-05 [auto_monad_grad]: 0.00017125 [auto_monad_eliminator]: 8.919e-05 [cse]: 0.00021411 [a_3]: 0.00034606 [Cycle 3]: 0.00372458, [30] [expand_dump_flag]: 3.54e-06 [switch_simplify]: 0.00013379 [a_1]: 0.00090199 [recompute_prepare]: 1.48e-05 [updatestate_depend_eliminate]: 1.622e-05 [updatestate_assign_eliminate]: 1.258e-05 [updatestate_loads_eliminate]: 1.165e-05 [parameter_eliminate]: 3.72e-06 [a_2]: 0.00019307 [accelerated_algorithm]: 1.778e-05 [pynative_shard]: 1.19e-06 [auto_parallel]: 4.08e-06 [parallel]: 4.47e-06 [merge_comm]: 3.27e-06 [allreduce_fusion]: 1.76e-06 [virtual_dataset]: 1.156e-05 [get_grad_eliminate_]: 9.73e-06 [virtual_output]: 9.78e-06 [merge_forward]: 1.347e-05 [cell_reuse_recompute_pass]: 4.39999e-07 [cell_reuse_handle_not_recompute_node_pass]: 2.352e-05 [meta_fg_expand]: 3.809e-05 [after_resolve]: 1.356e-05 [a_after_grad]: 1.742e-05 [renormalize]: 0.00183928 [real_op_eliminate]: 1.606e-05 [auto_monad_grad]: 5.60001e-06 [auto_monad_eliminator]: 2.614e-05 [cse]: 0.00012577 [a_3]: 9.144e-05 [Cycle 4]: 0.00088798, [30] [expand_dump_flag]: 1.28e-06 [switch_simplify]: 1.127e-05 [a_1]: 0.0001916 [recompute_prepare]: 1.278e-05 [updatestate_depend_eliminate]: 1.591e-05 [updatestate_assign_eliminate]: 1.241e-05 [updatestate_loads_eliminate]: 1.141e-05 [parameter_eliminate]: 2.25e-06 [a_2]: 0.00019 [accelerated_algorithm]: 1.813e-05 [pynative_shard]: 1.3e-06 [auto_parallel]: 3.37e-06 [parallel]: 3.69e-06 [merge_comm]: 2.56e-06 [allreduce_fusion]: 1.57e-06 [virtual_dataset]: 1.113e-05 [get_grad_eliminate_]: 9.45e-06 [virtual_output]: 9.18e-06 [merge_forward]: 1.414e-05 [cell_reuse_recompute_pass]: 3.89999e-07 [cell_reuse_handle_not_recompute_node_pass]: 2.357e-05 [meta_fg_expand]: 9.35e-06 [after_resolve]: 1.337e-05 [a_after_grad]: 1.657e-05 [renormalize]: 7.99992e-08 [real_op_eliminate]: 9.54e-06 [auto_monad_grad]: 2.17e-06 [auto_monad_eliminator]: 2.357e-05 [cse]: 6.656e-05 [a_3]: 8.287e-05 [py_interpret_to_execute_after_opt_a]: 4.74e-06 [slice_cell_reuse_recomputed_activation]: 2.34e-06 [rewriter_after_opt_a]: 0.0001014 [convert_after_rewriter]: 1.921e-05 [order_py_execute_after_rewriter]: 1.397e-05 [opt_b]: 0.00071725, [2] [Cycle 1]: 0.00060504, [7] [b_1]: 0.00053499 [b_2]: 4.98e-06 [updatestate_depend_eliminate]: 4.53999e-06 [updatestate_assign_eliminate]: 3.2e-06 [updatestate_loads_eliminate]: 3.22e-06 [renormalize]: 3.89999e-07 [cse]: 1.867e-05 [Cycle 2]: 0.00010211, [7] [b_1]: 5.24e-05 [b_2]: 3.38e-06 [updatestate_depend_eliminate]: 3.08e-06 [updatestate_assign_eliminate]: 2.66e-06 [updatestate_loads_eliminate]: 2.97e-06 [renormalize]: 8.99963e-08 [cse]: 1.238e-05 [cconv]: 2.224e-05 [opt_after_cconv]: 6.001e-05, [1] [Cycle 1]: 5.58e-05, [7] [c_1]: 7.7e-06 [parameter_eliminate]: 2.13e-06 [updatestate_depend_eliminate]: 3.1e-06 [updatestate_assign_eliminate]: 2.59e-06 [updatestate_loads_eliminate]: 2.36e-06 [cse]: 1.189e-05 [renormalize]: 3.39998e-07 [remove_dup_value]: 1.332e-05 [tuple_transform]: 4.518e-05, [1] [Cycle 1]: 4.133e-05, [3] [d_1]: 2.015e-05 [d_2]: 9.15e-06 [renormalize]: 2.20003e-07 [add_cache_embedding]: 1.104e-05 [add_recomputation]: 4.972e-05 [cse_after_recomputation]: 2.084e-05, [1] [Cycle 1]: 1.663e-05, [1] [cse]: 1.224e-05 [environ_conv]: 9.31e-06 [label_micro_interleaved_index]: 2.48e-06 [label_fine_grained_interleaved_index]: 2.27e-06 [assign_add_opt]: 1.44001e-06 [slice_recompute_activation]: 2.11e-06 [micro_interleaved_order_control]: 2.04e-06 [full_micro_interleaved_order_control]: 2.24e-06 [comp_comm_scheduling]: 2.06e-06 [reorder_send_recv_between_fp_bp]: 2.1e-06 [comm_op_add_attrs]: 1.08e-06 [add_comm_op_reuse_tag]: 9e-07 [overlap_opt_shard_in_pipeline]: 1.12e-06 [grouped_pairwise_exchange_alltoall]: 1.36e-06 [overlap_recompute_and_grad_model_parallel]: 1.75e-06 [overlap_grad_matmul_and_grad_allreduce]: 7.29997e-07 [split_matmul_comm_elemetwise]: 2.17e-06 [split_layernorm_comm]: 1.79e-06 [process_send_recv_for_ge]: 8.00006e-07 [handle_group_info]: 9.40003e-07 [auto_monad_reorder]: 1.912e-05 [get_jit_bprop_graph]: 3.89999e-07 [eliminate_special_op_node]: 0.00050232 [validate]: 3.994e-05 [distribtued_split]: 1.16e-06 [task_emit]: 0.00558236 [execute]: 7.5e-06 Sums parse : 0.001540s : 1.29% symbol_resolve.resolve : 0.012933s : 10.82% combine_like_graphs : 0.000001s : 0.00% graph_reusing : 0.000003s : 0.00% meta_unpack_prepare : 0.000138s : 0.12% pre_cconv : 0.000001s : 0.00% abstract_specialize : 0.004464s : 3.73% pack_expand : 0.000017s : 0.01% auto_monad : 0.000085s : 0.07% inline : 0.000002s : 0.00% pre_auto_parallel : 0.000011s : 0.01% pipeline_split : 0.000003s : 0.00% optimize.py_interpret_to_execute : 0.000005s : 0.00% optimize.rewriter_before_opt_a : 0.000205s : 0.17% optimize.opt_a.expand_dump_flag : 0.000012s : 0.01% optimize.opt_a.switch_simplify : 0.000270s : 0.23% optimize.opt_a.a_1 : 0.002127s : 1.78% optimize.opt_a.recompute_prepare : 0.000050s : 0.04% optimize.opt_a.updatestate_depend_eliminate : 0.000056s : 0.05% optimize.opt_a.updatestate_assign_eliminate : 0.000042s : 0.04% optimize.opt_a.updatestate_loads_eliminate : 0.000038s : 0.03% optimize.opt_a.parameter_eliminate : 0.000014s : 0.01% optimize.opt_a.a_2 : 0.000606s : 0.51% optimize.opt_a.accelerated_algorithm : 0.000056s : 0.05% optimize.opt_a.pynative_shard : 0.000005s : 0.00% optimize.opt_a.auto_parallel : 0.000016s : 0.01% optimize.opt_a.parallel : 0.000021s : 0.02% optimize.opt_a.merge_comm : 0.000012s : 0.01% optimize.opt_a.allreduce_fusion : 0.000007s : 0.01% optimize.opt_a.virtual_dataset : 0.000037s : 0.03% optimize.opt_a.get_grad_eliminate_ : 0.000031s : 0.03% optimize.opt_a.virtual_output : 0.000030s : 0.03% optimize.opt_a.merge_forward : 0.000047s : 0.04% optimize.opt_a.cell_reuse_recompute_pass : 0.000002s : 0.00% optimize.opt_a.cell_reuse_handle_not_recompute_node_pass : 0.000077s : 0.06% optimize.opt_a.meta_fg_expand : 0.000047s : 0.04% optimize.opt_a.meta_fg_expand.resolve : 0.020827s : 17.43% optimize.opt_a.after_resolve : 0.000216s : 0.18% optimize.opt_a.a_after_grad : 0.000643s : 0.54% optimize.opt_a.renormalize : 0.065837s : 55.08% optimize.opt_a.real_op_eliminate : 0.000098s : 0.08% optimize.opt_a.auto_monad_grad : 0.000225s : 0.19% optimize.opt_a.auto_monad_eliminator : 0.000202s : 0.17% optimize.opt_a.cse : 0.000560s : 0.47% optimize.opt_a.a_3 : 0.000755s : 0.63% optimize.py_interpret_to_execute_after_opt_a : 0.000005s : 0.00% optimize.slice_cell_reuse_recomputed_activation : 0.000002s : 0.00% optimize.rewriter_after_opt_a : 0.000101s : 0.08% optimize.convert_after_rewriter : 0.000019s : 0.02% optimize.order_py_execute_after_rewriter : 0.000014s : 0.01% optimize.opt_b.b_1 : 0.000587s : 0.49% optimize.opt_b.b_2 : 0.000008s : 0.01% optimize.opt_b.updatestate_depend_eliminate : 0.000008s : 0.01% optimize.opt_b.updatestate_assign_eliminate : 0.000006s : 0.00% optimize.opt_b.updatestate_loads_eliminate : 0.000006s : 0.01% optimize.opt_b.renormalize : 0.000000s : 0.00% optimize.opt_b.cse : 0.000031s : 0.03% optimize.cconv : 0.000022s : 0.02% optimize.opt_after_cconv.c_1 : 0.000008s : 0.01% optimize.opt_after_cconv.parameter_eliminate : 0.000002s : 0.00% optimize.opt_after_cconv.updatestate_depend_eliminate : 0.000003s : 0.00% optimize.opt_after_cconv.updatestate_assign_eliminate : 0.000003s : 0.00% optimize.opt_after_cconv.updatestate_loads_eliminate : 0.000002s : 0.00% optimize.opt_after_cconv.cse : 0.000012s : 0.01% optimize.opt_after_cconv.renormalize : 0.000000s : 0.00% optimize.remove_dup_value : 0.000013s : 0.01% optimize.tuple_transform.d_1 : 0.000020s : 0.02% optimize.tuple_transform.d_2 : 0.000009s : 0.01% optimize.tuple_transform.renormalize : 0.000000s : 0.00% optimize.add_cache_embedding : 0.000011s : 0.01% optimize.add_recomputation : 0.000050s : 0.04% optimize.cse_after_recomputation.cse : 0.000012s : 0.01% optimize.environ_conv : 0.000009s : 0.01% optimize.label_micro_interleaved_index : 0.000002s : 0.00% optimize.label_fine_grained_interleaved_index : 0.000002s : 0.00% optimize.assign_add_opt : 0.000001s : 0.00% optimize.slice_recompute_activation : 0.000002s : 0.00% optimize.micro_interleaved_order_control : 0.000002s : 0.00% optimize.full_micro_interleaved_order_control : 0.000002s : 0.00% optimize.comp_comm_scheduling : 0.000002s : 0.00% optimize.reorder_send_recv_between_fp_bp : 0.000002s : 0.00% optimize.comm_op_add_attrs : 0.000001s : 0.00% optimize.add_comm_op_reuse_tag : 0.000001s : 0.00% optimize.overlap_opt_shard_in_pipeline : 0.000001s : 0.00% optimize.grouped_pairwise_exchange_alltoall : 0.000001s : 0.00% optimize.overlap_recompute_and_grad_model_parallel : 0.000002s : 0.00% optimize.overlap_grad_matmul_and_grad_allreduce : 0.000001s : 0.00% optimize.split_matmul_comm_elemetwise : 0.000002s : 0.00% optimize.split_layernorm_comm : 0.000002s : 0.00% optimize.process_send_recv_for_ge : 0.000001s : 0.00% optimize.handle_group_info : 0.000001s : 0.00% auto_monad_reorder : 0.000019s : 0.02% get_jit_bprop_graph : 0.000000s : 0.00% eliminate_special_op_node : 0.000502s : 0.42% validate : 0.000040s : 0.03% distribtued_split : 0.000001s : 0.00% task_emit : 0.005582s : 4.67% execute : 0.000007s : 0.01% Time group info: ------[substitution.] 0.032749 619 0.01% : 0.000004s : 5: substitution.float_depend_g_call 0.03% : 0.000011s : 17: substitution.float_tuple_getitem_switch 95.04% : 0.031123s : 99: substitution.getattr_setattr_resolve 0.02% : 0.000006s : 6: substitution.graph_param_transform 0.01% : 0.000003s : 3: substitution.incorporate_call 0.01% : 0.000002s : 3: substitution.incorporate_call_switch 3.50% : 0.001148s : 114: substitution.inline 0.02% : 0.000006s : 12: substitution.less_batch_normalization 0.09% : 0.000031s : 23: substitution.meta_unpack_prepare 0.04% : 0.000013s : 14: substitution.minmaximum_grad 0.01% : 0.000003s : 5: substitution.partial_eliminate 0.00% : 0.000001s : 6: substitution.partial_unused_args_eliminate 0.02% : 0.000007s : 54: substitution.remove_not_recompute_node 0.28% : 0.000093s : 56: substitution.replace_applicator 0.03% : 0.000011s : 48: substitution.replace_old_param 0.01% : 0.000004s : 2: substitution.reset_defer_inline 0.09% : 0.000030s : 6: substitution.reshape_eliminate 0.02% : 0.000006s : 8: substitution.set_cell_output_no_recompute 0.02% : 0.000008s : 5: substitution.specialize_transform 0.04% : 0.000013s : 12: substitution.switch_simplify 0.05% : 0.000018s : 5: substitution.transpose_eliminate 0.16% : 0.000052s : 19: substitution.tuple_list_convert_item_index_to_positive 0.06% : 0.000020s : 19: substitution.tuple_list_get_item_const_eliminator 0.08% : 0.000026s : 19: substitution.tuple_list_get_item_depend_reorder 0.26% : 0.000085s : 40: substitution.tuple_list_get_item_eliminator 0.08% : 0.000026s : 19: substitution.tuple_list_get_set_item_eliminator ------[renormalize.] 0.065821 6 91.05% : 0.059933s : 3: renormalize.infer 8.95% : 0.005888s : 3: renormalize.specialize ------[replace.] 0.001653 152 69.13% : 0.001143s : 81: replace.getattr_setattr_resolve 17.62% : 0.000291s : 49: replace.inline 2.86% : 0.000047s : 2: replace.meta_unpack_prepare 7.03% : 0.000116s : 12: replace.switch_simplify 0.29% : 0.000005s : 1: replace.transpose_eliminate 3.07% : 0.000051s : 7: replace.tuple_list_get_item_eliminator ------[match.] 0.031542 152 97.44% : 0.030734s : 81: match.getattr_setattr_resolve 2.38% : 0.000749s : 49: match.inline 0.06% : 0.000018s : 2: match.meta_unpack_prepare 0.04% : 0.000013s : 12: match.switch_simplify 0.01% : 0.000004s : 1: match.transpose_eliminate 0.08% : 0.000024s : 7: match.tuple_list_get_item_eliminator ------[func_graph_cloner_run.] 0.005557 107 67.42% : 0.003747s : 48: func_graph_cloner_run.FuncGraphClonerGraph 32.58% : 0.001810s : 59: func_graph_cloner_run.FuncGraphSpecializer ------[meta_graph.] 0.000000 0 ------[manager.] 0.000000 0 ------[pynative] 0.000000 0 ------[others.] 0.038900 257 3.56% : 0.001386s : 104: opt.transform.opt_a 1.43% : 0.000554s : 92: opt.transform.opt_b 85.53% : 0.033272s : 12: opt.transform.opt_resolve 0.30% : 0.000115s : 1: opt.transforms.meta_unpack_prepare 9.05% : 0.003520s : 40: opt.transforms.opt_a 0.02% : 0.000006s : 1: opt.transforms.opt_after_cconv 0.02% : 0.000006s : 2: opt.transforms.opt_b 0.07% : 0.000027s : 2: opt.transforms.opt_trans_graph 0.03% : 0.000013s : 3: opt.transforms.special_op_eliminate . TotalTime = 0.143927, [20] [parse]: 0.00133489 [symbol_resolve]: 0.0132954, [1] [Cycle 1]: 0.0132357, [1] [resolve]: 0.0132175 [combine_like_graphs]: 6.99998e-07 [graph_reusing]: 3.3e-06 [meta_unpack_prepare]: 0.00015919 [pre_cconv]: 5.69999e-07 [abstract_specialize]: 0.00415947 [pack_expand]: 1.436e-05 [auto_monad]: 6.64e-05 [inline]: 1.26e-06 [pre_auto_parallel]: 6.98e-06 [pipeline_split]: 1.73e-06 [optimize]: 0.121331, [35] [py_interpret_to_execute]: 4.03e-06 [rewriter_before_opt_a]: 0.00020394 [opt_a]: 0.119919, [4] [Cycle 1]: 0.0584554, [30] [expand_dump_flag]: 2.89e-06 [switch_simplify]: 2.821e-05 [a_1]: 0.00075976 [recompute_prepare]: 9.2e-06 [updatestate_depend_eliminate]: 1.015e-05 [updatestate_assign_eliminate]: 6.81001e-06 [updatestate_loads_eliminate]: 6.35e-06 [parameter_eliminate]: 4.22e-06 [a_2]: 7.945e-05 [accelerated_algorithm]: 5.54e-06 [pynative_shard]: 1.06e-06 [auto_parallel]: 3.39e-06 [parallel]: 5.78e-06 [merge_comm]: 2.69e-06 [allreduce_fusion]: 1.66e-06 [virtual_dataset]: 5.99e-06 [get_grad_eliminate_]: 4.98e-06 [virtual_output]: 4.68e-06 [merge_forward]: 7.75e-06 [cell_reuse_recompute_pass]: 5.10001e-07 [cell_reuse_handle_not_recompute_node_pass]: 1.164e-05 [meta_fg_expand]: 0.0227852, [1] [Cycle 1]: 0.0167933, [1] [resolve]: 0.0167729 [after_resolve]: 8.318e-05 [a_after_grad]: 0.00030835 [renormalize]: 0.0335982 [real_op_eliminate]: 3.342e-05 [auto_monad_grad]: 4.727e-05 [auto_monad_eliminator]: 5.962e-05 [cse]: 0.00013958 [a_3]: 0.00023058 [Cycle 2]: 0.053236, [30] [expand_dump_flag]: 2.71e-06 [switch_simplify]: 0.00011905 [a_1]: 0.00106413 [recompute_prepare]: 1.028e-05 [updatestate_depend_eliminate]: 1.244e-05 [updatestate_assign_eliminate]: 9.02e-06 [updatestate_loads_eliminate]: 8.38e-06 [parameter_eliminate]: 3.47001e-06 [a_2]: 0.00013039 [accelerated_algorithm]: 1.133e-05 [pynative_shard]: 1.24e-06 [auto_parallel]: 3.57e-06 [parallel]: 4.02e-06 [merge_comm]: 2.29001e-06 [allreduce_fusion]: 1.39e-06 [virtual_dataset]: 8.22e-06 [get_grad_eliminate_]: 7.19e-06 [virtual_output]: 6.89e-06 [merge_forward]: 1.022e-05 [cell_reuse_recompute_pass]: 4.19997e-07 [cell_reuse_handle_not_recompute_node_pass]: 1.651e-05 [meta_fg_expand]: 0.0180495, [4] [Cycle 1]: 0.00169381, [1] [resolve]: 0.00167286 [Cycle 1]: 0.000332, [1] [resolve]: 0.00031309 [Cycle 1]: 0.0030093, [1] [resolve]: 0.00298901 [Cycle 1]: 0.00033786, [1] [resolve]: 0.0003186 [after_resolve]: 0.00011536 [a_after_grad]: 0.00040793 [renormalize]: 0.0321852 [real_op_eliminate]: 4.489e-05 [auto_monad_grad]: 0.00017093 [auto_monad_eliminator]: 8.836e-05 [cse]: 0.0002147 [a_3]: 0.00034103 [Cycle 3]: 0.00448834, [30] [expand_dump_flag]: 3.61e-06 [switch_simplify]: 0.00015997 [a_1]: 0.00160533 [recompute_prepare]: 1.314e-05 [updatestate_depend_eliminate]: 1.686e-05 [updatestate_assign_eliminate]: 1.251e-05 [updatestate_loads_eliminate]: 1.138e-05 [parameter_eliminate]: 3.79e-06 [a_2]: 0.0001874 [accelerated_algorithm]: 1.685e-05 [pynative_shard]: 1.22e-06 [auto_parallel]: 3.95e-06 [parallel]: 4.28e-06 [merge_comm]: 3.12e-06 [allreduce_fusion]: 1.8e-06 [virtual_dataset]: 1.118e-05 [get_grad_eliminate_]: 1.012e-05 [virtual_output]: 1.007e-05 [merge_forward]: 1.355e-05 [cell_reuse_recompute_pass]: 4.39999e-07 [cell_reuse_handle_not_recompute_node_pass]: 2.38e-05 [meta_fg_expand]: 3.844e-05 [after_resolve]: 1.442e-05 [a_after_grad]: 2.427e-05 [renormalize]: 0.0018502 [real_op_eliminate]: 1.735e-05 [auto_monad_grad]: 5.76e-06 [auto_monad_eliminator]: 2.667e-05 [cse]: 0.00012283 [a_3]: 8.896e-05 [Cycle 4]: 0.00120446, [30] [expand_dump_flag]: 1.42e-06 [switch_simplify]: 1.173e-05 [a_1]: 0.00050538 [recompute_prepare]: 1.213e-05 [updatestate_depend_eliminate]: 1.571e-05 [updatestate_assign_eliminate]: 1.239e-05 [updatestate_loads_eliminate]: 1.157e-05 [parameter_eliminate]: 2.3e-06 [a_2]: 0.0001884 [accelerated_algorithm]: 1.731e-05 [pynative_shard]: 1.45e-06 [auto_parallel]: 3.4e-06 [parallel]: 3.61e-06 [merge_comm]: 2.57e-06 [allreduce_fusion]: 1.79e-06 [virtual_dataset]: 1.119e-05 [get_grad_eliminate_]: 1.045e-05 [virtual_output]: 9.91e-06 [merge_forward]: 1.397e-05 [cell_reuse_recompute_pass]: 4.00003e-07 [cell_reuse_handle_not_recompute_node_pass]: 2.283e-05 [meta_fg_expand]: 9.98e-06 [after_resolve]: 1.33e-05 [a_after_grad]: 2.388e-05 [renormalize]: 7.99992e-08 [real_op_eliminate]: 9.83e-06 [auto_monad_grad]: 2.29e-06 [auto_monad_eliminator]: 2.329e-05 [cse]: 6.426e-05 [a_3]: 8.204e-05 [py_interpret_to_execute_after_opt_a]: 3.82e-06 [slice_cell_reuse_recomputed_activation]: 1.27e-06 [rewriter_after_opt_a]: 7.46e-05 [convert_after_rewriter]: 1.776e-05 [order_py_execute_after_rewriter]: 1.206e-05 [opt_b]: 0.00074934, [2] [Cycle 1]: 0.00064245, [7] [b_1]: 0.00057244 [b_2]: 4.34e-06 [updatestate_depend_eliminate]: 4.81e-06 [updatestate_assign_eliminate]: 3.22e-06 [updatestate_loads_eliminate]: 2.92e-06 [renormalize]: 4.30002e-07 [cse]: 1.813e-05 [Cycle 2]: 9.771e-05, [7] [b_1]: 5.051e-05 [b_2]: 2.79e-06 [updatestate_depend_eliminate]: 3.22e-06 [updatestate_assign_eliminate]: 2.76e-06 [updatestate_loads_eliminate]: 2.57e-06 [renormalize]: 7.99992e-08 [cse]: 1.092e-05 [cconv]: 1.536e-05 [opt_after_cconv]: 6.926e-05, [1] [Cycle 1]: 6.483e-05, [7] [c_1]: 1.728e-05 [parameter_eliminate]: 2.04e-06 [updatestate_depend_eliminate]: 3e-06 [updatestate_assign_eliminate]: 2.56e-06 [updatestate_loads_eliminate]: 2.48e-06 [cse]: 1.187e-05 [renormalize]: 2.09999e-07 [remove_dup_value]: 9.84999e-06 [tuple_transform]: 5.297e-05, [1] [Cycle 1]: 4.943e-05, [3] [d_1]: 2.944e-05 [d_2]: 8.66999e-06 [renormalize]: 1.39997e-07 [add_cache_embedding]: 8.94e-06 [add_recomputation]: 3.681e-05 [cse_after_recomputation]: 2.068e-05, [1] [Cycle 1]: 1.674e-05, [1] [cse]: 1.237e-05 [environ_conv]: 8.2e-06 [label_micro_interleaved_index]: 1.73e-06 [label_fine_grained_interleaved_index]: 1.28e-06 [assign_add_opt]: 1e-06 [slice_recompute_activation]: 1.65e-06 [micro_interleaved_order_control]: 9.09997e-07 [full_micro_interleaved_order_control]: 1.35e-06 [comp_comm_scheduling]: 9.20001e-07 [reorder_send_recv_between_fp_bp]: 1.58e-06 [comm_op_add_attrs]: 4.69998e-07 [add_comm_op_reuse_tag]: 1.09e-06 [overlap_opt_shard_in_pipeline]: 6.49998e-07 [grouped_pairwise_exchange_alltoall]: 5.79996e-07 [overlap_recompute_and_grad_model_parallel]: 9.29998e-07 [overlap_grad_matmul_and_grad_allreduce]: 5.30003e-07 [split_matmul_comm_elemetwise]: 1.8e-06 [split_layernorm_comm]: 9.09997e-07 [process_send_recv_for_ge]: 7.40001e-07 [handle_group_info]: 6.00005e-07 [auto_monad_reorder]: 1.509e-05 [get_jit_bprop_graph]: 2.7e-06 [eliminate_special_op_node]: 0.00049195 [validate]: 3.535e-05 [distribtued_split]: 1.13e-06 [task_emit]: 0.00281937 [execute]: 4.5e-06 Sums parse : 0.001335s : 1.10% symbol_resolve.resolve : 0.013218s : 10.88% combine_like_graphs : 0.000001s : 0.00% graph_reusing : 0.000003s : 0.00% meta_unpack_prepare : 0.000159s : 0.13% pre_cconv : 0.000001s : 0.00% abstract_specialize : 0.004159s : 3.43% pack_expand : 0.000014s : 0.01% auto_monad : 0.000066s : 0.05% inline : 0.000001s : 0.00% pre_auto_parallel : 0.000007s : 0.01% pipeline_split : 0.000002s : 0.00% optimize.py_interpret_to_execute : 0.000004s : 0.00% optimize.rewriter_before_opt_a : 0.000204s : 0.17% optimize.opt_a.expand_dump_flag : 0.000011s : 0.01% optimize.opt_a.switch_simplify : 0.000319s : 0.26% optimize.opt_a.a_1 : 0.003935s : 3.24% optimize.opt_a.recompute_prepare : 0.000045s : 0.04% optimize.opt_a.updatestate_depend_eliminate : 0.000055s : 0.05% optimize.opt_a.updatestate_assign_eliminate : 0.000041s : 0.03% optimize.opt_a.updatestate_loads_eliminate : 0.000038s : 0.03% optimize.opt_a.parameter_eliminate : 0.000014s : 0.01% optimize.opt_a.a_2 : 0.000586s : 0.48% optimize.opt_a.accelerated_algorithm : 0.000051s : 0.04% optimize.opt_a.pynative_shard : 0.000005s : 0.00% optimize.opt_a.auto_parallel : 0.000014s : 0.01% optimize.opt_a.parallel : 0.000018s : 0.01% optimize.opt_a.merge_comm : 0.000011s : 0.01% optimize.opt_a.allreduce_fusion : 0.000007s : 0.01% optimize.opt_a.virtual_dataset : 0.000037s : 0.03% optimize.opt_a.get_grad_eliminate_ : 0.000033s : 0.03% optimize.opt_a.virtual_output : 0.000032s : 0.03% optimize.opt_a.merge_forward : 0.000045s : 0.04% optimize.opt_a.cell_reuse_recompute_pass : 0.000002s : 0.00% optimize.opt_a.cell_reuse_handle_not_recompute_node_pass : 0.000075s : 0.06% optimize.opt_a.meta_fg_expand : 0.000048s : 0.04% optimize.opt_a.meta_fg_expand.resolve : 0.022066s : 18.17% optimize.opt_a.after_resolve : 0.000226s : 0.19% optimize.opt_a.a_after_grad : 0.000764s : 0.63% optimize.opt_a.renormalize : 0.067634s : 55.69% optimize.opt_a.real_op_eliminate : 0.000105s : 0.09% optimize.opt_a.auto_monad_grad : 0.000226s : 0.19% optimize.opt_a.auto_monad_eliminator : 0.000198s : 0.16% optimize.opt_a.cse : 0.000541s : 0.45% optimize.opt_a.a_3 : 0.000743s : 0.61% optimize.py_interpret_to_execute_after_opt_a : 0.000004s : 0.00% optimize.slice_cell_reuse_recomputed_activation : 0.000001s : 0.00% optimize.rewriter_after_opt_a : 0.000075s : 0.06% optimize.convert_after_rewriter : 0.000018s : 0.01% optimize.order_py_execute_after_rewriter : 0.000012s : 0.01% optimize.opt_b.b_1 : 0.000623s : 0.51% optimize.opt_b.b_2 : 0.000007s : 0.01% optimize.opt_b.updatestate_depend_eliminate : 0.000008s : 0.01% optimize.opt_b.updatestate_assign_eliminate : 0.000006s : 0.00% optimize.opt_b.updatestate_loads_eliminate : 0.000005s : 0.00% optimize.opt_b.renormalize : 0.000001s : 0.00% optimize.opt_b.cse : 0.000029s : 0.02% optimize.cconv : 0.000015s : 0.01% optimize.opt_after_cconv.c_1 : 0.000017s : 0.01% optimize.opt_after_cconv.parameter_eliminate : 0.000002s : 0.00% optimize.opt_after_cconv.updatestate_depend_eliminate : 0.000003s : 0.00% optimize.opt_after_cconv.updatestate_assign_eliminate : 0.000003s : 0.00% optimize.opt_after_cconv.updatestate_loads_eliminate : 0.000002s : 0.00% optimize.opt_after_cconv.cse : 0.000012s : 0.01% optimize.opt_after_cconv.renormalize : 0.000000s : 0.00% optimize.remove_dup_value : 0.000010s : 0.01% optimize.tuple_transform.d_1 : 0.000029s : 0.02% optimize.tuple_transform.d_2 : 0.000009s : 0.01% optimize.tuple_transform.renormalize : 0.000000s : 0.00% optimize.add_cache_embedding : 0.000009s : 0.01% optimize.add_recomputation : 0.000037s : 0.03% optimize.cse_after_recomputation.cse : 0.000012s : 0.01% optimize.environ_conv : 0.000008s : 0.01% optimize.label_micro_interleaved_index : 0.000002s : 0.00% optimize.label_fine_grained_interleaved_index : 0.000001s : 0.00% optimize.assign_add_opt : 0.000001s : 0.00% optimize.slice_recompute_activation : 0.000002s : 0.00% optimize.micro_interleaved_order_control : 0.000001s : 0.00% optimize.full_micro_interleaved_order_control : 0.000001s : 0.00% optimize.comp_comm_scheduling : 0.000001s : 0.00% optimize.reorder_send_recv_between_fp_bp : 0.000002s : 0.00% optimize.comm_op_add_attrs : 0.000000s : 0.00% optimize.add_comm_op_reuse_tag : 0.000001s : 0.00% optimize.overlap_opt_shard_in_pipeline : 0.000001s : 0.00% optimize.grouped_pairwise_exchange_alltoall : 0.000001s : 0.00% optimize.overlap_recompute_and_grad_model_parallel : 0.000001s : 0.00% optimize.overlap_grad_matmul_and_grad_allreduce : 0.000001s : 0.00% optimize.split_matmul_comm_elemetwise : 0.000002s : 0.00% optimize.split_layernorm_comm : 0.000001s : 0.00% optimize.process_send_recv_for_ge : 0.000001s : 0.00% optimize.handle_group_info : 0.000001s : 0.00% auto_monad_reorder : 0.000015s : 0.01% get_jit_bprop_graph : 0.000003s : 0.00% eliminate_special_op_node : 0.000492s : 0.41% validate : 0.000035s : 0.03% distribtued_split : 0.000001s : 0.00% task_emit : 0.002819s : 2.32% execute : 0.000004s : 0.00% Time group info: ------[substitution.] 0.034171 684 0.01% : 0.000003s : 6: substitution.float_depend_g_call 0.03% : 0.000010s : 17: substitution.float_tuple_getitem_switch 95.53% : 0.032644s : 99: substitution.getattr_setattr_resolve 0.01% : 0.000004s : 6: substitution.graph_param_transform 0.01% : 0.000002s : 3: substitution.incorporate_call 0.00% : 0.000001s : 3: substitution.incorporate_call_switch 3.11% : 0.001063s : 120: substitution.inline 0.01% : 0.000005s : 12: substitution.less_batch_normalization 0.10% : 0.000036s : 42: substitution.meta_unpack_prepare 0.04% : 0.000015s : 19: substitution.minmaximum_grad 0.01% : 0.000003s : 6: substitution.partial_eliminate 0.00% : 0.000001s : 6: substitution.partial_unused_args_eliminate 0.02% : 0.000006s : 54: substitution.remove_not_recompute_node 0.27% : 0.000094s : 62: substitution.replace_applicator 0.03% : 0.000011s : 48: substitution.replace_old_param 0.01% : 0.000003s : 2: substitution.reset_defer_inline 0.07% : 0.000025s : 7: substitution.reshape_eliminate 0.02% : 0.000005s : 8: substitution.set_cell_output_no_recompute 0.02% : 0.000007s : 5: substitution.specialize_transform 0.03% : 0.000012s : 12: substitution.switch_simplify 0.04% : 0.000013s : 6: substitution.transpose_eliminate 0.14% : 0.000048s : 24: substitution.tuple_list_convert_item_index_to_positive 0.06% : 0.000021s : 24: substitution.tuple_list_get_item_const_eliminator 0.09% : 0.000029s : 24: substitution.tuple_list_get_item_depend_reorder 0.23% : 0.000079s : 45: substitution.tuple_list_get_item_eliminator 0.08% : 0.000028s : 24: substitution.tuple_list_get_set_item_eliminator ------[renormalize.] 0.067620 6 91.39% : 0.061801s : 3: renormalize.infer 8.61% : 0.005819s : 3: renormalize.specialize ------[replace.] 0.001640 152 70.00% : 0.001148s : 81: replace.getattr_setattr_resolve 16.75% : 0.000275s : 49: replace.inline 2.87% : 0.000047s : 2: replace.meta_unpack_prepare 6.84% : 0.000112s : 12: replace.switch_simplify 0.39% : 0.000006s : 1: replace.transpose_eliminate 3.15% : 0.000052s : 7: replace.tuple_list_get_item_eliminator ------[match.] 0.032884 152 97.85% : 0.032175s : 81: match.getattr_setattr_resolve 2.01% : 0.000660s : 49: match.inline 0.05% : 0.000017s : 2: match.meta_unpack_prepare 0.04% : 0.000012s : 12: match.switch_simplify 0.01% : 0.000003s : 1: match.transpose_eliminate 0.05% : 0.000017s : 7: match.tuple_list_get_item_eliminator ------[func_graph_cloner_run.] 0.005445 107 67.83% : 0.003693s : 48: func_graph_cloner_run.FuncGraphClonerGraph 32.17% : 0.001752s : 59: func_graph_cloner_run.FuncGraphSpecializer ------[meta_graph.] 0.000000 0 ------[manager.] 0.000000 0 ------[pynative] 0.000000 0 ------[others.] 0.042350 587 0.33% : 0.000141s : 2: opt.transform.meta_unpack_prepare 15.95% : 0.006754s : 461: opt.transform.opt_a 0.03% : 0.000013s : 7: opt.transform.opt_after_cconv 1.41% : 0.000596s : 94: opt.transform.opt_b 82.17% : 0.034799s : 12: opt.transform.opt_resolve 0.08% : 0.000034s : 8: opt.transform.opt_trans_graph 0.03% : 0.000013s : 3: opt.transform.special_op_eliminate . ============================== 2 passed in 20.98s ============================== [TRACE] GE(171751,python3.7):2024-01-11-05:24:32.416.193 [status:INIT] [ge_api.cc:463]171751 ~Session:Start to destruct session. [TRACE] GE(171751,python3.7):2024-01-11-05:24:32.416.262 [status:RUNNING] [ge_api.cc:475]171751 ~Session:Session id is 0 [TRACE] GE(171751,python3.7):2024-01-11-05:24:32.416.273 [status:RUNNING] [ge_api.cc:476]171751 ~Session:Destroying session [TRACE] GE(171751,python3.7):2024-01-11-05:24:32.417.156 [status:STOP] [ge_api.cc:491]171751 ~Session:Session Destructor finished [TRACE] GE(171751,python3.7):2024-01-11-05:24:32.417.187 [status:INIT] [ge_api.cc:301]171751 GEFinalize:GEFinalize start [INFO] GE(171751,python3.7):2024-01-11-05:24:32.417.250 [execution_runtime.cc:80][EVENT]171751 FinalizeExecutionRuntime:Execution runtime finalize begin. [INFO] GE(171751,python3.7):2024-01-11-05:24:32.417.267 [execution_runtime.cc:92][EVENT]171751 FinalizeExecutionRuntime:Execution runtime finalized. [TRACE] GE(171751,python3.7):2024-01-11-05:24:32.417.278 [status:RUNNING] [ge_api.cc:313]171751 GEFinalize:Finalizing environment [INFO] TUNE(171751,python3.7):2024-01-11-05:24:32.704.609 [cann_kb_pyfunc_mgr.cpp:127][CANNKB][Tid:171751]"CannKbPyfuncMgr: enter PyObjectDeinit function, reference_[1]" [INFO] TUNE(171751,python3.7):2024-01-11-05:24:32.704.666 [cann_kb_pyfunc_mgr.cpp:138][CANNKB][Tid:171751]"CannKbPyfuncMgr: PyObjectDeinit function end successfully!" [INFO] GE(171751,python3.7):2024-01-11-05:24:32.705.957 [gelib.cc:324][EVENT]171751 SystemFinalize:Online infer finalize GELib success. [TRACE] GE(171751,python3.7):2024-01-11-05:24:33.618.480 [status:STOP] [ge_api.cc:341]171751 GEFinalize:GEFinalize finished [INFO] TDT(171751,python3.7):2024-01-11-05:24:34.047.657 [process_mode_manager.cpp:184][Close][tid:171751] [TsdClient] Close [deviceId=3][sessionId=1] hccp and computer enter [INFO] TDT(171751,python3.7):2024-01-11-05:24:34.047.697 [version_verify.cpp:112][SpecialFeatureCheck][tid:171751] VersionVerify: previous type[7], supported [INFO] TDT(171751,python3.7):2024-01-11-05:24:34.047.735 [process_mode_manager.cpp:192][Close][tid:171751] [TsdClient][deviceId=3] [sessionId=1] wait hccp and computer process close respond [INFO] TDT(171751,python3.7):2024-01-11-05:24:34.069.262 [process_mode_manager.cpp:197][Close][tid:171751] [TsdClient][logicDeviceId_=3]has recv close hccp and computer process respond [INFO] TDT(171751,python3.7):2024-01-11-05:24:34.069.276 [stub_process_mode_nowin.cpp:151][CloseInHost][tid:171751] enter into CloseInHost deviceid[3] [INFO] TDT(171751,python3.7):2024-01-11-05:24:34.069.287 [stub_process_mode_nowin.cpp:154][CloseInHost][tid:171751] host cpu not support [INFO] TDT(171751,python3.7):2024-01-11-05:24:34.069.320 [process_mode_manager.cpp:208][Close][tid:171751] [TsdClient][deviceId=3] [sessionId=1] close hccp and computer process success [INFO] ATRACE(171751,python3.7):2024-01-11-05:24:34.069.332 [atrace_api.c:93](tid:171751) AtraceDestroy start [INFO] ATRACE(171751,python3.7):2024-01-11-05:24:34.069.349 [atrace_api.c:95](tid:171751) AtraceDestroy end [INFO] PROFILING(171751,python3.7):2024-01-11-05:24:34.069.373 [msprofiler_impl.cpp:156] >>> (tid:171751) ProfNotifySetDevice called, is open: 0, devId: 3 [INFO] RUNTIME(171751,python3.7):2024-01-11-05:24:35.699.656 [runtime.cc:1737] 171751 ~Runtime: deconstruct runtime.