============================= test session starts ============================== platform linux -- Python 3.7.5, pytest-5.4.3, py-1.8.1, pluggy-0.13.1 rootdir: /home/jenkins/mindspore/testcases/testcases/tests/st/dyn_shape_dev, inifile: /home/jenkins/sault/virtual_test/virtualenv_006/sault/config/pytest.ini plugins: anyio-3.7.1, forked-1.1.3, xdist-1.32.0 [INFO] ATRACE(187024,python3.7):2024-01-11-05:30:22.750.088 [trace_attr.c:105](tid:187024) platform is 1. [INFO] ATRACE(187024,python3.7):2024-01-11-05:30:22.750.281 [trace_recorder.c:114](tid:187024) use root path: /home/jenkins/ascend/atrace [INFO] ATRACE(187024,python3.7):2024-01-11-05:30:22.750.308 [trace_signal.c:133](tid:187024) register signal handler for signo 2 succeed. [INFO] ATRACE(187024,python3.7):2024-01-11-05:30:22.750.320 [trace_signal.c:133](tid:187024) register signal handler for signo 15 succeed. [INFO] RUNTIME(187024,python3.7):2024-01-11-05:30:23.161.744 [runtime.cc:1159] 187024 GetAicoreNumByLevel: workingDev_=0 [INFO] RUNTIME(187024,python3.7):2024-01-11-05:30:23.161.811 [runtime.cc:4719] 187024 GetVisibleDevices: ASCEND_RT_VISIBLE_DEVICES param was not set collected 2 items test_sqrt.py [INFO] TDT(187024,python3.7):2024-01-11-05:30:27.474.368 [process_mode_manager.cpp:109][OpenProcess][tid:187024] [ProcessModeManager] enter into open process deviceId[5] rankSize[0] [INFO] TDT(187024,python3.7):2024-01-11-05:30:27.476.573 [process_mode_manager.cpp:379][InitTsdClient][tid:187024] [TsdClient] deviceId[5] begin to init hdc client [INFO] TDT(187024,python3.7):2024-01-11-05:30:27.476.713 [version_verify.cpp:34][SetVersionInfo][tid:187024] VersionVerify: send client version to server [INFO] TDT(187024,python3.7):2024-01-11-05:30:27.476.740 [version_verify.cpp:50][SetVersionInfo][tid:187024] send feature_info:{msg_type:35, features:{check before send aicpu package,}} [INFO] TDT(187024,python3.7):2024-01-11-05:30:27.476.752 [version_verify.cpp:50][SetVersionInfo][tid:187024] send feature_info:{msg_type:37, features:{check before send open qs message,}} [INFO] TDT(187024,python3.7):2024-01-11-05:30:27.477.251 [version_verify.cpp:66][PeerVersionCheck][tid:187024] VersionVerify: Check client version info, server[1230], client[1230] [INFO] TDT(187024,python3.7):2024-01-11-05:30:27.477.269 [version_verify.cpp:87][ParseVersionInfo][tid:187024] VersionVerify: pass client version info success [INFO] TDT(187024,python3.7):2024-01-11-05:30:27.477.278 [hdc_client.cpp:276][CheckHdcConnection][tid:187024] Service[2] create hdc success [INFO] TDT(187024,python3.7):2024-01-11-05:30:27.477.294 [version_verify.cpp:120][SpecialFeatureCheck][tid:187024] VersionVerify: new type[35], supported [INFO] TDT(187024,python3.7):2024-01-11-05:30:27.477.345 [process_mode_manager.cpp:748][GetDeviceCheckCode][tid:187024] [TsdClient][deviceId=5] [sessionId=1] wait package info respond [INFO] TDT(187024,python3.7):2024-01-11-05:30:27.477.478 [process_mode_manager.cpp:379][InitTsdClient][tid:187024] [TsdClient] deviceId[5] begin to init hdc client [INFO] TDT(187024,python3.7):2024-01-11-05:30:27.477.594 [version_verify.cpp:34][SetVersionInfo][tid:187024] VersionVerify: send client version to server [INFO] TDT(187024,python3.7):2024-01-11-05:30:27.477.606 [version_verify.cpp:50][SetVersionInfo][tid:187024] send feature_info:{msg_type:35, features:{check before send aicpu package,}} [INFO] TDT(187024,python3.7):2024-01-11-05:30:27.477.617 [version_verify.cpp:50][SetVersionInfo][tid:187024] send feature_info:{msg_type:37, features:{check before send open qs message,}} [INFO] TDT(187024,python3.7):2024-01-11-05:30:27.477.782 [version_verify.cpp:66][PeerVersionCheck][tid:187024] VersionVerify: Check client version info, server[1230], client[1230] [INFO] TDT(187024,python3.7):2024-01-11-05:30:27.477.793 [version_verify.cpp:87][ParseVersionInfo][tid:187024] VersionVerify: pass client version info success [INFO] TDT(187024,python3.7):2024-01-11-05:30:27.477.801 [hdc_client.cpp:276][CheckHdcConnection][tid:187024] Service[2] create hdc success [INFO] TDT(187024,python3.7):2024-01-11-05:30:27.477.812 [process_mode_manager.cpp:426][ConstructOpenMsg][tid:187024] [TsdClient] tsd get process sign successfully, procpid[187024] signSize[48] [INFO] TDT(187024,python3.7):2024-01-11-05:30:27.477.837 [version_verify.cpp:112][SpecialFeatureCheck][tid:187024] VersionVerify: previous type[6], supported [INFO] TDT(187024,python3.7):2024-01-11-05:30:27.477.858 [process_mode_manager.cpp:126][OpenProcess][tid:187024] [ProcessModeManager] deviceId[5] sessionId[1] rankSize[0], wait sub process start respond [INFO] TDT(187024,python3.7):2024-01-11-05:30:27.680.662 [stub_process_mode_nowin.cpp:63][ProcessQueueForMdc][tid:187024] [TsdClient] it is unnecessary of current mode[0] chiptype[1] to grant queue auth to aicpusd [INFO] TDT(187024,python3.7):2024-01-11-05:30:27.680.702 [stub_process_mode_nowin.cpp:101][OpenInHost][tid:187024] enter into OpenInHost deviceid[5] [INFO] TDT(187024,python3.7):2024-01-11-05:30:27.680.711 [stub_process_mode_nowin.cpp:105][OpenInHost][tid:187024] host cpu not support [INFO] TDT(187024,python3.7):2024-01-11-05:30:27.680.719 [process_mode_manager.cpp:156][OpenProcess][tid:187024] [TsdClient][deviceId=5] [sessionId=1] start hccp and computer process success [INFO] RUNTIME(187024,python3.7):2024-01-11-05:30:27.683.456 [device.cc:340] 187024 Init: isDoubledie:0, topologytype:0 [INFO] RUNTIME(187024,python3.7):2024-01-11-05:30:27.700.257 [npu_driver.cc:5428] 188666 GetDeviceStatus: GetDeviceStatus status=1. [INFO] ATRACE(187024,python3.7):2024-01-11-05:30:27.700.314 [atrace_api.c:28](tid:187024) AtraceCreate start [INFO] ATRACE(187024,python3.7):2024-01-11-05:30:27.700.421 [trace_rb_log.c:84](tid:187024) [RUNTIME_ATRACE_DEV5_TS0] create ring buffer success, buffer size : 131152. [INFO] ATRACE(187024,python3.7):2024-01-11-05:30:27.700.436 [atrace_api.c:32](tid:187024) AtraceCreate end [INFO] TDT(187024,python3.7):2024-01-11-05:30:27.700.451 [client_manager.cpp:157][SetProfilingCallback][tid:187024] [TsdClient] set profiling callback success [TRACE] GE(187024,python3.7):2024-01-11-05:30:27.851.515 [status:INIT] [ge_api.cc:144]187024 GEInitializeImpl:GEInitialize start [INFO] PROFILING(187024,python3.7):2024-01-11-05:30:28.078.953 [msprofiler_impl.cpp:156] >>> (tid:187024) ProfNotifySetDevice called, is open: 1, devId: 5 [INFO] PROFILING(187024,python3.7):2024-01-11-05:30:28.079.076 [platform.cpp:38] >>> (tid:187024) Profiling platform version: 1.0. [INFO] PROFILING(187024,python3.7):2024-01-11-05:30:28.079.090 [ai_drv_dev_api.cpp:384] >>> (tid:187024) Succeeded to DrvGetApiVersion version: 0x72313 [TRACE] GE(187024,python3.7):2024-01-11-05:30:28.130.974 [status:RUNNING] [ge_api.cc:211]187024 GEInitializeImpl:Initializing environment [INFO] GE(187024,python3.7):2024-01-11-05:30:28.131.044 [gelib.cc:98][EVENT]187024 Initialize:[GEPERFTRACE] GE Init Start [INFO] GE(187024,python3.7):2024-01-11-05:30:28.131.324 [gelib.cc:307][EVENT]187024 SystemInitialize:Online infer init GELib success, device id :5 [INFO] DVPP(187024,python3.7):2024-01-11-05:30:28.501.355 [dvpp_engine.cc:41][ENGINE][Initialize:41][tid:187024]dvpp engine do not support [INFO] TUNE(187024,python3.7):2024-01-11-05:30:28.504.545 [cann_kb_pyfunc_mgr.cpp:72][CANNKB][Tid:187024]"CannKbPyfuncMgr: Enter PyObjectInit, reference_ is 0!" [INFO] TUNE(187024,python3.7):2024-01-11-05:30:28.504.577 [handle_manager.cpp:115][CANNKB][Tid:187024]"Start to run init functions to load dynamic python lib!" [INFO] TUNE(187024,python3.7):2024-01-11-05:30:28.504.636 [handle_manager.cpp:407][CANNKB][Tid:187024]"Init functions of loading dynamic python lib end!" [INFO] TUNE(187024,python3.7):2024-01-11-05:30:28.504.647 [cann_kb_pyfunc_mgr.cpp:24][CANNKB][Tid:187024]"CANN_KB_Py has already been initialized." [INFO] TUNE(187024,python3.7):2024-01-11-05:30:28.504.711 [cann_kb_pyfunc_mgr.cpp:117][CANNKB][Tid:187024]"CannKbPyfuncMgr: Run PyObjectInit successfully!" [INFO] HCCL(187024,python3.7):2024-01-11-05:30:40.321.415 [plugin_manager.cc:42][187024]hcom running normal mode. [INFO] DVPP(187024,python3.7):2024-01-11-05:30:40.321.965 [dvpp_engine.cc:92][ENGINE][GetOpsKernelInfoStores:92][tid:187024]dvpp ops kernel info store do not support [INFO] DVPP(187024,python3.7):2024-01-11-05:30:40.322.114 [dvpp_engine.cc:69][ENGINE][GetGraphOptimizerObjs:69][tid:187024]dvpp graph optimizer do not support [INFO] DVPP(187024,python3.7):2024-01-11-05:30:40.878.686 [dvpp_ops_kernel_builder.cc:48][ENGINE][Initialize:48][tid:187024]dvpp ops kernel builder do not support [INFO] GE(187024,python3.7):2024-01-11-05:30:40.886.979 [gelib.cc:169][EVENT]187024 Initialize:[GEPERFTRACE] The time cost of GELib::Initialize is [12755881] micro second. [TRACE] GE(187024,python3.7):2024-01-11-05:30:40.973.447 [status:STOP] [ge_api.cc:255]187024 GEInitializeImpl:GEInitialize finished [TRACE] GE(187024,python3.7):2024-01-11-05:30:40.973.568 [status:INIT] [ge_api.cc:398]187024 Session:Start to construct session. [TRACE] GE(187024,python3.7):2024-01-11-05:30:40.973.585 [status:RUNNING] [ge_api.cc:408]187024 Session:Creating session [INFO] GE(187024,python3.7):2024-01-11-05:30:40.974.018 [graph_var_manager.cc:1445][EVENT]187024 SetMemoryMallocSize:Total memory size is 34359738368 [INFO] GE(187024,python3.7):2024-01-11-05:30:40.974.034 [graph_var_manager.cc:1424][EVENT]187024 SetAllMemoryMaxValue:The graph_mem_max_size is 27917287424 and the var_mem_max_size is 5368709120 [INFO] PROFILING(187024,python3.7):2024-01-11-05:30:40.974.299 [msprofiler_impl.cpp:156] >>> (tid:187024) ProfNotifySetDevice called, is open: 1, devId: 5 [TRACE] GE(187024,python3.7):2024-01-11-05:30:40.975.078 [status:RUNNING] [ge_api.cc:411]187024 Session:Session id is 0 [TRACE] GE(187024,python3.7):2024-01-11-05:30:40.975.097 [status:STOP] [ge_api.cc:420]187024 Session:Session Constructor finished [INFO] PROFILING(187024,python3.7):2024-01-11-05:30:40.984.673 [platform.cpp:38] >>> (tid:187024) Profiling platform version: 1.0. [INFO] PROFILING(187024,python3.7):2024-01-11-05:30:40.984.698 [ai_drv_dev_api.cpp:384] >>> (tid:187024) Succeeded to DrvGetApiVersion version: 0x72313 [TRACE] GE(187024,python3.7):2024-01-11-05:30:40.984.867 [status:INIT] [ge_api.cc:144]187024 GEInitializeImpl:GEInitialize start TotalTime = 0.196763, [20] [parse]: 0.0122643 [symbol_resolve]: 0.0281986, [1] [Cycle 1]: 0.0281267, [1] [resolve]: 0.0281053 [combine_like_graphs]: 9.10004e-07 [graph_reusing]: 2.98e-06 [meta_unpack_prepare]: 0.00013908 [pre_cconv]: 3.7e-06 [abstract_specialize]: 0.00708582 [pack_expand]: 1.53e-05 [auto_monad]: 0.00011449 [inline]: 3.31e-06 [pre_auto_parallel]: 1.765e-05 [pipeline_split]: 2.98e-06 [optimize]: 0.141787, [35] [py_interpret_to_execute]: 4.59e-06 [rewriter_before_opt_a]: 0.00016281 [opt_a]: 0.140416, [4] [Cycle 1]: 0.084588, [30] [expand_dump_flag]: 4.11e-06 [switch_simplify]: 2.586e-05 [a_1]: 0.00038452 [recompute_prepare]: 8.68001e-06 [updatestate_depend_eliminate]: 9.92e-06 [updatestate_assign_eliminate]: 6.52e-06 [updatestate_loads_eliminate]: 6.67e-06 [parameter_eliminate]: 5.02e-06 [a_2]: 8.348e-05 [accelerated_algorithm]: 5.07e-06 [pynative_shard]: 1.79e-06 [auto_parallel]: 3.3e-06 [parallel]: 1.62e-05 [merge_comm]: 1.122e-05 [allreduce_fusion]: 2.28e-06 [virtual_dataset]: 4.85e-06 [get_grad_eliminate_]: 4.34e-06 [virtual_output]: 3.67e-06 [merge_forward]: 8e-06 [cell_reuse_recompute_pass]: 7.40001e-07 [cell_reuse_handle_not_recompute_node_pass]: 1.14e-05 [meta_fg_expand]: 0.0284952, [1] [Cycle 1]: 0.0005005, [1] [resolve]: 0.00047914 [after_resolve]: 2.163e-05 [a_after_grad]: 3.616e-05 [renormalize]: 0.0548701 [real_op_eliminate]: 2.352e-05 [auto_monad_grad]: 3.168e-05 [auto_monad_eliminator]: 4.482e-05 [cse]: 0.00013116 [a_3]: 0.00015777 [Cycle 2]: 0.0456461, [30] [expand_dump_flag]: 2.56e-06 [switch_simplify]: 5.961e-05 [a_1]: 0.00038513 [recompute_prepare]: 9.42e-06 [updatestate_depend_eliminate]: 1.107e-05 [updatestate_assign_eliminate]: 8.59e-06 [updatestate_loads_eliminate]: 7.98e-06 [parameter_eliminate]: 3.49e-06 [a_2]: 0.00012053 [accelerated_algorithm]: 1.205e-05 [pynative_shard]: 1.65e-06 [auto_parallel]: 4.37e-06 [parallel]: 4.14e-06 [merge_comm]: 2.21e-06 [allreduce_fusion]: 1.52e-06 [virtual_dataset]: 7.19e-06 [get_grad_eliminate_]: 6e-06 [virtual_output]: 5.76e-06 [merge_forward]: 9.87e-06 [cell_reuse_recompute_pass]: 5.60001e-07 [cell_reuse_handle_not_recompute_node_pass]: 1.548e-05 [meta_fg_expand]: 0.0177256, [4] [Cycle 1]: 0.00180562, [1] [resolve]: 0.00178639 [Cycle 1]: 0.00032906, [1] [resolve]: 0.00031068 [Cycle 1]: 0.00040563, [1] [resolve]: 0.00038763 [Cycle 1]: 0.00032166, [1] [resolve]: 0.00030359 [after_resolve]: 5.755e-05 [a_after_grad]: 0.00014745 [renormalize]: 0.0262631 [real_op_eliminate]: 3.513e-05 [auto_monad_grad]: 7.015e-05 [auto_monad_eliminator]: 6.7e-05 [cse]: 0.00017084 [a_3]: 0.00026633 [Cycle 3]: 0.00298478, [30] [expand_dump_flag]: 3.05e-06 [switch_simplify]: 0.00011761 [a_1]: 0.00066781 [recompute_prepare]: 1.238e-05 [updatestate_depend_eliminate]: 1.466e-05 [updatestate_assign_eliminate]: 1.183e-05 [updatestate_loads_eliminate]: 1.156e-05 [parameter_eliminate]: 3.48e-06 [a_2]: 0.00017372 [accelerated_algorithm]: 1.663e-05 [pynative_shard]: 1.15e-06 [auto_parallel]: 4.11e-06 [parallel]: 3.81e-06 [merge_comm]: 2.86999e-06 [allreduce_fusion]: 2.12e-06 [virtual_dataset]: 9.31e-06 [get_grad_eliminate_]: 8.34e-06 [virtual_output]: 8.29e-06 [merge_forward]: 1.265e-05 [cell_reuse_recompute_pass]: 3.89999e-07 [cell_reuse_handle_not_recompute_node_pass]: 2.182e-05 [meta_fg_expand]: 3.071e-05 [after_resolve]: 1.223e-05 [a_after_grad]: 1.428e-05 [renormalize]: 0.00144303 [real_op_eliminate]: 1.447e-05 [auto_monad_grad]: 5.31e-06 [auto_monad_eliminator]: 2.415e-05 [cse]: 0.00010592 [a_3]: 8.034e-05 [Cycle 4]: 0.0008035, [30] [expand_dump_flag]: 1.41e-06 [switch_simplify]: 9.75e-06 [a_1]: 0.00016859 [recompute_prepare]: 1.082e-05 [updatestate_depend_eliminate]: 1.445e-05 [updatestate_assign_eliminate]: 1.186e-05 [updatestate_loads_eliminate]: 1.118e-05 [parameter_eliminate]: 2.04e-06 [a_2]: 0.00017272 [accelerated_algorithm]: 1.63e-05 [pynative_shard]: 1.28e-06 [auto_parallel]: 3.51e-06 [parallel]: 3.71e-06 [merge_comm]: 2.31e-06 [allreduce_fusion]: 1.64e-06 [virtual_dataset]: 9.54e-06 [get_grad_eliminate_]: 8.5e-06 [virtual_output]: 8.18e-06 [merge_forward]: 1.27e-05 [cell_reuse_recompute_pass]: 4.1e-07 [cell_reuse_handle_not_recompute_node_pass]: 2.143e-05 [meta_fg_expand]: 8.88e-06 [after_resolve]: 1.156e-05 [a_after_grad]: 1.507e-05 [renormalize]: 6.00048e-08 [real_op_eliminate]: 8.42e-06 [auto_monad_grad]: 1.97e-06 [auto_monad_eliminator]: 2.161e-05 [cse]: 5.858e-05 [a_3]: 7.363e-05 [py_interpret_to_execute_after_opt_a]: 3.62e-06 [slice_cell_reuse_recomputed_activation]: 2.21e-06 [rewriter_after_opt_a]: 8.772e-05 [convert_after_rewriter]: 1.893e-05 [order_py_execute_after_rewriter]: 1.354e-05 [opt_b]: 0.00067781, [2] [Cycle 1]: 0.00057729, [7] [b_1]: 0.00051297 [b_2]: 3.95e-06 [updatestate_depend_eliminate]: 4.17e-06 [updatestate_assign_eliminate]: 2.87e-06 [updatestate_loads_eliminate]: 2.85e-06 [renormalize]: 3.69997e-07 [cse]: 1.67e-05 [Cycle 2]: 9.142e-05, [7] [b_1]: 4.573e-05 [b_2]: 2.65e-06 [updatestate_depend_eliminate]: 2.78001e-06 [updatestate_assign_eliminate]: 2.40999e-06 [updatestate_loads_eliminate]: 2.38e-06 [renormalize]: 6.00048e-08 [cse]: 1.005e-05 [cconv]: 2.13e-05 [opt_after_cconv]: 5.586e-05, [1] [Cycle 1]: 5.164e-05, [7] [c_1]: 6.4e-06 [parameter_eliminate]: 1.62e-06 [updatestate_depend_eliminate]: 3e-06 [updatestate_assign_eliminate]: 2.35e-06 [updatestate_loads_eliminate]: 2.28e-06 [cse]: 1.038e-05 [renormalize]: 2.99995e-07 [remove_dup_value]: 1.207e-05 [tuple_transform]: 4.06e-05, [1] [Cycle 1]: 3.712e-05, [3] [d_1]: 1.766e-05 [d_2]: 7.39e-06 [renormalize]: 1.8e-07 [add_cache_embedding]: 1.08e-05 [add_recomputation]: 5.254e-05 [cse_after_recomputation]: 2.029e-05, [1] [Cycle 1]: 1.621e-05, [1] [cse]: 1.169e-05 [environ_conv]: 2.699e-05 [label_micro_interleaved_index]: 2.33e-06 [label_fine_grained_interleaved_index]: 2.27e-06 [assign_add_opt]: 1.68e-06 [slice_recompute_activation]: 2.36e-06 [micro_interleaved_order_control]: 1.8e-06 [full_micro_interleaved_order_control]: 1.87e-06 [comp_comm_scheduling]: 2.02e-06 [reorder_send_recv_between_fp_bp]: 2.19e-06 [comm_op_add_attrs]: 9.09997e-07 [add_comm_op_reuse_tag]: 8.59996e-07 [overlap_opt_shard_in_pipeline]: 1.06e-06 [grouped_pairwise_exchange_alltoall]: 1.12e-06 [overlap_recompute_and_grad_model_parallel]: 1.55e-06 [overlap_grad_matmul_and_grad_allreduce]: 7.50006e-07 [split_matmul_comm_elemetwise]: 2.26e-06 [split_layernorm_comm]: 1.59e-06 [process_send_recv_for_ge]: 2.25e-06 [handle_group_info]: 1.1e-06 [auto_monad_reorder]: 2.429e-05 [get_jit_bprop_graph]: 4.30002e-07 [eliminate_special_op_node]: 0.0004905 [validate]: 5.188e-05 [distribtued_split]: 1.29e-06 [task_emit]: 0.00633258 [execute]: 8.2e-06 Sums parse : 0.012264s : 8.39% symbol_resolve.resolve : 0.028105s : 19.22% combine_like_graphs : 0.000001s : 0.00% graph_reusing : 0.000003s : 0.00% meta_unpack_prepare : 0.000139s : 0.10% pre_cconv : 0.000004s : 0.00% abstract_specialize : 0.007086s : 4.84% pack_expand : 0.000015s : 0.01% auto_monad : 0.000114s : 0.08% inline : 0.000003s : 0.00% pre_auto_parallel : 0.000018s : 0.01% pipeline_split : 0.000003s : 0.00% optimize.py_interpret_to_execute : 0.000005s : 0.00% optimize.rewriter_before_opt_a : 0.000163s : 0.11% optimize.opt_a.expand_dump_flag : 0.000011s : 0.01% optimize.opt_a.switch_simplify : 0.000213s : 0.15% optimize.opt_a.a_1 : 0.001606s : 1.10% optimize.opt_a.recompute_prepare : 0.000041s : 0.03% optimize.opt_a.updatestate_depend_eliminate : 0.000050s : 0.03% optimize.opt_a.updatestate_assign_eliminate : 0.000039s : 0.03% optimize.opt_a.updatestate_loads_eliminate : 0.000037s : 0.03% optimize.opt_a.parameter_eliminate : 0.000014s : 0.01% optimize.opt_a.a_2 : 0.000550s : 0.38% optimize.opt_a.accelerated_algorithm : 0.000050s : 0.03% optimize.opt_a.pynative_shard : 0.000006s : 0.00% optimize.opt_a.auto_parallel : 0.000015s : 0.01% optimize.opt_a.parallel : 0.000028s : 0.02% optimize.opt_a.merge_comm : 0.000019s : 0.01% optimize.opt_a.allreduce_fusion : 0.000008s : 0.01% optimize.opt_a.virtual_dataset : 0.000031s : 0.02% optimize.opt_a.get_grad_eliminate_ : 0.000027s : 0.02% optimize.opt_a.virtual_output : 0.000026s : 0.02% optimize.opt_a.merge_forward : 0.000043s : 0.03% optimize.opt_a.cell_reuse_recompute_pass : 0.000002s : 0.00% optimize.opt_a.cell_reuse_handle_not_recompute_node_pass : 0.000070s : 0.05% optimize.opt_a.meta_fg_expand : 0.000040s : 0.03% optimize.opt_a.meta_fg_expand.resolve : 0.003267s : 2.23% optimize.opt_a.after_resolve : 0.000103s : 0.07% optimize.opt_a.a_after_grad : 0.000213s : 0.15% optimize.opt_a.renormalize : 0.082576s : 56.46% optimize.opt_a.real_op_eliminate : 0.000082s : 0.06% optimize.opt_a.auto_monad_grad : 0.000109s : 0.07% optimize.opt_a.auto_monad_eliminator : 0.000158s : 0.11% optimize.opt_a.cse : 0.000466s : 0.32% optimize.opt_a.a_3 : 0.000578s : 0.40% optimize.py_interpret_to_execute_after_opt_a : 0.000004s : 0.00% optimize.slice_cell_reuse_recomputed_activation : 0.000002s : 0.00% optimize.rewriter_after_opt_a : 0.000088s : 0.06% optimize.convert_after_rewriter : 0.000019s : 0.01% optimize.order_py_execute_after_rewriter : 0.000014s : 0.01% optimize.opt_b.b_1 : 0.000559s : 0.38% optimize.opt_b.b_2 : 0.000007s : 0.00% optimize.opt_b.updatestate_depend_eliminate : 0.000007s : 0.00% optimize.opt_b.updatestate_assign_eliminate : 0.000005s : 0.00% optimize.opt_b.updatestate_loads_eliminate : 0.000005s : 0.00% optimize.opt_b.renormalize : 0.000000s : 0.00% optimize.opt_b.cse : 0.000027s : 0.02% optimize.cconv : 0.000021s : 0.01% optimize.opt_after_cconv.c_1 : 0.000006s : 0.00% optimize.opt_after_cconv.parameter_eliminate : 0.000002s : 0.00% optimize.opt_after_cconv.updatestate_depend_eliminate : 0.000003s : 0.00% optimize.opt_after_cconv.updatestate_assign_eliminate : 0.000002s : 0.00% optimize.opt_after_cconv.updatestate_loads_eliminate : 0.000002s : 0.00% optimize.opt_after_cconv.cse : 0.000010s : 0.01% optimize.opt_after_cconv.renormalize : 0.000000s : 0.00% optimize.remove_dup_value : 0.000012s : 0.01% optimize.tuple_transform.d_1 : 0.000018s : 0.01% optimize.tuple_transform.d_2 : 0.000007s : 0.01% optimize.tuple_transform.renormalize : 0.000000s : 0.00% optimize.add_cache_embedding : 0.000011s : 0.01% optimize.add_recomputation : 0.000053s : 0.04% optimize.cse_after_recomputation.cse : 0.000012s : 0.01% optimize.environ_conv : 0.000027s : 0.02% optimize.label_micro_interleaved_index : 0.000002s : 0.00% optimize.label_fine_grained_interleaved_index : 0.000002s : 0.00% optimize.assign_add_opt : 0.000002s : 0.00% optimize.slice_recompute_activation : 0.000002s : 0.00% optimize.micro_interleaved_order_control : 0.000002s : 0.00% optimize.full_micro_interleaved_order_control : 0.000002s : 0.00% optimize.comp_comm_scheduling : 0.000002s : 0.00% optimize.reorder_send_recv_between_fp_bp : 0.000002s : 0.00% optimize.comm_op_add_attrs : 0.000001s : 0.00% optimize.add_comm_op_reuse_tag : 0.000001s : 0.00% optimize.overlap_opt_shard_in_pipeline : 0.000001s : 0.00% optimize.grouped_pairwise_exchange_alltoall : 0.000001s : 0.00% optimize.overlap_recompute_and_grad_model_parallel : 0.000002s : 0.00% optimize.overlap_grad_matmul_and_grad_allreduce : 0.000001s : 0.00% optimize.split_matmul_comm_elemetwise : 0.000002s : 0.00% optimize.split_layernorm_comm : 0.000002s : 0.00% optimize.process_send_recv_for_ge : 0.000002s : 0.00% optimize.handle_group_info : 0.000001s : 0.00% auto_monad_reorder : 0.000024s : 0.02% get_jit_bprop_graph : 0.000000s : 0.00% eliminate_special_op_node : 0.000491s : 0.34% validate : 0.000052s : 0.04% distribtued_split : 0.000001s : 0.00% task_emit : 0.006333s : 4.33% execute : 0.000008s : 0.01% Time group info: ------[substitution.] 0.031384 471 0.01% : 0.000003s : 5: substitution.float_depend_g_call 0.04% : 0.000011s : 17: substitution.float_tuple_getitem_switch 96.47% : 0.030277s : 41: substitution.getattr_setattr_resolve 0.02% : 0.000005s : 5: substitution.graph_param_transform 0.01% : 0.000003s : 3: substitution.incorporate_call 0.04% : 0.000012s : 3: substitution.incorporate_call_switch 2.18% : 0.000683s : 72: substitution.inline 0.02% : 0.000006s : 12: substitution.less_batch_normalization 0.12% : 0.000038s : 23: substitution.meta_unpack_prepare 0.04% : 0.000012s : 14: substitution.minmaximum_grad 0.01% : 0.000003s : 5: substitution.partial_eliminate 0.00% : 0.000001s : 5: substitution.partial_unused_args_eliminate 0.02% : 0.000007s : 54: substitution.remove_not_recompute_node 0.20% : 0.000064s : 44: substitution.replace_applicator 0.02% : 0.000008s : 26: substitution.replace_old_param 0.01% : 0.000003s : 2: substitution.reset_defer_inline 0.02% : 0.000005s : 8: substitution.set_cell_output_no_recompute 0.02% : 0.000007s : 5: substitution.specialize_transform 0.03% : 0.000009s : 6: substitution.switch_simplify 0.05% : 0.000017s : 5: substitution.transpose_eliminate 0.22% : 0.000070s : 19: substitution.tuple_list_convert_item_index_to_positive 0.06% : 0.000018s : 19: substitution.tuple_list_get_item_const_eliminator 0.08% : 0.000024s : 19: substitution.tuple_list_get_item_depend_reorder 0.23% : 0.000072s : 40: substitution.tuple_list_get_item_eliminator 0.08% : 0.000024s : 19: substitution.tuple_list_get_set_item_eliminator ------[renormalize.] 0.082562 6 95.04% : 0.078466s : 3: renormalize.infer 4.96% : 0.004095s : 3: renormalize.specialize ------[replace.] 0.000899 89 55.28% : 0.000497s : 36: replace.getattr_setattr_resolve 23.41% : 0.000210s : 37: replace.inline 4.82% : 0.000043s : 2: replace.meta_unpack_prepare 10.71% : 0.000096s : 6: replace.switch_simplify 0.51% : 0.000005s : 1: replace.transpose_eliminate 5.28% : 0.000047s : 7: replace.tuple_list_get_item_eliminator ------[match.] 0.030752 89 97.98% : 0.030130s : 36: match.getattr_setattr_resolve 1.83% : 0.000561s : 37: match.inline 0.09% : 0.000027s : 2: match.meta_unpack_prepare 0.03% : 0.000009s : 6: match.switch_simplify 0.01% : 0.000004s : 1: match.transpose_eliminate 0.07% : 0.000021s : 7: match.tuple_list_get_item_eliminator ------[func_graph_cloner_run.] 0.004332 81 67.62% : 0.002929s : 34: func_graph_cloner_run.FuncGraphClonerGraph 32.38% : 0.001403s : 47: func_graph_cloner_run.FuncGraphSpecializer ------[meta_graph.] 0.000000 0 ------[manager.] 0.000000 0 ------[pynative] 0.000000 0 ------[others.] 0.035363 257 3.26% : 0.001151s : 104: opt.transform.opt_a 1.49% : 0.000527s : 92: opt.transform.opt_b 88.14% : 0.031169s : 12: opt.transform.opt_resolve 0.33% : 0.000116s : 1: opt.transforms.meta_unpack_prepare 6.66% : 0.002356s : 40: opt.transforms.opt_a 0.01% : 0.000005s : 1: opt.transforms.opt_after_cconv 0.01% : 0.000005s : 2: opt.transforms.opt_b 0.07% : 0.000023s : 2: opt.transforms.opt_trans_graph 0.03% : 0.000011s : 3: opt.transforms.special_op_eliminate [INFO] GE(187024,python3.7):2024-01-11-05:30:41.289.644 [scalable_config.cc:55][EVENT]191228 ScalableConfig:device total max size: 34359738368, page_mem_size_total_thresold: 32641751449, uncacheable_size_threshold: 17179869184 [INFO] GE(187024,python3.7):2024-01-11-05:30:41.372.860 [graph_var_manager.cc:1424][EVENT]191228 SetAllMemoryMaxValue:The graph_mem_max_size is 27917287424 and the var_mem_max_size is 5368709120 [INFO] GE(187024,python3.7):2024-01-11-05:30:41.372.942 [graph_manager.cc:1248][EVENT]191228 PreRun:PreRun start: graph node size 3, session id 1, graph id 0, graph name online. [INFO] ATRACE(187024,python3.7):2024-01-11-05:30:41.373.901 [atrace_api.c:28](tid:191228) AtraceCreate start [INFO] ATRACE(187024,python3.7):2024-01-11-05:30:41.373.978 [trace_rb_log.c:84](tid:191228) [RUNTIME_ATRACE_DEV64_TS0] create ring buffer success, buffer size : 131152. [INFO] ATRACE(187024,python3.7):2024-01-11-05:30:41.373.992 [atrace_api.c:32](tid:191228) AtraceCreate end [INFO] TDT(187024,python3.7):2024-01-11-05:30:41.374.015 [client_manager.cpp:157][SetProfilingCallback][tid:191228] [TsdClient] set profiling callback success [INFO] GE(187024,python3.7):2024-01-11-05:30:41.374.982 [parallel_partitioner.cc:165][EVENT]191228 DoPipelinePartition:[GEPERFTRACE] The time cost of OptimizeSubgraph::PipelinePartition is [24] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.375.023 [parallel_partitioner.cc:178][EVENT]191228 DoFlowGraphPartition:[GEPERFTRACE] The time cost of OptimizeSubgraph::FlowGraphPartition is [16] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.375.086 [graph_prepare.cc:1378][EVENT]191228 Init:[GEPERFTRACE] The time cost of FileConstantUtils::ConvertFileConstToConst is [8] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.375.769 [graph_manager.cc:1050][EVENT]191228 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.preparer.PrepareInit is [710] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.375.798 [graph_manager.cc:1052][EVENT]191228 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.optimizer.HandleSummaryOp is [9] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.375.941 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of ForToWhilePass is [2] micro second, call num is [3] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.375.972 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of ProcessNetOutput::SavePass is [4] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.376.036 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of ProcessNetOutput::NetOutputPass is [51] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.376.051 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of ProcessNetOutput::DataPass is [1] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.376.151 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of CreateSubGraphWithScopePass is [19] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.376.164 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of SubgraphMultiDimsClonePass is [1] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.376.186 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of MultiBatchClonePass is [12] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.376.299 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of SplitVariableIntoSubgraphPass is [2] micro second, call num is [3] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.376.320 [graph_manager.cc:1054][EVENT]191228 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.preparer.NormalizeGraph is [508] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.383.928 [graph_manager.cc:1055][EVENT]191228 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.optimizer.OptimizeGraphInit is [7594] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.384.877 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of AssertPass is [2] micro second, call num is [6] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.384.901 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of SwitchDeadBranchElimination is [3] micro second, call num is [3] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.384.913 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of MergePass is [4] micro second, call num is [3] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.384.923 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of InferShapePass is [272] micro second, call num is [3] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.384.932 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of ReplaceWithEmptyConstPass is [15] micro second, call num is [3] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.384.941 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of SplitShapeNPass is [2] micro second, call num is [6] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.384.950 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of DimensionComputePass is [22] micro second, call num is [3] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.384.958 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of ConstantFoldingPass is [21] micro second, call num is [3] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.384.966 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of InferValuePass is [4] micro second, call num is [3] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.386.905 [graph_manager.cc:1056][EVENT]191228 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.optimizer.OptimizeOriginalGraphForQuantize is [2945] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.386.967 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of CondRemovePass is [4] micro second, call num is [3] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.386.984 [graph_prepare.cc:1982][EVENT]191228 PrepareDynShape:[GEPERFTRACE] The time cost of Prepare::ProcessBeforeInfershape is [47] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.387.328 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of AssertPass is [2] micro second, call num is [6] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.387.349 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of SwitchDeadBranchElimination is [1] micro second, call num is [3] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.387.359 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of MergePass is [1] micro second, call num is [3] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.387.368 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of InferShapePass is [178] micro second, call num is [3] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.387.377 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of ReplaceWithEmptyConstPass is [7] micro second, call num is [3] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.387.385 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of SplitShapeNPass is [2] micro second, call num is [6] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.387.394 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of DimensionComputePass is [6] micro second, call num is [3] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.387.410 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of ConstantFoldingPass is [9] micro second, call num is [3] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.387.419 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of InferValuePass is [3] micro second, call num is [3] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.387.443 [graph_prepare.cc:1983][EVENT]191228 PrepareDynShape:[GEPERFTRACE] The time cost of Prepare::FormatAndShapeProcess is [445] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.387.467 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of PreRun::MarkForceUnknownForCondPass is [5] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.387.479 [graph_prepare.cc:1984][EVENT]191228 PrepareDynShape:[GEPERFTRACE] The time cost of Prepare::CtrlFlowPreProcess is [21] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.387.494 [graph_prepare.cc:1985][EVENT]191228 PrepareDynShape:[GEPERFTRACE] The time cost of Prepare::multibatch::GetDynamicOutputShape is [3] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.387.514 [graph_prepare.cc:1986][EVENT]191228 PrepareDynShape:[GEPERFTRACE] The time cost of Prepare::InsertAippOpUtil::Instance().UpdateDataNodeByAipp is [8] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.387.525 [graph_prepare.cc:1987][EVENT]191228 PrepareDynShape:[GEPERFTRACE] The time cost of Prepare::SaveOriginalGraphToOmModel is [1] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.387.541 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of PrepareOptimize::ShapeOperateOpRemovePass is [2] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.387.552 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of PrepareOptimize::ReplaceTransShapePass is [2] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.387.565 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of PrepareOptimize::MarkAgnosticPass is [4] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.387.645 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of EnterPass is [3] micro second, call num is [3] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.387.658 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of CondPass is [2] micro second, call num is [3] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.387.667 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of PrintOpPass is [2] micro second, call num is [3] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.387.676 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of NoUseReshapeRemovePass is [1] micro second, call num is [3] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.387.684 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of DropOutPass is [2] micro second, call num is [3] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.387.692 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of AssertPass is [1] micro second, call num is [3] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.387.700 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of TransposeRemovePass is [1] micro second, call num is [3] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.387.709 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of UnusedConstPass is [1] micro second, call num is [3] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.387.717 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of StopGradientPass is [1] micro second, call num is [3] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.387.725 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of PreventGradientPass is [0] micro second, call num is [3] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.387.733 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of PlaceholderWithDefaultPass is [1] micro second, call num is [3] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.387.742 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of SnapshotPass is [1] micro second, call num is [3] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.387.756 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of GuaranteeConstPass is [2] micro second, call num is [3] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.387.765 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of VarIsInitializedOpPass is [4] micro second, call num is [3] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.387.773 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of ParallelConcatStartOpPass is [2] micro second, call num is [3] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.387.781 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of IdentityPass is [3] micro second, call num is [3] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.387.804 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of PrepareOptimize::PrunePass is [10] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.387.818 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of PrepareOptimize::HcclMemcpyPass is [2] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.387.849 [graph_prepare.cc:1988][EVENT]191228 PrepareDynShape:[GEPERFTRACE] The time cost of Prepare::PrepareOptimize is [313] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.387.861 [graph_manager.cc:1065][EVENT]191228 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.preparer.PrepareDynShape is [925] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.400.923 [graph_manager.cc:1077][EVENT]191228 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.optimizer.OptimizeOriginalGraph is [13042] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.400.986 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of PrepareRunningFormatRefiner::VariablePrepareOpPass is [6] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.401.034 [graph_manager.cc:1080][EVENT]191228 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.preparer.PrepareRunningFormatRefiner is [79] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.404.558 [graph_manager.cc:1081][EVENT]191228 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.optimizer.OptimizeOriginalGraphJudgeInsert is [3506] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.404.595 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of SubexpressionMigrationPass is [4] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.404.611 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of UnusedArgsCleanPass is [2] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.404.623 [graph_manager.cc:1082][EVENT]191228 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::SubexpressionMigration is [37] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.404.654 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::MergeInputMemcpyPass is [4] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.404.670 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::SwitchDataEdgesBypass is [2] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.404.685 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::ConstantFuseSamePass is [4] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.404.795 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::CSEBeforeFuseDataNodesWithCommonInputPass is [99] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.404.813 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::FuseDataNodesWithCommonInputPass is [3] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.404.861 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::CommonSubexpressionEliminationPass is [36] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.404.885 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::PermutePass is [4] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.404.922 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::SameTransdataBreadthFusionPass is [26] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.404.948 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::VariableOpPass is [15] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.404.970 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::TransOpWithoutReshapeFusionPass is [11] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.405.017 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::TransOpBreadthFusionPass is [37] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.405.036 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::DataFlowPreparePass is [8] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.405.049 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::MergeUnknownShapeNPass is [3] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.405.059 [graph_manager.cc:2700][EVENT]191228 OptimizeStage1:[GEPERFTRACE] The time cost of GraphManager::OptimizeStage1_1 is [409] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.405.211 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of EnterPass is [2] micro second, call num is [3] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.405.226 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of AddNPass is [1] micro second, call num is [3] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.405.236 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of SwitchDeadBranchElimination is [3] micro second, call num is [3] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.405.245 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of SwitchLogicRemovePass is [1] micro second, call num is [3] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.405.253 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of MergePass is [3] micro second, call num is [3] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.405.262 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of CastRemovePass is [10] micro second, call num is [3] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.405.270 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of TransposeTransDataPass is [3] micro second, call num is [3] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.405.278 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of ReshapeRemovePass is [5] micro second, call num is [3] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.405.286 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of TransOpSymmetryEliminationPass is [3] micro second, call num is [3] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.405.294 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of TransOpNearbyAllreduceFusionPass is [1] micro second, call num is [3] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.405.302 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of ReplaceWithEmptyConstPass is [8] micro second, call num is [3] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.405.311 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of DimensionComputePass is [41] micro second, call num is [3] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.405.319 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of ConstantFoldingPass is [11] micro second, call num is [3] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.405.327 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of DimensionAdjustPass is [3] micro second, call num is [3] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.405.341 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of UselessControlOutRemovePass is [1] micro second, call num is [3] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.405.351 [graph_manager.cc:2741][EVENT]191228 OptimizeStage1:[GEPERFTRACE] The time cost of GraphManager::OptimizeStage1_2 is [273] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.405.360 [graph_manager.cc:2752][EVENT]191228 OptimizeStage1:[GEPERFTRACE] The time cost of extern constant folding is [0] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.405.384 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::Migration is [3] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.405.397 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::ArgsClean is [1] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.405.413 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::PrunePass is [7] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.405.429 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::NextIterationPass is [5] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.405.442 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::ControlTriggerPass is [1] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.405.455 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::MergeToStreamMergePass is [2] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.405.481 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::SwitchToStreamSwitchPass is [15] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.405.495 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::AttachStreamLabelPass is [4] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.405.508 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::MultiBatchPass is [3] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.405.518 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::SubgraphMultiDimsPass is [1] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.405.532 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::IteratorOpPass is [4] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.405.543 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::VariableRefUselessControlOutDeletePass is [2] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.405.562 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::ReshapeRecoveryPass is [9] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.405.577 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::RemoveSameConstPass is [3] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.405.587 [graph_manager.cc:2810][EVENT]191228 OptimizeStage1:[GEPERFTRACE] The time cost of GraphManager::OptimizeStage1_3 is [208] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.405.614 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of IdentityPass is [1] micro second, call num is [3] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.405.627 [graph_manager.cc:2821][EVENT]191228 OptimizeStage1:[GEPERFTRACE] The time cost of GraphPrepare::node_pass is [31] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.405.655 [graph_manager.cc:1087][EVENT]191228 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::OptimizeStage1 is [1012] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.405.792 [graph_manager.cc:1088][EVENT]191228 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.optimizer.OptimizeAfterStage1 is [123] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.405.838 [graph_manager.cc:1089][EVENT]191228 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::GraphUtilsEx::InferShapeInNeed is [20] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.405.857 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of PreRun::CtrlEdgeTransferPass is [2] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.405.871 [graph_manager.cc:1097][EVENT]191228 PreRunOptimizeOriginalGraph:PreRun:PreRunOptimizeOriginalGraph success. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.405.892 [graph_manager.cc:3325][EVENT]191228 OptimizeSubgraph:[GEPERFTRACE] The time cost of OptimizeSubgraph::StagePartition is [4] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.406.310 [engine_place.cc:144][EVENT]191228 Run:The time cost of AIcoreEngine::CheckSupported is [259] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.406.332 [engine_place.cc:144][EVENT]191228 Run:The time cost of DNN_VM_GE_LOCAL_OP_STORE::CheckSupported is [13] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.406.342 [engine_place.cc:144][EVENT]191228 Run:The time cost of DNN_VM_RTS_OP_STORE::CheckSupported is [11] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.406.424 [graph_manager.cc:3351][EVENT]191228 OptimizeSubgraph:[GEPERFTRACE] The time cost of OptimizeSubgraph::GraphPartitionDynamicShape is [518] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.406.444 [graph_manager.cc:3364][EVENT]191228 OptimizeSubgraph:[GEPERFTRACE] The time cost of OptimizeSubgraph::SubgraphPartitionAndOptimization::CompositeEngine is [2] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.406.515 [engine_partitioner.cc:1139][EVENT]191228 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionInitialize is [15] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.406.533 [engine_partitioner.cc:1142][EVENT]191228 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionMarkClusters is [4] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.406.664 [engine_partitioner.cc:1148][EVENT]191228 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionSplitSubGraphs is [122] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.406.703 [engine_partitioner.cc:1155][EVENT]191228 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionSortSubGraphs is [26] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.406.755 [engine_partitioner.cc:1164][EVENT]191228 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionAddPartitionsToGraphNode is [41] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.406.790 [graph_manager.cc:3405][EVENT]191228 SubgraphPartitionAndOptimization:[GEPERFTRACE] The time cost of OptimizeSubgraph::Partition1 is [333] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.406.810 [graph_manager.cc:3412][EVENT]191228 SubgraphPartitionAndOptimization:[GEPERFTRACE] The time cost of OptimizeSubgraph::SetSubgraphPreProc is [7] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.419.003 [graph_manager.cc:3422][EVENT]191228 SubgraphPartitionAndOptimization:[GEPERFTRACE] The time cost of OptimizeSubgraph::SetSubGraph is [12178] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.419.040 [graph_manager.cc:3428][EVENT]191228 SubgraphPartitionAndOptimization:[GEPERFTRACE] The time cost of OptimizeSubgraph::SetSubgraphPostProc is [8] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.419.163 [graph_manager.cc:3467][EVENT]191228 SubgraphPartitionAndOptimization:[GEPERFTRACE] The time cost of OptimizeSubgraph::MergeSubGraph is [101] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.419.182 [graph_manager.cc:3377][EVENT]191228 OptimizeSubgraph:[GEPERFTRACE] The time cost of OptimizeSubgraph::SubgraphPartitionAndOptimization::AtomicEngine is [12725] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.419.209 [graph_manager.cc:1106][EVENT]191228 PreRunOptimizeSubGraph:[GEPERFTRACE] The time cost of GraphManager::OptimizeSubgraph is [13323] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.419.222 [graph_manager.cc:1115][EVENT]191228 PreRunOptimizeSubGraph:PreRun:PreRunOptimizeSubGraph success. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.419.253 [graph_manager.cc:1130][EVENT]191228 PreRunAfterOptimizeSubGraph:[GEPERFTRACE] The time cost of GraphManager::stages.optimizer.ReplacePrecompiledNodeWithOmGraph is [5] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.419.286 [graph_manager.cc:1131][EVENT]191228 PreRunAfterOptimizeSubGraph:[GEPERFTRACE] The time cost of GraphManager::stages.optimizer.OptimizeWholeGraph is [19] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.419.314 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of OptimizeStage2::AfterMergePasses::LinkGenMaskNodesPass is [10] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.419.333 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of OptimizeStage2::HcclContinuousMemcpyPass is [5] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.419.344 [graph_manager.cc:2837][EVENT]191228 OptimizeStage2:[GEPERFTRACE] The time cost of OptimizeStage2::AfterMergePasses is [41] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.419.416 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of ConstantFoldingPass is [12] micro second, call num is [3] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.419.430 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of ReshapeRemovePass is [3] micro second, call num is [3] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.419.439 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of CondRemovePass is [2] micro second, call num is [3] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.419.448 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of BitcastPass is [1] micro second, call num is [3] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.419.457 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of AssignRemovePass is [5] micro second, call num is [3] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.419.465 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of DimensionAdjustPass is [4] micro second, call num is [3] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.419.474 [graph_manager.cc:2864][EVENT]191228 OptimizeStage2:[GEPERFTRACE] The time cost of OptimizeStage2::MergedGraphNameToPasses is [114] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.419.492 [graph_manager.cc:2872][EVENT]191228 OptimizeStage2:[GEPERFTRACE] The time cost of OptimizeStage2::RemoveIsolatedConst is [8] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.419.512 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of OptimizeStage2::ControlAttrOptimize::MultiBatchPass is [2] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.419.527 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of OptimizeStage2::AfterMergePasses::RefIdentityDeleteOpPass is [4] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.419.543 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of OptimizeStage2::AfterMergePasses::VariableRefDeleteOpPass is [5] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.419.556 [compile_nodes_pass.cc:88][EVENT]191228 Run:[GEPERFTRACE] The time cost of OptimizeStage2::ControlAttrOptimize::CompileNodesPass is [4] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.419.567 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of OptimizeStage2::ControlAttrOptimize::CompileNodesPass is [15] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.419.578 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of OptimizeStage2::AfterMergePasses::SwapSpacePass is [1] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.419.681 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of OptimizeStage2::AfterMergePasses::InputOutputConnectionIdentifyPass is [79] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.419.711 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of OptimizeStage2::AfterMergePasses::AtomicAddrCleanPass is [17] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.419.726 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of OptimizeStage2::AfterMergePasses::EndOfSequenceAddControlPass is [2] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.419.740 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of OptimizeStage2::ControlAttrOptimize::SubgraphPass is [3] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.419.753 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of OptimizeStage2::ControlAttrOptimize::AttachStreamLabelPass is [3] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.419.762 [graph_manager.cc:2927][EVENT]191228 OptimizeStage2:[GEPERFTRACE] The time cost of OptimizeStage2::ControlAttrOptimize is [253] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.419.774 [graph_manager.cc:2937][EVENT]191228 OptimizeStage2:[GEPERFTRACE] The time cost of ModelBuilder::AssignFunctionalLabels is [3] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.419.791 [graph_manager.cc:2943][EVENT]191228 OptimizeStage2:[GEPERFTRACE] The time cost of MemcpyAddrAsyncPass::Run. is [6] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.419.803 [graph_manager.cc:2950][EVENT]191228 OptimizeStage2:[GEPERFTRACE] The time cost of BufferPoolMemoryPass::Run. is [2] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.429.976 [graph_manager.cc:2958][EVENT]191228 OptimizeStage2:[GEPERFTRACE] The time cost of ParallelGroupPass::Run. is [40] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.430.018 [graph_manager.cc:1132][EVENT]191228 PreRunAfterOptimizeSubGraph:[GEPERFTRACE] The time cost of GraphManager::OptimizeStage2 is [10717] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.430.094 [graph_manager.cc:1135][EVENT]191228 PreRunAfterOptimizeSubGraph:[GEPERFTRACE] The time cost of GraphManager::GetCompilerStages(graph_node->GetGraphId()).optimizer.OptimizeGraphBeforeBuild is [60] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.430.144 [graph_manager.cc:2975][EVENT]191228 MemConflictProc:[GEPERFTRACE] The time cost of HandleMemoryRWConflict is [32] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.430.185 [graph_manager.cc:2981][EVENT]191228 MemConflictProc:[GEPERFTRACE] The time cost of MemLayoutConflictOptimizer::Run. is [27] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.430.200 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of OptimizeStage2::SetFftsPlusAttrPass is [1] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.430.211 [graph_manager.cc:2986][EVENT]191228 MemConflictProc:[GEPERFTRACE] The time cost of SetFftsPlusAttrPass::last_passes.Run is [15] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.430.222 [graph_manager.cc:1136][EVENT]191228 PreRunAfterOptimizeSubGraph:[GEPERFTRACE] The time cost of GraphManager::MemConflictProc is [110] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.430.337 [graph_manager.cc:3555][EVENT]191228 Build:[GEPERFTRACE] The time cost of GraphManager::RecoverIrDefinitionAndModifyAippData is [80] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.430.441 [engine_partitioner.cc:1139][EVENT]191228 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionInitialize is [15] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.430.457 [engine_partitioner.cc:1142][EVENT]191228 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionMarkClusters is [3] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.430.561 [engine_partitioner.cc:1148][EVENT]191228 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionSplitSubGraphs is [93] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.430.603 [engine_partitioner.cc:1155][EVENT]191228 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionSortSubGraphs is [21] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.430.643 [engine_partitioner.cc:1164][EVENT]191228 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionAddPartitionsToGraphNode is [29] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.430.670 [graph_builder.cc:865][EVENT]191228 SecondPartition:[GEPERFTRACE] The time cost of EnginePartitioner::Partition2 is [264] micro second. [INFO] RUNTIME(187024,python3.7):2024-01-11-05:30:41.431.149 [logger.cc:1071] 191228 ModelBindStream: model_id=1600, stream_id=1857, flag=0. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.431.187 [task_generator.cc:804][EVENT]191228 GenerateTask:[GEPERFTRACE] The time cost of TaskGenerator::SetStreamCtx is [182] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.431.258 [task_generator.cc:805][EVENT]191228 GenerateTask:[GEPERFTRACE] The time cost of TaskGenerator::PrepareForGenerateTask is [58] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.431.959 [task_generator.cc:814][EVENT]191228 GenerateTask:[GEPERFTRACE] The time cost of TaskGenerator::DoGenerateTask is [684] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.431.973 [task_generator.cc:954][EVENT]191228 GetTaskInfo:[GEPERFTRACE] The time cost of TaskGenerator::GenerateTask is [969] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.432.042 [task_generator.cc:967][EVENT]191228 GetTaskInfo:[GEPERFTRACE] The time cost of TaskGenerator::AddModelTaskToModel is [42] micro second. [INFO] RUNTIME(187024,python3.7):2024-01-11-05:30:41.432.063 [logger.cc:1084] 191228 ModelUnbindStream: model_id=1600, stream_id=1857, [INFO] GE(187024,python3.7):2024-01-11-05:30:41.432.247 [graph_manager.cc:1152][EVENT]191228 PreRunAfterOptimizeSubGraph:[GEPERFTRACE] The time cost of GraphManager::Build is [1999] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.432.266 [graph_manager.cc:1164][EVENT]191228 PreRunAfterOptimizeSubGraph:PreRun:PreRunAfterOptimizeSubGraph success. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.432.304 [graph_manager.cc:1271][EVENT]191228 PreRun:[GEPERFTRACE] The time cost of FlowModelBuild is [57441] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.432.316 [graph_manager.cc:1272][EVENT]191228 PreRun:[GEPERFTRACE] GE PreRun End [INFO] ATRACE(187024,python3.7):2024-01-11-05:30:41.432.650 [atrace_api.c:93](tid:191228) AtraceDestroy start [INFO] ATRACE(187024,python3.7):2024-01-11-05:30:41.432.671 [atrace_api.c:95](tid:191228) AtraceDestroy end [INFO] GE(187024,python3.7):2024-01-11-05:30:41.437.627 [graph_converter.cc:838][EVENT]191228 ConvertComputeGraphToExecuteGraph:[GEPERFTRACE] The time cost of ConvertComputeGraphToExecuteGraph::CreateMainNode is [1444] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.437.794 [base_optimizer.cc:70][EVENT]191228 Run:[GEPERFTRACE] The time cost of ZeroCopy is [124] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.438.270 [base_optimizer.cc:70][EVENT]191228 Run:[GEPERFTRACE] The time cost of CEM is [455] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.438.465 [copy_flow_launch_fuse.cc:395][EVENT]191228 Run:[GEPERFTRACE] The time cost of Pass::CopyFlowLaunchFuse is [172] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.438.483 [base_optimizer.cc:70][EVENT]191228 Run:[GEPERFTRACE] The time cost of CopyFlowLaunch is [192] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.438.707 [base_optimizer.cc:70][EVENT]191228 Run:[GEPERFTRACE] The time cost of TrustOutTensor is [212] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.438.732 [base_optimizer.cc:70][EVENT]191228 Run:[GEPERFTRACE] The time cost of AicpuFuseHostInputs is [7] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.438.775 [base_optimizer.cc:70][EVENT]191228 Run:[GEPERFTRACE] The time cost of ZeroCopy is [23] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.438.965 [base_optimizer.cc:70][EVENT]191228 Run:[GEPERFTRACE] The time cost of CEM is [176] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.439.044 [copy_flow_launch_fuse.cc:395][EVENT]191228 Run:[GEPERFTRACE] The time cost of Pass::CopyFlowLaunchFuse is [63] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.439.057 [base_optimizer.cc:70][EVENT]191228 Run:[GEPERFTRACE] The time cost of CopyFlowLaunch is [77] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.439.086 [base_optimizer.cc:70][EVENT]191228 Run:[GEPERFTRACE] The time cost of TrustOutTensor is [19] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.439.096 [base_optimizer.cc:70][EVENT]191228 Run:[GEPERFTRACE] The time cost of AicpuFuseHostInputs is [0] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.439.121 [base_optimizer.cc:70][EVENT]191228 Run:[GEPERFTRACE] The time cost of ZeroCopy is [16] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.439.192 [base_optimizer.cc:70][EVENT]191228 Run:[GEPERFTRACE] The time cost of CEM is [61] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.439.258 [copy_flow_launch_fuse.cc:395][EVENT]191228 Run:[GEPERFTRACE] The time cost of Pass::CopyFlowLaunchFuse is [53] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.439.268 [base_optimizer.cc:70][EVENT]191228 Run:[GEPERFTRACE] The time cost of CopyFlowLaunch is [65] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.439.293 [base_optimizer.cc:70][EVENT]191228 Run:[GEPERFTRACE] The time cost of TrustOutTensor is [17] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.439.304 [base_optimizer.cc:70][EVENT]191228 Run:[GEPERFTRACE] The time cost of AicpuFuseHostInputs is [0] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.439.317 [graph_converter.cc:849][EVENT]191228 ConvertComputeGraphToExecuteGraph:[GEPERFTRACE] The time cost of ConvertComputeGraphToExecuteGraph::RunAllPass is [1653] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.439.521 [graph_converter.cc:853][EVENT]191228 ConvertComputeGraphToExecuteGraph:[GEPERFTRACE] The time cost of ConvertComputeGraphToExecuteGraph::TopologicalSorting is [194] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.440.179 [graph_converter.cc:857][EVENT]191228 ConvertComputeGraphToExecuteGraph:[GEPERFTRACE] The time cost of ConvertComputeGraphToExecuteGraph::AppendGraphLevelData is [644] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.440.319 [graph_converter.cc:862][EVENT]191228 ConvertComputeGraphToExecuteGraph:[GEPERFTRACE] The time cost of ConvertComputeGraphToExecuteGraph::CalculatePriority is [118] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.522.146 [graph_var_manager.cc:1424][EVENT]191228 SetAllMemoryMaxValue:The graph_mem_max_size is 27917287424 and the var_mem_max_size is 5368709120 [INFO] GE(187024,python3.7):2024-01-11-05:30:41.522.228 [graph_manager.cc:1248][EVENT]191228 PreRun:PreRun start: graph node size 4, session id 2, graph id 1, graph name online. [INFO] ATRACE(187024,python3.7):2024-01-11-05:30:41.522.616 [atrace_api.c:28](tid:191228) AtraceCreate start [INFO] ATRACE(187024,python3.7):2024-01-11-05:30:41.522.657 [trace_rb_log.c:84](tid:191228) [RUNTIME_ATRACE_DEV64_TS0] create ring buffer success, buffer size : 131152. [INFO] ATRACE(187024,python3.7):2024-01-11-05:30:41.522.669 [atrace_api.c:32](tid:191228) AtraceCreate end [INFO] TDT(187024,python3.7):2024-01-11-05:30:41.522.682 [client_manager.cpp:157][SetProfilingCallback][tid:191228] [TsdClient] set profiling callback success [INFO] GE(187024,python3.7):2024-01-11-05:30:41.523.160 [parallel_partitioner.cc:165][EVENT]191228 DoPipelinePartition:[GEPERFTRACE] The time cost of OptimizeSubgraph::PipelinePartition is [18] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.523.198 [parallel_partitioner.cc:178][EVENT]191228 DoFlowGraphPartition:[GEPERFTRACE] The time cost of OptimizeSubgraph::FlowGraphPartition is [13] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.523.260 [graph_prepare.cc:1378][EVENT]191228 Init:[GEPERFTRACE] The time cost of FileConstantUtils::ConvertFileConstToConst is [4] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.523.500 [graph_manager.cc:1050][EVENT]191228 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.preparer.PrepareInit is [259] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.523.528 [graph_manager.cc:1052][EVENT]191228 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.optimizer.HandleSummaryOp is [8] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.523.680 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of ForToWhilePass is [1] micro second, call num is [4] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.523.713 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of ProcessNetOutput::SavePass is [4] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.523.763 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of ProcessNetOutput::NetOutputPass is [38] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.523.777 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of ProcessNetOutput::DataPass is [1] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.523.823 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of CreateSubGraphWithScopePass is [10] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.523.837 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of SubgraphMultiDimsClonePass is [2] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.523.856 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of MultiBatchClonePass is [8] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.523.954 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of SplitVariableIntoSubgraphPass is [2] micro second, call num is [4] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.523.975 [graph_manager.cc:1054][EVENT]191228 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.preparer.NormalizeGraph is [434] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.524.193 [graph_manager.cc:1055][EVENT]191228 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.optimizer.OptimizeGraphInit is [203] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.525.306 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of AssertPass is [2] micro second, call num is [8] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.525.332 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of SwitchDeadBranchElimination is [5] micro second, call num is [4] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.525.343 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of MergePass is [4] micro second, call num is [4] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.525.352 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of InferShapePass is [383] micro second, call num is [4] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.525.361 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of ReplaceWithEmptyConstPass is [12] micro second, call num is [4] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.525.370 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of SplitShapeNPass is [2] micro second, call num is [8] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.525.378 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of DimensionComputePass is [19] micro second, call num is [4] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.525.386 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of ConstantFoldingPass is [18] micro second, call num is [4] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.525.402 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of InferValuePass is [7] micro second, call num is [4] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.527.460 [graph_manager.cc:1056][EVENT]191228 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.optimizer.OptimizeOriginalGraphForQuantize is [3248] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.527.523 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of CondRemovePass is [2] micro second, call num is [4] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.527.541 [graph_prepare.cc:1982][EVENT]191228 PrepareDynShape:[GEPERFTRACE] The time cost of Prepare::ProcessBeforeInfershape is [52] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.527.998 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of AssertPass is [0] micro second, call num is [8] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.528.019 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of SwitchDeadBranchElimination is [0] micro second, call num is [4] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.528.030 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of MergePass is [2] micro second, call num is [4] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.528.039 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of InferShapePass is [269] micro second, call num is [4] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.528.048 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of ReplaceWithEmptyConstPass is [8] micro second, call num is [4] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.528.056 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of SplitShapeNPass is [0] micro second, call num is [8] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.528.064 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of DimensionComputePass is [9] micro second, call num is [4] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.528.073 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of ConstantFoldingPass is [12] micro second, call num is [4] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.528.081 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of InferValuePass is [4] micro second, call num is [4] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.528.107 [graph_prepare.cc:1983][EVENT]191228 PrepareDynShape:[GEPERFTRACE] The time cost of Prepare::FormatAndShapeProcess is [552] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.528.131 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of PreRun::MarkForceUnknownForCondPass is [5] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.528.143 [graph_prepare.cc:1984][EVENT]191228 PrepareDynShape:[GEPERFTRACE] The time cost of Prepare::CtrlFlowPreProcess is [19] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.528.157 [graph_prepare.cc:1985][EVENT]191228 PrepareDynShape:[GEPERFTRACE] The time cost of Prepare::multibatch::GetDynamicOutputShape is [4] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.528.172 [graph_prepare.cc:1986][EVENT]191228 PrepareDynShape:[GEPERFTRACE] The time cost of Prepare::InsertAippOpUtil::Instance().UpdateDataNodeByAipp is [3] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.528.183 [graph_prepare.cc:1987][EVENT]191228 PrepareDynShape:[GEPERFTRACE] The time cost of Prepare::SaveOriginalGraphToOmModel is [1] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.528.198 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of PrepareOptimize::ShapeOperateOpRemovePass is [2] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.528.212 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of PrepareOptimize::ReplaceTransShapePass is [1] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.528.227 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of PrepareOptimize::MarkAgnosticPass is [4] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.528.335 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of EnterPass is [1] micro second, call num is [4] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.528.348 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of CondPass is [4] micro second, call num is [4] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.528.357 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of PrintOpPass is [1] micro second, call num is [4] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.528.366 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of NoUseReshapeRemovePass is [1] micro second, call num is [4] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.528.375 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of DropOutPass is [1] micro second, call num is [4] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.528.383 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of AssertPass is [3] micro second, call num is [4] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.528.391 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of TransposeRemovePass is [9] micro second, call num is [4] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.528.400 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of UnusedConstPass is [2] micro second, call num is [4] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.528.408 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of StopGradientPass is [2] micro second, call num is [4] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.528.416 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of PreventGradientPass is [3] micro second, call num is [4] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.528.424 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of PlaceholderWithDefaultPass is [1] micro second, call num is [4] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.528.432 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of SnapshotPass is [3] micro second, call num is [4] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.528.440 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of GuaranteeConstPass is [2] micro second, call num is [4] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.528.449 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of VarIsInitializedOpPass is [6] micro second, call num is [4] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.528.457 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of ParallelConcatStartOpPass is [1] micro second, call num is [4] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.528.465 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of IdentityPass is [4] micro second, call num is [4] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.528.488 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of PrepareOptimize::PrunePass is [10] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.528.502 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of PrepareOptimize::HcclMemcpyPass is [3] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.528.538 [graph_prepare.cc:1988][EVENT]191228 PrepareDynShape:[GEPERFTRACE] The time cost of Prepare::PrepareOptimize is [345] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.528.551 [graph_manager.cc:1065][EVENT]191228 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.preparer.PrepareDynShape is [1062] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.542.456 [graph_manager.cc:1077][EVENT]191228 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.optimizer.OptimizeOriginalGraph is [13884] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.542.523 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of PrepareRunningFormatRefiner::VariablePrepareOpPass is [7] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.542.575 [graph_manager.cc:1080][EVENT]191228 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.preparer.PrepareRunningFormatRefiner is [85] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.546.566 [graph_manager.cc:1081][EVENT]191228 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.optimizer.OptimizeOriginalGraphJudgeInsert is [3965] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.546.603 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of SubexpressionMigrationPass is [3] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.546.618 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of UnusedArgsCleanPass is [2] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.546.630 [graph_manager.cc:1082][EVENT]191228 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::SubexpressionMigration is [34] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.546.662 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::MergeInputMemcpyPass is [5] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.546.677 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::SwitchDataEdgesBypass is [3] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.546.693 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::ConstantFuseSamePass is [4] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.546.799 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::CSEBeforeFuseDataNodesWithCommonInputPass is [93] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.546.815 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::FuseDataNodesWithCommonInputPass is [3] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.546.866 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::CommonSubexpressionEliminationPass is [40] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.546.882 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::PermutePass is [3] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.546.923 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::SameTransdataBreadthFusionPass is [29] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.546.944 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::VariableOpPass is [8] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.546.993 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::TransOpWithoutReshapeFusionPass is [37] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.547.027 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::TransOpBreadthFusionPass is [21] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.547.044 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::DataFlowPreparePass is [4] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.547.058 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of OptimizeStage1_1::MergeUnknownShapeNPass is [2] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.547.068 [graph_manager.cc:2700][EVENT]191228 OptimizeStage1:[GEPERFTRACE] The time cost of GraphManager::OptimizeStage1_1 is [412] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.547.203 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of EnterPass is [2] micro second, call num is [4] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.547.218 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of AddNPass is [1] micro second, call num is [4] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.547.227 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of SwitchDeadBranchElimination is [1] micro second, call num is [4] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.547.243 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of SwitchLogicRemovePass is [2] micro second, call num is [4] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.547.253 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of MergePass is [1] micro second, call num is [4] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.547.262 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of CastRemovePass is [23] micro second, call num is [4] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.547.270 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of TransposeTransDataPass is [2] micro second, call num is [4] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.547.278 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of ReshapeRemovePass is [3] micro second, call num is [4] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.547.287 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of TransOpSymmetryEliminationPass is [2] micro second, call num is [4] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.547.295 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of TransOpNearbyAllreduceFusionPass is [2] micro second, call num is [4] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.547.303 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of ReplaceWithEmptyConstPass is [7] micro second, call num is [4] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.547.311 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of DimensionComputePass is [6] micro second, call num is [4] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.547.319 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of ConstantFoldingPass is [13] micro second, call num is [4] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.547.327 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of DimensionAdjustPass is [4] micro second, call num is [4] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.547.335 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of UselessControlOutRemovePass is [2] micro second, call num is [4] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.547.345 [graph_manager.cc:2741][EVENT]191228 OptimizeStage1:[GEPERFTRACE] The time cost of GraphManager::OptimizeStage1_2 is [258] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.547.354 [graph_manager.cc:2752][EVENT]191228 OptimizeStage1:[GEPERFTRACE] The time cost of extern constant folding is [0] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.547.377 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::Migration is [3] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.547.391 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::ArgsClean is [1] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.547.409 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::PrunePass is [7] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.547.425 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::NextIterationPass is [4] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.547.438 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::ControlTriggerPass is [2] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.547.452 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::MergeToStreamMergePass is [3] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.547.471 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::SwitchToStreamSwitchPass is [10] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.547.486 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::AttachStreamLabelPass is [4] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.547.506 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::MultiBatchPass is [4] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.547.517 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::SubgraphMultiDimsPass is [1] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.547.530 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::IteratorOpPass is [4] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.547.541 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::VariableRefUselessControlOutDeletePass is [2] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.547.560 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::ReshapeRecoveryPass is [10] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.547.573 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of OptimizeStage1_3::RemoveSameConstPass is [4] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.547.583 [graph_manager.cc:2810][EVENT]191228 OptimizeStage1:[GEPERFTRACE] The time cost of GraphManager::OptimizeStage1_3 is [210] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.547.613 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of IdentityPass is [2] micro second, call num is [4] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.547.625 [graph_manager.cc:2821][EVENT]191228 OptimizeStage1:[GEPERFTRACE] The time cost of GraphPrepare::node_pass is [34] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.547.654 [graph_manager.cc:1087][EVENT]191228 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::OptimizeStage1 is [1005] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.547.795 [graph_manager.cc:1088][EVENT]191228 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::stages.optimizer.OptimizeAfterStage1 is [125] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.547.836 [graph_manager.cc:1089][EVENT]191228 PreRunOptimizeOriginalGraph:[GEPERFTRACE] The time cost of GraphManager::GraphUtilsEx::InferShapeInNeed is [20] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.547.855 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of PreRun::CtrlEdgeTransferPass is [1] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.547.873 [graph_manager.cc:1097][EVENT]191228 PreRunOptimizeOriginalGraph:PreRun:PreRunOptimizeOriginalGraph success. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.547.894 [graph_manager.cc:3325][EVENT]191228 OptimizeSubgraph:[GEPERFTRACE] The time cost of OptimizeSubgraph::StagePartition is [4] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.548.850 [engine_place.cc:144][EVENT]191228 Run:The time cost of AIcoreEngine::CheckSupported is [843] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.548.875 [engine_place.cc:144][EVENT]191228 Run:The time cost of DNN_VM_GE_LOCAL_OP_STORE::CheckSupported is [10] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.548.884 [engine_place.cc:144][EVENT]191228 Run:The time cost of DNN_VM_RTS_OP_STORE::CheckSupported is [9] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.548.967 [graph_manager.cc:3351][EVENT]191228 OptimizeSubgraph:[GEPERFTRACE] The time cost of OptimizeSubgraph::GraphPartitionDynamicShape is [1059] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.548.986 [graph_manager.cc:3364][EVENT]191228 OptimizeSubgraph:[GEPERFTRACE] The time cost of OptimizeSubgraph::SubgraphPartitionAndOptimization::CompositeEngine is [3] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.549.046 [engine_partitioner.cc:1139][EVENT]191228 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionInitialize is [18] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.549.064 [engine_partitioner.cc:1142][EVENT]191228 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionMarkClusters is [5] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.549.255 [engine_partitioner.cc:1148][EVENT]191228 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionSplitSubGraphs is [170] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.549.302 [engine_partitioner.cc:1155][EVENT]191228 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionSortSubGraphs is [31] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.549.354 [engine_partitioner.cc:1164][EVENT]191228 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionAddPartitionsToGraphNode is [38] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.549.391 [graph_manager.cc:3405][EVENT]191228 SubgraphPartitionAndOptimization:[GEPERFTRACE] The time cost of OptimizeSubgraph::Partition1 is [391] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.549.412 [graph_manager.cc:3412][EVENT]191228 SubgraphPartitionAndOptimization:[GEPERFTRACE] The time cost of OptimizeSubgraph::SetSubgraphPreProc is [7] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.558.081 [graph_manager.cc:3422][EVENT]191228 SubgraphPartitionAndOptimization:[GEPERFTRACE] The time cost of OptimizeSubgraph::SetSubGraph is [8652] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.558.114 [graph_manager.cc:3428][EVENT]191228 SubgraphPartitionAndOptimization:[GEPERFTRACE] The time cost of OptimizeSubgraph::SetSubgraphPostProc is [7] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.558.241 [graph_manager.cc:3467][EVENT]191228 SubgraphPartitionAndOptimization:[GEPERFTRACE] The time cost of OptimizeSubgraph::MergeSubGraph is [107] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.558.259 [graph_manager.cc:3377][EVENT]191228 OptimizeSubgraph:[GEPERFTRACE] The time cost of OptimizeSubgraph::SubgraphPartitionAndOptimization::AtomicEngine is [9260] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.558.276 [graph_manager.cc:1106][EVENT]191228 PreRunOptimizeSubGraph:[GEPERFTRACE] The time cost of GraphManager::OptimizeSubgraph is [10388] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.558.289 [graph_manager.cc:1115][EVENT]191228 PreRunOptimizeSubGraph:PreRun:PreRunOptimizeSubGraph success. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.558.313 [graph_manager.cc:1130][EVENT]191228 PreRunAfterOptimizeSubGraph:[GEPERFTRACE] The time cost of GraphManager::stages.optimizer.ReplacePrecompiledNodeWithOmGraph is [6] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.558.347 [graph_manager.cc:1131][EVENT]191228 PreRunAfterOptimizeSubGraph:[GEPERFTRACE] The time cost of GraphManager::stages.optimizer.OptimizeWholeGraph is [20] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.558.372 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of OptimizeStage2::AfterMergePasses::LinkGenMaskNodesPass is [7] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.558.390 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of OptimizeStage2::HcclContinuousMemcpyPass is [6] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.558.400 [graph_manager.cc:2837][EVENT]191228 OptimizeStage2:[GEPERFTRACE] The time cost of OptimizeStage2::AfterMergePasses is [37] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.558.482 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of ConstantFoldingPass is [14] micro second, call num is [4] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.558.495 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of ReshapeRemovePass is [1] micro second, call num is [4] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.558.505 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of CondRemovePass is [2] micro second, call num is [4] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.558.513 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of BitcastPass is [3] micro second, call num is [4] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.558.530 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of AssignRemovePass is [6] micro second, call num is [4] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.558.538 [base_pass.cc:339][EVENT]191228 Run:[GEPERFTRACE] The time cost of DimensionAdjustPass is [7] micro second, call num is [4] [INFO] GE(187024,python3.7):2024-01-11-05:30:41.558.548 [graph_manager.cc:2864][EVENT]191228 OptimizeStage2:[GEPERFTRACE] The time cost of OptimizeStage2::MergedGraphNameToPasses is [131] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.558.560 [graph_manager.cc:2872][EVENT]191228 OptimizeStage2:[GEPERFTRACE] The time cost of OptimizeStage2::RemoveIsolatedConst is [3] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.558.579 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of OptimizeStage2::ControlAttrOptimize::MultiBatchPass is [1] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.558.594 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of OptimizeStage2::AfterMergePasses::RefIdentityDeleteOpPass is [4] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.558.611 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of OptimizeStage2::AfterMergePasses::VariableRefDeleteOpPass is [5] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.558.626 [compile_nodes_pass.cc:88][EVENT]191228 Run:[GEPERFTRACE] The time cost of OptimizeStage2::ControlAttrOptimize::CompileNodesPass is [4] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.558.636 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of OptimizeStage2::ControlAttrOptimize::CompileNodesPass is [15] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.558.646 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of OptimizeStage2::AfterMergePasses::SwapSpacePass is [2] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.558.727 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of OptimizeStage2::AfterMergePasses::InputOutputConnectionIdentifyPass is [71] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.558.756 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of OptimizeStage2::AfterMergePasses::AtomicAddrCleanPass is [16] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.558.770 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of OptimizeStage2::AfterMergePasses::EndOfSequenceAddControlPass is [2] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.558.784 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of OptimizeStage2::ControlAttrOptimize::SubgraphPass is [3] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.558.797 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of OptimizeStage2::ControlAttrOptimize::AttachStreamLabelPass is [3] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.558.808 [graph_manager.cc:2927][EVENT]191228 OptimizeStage2:[GEPERFTRACE] The time cost of OptimizeStage2::ControlAttrOptimize is [231] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.558.820 [graph_manager.cc:2937][EVENT]191228 OptimizeStage2:[GEPERFTRACE] The time cost of ModelBuilder::AssignFunctionalLabels is [3] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.558.834 [graph_manager.cc:2943][EVENT]191228 OptimizeStage2:[GEPERFTRACE] The time cost of MemcpyAddrAsyncPass::Run. is [4] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.558.845 [graph_manager.cc:2950][EVENT]191228 OptimizeStage2:[GEPERFTRACE] The time cost of BufferPoolMemoryPass::Run. is [1] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.559.009 [graph_manager.cc:2958][EVENT]191228 OptimizeStage2:[GEPERFTRACE] The time cost of ParallelGroupPass::Run. is [40] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.559.040 [graph_manager.cc:1132][EVENT]191228 PreRunAfterOptimizeSubGraph:[GEPERFTRACE] The time cost of GraphManager::OptimizeStage2 is [680] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.559.117 [graph_manager.cc:1135][EVENT]191228 PreRunAfterOptimizeSubGraph:[GEPERFTRACE] The time cost of GraphManager::GetCompilerStages(graph_node->GetGraphId()).optimizer.OptimizeGraphBeforeBuild is [55] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.559.148 [graph_manager.cc:2975][EVENT]191228 MemConflictProc:[GEPERFTRACE] The time cost of HandleMemoryRWConflict is [15] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.559.177 [graph_manager.cc:2981][EVENT]191228 MemConflictProc:[GEPERFTRACE] The time cost of MemLayoutConflictOptimizer::Run. is [16] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.559.192 [pass_manager.cc:82][EVENT]191228 Run:[GEPERFTRACE] The time cost of OptimizeStage2::SetFftsPlusAttrPass is [0] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.559.203 [graph_manager.cc:2986][EVENT]191228 MemConflictProc:[GEPERFTRACE] The time cost of SetFftsPlusAttrPass::last_passes.Run is [14] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.559.212 [graph_manager.cc:1136][EVENT]191228 PreRunAfterOptimizeSubGraph:[GEPERFTRACE] The time cost of GraphManager::MemConflictProc is [79] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.559.326 [graph_manager.cc:3555][EVENT]191228 Build:[GEPERFTRACE] The time cost of GraphManager::RecoverIrDefinitionAndModifyAippData is [82] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.559.414 [engine_partitioner.cc:1139][EVENT]191228 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionInitialize is [17] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.559.430 [engine_partitioner.cc:1142][EVENT]191228 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionMarkClusters is [3] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.559.546 [engine_partitioner.cc:1148][EVENT]191228 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionSplitSubGraphs is [106] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.559.578 [engine_partitioner.cc:1155][EVENT]191228 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionSortSubGraphs is [20] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.559.619 [engine_partitioner.cc:1164][EVENT]191228 PartitionSubGraph:[GEPERFTRACE] The time cost of EnginePartitioner::PartitionAddPartitionsToGraphNode is [29] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.559.642 [graph_builder.cc:865][EVENT]191228 SecondPartition:[GEPERFTRACE] The time cost of EnginePartitioner::Partition2 is [262] micro second. [INFO] RUNTIME(187024,python3.7):2024-01-11-05:30:41.559.968 [logger.cc:1071] 191228 ModelBindStream: model_id=576, stream_id=833, flag=0. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.559.999 [task_generator.cc:804][EVENT]191228 GenerateTask:[GEPERFTRACE] The time cost of TaskGenerator::SetStreamCtx is [90] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.560.064 [task_generator.cc:805][EVENT]191228 GenerateTask:[GEPERFTRACE] The time cost of TaskGenerator::PrepareForGenerateTask is [52] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.560.735 [task_generator.cc:814][EVENT]191228 GenerateTask:[GEPERFTRACE] The time cost of TaskGenerator::DoGenerateTask is [656] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.560.749 [task_generator.cc:954][EVENT]191228 GetTaskInfo:[GEPERFTRACE] The time cost of TaskGenerator::GenerateTask is [840] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.560.809 [task_generator.cc:967][EVENT]191228 GetTaskInfo:[GEPERFTRACE] The time cost of TaskGenerator::AddModelTaskToModel is [34] micro second. [INFO] RUNTIME(187024,python3.7):2024-01-11-05:30:41.560.826 [logger.cc:1084] 191228 ModelUnbindStream: model_id=576, stream_id=833, [INFO] GE(187024,python3.7):2024-01-11-05:30:41.561.077 [graph_manager.cc:1152][EVENT]191228 PreRunAfterOptimizeSubGraph:[GEPERFTRACE] The time cost of GraphManager::Build is [1840] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.561.109 [graph_manager.cc:1164][EVENT]191228 PreRunAfterOptimizeSubGraph:PreRun:PreRunAfterOptimizeSubGraph success. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.561.172 [graph_manager.cc:1271][EVENT]191228 PreRun:[GEPERFTRACE] The time cost of FlowModelBuild is [38101] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.561.185 [graph_manager.cc:1272][EVENT]191228 PreRun:[GEPERFTRACE] GE PreRun End [INFO] ATRACE(187024,python3.7):2024-01-11-05:30:41.561.492 [atrace_api.c:93](tid:191228) AtraceDestroy start [INFO] ATRACE(187024,python3.7):2024-01-11-05:30:41.561.508 [atrace_api.c:95](tid:191228) AtraceDestroy end [INFO] GE(187024,python3.7):2024-01-11-05:30:41.567.716 [graph_converter.cc:838][EVENT]191228 ConvertComputeGraphToExecuteGraph:[GEPERFTRACE] The time cost of ConvertComputeGraphToExecuteGraph::CreateMainNode is [1883] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.567.878 [base_optimizer.cc:70][EVENT]191228 Run:[GEPERFTRACE] The time cost of ZeroCopy is [120] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.568.470 [base_optimizer.cc:70][EVENT]191228 Run:[GEPERFTRACE] The time cost of CEM is [569] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.568.692 [copy_flow_launch_fuse.cc:395][EVENT]191228 Run:[GEPERFTRACE] The time cost of Pass::CopyFlowLaunchFuse is [197] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.568.711 [base_optimizer.cc:70][EVENT]191228 Run:[GEPERFTRACE] The time cost of CopyFlowLaunch is [220] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.568.941 [base_optimizer.cc:70][EVENT]191228 Run:[GEPERFTRACE] The time cost of TrustOutTensor is [217] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.568.959 [base_optimizer.cc:70][EVENT]191228 Run:[GEPERFTRACE] The time cost of AicpuFuseHostInputs is [1] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.568.996 [base_optimizer.cc:70][EVENT]191228 Run:[GEPERFTRACE] The time cost of ZeroCopy is [26] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.569.253 [base_optimizer.cc:70][EVENT]191228 Run:[GEPERFTRACE] The time cost of CEM is [243] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.569.349 [copy_flow_launch_fuse.cc:395][EVENT]191228 Run:[GEPERFTRACE] The time cost of Pass::CopyFlowLaunchFuse is [77] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.569.364 [base_optimizer.cc:70][EVENT]191228 Run:[GEPERFTRACE] The time cost of CopyFlowLaunch is [93] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.569.397 [base_optimizer.cc:70][EVENT]191228 Run:[GEPERFTRACE] The time cost of TrustOutTensor is [23] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.569.407 [base_optimizer.cc:70][EVENT]191228 Run:[GEPERFTRACE] The time cost of AicpuFuseHostInputs is [0] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.569.436 [base_optimizer.cc:70][EVENT]191228 Run:[GEPERFTRACE] The time cost of ZeroCopy is [19] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.569.530 [base_optimizer.cc:70][EVENT]191228 Run:[GEPERFTRACE] The time cost of CEM is [84] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.569.608 [copy_flow_launch_fuse.cc:395][EVENT]191228 Run:[GEPERFTRACE] The time cost of Pass::CopyFlowLaunchFuse is [66] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.569.619 [base_optimizer.cc:70][EVENT]191228 Run:[GEPERFTRACE] The time cost of CopyFlowLaunch is [77] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.569.649 [base_optimizer.cc:70][EVENT]191228 Run:[GEPERFTRACE] The time cost of TrustOutTensor is [21] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.569.659 [base_optimizer.cc:70][EVENT]191228 Run:[GEPERFTRACE] The time cost of AicpuFuseHostInputs is [1] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.569.672 [graph_converter.cc:849][EVENT]191228 ConvertComputeGraphToExecuteGraph:[GEPERFTRACE] The time cost of ConvertComputeGraphToExecuteGraph::RunAllPass is [1919] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.569.941 [graph_converter.cc:853][EVENT]191228 ConvertComputeGraphToExecuteGraph:[GEPERFTRACE] The time cost of ConvertComputeGraphToExecuteGraph::TopologicalSorting is [251] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.570.747 [graph_converter.cc:857][EVENT]191228 ConvertComputeGraphToExecuteGraph:[GEPERFTRACE] The time cost of ConvertComputeGraphToExecuteGraph::AppendGraphLevelData is [789] micro second. [INFO] GE(187024,python3.7):2024-01-11-05:30:41.570.912 [graph_converter.cc:862][EVENT]191228 ConvertComputeGraphToExecuteGraph:[GEPERFTRACE] The time cost of ConvertComputeGraphToExecuteGraph::CalculatePriority is [141] micro second. TotalTime = 0.0968719, [20] [parse]: 0.00147682 [symbol_resolve]: 0.0122833, [1] [Cycle 1]: 0.0122159, [1] [resolve]: 0.0121948 [combine_like_graphs]: 7.99999e-07 [graph_reusing]: 3.04e-06 [meta_unpack_prepare]: 0.00012987 [pre_cconv]: 7.00005e-07 [abstract_specialize]: 0.00378347 [pack_expand]: 1.542e-05 [auto_monad]: 8.335e-05 [inline]: 1.66e-06 [pre_auto_parallel]: 9.36e-06 [pipeline_split]: 2.72e-06 [optimize]: 0.0732987, [35] [py_interpret_to_execute]: 4.45e-06 [rewriter_before_opt_a]: 0.00015804 [opt_a]: 0.0720433, [4] [Cycle 1]: 0.0288677, [30] [expand_dump_flag]: 4.38e-06 [switch_simplify]: 2.27e-05 [a_1]: 0.00037714 [recompute_prepare]: 8.61e-06 [updatestate_depend_eliminate]: 9.62e-06 [updatestate_assign_eliminate]: 6.35999e-06 [updatestate_loads_eliminate]: 6.1e-06 [parameter_eliminate]: 4.86999e-06 [a_2]: 7.373e-05 [accelerated_algorithm]: 5.69e-06 [pynative_shard]: 1.59e-06 [auto_parallel]: 3.45e-06 [parallel]: 8.6e-06 [merge_comm]: 4.13e-06 [allreduce_fusion]: 1.97e-06 [virtual_dataset]: 4.71e-06 [get_grad_eliminate_]: 4.11e-06 [virtual_output]: 3.83e-06 [merge_forward]: 8.27e-06 [cell_reuse_recompute_pass]: 8.70001e-07 [cell_reuse_handle_not_recompute_node_pass]: 1.138e-05 [meta_fg_expand]: 0.00176249, [1] [Cycle 1]: 0.00040292, [1] [resolve]: 0.00038424 [after_resolve]: 1.887e-05 [a_after_grad]: 3.136e-05 [renormalize]: 0.0259497 [real_op_eliminate]: 2.326e-05 [auto_monad_grad]: 2.962e-05 [auto_monad_eliminator]: 4.559e-05 [cse]: 0.00011075 [a_3]: 0.00015549 [Cycle 2]: 0.0368311, [30] [expand_dump_flag]: 2.59e-06 [switch_simplify]: 5.715e-05 [a_1]: 0.00037903 [recompute_prepare]: 9.92e-06 [updatestate_depend_eliminate]: 1.146e-05 [updatestate_assign_eliminate]: 8.29e-06 [updatestate_loads_eliminate]: 8.09e-06 [parameter_eliminate]: 3.17e-06 [a_2]: 0.00012106 [accelerated_algorithm]: 1.184e-05 [pynative_shard]: 1.14e-06 [auto_parallel]: 4.14e-06 [parallel]: 4.24e-06 [merge_comm]: 2.17e-06 [allreduce_fusion]: 1.34e-06 [virtual_dataset]: 7.23e-06 [get_grad_eliminate_]: 6.08e-06 [virtual_output]: 5.76e-06 [merge_forward]: 9.61e-06 [cell_reuse_recompute_pass]: 5.69999e-07 [cell_reuse_handle_not_recompute_node_pass]: 1.56e-05 [meta_fg_expand]: 0.0102368, [4] [Cycle 1]: 0.00174692, [1] [resolve]: 0.00172694 [Cycle 1]: 0.00031494, [1] [resolve]: 0.00029662 [Cycle 1]: 0.00040036, [1] [resolve]: 0.00038128 [Cycle 1]: 0.00030959, [1] [resolve]: 0.00029118 [after_resolve]: 5.624e-05 [a_after_grad]: 0.00014122 [renormalize]: 0.0249588 [real_op_eliminate]: 3.546e-05 [auto_monad_grad]: 6.791e-05 [auto_monad_eliminator]: 6.617e-05 [cse]: 0.00016888 [a_3]: 0.00026494 [Cycle 3]: 0.00296456, [30] [expand_dump_flag]: 2.92e-06 [switch_simplify]: 8.943e-05 [a_1]: 0.00066967 [recompute_prepare]: 1.227e-05 [updatestate_depend_eliminate]: 1.421e-05 [updatestate_assign_eliminate]: 1.144e-05 [updatestate_loads_eliminate]: 1.126e-05 [parameter_eliminate]: 3.53e-06 [a_2]: 0.00017391 [accelerated_algorithm]: 1.629e-05 [pynative_shard]: 1.2e-06 [auto_parallel]: 3.97e-06 [parallel]: 4.32001e-06 [merge_comm]: 2.78e-06 [allreduce_fusion]: 1.95e-06 [virtual_dataset]: 9.38e-06 [get_grad_eliminate_]: 8.54e-06 [virtual_output]: 8.43999e-06 [merge_forward]: 1.227e-05 [cell_reuse_recompute_pass]: 4.30002e-07 [cell_reuse_handle_not_recompute_node_pass]: 2.184e-05 [meta_fg_expand]: 3.255e-05 [after_resolve]: 1.204e-05 [a_after_grad]: 1.48e-05 [renormalize]: 0.0014456 [real_op_eliminate]: 1.429e-05 [auto_monad_grad]: 5.37001e-06 [auto_monad_eliminator]: 2.425e-05 [cse]: 0.00010558 [a_3]: 8.081e-05 [Cycle 4]: 0.00080248, [30] [expand_dump_flag]: 1.24999e-06 [switch_simplify]: 9.53001e-06 [a_1]: 0.00017012 [recompute_prepare]: 1.111e-05 [updatestate_depend_eliminate]: 1.423e-05 [updatestate_assign_eliminate]: 1.149e-05 [updatestate_loads_eliminate]: 1.112e-05 [parameter_eliminate]: 1.98e-06 [a_2]: 0.00017204 [accelerated_algorithm]: 1.597e-05 [pynative_shard]: 1.32999e-06 [auto_parallel]: 3.53e-06 [parallel]: 3.68e-06 [merge_comm]: 2.37999e-06 [allreduce_fusion]: 1.62e-06 [virtual_dataset]: 9.5e-06 [get_grad_eliminate_]: 8.84e-06 [virtual_output]: 8.28e-06 [merge_forward]: 1.277e-05 [cell_reuse_recompute_pass]: 4.00003e-07 [cell_reuse_handle_not_recompute_node_pass]: 2.193e-05 [meta_fg_expand]: 9.20999e-06 [after_resolve]: 1.159e-05 [a_after_grad]: 1.561e-05 [renormalize]: 7.0002e-08 [real_op_eliminate]: 8.35e-06 [auto_monad_grad]: 2.11001e-06 [auto_monad_eliminator]: 2.142e-05 [cse]: 5.559e-05 [a_3]: 7.355e-05 [py_interpret_to_execute_after_opt_a]: 3.87e-06 [slice_cell_reuse_recomputed_activation]: 2.75e-06 [rewriter_after_opt_a]: 7.143e-05 [convert_after_rewriter]: 1.78e-05 [order_py_execute_after_rewriter]: 1.278e-05 [opt_b]: 0.00064172, [2] [Cycle 1]: 0.0005416, [7] [b_1]: 0.00047952 [b_2]: 4.07e-06 [updatestate_depend_eliminate]: 3.94e-06 [updatestate_assign_eliminate]: 3e-06 [updatestate_loads_eliminate]: 2.88e-06 [renormalize]: 3.39998e-07 [cse]: 1.506e-05 [Cycle 2]: 9.085e-05, [7] [b_1]: 4.618e-05 [b_2]: 2.78e-06 [updatestate_depend_eliminate]: 2.76e-06 [updatestate_assign_eliminate]: 2.47e-06 [updatestate_loads_eliminate]: 2.26e-06 [renormalize]: 7.99992e-08 [cse]: 9.57e-06 [cconv]: 2.135e-05 [opt_after_cconv]: 5.431e-05, [1] [Cycle 1]: 5.018e-05, [7] [c_1]: 6.09001e-06 [parameter_eliminate]: 1.83e-06 [updatestate_depend_eliminate]: 2.76e-06 [updatestate_assign_eliminate]: 2.24e-06 [updatestate_loads_eliminate]: 2.13e-06 [cse]: 9.91e-06 [renormalize]: 2.70004e-07 [remove_dup_value]: 1.128e-05 [tuple_transform]: 3.932e-05, [1] [Cycle 1]: 3.6e-05, [3] [d_1]: 1.676e-05 [d_2]: 7.56e-06 [renormalize]: 2.00002e-07 [add_cache_embedding]: 1.154e-05 [add_recomputation]: 4.373e-05 [cse_after_recomputation]: 1.891e-05, [1] [Cycle 1]: 1.514e-05, [1] [cse]: 1.089e-05 [environ_conv]: 8.36e-06 [label_micro_interleaved_index]: 2.28e-06 [label_fine_grained_interleaved_index]: 2.2e-06 [assign_add_opt]: 1.43e-06 [slice_recompute_activation]: 2.11001e-06 [micro_interleaved_order_control]: 1.96e-06 [full_micro_interleaved_order_control]: 1.97e-06 [comp_comm_scheduling]: 2.09999e-06 [reorder_send_recv_between_fp_bp]: 2.36e-06 [comm_op_add_attrs]: 1.23e-06 [add_comm_op_reuse_tag]: 8.70001e-07 [overlap_opt_shard_in_pipeline]: 1.3e-06 [grouped_pairwise_exchange_alltoall]: 1.25e-06 [overlap_recompute_and_grad_model_parallel]: 1.53e-06 [overlap_grad_matmul_and_grad_allreduce]: 7.29997e-07 [split_matmul_comm_elemetwise]: 2.13e-06 [split_layernorm_comm]: 1.86e-06 [process_send_recv_for_ge]: 8.19993e-07 [handle_group_info]: 9.20001e-07 [auto_monad_reorder]: 1.965e-05 [get_jit_bprop_graph]: 6.69999e-07 [eliminate_special_op_node]: 0.000472 [validate]: 3.245e-05 [distribtued_split]: 1.25e-06 [task_emit]: 0.00506067 [execute]: 7.22e-06 Sums parse : 0.001477s : 1.75% symbol_resolve.resolve : 0.012195s : 14.47% combine_like_graphs : 0.000001s : 0.00% graph_reusing : 0.000003s : 0.00% meta_unpack_prepare : 0.000130s : 0.15% pre_cconv : 0.000001s : 0.00% abstract_specialize : 0.003783s : 4.49% pack_expand : 0.000015s : 0.02% auto_monad : 0.000083s : 0.10% inline : 0.000002s : 0.00% pre_auto_parallel : 0.000009s : 0.01% pipeline_split : 0.000003s : 0.00% optimize.py_interpret_to_execute : 0.000004s : 0.01% optimize.rewriter_before_opt_a : 0.000158s : 0.19% optimize.opt_a.expand_dump_flag : 0.000011s : 0.01% optimize.opt_a.switch_simplify : 0.000179s : 0.21% optimize.opt_a.a_1 : 0.001596s : 1.89% optimize.opt_a.recompute_prepare : 0.000042s : 0.05% optimize.opt_a.updatestate_depend_eliminate : 0.000050s : 0.06% optimize.opt_a.updatestate_assign_eliminate : 0.000038s : 0.04% optimize.opt_a.updatestate_loads_eliminate : 0.000037s : 0.04% optimize.opt_a.parameter_eliminate : 0.000014s : 0.02% optimize.opt_a.a_2 : 0.000541s : 0.64% optimize.opt_a.accelerated_algorithm : 0.000050s : 0.06% optimize.opt_a.pynative_shard : 0.000005s : 0.01% optimize.opt_a.auto_parallel : 0.000015s : 0.02% optimize.opt_a.parallel : 0.000021s : 0.02% optimize.opt_a.merge_comm : 0.000011s : 0.01% optimize.opt_a.allreduce_fusion : 0.000007s : 0.01% optimize.opt_a.virtual_dataset : 0.000031s : 0.04% optimize.opt_a.get_grad_eliminate_ : 0.000028s : 0.03% optimize.opt_a.virtual_output : 0.000026s : 0.03% optimize.opt_a.merge_forward : 0.000043s : 0.05% optimize.opt_a.cell_reuse_recompute_pass : 0.000002s : 0.00% optimize.opt_a.cell_reuse_handle_not_recompute_node_pass : 0.000071s : 0.08% optimize.opt_a.meta_fg_expand : 0.000042s : 0.05% optimize.opt_a.meta_fg_expand.resolve : 0.003080s : 3.65% optimize.opt_a.after_resolve : 0.000099s : 0.12% optimize.opt_a.a_after_grad : 0.000203s : 0.24% optimize.opt_a.renormalize : 0.052354s : 62.12% optimize.opt_a.real_op_eliminate : 0.000081s : 0.10% optimize.opt_a.auto_monad_grad : 0.000105s : 0.12% optimize.opt_a.auto_monad_eliminator : 0.000157s : 0.19% optimize.opt_a.cse : 0.000441s : 0.52% optimize.opt_a.a_3 : 0.000575s : 0.68% optimize.py_interpret_to_execute_after_opt_a : 0.000004s : 0.00% optimize.slice_cell_reuse_recomputed_activation : 0.000003s : 0.00% optimize.rewriter_after_opt_a : 0.000071s : 0.08% optimize.convert_after_rewriter : 0.000018s : 0.02% optimize.order_py_execute_after_rewriter : 0.000013s : 0.02% optimize.opt_b.b_1 : 0.000526s : 0.62% optimize.opt_b.b_2 : 0.000007s : 0.01% optimize.opt_b.updatestate_depend_eliminate : 0.000007s : 0.01% optimize.opt_b.updatestate_assign_eliminate : 0.000005s : 0.01% optimize.opt_b.updatestate_loads_eliminate : 0.000005s : 0.01% optimize.opt_b.renormalize : 0.000000s : 0.00% optimize.opt_b.cse : 0.000025s : 0.03% optimize.cconv : 0.000021s : 0.03% optimize.opt_after_cconv.c_1 : 0.000006s : 0.01% optimize.opt_after_cconv.parameter_eliminate : 0.000002s : 0.00% optimize.opt_after_cconv.updatestate_depend_eliminate : 0.000003s : 0.00% optimize.opt_after_cconv.updatestate_assign_eliminate : 0.000002s : 0.00% optimize.opt_after_cconv.updatestate_loads_eliminate : 0.000002s : 0.00% optimize.opt_after_cconv.cse : 0.000010s : 0.01% optimize.opt_after_cconv.renormalize : 0.000000s : 0.00% optimize.remove_dup_value : 0.000011s : 0.01% optimize.tuple_transform.d_1 : 0.000017s : 0.02% optimize.tuple_transform.d_2 : 0.000008s : 0.01% optimize.tuple_transform.renormalize : 0.000000s : 0.00% optimize.add_cache_embedding : 0.000012s : 0.01% optimize.add_recomputation : 0.000044s : 0.05% optimize.cse_after_recomputation.cse : 0.000011s : 0.01% optimize.environ_conv : 0.000008s : 0.01% optimize.label_micro_interleaved_index : 0.000002s : 0.00% optimize.label_fine_grained_interleaved_index : 0.000002s : 0.00% optimize.assign_add_opt : 0.000001s : 0.00% optimize.slice_recompute_activation : 0.000002s : 0.00% optimize.micro_interleaved_order_control : 0.000002s : 0.00% optimize.full_micro_interleaved_order_control : 0.000002s : 0.00% optimize.comp_comm_scheduling : 0.000002s : 0.00% optimize.reorder_send_recv_between_fp_bp : 0.000002s : 0.00% optimize.comm_op_add_attrs : 0.000001s : 0.00% optimize.add_comm_op_reuse_tag : 0.000001s : 0.00% optimize.overlap_opt_shard_in_pipeline : 0.000001s : 0.00% optimize.grouped_pairwise_exchange_alltoall : 0.000001s : 0.00% optimize.overlap_recompute_and_grad_model_parallel : 0.000002s : 0.00% optimize.overlap_grad_matmul_and_grad_allreduce : 0.000001s : 0.00% optimize.split_matmul_comm_elemetwise : 0.000002s : 0.00% optimize.split_layernorm_comm : 0.000002s : 0.00% optimize.process_send_recv_for_ge : 0.000001s : 0.00% optimize.handle_group_info : 0.000001s : 0.00% auto_monad_reorder : 0.000020s : 0.02% get_jit_bprop_graph : 0.000001s : 0.00% eliminate_special_op_node : 0.000472s : 0.56% validate : 0.000032s : 0.04% distribtued_split : 0.000001s : 0.00% task_emit : 0.005061s : 6.00% execute : 0.000007s : 0.01% Time group info: ------[substitution.] 0.015251 471 0.02% : 0.000004s : 5: substitution.float_depend_g_call 0.07% : 0.000011s : 17: substitution.float_tuple_getitem_switch 93.22% : 0.014217s : 41: substitution.getattr_setattr_resolve 0.03% : 0.000005s : 5: substitution.graph_param_transform 0.02% : 0.000003s : 3: substitution.incorporate_call 0.01% : 0.000002s : 3: substitution.incorporate_call_switch 4.31% : 0.000657s : 72: substitution.inline 0.04% : 0.000006s : 12: substitution.less_batch_normalization 0.19% : 0.000029s : 23: substitution.meta_unpack_prepare 0.08% : 0.000013s : 14: substitution.minmaximum_grad 0.02% : 0.000003s : 5: substitution.partial_eliminate 0.01% : 0.000001s : 5: substitution.partial_unused_args_eliminate 0.04% : 0.000006s : 54: substitution.remove_not_recompute_node 0.41% : 0.000062s : 44: substitution.replace_applicator 0.05% : 0.000008s : 26: substitution.replace_old_param 0.02% : 0.000003s : 2: substitution.reset_defer_inline 0.04% : 0.000006s : 8: substitution.set_cell_output_no_recompute 0.05% : 0.000007s : 5: substitution.specialize_transform 0.06% : 0.000010s : 6: substitution.switch_simplify 0.11% : 0.000017s : 5: substitution.transpose_eliminate 0.30% : 0.000045s : 19: substitution.tuple_list_convert_item_index_to_positive 0.12% : 0.000018s : 19: substitution.tuple_list_get_item_const_eliminator 0.16% : 0.000024s : 19: substitution.tuple_list_get_item_depend_reorder 0.46% : 0.000071s : 40: substitution.tuple_list_get_item_eliminator 0.16% : 0.000024s : 19: substitution.tuple_list_get_set_item_eliminator ------[renormalize.] 0.052340 6 92.76% : 0.048549s : 3: renormalize.infer 7.24% : 0.003791s : 3: renormalize.specialize ------[replace.] 0.000847 89 56.49% : 0.000479s : 36: replace.getattr_setattr_resolve 24.48% : 0.000207s : 37: replace.inline 5.13% : 0.000043s : 2: replace.meta_unpack_prepare 7.84% : 0.000066s : 6: replace.switch_simplify 0.53% : 0.000004s : 1: replace.transpose_eliminate 5.53% : 0.000047s : 7: replace.tuple_list_get_item_eliminator ------[match.] 0.014716 89 95.96% : 0.014121s : 36: match.getattr_setattr_resolve 3.69% : 0.000543s : 37: match.inline 0.12% : 0.000018s : 2: match.meta_unpack_prepare 0.07% : 0.000010s : 6: match.switch_simplify 0.03% : 0.000004s : 1: match.transpose_eliminate 0.14% : 0.000021s : 7: match.tuple_list_get_item_eliminator ------[func_graph_cloner_run.] 0.004082 81 68.79% : 0.002808s : 34: func_graph_cloner_run.FuncGraphClonerGraph 31.21% : 0.001274s : 47: func_graph_cloner_run.FuncGraphSpecializer ------[meta_graph.] 0.000000 0 ------[manager.] 0.000000 0 ------[pynative] 0.000000 0 ------[others.] 0.019170 257 5.95% : 0.001140s : 104: opt.transform.opt_a 2.58% : 0.000495s : 92: opt.transform.opt_b 78.69% : 0.015084s : 12: opt.transform.opt_resolve 0.56% : 0.000108s : 1: opt.transforms.meta_unpack_prepare 11.99% : 0.002299s : 40: opt.transforms.opt_a 0.02% : 0.000005s : 1: opt.transforms.opt_after_cconv 0.03% : 0.000005s : 2: opt.transforms.opt_b 0.12% : 0.000023s : 2: opt.transforms.opt_trans_graph 0.06% : 0.000011s : 3: opt.transforms.special_op_eliminate . TotalTime = 0.0964158, [20] [parse]: 0.00126642 [symbol_resolve]: 0.0120699, [1] [Cycle 1]: 0.0120124, [1] [resolve]: 0.0119941 [combine_like_graphs]: 6.30003e-07 [graph_reusing]: 3.17e-06 [meta_unpack_prepare]: 0.00014919 [pre_cconv]: 6.69999e-07 [abstract_specialize]: 0.00358416 [pack_expand]: 1.29e-05 [auto_monad]: 6.458e-05 [inline]: 1.3e-06 [pre_auto_parallel]: 6.98e-06 [pipeline_split]: 1.78e-06 [optimize]: 0.0759872, [35] [py_interpret_to_execute]: 4.07e-06 [rewriter_before_opt_a]: 0.00015405 [opt_a]: 0.074751, [4] [Cycle 1]: 0.0294948, [30] [expand_dump_flag]: 3e-06 [switch_simplify]: 2.524e-05 [a_1]: 0.00065744 [recompute_prepare]: 6.94e-06 [updatestate_depend_eliminate]: 8.38e-06 [updatestate_assign_eliminate]: 6.54e-06 [updatestate_loads_eliminate]: 5.54e-06 [parameter_eliminate]: 4.27e-06 [a_2]: 6.899e-05 [accelerated_algorithm]: 5.01e-06 [pynative_shard]: 1.15e-06 [auto_parallel]: 3.29e-06 [parallel]: 5.59e-06 [merge_comm]: 2.55e-06 [allreduce_fusion]: 1.6e-06 [virtual_dataset]: 4.69e-06 [get_grad_eliminate_]: 4.22999e-06 [virtual_output]: 4.06e-06 [merge_forward]: 6.23e-06 [cell_reuse_recompute_pass]: 5.49997e-07 [cell_reuse_handle_not_recompute_node_pass]: 1.081e-05 [meta_fg_expand]: 0.00168084, [1] [Cycle 1]: 0.00040832, [1] [resolve]: 0.00038986 [after_resolve]: 1.953e-05 [a_after_grad]: 3.673e-05 [renormalize]: 0.0264106 [real_op_eliminate]: 2.378e-05 [auto_monad_grad]: 3.013e-05 [auto_monad_eliminator]: 4.25e-05 [cse]: 9.732e-05 [a_3]: 0.00015334 [Cycle 2]: 0.0377741, [30] [expand_dump_flag]: 2.18e-06 [switch_simplify]: 7.054e-05 [a_1]: 0.00084705 [recompute_prepare]: 8.9e-06 [updatestate_depend_eliminate]: 1.073e-05 [updatestate_assign_eliminate]: 8.53e-06 [updatestate_loads_eliminate]: 8.03e-06 [parameter_eliminate]: 3.38e-06 [a_2]: 0.00011549 [accelerated_algorithm]: 1.043e-05 [pynative_shard]: 1.16001e-06 [auto_parallel]: 3.45e-06 [parallel]: 3.86999e-06 [merge_comm]: 2.21e-06 [allreduce_fusion]: 1.31e-06 [virtual_dataset]: 7.2e-06 [get_grad_eliminate_]: 6.44e-06 [virtual_output]: 6.25e-06 [merge_forward]: 9.53e-06 [cell_reuse_recompute_pass]: 4.39999e-07 [cell_reuse_handle_not_recompute_node_pass]: 1.528e-05 [meta_fg_expand]: 0.0103193, [4] [Cycle 1]: 0.00176777, [1] [resolve]: 0.00174848 [Cycle 1]: 0.0003211, [1] [resolve]: 0.00030281 [Cycle 1]: 0.00040487, [1] [resolve]: 0.00038713 [Cycle 1]: 0.00031164, [1] [resolve]: 0.00029368 [after_resolve]: 5.895e-05 [a_after_grad]: 0.00016433 [renormalize]: 0.0253053 [real_op_eliminate]: 3.763e-05 [auto_monad_grad]: 6.873e-05 [auto_monad_eliminator]: 6.761e-05 [cse]: 0.00018315 [a_3]: 0.00026382 [Cycle 3]: 0.00379531, [30] [expand_dump_flag]: 3.01e-06 [switch_simplify]: 0.00011442 [a_1]: 0.00137465 [recompute_prepare]: 1.163e-05 [updatestate_depend_eliminate]: 1.459e-05 [updatestate_assign_eliminate]: 1.144e-05 [updatestate_loads_eliminate]: 1.103e-05 [parameter_eliminate]: 3.95e-06 [a_2]: 0.00016887 [accelerated_algorithm]: 1.619e-05 [pynative_shard]: 1.22e-06 [auto_parallel]: 4.12e-06 [parallel]: 4.1e-06 [merge_comm]: 2.82e-06 [allreduce_fusion]: 1.94999e-06 [virtual_dataset]: 9.82e-06 [get_grad_eliminate_]: 9.29e-06 [virtual_output]: 8.93e-06 [merge_forward]: 1.259e-05 [cell_reuse_recompute_pass]: 4.30002e-07 [cell_reuse_handle_not_recompute_node_pass]: 2.124e-05 [meta_fg_expand]: 3.237e-05 [after_resolve]: 1.327e-05 [a_after_grad]: 2.184e-05 [renormalize]: 0.00151617 [real_op_eliminate]: 1.546e-05 [auto_monad_grad]: 5.28e-06 [auto_monad_eliminator]: 2.465e-05 [cse]: 0.00010367 [a_3]: 0.00010973 [Cycle 4]: 0.00109205, [30] [expand_dump_flag]: 1.42e-06 [switch_simplify]: 9.34e-06 [a_1]: 0.00045479 [recompute_prepare]: 1.101e-05 [updatestate_depend_eliminate]: 1.468e-05 [updatestate_assign_eliminate]: 1.113e-05 [updatestate_loads_eliminate]: 1.09e-05 [parameter_eliminate]: 2.28e-06 [a_2]: 0.0001697 [accelerated_algorithm]: 1.635e-05 [pynative_shard]: 1.26e-06 [auto_parallel]: 3.36e-06 [parallel]: 3.76999e-06 [merge_comm]: 2.39e-06 [allreduce_fusion]: 1.6e-06 [virtual_dataset]: 9.88e-06 [get_grad_eliminate_]: 9.25e-06 [virtual_output]: 8.87e-06 [merge_forward]: 1.227e-05 [cell_reuse_recompute_pass]: 3.89999e-07 [cell_reuse_handle_not_recompute_node_pass]: 2.108e-05 [meta_fg_expand]: 8.88e-06 [after_resolve]: 1.195e-05 [a_after_grad]: 2.17e-05 [renormalize]: 7.99992e-08 [real_op_eliminate]: 8.96e-06 [auto_monad_grad]: 2.42001e-06 [auto_monad_eliminator]: 2.215e-05 [cse]: 5.682e-05 [a_3]: 7.249e-05 [py_interpret_to_execute_after_opt_a]: 3.73001e-06 [slice_cell_reuse_recomputed_activation]: 1.99e-06 [rewriter_after_opt_a]: 6.977e-05 [convert_after_rewriter]: 1.664e-05 [order_py_execute_after_rewriter]: 1.26e-05 [opt_b]: 0.00064207, [2] [Cycle 1]: 0.00054154, [7] [b_1]: 0.00048116 [b_2]: 3.38e-06 [updatestate_depend_eliminate]: 3.98e-06 [updatestate_assign_eliminate]: 2.9e-06 [updatestate_loads_eliminate]: 2.72e-06 [renormalize]: 2.80001e-07 [cse]: 1.517e-05 [Cycle 2]: 9.13e-05, [7] [b_1]: 4.585e-05 [b_2]: 2.55999e-06 [updatestate_depend_eliminate]: 2.9e-06 [updatestate_assign_eliminate]: 2.44e-06 [updatestate_loads_eliminate]: 2.34e-06 [renormalize]: 7.99992e-08 [cse]: 1.024e-05 [cconv]: 1.542e-05 [opt_after_cconv]: 6.632e-05, [1] [Cycle 1]: 6.192e-05, [7] [c_1]: 1.619e-05 [parameter_eliminate]: 1.81e-06 [updatestate_depend_eliminate]: 2.78e-06 [updatestate_assign_eliminate]: 2.29e-06 [updatestate_loads_eliminate]: 2.19e-06 [cse]: 1.03e-05 [renormalize]: 2.50002e-07 [remove_dup_value]: 8.35e-06 [tuple_transform]: 4.716e-05, [1] [Cycle 1]: 4.361e-05, [3] [d_1]: 2.559e-05 [d_2]: 6.91e-06 [renormalize]: 1.8e-07 [add_cache_embedding]: 8.21e-06 [add_recomputation]: 3.588e-05 [cse_after_recomputation]: 1.905e-05, [1] [Cycle 1]: 1.488e-05, [1] [cse]: 1.066e-05 [environ_conv]: 6.63e-06 [label_micro_interleaved_index]: 1.80001e-06 [label_fine_grained_interleaved_index]: 2.11e-06 [assign_add_opt]: 1.19e-06 [slice_recompute_activation]: 1.64e-06 [micro_interleaved_order_control]: 1.15e-06 [full_micro_interleaved_order_control]: 1.11e-06 [comp_comm_scheduling]: 1.29e-06 [reorder_send_recv_between_fp_bp]: 1.61e-06 [comm_op_add_attrs]: 5.79996e-07 [add_comm_op_reuse_tag]: 6.90001e-07 [overlap_opt_shard_in_pipeline]: 6.49998e-07 [grouped_pairwise_exchange_alltoall]: 5.60001e-07 [overlap_recompute_and_grad_model_parallel]: 1.5e-06 [overlap_grad_matmul_and_grad_allreduce]: 5.29995e-07 [split_matmul_comm_elemetwise]: 1.39e-06 [split_layernorm_comm]: 1.18e-06 [process_send_recv_for_ge]: 5.69999e-07 [handle_group_info]: 5.99997e-07 [auto_monad_reorder]: 1.398e-05 [get_jit_bprop_graph]: 3.05e-06 [eliminate_special_op_node]: 0.00048746 [validate]: 3.212e-05 [distribtued_split]: 1.09e-06 [task_emit]: 0.0025433 [execute]: 4.4e-06 Sums parse : 0.001266s : 1.51% symbol_resolve.resolve : 0.011994s : 14.30% combine_like_graphs : 0.000001s : 0.00% graph_reusing : 0.000003s : 0.00% meta_unpack_prepare : 0.000149s : 0.18% pre_cconv : 0.000001s : 0.00% abstract_specialize : 0.003584s : 4.27% pack_expand : 0.000013s : 0.02% auto_monad : 0.000065s : 0.08% inline : 0.000001s : 0.00% pre_auto_parallel : 0.000007s : 0.01% pipeline_split : 0.000002s : 0.00% optimize.py_interpret_to_execute : 0.000004s : 0.00% optimize.rewriter_before_opt_a : 0.000154s : 0.18% optimize.opt_a.expand_dump_flag : 0.000010s : 0.01% optimize.opt_a.switch_simplify : 0.000220s : 0.26% optimize.opt_a.a_1 : 0.003334s : 3.97% optimize.opt_a.recompute_prepare : 0.000038s : 0.05% optimize.opt_a.updatestate_depend_eliminate : 0.000048s : 0.06% optimize.opt_a.updatestate_assign_eliminate : 0.000038s : 0.04% optimize.opt_a.updatestate_loads_eliminate : 0.000036s : 0.04% optimize.opt_a.parameter_eliminate : 0.000014s : 0.02% optimize.opt_a.a_2 : 0.000523s : 0.62% optimize.opt_a.accelerated_algorithm : 0.000048s : 0.06% optimize.opt_a.pynative_shard : 0.000005s : 0.01% optimize.opt_a.auto_parallel : 0.000014s : 0.02% optimize.opt_a.parallel : 0.000017s : 0.02% optimize.opt_a.merge_comm : 0.000010s : 0.01% optimize.opt_a.allreduce_fusion : 0.000006s : 0.01% optimize.opt_a.virtual_dataset : 0.000032s : 0.04% optimize.opt_a.get_grad_eliminate_ : 0.000029s : 0.03% optimize.opt_a.virtual_output : 0.000028s : 0.03% optimize.opt_a.merge_forward : 0.000041s : 0.05% optimize.opt_a.cell_reuse_recompute_pass : 0.000002s : 0.00% optimize.opt_a.cell_reuse_handle_not_recompute_node_pass : 0.000068s : 0.08% optimize.opt_a.meta_fg_expand : 0.000041s : 0.05% optimize.opt_a.meta_fg_expand.resolve : 0.003122s : 3.72% optimize.opt_a.after_resolve : 0.000104s : 0.12% optimize.opt_a.a_after_grad : 0.000245s : 0.29% optimize.opt_a.renormalize : 0.053232s : 63.46% optimize.opt_a.real_op_eliminate : 0.000086s : 0.10% optimize.opt_a.auto_monad_grad : 0.000107s : 0.13% optimize.opt_a.auto_monad_eliminator : 0.000157s : 0.19% optimize.opt_a.cse : 0.000441s : 0.53% optimize.opt_a.a_3 : 0.000599s : 0.71% optimize.py_interpret_to_execute_after_opt_a : 0.000004s : 0.00% optimize.slice_cell_reuse_recomputed_activation : 0.000002s : 0.00% optimize.rewriter_after_opt_a : 0.000070s : 0.08% optimize.convert_after_rewriter : 0.000017s : 0.02% optimize.order_py_execute_after_rewriter : 0.000013s : 0.02% optimize.opt_b.b_1 : 0.000527s : 0.63% optimize.opt_b.b_2 : 0.000006s : 0.01% optimize.opt_b.updatestate_depend_eliminate : 0.000007s : 0.01% optimize.opt_b.updatestate_assign_eliminate : 0.000005s : 0.01% optimize.opt_b.updatestate_loads_eliminate : 0.000005s : 0.01% optimize.opt_b.renormalize : 0.000000s : 0.00% optimize.opt_b.cse : 0.000025s : 0.03% optimize.cconv : 0.000015s : 0.02% optimize.opt_after_cconv.c_1 : 0.000016s : 0.02% optimize.opt_after_cconv.parameter_eliminate : 0.000002s : 0.00% optimize.opt_after_cconv.updatestate_depend_eliminate : 0.000003s : 0.00% optimize.opt_after_cconv.updatestate_assign_eliminate : 0.000002s : 0.00% optimize.opt_after_cconv.updatestate_loads_eliminate : 0.000002s : 0.00% optimize.opt_after_cconv.cse : 0.000010s : 0.01% optimize.opt_after_cconv.renormalize : 0.000000s : 0.00% optimize.remove_dup_value : 0.000008s : 0.01% optimize.tuple_transform.d_1 : 0.000026s : 0.03% optimize.tuple_transform.d_2 : 0.000007s : 0.01% optimize.tuple_transform.renormalize : 0.000000s : 0.00% optimize.add_cache_embedding : 0.000008s : 0.01% optimize.add_recomputation : 0.000036s : 0.04% optimize.cse_after_recomputation.cse : 0.000011s : 0.01% optimize.environ_conv : 0.000007s : 0.01% optimize.label_micro_interleaved_index : 0.000002s : 0.00% optimize.label_fine_grained_interleaved_index : 0.000002s : 0.00% optimize.assign_add_opt : 0.000001s : 0.00% optimize.slice_recompute_activation : 0.000002s : 0.00% optimize.micro_interleaved_order_control : 0.000001s : 0.00% optimize.full_micro_interleaved_order_control : 0.000001s : 0.00% optimize.comp_comm_scheduling : 0.000001s : 0.00% optimize.reorder_send_recv_between_fp_bp : 0.000002s : 0.00% optimize.comm_op_add_attrs : 0.000001s : 0.00% optimize.add_comm_op_reuse_tag : 0.000001s : 0.00% optimize.overlap_opt_shard_in_pipeline : 0.000001s : 0.00% optimize.grouped_pairwise_exchange_alltoall : 0.000001s : 0.00% optimize.overlap_recompute_and_grad_model_parallel : 0.000001s : 0.00% optimize.overlap_grad_matmul_and_grad_allreduce : 0.000001s : 0.00% optimize.split_matmul_comm_elemetwise : 0.000001s : 0.00% optimize.split_layernorm_comm : 0.000001s : 0.00% optimize.process_send_recv_for_ge : 0.000001s : 0.00% optimize.handle_group_info : 0.000001s : 0.00% auto_monad_reorder : 0.000014s : 0.02% get_jit_bprop_graph : 0.000003s : 0.00% eliminate_special_op_node : 0.000487s : 0.58% validate : 0.000032s : 0.04% distribtued_split : 0.000001s : 0.00% task_emit : 0.002543s : 3.03% execute : 0.000004s : 0.01% Time group info: ------[substitution.] 0.015080 535 0.02% : 0.000003s : 6: substitution.float_depend_g_call 0.07% : 0.000011s : 17: substitution.float_tuple_getitem_switch 93.24% : 0.014061s : 41: substitution.getattr_setattr_resolve 0.03% : 0.000004s : 5: substitution.graph_param_transform 0.01% : 0.000002s : 3: substitution.incorporate_call 0.01% : 0.000001s : 3: substitution.incorporate_call_switch 4.25% : 0.000641s : 78: substitution.inline 0.03% : 0.000005s : 12: substitution.less_batch_normalization 0.22% : 0.000034s : 42: substitution.meta_unpack_prepare 0.09% : 0.000014s : 19: substitution.minmaximum_grad 0.02% : 0.000003s : 6: substitution.partial_eliminate 0.01% : 0.000001s : 5: substitution.partial_unused_args_eliminate 0.04% : 0.000006s : 54: substitution.remove_not_recompute_node 0.44% : 0.000066s : 50: substitution.replace_applicator 0.05% : 0.000007s : 26: substitution.replace_old_param 0.02% : 0.000003s : 2: substitution.reset_defer_inline 0.03% : 0.000004s : 8: substitution.set_cell_output_no_recompute 0.04% : 0.000006s : 5: substitution.specialize_transform 0.05% : 0.000008s : 6: substitution.switch_simplify 0.08% : 0.000011s : 6: substitution.transpose_eliminate 0.30% : 0.000045s : 24: substitution.tuple_list_convert_item_index_to_positive 0.14% : 0.000020s : 24: substitution.tuple_list_get_item_const_eliminator 0.18% : 0.000028s : 24: substitution.tuple_list_get_item_depend_reorder 0.46% : 0.000070s : 45: substitution.tuple_list_get_item_eliminator 0.18% : 0.000027s : 24: substitution.tuple_list_get_set_item_eliminator ------[renormalize.] 0.053219 6 92.90% : 0.049442s : 3: renormalize.infer 7.10% : 0.003777s : 3: renormalize.specialize ------[replace.] 0.000843 89 56.25% : 0.000474s : 36: replace.getattr_setattr_resolve 24.53% : 0.000207s : 37: replace.inline 5.14% : 0.000043s : 2: replace.meta_unpack_prepare 7.98% : 0.000067s : 6: replace.switch_simplify 0.69% : 0.000006s : 1: replace.transpose_eliminate 5.41% : 0.000046s : 7: replace.tuple_list_get_item_eliminator ------[match.] 0.014537 89 96.10% : 0.013970s : 36: match.getattr_setattr_resolve 3.61% : 0.000525s : 37: match.inline 0.11% : 0.000016s : 2: match.meta_unpack_prepare 0.05% : 0.000008s : 6: match.switch_simplify 0.02% : 0.000003s : 1: match.transpose_eliminate 0.11% : 0.000016s : 7: match.tuple_list_get_item_eliminator ------[func_graph_cloner_run.] 0.004015 81 68.70% : 0.002758s : 34: func_graph_cloner_run.FuncGraphClonerGraph 31.30% : 0.001256s : 47: func_graph_cloner_run.FuncGraphSpecializer ------[meta_graph.] 0.000000 0 ------[manager.] 0.000000 0 ------[pynative] 0.000000 0 ------[others.] 0.020775 587 0.64% : 0.000133s : 2: opt.transform.meta_unpack_prepare 24.89% : 0.005172s : 461: opt.transform.opt_a 0.06% : 0.000013s : 7: opt.transform.opt_after_cconv 2.40% : 0.000499s : 94: opt.transform.opt_b 71.82% : 0.014920s : 12: opt.transform.opt_resolve 0.14% : 0.000028s : 8: opt.transform.opt_trans_graph 0.05% : 0.000011s : 3: opt.transform.special_op_eliminate . ============================== 2 passed in 20.48s ============================== [TRACE] GE(187024,python3.7):2024-01-11-05:30:44.025.701 [status:INIT] [ge_api.cc:463]187024 ~Session:Start to destruct session. [TRACE] GE(187024,python3.7):2024-01-11-05:30:44.026.057 [status:RUNNING] [ge_api.cc:475]187024 ~Session:Session id is 0 [TRACE] GE(187024,python3.7):2024-01-11-05:30:44.026.078 [status:RUNNING] [ge_api.cc:476]187024 ~Session:Destroying session [TRACE] GE(187024,python3.7):2024-01-11-05:30:44.026.953 [status:STOP] [ge_api.cc:491]187024 ~Session:Session Destructor finished [TRACE] GE(187024,python3.7):2024-01-11-05:30:44.026.982 [status:INIT] [ge_api.cc:301]187024 GEFinalize:GEFinalize start [INFO] GE(187024,python3.7):2024-01-11-05:30:44.027.048 [execution_runtime.cc:80][EVENT]187024 FinalizeExecutionRuntime:Execution runtime finalize begin. [INFO] GE(187024,python3.7):2024-01-11-05:30:44.027.066 [execution_runtime.cc:92][EVENT]187024 FinalizeExecutionRuntime:Execution runtime finalized. [TRACE] GE(187024,python3.7):2024-01-11-05:30:44.027.077 [status:RUNNING] [ge_api.cc:313]187024 GEFinalize:Finalizing environment [INFO] TUNE(187024,python3.7):2024-01-11-05:30:44.318.040 [cann_kb_pyfunc_mgr.cpp:127][CANNKB][Tid:187024]"CannKbPyfuncMgr: enter PyObjectDeinit function, reference_[1]" [INFO] TUNE(187024,python3.7):2024-01-11-05:30:44.318.079 [cann_kb_pyfunc_mgr.cpp:138][CANNKB][Tid:187024]"CannKbPyfuncMgr: PyObjectDeinit function end successfully!" [INFO] GE(187024,python3.7):2024-01-11-05:30:44.319.431 [gelib.cc:324][EVENT]187024 SystemFinalize:Online infer finalize GELib success. [TRACE] GE(187024,python3.7):2024-01-11-05:30:44.445.658 [status:STOP] [ge_api.cc:341]187024 GEFinalize:GEFinalize finished [INFO] TDT(187024,python3.7):2024-01-11-05:30:44.895.395 [process_mode_manager.cpp:184][Close][tid:187024] [TsdClient] Close [deviceId=5][sessionId=1] hccp and computer enter [INFO] TDT(187024,python3.7):2024-01-11-05:30:44.895.435 [version_verify.cpp:112][SpecialFeatureCheck][tid:187024] VersionVerify: previous type[7], supported [INFO] TDT(187024,python3.7):2024-01-11-05:30:44.895.469 [process_mode_manager.cpp:192][Close][tid:187024] [TsdClient][deviceId=5] [sessionId=1] wait hccp and computer process close respond [INFO] TDT(187024,python3.7):2024-01-11-05:30:44.926.483 [process_mode_manager.cpp:197][Close][tid:187024] [TsdClient][logicDeviceId_=5]has recv close hccp and computer process respond [INFO] TDT(187024,python3.7):2024-01-11-05:30:44.926.499 [stub_process_mode_nowin.cpp:151][CloseInHost][tid:187024] enter into CloseInHost deviceid[5] [INFO] TDT(187024,python3.7):2024-01-11-05:30:44.926.509 [stub_process_mode_nowin.cpp:154][CloseInHost][tid:187024] host cpu not support [INFO] TDT(187024,python3.7):2024-01-11-05:30:44.926.544 [process_mode_manager.cpp:208][Close][tid:187024] [TsdClient][deviceId=5] [sessionId=1] close hccp and computer process success [INFO] ATRACE(187024,python3.7):2024-01-11-05:30:44.926.558 [atrace_api.c:93](tid:187024) AtraceDestroy start [INFO] ATRACE(187024,python3.7):2024-01-11-05:30:44.926.573 [atrace_api.c:95](tid:187024) AtraceDestroy end [INFO] PROFILING(187024,python3.7):2024-01-11-05:30:44.926.593 [msprofiler_impl.cpp:156] >>> (tid:187024) ProfNotifySetDevice called, is open: 0, devId: 5 [INFO] RUNTIME(187024,python3.7):2024-01-11-05:30:46.622.202 [runtime.cc:1737] 187024 ~Runtime: deconstruct runtime.