============================= test session starts ============================== platform linux -- Python 3.7.5, pytest-5.4.3, py-1.8.1, pluggy-0.13.1 rootdir: /home/jenkins/mindspore/testcases/testcases/tests/st/ops/ascend/test_aicpu_ops, inifile: /home/jenkins/sault/virtual_test/virtualenv_003/sault/config/pytest.ini plugins: anyio-3.7.1, xdist-1.32.0, forked-1.1.3 [INFO] ATRACE(131377,python3.7):2024-01-11-05:35:59.109.774 [trace_attr.c:105](tid:131377) platform is 1. [INFO] ATRACE(131377,python3.7):2024-01-11-05:35:59.109.951 [trace_recorder.c:114](tid:131377) use root path: /home/jenkins/ascend/atrace [INFO] ATRACE(131377,python3.7):2024-01-11-05:35:59.109.980 [trace_signal.c:133](tid:131377) register signal handler for signo 2 succeed. [INFO] ATRACE(131377,python3.7):2024-01-11-05:35:59.109.991 [trace_signal.c:133](tid:131377) register signal handler for signo 15 succeed. [INFO] RUNTIME(131377,python3.7):2024-01-11-05:35:59.519.511 [runtime.cc:1159] 131377 GetAicoreNumByLevel: workingDev_=0 [INFO] RUNTIME(131377,python3.7):2024-01-11-05:35:59.519.590 [runtime.cc:4719] 131377 GetVisibleDevices: ASCEND_RT_VISIBLE_DEVICES param was not set collected 2 items test_flatten.py [INFO] TDT(131377,python3.7):2024-01-11-05:36:03.951.727 [process_mode_manager.cpp:109][OpenProcess][tid:131377] [ProcessModeManager] enter into open process deviceId[2] rankSize[0] [INFO] TDT(131377,python3.7):2024-01-11-05:36:03.952.572 [process_mode_manager.cpp:379][InitTsdClient][tid:131377] [TsdClient] deviceId[2] begin to init hdc client [INFO] TDT(131377,python3.7):2024-01-11-05:36:03.952.748 [version_verify.cpp:34][SetVersionInfo][tid:131377] VersionVerify: send client version to server [INFO] TDT(131377,python3.7):2024-01-11-05:36:03.952.780 [version_verify.cpp:50][SetVersionInfo][tid:131377] send feature_info:{msg_type:35, features:{check before send aicpu package,}} [INFO] TDT(131377,python3.7):2024-01-11-05:36:03.952.792 [version_verify.cpp:50][SetVersionInfo][tid:131377] send feature_info:{msg_type:37, features:{check before send open qs message,}} [INFO] TDT(131377,python3.7):2024-01-11-05:36:03.953.097 [version_verify.cpp:66][PeerVersionCheck][tid:131377] VersionVerify: Check client version info, server[1230], client[1230] [INFO] TDT(131377,python3.7):2024-01-11-05:36:03.953.114 [version_verify.cpp:87][ParseVersionInfo][tid:131377] VersionVerify: pass client version info success [INFO] TDT(131377,python3.7):2024-01-11-05:36:03.953.123 [hdc_client.cpp:276][CheckHdcConnection][tid:131377] Service[2] create hdc success [INFO] TDT(131377,python3.7):2024-01-11-05:36:03.953.138 [version_verify.cpp:120][SpecialFeatureCheck][tid:131377] VersionVerify: new type[35], supported [INFO] TDT(131377,python3.7):2024-01-11-05:36:03.953.180 [process_mode_manager.cpp:748][GetDeviceCheckCode][tid:131377] [TsdClient][deviceId=2] [sessionId=1] wait package info respond [INFO] TDT(131377,python3.7):2024-01-11-05:36:03.953.282 [process_mode_manager.cpp:379][InitTsdClient][tid:131377] [TsdClient] deviceId[2] begin to init hdc client [INFO] TDT(131377,python3.7):2024-01-11-05:36:03.953.384 [version_verify.cpp:34][SetVersionInfo][tid:131377] VersionVerify: send client version to server [INFO] TDT(131377,python3.7):2024-01-11-05:36:03.953.396 [version_verify.cpp:50][SetVersionInfo][tid:131377] send feature_info:{msg_type:35, features:{check before send aicpu package,}} [INFO] TDT(131377,python3.7):2024-01-11-05:36:03.953.406 [version_verify.cpp:50][SetVersionInfo][tid:131377] send feature_info:{msg_type:37, features:{check before send open qs message,}} [INFO] TDT(131377,python3.7):2024-01-11-05:36:03.953.527 [version_verify.cpp:66][PeerVersionCheck][tid:131377] VersionVerify: Check client version info, server[1230], client[1230] [INFO] TDT(131377,python3.7):2024-01-11-05:36:03.953.539 [version_verify.cpp:87][ParseVersionInfo][tid:131377] VersionVerify: pass client version info success [INFO] TDT(131377,python3.7):2024-01-11-05:36:03.953.547 [hdc_client.cpp:276][CheckHdcConnection][tid:131377] Service[2] create hdc success [INFO] TDT(131377,python3.7):2024-01-11-05:36:03.953.558 [process_mode_manager.cpp:426][ConstructOpenMsg][tid:131377] [TsdClient] tsd get process sign successfully, procpid[131377] signSize[48] [INFO] TDT(131377,python3.7):2024-01-11-05:36:03.953.584 [version_verify.cpp:112][SpecialFeatureCheck][tid:131377] VersionVerify: previous type[6], supported [INFO] TDT(131377,python3.7):2024-01-11-05:36:03.953.602 [process_mode_manager.cpp:126][OpenProcess][tid:131377] [ProcessModeManager] deviceId[2] sessionId[1] rankSize[0], wait sub process start respond [INFO] TDT(131377,python3.7):2024-01-11-05:36:04.166.271 [stub_process_mode_nowin.cpp:63][ProcessQueueForMdc][tid:131377] [TsdClient] it is unnecessary of current mode[0] chiptype[1] to grant queue auth to aicpusd [INFO] TDT(131377,python3.7):2024-01-11-05:36:04.166.304 [stub_process_mode_nowin.cpp:101][OpenInHost][tid:131377] enter into OpenInHost deviceid[2] [INFO] TDT(131377,python3.7):2024-01-11-05:36:04.166.314 [stub_process_mode_nowin.cpp:105][OpenInHost][tid:131377] host cpu not support [INFO] TDT(131377,python3.7):2024-01-11-05:36:04.166.322 [process_mode_manager.cpp:156][OpenProcess][tid:131377] [TsdClient][deviceId=2] [sessionId=1] start hccp and computer process success [INFO] RUNTIME(131377,python3.7):2024-01-11-05:36:04.168.976 [device.cc:340] 131377 Init: isDoubledie:0, topologytype:0 [INFO] RUNTIME(131377,python3.7):2024-01-11-05:36:04.181.132 [npu_driver.cc:5428] 132343 GetDeviceStatus: GetDeviceStatus status=1. [INFO] ATRACE(131377,python3.7):2024-01-11-05:36:04.181.184 [atrace_api.c:28](tid:131377) AtraceCreate start [INFO] ATRACE(131377,python3.7):2024-01-11-05:36:04.181.275 [trace_rb_log.c:84](tid:131377) [RUNTIME_ATRACE_DEV2_TS0] create ring buffer success, buffer size : 131152. [INFO] ATRACE(131377,python3.7):2024-01-11-05:36:04.181.289 [atrace_api.c:32](tid:131377) AtraceCreate end [INFO] TDT(131377,python3.7):2024-01-11-05:36:04.181.303 [client_manager.cpp:157][SetProfilingCallback][tid:131377] [TsdClient] set profiling callback success [TRACE] GE(131377,python3.7):2024-01-11-05:36:04.331.727 [status:INIT] [ge_api.cc:144]131377 GEInitializeImpl:GEInitialize start [INFO] PROFILING(131377,python3.7):2024-01-11-05:36:04.556.008 [msprofiler_impl.cpp:156] >>> (tid:131377) ProfNotifySetDevice called, is open: 1, devId: 2 [INFO] PROFILING(131377,python3.7):2024-01-11-05:36:04.556.204 [platform.cpp:38] >>> (tid:131377) Profiling platform version: 1.0. [INFO] PROFILING(131377,python3.7):2024-01-11-05:36:04.556.222 [ai_drv_dev_api.cpp:384] >>> (tid:131377) Succeeded to DrvGetApiVersion version: 0x72313 [TRACE] GE(131377,python3.7):2024-01-11-05:36:04.607.601 [status:RUNNING] [ge_api.cc:211]131377 GEInitializeImpl:Initializing environment [INFO] GE(131377,python3.7):2024-01-11-05:36:04.607.656 [gelib.cc:98][EVENT]131377 Initialize:[GEPERFTRACE] GE Init Start [INFO] GE(131377,python3.7):2024-01-11-05:36:04.607.920 [gelib.cc:307][EVENT]131377 SystemInitialize:Online infer init GELib success, device id :2 [INFO] DVPP(131377,python3.7):2024-01-11-05:36:04.972.767 [dvpp_engine.cc:41][ENGINE][Initialize:41][tid:131377]dvpp engine do not support [INFO] TUNE(131377,python3.7):2024-01-11-05:36:04.976.705 [cann_kb_pyfunc_mgr.cpp:72][CANNKB][Tid:131377]"CannKbPyfuncMgr: Enter PyObjectInit, reference_ is 0!" [INFO] TUNE(131377,python3.7):2024-01-11-05:36:04.976.742 [handle_manager.cpp:115][CANNKB][Tid:131377]"Start to run init functions to load dynamic python lib!" [INFO] TUNE(131377,python3.7):2024-01-11-05:36:04.976.802 [handle_manager.cpp:407][CANNKB][Tid:131377]"Init functions of loading dynamic python lib end!" [INFO] TUNE(131377,python3.7):2024-01-11-05:36:04.976.813 [cann_kb_pyfunc_mgr.cpp:24][CANNKB][Tid:131377]"CANN_KB_Py has already been initialized." [INFO] TUNE(131377,python3.7):2024-01-11-05:36:04.976.881 [cann_kb_pyfunc_mgr.cpp:117][CANNKB][Tid:131377]"CannKbPyfuncMgr: Run PyObjectInit successfully!" [INFO] HCCL(131377,python3.7):2024-01-11-05:36:17.059.826 [plugin_manager.cc:42][131377]hcom running normal mode. [INFO] DVPP(131377,python3.7):2024-01-11-05:36:17.060.529 [dvpp_engine.cc:92][ENGINE][GetOpsKernelInfoStores:92][tid:131377]dvpp ops kernel info store do not support [INFO] DVPP(131377,python3.7):2024-01-11-05:36:17.060.724 [dvpp_engine.cc:69][ENGINE][GetGraphOptimizerObjs:69][tid:131377]dvpp graph optimizer do not support [INFO] DVPP(131377,python3.7):2024-01-11-05:36:17.731.083 [dvpp_ops_kernel_builder.cc:48][ENGINE][Initialize:48][tid:131377]dvpp ops kernel builder do not support [INFO] GE(131377,python3.7):2024-01-11-05:36:17.740.489 [gelib.cc:169][EVENT]131377 Initialize:[GEPERFTRACE] The time cost of GELib::Initialize is [13132782] micro second. [TRACE] GE(131377,python3.7):2024-01-11-05:36:17.832.787 [status:STOP] [ge_api.cc:255]131377 GEInitializeImpl:GEInitialize finished [TRACE] GE(131377,python3.7):2024-01-11-05:36:17.832.935 [status:INIT] [ge_api.cc:398]131377 Session:Start to construct session. [TRACE] GE(131377,python3.7):2024-01-11-05:36:17.832.953 [status:RUNNING] [ge_api.cc:408]131377 Session:Creating session [INFO] GE(131377,python3.7):2024-01-11-05:36:17.833.386 [graph_var_manager.cc:1445][EVENT]131377 SetMemoryMallocSize:Total memory size is 34359738368 [INFO] GE(131377,python3.7):2024-01-11-05:36:17.833.403 [graph_var_manager.cc:1424][EVENT]131377 SetAllMemoryMaxValue:The graph_mem_max_size is 27917287424 and the var_mem_max_size is 5368709120 [INFO] PROFILING(131377,python3.7):2024-01-11-05:36:17.836.806 [msprofiler_impl.cpp:156] >>> (tid:131377) ProfNotifySetDevice called, is open: 1, devId: 2 [TRACE] GE(131377,python3.7):2024-01-11-05:36:17.837.725 [status:RUNNING] [ge_api.cc:411]131377 Session:Session id is 0 [TRACE] GE(131377,python3.7):2024-01-11-05:36:17.837.750 [status:STOP] [ge_api.cc:420]131377 Session:Session Constructor finished [INFO] PROFILING(131377,python3.7):2024-01-11-05:36:17.847.465 [platform.cpp:38] >>> (tid:131377) Profiling platform version: 1.0. [INFO] PROFILING(131377,python3.7):2024-01-11-05:36:17.847.512 [ai_drv_dev_api.cpp:384] >>> (tid:131377) Succeeded to DrvGetApiVersion version: 0x72313 [TRACE] GE(131377,python3.7):2024-01-11-05:36:17.847.697 [status:INIT] [ge_api.cc:144]131377 GEInitializeImpl:GEInitialize start TotalTime = 0.151235, [20] [parse]: 0.013689 [symbol_resolve]: 0.0525244, [1] [Cycle 1]: 0.0523348, [1] [resolve]: 0.0522908 [combine_like_graphs]: 1.28e-06 [graph_reusing]: 6.1e-06 [meta_unpack_prepare]: 0.00035082 [pre_cconv]: 4.89e-06 [abstract_specialize]: 0.0687074 [pack_expand]: 1.905e-05 [auto_monad]: 0.0001444 [inline]: 2.25e-06 [pre_auto_parallel]: 2.09e-05 [pipeline_split]: 2.89e-06 [optimize]: 0.00892966, [35] [py_interpret_to_execute]: 0.00028593 [rewriter_before_opt_a]: 0.00023321 [opt_a]: 0.00761442, [2] [Cycle 1]: 0.0022237, [30] [expand_dump_flag]: 5.58999e-06 [switch_simplify]: 0.00014497 [a_1]: 0.00040011 [recompute_prepare]: 3.41e-06 [updatestate_depend_eliminate]: 1.034e-05 [updatestate_assign_eliminate]: 4.9e-06 [updatestate_loads_eliminate]: 4.12e-06 [parameter_eliminate]: 4.15e-06 [a_2]: 7.64e-05 [accelerated_algorithm]: 3.16e-06 [pynative_shard]: 2.2e-06 [auto_parallel]: 6.4e-06 [parallel]: 1.821e-05 [merge_comm]: 1.072e-05 [allreduce_fusion]: 1.98e-06 [virtual_dataset]: 2.87e-06 [get_grad_eliminate_]: 1.96e-06 [virtual_output]: 1.95e-06 [merge_forward]: 5.56e-06 [cell_reuse_recompute_pass]: 9.10004e-07 [cell_reuse_handle_not_recompute_node_pass]: 5.98e-06 [meta_fg_expand]: 4.39e-06 [after_resolve]: 5.16e-06 [a_after_grad]: 2.76999e-06 [renormalize]: 0.00126733 [real_op_eliminate]: 5.89e-06 [auto_monad_grad]: 4.72e-06 [auto_monad_eliminator]: 1.134e-05 [cse]: 2.712e-05 [a_3]: 1.641e-05 [Cycle 2]: 0.00023077, [30] [expand_dump_flag]: 1.36e-06 [switch_simplify]: 2.08e-06 [a_1]: 1.846e-05 [recompute_prepare]: 1.64e-06 [updatestate_depend_eliminate]: 2.82e-06 [updatestate_assign_eliminate]: 2.27e-06 [updatestate_loads_eliminate]: 2.13e-06 [parameter_eliminate]: 9.60004e-07 [a_2]: 2.616e-05 [accelerated_algorithm]: 2.62e-06 [pynative_shard]: 1.66e-06 [auto_parallel]: 4.52e-06 [parallel]: 4.06e-06 [merge_comm]: 1.88e-06 [allreduce_fusion]: 1.3e-06 [virtual_dataset]: 2.28e-06 [get_grad_eliminate_]: 1.9e-06 [virtual_output]: 1.65e-06 [merge_forward]: 3.13e-06 [cell_reuse_recompute_pass]: 4.39999e-07 [cell_reuse_handle_not_recompute_node_pass]: 6.48e-06 [meta_fg_expand]: 2.21e-06 [after_resolve]: 3.67e-06 [a_after_grad]: 2.31e-06 [renormalize]: 6.99947e-08 [real_op_eliminate]: 1.60999e-06 [auto_monad_grad]: 7.90002e-07 [auto_monad_eliminator]: 3.91001e-06 [cse]: 8.79e-06 [a_3]: 1.229e-05 [py_interpret_to_execute_after_opt_a]: 0.00010025 [slice_cell_reuse_recomputed_activation]: 2.57e-06 [rewriter_after_opt_a]: 0.00017245 [convert_after_rewriter]: 7.54e-06 [order_py_execute_after_rewriter]: 4.11001e-06 [opt_b]: 9.523e-05, [1] [Cycle 1]: 8.985e-05, [7] [b_1]: 4.129e-05 [b_2]: 2.88e-06 [updatestate_depend_eliminate]: 3.1e-06 [updatestate_assign_eliminate]: 2.47e-06 [updatestate_loads_eliminate]: 2.06e-06 [renormalize]: 3.20004e-07 [cse]: 9.99e-06 [cconv]: 2.655e-05 [opt_after_cconv]: 4.868e-05, [1] [Cycle 1]: 4.487e-05, [7] [c_1]: 4.63999e-06 [parameter_eliminate]: 8.49999e-07 [updatestate_depend_eliminate]: 2.11e-06 [updatestate_assign_eliminate]: 1.75e-06 [updatestate_loads_eliminate]: 1.66e-06 [cse]: 7.23e-06 [renormalize]: 3.19997e-07 [remove_dup_value]: 1.231e-05 [tuple_transform]: 4.449e-05, [1] [Cycle 1]: 4.043e-05, [3] [d_1]: 2.269e-05 [d_2]: 5.5e-06 [renormalize]: 1.49994e-07 [add_cache_embedding]: 1.809e-05 [add_recomputation]: 4.592e-05 [cse_after_recomputation]: 1.659e-05, [1] [Cycle 1]: 1.25e-05, [1] [cse]: 7.91e-06 [environ_conv]: 1.508e-05 [label_micro_interleaved_index]: 2.34001e-06 [label_fine_grained_interleaved_index]: 2.44e-06 [assign_add_opt]: 1.54e-06 [slice_recompute_activation]: 2.22999e-06 [micro_interleaved_order_control]: 2.26e-06 [full_micro_interleaved_order_control]: 2.33e-06 [comp_comm_scheduling]: 2.48e-06 [reorder_send_recv_between_fp_bp]: 2.3e-06 [comm_op_add_attrs]: 1.06e-06 [add_comm_op_reuse_tag]: 9.39996e-07 [overlap_opt_shard_in_pipeline]: 1.02e-06 [grouped_pairwise_exchange_alltoall]: 1.65e-06 [overlap_recompute_and_grad_model_parallel]: 1.96999e-06 [overlap_grad_matmul_and_grad_allreduce]: 1.32e-06 [split_matmul_comm_elemetwise]: 2.32e-06 [split_layernorm_comm]: 1.84e-06 [process_send_recv_for_ge]: 3.35e-06 [handle_group_info]: 1.04e-06 [auto_monad_reorder]: 2.203e-05 [get_jit_bprop_graph]: 4.39999e-07 [eliminate_special_op_node]: 0.00053154 [validate]: 4.952e-05 [distribtued_split]: 1.42e-06 [task_emit]: 0.00594614 [execute]: 8.54e-06 Sums parse : 0.013689s : 9.44% symbol_resolve.resolve : 0.052291s : 36.05% combine_like_graphs : 0.000001s : 0.00% graph_reusing : 0.000006s : 0.00% meta_unpack_prepare : 0.000351s : 0.24% pre_cconv : 0.000005s : 0.00% abstract_specialize : 0.068707s : 47.36% pack_expand : 0.000019s : 0.01% auto_monad : 0.000144s : 0.10% inline : 0.000002s : 0.00% pre_auto_parallel : 0.000021s : 0.01% pipeline_split : 0.000003s : 0.00% optimize.py_interpret_to_execute : 0.000286s : 0.20% optimize.rewriter_before_opt_a : 0.000233s : 0.16% optimize.opt_a.expand_dump_flag : 0.000007s : 0.00% optimize.opt_a.switch_simplify : 0.000147s : 0.10% optimize.opt_a.a_1 : 0.000419s : 0.29% optimize.opt_a.recompute_prepare : 0.000005s : 0.00% optimize.opt_a.updatestate_depend_eliminate : 0.000013s : 0.01% optimize.opt_a.updatestate_assign_eliminate : 0.000007s : 0.00% optimize.opt_a.updatestate_loads_eliminate : 0.000006s : 0.00% optimize.opt_a.parameter_eliminate : 0.000005s : 0.00% optimize.opt_a.a_2 : 0.000103s : 0.07% optimize.opt_a.accelerated_algorithm : 0.000006s : 0.00% optimize.opt_a.pynative_shard : 0.000004s : 0.00% optimize.opt_a.auto_parallel : 0.000011s : 0.01% optimize.opt_a.parallel : 0.000022s : 0.02% optimize.opt_a.merge_comm : 0.000013s : 0.01% optimize.opt_a.allreduce_fusion : 0.000003s : 0.00% optimize.opt_a.virtual_dataset : 0.000005s : 0.00% optimize.opt_a.get_grad_eliminate_ : 0.000004s : 0.00% optimize.opt_a.virtual_output : 0.000004s : 0.00% optimize.opt_a.merge_forward : 0.000009s : 0.01% optimize.opt_a.cell_reuse_recompute_pass : 0.000001s : 0.00% optimize.opt_a.cell_reuse_handle_not_recompute_node_pass : 0.000012s : 0.01% optimize.opt_a.meta_fg_expand : 0.000007s : 0.00% optimize.opt_a.after_resolve : 0.000009s : 0.01% optimize.opt_a.a_after_grad : 0.000005s : 0.00% optimize.opt_a.renormalize : 0.001267s : 0.87% optimize.opt_a.real_op_eliminate : 0.000007s : 0.01% optimize.opt_a.auto_monad_grad : 0.000006s : 0.00% optimize.opt_a.auto_monad_eliminator : 0.000015s : 0.01% optimize.opt_a.cse : 0.000036s : 0.02% optimize.opt_a.a_3 : 0.000029s : 0.02% optimize.py_interpret_to_execute_after_opt_a : 0.000100s : 0.07% optimize.slice_cell_reuse_recomputed_activation : 0.000003s : 0.00% optimize.rewriter_after_opt_a : 0.000172s : 0.12% optimize.convert_after_rewriter : 0.000008s : 0.01% optimize.order_py_execute_after_rewriter : 0.000004s : 0.00% optimize.opt_b.b_1 : 0.000041s : 0.03% optimize.opt_b.b_2 : 0.000003s : 0.00% optimize.opt_b.updatestate_depend_eliminate : 0.000003s : 0.00% optimize.opt_b.updatestate_assign_eliminate : 0.000002s : 0.00% optimize.opt_b.updatestate_loads_eliminate : 0.000002s : 0.00% optimize.opt_b.renormalize : 0.000000s : 0.00% optimize.opt_b.cse : 0.000010s : 0.01% optimize.cconv : 0.000027s : 0.02% optimize.opt_after_cconv.c_1 : 0.000005s : 0.00% optimize.opt_after_cconv.parameter_eliminate : 0.000001s : 0.00% optimize.opt_after_cconv.updatestate_depend_eliminate : 0.000002s : 0.00% optimize.opt_after_cconv.updatestate_assign_eliminate : 0.000002s : 0.00% optimize.opt_after_cconv.updatestate_loads_eliminate : 0.000002s : 0.00% optimize.opt_after_cconv.cse : 0.000007s : 0.00% optimize.opt_after_cconv.renormalize : 0.000000s : 0.00% optimize.remove_dup_value : 0.000012s : 0.01% optimize.tuple_transform.d_1 : 0.000023s : 0.02% optimize.tuple_transform.d_2 : 0.000005s : 0.00% optimize.tuple_transform.renormalize : 0.000000s : 0.00% optimize.add_cache_embedding : 0.000018s : 0.01% optimize.add_recomputation : 0.000046s : 0.03% optimize.cse_after_recomputation.cse : 0.000008s : 0.01% optimize.environ_conv : 0.000015s : 0.01% optimize.label_micro_interleaved_index : 0.000002s : 0.00% optimize.label_fine_grained_interleaved_index : 0.000002s : 0.00% optimize.assign_add_opt : 0.000002s : 0.00% optimize.slice_recompute_activation : 0.000002s : 0.00% optimize.micro_interleaved_order_control : 0.000002s : 0.00% optimize.full_micro_interleaved_order_control : 0.000002s : 0.00% optimize.comp_comm_scheduling : 0.000002s : 0.00% optimize.reorder_send_recv_between_fp_bp : 0.000002s : 0.00% optimize.comm_op_add_attrs : 0.000001s : 0.00% optimize.add_comm_op_reuse_tag : 0.000001s : 0.00% optimize.overlap_opt_shard_in_pipeline : 0.000001s : 0.00% optimize.grouped_pairwise_exchange_alltoall : 0.000002s : 0.00% optimize.overlap_recompute_and_grad_model_parallel : 0.000002s : 0.00% optimize.overlap_grad_matmul_and_grad_allreduce : 0.000001s : 0.00% optimize.split_matmul_comm_elemetwise : 0.000002s : 0.00% optimize.split_layernorm_comm : 0.000002s : 0.00% optimize.process_send_recv_for_ge : 0.000003s : 0.00% optimize.handle_group_info : 0.000001s : 0.00% auto_monad_reorder : 0.000022s : 0.02% get_jit_bprop_graph : 0.000000s : 0.00% eliminate_special_op_node : 0.000532s : 0.37% validate : 0.000050s : 0.03% distribtued_split : 0.000001s : 0.00% task_emit : 0.005946s : 4.10% execute : 0.000009s : 0.01% Time group info: ------[substitution.] 0.050945 287 0.01% : 0.000003s : 1: substitution.depend_value_elim 99.36% : 0.050617s : 61: substitution.getattr_setattr_resolve 0.03% : 0.000014s : 3: substitution.graph_param_transform 0.38% : 0.000194s : 11: substitution.inline 0.18% : 0.000093s : 196: substitution.meta_unpack_prepare 0.00% : 0.000001s : 3: substitution.partial_unused_args_eliminate 0.00% : 0.000002s : 4: substitution.remove_not_recompute_node 0.01% : 0.000003s : 2: substitution.replace_old_param 0.02% : 0.000009s : 5: substitution.switch_simplify 0.02% : 0.000009s : 1: substitution.tuple_list_get_item_eliminator ------[renormalize.] 0.001259 2 66.43% : 0.000837s : 1: renormalize.infer 33.57% : 0.000423s : 1: renormalize.specialize ------[replace.] 0.001001 77 0.75% : 0.000007s : 1: replace.depend_value_elim 84.97% : 0.000850s : 59: replace.getattr_setattr_resolve 6.48% : 0.000065s : 11: replace.inline 6.94% : 0.000069s : 5: replace.switch_simplify 0.87% : 0.000009s : 1: replace.tuple_list_get_item_eliminator ------[match.] 0.050639 77 0.01% : 0.000003s : 1: match.depend_value_elim 99.58% : 0.050425s : 59: match.getattr_setattr_resolve 0.38% : 0.000194s : 11: match.inline 0.02% : 0.000009s : 5: match.switch_simplify 0.02% : 0.000009s : 1: match.tuple_list_get_item_eliminator ------[func_graph_cloner_run.] 0.005163 29 88.39% : 0.004564s : 16: func_graph_cloner_run.FuncGraphClonerGraph 11.61% : 0.000600s : 13: func_graph_cloner_run.FuncGraphSpecializer ------[meta_graph.] 0.000000 0 ------[manager.] 0.000000 0 ------[pynative] 0.000000 0 ------[others.] 0.053358 122 0.22% : 0.000118s : 69: opt.transform.opt_a 0.06% : 0.000032s : 23: opt.transform.opt_b 97.93% : 0.052253s : 2: opt.transform.opt_resolve 0.60% : 0.000321s : 1: opt.transforms.meta_unpack_prepare 1.11% : 0.000594s : 20: opt.transforms.opt_a 0.01% : 0.000004s : 1: opt.transforms.opt_after_cconv 0.00% : 0.000002s : 1: opt.transforms.opt_b 0.05% : 0.000027s : 2: opt.transforms.opt_trans_graph 0.02% : 0.000008s : 3: opt.transforms.special_op_eliminate TotalTime = 0.183398, [20] [parse]: 0.00194351 [symbol_resolve]: 0.0254318, [1] [Cycle 1]: 0.0252446, [1] [resolve]: 0.0252089 [combine_like_graphs]: 1.3e-06 [graph_reusing]: 5.1e-06 [meta_unpack_prepare]: 0.00036096 [pre_cconv]: 8.40002e-07 [abstract_specialize]: 0.139698 [pack_expand]: 3.293e-05 [auto_monad]: 0.000753 [inline]: 2.73e-06 [pre_auto_parallel]: 1.642e-05 [pipeline_split]: 4.13e-06 [optimize]: 0.00909718, [35] [py_interpret_to_execute]: 8.232e-05 [rewriter_before_opt_a]: 0.00040146 [opt_a]: 0.00789076, [2] [Cycle 1]: 0.00487467, [30] [expand_dump_flag]: 9.97e-06 [switch_simplify]: 0.00023282 [a_1]: 0.00081035 [recompute_prepare]: 4.5e-06 [updatestate_depend_eliminate]: 1.167e-05 [updatestate_assign_eliminate]: 6.06e-06 [updatestate_loads_eliminate]: 5.03e-06 [parameter_eliminate]: 3.83e-06 [a_2]: 9.695e-05 [accelerated_algorithm]: 3.42e-06 [pynative_shard]: 1.85e-06 [auto_parallel]: 5.22999e-06 [parallel]: 9.15e-06 [merge_comm]: 4.48e-06 [allreduce_fusion]: 1.86e-06 [virtual_dataset]: 4.08e-06 [get_grad_eliminate_]: 2.25e-06 [virtual_output]: 2.12e-06 [merge_forward]: 5.1e-06 [cell_reuse_recompute_pass]: 8.2e-07 [cell_reuse_handle_not_recompute_node_pass]: 6.82e-06 [meta_fg_expand]: 5.4e-06 [after_resolve]: 5.6e-06 [a_after_grad]: 2.76e-06 [renormalize]: 0.00339011 [real_op_eliminate]: 6.7e-06 [auto_monad_grad]: 5.32e-06 [auto_monad_eliminator]: 1.312e-05 [cse]: 2.813e-05 [a_3]: 1.931e-05 [Cycle 2]: 0.00026327, [30] [expand_dump_flag]: 1.5e-06 [switch_simplify]: 2.55e-06 [a_1]: 3.152e-05 [recompute_prepare]: 1.93001e-06 [updatestate_depend_eliminate]: 4.07e-06 [updatestate_assign_eliminate]: 2.91e-06 [updatestate_loads_eliminate]: 2.2e-06 [parameter_eliminate]: 1.4e-06 [a_2]: 2.926e-05 [accelerated_algorithm]: 2.79e-06 [pynative_shard]: 1.95e-06 [auto_parallel]: 4.56e-06 [parallel]: 4.34e-06 [merge_comm]: 2.58e-06 [allreduce_fusion]: 1.27e-06 [virtual_dataset]: 2.49e-06 [get_grad_eliminate_]: 3.24e-06 [virtual_output]: 2.18e-06 [merge_forward]: 3.12e-06 [cell_reuse_recompute_pass]: 4.70005e-07 [cell_reuse_handle_not_recompute_node_pass]: 5.88e-06 [meta_fg_expand]: 2.58e-06 [after_resolve]: 3.93e-06 [a_after_grad]: 2.64e-06 [renormalize]: 5.99975e-08 [real_op_eliminate]: 2.25e-06 [auto_monad_grad]: 8.70001e-07 [auto_monad_eliminator]: 4.81e-06 [cse]: 1.125e-05 [a_3]: 1.402e-05 [py_interpret_to_execute_after_opt_a]: 9.41e-06 [slice_cell_reuse_recomputed_activation]: 2.58e-06 [rewriter_after_opt_a]: 0.00023112 [convert_after_rewriter]: 8.75e-06 [order_py_execute_after_rewriter]: 5.42e-06 [opt_b]: 0.000109, [1] [Cycle 1]: 0.00010255, [7] [b_1]: 4.656e-05 [b_2]: 4.19001e-06 [updatestate_depend_eliminate]: 4.51e-06 [updatestate_assign_eliminate]: 2.52e-06 [updatestate_loads_eliminate]: 2.21e-06 [renormalize]: 4.30002e-07 [cse]: 1.34e-05 [cconv]: 2.565e-05 [opt_after_cconv]: 5.243e-05, [1] [Cycle 1]: 4.824e-05, [7] [c_1]: 5.66e-06 [parameter_eliminate]: 9.09997e-07 [updatestate_depend_eliminate]: 3.17e-06 [updatestate_assign_eliminate]: 1.85e-06 [updatestate_loads_eliminate]: 1.87e-06 [cse]: 8.94e-06 [renormalize]: 1.90004e-07 [remove_dup_value]: 1.232e-05 [tuple_transform]: 3.805e-05, [1] [Cycle 1]: 3.402e-05, [3] [d_1]: 1.529e-05 [d_2]: 6.91e-06 [renormalize]: 1.39997e-07 [add_cache_embedding]: 1.217e-05 [add_recomputation]: 4.151e-05 [cse_after_recomputation]: 1.797e-05, [1] [Cycle 1]: 1.41e-05, [1] [cse]: 9.86e-06 [environ_conv]: 8.1e-06 [label_micro_interleaved_index]: 2.95e-06 [label_fine_grained_interleaved_index]: 2.41e-06 [assign_add_opt]: 1.46e-06 [slice_recompute_activation]: 2.55e-06 [micro_interleaved_order_control]: 1.72e-06 [full_micro_interleaved_order_control]: 1.96e-06 [comp_comm_scheduling]: 2.16e-06 [reorder_send_recv_between_fp_bp]: 2.18e-06 [comm_op_add_attrs]: 1.12e-06 [add_comm_op_reuse_tag]: 9.90003e-07 [overlap_opt_shard_in_pipeline]: 1.21e-06 [grouped_pairwise_exchange_alltoall]: 1.37e-06 [overlap_recompute_and_grad_model_parallel]: 2.11e-06 [overlap_grad_matmul_and_grad_allreduce]: 7.2e-07 [split_matmul_comm_elemetwise]: 2.27999e-06 [split_layernorm_comm]: 1.79e-06 [process_send_recv_for_ge]: 7.7e-07 [handle_group_info]: 9.5e-07 [auto_monad_reorder]: 1.578e-05 [get_jit_bprop_graph]: 3.99996e-07 [eliminate_special_op_node]: 0.00054181 [validate]: 3.163e-05 [distribtued_split]: 1.27e-06 [task_emit]: 0.00520318 [execute]: 7.66e-06 Sums parse : 0.001944s : 1.08% symbol_resolve.resolve : 0.025209s : 14.03% combine_like_graphs : 0.000001s : 0.00% graph_reusing : 0.000005s : 0.00% meta_unpack_prepare : 0.000361s : 0.20% pre_cconv : 0.000001s : 0.00% abstract_specialize : 0.139698s : 77.74% pack_expand : 0.000033s : 0.02% auto_monad : 0.000753s : 0.42% inline : 0.000003s : 0.00% pre_auto_parallel : 0.000016s : 0.01% pipeline_split : 0.000004s : 0.00% optimize.py_interpret_to_execute : 0.000082s : 0.05% optimize.rewriter_before_opt_a : 0.000401s : 0.22% optimize.opt_a.expand_dump_flag : 0.000011s : 0.01% optimize.opt_a.switch_simplify : 0.000235s : 0.13% optimize.opt_a.a_1 : 0.000842s : 0.47% optimize.opt_a.recompute_prepare : 0.000006s : 0.00% optimize.opt_a.updatestate_depend_eliminate : 0.000016s : 0.01% optimize.opt_a.updatestate_assign_eliminate : 0.000009s : 0.00% optimize.opt_a.updatestate_loads_eliminate : 0.000007s : 0.00% optimize.opt_a.parameter_eliminate : 0.000005s : 0.00% optimize.opt_a.a_2 : 0.000126s : 0.07% optimize.opt_a.accelerated_algorithm : 0.000006s : 0.00% optimize.opt_a.pynative_shard : 0.000004s : 0.00% optimize.opt_a.auto_parallel : 0.000010s : 0.01% optimize.opt_a.parallel : 0.000013s : 0.01% optimize.opt_a.merge_comm : 0.000007s : 0.00% optimize.opt_a.allreduce_fusion : 0.000003s : 0.00% optimize.opt_a.virtual_dataset : 0.000007s : 0.00% optimize.opt_a.get_grad_eliminate_ : 0.000005s : 0.00% optimize.opt_a.virtual_output : 0.000004s : 0.00% optimize.opt_a.merge_forward : 0.000008s : 0.00% optimize.opt_a.cell_reuse_recompute_pass : 0.000001s : 0.00% optimize.opt_a.cell_reuse_handle_not_recompute_node_pass : 0.000013s : 0.01% optimize.opt_a.meta_fg_expand : 0.000008s : 0.00% optimize.opt_a.after_resolve : 0.000010s : 0.01% optimize.opt_a.a_after_grad : 0.000005s : 0.00% optimize.opt_a.renormalize : 0.003390s : 1.89% optimize.opt_a.real_op_eliminate : 0.000009s : 0.00% optimize.opt_a.auto_monad_grad : 0.000006s : 0.00% optimize.opt_a.auto_monad_eliminator : 0.000018s : 0.01% optimize.opt_a.cse : 0.000039s : 0.02% optimize.opt_a.a_3 : 0.000033s : 0.02% optimize.py_interpret_to_execute_after_opt_a : 0.000009s : 0.01% optimize.slice_cell_reuse_recomputed_activation : 0.000003s : 0.00% optimize.rewriter_after_opt_a : 0.000231s : 0.13% optimize.convert_after_rewriter : 0.000009s : 0.00% optimize.order_py_execute_after_rewriter : 0.000005s : 0.00% optimize.opt_b.b_1 : 0.000047s : 0.03% optimize.opt_b.b_2 : 0.000004s : 0.00% optimize.opt_b.updatestate_depend_eliminate : 0.000005s : 0.00% optimize.opt_b.updatestate_assign_eliminate : 0.000003s : 0.00% optimize.opt_b.updatestate_loads_eliminate : 0.000002s : 0.00% optimize.opt_b.renormalize : 0.000000s : 0.00% optimize.opt_b.cse : 0.000013s : 0.01% optimize.cconv : 0.000026s : 0.01% optimize.opt_after_cconv.c_1 : 0.000006s : 0.00% optimize.opt_after_cconv.parameter_eliminate : 0.000001s : 0.00% optimize.opt_after_cconv.updatestate_depend_eliminate : 0.000003s : 0.00% optimize.opt_after_cconv.updatestate_assign_eliminate : 0.000002s : 0.00% optimize.opt_after_cconv.updatestate_loads_eliminate : 0.000002s : 0.00% optimize.opt_after_cconv.cse : 0.000009s : 0.00% optimize.opt_after_cconv.renormalize : 0.000000s : 0.00% optimize.remove_dup_value : 0.000012s : 0.01% optimize.tuple_transform.d_1 : 0.000015s : 0.01% optimize.tuple_transform.d_2 : 0.000007s : 0.00% optimize.tuple_transform.renormalize : 0.000000s : 0.00% optimize.add_cache_embedding : 0.000012s : 0.01% optimize.add_recomputation : 0.000042s : 0.02% optimize.cse_after_recomputation.cse : 0.000010s : 0.01% optimize.environ_conv : 0.000008s : 0.00% optimize.label_micro_interleaved_index : 0.000003s : 0.00% optimize.label_fine_grained_interleaved_index : 0.000002s : 0.00% optimize.assign_add_opt : 0.000001s : 0.00% optimize.slice_recompute_activation : 0.000003s : 0.00% optimize.micro_interleaved_order_control : 0.000002s : 0.00% optimize.full_micro_interleaved_order_control : 0.000002s : 0.00% optimize.comp_comm_scheduling : 0.000002s : 0.00% optimize.reorder_send_recv_between_fp_bp : 0.000002s : 0.00% optimize.comm_op_add_attrs : 0.000001s : 0.00% optimize.add_comm_op_reuse_tag : 0.000001s : 0.00% optimize.overlap_opt_shard_in_pipeline : 0.000001s : 0.00% optimize.grouped_pairwise_exchange_alltoall : 0.000001s : 0.00% optimize.overlap_recompute_and_grad_model_parallel : 0.000002s : 0.00% optimize.overlap_grad_matmul_and_grad_allreduce : 0.000001s : 0.00% optimize.split_matmul_comm_elemetwise : 0.000002s : 0.00% optimize.split_layernorm_comm : 0.000002s : 0.00% optimize.process_send_recv_for_ge : 0.000001s : 0.00% optimize.handle_group_info : 0.000001s : 0.00% auto_monad_reorder : 0.000016s : 0.01% get_jit_bprop_graph : 0.000000s : 0.00% eliminate_special_op_node : 0.000542s : 0.30% validate : 0.000032s : 0.02% distribtued_split : 0.000001s : 0.00% task_emit : 0.005203s : 2.90% execute : 0.000008s : 0.00% Time group info: ------[substitution.] 0.024268 312 0.01% : 0.000003s : 2: substitution.depend_value_elim 97.68% : 0.023704s : 61: substitution.getattr_setattr_resolve 0.02% : 0.000005s : 4: substitution.graph_param_transform 1.72% : 0.000417s : 24: substitution.inline 0.39% : 0.000096s : 196: substitution.meta_unpack_prepare 0.01% : 0.000002s : 4: substitution.partial_unused_args_eliminate 0.01% : 0.000002s : 4: substitution.remove_not_recompute_node 0.01% : 0.000002s : 2: substitution.replace_old_param 0.07% : 0.000018s : 3: substitution.reshape_eliminate 0.05% : 0.000013s : 11: substitution.switch_simplify 0.03% : 0.000006s : 1: substitution.tuple_list_get_item_eliminator ------[renormalize.] 0.003383 2 71.26% : 0.002411s : 1: renormalize.infer 28.74% : 0.000972s : 1: renormalize.specialize ------[replace.] 0.001041 97 1.15% : 0.000012s : 2: replace.depend_value_elim 73.74% : 0.000768s : 59: replace.getattr_setattr_resolve 12.93% : 0.000135s : 24: replace.inline 11.18% : 0.000116s : 11: replace.switch_simplify 1.00% : 0.000010s : 1: replace.tuple_list_get_item_eliminator ------[match.] 0.024012 97 0.01% : 0.000003s : 2: match.depend_value_elim 98.17% : 0.023573s : 59: match.getattr_setattr_resolve 1.74% : 0.000417s : 24: match.inline 0.05% : 0.000013s : 11: match.switch_simplify 0.03% : 0.000006s : 1: match.tuple_list_get_item_eliminator ------[func_graph_cloner_run.] 0.010615 83 87.27% : 0.009264s : 57: func_graph_cloner_run.FuncGraphClonerGraph 12.73% : 0.001351s : 26: func_graph_cloner_run.FuncGraphSpecializer ------[meta_graph.] 0.000000 0 ------[manager.] 0.000000 0 ------[pynative] 0.000000 0 ------[others.] 0.026862 122 0.54% : 0.000145s : 69: opt.transform.opt_a 0.14% : 0.000037s : 23: opt.transform.opt_b 93.81% : 0.025199s : 2: opt.transform.opt_resolve 1.23% : 0.000330s : 1: opt.transforms.meta_unpack_prepare 4.14% : 0.001113s : 20: opt.transforms.opt_a 0.02% : 0.000004s : 1: opt.transforms.opt_after_cconv 0.01% : 0.000003s : 1: opt.transforms.opt_b 0.08% : 0.000020s : 2: opt.transforms.opt_trans_graph 0.04% : 0.000010s : 3: opt.transforms.special_op_eliminate TotalTime = 0.0795835, [20] [parse]: 0.00198476 [symbol_resolve]: 0.0250912, [1] [Cycle 1]: 0.0249134, [1] [resolve]: 0.0248815 [combine_like_graphs]: 1.09e-06 [graph_reusing]: 5.25e-06 [meta_unpack_prepare]: 0.00037228 [pre_cconv]: 7.79997e-07 [abstract_specialize]: 0.0400032 [pack_expand]: 2.222e-05 [auto_monad]: 0.00010348 [inline]: 2.05e-06 [pre_auto_parallel]: 1.239e-05 [pipeline_split]: 2.7e-06 [optimize]: 0.00633545, [35] [py_interpret_to_execute]: 0.0003015 [rewriter_before_opt_a]: 0.00022409 [opt_a]: 0.00519999, [2] [Cycle 1]: 0.00208744, [30] [expand_dump_flag]: 5.58e-06 [switch_simplify]: 0.00012004 [a_1]: 0.00037304 [recompute_prepare]: 3.24e-06 [updatestate_depend_eliminate]: 7.95e-06 [updatestate_assign_eliminate]: 4.42e-06 [updatestate_loads_eliminate]: 3.86e-06 [parameter_eliminate]: 3.32e-06 [a_2]: 7.464e-05 [accelerated_algorithm]: 2.91e-06 [pynative_shard]: 1.9e-06 [auto_parallel]: 4.8e-06 [parallel]: 9.37e-06 [merge_comm]: 4.25e-06 [allreduce_fusion]: 1.83001e-06 [virtual_dataset]: 2.78001e-06 [get_grad_eliminate_]: 1.88001e-06 [virtual_output]: 2.17e-06 [merge_forward]: 4.66e-06 [cell_reuse_recompute_pass]: 7.99999e-07 [cell_reuse_handle_not_recompute_node_pass]: 6.38e-06 [meta_fg_expand]: 4.58999e-06 [after_resolve]: 4.93e-06 [a_after_grad]: 2.36e-06 [renormalize]: 0.00121449 [real_op_eliminate]: 6.15e-06 [auto_monad_grad]: 5.18e-06 [auto_monad_eliminator]: 1.099e-05 [cse]: 2.522e-05 [a_3]: 1.609e-05 [Cycle 2]: 0.00022912, [30] [expand_dump_flag]: 1.38e-06 [switch_simplify]: 2.27999e-06 [a_1]: 1.697e-05 [recompute_prepare]: 1.76e-06 [updatestate_depend_eliminate]: 2.86e-06 [updatestate_assign_eliminate]: 2.25e-06 [updatestate_loads_eliminate]: 2.08e-06 [parameter_eliminate]: 1.17e-06 [a_2]: 2.636e-05 [accelerated_algorithm]: 2.3e-06 [pynative_shard]: 1.4e-06 [auto_parallel]: 3.67e-06 [parallel]: 3.53e-06 [merge_comm]: 2.02e-06 [allreduce_fusion]: 1.3e-06 [virtual_dataset]: 2.16e-06 [get_grad_eliminate_]: 2.04e-06 [virtual_output]: 1.77e-06 [merge_forward]: 2.9e-06 [cell_reuse_recompute_pass]: 4.19997e-07 [cell_reuse_handle_not_recompute_node_pass]: 7.05e-06 [meta_fg_expand]: 2.07e-06 [after_resolve]: 3.52e-06 [a_after_grad]: 2.42001e-06 [renormalize]: 5.99975e-08 [real_op_eliminate]: 1.67e-06 [auto_monad_grad]: 8.39995e-07 [auto_monad_eliminator]: 4.12e-06 [cse]: 8.59e-06 [a_3]: 1.254e-05 [py_interpret_to_execute_after_opt_a]: 9.23e-06 [slice_cell_reuse_recomputed_activation]: 2.28e-06 [rewriter_after_opt_a]: 0.00014582 [convert_after_rewriter]: 7.64e-06 [order_py_execute_after_rewriter]: 4.89e-06 [opt_b]: 9.574e-05, [1] [Cycle 1]: 9.067e-05, [7] [b_1]: 4.148e-05 [b_2]: 3.56e-06 [updatestate_depend_eliminate]: 3.13e-06 [updatestate_assign_eliminate]: 2.60001e-06 [updatestate_loads_eliminate]: 2.06e-06 [renormalize]: 5.30003e-07 [cse]: 9.46e-06 [cconv]: 2.551e-05 [opt_after_cconv]: 4.734e-05, [1] [Cycle 1]: 4.37e-05, [7] [c_1]: 4.68999e-06 [parameter_eliminate]: 9.39996e-07 [updatestate_depend_eliminate]: 2.06e-06 [updatestate_assign_eliminate]: 1.79e-06 [updatestate_loads_eliminate]: 1.66e-06 [cse]: 6.87e-06 [renormalize]: 2.09999e-07 [remove_dup_value]: 1.135e-05 [tuple_transform]: 3.416e-05, [1] [Cycle 1]: 3.064e-05, [3] [d_1]: 1.355e-05 [d_2]: 5.75e-06 [renormalize]: 2.00002e-07 [add_cache_embedding]: 1.121e-05 [add_recomputation]: 4.025e-05 [cse_after_recomputation]: 1.665e-05, [1] [Cycle 1]: 1.221e-05, [1] [cse]: 7.65e-06 [environ_conv]: 5.35999e-06 [label_micro_interleaved_index]: 2.58e-06 [label_fine_grained_interleaved_index]: 2.48e-06 [assign_add_opt]: 1.49e-06 [slice_recompute_activation]: 2.12e-06 [micro_interleaved_order_control]: 2.11e-06 [full_micro_interleaved_order_control]: 2.04e-06 [comp_comm_scheduling]: 2.45e-06 [reorder_send_recv_between_fp_bp]: 2.57e-06 [comm_op_add_attrs]: 1.16e-06 [add_comm_op_reuse_tag]: 1.02e-06 [overlap_opt_shard_in_pipeline]: 1.07e-06 [grouped_pairwise_exchange_alltoall]: 1.48e-06 [overlap_recompute_and_grad_model_parallel]: 1.89e-06 [overlap_grad_matmul_and_grad_allreduce]: 7.90002e-07 [split_matmul_comm_elemetwise]: 2.22e-06 [split_layernorm_comm]: 1.73e-06 [process_send_recv_for_ge]: 1.04e-06 [handle_group_info]: 1e-06 [auto_monad_reorder]: 1.675e-05 [get_jit_bprop_graph]: 3.89999e-07 [eliminate_special_op_node]: 0.00051956 [validate]: 2.584e-05 [distribtued_split]: 1.31e-06 [task_emit]: 0.00485367 [execute]: 9.27e-06 Sums parse : 0.001985s : 2.62% symbol_resolve.resolve : 0.024882s : 32.83% combine_like_graphs : 0.000001s : 0.00% graph_reusing : 0.000005s : 0.01% meta_unpack_prepare : 0.000372s : 0.49% pre_cconv : 0.000001s : 0.00% abstract_specialize : 0.040003s : 52.77% pack_expand : 0.000022s : 0.03% auto_monad : 0.000103s : 0.14% inline : 0.000002s : 0.00% pre_auto_parallel : 0.000012s : 0.02% pipeline_split : 0.000003s : 0.00% optimize.py_interpret_to_execute : 0.000302s : 0.40% optimize.rewriter_before_opt_a : 0.000224s : 0.30% optimize.opt_a.expand_dump_flag : 0.000007s : 0.01% optimize.opt_a.switch_simplify : 0.000122s : 0.16% optimize.opt_a.a_1 : 0.000390s : 0.51% optimize.opt_a.recompute_prepare : 0.000005s : 0.01% optimize.opt_a.updatestate_depend_eliminate : 0.000011s : 0.01% optimize.opt_a.updatestate_assign_eliminate : 0.000007s : 0.01% optimize.opt_a.updatestate_loads_eliminate : 0.000006s : 0.01% optimize.opt_a.parameter_eliminate : 0.000004s : 0.01% optimize.opt_a.a_2 : 0.000101s : 0.13% optimize.opt_a.accelerated_algorithm : 0.000005s : 0.01% optimize.opt_a.pynative_shard : 0.000003s : 0.00% optimize.opt_a.auto_parallel : 0.000008s : 0.01% optimize.opt_a.parallel : 0.000013s : 0.02% optimize.opt_a.merge_comm : 0.000006s : 0.01% optimize.opt_a.allreduce_fusion : 0.000003s : 0.00% optimize.opt_a.virtual_dataset : 0.000005s : 0.01% optimize.opt_a.get_grad_eliminate_ : 0.000004s : 0.01% optimize.opt_a.virtual_output : 0.000004s : 0.01% optimize.opt_a.merge_forward : 0.000008s : 0.01% optimize.opt_a.cell_reuse_recompute_pass : 0.000001s : 0.00% optimize.opt_a.cell_reuse_handle_not_recompute_node_pass : 0.000013s : 0.02% optimize.opt_a.meta_fg_expand : 0.000007s : 0.01% optimize.opt_a.after_resolve : 0.000008s : 0.01% optimize.opt_a.a_after_grad : 0.000005s : 0.01% optimize.opt_a.renormalize : 0.001215s : 1.60% optimize.opt_a.real_op_eliminate : 0.000008s : 0.01% optimize.opt_a.auto_monad_grad : 0.000006s : 0.01% optimize.opt_a.auto_monad_eliminator : 0.000015s : 0.02% optimize.opt_a.cse : 0.000034s : 0.04% optimize.opt_a.a_3 : 0.000029s : 0.04% optimize.py_interpret_to_execute_after_opt_a : 0.000009s : 0.01% optimize.slice_cell_reuse_recomputed_activation : 0.000002s : 0.00% optimize.rewriter_after_opt_a : 0.000146s : 0.19% optimize.convert_after_rewriter : 0.000008s : 0.01% optimize.order_py_execute_after_rewriter : 0.000005s : 0.01% optimize.opt_b.b_1 : 0.000041s : 0.05% optimize.opt_b.b_2 : 0.000004s : 0.00% optimize.opt_b.updatestate_depend_eliminate : 0.000003s : 0.00% optimize.opt_b.updatestate_assign_eliminate : 0.000003s : 0.00% optimize.opt_b.updatestate_loads_eliminate : 0.000002s : 0.00% optimize.opt_b.renormalize : 0.000001s : 0.00% optimize.opt_b.cse : 0.000009s : 0.01% optimize.cconv : 0.000026s : 0.03% optimize.opt_after_cconv.c_1 : 0.000005s : 0.01% optimize.opt_after_cconv.parameter_eliminate : 0.000001s : 0.00% optimize.opt_after_cconv.updatestate_depend_eliminate : 0.000002s : 0.00% optimize.opt_after_cconv.updatestate_assign_eliminate : 0.000002s : 0.00% optimize.opt_after_cconv.updatestate_loads_eliminate : 0.000002s : 0.00% optimize.opt_after_cconv.cse : 0.000007s : 0.01% optimize.opt_after_cconv.renormalize : 0.000000s : 0.00% optimize.remove_dup_value : 0.000011s : 0.01% optimize.tuple_transform.d_1 : 0.000014s : 0.02% optimize.tuple_transform.d_2 : 0.000006s : 0.01% optimize.tuple_transform.renormalize : 0.000000s : 0.00% optimize.add_cache_embedding : 0.000011s : 0.01% optimize.add_recomputation : 0.000040s : 0.05% optimize.cse_after_recomputation.cse : 0.000008s : 0.01% optimize.environ_conv : 0.000005s : 0.01% optimize.label_micro_interleaved_index : 0.000003s : 0.00% optimize.label_fine_grained_interleaved_index : 0.000002s : 0.00% optimize.assign_add_opt : 0.000001s : 0.00% optimize.slice_recompute_activation : 0.000002s : 0.00% optimize.micro_interleaved_order_control : 0.000002s : 0.00% optimize.full_micro_interleaved_order_control : 0.000002s : 0.00% optimize.comp_comm_scheduling : 0.000002s : 0.00% optimize.reorder_send_recv_between_fp_bp : 0.000003s : 0.00% optimize.comm_op_add_attrs : 0.000001s : 0.00% optimize.add_comm_op_reuse_tag : 0.000001s : 0.00% optimize.overlap_opt_shard_in_pipeline : 0.000001s : 0.00% optimize.grouped_pairwise_exchange_alltoall : 0.000001s : 0.00% optimize.overlap_recompute_and_grad_model_parallel : 0.000002s : 0.00% optimize.overlap_grad_matmul_and_grad_allreduce : 0.000001s : 0.00% optimize.split_matmul_comm_elemetwise : 0.000002s : 0.00% optimize.split_layernorm_comm : 0.000002s : 0.00% optimize.process_send_recv_for_ge : 0.000001s : 0.00% optimize.handle_group_info : 0.000001s : 0.00% auto_monad_reorder : 0.000017s : 0.02% get_jit_bprop_graph : 0.000000s : 0.00% eliminate_special_op_node : 0.000520s : 0.69% validate : 0.000026s : 0.03% distribtued_split : 0.000001s : 0.00% task_emit : 0.004854s : 6.40% execute : 0.000009s : 0.01% Time group info: ------[substitution.] 0.023750 287 0.01% : 0.000003s : 1: substitution.depend_value_elim 98.71% : 0.023443s : 61: substitution.getattr_setattr_resolve 0.02% : 0.000005s : 3: substitution.graph_param_transform 0.74% : 0.000176s : 11: substitution.inline 0.42% : 0.000100s : 196: substitution.meta_unpack_prepare 0.01% : 0.000001s : 3: substitution.partial_unused_args_eliminate 0.01% : 0.000002s : 4: substitution.remove_not_recompute_node 0.01% : 0.000003s : 2: substitution.replace_old_param 0.03% : 0.000008s : 5: substitution.switch_simplify 0.04% : 0.000009s : 1: substitution.tuple_list_get_item_eliminator ------[renormalize.] 0.001207 2 60.40% : 0.000729s : 1: renormalize.infer 39.60% : 0.000478s : 1: renormalize.specialize ------[replace.] 0.000843 77 0.80% : 0.000007s : 1: replace.depend_value_elim 84.35% : 0.000711s : 59: replace.getattr_setattr_resolve 7.28% : 0.000061s : 11: replace.inline 6.58% : 0.000056s : 5: replace.switch_simplify 0.99% : 0.000008s : 1: replace.tuple_list_get_item_eliminator ------[match.] 0.023506 77 0.01% : 0.000003s : 1: match.depend_value_elim 99.17% : 0.023311s : 59: match.getattr_setattr_resolve 0.75% : 0.000176s : 11: match.inline 0.03% : 0.000008s : 5: match.switch_simplify 0.04% : 0.000009s : 1: match.tuple_list_get_item_eliminator ------[func_graph_cloner_run.] 0.005057 28 87.26% : 0.004413s : 15: func_graph_cloner_run.FuncGraphClonerGraph 12.74% : 0.000644s : 13: func_graph_cloner_run.FuncGraphSpecializer ------[meta_graph.] 0.000000 0 ------[manager.] 0.000000 0 ------[pynative] 0.000000 0 ------[others.] 0.025937 122 0.45% : 0.000117s : 69: opt.transform.opt_a 0.13% : 0.000032s : 23: opt.transform.opt_b 95.89% : 0.024872s : 2: opt.transform.opt_resolve 1.33% : 0.000344s : 1: opt.transforms.meta_unpack_prepare 2.08% : 0.000540s : 20: opt.transforms.opt_a 0.01% : 0.000004s : 1: opt.transforms.opt_after_cconv 0.01% : 0.000002s : 1: opt.transforms.opt_b 0.07% : 0.000017s : 2: opt.transforms.opt_trans_graph 0.03% : 0.000008s : 3: opt.transforms.special_op_eliminate TotalTime = 0.143027, [20] [parse]: 0.00201637 [symbol_resolve]: 0.025476, [1] [Cycle 1]: 0.0252873, [1] [resolve]: 0.025253 [combine_like_graphs]: 1.55e-06 [graph_reusing]: 5.32e-06 [meta_unpack_prepare]: 0.00036505 [pre_cconv]: 7.99999e-07 [abstract_specialize]: 0.100706 [pack_expand]: 0.00015731 [auto_monad]: 0.00016802 [inline]: 2.27e-06 [pre_auto_parallel]: 1.435e-05 [pipeline_split]: 4.05e-06 [optimize]: 0.00822881, [35] [py_interpret_to_execute]: 6.405e-05 [rewriter_before_opt_a]: 0.00034254 [opt_a]: 0.00715645, [2] [Cycle 1]: 0.00434629, [30] [expand_dump_flag]: 7.47e-06 [switch_simplify]: 0.00018233 [a_1]: 0.0007009 [recompute_prepare]: 3.88e-06 [updatestate_depend_eliminate]: 1.023e-05 [updatestate_assign_eliminate]: 5.12e-06 [updatestate_loads_eliminate]: 4.45e-06 [parameter_eliminate]: 3.36e-06 [a_2]: 9.646e-05 [accelerated_algorithm]: 3.18e-06 [pynative_shard]: 2.06e-06 [auto_parallel]: 4.68001e-06 [parallel]: 8.35001e-06 [merge_comm]: 4.02e-06 [allreduce_fusion]: 2.84e-06 [virtual_dataset]: 3.12999e-06 [get_grad_eliminate_]: 2.27e-06 [virtual_output]: 2.01e-06 [merge_forward]: 5.66e-06 [cell_reuse_recompute_pass]: 8.59996e-07 [cell_reuse_handle_not_recompute_node_pass]: 6.76e-06 [meta_fg_expand]: 5.01e-06 [after_resolve]: 5.63999e-06 [a_after_grad]: 2.61e-06 [renormalize]: 0.00302229 [real_op_eliminate]: 6.93e-06 [auto_monad_grad]: 4.72e-06 [auto_monad_eliminator]: 1.306e-05 [cse]: 3.933e-05 [a_3]: 1.978e-05 [Cycle 2]: 0.00025739, [30] [expand_dump_flag]: 1.45e-06 [switch_simplify]: 2.82e-06 [a_1]: 2.924e-05 [recompute_prepare]: 1.94e-06 [updatestate_depend_eliminate]: 3.76e-06 [updatestate_assign_eliminate]: 2.78e-06 [updatestate_loads_eliminate]: 2.42001e-06 [parameter_eliminate]: 1.25e-06 [a_2]: 2.992e-05 [accelerated_algorithm]: 2.74e-06 [pynative_shard]: 1.39e-06 [auto_parallel]: 4.23e-06 [parallel]: 3.27e-06 [merge_comm]: 2.29e-06 [allreduce_fusion]: 1.18e-06 [virtual_dataset]: 3.52e-06 [get_grad_eliminate_]: 2.09e-06 [virtual_output]: 1.95e-06 [merge_forward]: 3.1e-06 [cell_reuse_recompute_pass]: 4.19997e-07 [cell_reuse_handle_not_recompute_node_pass]: 5.22999e-06 [meta_fg_expand]: 2.31e-06 [after_resolve]: 3.91001e-06 [a_after_grad]: 2.70001e-06 [renormalize]: 6.00048e-08 [real_op_eliminate]: 1.97e-06 [auto_monad_grad]: 1.03e-06 [auto_monad_eliminator]: 4.76e-06 [cse]: 1.128e-05 [a_3]: 1.38e-05 [py_interpret_to_execute_after_opt_a]: 9.61999e-06 [slice_cell_reuse_recomputed_activation]: 2.49e-06 [rewriter_after_opt_a]: 0.00017763 [convert_after_rewriter]: 8.25e-06 [order_py_execute_after_rewriter]: 4.67e-06 [opt_b]: 0.00010559, [1] [Cycle 1]: 0.00010024, [7] [b_1]: 4.538e-05 [b_2]: 4.33e-06 [updatestate_depend_eliminate]: 3.32e-06 [updatestate_assign_eliminate]: 2.36e-06 [updatestate_loads_eliminate]: 2.27e-06 [renormalize]: 5.4e-07 [cse]: 1.364e-05 [cconv]: 2.495e-05 [opt_after_cconv]: 5.395e-05, [1] [Cycle 1]: 5.015e-05, [7] [c_1]: 5.62e-06 [parameter_eliminate]: 1.01e-06 [updatestate_depend_eliminate]: 2.39e-06 [updatestate_assign_eliminate]: 1.98e-06 [updatestate_loads_eliminate]: 1.87e-06 [cse]: 9.63e-06 [renormalize]: 2.39997e-07 [remove_dup_value]: 1.335e-05 [tuple_transform]: 3.808e-05, [1] [Cycle 1]: 3.437e-05, [3] [d_1]: 1.59e-05 [d_2]: 6.94e-06 [renormalize]: 1.39997e-07 [add_cache_embedding]: 1.127e-05 [add_recomputation]: 4.247e-05 [cse_after_recomputation]: 1.76e-05, [1] [Cycle 1]: 1.302e-05, [1] [cse]: 8.91e-06 [environ_conv]: 8.53e-06 [label_micro_interleaved_index]: 2.57e-06 [label_fine_grained_interleaved_index]: 2.62e-06 [assign_add_opt]: 1.74e-06 [slice_recompute_activation]: 2.58e-06 [micro_interleaved_order_control]: 1.8e-06 [full_micro_interleaved_order_control]: 2.16001e-06 [comp_comm_scheduling]: 2.15e-06 [reorder_send_recv_between_fp_bp]: 2.11e-06 [comm_op_add_attrs]: 1.16e-06 [add_comm_op_reuse_tag]: 9.59997e-07 [overlap_opt_shard_in_pipeline]: 1e-06 [grouped_pairwise_exchange_alltoall]: 1.39001e-06 [overlap_recompute_and_grad_model_parallel]: 1.95e-06 [overlap_grad_matmul_and_grad_allreduce]: 7.09995e-07 [split_matmul_comm_elemetwise]: 2.48e-06 [split_layernorm_comm]: 2.28e-06 [process_send_recv_for_ge]: 1.24001e-06 [handle_group_info]: 1.11e-06 [auto_monad_reorder]: 1.444e-05 [get_jit_bprop_graph]: 4.20005e-07 [eliminate_special_op_node]: 0.00052765 [validate]: 3.017e-05 [distribtued_split]: 1.11001e-06 [task_emit]: 0.00505939 [execute]: 7.4e-06 Sums parse : 0.002016s : 1.45% symbol_resolve.resolve : 0.025253s : 18.10% combine_like_graphs : 0.000002s : 0.00% graph_reusing : 0.000005s : 0.00% meta_unpack_prepare : 0.000365s : 0.26% pre_cconv : 0.000001s : 0.00% abstract_specialize : 0.100706s : 72.17% pack_expand : 0.000157s : 0.11% auto_monad : 0.000168s : 0.12% inline : 0.000002s : 0.00% pre_auto_parallel : 0.000014s : 0.01% pipeline_split : 0.000004s : 0.00% optimize.py_interpret_to_execute : 0.000064s : 0.05% optimize.rewriter_before_opt_a : 0.000343s : 0.25% optimize.opt_a.expand_dump_flag : 0.000009s : 0.01% optimize.opt_a.switch_simplify : 0.000185s : 0.13% optimize.opt_a.a_1 : 0.000730s : 0.52% optimize.opt_a.recompute_prepare : 0.000006s : 0.00% optimize.opt_a.updatestate_depend_eliminate : 0.000014s : 0.01% optimize.opt_a.updatestate_assign_eliminate : 0.000008s : 0.01% optimize.opt_a.updatestate_loads_eliminate : 0.000007s : 0.00% optimize.opt_a.parameter_eliminate : 0.000005s : 0.00% optimize.opt_a.a_2 : 0.000126s : 0.09% optimize.opt_a.accelerated_algorithm : 0.000006s : 0.00% optimize.opt_a.pynative_shard : 0.000003s : 0.00% optimize.opt_a.auto_parallel : 0.000009s : 0.01% optimize.opt_a.parallel : 0.000012s : 0.01% optimize.opt_a.merge_comm : 0.000006s : 0.00% optimize.opt_a.allreduce_fusion : 0.000004s : 0.00% optimize.opt_a.virtual_dataset : 0.000007s : 0.00% optimize.opt_a.get_grad_eliminate_ : 0.000004s : 0.00% optimize.opt_a.virtual_output : 0.000004s : 0.00% optimize.opt_a.merge_forward : 0.000009s : 0.01% optimize.opt_a.cell_reuse_recompute_pass : 0.000001s : 0.00% optimize.opt_a.cell_reuse_handle_not_recompute_node_pass : 0.000012s : 0.01% optimize.opt_a.meta_fg_expand : 0.000007s : 0.01% optimize.opt_a.after_resolve : 0.000010s : 0.01% optimize.opt_a.a_after_grad : 0.000005s : 0.00% optimize.opt_a.renormalize : 0.003022s : 2.17% optimize.opt_a.real_op_eliminate : 0.000009s : 0.01% optimize.opt_a.auto_monad_grad : 0.000006s : 0.00% optimize.opt_a.auto_monad_eliminator : 0.000018s : 0.01% optimize.opt_a.cse : 0.000051s : 0.04% optimize.opt_a.a_3 : 0.000034s : 0.02% optimize.py_interpret_to_execute_after_opt_a : 0.000010s : 0.01% optimize.slice_cell_reuse_recomputed_activation : 0.000002s : 0.00% optimize.rewriter_after_opt_a : 0.000178s : 0.13% optimize.convert_after_rewriter : 0.000008s : 0.01% optimize.order_py_execute_after_rewriter : 0.000005s : 0.00% optimize.opt_b.b_1 : 0.000045s : 0.03% optimize.opt_b.b_2 : 0.000004s : 0.00% optimize.opt_b.updatestate_depend_eliminate : 0.000003s : 0.00% optimize.opt_b.updatestate_assign_eliminate : 0.000002s : 0.00% optimize.opt_b.updatestate_loads_eliminate : 0.000002s : 0.00% optimize.opt_b.renormalize : 0.000001s : 0.00% optimize.opt_b.cse : 0.000014s : 0.01% optimize.cconv : 0.000025s : 0.02% optimize.opt_after_cconv.c_1 : 0.000006s : 0.00% optimize.opt_after_cconv.parameter_eliminate : 0.000001s : 0.00% optimize.opt_after_cconv.updatestate_depend_eliminate : 0.000002s : 0.00% optimize.opt_after_cconv.updatestate_assign_eliminate : 0.000002s : 0.00% optimize.opt_after_cconv.updatestate_loads_eliminate : 0.000002s : 0.00% optimize.opt_after_cconv.cse : 0.000010s : 0.01% optimize.opt_after_cconv.renormalize : 0.000000s : 0.00% optimize.remove_dup_value : 0.000013s : 0.01% optimize.tuple_transform.d_1 : 0.000016s : 0.01% optimize.tuple_transform.d_2 : 0.000007s : 0.00% optimize.tuple_transform.renormalize : 0.000000s : 0.00% optimize.add_cache_embedding : 0.000011s : 0.01% optimize.add_recomputation : 0.000042s : 0.03% optimize.cse_after_recomputation.cse : 0.000009s : 0.01% optimize.environ_conv : 0.000009s : 0.01% optimize.label_micro_interleaved_index : 0.000003s : 0.00% optimize.label_fine_grained_interleaved_index : 0.000003s : 0.00% optimize.assign_add_opt : 0.000002s : 0.00% optimize.slice_recompute_activation : 0.000003s : 0.00% optimize.micro_interleaved_order_control : 0.000002s : 0.00% optimize.full_micro_interleaved_order_control : 0.000002s : 0.00% optimize.comp_comm_scheduling : 0.000002s : 0.00% optimize.reorder_send_recv_between_fp_bp : 0.000002s : 0.00% optimize.comm_op_add_attrs : 0.000001s : 0.00% optimize.add_comm_op_reuse_tag : 0.000001s : 0.00% optimize.overlap_opt_shard_in_pipeline : 0.000001s : 0.00% optimize.grouped_pairwise_exchange_alltoall : 0.000001s : 0.00% optimize.overlap_recompute_and_grad_model_parallel : 0.000002s : 0.00% optimize.overlap_grad_matmul_and_grad_allreduce : 0.000001s : 0.00% optimize.split_matmul_comm_elemetwise : 0.000002s : 0.00% optimize.split_layernorm_comm : 0.000002s : 0.00% optimize.process_send_recv_for_ge : 0.000001s : 0.00% optimize.handle_group_info : 0.000001s : 0.00% auto_monad_reorder : 0.000014s : 0.01% get_jit_bprop_graph : 0.000000s : 0.00% eliminate_special_op_node : 0.000528s : 0.38% validate : 0.000030s : 0.02% distribtued_split : 0.000001s : 0.00% task_emit : 0.005059s : 3.63% execute : 0.000007s : 0.01% Time group info: ------[substitution.] 0.024313 306 0.01% : 0.000003s : 2: substitution.depend_value_elim 97.88% : 0.023797s : 61: substitution.getattr_setattr_resolve 0.02% : 0.000005s : 4: substitution.graph_param_transform 1.52% : 0.000371s : 20: substitution.inline 0.41% : 0.000099s : 196: substitution.meta_unpack_prepare 0.01% : 0.000002s : 4: substitution.partial_unused_args_eliminate 0.01% : 0.000002s : 4: substitution.remove_not_recompute_node 0.01% : 0.000002s : 2: substitution.replace_old_param 0.07% : 0.000016s : 3: substitution.reshape_eliminate 0.04% : 0.000010s : 9: substitution.switch_simplify 0.02% : 0.000006s : 1: substitution.tuple_list_get_item_eliminator ------[renormalize.] 0.003015 2 69.85% : 0.002106s : 1: renormalize.infer 30.15% : 0.000909s : 1: renormalize.specialize ------[replace.] 0.000938 91 1.32% : 0.000012s : 2: replace.depend_value_elim 76.65% : 0.000719s : 59: replace.getattr_setattr_resolve 11.65% : 0.000109s : 20: replace.inline 9.49% : 0.000089s : 9: replace.switch_simplify 0.89% : 0.000008s : 1: replace.tuple_list_get_item_eliminator ------[match.] 0.024060 91 0.01% : 0.000003s : 2: match.depend_value_elim 98.38% : 0.023670s : 59: match.getattr_setattr_resolve 1.54% : 0.000371s : 20: match.inline 0.04% : 0.000010s : 9: match.switch_simplify 0.02% : 0.000006s : 1: match.tuple_list_get_item_eliminator ------[func_graph_cloner_run.] 0.009572 71 87.03% : 0.008330s : 49: func_graph_cloner_run.FuncGraphClonerGraph 12.97% : 0.001242s : 22: func_graph_cloner_run.FuncGraphSpecializer ------[meta_graph.] 0.000000 0 ------[manager.] 0.000000 0 ------[pynative] 0.000000 0 ------[others.] 0.026747 122 0.54% : 0.000145s : 69: opt.transform.opt_a 0.14% : 0.000036s : 23: opt.transform.opt_b 94.38% : 0.025243s : 2: opt.transform.opt_resolve 1.25% : 0.000336s : 1: opt.transforms.meta_unpack_prepare 3.55% : 0.000949s : 20: opt.transforms.opt_a 0.02% : 0.000004s : 1: opt.transforms.opt_after_cconv 0.01% : 0.000003s : 1: opt.transforms.opt_b 0.08% : 0.000021s : 2: opt.transforms.opt_trans_graph 0.03% : 0.000009s : 3: opt.transforms.special_op_eliminate TotalTime = 0.143253, [20] [parse]: 0.00189217 [symbol_resolve]: 0.0246276, [1] [Cycle 1]: 0.0244771, [1] [resolve]: 0.0244533 [combine_like_graphs]: 9.59997e-07 [graph_reusing]: 4.37e-06 [meta_unpack_prepare]: 0.00033044 [pre_cconv]: 7.59996e-07 [abstract_specialize]: 0.101509 [pack_expand]: 2.696e-05 [auto_monad]: 0.00062832 [inline]: 2.34e-06 [pre_auto_parallel]: 1.406e-05 [pipeline_split]: 3.35e-06 [optimize]: 0.00811609, [35] [py_interpret_to_execute]: 5.416e-05 [rewriter_before_opt_a]: 0.0003252 [opt_a]: 0.00706227, [2] [Cycle 1]: 0.00419644, [30] [expand_dump_flag]: 7.42e-06 [switch_simplify]: 0.00017328 [a_1]: 0.00062088 [recompute_prepare]: 3.98e-06 [updatestate_depend_eliminate]: 9.74e-06 [updatestate_assign_eliminate]: 5.05e-06 [updatestate_loads_eliminate]: 4.59e-06 [parameter_eliminate]: 3.8e-06 [a_2]: 9.577e-05 [accelerated_algorithm]: 3.57e-06 [pynative_shard]: 1.95e-06 [auto_parallel]: 4.03e-06 [parallel]: 8.86e-06 [merge_comm]: 3.65e-06 [allreduce_fusion]: 2.57e-06 [virtual_dataset]: 3.39e-06 [get_grad_eliminate_]: 2.09e-06 [virtual_output]: 2.07e-06 [merge_forward]: 5.43999e-06 [cell_reuse_recompute_pass]: 7.59996e-07 [cell_reuse_handle_not_recompute_node_pass]: 6.18e-06 [meta_fg_expand]: 4.69e-06 [after_resolve]: 5.26e-06 [a_after_grad]: 2.66e-06 [renormalize]: 0.00298061 [real_op_eliminate]: 6.98e-06 [auto_monad_grad]: 5.43e-06 [auto_monad_eliminator]: 1.317e-05 [cse]: 2.739e-05 [a_3]: 1.992e-05 [Cycle 2]: 0.00025707, [30] [expand_dump_flag]: 1.8e-06 [switch_simplify]: 2.57e-06 [a_1]: 3.184e-05 [recompute_prepare]: 2.08e-06 [updatestate_depend_eliminate]: 4.17e-06 [updatestate_assign_eliminate]: 3.19001e-06 [updatestate_loads_eliminate]: 2.36e-06 [parameter_eliminate]: 1.37e-06 [a_2]: 2.859e-05 [accelerated_algorithm]: 2.72e-06 [pynative_shard]: 2e-06 [auto_parallel]: 4.69e-06 [parallel]: 4.48e-06 [merge_comm]: 2.21001e-06 [allreduce_fusion]: 1.35e-06 [virtual_dataset]: 2.55e-06 [get_grad_eliminate_]: 2.66e-06 [virtual_output]: 1.94e-06 [merge_forward]: 2.96e-06 [cell_reuse_recompute_pass]: 5.79996e-07 [cell_reuse_handle_not_recompute_node_pass]: 6.05e-06 [meta_fg_expand]: 2.44e-06 [after_resolve]: 4.3e-06 [a_after_grad]: 2.44e-06 [renormalize]: 5.99975e-08 [real_op_eliminate]: 2.43e-06 [auto_monad_grad]: 1.02e-06 [auto_monad_eliminator]: 4.57e-06 [cse]: 9.69e-06 [a_3]: 1.381e-05 [py_interpret_to_execute_after_opt_a]: 1.046e-05 [slice_cell_reuse_recomputed_activation]: 2.24e-06 [rewriter_after_opt_a]: 0.00019455 [convert_after_rewriter]: 8.64e-06 [order_py_execute_after_rewriter]: 5.75e-06 [opt_b]: 0.00010235, [1] [Cycle 1]: 9.658e-05, [7] [b_1]: 4.424e-05 [b_2]: 3.72e-06 [updatestate_depend_eliminate]: 4.1e-06 [updatestate_assign_eliminate]: 2.35999e-06 [updatestate_loads_eliminate]: 2.14e-06 [renormalize]: 2.89998e-07 [cse]: 1.227e-05 [cconv]: 2.685e-05 [opt_after_cconv]: 5.183e-05, [1] [Cycle 1]: 4.796e-05, [7] [c_1]: 5.27001e-06 [parameter_eliminate]: 9.90003e-07 [updatestate_depend_eliminate]: 3.08e-06 [updatestate_assign_eliminate]: 1.98e-06 [updatestate_loads_eliminate]: 1.85e-06 [cse]: 8.69e-06 [renormalize]: 2.00002e-07 [remove_dup_value]: 1.143e-05 [tuple_transform]: 3.847e-05, [1] [Cycle 1]: 3.436e-05, [3] [d_1]: 1.505e-05 [d_2]: 7.15e-06 [renormalize]: 1.60006e-07 [add_cache_embedding]: 1.211e-05 [add_recomputation]: 4.118e-05 [cse_after_recomputation]: 1.732e-05, [1] [Cycle 1]: 1.288e-05, [1] [cse]: 8.77e-06 [environ_conv]: 7.97e-06 [label_micro_interleaved_index]: 2.21e-06 [label_fine_grained_interleaved_index]: 2.25e-06 [assign_add_opt]: 1.75e-06 [slice_recompute_activation]: 2.37e-06 [micro_interleaved_order_control]: 1.7e-06 [full_micro_interleaved_order_control]: 1.91e-06 [comp_comm_scheduling]: 2.34e-06 [reorder_send_recv_between_fp_bp]: 1.88e-06 [comm_op_add_attrs]: 1.04e-06 [add_comm_op_reuse_tag]: 9.60004e-07 [overlap_opt_shard_in_pipeline]: 1.08e-06 [grouped_pairwise_exchange_alltoall]: 1.42e-06 [overlap_recompute_and_grad_model_parallel]: 1.75e-06 [overlap_grad_matmul_and_grad_allreduce]: 7.30004e-07 [split_matmul_comm_elemetwise]: 1.99e-06 [split_layernorm_comm]: 1.74e-06 [process_send_recv_for_ge]: 7.79997e-07 [handle_group_info]: 1.27e-06 [auto_monad_reorder]: 1.599e-05 [get_jit_bprop_graph]: 4.39999e-07 [eliminate_special_op_node]: 0.00060051 [validate]: 3.057e-05 [distribtued_split]: 1.44e-06 [task_emit]: 0.00510396 [execute]: 9e-06 Sums parse : 0.001892s : 1.35% symbol_resolve.resolve : 0.024453s : 17.51% combine_like_graphs : 0.000001s : 0.00% graph_reusing : 0.000004s : 0.00% meta_unpack_prepare : 0.000330s : 0.24% pre_cconv : 0.000001s : 0.00% abstract_specialize : 0.101509s : 72.68% pack_expand : 0.000027s : 0.02% auto_monad : 0.000628s : 0.45% inline : 0.000002s : 0.00% pre_auto_parallel : 0.000014s : 0.01% pipeline_split : 0.000003s : 0.00% optimize.py_interpret_to_execute : 0.000054s : 0.04% optimize.rewriter_before_opt_a : 0.000325s : 0.23% optimize.opt_a.expand_dump_flag : 0.000009s : 0.01% optimize.opt_a.switch_simplify : 0.000176s : 0.13% optimize.opt_a.a_1 : 0.000653s : 0.47% optimize.opt_a.recompute_prepare : 0.000006s : 0.00% optimize.opt_a.updatestate_depend_eliminate : 0.000014s : 0.01% optimize.opt_a.updatestate_assign_eliminate : 0.000008s : 0.01% optimize.opt_a.updatestate_loads_eliminate : 0.000007s : 0.00% optimize.opt_a.parameter_eliminate : 0.000005s : 0.00% optimize.opt_a.a_2 : 0.000124s : 0.09% optimize.opt_a.accelerated_algorithm : 0.000006s : 0.00% optimize.opt_a.pynative_shard : 0.000004s : 0.00% optimize.opt_a.auto_parallel : 0.000009s : 0.01% optimize.opt_a.parallel : 0.000013s : 0.01% optimize.opt_a.merge_comm : 0.000006s : 0.00% optimize.opt_a.allreduce_fusion : 0.000004s : 0.00% optimize.opt_a.virtual_dataset : 0.000006s : 0.00% optimize.opt_a.get_grad_eliminate_ : 0.000005s : 0.00% optimize.opt_a.virtual_output : 0.000004s : 0.00% optimize.opt_a.merge_forward : 0.000008s : 0.01% optimize.opt_a.cell_reuse_recompute_pass : 0.000001s : 0.00% optimize.opt_a.cell_reuse_handle_not_recompute_node_pass : 0.000012s : 0.01% optimize.opt_a.meta_fg_expand : 0.000007s : 0.01% optimize.opt_a.after_resolve : 0.000010s : 0.01% optimize.opt_a.a_after_grad : 0.000005s : 0.00% optimize.opt_a.renormalize : 0.002981s : 2.13% optimize.opt_a.real_op_eliminate : 0.000009s : 0.01% optimize.opt_a.auto_monad_grad : 0.000006s : 0.00% optimize.opt_a.auto_monad_eliminator : 0.000018s : 0.01% optimize.opt_a.cse : 0.000037s : 0.03% optimize.opt_a.a_3 : 0.000034s : 0.02% optimize.py_interpret_to_execute_after_opt_a : 0.000010s : 0.01% optimize.slice_cell_reuse_recomputed_activation : 0.000002s : 0.00% optimize.rewriter_after_opt_a : 0.000195s : 0.14% optimize.convert_after_rewriter : 0.000009s : 0.01% optimize.order_py_execute_after_rewriter : 0.000006s : 0.00% optimize.opt_b.b_1 : 0.000044s : 0.03% optimize.opt_b.b_2 : 0.000004s : 0.00% optimize.opt_b.updatestate_depend_eliminate : 0.000004s : 0.00% optimize.opt_b.updatestate_assign_eliminate : 0.000002s : 0.00% optimize.opt_b.updatestate_loads_eliminate : 0.000002s : 0.00% optimize.opt_b.renormalize : 0.000000s : 0.00% optimize.opt_b.cse : 0.000012s : 0.01% optimize.cconv : 0.000027s : 0.02% optimize.opt_after_cconv.c_1 : 0.000005s : 0.00% optimize.opt_after_cconv.parameter_eliminate : 0.000001s : 0.00% optimize.opt_after_cconv.updatestate_depend_eliminate : 0.000003s : 0.00% optimize.opt_after_cconv.updatestate_assign_eliminate : 0.000002s : 0.00% optimize.opt_after_cconv.updatestate_loads_eliminate : 0.000002s : 0.00% optimize.opt_after_cconv.cse : 0.000009s : 0.01% optimize.opt_after_cconv.renormalize : 0.000000s : 0.00% optimize.remove_dup_value : 0.000011s : 0.01% optimize.tuple_transform.d_1 : 0.000015s : 0.01% optimize.tuple_transform.d_2 : 0.000007s : 0.01% optimize.tuple_transform.renormalize : 0.000000s : 0.00% optimize.add_cache_embedding : 0.000012s : 0.01% optimize.add_recomputation : 0.000041s : 0.03% optimize.cse_after_recomputation.cse : 0.000009s : 0.01% optimize.environ_conv : 0.000008s : 0.01% optimize.label_micro_interleaved_index : 0.000002s : 0.00% optimize.label_fine_grained_interleaved_index : 0.000002s : 0.00% optimize.assign_add_opt : 0.000002s : 0.00% optimize.slice_recompute_activation : 0.000002s : 0.00% optimize.micro_interleaved_order_control : 0.000002s : 0.00% optimize.full_micro_interleaved_order_control : 0.000002s : 0.00% optimize.comp_comm_scheduling : 0.000002s : 0.00% optimize.reorder_send_recv_between_fp_bp : 0.000002s : 0.00% optimize.comm_op_add_attrs : 0.000001s : 0.00% optimize.add_comm_op_reuse_tag : 0.000001s : 0.00% optimize.overlap_opt_shard_in_pipeline : 0.000001s : 0.00% optimize.grouped_pairwise_exchange_alltoall : 0.000001s : 0.00% optimize.overlap_recompute_and_grad_model_parallel : 0.000002s : 0.00% optimize.overlap_grad_matmul_and_grad_allreduce : 0.000001s : 0.00% optimize.split_matmul_comm_elemetwise : 0.000002s : 0.00% optimize.split_layernorm_comm : 0.000002s : 0.00% optimize.process_send_recv_for_ge : 0.000001s : 0.00% optimize.handle_group_info : 0.000001s : 0.00% auto_monad_reorder : 0.000016s : 0.01% get_jit_bprop_graph : 0.000000s : 0.00% eliminate_special_op_node : 0.000601s : 0.43% validate : 0.000031s : 0.02% distribtued_split : 0.000001s : 0.00% task_emit : 0.005104s : 3.65% execute : 0.000009s : 0.01% Time group info: ------[substitution.] 0.023471 306 0.01% : 0.000003s : 2: substitution.depend_value_elim 98.14% : 0.023035s : 61: substitution.getattr_setattr_resolve 0.02% : 0.000005s : 4: substitution.graph_param_transform 1.26% : 0.000296s : 20: substitution.inline 0.39% : 0.000091s : 196: substitution.meta_unpack_prepare 0.01% : 0.000002s : 4: substitution.partial_unused_args_eliminate 0.01% : 0.000002s : 4: substitution.remove_not_recompute_node 0.01% : 0.000003s : 2: substitution.replace_old_param 0.08% : 0.000018s : 3: substitution.reshape_eliminate 0.04% : 0.000010s : 9: substitution.switch_simplify 0.02% : 0.000006s : 1: substitution.tuple_list_get_item_eliminator ------[renormalize.] 0.002973 2 70.54% : 0.002097s : 1: renormalize.infer 29.46% : 0.000876s : 1: renormalize.specialize ------[replace.] 0.000913 91 1.26% : 0.000012s : 2: replace.depend_value_elim 76.87% : 0.000702s : 59: replace.getattr_setattr_resolve 11.63% : 0.000106s : 20: replace.inline 9.36% : 0.000085s : 9: replace.switch_simplify 0.88% : 0.000008s : 1: replace.tuple_list_get_item_eliminator ------[match.] 0.023222 91 0.01% : 0.000003s : 2: match.depend_value_elim 98.64% : 0.022906s : 59: match.getattr_setattr_resolve 1.28% : 0.000296s : 20: match.inline 0.04% : 0.000010s : 9: match.switch_simplify 0.02% : 0.000006s : 1: match.tuple_list_get_item_eliminator ------[func_graph_cloner_run.] 0.009400 72 87.16% : 0.008193s : 50: func_graph_cloner_run.FuncGraphClonerGraph 12.84% : 0.001207s : 22: func_graph_cloner_run.FuncGraphSpecializer ------[meta_graph.] 0.000000 0 ------[manager.] 0.000000 0 ------[pynative] 0.000000 0 ------[others.] 0.025829 122 0.55% : 0.000142s : 69: opt.transform.opt_a 0.14% : 0.000035s : 23: opt.transform.opt_b 94.65% : 0.024446s : 2: opt.transform.opt_resolve 1.18% : 0.000306s : 1: opt.transforms.meta_unpack_prepare 3.34% : 0.000862s : 20: opt.transforms.opt_a 0.02% : 0.000004s : 1: opt.transforms.opt_after_cconv 0.01% : 0.000002s : 1: opt.transforms.opt_b 0.08% : 0.000021s : 2: opt.transforms.opt_trans_graph 0.04% : 0.000011s : 3: opt.transforms.special_op_eliminate TotalTime = 0.0900887, [20] [parse]: 0.0020242 [symbol_resolve]: 0.0248694, [1] [Cycle 1]: 0.0247095, [1] [resolve]: 0.0246753 [combine_like_graphs]: 1.41e-06 [graph_reusing]: 4.84e-06 [meta_unpack_prepare]: 0.00033176 [pre_cconv]: 8.40002e-07 [abstract_specialize]: 0.0556275 [pack_expand]: 2.266e-05 [auto_monad]: 0.00042745 [inline]: 2.11e-06 [pre_auto_parallel]: 1.332e-05 [pipeline_split]: 3.56e-06 [optimize]: 0.0060147, [35] [py_interpret_to_execute]: 9.547e-05 [rewriter_before_opt_a]: 0.00023901 [opt_a]: 0.00511374, [2] [Cycle 1]: 0.00247063, [30] [expand_dump_flag]: 4.98e-06 [switch_simplify]: 0.0001274 [a_1]: 0.00041768 [recompute_prepare]: 3.19001e-06 [updatestate_depend_eliminate]: 8.07e-06 [updatestate_assign_eliminate]: 4.45e-06 [updatestate_loads_eliminate]: 4.88001e-06 [parameter_eliminate]: 2.81999e-06 [a_2]: 7.562e-05 [accelerated_algorithm]: 2.4e-06 [pynative_shard]: 1.99e-06 [auto_parallel]: 3.67e-06 [parallel]: 8.72e-06 [merge_comm]: 3.39e-06 [allreduce_fusion]: 1.76e-06 [virtual_dataset]: 2.2e-06 [get_grad_eliminate_]: 1.58e-06 [virtual_output]: 1.62e-06 [merge_forward]: 3.82e-06 [cell_reuse_recompute_pass]: 7.90002e-07 [cell_reuse_handle_not_recompute_node_pass]: 5.09001e-06 [meta_fg_expand]: 4.77e-06 [after_resolve]: 4.73e-06 [a_after_grad]: 1.91e-06 [renormalize]: 0.0015634 [real_op_eliminate]: 5.62e-06 [auto_monad_grad]: 4.45999e-06 [auto_monad_eliminator]: 1.064e-05 [cse]: 2.16e-05 [a_3]: 1.314e-05 [Cycle 2]: 0.00020011, [30] [expand_dump_flag]: 1.28e-06 [switch_simplify]: 1.72e-06 [a_1]: 1.201e-05 [recompute_prepare]: 1.24001e-06 [updatestate_depend_eliminate]: 2.34e-06 [updatestate_assign_eliminate]: 1.85e-06 [updatestate_loads_eliminate]: 1.91e-06 [parameter_eliminate]: 9.79999e-07 [a_2]: 1.903e-05 [accelerated_algorithm]: 1.78e-06 [pynative_shard]: 1.8e-06 [auto_parallel]: 3.29e-06 [parallel]: 3.47e-06 [merge_comm]: 1.76e-06 [allreduce_fusion]: 1.01e-06 [virtual_dataset]: 2.27e-06 [get_grad_eliminate_]: 1.41e-06 [virtual_output]: 1.31e-06 [merge_forward]: 2.16e-06 [cell_reuse_recompute_pass]: 4.30002e-07 [cell_reuse_handle_not_recompute_node_pass]: 5.43e-06 [meta_fg_expand]: 1.77e-06 [after_resolve]: 3.57e-06 [a_after_grad]: 1.8e-06 [renormalize]: 6.99947e-08 [real_op_eliminate]: 1.21e-06 [auto_monad_grad]: 8.79998e-07 [auto_monad_eliminator]: 2.88e-06 [cse]: 6.99001e-06 [a_3]: 9.35e-06 [py_interpret_to_execute_after_opt_a]: 7.88e-06 [slice_cell_reuse_recomputed_activation]: 2.78e-06 [rewriter_after_opt_a]: 0.00015913 [convert_after_rewriter]: 7.21e-06 [order_py_execute_after_rewriter]: 4.25e-06 [opt_b]: 8.226e-05, [1] [Cycle 1]: 7.694e-05, [7] [b_1]: 3.136e-05 [b_2]: 3.26e-06 [updatestate_depend_eliminate]: 2.54e-06 [updatestate_assign_eliminate]: 1.67e-06 [updatestate_loads_eliminate]: 1.59e-06 [renormalize]: 4.1e-07 [cse]: 8.72e-06 [cconv]: 2.306e-05 [opt_after_cconv]: 4.397e-05, [1] [Cycle 1]: 4.047e-05, [7] [c_1]: 3.96001e-06 [parameter_eliminate]: 8.90002e-07 [updatestate_depend_eliminate]: 1.66e-06 [updatestate_assign_eliminate]: 1.22e-06 [updatestate_loads_eliminate]: 1.22e-06 [cse]: 5.31e-06 [renormalize]: 2.80001e-07 [remove_dup_value]: 1.001e-05 [tuple_transform]: 3.076e-05, [1] [Cycle 1]: 2.72e-05, [3] [d_1]: 1.182e-05 [d_2]: 4.8e-06 [renormalize]: 1.70003e-07 [add_cache_embedding]: 9.17e-06 [add_recomputation]: 2.877e-05 [cse_after_recomputation]: 1.304e-05, [1] [Cycle 1]: 9.45e-06, [1] [cse]: 5.12e-06 [environ_conv]: 5.17e-06 [label_micro_interleaved_index]: 2.16e-06 [label_fine_grained_interleaved_index]: 2.55e-06 [assign_add_opt]: 1.41e-06 [slice_recompute_activation]: 2.38e-06 [micro_interleaved_order_control]: 1.71e-06 [full_micro_interleaved_order_control]: 1.85e-06 [comp_comm_scheduling]: 2.33e-06 [reorder_send_recv_between_fp_bp]: 2.06e-06 [comm_op_add_attrs]: 1.21e-06 [add_comm_op_reuse_tag]: 9e-07 [overlap_opt_shard_in_pipeline]: 1.2e-06 [grouped_pairwise_exchange_alltoall]: 1.24e-06 [overlap_recompute_and_grad_model_parallel]: 1.74e-06 [overlap_grad_matmul_and_grad_allreduce]: 7.60003e-07 [split_matmul_comm_elemetwise]: 2.27e-06 [split_layernorm_comm]: 1.71e-06 [process_send_recv_for_ge]: 7.79997e-07 [handle_group_info]: 9.5e-07 [auto_monad_reorder]: 1.286e-05 [get_jit_bprop_graph]: 6.69999e-07 [eliminate_special_op_node]: 0.00048674 [validate]: 2.109e-05 [distribtued_split]: 1.16e-06 [task_emit]: 9.60004e-07 [execute]: 8.2e-07 Sums parse : 0.002024s : 2.33% symbol_resolve.resolve : 0.024675s : 28.43% combine_like_graphs : 0.000001s : 0.00% graph_reusing : 0.000005s : 0.01% meta_unpack_prepare : 0.000332s : 0.38% pre_cconv : 0.000001s : 0.00% abstract_specialize : 0.055628s : 64.10% pack_expand : 0.000023s : 0.03% auto_monad : 0.000427s : 0.49% inline : 0.000002s : 0.00% pre_auto_parallel : 0.000013s : 0.02% pipeline_split : 0.000004s : 0.00% optimize.py_interpret_to_execute : 0.000095s : 0.11% optimize.rewriter_before_opt_a : 0.000239s : 0.28% optimize.opt_a.expand_dump_flag : 0.000006s : 0.01% optimize.opt_a.switch_simplify : 0.000129s : 0.15% optimize.opt_a.a_1 : 0.000430s : 0.50% optimize.opt_a.recompute_prepare : 0.000004s : 0.01% optimize.opt_a.updatestate_depend_eliminate : 0.000010s : 0.01% optimize.opt_a.updatestate_assign_eliminate : 0.000006s : 0.01% optimize.opt_a.updatestate_loads_eliminate : 0.000007s : 0.01% optimize.opt_a.parameter_eliminate : 0.000004s : 0.00% optimize.opt_a.a_2 : 0.000095s : 0.11% optimize.opt_a.accelerated_algorithm : 0.000004s : 0.00% optimize.opt_a.pynative_shard : 0.000004s : 0.00% optimize.opt_a.auto_parallel : 0.000007s : 0.01% optimize.opt_a.parallel : 0.000012s : 0.01% optimize.opt_a.merge_comm : 0.000005s : 0.01% optimize.opt_a.allreduce_fusion : 0.000003s : 0.00% optimize.opt_a.virtual_dataset : 0.000004s : 0.01% optimize.opt_a.get_grad_eliminate_ : 0.000003s : 0.00% optimize.opt_a.virtual_output : 0.000003s : 0.00% optimize.opt_a.merge_forward : 0.000006s : 0.01% optimize.opt_a.cell_reuse_recompute_pass : 0.000001s : 0.00% optimize.opt_a.cell_reuse_handle_not_recompute_node_pass : 0.000011s : 0.01% optimize.opt_a.meta_fg_expand : 0.000007s : 0.01% optimize.opt_a.after_resolve : 0.000008s : 0.01% optimize.opt_a.a_after_grad : 0.000004s : 0.00% optimize.opt_a.renormalize : 0.001563s : 1.80% optimize.opt_a.real_op_eliminate : 0.000007s : 0.01% optimize.opt_a.auto_monad_grad : 0.000005s : 0.01% optimize.opt_a.auto_monad_eliminator : 0.000014s : 0.02% optimize.opt_a.cse : 0.000029s : 0.03% optimize.opt_a.a_3 : 0.000022s : 0.03% optimize.py_interpret_to_execute_after_opt_a : 0.000008s : 0.01% optimize.slice_cell_reuse_recomputed_activation : 0.000003s : 0.00% optimize.rewriter_after_opt_a : 0.000159s : 0.18% optimize.convert_after_rewriter : 0.000007s : 0.01% optimize.order_py_execute_after_rewriter : 0.000004s : 0.00% optimize.opt_b.b_1 : 0.000031s : 0.04% optimize.opt_b.b_2 : 0.000003s : 0.00% optimize.opt_b.updatestate_depend_eliminate : 0.000003s : 0.00% optimize.opt_b.updatestate_assign_eliminate : 0.000002s : 0.00% optimize.opt_b.updatestate_loads_eliminate : 0.000002s : 0.00% optimize.opt_b.renormalize : 0.000000s : 0.00% optimize.opt_b.cse : 0.000009s : 0.01% optimize.cconv : 0.000023s : 0.03% optimize.opt_after_cconv.c_1 : 0.000004s : 0.00% optimize.opt_after_cconv.parameter_eliminate : 0.000001s : 0.00% optimize.opt_after_cconv.updatestate_depend_eliminate : 0.000002s : 0.00% optimize.opt_after_cconv.updatestate_assign_eliminate : 0.000001s : 0.00% optimize.opt_after_cconv.updatestate_loads_eliminate : 0.000001s : 0.00% optimize.opt_after_cconv.cse : 0.000005s : 0.01% optimize.opt_after_cconv.renormalize : 0.000000s : 0.00% optimize.remove_dup_value : 0.000010s : 0.01% optimize.tuple_transform.d_1 : 0.000012s : 0.01% optimize.tuple_transform.d_2 : 0.000005s : 0.01% optimize.tuple_transform.renormalize : 0.000000s : 0.00% optimize.add_cache_embedding : 0.000009s : 0.01% optimize.add_recomputation : 0.000029s : 0.03% optimize.cse_after_recomputation.cse : 0.000005s : 0.01% optimize.environ_conv : 0.000005s : 0.01% optimize.label_micro_interleaved_index : 0.000002s : 0.00% optimize.label_fine_grained_interleaved_index : 0.000003s : 0.00% optimize.assign_add_opt : 0.000001s : 0.00% optimize.slice_recompute_activation : 0.000002s : 0.00% optimize.micro_interleaved_order_control : 0.000002s : 0.00% optimize.full_micro_interleaved_order_control : 0.000002s : 0.00% optimize.comp_comm_scheduling : 0.000002s : 0.00% optimize.reorder_send_recv_between_fp_bp : 0.000002s : 0.00% optimize.comm_op_add_attrs : 0.000001s : 0.00% optimize.add_comm_op_reuse_tag : 0.000001s : 0.00% optimize.overlap_opt_shard_in_pipeline : 0.000001s : 0.00% optimize.grouped_pairwise_exchange_alltoall : 0.000001s : 0.00% optimize.overlap_recompute_and_grad_model_parallel : 0.000002s : 0.00% optimize.overlap_grad_matmul_and_grad_allreduce : 0.000001s : 0.00% optimize.split_matmul_comm_elemetwise : 0.000002s : 0.00% optimize.split_layernorm_comm : 0.000002s : 0.00% optimize.process_send_recv_for_ge : 0.000001s : 0.00% optimize.handle_group_info : 0.000001s : 0.00% auto_monad_reorder : 0.000013s : 0.01% get_jit_bprop_graph : 0.000001s : 0.00% eliminate_special_op_node : 0.000487s : 0.56% validate : 0.000021s : 0.02% distribtued_split : 0.000001s : 0.00% task_emit : 0.000001s : 0.00% execute : 0.000001s : 0.00% Time group info: ------[substitution.] 0.023622 287 0.01% : 0.000003s : 2: substitution.depend_value_elim 98.67% : 0.023307s : 61: substitution.getattr_setattr_resolve 0.02% : 0.000004s : 2: substitution.graph_param_transform 0.83% : 0.000197s : 13: substitution.inline 0.37% : 0.000087s : 196: substitution.meta_unpack_prepare 0.01% : 0.000001s : 2: substitution.partial_unused_args_eliminate 0.01% : 0.000002s : 2: substitution.remove_not_recompute_node 0.01% : 0.000003s : 2: substitution.replace_old_param 0.04% : 0.000009s : 6: substitution.switch_simplify 0.04% : 0.000009s : 1: substitution.tuple_list_get_item_eliminator ------[renormalize.] 0.001556 2 66.87% : 0.001041s : 1: renormalize.infer 33.13% : 0.000516s : 1: renormalize.specialize ------[replace.] 0.000839 81 1.27% : 0.000011s : 2: replace.depend_value_elim 81.97% : 0.000688s : 59: replace.getattr_setattr_resolve 8.57% : 0.000072s : 13: replace.inline 7.19% : 0.000060s : 6: replace.switch_simplify 1.00% : 0.000008s : 1: replace.tuple_list_get_item_eliminator ------[match.] 0.023393 81 0.01% : 0.000003s : 2: match.depend_value_elim 99.07% : 0.023175s : 59: match.getattr_setattr_resolve 0.84% : 0.000197s : 13: match.inline 0.04% : 0.000009s : 6: match.switch_simplify 0.04% : 0.000009s : 1: match.tuple_list_get_item_eliminator ------[func_graph_cloner_run.] 0.005788 44 87.39% : 0.005058s : 29: func_graph_cloner_run.FuncGraphClonerGraph 12.61% : 0.000730s : 15: func_graph_cloner_run.FuncGraphSpecializer ------[meta_graph.] 0.000000 0 ------[manager.] 0.000000 0 ------[pynative] 0.000000 0 ------[others.] 0.025700 122 0.40% : 0.000102s : 69: opt.transform.opt_a 0.09% : 0.000023s : 23: opt.transform.opt_b 95.97% : 0.024665s : 2: opt.transform.opt_resolve 1.18% : 0.000304s : 1: opt.transforms.meta_unpack_prepare 2.26% : 0.000581s : 20: opt.transforms.opt_a 0.01% : 0.000003s : 1: opt.transforms.opt_after_cconv 0.01% : 0.000002s : 1: opt.transforms.opt_b 0.06% : 0.000015s : 2: opt.transforms.opt_trans_graph 0.03% : 0.000006s : 3: opt.transforms.special_op_eliminate .. ============================== 2 passed in 21.95s ============================== [TRACE] GE(131377,python3.7):2024-01-11-05:36:21.102.349 [status:INIT] [ge_api.cc:463]131377 ~Session:Start to destruct session. [TRACE] GE(131377,python3.7):2024-01-11-05:36:21.102.432 [status:RUNNING] [ge_api.cc:475]131377 ~Session:Session id is 0 [TRACE] GE(131377,python3.7):2024-01-11-05:36:21.102.444 [status:RUNNING] [ge_api.cc:476]131377 ~Session:Destroying session [TRACE] GE(131377,python3.7):2024-01-11-05:36:21.103.707 [status:STOP] [ge_api.cc:491]131377 ~Session:Session Destructor finished [TRACE] GE(131377,python3.7):2024-01-11-05:36:21.103.761 [status:INIT] [ge_api.cc:301]131377 GEFinalize:GEFinalize start [INFO] GE(131377,python3.7):2024-01-11-05:36:21.103.862 [execution_runtime.cc:80][EVENT]131377 FinalizeExecutionRuntime:Execution runtime finalize begin. [INFO] GE(131377,python3.7):2024-01-11-05:36:21.103.881 [execution_runtime.cc:92][EVENT]131377 FinalizeExecutionRuntime:Execution runtime finalized. [TRACE] GE(131377,python3.7):2024-01-11-05:36:21.103.892 [status:RUNNING] [ge_api.cc:313]131377 GEFinalize:Finalizing environment [INFO] TUNE(131377,python3.7):2024-01-11-05:36:21.402.094 [cann_kb_pyfunc_mgr.cpp:127][CANNKB][Tid:131377]"CannKbPyfuncMgr: enter PyObjectDeinit function, reference_[1]" [INFO] TUNE(131377,python3.7):2024-01-11-05:36:21.402.158 [cann_kb_pyfunc_mgr.cpp:138][CANNKB][Tid:131377]"CannKbPyfuncMgr: PyObjectDeinit function end successfully!" [INFO] GE(131377,python3.7):2024-01-11-05:36:21.403.583 [gelib.cc:324][EVENT]131377 SystemFinalize:Online infer finalize GELib success. [TRACE] GE(131377,python3.7):2024-01-11-05:36:21.725.834 [status:STOP] [ge_api.cc:341]131377 GEFinalize:GEFinalize finished [INFO] TDT(131377,python3.7):2024-01-11-05:36:21.923.833 [process_mode_manager.cpp:184][Close][tid:131377] [TsdClient] Close [deviceId=2][sessionId=1] hccp and computer enter [INFO] TDT(131377,python3.7):2024-01-11-05:36:21.923.900 [version_verify.cpp:112][SpecialFeatureCheck][tid:131377] VersionVerify: previous type[7], supported [INFO] TDT(131377,python3.7):2024-01-11-05:36:21.923.948 [process_mode_manager.cpp:192][Close][tid:131377] [TsdClient][deviceId=2] [sessionId=1] wait hccp and computer process close respond [INFO] TDT(131377,python3.7):2024-01-11-05:36:21.954.992 [process_mode_manager.cpp:197][Close][tid:131377] [TsdClient][logicDeviceId_=2]has recv close hccp and computer process respond [INFO] TDT(131377,python3.7):2024-01-11-05:36:21.955.050 [stub_process_mode_nowin.cpp:151][CloseInHost][tid:131377] enter into CloseInHost deviceid[2] [INFO] TDT(131377,python3.7):2024-01-11-05:36:21.955.062 [stub_process_mode_nowin.cpp:154][CloseInHost][tid:131377] host cpu not support [INFO] TDT(131377,python3.7):2024-01-11-05:36:21.955.106 [process_mode_manager.cpp:208][Close][tid:131377] [TsdClient][deviceId=2] [sessionId=1] close hccp and computer process success [INFO] ATRACE(131377,python3.7):2024-01-11-05:36:21.955.124 [atrace_api.c:93](tid:131377) AtraceDestroy start [INFO] ATRACE(131377,python3.7):2024-01-11-05:36:21.955.147 [atrace_api.c:95](tid:131377) AtraceDestroy end [INFO] PROFILING(131377,python3.7):2024-01-11-05:36:21.955.174 [msprofiler_impl.cpp:156] >>> (tid:131377) ProfNotifySetDevice called, is open: 0, devId: 2 [INFO] RUNTIME(131377,python3.7):2024-01-11-05:36:23.542.940 [runtime.cc:1737] 131377 ~Runtime: deconstruct runtime.