Sign in
Sign up
Explore
Enterprise
Education
Search
Help
Terms of use
About Us
Explore
Enterprise
Education
Gitee Premium
Gitee AI
AI teammates
Sign in
Sign up
Fetch the repository succeeded.
description of repo status
Open Source
>
Other
>
Operation System
&&
Donate
Please sign in before you donate.
Cancel
Sign in
Scan WeChat QR to Pay
Cancel
Complete
Prompt
Switch to Alipay.
OK
Cancel
Watch
Unwatch
Watching
Releases Only
Ignoring
456
Star
1.7K
Fork
1.9K
GVP
openEuler
/
kernel
Closed
Code
Issues
1271
Pull Requests
986
Wiki
Insights
Pipelines
Service
Quality Analysis
Jenkins for Gitee
Tencent CloudBase
Tencent Cloud Serverless
悬镜安全
Aliyun SAE
Codeblitz
SBOM
DevLens
Don’t show this again
Update failed. Please try again later!
Remove this flag
Content Risk Flag
This task is identified by
as the content contains sensitive information such as code security bugs, privacy leaks, etc., so it is only accessible to contributors of this repository.
[OLK-6.6] qos_timer deadlock when cpuhp offline
Done
#IB7GK5
Bug
Kai
Opened this issue
2024-11-27 17:04
**【缺陷描述】** 长稳环境出现hardlockup **一、缺陷信息** **【缺陷所属的os版本】** OS: openeulerversion=openEuler-22.03-LTS compiletime=2022-03-30-16-23-56 gccversion=10.3.1-10.oe2203 kernelversion=5.10.0-60.18.0.50.oe2203 openjdkversion=1.8.0.312.b07-11.oe2203 **【内核版本】** kernel: 6.6.0 **【缺陷所属软件及版本号】** **【环境信息】** 硬件信息 - Intel(R) Xeon(R) CPU E5-2698 v3 @ 2.30GHz 软件信息 网络信息 **【问题复现步骤】** 1. 开启CONFIG_KASAN 、CONFIG_LOCKDEP、CONFIG_LOCK_STAT、CONFIG_DEBUG_SPINLOCK,编译安装内核 2. 执行调度长稳测试 **【实际结果】** **【期望结果】** **【其他相关附件信息】** [25224.214659] smpboot: Booting Node 0 Processor 1 APIC 0x2 [25245.406522] watchdog: Watchdog detected hard LOCKUP on cpu 52 [25245.406529] Modules linked in: my_hrtimer(OE) mutex_test(OE) nfsv3 nfs_acl nfs lockd grace fscache netfs tls dns_resolver brd ntfs exfat vfat fat btrfs xor raid6_pq xfs loop sctp ip6_udp_tunnel udp_tunnel xt_CHECKSUM ipt_REJECT nf_reject_ipv4 ip6table_mangle ip6table_nat xt_MASQUERADE nf_conntrack_netlink nfnetlink xt_addrtype iptable_mangle iptable_nat xt_conntrack nf_nat nf_conntrack br_netfilter nf_defrag_ipv6 nf_defrag_ipv4 libcrc32c overlay ebtable_filter ebtables ip6table_filter ip6_tables iptable_filter ip_tables tun bridge stp llc rfkill sunrpc intel_rapl_msr intel_rapl_common sb_edac x86_pkg_temp_thermal coretemp kvm_intel kvm irqbypass ipmi_si rapl joydev ses mei_me ipmi_devintf enclosure intel_cstate ipmi_msghandler scsi_transport_sas mei i2c_i801 pcspkr intel_pch_thermal intel_uncore sg lpc_ich i2c_smbus acpi_power_meter drm fuse ext4 mbcache jbd2 sd_mod t10_pi crc64_rocksoft_generic crc64_rocksoft crc64 crct10dif_pclmul crc32_pclmul crc32c_intel ghash_clmulni_intel ixgbe sha512_ssse3 ahci libahci wdat_wdt [25245.406692] mdio dca libata tg3 megaraid_sas [last unloaded: mutex_test(OE)] [25245.406703] irq event stamp: 5883352 [25245.406705] hardirqs last enabled at (5883351): [<ffffffffb9e3bc8d>] _raw_spin_unlock_irqrestore+0x2d/0x40 [25245.406721] hardirqs last disabled at (5883352): [<ffffffffb9e2b445>] __schedule+0x1045/0x1800 [25245.406730] softirqs last enabled at (5882168): [<ffffffffb896f34c>] handle_softirqs+0x3bc/0x4c0 [25245.406740] softirqs last disabled at (5882163): [<ffffffffb896f82a>] irq_exit_rcu+0xea/0x120 [25245.406748] CPU: 52 PID: 226274 Comm: python Kdump: loaded Tainted: G S W OE 6.6.0+ #1 [25245.406755] Hardware name: Huawei RH2288H V3/BC11HGSA0, BIOS 3.35 10/20/2016 [25245.406758] RIP: 0010:__asan_load4+0x4c/0x90 [25245.406767] Code: 80 ff ff 48 39 c7 72 42 48 8d 47 03 48 89 c2 83 e2 07 48 83 fa 02 76 1b 48 b9 00 00 00 00 00 fc ff df 48 c1 e8 03 0f b6 04 08 <84> c0 75 37 c3 cc cc cc cc 48 b9 00 00 00 00 00 fc ff df 48 89 fe [25245.406773] RSP: 0018:ffff88b367c0f720 EFLAGS: 00000806 [25245.406777] RAX: 0000000000000000 RBX: ffff88de7c209700 RCX: dffffc0000000000 [25245.406782] RDX: 0000000000000003 RSI: 0000000000000002 RDI: ffff88de7c209708 [25245.406785] RBP: ffff88de7b208640 R08: ffffffffb9e3cafd R09: ffffed1bcf6410c8 [25245.406789] R10: ffff88de7b208643 R11: 0000000000000000 R12: 0000000000d40000 [25245.406792] R13: 1ffff1166cf81ee6 R14: ffff88de7de09700 R15: ffff88de7c209708 [25245.406796] FS: 00007f66045fe640(0000) GS:ffff88de7c000000(0000) knlGS:0000000000000000 [25245.406801] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [25245.406805] CR2: 00007f5b21000010 CR3: 00000031f9400006 CR4: 00000000001706e0 [25245.406808] Call Trace: [25245.406811] <NMI> [25245.406814] ? watchdog_hardlockup_check.part.0+0xa4/0x170 [25245.406824] ? __perf_event_overflow+0x1d3/0x310 [25245.406834] ? handle_pmi_common+0x2a8/0x470 [25245.406845] ? __pfx_handle_pmi_common+0x10/0x10 [25245.406853] ? __pfx_native_set_fixmap+0x10/0x10 [25245.406861] ? apei_read+0x115/0x120 [25245.406870] ? __pfx_apei_read+0x10/0x10 [25245.406876] ? memcpy_fromio+0x22/0xd0 [25245.406885] ? ghes_copy_tofrom_phys+0x74/0x130 [25245.406892] ? __ghes_peek_estatus.isra.0+0x8b/0xf0 [25245.406900] ? look_up_lock_class+0x56/0x130 [25245.406905] ? ghes_in_nmi_queue_one_entry.constprop.0+0x98/0x3f0 [25245.406913] ? lock_acquire+0x256/0x3b0 [25245.406923] ? intel_bts_interrupt+0x7d/0x1b0 [25245.406931] ? intel_pmu_handle_irq+0x158/0x320 [25245.406938] ? perf_event_nmi_handler+0x39/0x60 [25245.406947] ? nmi_handle+0x103/0x300 [25245.406958] ? default_do_nmi+0x45/0x110 [25245.406966] ? exc_nmi+0x100/0x190 [25245.406972] ? end_repeat_nmi+0x16/0x67 [25245.406984] ? native_queued_spin_lock_slowpath+0x46d/0x580 [25245.406992] ? __asan_load4+0x4c/0x90 [25245.406998] ? __asan_load4+0x4c/0x90 [25245.407004] ? __asan_load4+0x4c/0x90 [25245.407010] </NMI> [25245.407012] <TASK> [25245.407013] native_queued_spin_lock_slowpath+0x46d/0x580 [25245.407020] ? __pfx_native_queued_spin_lock_slowpath+0x10/0x10 [25245.407026] ? find_held_lock+0x8c/0xa0 [25245.407031] ? local_clock_noinstr+0xb/0xb0 [25245.407038] do_raw_spin_lock+0x19b/0x1b0 [25245.407045] ? __pfx_do_raw_spin_lock+0x10/0x10 [25245.407049] ? __pfx_do_raw_spin_trylock+0x10/0x10 [25245.407055] ? raw_spin_rq_lock_nested+0x11/0x20 [25245.407065] _raw_spin_lock_nested+0x61/0x80 [25245.407073] raw_spin_rq_lock_nested+0x11/0x20 [25245.407078] steal_from+0x10d/0x3c0 [25245.407088] try_steal+0x457/0x6a0 [25245.407096] ? __pfx_try_steal+0x10/0x10 [25245.407102] ? qos_smt_check_siblings_status+0xc2/0x100 [25245.407111] ? lock_is_held_type+0x9d/0x110 [25245.407118] pick_next_task_fair+0x4c7/0x1090 [25245.407129] pick_next_task+0x68/0x1b0 [25245.407138] __schedule+0x2f1/0x1800 [25245.407144] ? hrtimer_start_range_ns+0x71/0x100 [25245.407152] ? rb_insert_color+0x2a/0x270 [25245.407162] ? __pfx___schedule+0x10/0x10 [25245.407168] ? __lock_release+0x142/0x290 [25245.407173] ? futex_wait_queue+0x5c/0xc0 [25245.407182] ? __pfx___lock_release+0x10/0x10 [25245.407188] ? __hrtimer_start_range_ns+0x2bc/0x530 [25245.407196] ? mark_held_locks+0x23/0x90 [25245.407205] schedule+0x8e/0x120 [25245.407212] futex_wait_queue+0x83/0xc0 [25245.407219] futex_wait+0x1fa/0x3e0 [25245.407227] ? __pfx_futex_wait+0x10/0x10 [25245.407234] ? __pfx_get_futex_key+0x10/0x10 [25245.407242] ? __pfx_hrtimer_wakeup+0x10/0x10 [25245.407248] ? mark_lock.part.0+0x8a/0x620 [25245.407259] ? __lock_release+0x142/0x290 [25245.407268] do_futex+0x196/0x230 [25245.407275] ? __pfx_do_futex+0x10/0x10 [25245.407280] ? __pfx___lock_release+0x10/0x10 [25245.407286] ? clockevents_program_event+0x75/0x140 [25245.407294] __se_sys_futex+0xc2/0x270 [25245.407302] ? __pfx___se_sys_futex+0x10/0x10 [25245.407308] ? ktime_get_coarse_real_ts64+0xea/0x100 [25245.407315] ? ktime_get_coarse_real_ts64+0xea/0x100 [25245.407321] ? ktime_get_coarse_real_ts64+0x8e/0x100 [25245.407330] do_syscall_64+0x6c/0x120 [25245.407336] entry_SYSCALL_64_after_hwframe+0x78/0xe2 [25245.407342] RIP: 0033:0x7f6606287cfa [25245.407348] Code: 24 08 e8 c9 f8 ff ff 4c 8b 54 24 18 45 31 c0 44 89 e2 89 c5 8b 74 24 08 48 8b 7c 24 10 41 b9 ff ff ff ff b8 ca 00 00 00 0f 05 <89> ef 48 89 44 24 08 e8 1a f9 ff ff 48 8b 44 24 08 e9 68 ff ff ff [25245.407352] RSP: 002b:00007f66045fdb10 EFLAGS: 00000246 ORIG_RAX: 00000000000000ca [25245.407358] RAX: ffffffffffffffda RBX: 00007f66045fdc60 RCX: 00007f6606287cfa [25245.407361] RDX: 0000000000000000 RSI: 0000000000000089 RDI: 00007f660695b04c [25245.407365] RBP: 0000000000000000 R08: 0000000000000000 R09: 00000000ffffffff [25245.407368] R10: 00007f66045fdc60 R11: 0000000000000246 R12: 0000000000000000 [25245.407371] R13: 00007f660695b04c R14: 0000000000000000 R15: 00007f660695b050 [25245.407380] </TASK> [25245.407383] Kernel panic - not syncing: Hard LOCKUP [25245.407387] CPU: 52 PID: 226274 Comm: python Kdump: loaded Tainted: G S W OE 6.6.0+ #1 [25245.407392] Hardware name: Huawei RH2288H V3/BC11HGSA0, BIOS 3.35 10/20/2016 [25245.407395] Call Trace: [25245.407397] <NMI> [25245.407399] dump_stack_lvl+0x47/0x80 [25245.407407] panic+0x20a/0x410 [25245.407413] ? __pfx_panic+0x10/0x10 [25245.407422] ? nmi_panic+0x27/0x70 [25245.407428] nmi_panic+0x5a/0x70 [25245.407433] watchdog_hardlockup_check.part.0+0x15f/0x170 [25245.407442] __perf_event_overflow+0x1d3/0x310 [25245.407450] handle_pmi_common+0x2a8/0x470 [25245.407460] ? __pfx_handle_pmi_common+0x10/0x10 [25245.407468] ? __pfx_native_set_fixmap+0x10/0x10 [25245.407474] ? apei_read+0x115/0x120 [25245.407481] ? __pfx_apei_read+0x10/0x10 [25245.407486] ? memcpy_fromio+0x22/0xd0 [25245.407493] ? ghes_copy_tofrom_phys+0x74/0x130 [25245.407500] ? __ghes_peek_estatus.isra.0+0x8b/0xf0 [25245.407507] ? look_up_lock_class+0x56/0x130 [25245.407512] ? ghes_in_nmi_queue_one_entry.constprop.0+0x98/0x3f0 [25245.407519] ? lock_acquire+0x256/0x3b0 [25245.407527] ? intel_bts_interrupt+0x7d/0x1b0 [25245.407534] intel_pmu_handle_irq+0x158/0x320 [25245.407542] perf_event_nmi_handler+0x39/0x60 [25245.407550] nmi_handle+0x103/0x300 [25245.407558] default_do_nmi+0x45/0x110 [25245.407565] exc_nmi+0x100/0x190 [25245.407571] end_repeat_nmi+0x16/0x67 [25245.407578] RIP: 0010:__asan_load4+0x4c/0x90 [25245.407583] Code: 80 ff ff 48 39 c7 72 42 48 8d 47 03 48 89 c2 83 e2 07 48 83 fa 02 76 1b 48 b9 00 00 00 00 00 fc ff df 48 c1 e8 03 0f b6 04 08 <84> c0 75 37 c3 cc cc cc cc 48 b9 00 00 00 00 00 fc ff df 48 89 fe [25245.407588] RSP: 0018:ffff88b367c0f720 EFLAGS: 00000806 [25245.407592] RAX: 0000000000000000 RBX: ffff88de7c209700 RCX: dffffc0000000000 [25245.407596] RDX: 0000000000000003 RSI: 0000000000000002 RDI: ffff88de7c209708 [25245.407599] RBP: ffff88de7b208640 R08: ffffffffb9e3cafd R09: ffffed1bcf6410c8 [25245.407603] R10: ffff88de7b208643 R11: 0000000000000000 R12: 0000000000d40000 [25245.407606] R13: 1ffff1166cf81ee6 R14: ffff88de7de09700 R15: ffff88de7c209708 [25245.407613] ? native_queued_spin_lock_slowpath+0x46d/0x580 [25245.407621] ? __asan_load4+0x4c/0x90 [25245.407628] ? __asan_load4+0x4c/0x90 [25245.407634] </NMI> [25245.407636] <TASK> [25245.407637] native_queued_spin_lock_slowpath+0x46d/0x580 [25245.407643] ? __pfx_native_queued_spin_lock_slowpath+0x10/0x10 [25245.407649] ? find_held_lock+0x8c/0xa0 [25245.407654] ? local_clock_noinstr+0xb/0xb0 [25245.407661] do_raw_spin_lock+0x19b/0x1b0 [25245.407666] ? __pfx_do_raw_spin_lock+0x10/0x10 [25245.407671] ? __pfx_do_raw_spin_trylock+0x10/0x10 [25245.407678] ? raw_spin_rq_lock_nested+0x11/0x20 [25245.407685] _raw_spin_lock_nested+0x61/0x80 [25245.407692] raw_spin_rq_lock_nested+0x11/0x20 [25245.407697] steal_from+0x10d/0x3c0 [25245.407705] try_steal+0x457/0x6a0 [25245.407713] ? __pfx_try_steal+0x10/0x10 [25245.407718] ? qos_smt_check_siblings_status+0xc2/0x100 [25245.407726] ? lock_is_held_type+0x9d/0x110 [25245.407732] pick_next_task_fair+0x4c7/0x1090 [25245.407742] pick_next_task+0x68/0x1b0 [25245.407749] __schedule+0x2f1/0x1800 [25245.407755] ? hrtimer_start_range_ns+0x71/0x100 [25245.407760] ? rb_insert_color+0x2a/0x270 [25245.407769] ? __pfx___schedule+0x10/0x10 [25245.407775] ? __lock_release+0x142/0x290 [25245.407780] ? futex_wait_queue+0x5c/0xc0 [25245.407786] ? __pfx___lock_release+0x10/0x10 [25245.407793] ? __hrtimer_start_range_ns+0x2bc/0x530 [25245.407800] ? mark_held_locks+0x23/0x90 [25245.407809] schedule+0x8e/0x120 [25245.407815] futex_wait_queue+0x83/0xc0 [25245.407823] futex_wait+0x1fa/0x3e0 [25245.407830] ? __pfx_futex_wait+0x10/0x10 [25245.407837] ? __pfx_get_futex_key+0x10/0x10 [25245.407844] ? __pfx_hrtimer_wakeup+0x10/0x10 [25245.407850] ? mark_lock.part.0+0x8a/0x620 [25245.407861] ? __lock_release+0x142/0x290 [25245.407870] do_futex+0x196/0x230 [25245.407876] ? __pfx_do_futex+0x10/0x10 [25245.407881] ? __pfx___lock_release+0x10/0x10 [25245.407887] ? clockevents_program_event+0x75/0x140 [25245.407894] __se_sys_futex+0xc2/0x270 [25245.407901] ? __pfx___se_sys_futex+0x10/0x10 [25245.407907] ? ktime_get_coarse_real_ts64+0xea/0x100 [25245.407914] ? ktime_get_coarse_real_ts64+0xea/0x100 [25245.407920] ? ktime_get_coarse_real_ts64+0x8e/0x100 [25245.407928] do_syscall_64+0x6c/0x120 [25245.407934] entry_SYSCALL_64_after_hwframe+0x78/0xe2 [25245.407940] RIP: 0033:0x7f6606287cfa [25245.407943] Code: 24 08 e8 c9 f8 ff ff 4c 8b 54 24 18 45 31 c0 44 89 e2 89 c5 8b 74 24 08 48 8b 7c 24 10 41 b9 ff ff ff ff b8 ca 00 00 00 0f 05 <89> ef 48 89 44 24 08 e8 1a f9 ff ff 48 8b 44 24 08 e9 68 ff ff ff [25245.407948] RSP: 002b:00007f66045fdb10 EFLAGS: 00000246 ORIG_RAX: 00000000000000ca [25245.407952] RAX: ffffffffffffffda RBX: 00007f66045fdc60 RCX: 00007f6606287cfa [25245.407956] RDX: 0000000000000000 RSI: 0000000000000089 RDI: 00007f660695b04c [25245.407958] RBP: 0000000000000000 R08: 0000000000000000 R09: 00000000ffffffff [25245.407961] R10: 00007f66045fdc60 R11: 0000000000000246 R12: 0000000000000000 [25245.407964] R13: 00007f660695b04c R14: 0000000000000000 R15: 00007f660695b050 [25245.407973] </TASK> [25245.407981] [25245.407982] ================================ [25245.407984] WARNING: inconsistent lock state [25245.407986] 6.6.0+ #1 Tainted: G S W OE [25245.407988] -------------------------------- [25245.407990] inconsistent {INITIAL USE} -> {IN-NMI} usage. [25245.407993] python/226274 [HC1[1]:SC0[0]:HE0:SE1] takes: [25245.407998] ffffffffbb27ab58 (&nmi_desc[0].lock){....}-{2:2}, at: __register_nmi_handler+0x71/0x1e0 [25245.408012] {INITIAL USE} state was registered at: [25245.408014] __lock_acquire+0x2dd/0x9a0 [25245.408020] lock_acquire+0x152/0x3b0 [25245.408025] _raw_spin_lock_irqsave+0x40/0x90 [25245.408031] __register_nmi_handler+0x71/0x1e0 [25245.408036] init_hw_perf_events+0x188/0x510 [25245.408044] do_one_initcall+0xbb/0x390 [25245.408052] do_pre_smp_initcalls+0x61/0xf0 [25245.408057] kernel_init_freeable+0x22e/0x330 [25245.408062] kernel_init+0x1b/0x1e0 [25245.408068] ret_from_fork+0x30/0x50 [25245.408074] ret_from_fork_asm+0x1b/0x30 [25245.408079] irq event stamp: 5883352 [25245.408081] hardirqs last enabled at (5883351): [<ffffffffb9e3bc8d>] _raw_spin_unlock_irqrestore+0x2d/0x40 [25245.408089] hardirqs last disabled at (5883352): [<ffffffffb9e2b445>] __schedule+0x1045/0x1800 [25245.408096] softirqs last enabled at (5882168): [<ffffffffb896f34c>] handle_softirqs+0x3bc/0x4c0 [25245.408103] softirqs last disabled at (5882163): [<ffffffffb896f82a>] irq_exit_rcu+0xea/0x120 [25245.408110] other info that might help us debug this: [25245.408111] Possible unsafe locking scenario: [25245.408113] CPU0 [25245.408114] ---- [25245.408115] lock(&nmi_desc[0].lock); [25245.408119] <Interrupt> [25245.408120] lock(&nmi_desc[0].lock); [25245.408123] *** DEADLOCK *** [25245.408124] 2 locks held by python/226274: [25245.408127] #0: ffffffffbb6e38c0 (rcu_read_lock){....}-{1:2}, at: try_steal+0x149/0x6a0 [25245.408139] #1: ffff88de7b208658 (&rq->__lock){-.-.}-{2:2}, at: raw_spin_rq_lock_nested+0x11/0x20 [25245.408151] stack backtrace: [25245.408153] CPU: 52 PID: 226274 Comm: python Kdump: loaded Tainted: G S W OE 6.6.0+ #1 [25245.408158] Hardware name: Huawei RH2288H V3/BC11HGSA0, BIOS 3.35 10/20/2016 [25245.408161] Call Trace: [25245.408163] <NMI> [25245.408165] dump_stack_lvl+0x47/0x80 [25245.408171] lock_acquire+0x2e2/0x3b0 [25245.408178] ? __pfx_lock_acquire+0x10/0x10 [25245.408185] ? __register_nmi_handler+0x71/0x1e0 [25245.408191] ? look_up_lock_class+0x56/0x130 [25245.408196] ? irq_work_claim+0x1e/0x40 [25245.408204] ? irq_work_queue+0xb/0x40 [25245.408210] _raw_spin_lock_irqsave+0x40/0x90 [25245.408217] ? __register_nmi_handler+0x71/0x1e0 [25245.408223] __register_nmi_handler+0x71/0x1e0 [25245.408230] ? __asan_memcpy+0x39/0x60 [25245.408239] nmi_shootdown_cpus+0x71/0x100 [25245.408247] kdump_nmi_shootdown_cpus+0x11/0x20 [25245.408256] native_machine_crash_shutdown+0x38/0xa0 [25245.408264] __crash_kexec+0xcf/0x1e0 [25245.408271] ? __pfx___crash_kexec+0x10/0x10 [25245.408278] ? lock_release+0x75/0x160 [25245.408286] ? __crash_kexec+0x88/0x1e0 [25245.408293] ? __crash_kexec+0x1c3/0x1e0 [25245.408303] ? panic+0x213/0x410 [25245.408308] ? __pfx_panic+0x10/0x10 [25245.408315] ? nmi_panic+0x27/0x70 [25245.408321] ? nmi_panic+0x5a/0x70 [25245.408326] ? watchdog_hardlockup_check.part.0+0x15f/0x170 [25245.408333] ? __perf_event_overflow+0x1d3/0x310 [25245.408340] ? handle_pmi_common+0x2a8/0x470 [25245.408348] ? __pfx_handle_pmi_common+0x10/0x10 [25245.408356] ? __pfx_native_set_fixmap+0x10/0x10 [25245.408361] ? apei_read+0x115/0x120 [25245.408367] ? __pfx_apei_read+0x10/0x10 [25245.408373] ? memcpy_fromio+0x22/0xd0 [25245.408379] ? ghes_copy_tofrom_phys+0x74/0x130 [25245.408386] ? __ghes_peek_estatus.isra.0+0x8b/0xf0 [25245.408392] ? look_up_lock_class+0x56/0x130 [25245.408396] ? ghes_in_nmi_queue_one_entry.constprop.0+0x98/0x3f0 [25245.408404] ? lock_acquire+0x256/0x3b0 [25245.408411] ? intel_bts_interrupt+0x7d/0x1b0 [25245.408418] ? intel_pmu_handle_irq+0x158/0x320 [25245.408424] ? perf_event_nmi_handler+0x39/0x60 [25245.408430] ? nmi_handle+0x103/0x300 [25245.408438] ? default_do_nmi+0x45/0x110 [25245.408444] ? exc_nmi+0x100/0x190 [25245.408449] ? end_repeat_nmi+0x16/0x67 [25245.408457] ? native_queued_spin_lock_slowpath+0x46d/0x580 [25245.408464] ? __asan_load4+0x4c/0x90 [25245.408469] ? __asan_load4+0x4c/0x90 [25245.408475] ? __asan_load4+0x4c/0x90 [25245.408481] </NMI> [25245.408482] <TASK> [25245.408484] ? native_queued_spin_lock_slowpath+0x46d/0x580 [25245.408489] ? __pfx_native_queued_spin_lock_slowpath+0x10/0x10 [25245.408495] ? find_held_lock+0x8c/0xa0 [25245.408500] ? local_clock_noinstr+0xb/0xb0 [25245.408506] ? do_raw_spin_lock+0x19b/0x1b0 [25245.408511] ? __pfx_do_raw_spin_lock+0x10/0x10 [25245.408516] ? __pfx_do_raw_spin_trylock+0x10/0x10 [25245.408522] ? raw_spin_rq_lock_nested+0x11/0x20 [25245.408529] ? _raw_spin_lock_nested+0x61/0x80 [25245.408535] ? raw_spin_rq_lock_nested+0x11/0x20 [25245.408540] ? steal_from+0x10d/0x3c0 [25245.408547] ? try_steal+0x457/0x6a0 [25245.408553] ? __pfx_try_steal+0x10/0x10 [25245.408559] ? qos_smt_check_siblings_status+0xc2/0x100 [25245.408566] ? lock_is_held_type+0x9d/0x110 [25245.408572] ? pick_next_task_fair+0x4c7/0x1090 [25245.408580] ? pick_next_task+0x68/0x1b0 [25245.408586] ? __schedule+0x2f1/0x1800 [25245.408592] ? hrtimer_start_range_ns+0x71/0x100 [25245.408597] ? rb_insert_color+0x2a/0x270 [25245.408605] ? __pfx___schedule+0x10/0x10 [25245.408611] ? __lock_release+0x142/0x290 [25245.408617] ? futex_wait_queue+0x5c/0xc0 [25245.408623] ? __pfx___lock_release+0x10/0x10 [25245.408629] ? __hrtimer_start_range_ns+0x2bc/0x530 [25245.408636] ? mark_held_locks+0x23/0x90 [25245.408644] ? schedule+0x8e/0x120 [25245.408650] ? futex_wait_queue+0x83/0xc0 [25245.408657] ? futex_wait+0x1fa/0x3e0 [25245.408663] ? __pfx_futex_wait+0x10/0x10 [25245.408670] ? __pfx_get_futex_key+0x10/0x10 [25245.408676] ? __pfx_hrtimer_wakeup+0x10/0x10 [25245.408681] ? mark_lock.part.0+0x8a/0x620 [25245.408691] ? __lock_release+0x142/0x290 [25245.408699] ? do_futex+0x196/0x230 [25245.408705] ? __pfx_do_futex+0x10/0x10 [25245.408710] ? __pfx___lock_release+0x10/0x10 [25245.408716] ? clockevents_program_event+0x75/0x140 [25245.408722] ? __se_sys_futex+0xc2/0x270 [25245.408728] ? __pfx___se_sys_futex+0x10/0x10 [25245.408734] ? ktime_get_coarse_real_ts64+0xea/0x100 [25245.408741] ? ktime_get_coarse_real_ts64+0xea/0x100 [25245.408747] ? ktime_get_coarse_real_ts64+0x8e/0x100 [25245.408754] ? do_syscall_64+0x6c/0x120 [25245.408759] ? entry_SYSCALL_64_after_hwframe+0x78/0xe2 [25245.408770] </TASK> [25245.410131] kexec: Bye! **【缺陷详情及分析指导参考链接】**
**【缺陷描述】** 长稳环境出现hardlockup **一、缺陷信息** **【缺陷所属的os版本】** OS: openeulerversion=openEuler-22.03-LTS compiletime=2022-03-30-16-23-56 gccversion=10.3.1-10.oe2203 kernelversion=5.10.0-60.18.0.50.oe2203 openjdkversion=1.8.0.312.b07-11.oe2203 **【内核版本】** kernel: 6.6.0 **【缺陷所属软件及版本号】** **【环境信息】** 硬件信息 - Intel(R) Xeon(R) CPU E5-2698 v3 @ 2.30GHz 软件信息 网络信息 **【问题复现步骤】** 1. 开启CONFIG_KASAN 、CONFIG_LOCKDEP、CONFIG_LOCK_STAT、CONFIG_DEBUG_SPINLOCK,编译安装内核 2. 执行调度长稳测试 **【实际结果】** **【期望结果】** **【其他相关附件信息】** [25224.214659] smpboot: Booting Node 0 Processor 1 APIC 0x2 [25245.406522] watchdog: Watchdog detected hard LOCKUP on cpu 52 [25245.406529] Modules linked in: my_hrtimer(OE) mutex_test(OE) nfsv3 nfs_acl nfs lockd grace fscache netfs tls dns_resolver brd ntfs exfat vfat fat btrfs xor raid6_pq xfs loop sctp ip6_udp_tunnel udp_tunnel xt_CHECKSUM ipt_REJECT nf_reject_ipv4 ip6table_mangle ip6table_nat xt_MASQUERADE nf_conntrack_netlink nfnetlink xt_addrtype iptable_mangle iptable_nat xt_conntrack nf_nat nf_conntrack br_netfilter nf_defrag_ipv6 nf_defrag_ipv4 libcrc32c overlay ebtable_filter ebtables ip6table_filter ip6_tables iptable_filter ip_tables tun bridge stp llc rfkill sunrpc intel_rapl_msr intel_rapl_common sb_edac x86_pkg_temp_thermal coretemp kvm_intel kvm irqbypass ipmi_si rapl joydev ses mei_me ipmi_devintf enclosure intel_cstate ipmi_msghandler scsi_transport_sas mei i2c_i801 pcspkr intel_pch_thermal intel_uncore sg lpc_ich i2c_smbus acpi_power_meter drm fuse ext4 mbcache jbd2 sd_mod t10_pi crc64_rocksoft_generic crc64_rocksoft crc64 crct10dif_pclmul crc32_pclmul crc32c_intel ghash_clmulni_intel ixgbe sha512_ssse3 ahci libahci wdat_wdt [25245.406692] mdio dca libata tg3 megaraid_sas [last unloaded: mutex_test(OE)] [25245.406703] irq event stamp: 5883352 [25245.406705] hardirqs last enabled at (5883351): [<ffffffffb9e3bc8d>] _raw_spin_unlock_irqrestore+0x2d/0x40 [25245.406721] hardirqs last disabled at (5883352): [<ffffffffb9e2b445>] __schedule+0x1045/0x1800 [25245.406730] softirqs last enabled at (5882168): [<ffffffffb896f34c>] handle_softirqs+0x3bc/0x4c0 [25245.406740] softirqs last disabled at (5882163): [<ffffffffb896f82a>] irq_exit_rcu+0xea/0x120 [25245.406748] CPU: 52 PID: 226274 Comm: python Kdump: loaded Tainted: G S W OE 6.6.0+ #1 [25245.406755] Hardware name: Huawei RH2288H V3/BC11HGSA0, BIOS 3.35 10/20/2016 [25245.406758] RIP: 0010:__asan_load4+0x4c/0x90 [25245.406767] Code: 80 ff ff 48 39 c7 72 42 48 8d 47 03 48 89 c2 83 e2 07 48 83 fa 02 76 1b 48 b9 00 00 00 00 00 fc ff df 48 c1 e8 03 0f b6 04 08 <84> c0 75 37 c3 cc cc cc cc 48 b9 00 00 00 00 00 fc ff df 48 89 fe [25245.406773] RSP: 0018:ffff88b367c0f720 EFLAGS: 00000806 [25245.406777] RAX: 0000000000000000 RBX: ffff88de7c209700 RCX: dffffc0000000000 [25245.406782] RDX: 0000000000000003 RSI: 0000000000000002 RDI: ffff88de7c209708 [25245.406785] RBP: ffff88de7b208640 R08: ffffffffb9e3cafd R09: ffffed1bcf6410c8 [25245.406789] R10: ffff88de7b208643 R11: 0000000000000000 R12: 0000000000d40000 [25245.406792] R13: 1ffff1166cf81ee6 R14: ffff88de7de09700 R15: ffff88de7c209708 [25245.406796] FS: 00007f66045fe640(0000) GS:ffff88de7c000000(0000) knlGS:0000000000000000 [25245.406801] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [25245.406805] CR2: 00007f5b21000010 CR3: 00000031f9400006 CR4: 00000000001706e0 [25245.406808] Call Trace: [25245.406811] <NMI> [25245.406814] ? watchdog_hardlockup_check.part.0+0xa4/0x170 [25245.406824] ? __perf_event_overflow+0x1d3/0x310 [25245.406834] ? handle_pmi_common+0x2a8/0x470 [25245.406845] ? __pfx_handle_pmi_common+0x10/0x10 [25245.406853] ? __pfx_native_set_fixmap+0x10/0x10 [25245.406861] ? apei_read+0x115/0x120 [25245.406870] ? __pfx_apei_read+0x10/0x10 [25245.406876] ? memcpy_fromio+0x22/0xd0 [25245.406885] ? ghes_copy_tofrom_phys+0x74/0x130 [25245.406892] ? __ghes_peek_estatus.isra.0+0x8b/0xf0 [25245.406900] ? look_up_lock_class+0x56/0x130 [25245.406905] ? ghes_in_nmi_queue_one_entry.constprop.0+0x98/0x3f0 [25245.406913] ? lock_acquire+0x256/0x3b0 [25245.406923] ? intel_bts_interrupt+0x7d/0x1b0 [25245.406931] ? intel_pmu_handle_irq+0x158/0x320 [25245.406938] ? perf_event_nmi_handler+0x39/0x60 [25245.406947] ? nmi_handle+0x103/0x300 [25245.406958] ? default_do_nmi+0x45/0x110 [25245.406966] ? exc_nmi+0x100/0x190 [25245.406972] ? end_repeat_nmi+0x16/0x67 [25245.406984] ? native_queued_spin_lock_slowpath+0x46d/0x580 [25245.406992] ? __asan_load4+0x4c/0x90 [25245.406998] ? __asan_load4+0x4c/0x90 [25245.407004] ? __asan_load4+0x4c/0x90 [25245.407010] </NMI> [25245.407012] <TASK> [25245.407013] native_queued_spin_lock_slowpath+0x46d/0x580 [25245.407020] ? __pfx_native_queued_spin_lock_slowpath+0x10/0x10 [25245.407026] ? find_held_lock+0x8c/0xa0 [25245.407031] ? local_clock_noinstr+0xb/0xb0 [25245.407038] do_raw_spin_lock+0x19b/0x1b0 [25245.407045] ? __pfx_do_raw_spin_lock+0x10/0x10 [25245.407049] ? __pfx_do_raw_spin_trylock+0x10/0x10 [25245.407055] ? raw_spin_rq_lock_nested+0x11/0x20 [25245.407065] _raw_spin_lock_nested+0x61/0x80 [25245.407073] raw_spin_rq_lock_nested+0x11/0x20 [25245.407078] steal_from+0x10d/0x3c0 [25245.407088] try_steal+0x457/0x6a0 [25245.407096] ? __pfx_try_steal+0x10/0x10 [25245.407102] ? qos_smt_check_siblings_status+0xc2/0x100 [25245.407111] ? lock_is_held_type+0x9d/0x110 [25245.407118] pick_next_task_fair+0x4c7/0x1090 [25245.407129] pick_next_task+0x68/0x1b0 [25245.407138] __schedule+0x2f1/0x1800 [25245.407144] ? hrtimer_start_range_ns+0x71/0x100 [25245.407152] ? rb_insert_color+0x2a/0x270 [25245.407162] ? __pfx___schedule+0x10/0x10 [25245.407168] ? __lock_release+0x142/0x290 [25245.407173] ? futex_wait_queue+0x5c/0xc0 [25245.407182] ? __pfx___lock_release+0x10/0x10 [25245.407188] ? __hrtimer_start_range_ns+0x2bc/0x530 [25245.407196] ? mark_held_locks+0x23/0x90 [25245.407205] schedule+0x8e/0x120 [25245.407212] futex_wait_queue+0x83/0xc0 [25245.407219] futex_wait+0x1fa/0x3e0 [25245.407227] ? __pfx_futex_wait+0x10/0x10 [25245.407234] ? __pfx_get_futex_key+0x10/0x10 [25245.407242] ? __pfx_hrtimer_wakeup+0x10/0x10 [25245.407248] ? mark_lock.part.0+0x8a/0x620 [25245.407259] ? __lock_release+0x142/0x290 [25245.407268] do_futex+0x196/0x230 [25245.407275] ? __pfx_do_futex+0x10/0x10 [25245.407280] ? __pfx___lock_release+0x10/0x10 [25245.407286] ? clockevents_program_event+0x75/0x140 [25245.407294] __se_sys_futex+0xc2/0x270 [25245.407302] ? __pfx___se_sys_futex+0x10/0x10 [25245.407308] ? ktime_get_coarse_real_ts64+0xea/0x100 [25245.407315] ? ktime_get_coarse_real_ts64+0xea/0x100 [25245.407321] ? ktime_get_coarse_real_ts64+0x8e/0x100 [25245.407330] do_syscall_64+0x6c/0x120 [25245.407336] entry_SYSCALL_64_after_hwframe+0x78/0xe2 [25245.407342] RIP: 0033:0x7f6606287cfa [25245.407348] Code: 24 08 e8 c9 f8 ff ff 4c 8b 54 24 18 45 31 c0 44 89 e2 89 c5 8b 74 24 08 48 8b 7c 24 10 41 b9 ff ff ff ff b8 ca 00 00 00 0f 05 <89> ef 48 89 44 24 08 e8 1a f9 ff ff 48 8b 44 24 08 e9 68 ff ff ff [25245.407352] RSP: 002b:00007f66045fdb10 EFLAGS: 00000246 ORIG_RAX: 00000000000000ca [25245.407358] RAX: ffffffffffffffda RBX: 00007f66045fdc60 RCX: 00007f6606287cfa [25245.407361] RDX: 0000000000000000 RSI: 0000000000000089 RDI: 00007f660695b04c [25245.407365] RBP: 0000000000000000 R08: 0000000000000000 R09: 00000000ffffffff [25245.407368] R10: 00007f66045fdc60 R11: 0000000000000246 R12: 0000000000000000 [25245.407371] R13: 00007f660695b04c R14: 0000000000000000 R15: 00007f660695b050 [25245.407380] </TASK> [25245.407383] Kernel panic - not syncing: Hard LOCKUP [25245.407387] CPU: 52 PID: 226274 Comm: python Kdump: loaded Tainted: G S W OE 6.6.0+ #1 [25245.407392] Hardware name: Huawei RH2288H V3/BC11HGSA0, BIOS 3.35 10/20/2016 [25245.407395] Call Trace: [25245.407397] <NMI> [25245.407399] dump_stack_lvl+0x47/0x80 [25245.407407] panic+0x20a/0x410 [25245.407413] ? __pfx_panic+0x10/0x10 [25245.407422] ? nmi_panic+0x27/0x70 [25245.407428] nmi_panic+0x5a/0x70 [25245.407433] watchdog_hardlockup_check.part.0+0x15f/0x170 [25245.407442] __perf_event_overflow+0x1d3/0x310 [25245.407450] handle_pmi_common+0x2a8/0x470 [25245.407460] ? __pfx_handle_pmi_common+0x10/0x10 [25245.407468] ? __pfx_native_set_fixmap+0x10/0x10 [25245.407474] ? apei_read+0x115/0x120 [25245.407481] ? __pfx_apei_read+0x10/0x10 [25245.407486] ? memcpy_fromio+0x22/0xd0 [25245.407493] ? ghes_copy_tofrom_phys+0x74/0x130 [25245.407500] ? __ghes_peek_estatus.isra.0+0x8b/0xf0 [25245.407507] ? look_up_lock_class+0x56/0x130 [25245.407512] ? ghes_in_nmi_queue_one_entry.constprop.0+0x98/0x3f0 [25245.407519] ? lock_acquire+0x256/0x3b0 [25245.407527] ? intel_bts_interrupt+0x7d/0x1b0 [25245.407534] intel_pmu_handle_irq+0x158/0x320 [25245.407542] perf_event_nmi_handler+0x39/0x60 [25245.407550] nmi_handle+0x103/0x300 [25245.407558] default_do_nmi+0x45/0x110 [25245.407565] exc_nmi+0x100/0x190 [25245.407571] end_repeat_nmi+0x16/0x67 [25245.407578] RIP: 0010:__asan_load4+0x4c/0x90 [25245.407583] Code: 80 ff ff 48 39 c7 72 42 48 8d 47 03 48 89 c2 83 e2 07 48 83 fa 02 76 1b 48 b9 00 00 00 00 00 fc ff df 48 c1 e8 03 0f b6 04 08 <84> c0 75 37 c3 cc cc cc cc 48 b9 00 00 00 00 00 fc ff df 48 89 fe [25245.407588] RSP: 0018:ffff88b367c0f720 EFLAGS: 00000806 [25245.407592] RAX: 0000000000000000 RBX: ffff88de7c209700 RCX: dffffc0000000000 [25245.407596] RDX: 0000000000000003 RSI: 0000000000000002 RDI: ffff88de7c209708 [25245.407599] RBP: ffff88de7b208640 R08: ffffffffb9e3cafd R09: ffffed1bcf6410c8 [25245.407603] R10: ffff88de7b208643 R11: 0000000000000000 R12: 0000000000d40000 [25245.407606] R13: 1ffff1166cf81ee6 R14: ffff88de7de09700 R15: ffff88de7c209708 [25245.407613] ? native_queued_spin_lock_slowpath+0x46d/0x580 [25245.407621] ? __asan_load4+0x4c/0x90 [25245.407628] ? __asan_load4+0x4c/0x90 [25245.407634] </NMI> [25245.407636] <TASK> [25245.407637] native_queued_spin_lock_slowpath+0x46d/0x580 [25245.407643] ? __pfx_native_queued_spin_lock_slowpath+0x10/0x10 [25245.407649] ? find_held_lock+0x8c/0xa0 [25245.407654] ? local_clock_noinstr+0xb/0xb0 [25245.407661] do_raw_spin_lock+0x19b/0x1b0 [25245.407666] ? __pfx_do_raw_spin_lock+0x10/0x10 [25245.407671] ? __pfx_do_raw_spin_trylock+0x10/0x10 [25245.407678] ? raw_spin_rq_lock_nested+0x11/0x20 [25245.407685] _raw_spin_lock_nested+0x61/0x80 [25245.407692] raw_spin_rq_lock_nested+0x11/0x20 [25245.407697] steal_from+0x10d/0x3c0 [25245.407705] try_steal+0x457/0x6a0 [25245.407713] ? __pfx_try_steal+0x10/0x10 [25245.407718] ? qos_smt_check_siblings_status+0xc2/0x100 [25245.407726] ? lock_is_held_type+0x9d/0x110 [25245.407732] pick_next_task_fair+0x4c7/0x1090 [25245.407742] pick_next_task+0x68/0x1b0 [25245.407749] __schedule+0x2f1/0x1800 [25245.407755] ? hrtimer_start_range_ns+0x71/0x100 [25245.407760] ? rb_insert_color+0x2a/0x270 [25245.407769] ? __pfx___schedule+0x10/0x10 [25245.407775] ? __lock_release+0x142/0x290 [25245.407780] ? futex_wait_queue+0x5c/0xc0 [25245.407786] ? __pfx___lock_release+0x10/0x10 [25245.407793] ? __hrtimer_start_range_ns+0x2bc/0x530 [25245.407800] ? mark_held_locks+0x23/0x90 [25245.407809] schedule+0x8e/0x120 [25245.407815] futex_wait_queue+0x83/0xc0 [25245.407823] futex_wait+0x1fa/0x3e0 [25245.407830] ? __pfx_futex_wait+0x10/0x10 [25245.407837] ? __pfx_get_futex_key+0x10/0x10 [25245.407844] ? __pfx_hrtimer_wakeup+0x10/0x10 [25245.407850] ? mark_lock.part.0+0x8a/0x620 [25245.407861] ? __lock_release+0x142/0x290 [25245.407870] do_futex+0x196/0x230 [25245.407876] ? __pfx_do_futex+0x10/0x10 [25245.407881] ? __pfx___lock_release+0x10/0x10 [25245.407887] ? clockevents_program_event+0x75/0x140 [25245.407894] __se_sys_futex+0xc2/0x270 [25245.407901] ? __pfx___se_sys_futex+0x10/0x10 [25245.407907] ? ktime_get_coarse_real_ts64+0xea/0x100 [25245.407914] ? ktime_get_coarse_real_ts64+0xea/0x100 [25245.407920] ? ktime_get_coarse_real_ts64+0x8e/0x100 [25245.407928] do_syscall_64+0x6c/0x120 [25245.407934] entry_SYSCALL_64_after_hwframe+0x78/0xe2 [25245.407940] RIP: 0033:0x7f6606287cfa [25245.407943] Code: 24 08 e8 c9 f8 ff ff 4c 8b 54 24 18 45 31 c0 44 89 e2 89 c5 8b 74 24 08 48 8b 7c 24 10 41 b9 ff ff ff ff b8 ca 00 00 00 0f 05 <89> ef 48 89 44 24 08 e8 1a f9 ff ff 48 8b 44 24 08 e9 68 ff ff ff [25245.407948] RSP: 002b:00007f66045fdb10 EFLAGS: 00000246 ORIG_RAX: 00000000000000ca [25245.407952] RAX: ffffffffffffffda RBX: 00007f66045fdc60 RCX: 00007f6606287cfa [25245.407956] RDX: 0000000000000000 RSI: 0000000000000089 RDI: 00007f660695b04c [25245.407958] RBP: 0000000000000000 R08: 0000000000000000 R09: 00000000ffffffff [25245.407961] R10: 00007f66045fdc60 R11: 0000000000000246 R12: 0000000000000000 [25245.407964] R13: 00007f660695b04c R14: 0000000000000000 R15: 00007f660695b050 [25245.407973] </TASK> [25245.407981] [25245.407982] ================================ [25245.407984] WARNING: inconsistent lock state [25245.407986] 6.6.0+ #1 Tainted: G S W OE [25245.407988] -------------------------------- [25245.407990] inconsistent {INITIAL USE} -> {IN-NMI} usage. [25245.407993] python/226274 [HC1[1]:SC0[0]:HE0:SE1] takes: [25245.407998] ffffffffbb27ab58 (&nmi_desc[0].lock){....}-{2:2}, at: __register_nmi_handler+0x71/0x1e0 [25245.408012] {INITIAL USE} state was registered at: [25245.408014] __lock_acquire+0x2dd/0x9a0 [25245.408020] lock_acquire+0x152/0x3b0 [25245.408025] _raw_spin_lock_irqsave+0x40/0x90 [25245.408031] __register_nmi_handler+0x71/0x1e0 [25245.408036] init_hw_perf_events+0x188/0x510 [25245.408044] do_one_initcall+0xbb/0x390 [25245.408052] do_pre_smp_initcalls+0x61/0xf0 [25245.408057] kernel_init_freeable+0x22e/0x330 [25245.408062] kernel_init+0x1b/0x1e0 [25245.408068] ret_from_fork+0x30/0x50 [25245.408074] ret_from_fork_asm+0x1b/0x30 [25245.408079] irq event stamp: 5883352 [25245.408081] hardirqs last enabled at (5883351): [<ffffffffb9e3bc8d>] _raw_spin_unlock_irqrestore+0x2d/0x40 [25245.408089] hardirqs last disabled at (5883352): [<ffffffffb9e2b445>] __schedule+0x1045/0x1800 [25245.408096] softirqs last enabled at (5882168): [<ffffffffb896f34c>] handle_softirqs+0x3bc/0x4c0 [25245.408103] softirqs last disabled at (5882163): [<ffffffffb896f82a>] irq_exit_rcu+0xea/0x120 [25245.408110] other info that might help us debug this: [25245.408111] Possible unsafe locking scenario: [25245.408113] CPU0 [25245.408114] ---- [25245.408115] lock(&nmi_desc[0].lock); [25245.408119] <Interrupt> [25245.408120] lock(&nmi_desc[0].lock); [25245.408123] *** DEADLOCK *** [25245.408124] 2 locks held by python/226274: [25245.408127] #0: ffffffffbb6e38c0 (rcu_read_lock){....}-{1:2}, at: try_steal+0x149/0x6a0 [25245.408139] #1: ffff88de7b208658 (&rq->__lock){-.-.}-{2:2}, at: raw_spin_rq_lock_nested+0x11/0x20 [25245.408151] stack backtrace: [25245.408153] CPU: 52 PID: 226274 Comm: python Kdump: loaded Tainted: G S W OE 6.6.0+ #1 [25245.408158] Hardware name: Huawei RH2288H V3/BC11HGSA0, BIOS 3.35 10/20/2016 [25245.408161] Call Trace: [25245.408163] <NMI> [25245.408165] dump_stack_lvl+0x47/0x80 [25245.408171] lock_acquire+0x2e2/0x3b0 [25245.408178] ? __pfx_lock_acquire+0x10/0x10 [25245.408185] ? __register_nmi_handler+0x71/0x1e0 [25245.408191] ? look_up_lock_class+0x56/0x130 [25245.408196] ? irq_work_claim+0x1e/0x40 [25245.408204] ? irq_work_queue+0xb/0x40 [25245.408210] _raw_spin_lock_irqsave+0x40/0x90 [25245.408217] ? __register_nmi_handler+0x71/0x1e0 [25245.408223] __register_nmi_handler+0x71/0x1e0 [25245.408230] ? __asan_memcpy+0x39/0x60 [25245.408239] nmi_shootdown_cpus+0x71/0x100 [25245.408247] kdump_nmi_shootdown_cpus+0x11/0x20 [25245.408256] native_machine_crash_shutdown+0x38/0xa0 [25245.408264] __crash_kexec+0xcf/0x1e0 [25245.408271] ? __pfx___crash_kexec+0x10/0x10 [25245.408278] ? lock_release+0x75/0x160 [25245.408286] ? __crash_kexec+0x88/0x1e0 [25245.408293] ? __crash_kexec+0x1c3/0x1e0 [25245.408303] ? panic+0x213/0x410 [25245.408308] ? __pfx_panic+0x10/0x10 [25245.408315] ? nmi_panic+0x27/0x70 [25245.408321] ? nmi_panic+0x5a/0x70 [25245.408326] ? watchdog_hardlockup_check.part.0+0x15f/0x170 [25245.408333] ? __perf_event_overflow+0x1d3/0x310 [25245.408340] ? handle_pmi_common+0x2a8/0x470 [25245.408348] ? __pfx_handle_pmi_common+0x10/0x10 [25245.408356] ? __pfx_native_set_fixmap+0x10/0x10 [25245.408361] ? apei_read+0x115/0x120 [25245.408367] ? __pfx_apei_read+0x10/0x10 [25245.408373] ? memcpy_fromio+0x22/0xd0 [25245.408379] ? ghes_copy_tofrom_phys+0x74/0x130 [25245.408386] ? __ghes_peek_estatus.isra.0+0x8b/0xf0 [25245.408392] ? look_up_lock_class+0x56/0x130 [25245.408396] ? ghes_in_nmi_queue_one_entry.constprop.0+0x98/0x3f0 [25245.408404] ? lock_acquire+0x256/0x3b0 [25245.408411] ? intel_bts_interrupt+0x7d/0x1b0 [25245.408418] ? intel_pmu_handle_irq+0x158/0x320 [25245.408424] ? perf_event_nmi_handler+0x39/0x60 [25245.408430] ? nmi_handle+0x103/0x300 [25245.408438] ? default_do_nmi+0x45/0x110 [25245.408444] ? exc_nmi+0x100/0x190 [25245.408449] ? end_repeat_nmi+0x16/0x67 [25245.408457] ? native_queued_spin_lock_slowpath+0x46d/0x580 [25245.408464] ? __asan_load4+0x4c/0x90 [25245.408469] ? __asan_load4+0x4c/0x90 [25245.408475] ? __asan_load4+0x4c/0x90 [25245.408481] </NMI> [25245.408482] <TASK> [25245.408484] ? native_queued_spin_lock_slowpath+0x46d/0x580 [25245.408489] ? __pfx_native_queued_spin_lock_slowpath+0x10/0x10 [25245.408495] ? find_held_lock+0x8c/0xa0 [25245.408500] ? local_clock_noinstr+0xb/0xb0 [25245.408506] ? do_raw_spin_lock+0x19b/0x1b0 [25245.408511] ? __pfx_do_raw_spin_lock+0x10/0x10 [25245.408516] ? __pfx_do_raw_spin_trylock+0x10/0x10 [25245.408522] ? raw_spin_rq_lock_nested+0x11/0x20 [25245.408529] ? _raw_spin_lock_nested+0x61/0x80 [25245.408535] ? raw_spin_rq_lock_nested+0x11/0x20 [25245.408540] ? steal_from+0x10d/0x3c0 [25245.408547] ? try_steal+0x457/0x6a0 [25245.408553] ? __pfx_try_steal+0x10/0x10 [25245.408559] ? qos_smt_check_siblings_status+0xc2/0x100 [25245.408566] ? lock_is_held_type+0x9d/0x110 [25245.408572] ? pick_next_task_fair+0x4c7/0x1090 [25245.408580] ? pick_next_task+0x68/0x1b0 [25245.408586] ? __schedule+0x2f1/0x1800 [25245.408592] ? hrtimer_start_range_ns+0x71/0x100 [25245.408597] ? rb_insert_color+0x2a/0x270 [25245.408605] ? __pfx___schedule+0x10/0x10 [25245.408611] ? __lock_release+0x142/0x290 [25245.408617] ? futex_wait_queue+0x5c/0xc0 [25245.408623] ? __pfx___lock_release+0x10/0x10 [25245.408629] ? __hrtimer_start_range_ns+0x2bc/0x530 [25245.408636] ? mark_held_locks+0x23/0x90 [25245.408644] ? schedule+0x8e/0x120 [25245.408650] ? futex_wait_queue+0x83/0xc0 [25245.408657] ? futex_wait+0x1fa/0x3e0 [25245.408663] ? __pfx_futex_wait+0x10/0x10 [25245.408670] ? __pfx_get_futex_key+0x10/0x10 [25245.408676] ? __pfx_hrtimer_wakeup+0x10/0x10 [25245.408681] ? mark_lock.part.0+0x8a/0x620 [25245.408691] ? __lock_release+0x142/0x290 [25245.408699] ? do_futex+0x196/0x230 [25245.408705] ? __pfx_do_futex+0x10/0x10 [25245.408710] ? __pfx___lock_release+0x10/0x10 [25245.408716] ? clockevents_program_event+0x75/0x140 [25245.408722] ? __se_sys_futex+0xc2/0x270 [25245.408728] ? __pfx___se_sys_futex+0x10/0x10 [25245.408734] ? ktime_get_coarse_real_ts64+0xea/0x100 [25245.408741] ? ktime_get_coarse_real_ts64+0xea/0x100 [25245.408747] ? ktime_get_coarse_real_ts64+0x8e/0x100 [25245.408754] ? do_syscall_64+0x6c/0x120 [25245.408759] ? entry_SYSCALL_64_after_hwframe+0x78/0xe2 [25245.408770] </TASK> [25245.410131] kexec: Bye! **【缺陷详情及分析指导参考链接】**
Comments (
2
)
Sign in
to comment
Status
Done
Backlog
已挂起
Fixing
Confirmed
Done
Accepted
Canceled
Assignees
Not set
Labels
sig/Kernel
Not set
Projects
Unprojected
Unprojected
Pull Requests
None yet
None yet
Successfully merging a pull request will close this issue.
Branches
No related branch
Branches (
-
)
Tags (
-
)
Planed to start   -   Planed to end
-
Top level
Not Top
Top Level: High
Top Level: Medium
Top Level: Low
Priority
Not specified
Serious
Main
Secondary
Unimportant
Duration
(hours)
参与者(2)
C
1
https://gitee.com/openeuler/kernel.git
git@gitee.com:openeuler/kernel.git
openeuler
kernel
kernel
Going to Help Center
Search
Git 命令在线学习
如何在 Gitee 导入 GitHub 仓库
Git 仓库基础操作
企业版和社区版功能对比
SSH 公钥设置
如何处理代码冲突
仓库体积过大,如何减小?
如何找回被删除的仓库数据
Gitee 产品配额说明
GitHub仓库快速导入Gitee及同步更新
什么是 Release(发行版)
将 PHP 项目自动发布到 packagist.org
Comment
Repository Report
Back to the top
Login prompt
This operation requires login to the code cloud account. Please log in before operating.
Go to login
No account. Register