-
When running a bleeding edge nvim (which has a libuv using io_uring) I was getting "hung task" kernel panics. After much anguish, we find this was due to the related The response of chromium upstream maintainers is to start disabling "hung task" panics in various places. Is this the correct course of action? A quick survey of Google for D-state kworkers seems to indicate that (apart from my problem) this is almost always indicative of a major bug that requires a reboot. I wonder if a better approach would be to mark such Full context: |
Beta Was this translation helpful? Give feedback.
Replies: 2 comments 6 replies
-
This SIGSTOP issue has been closed a year ago. It's marked as false positive and not worth fixing. |
Beta Was this translation helpful? Give feedback.
-
Thank you for considering
On Sun, Jun 11, 2023, 7:27 PM Jens Axboe ***@***.***> wrote:
It's not, but of course if you have the setting on that causes a panic on
(what appears to be) a hung task, then it can turn into a problem. All that
trace is telling you that here's the stack trace for this process, and it's
been stuck uninterruptibly for X amount of time. Obviously it'd be better
if we didn't have this oddity to begin with. It's really because the
kworker is waiting on requests to get finished, but someone sent a SIGSTOP
to the tasks that are supposed to finish them. This is what is causing the
kworker to appear stalled.
As such, there's obviously nothing wrong by disabling panic on a hung task.
The problem is that the system may have real hung tasks and want to panic
on that condition, and potentially reboot.
That's what I was trying to express in my original message. Thank you for
rephrasing :)
Let me ponder this for a bit. Since the kworker exit part doesn't take
signals to begin with, a quick work-around would be to have the ring exit
work just sleep interruptibly instead. We can still warn if that takes
longer than we think, but at least we would not have D state workers which
is what is triggering this to begin with.
Thanks for your consideration! I saw your patch and it's sounds like it's
just what I needed.
I wonder whether anyone in the kernel would like to consider this case. The
same line of reasoning would apply, I'd think. I think it's work asking but
I don't actually know where I should ask.
—
… Reply to this email directly, view it on GitHub
<#878 (reply in thread)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AAE4KSGLL2DK7XJOCTSWF7DXKZ5BRANCNFSM6AAAAAAZCPOYJ4>
.
You are receiving this because you authored the thread.Message ID:
***@***.***>
|
Beta Was this translation helpful? Give feedback.
It's not, but of course if you have the setting on that causes a panic on (what appears to be) a hung task, then it can turn into a problem. All that trace is telling you that here's the stack trace for this process, and it's been stuck uninterruptibly for X amount of time. Obviously it'd be better if we didn't have this oddity to begin with. It's really because the kworker is waiting on requests to get finished, but someone sent a SIGSTOP to the tasks that are supposed to finish them. This is what is causing the kworker to appear stalled.
As such, there's obviously nothing wrong by disabling panic on a hung task. The problem is that the system may have real hung tasks and want to panic o…