-
Notifications
You must be signed in to change notification settings - Fork 5.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Bug]: Child node service reports as active, but seems hung and is stale in Netdata Cloud #16809
Comments
I experience the very same issue with a netdata parent setup. Some logs:
|
@shodanshok stable version too? I think the problem is fixed in the latest. |
Yes, I am using the latest stable version (1.44) |
FYI, since I posted the issue, I've tried playing around with a few options/plugins, and it seems disabling the |
@asteinlein Are you still having this issue after updating to v1.44.2? |
I'm still running v1.44.1. Shouldn't it automatically update? ( |
Check your auto updates
Apt-get update just fetches changes in the repo, not upgrading any piece of software. |
I've upgraded and enabled the |
Bug description
After a few hours of running the client, the agent it seems to somehow die/stop collecting and reporting data, even though
systemctl status netdata
reports the state as active.The log seems to indicate that a child was killed somehow, and that collectors/replication have stopped, but I have no idea why nor why the agent seems to still run. When I finally tried to stop the client with
systemctl stop netplan
, it hung for a long time, before it was eventually automatically killed.Expected behavior
It should continue to run, collect metrics and have the metrics show up in Netdata Cloud.
Steps to reproduce
Installation method
kickstart.sh
System info
Netdata build info
Additional info
systemctl status netdata
when the agent had hung/stopped reporting:I then tried inspecting logs from Netdata, but I'm a bit confused why I'm not seeing the messages reported above. Instead, all I'm seeing with
journalctl -u netdata
is:Then tried to stop Netdata with
systemctl stop netdata
which hung, but was eventually killed:The text was updated successfully, but these errors were encountered: