-
Notifications
You must be signed in to change notification settings - Fork 9
Description
Description of the problem
When I open Events page of Tendrl ui, I see events like:
Job finished successfully (job_id: 4207477c-8101-4921-b48a-f66c4d028cb8)
I don't immediately see what kind of job it is.
This could be especially confusing when I see lot of events like that, without any hint what's wrong (if anything):
Note that in the screenshot above, the message about successfully finished job repeats after few minutes.
When I tried to dig deeper and on the tendrl server machine tried:
# grep -R 4207477c-8101-4921-b48a-f66c4d028cb8 /var/log/
/var/log/tendrl/node-agent/node-agent.log:Mar 12 15:56:49 mbukatov-usm1-server tendrl-node-agent: 2018-03-12 15:56:49.766151+00:00 - node_agent - /usr/lib/python2.7/site-packages/tendrl/commons/jobs/__init__.py:169 - process_job - INFO - Node (76bc408b-e51d-4530-8b30-29ee1f153e60)(type: node)(tags: [u'tendrl/node_76bc408b-e51d-4530-8b30-29ee1f153e60', u'tendrl/integration/monitoring', u'tendrl/central-store', u'tendrl/server', u'tendrl/monitor', u'tendrl/node']) will not process job-4207477c-8101-4921-b48a-f66c4d028cb8 (tags: tendrl/node_6f6e2269-bcf4-4889-82c7-9ba8ed8fb152)
/var/log/messages:Mar 12 15:56:49 mbukatov-usm1-server journal: 2018-03-12 15:56:49.766151+00:00 - node_agent - /usr/lib/python2.7/site-packages/tendrl/commons/jobs/__init__.py:169 - process_job - INFO - Node (76bc408b-e51d-4530-8b30-29ee1f153e60)(type: node)(tags: [u'tendrl/node_76bc408b-e51d-4530-8b30-29ee1f153e60', u'tendrl/integration/monitoring', u'tendrl/central-store', u'tendrl/server', u'tendrl/monitor', u'tendrl/node']) will not process job-4207477c-8101-4921-b48a-f66c4d028cb8 (tags: tendrl/node_6f6e2269-bcf4-4889-82c7-9ba8ed8fb152)
I see only single log message related to this (with two occurrences though, one in node agent and other in messages log) and I read it as:
Node 76bc408b-e51d-4530-8b30-29ee1f153e60 will not process job 4207477c-8101-4921-b48a-f66c4d028cb8
Which doesn't help me much with debugging of the event showed above, as it contradicts the original message (job finished successfully).
Expected Result
Event description may contain more details, eg. job type, to improve information delivered to the user.
Moreover we will need a description of the job id and how to use it for debugging. In my case, I'm unable to find any useful details for the event to go further.
Version
On Storage Servers:
# rpm -qa | egrep '(gluster|tendrl)'
glusterfs-api-4.1dev-0.115.git685d440.el7.centos.x86_64
glusterfs-events-4.1dev-0.115.git685d440.el7.centos.x86_64
tendrl-gluster-integration-1.6.1-1.el7.centos.noarch
tendrl-node-agent-1.6.1-1.el7.centos.noarch
python2-gluster-4.1dev-0.115.git685d440.el7.centos.x86_64
tendrl-collectd-selinux-1.5.4-2.el7.centos.noarch
glusterfs-fuse-4.1dev-0.115.git685d440.el7.centos.x86_64
glusterfs-server-4.1dev-0.115.git685d440.el7.centos.x86_64
glusterfs-geo-replication-4.1dev-0.115.git685d440.el7.centos.x86_64
tendrl-commons-1.6.1-1.el7.centos.noarch
glusterfs-libs-4.1dev-0.115.git685d440.el7.centos.x86_64
glusterfs-client-xlators-4.1dev-0.115.git685d440.el7.centos.x86_64
glusterfs-cli-4.1dev-0.115.git685d440.el7.centos.x86_64
tendrl-selinux-1.5.4-2.el7.centos.noarch
glusterfs-4.1dev-0.115.git685d440.el7.centos.x86_64
On Tendrl server:
# rpm -qa | egrep '(gluster|tendrl)'
tendrl-grafana-plugins-1.6.1-1.el7.centos.noarch
tendrl-monitoring-integration-1.6.1-1.el7.centos.noarch
tendrl-notifier-1.6.0-1.el7.centos.noarch
tendrl-api-httpd-1.6.1-1.el7.centos.noarch
tendrl-selinux-1.5.4-2.el7.centos.noarch
tendrl-node-agent-1.6.1-1.el7.centos.noarch
tendrl-ui-1.6.1-1.el7.centos.noarch
tendrl-grafana-selinux-1.5.4-2.el7.centos.noarch
tendrl-commons-1.6.1-1.el7.centos.noarch
tendrl-api-1.6.1-1.el7.centos.noarch