Discussion:
Some graphs working, some not in 4.2 -- how to debug?
Daniel Rich
2012-09-14 15:39:59 UTC
Permalink
Daniel Rich [http://community.zenoss.org/people/sjthespian] created the discussion

"Some graphs working, some not in 4.2 -- how to debug?"

To view the discussion, visit: http://community.zenoss.org/message/68583#68583

--------------------------------------------------------------
I am getting a Zenoss 4.2 server up and running to replace an aging Nagios server but have run into a problem. Most things seem to be working just fine but the majority of my metrics are not graphing.

For example, I have a Linux server running an Apache server. I get events just fine when things go down and the server models correctly. My Apache and NTP server graphs look great, but the load, CPU and Memory graphs are all showing nan. SNMP works just fine and I can both query and walk the OIDs that it should be querying. The network interfaces are in a similar state, it can query the mac address and interface state but the graphs are all showing nan.

If all of the graphs were broken I would be suspecting rrdcached or some problem with the rrd libs, but since they are working I'm assuming it is some problem with the snmp collection run. The only errors I see in the zenperfsnmp logs are for a completely different device, nothing for this one.

Any suggestions on where I can look or what I can do to debug this?
--------------------------------------------------------------

Reply to this message by replying to this email -or- go to the discussion on Zenoss Community
[http://community.zenoss.org/message/68583#68583]

Start a new discussion in zenoss-users by email
[discussions-community-forums-zenoss--***@community.zenoss.org] -or- at Zenoss Community
[http://community.zenoss.org/choose-container!input.jspa?contentType=1&containerType=14&container=2003]
COMPUTER KING
2012-09-14 18:13:17 UTC
Permalink
COMPUTER KING [http://community.zenoss.org/people/c0mputerking] created the discussion

"Re: Some graphs working, some not in 4.2 -- how to debug?"

To view the discussion, visit: http://community.zenoss.org/message/68593#68593

--------------------------------------------------------------
I have a similar problem, however i am getting no graphing data for any metrics would really like to see some bandwidth useage.  Here is my post with a simlar problem.

http://community.zenoss.org/message/68566#68566 http://community.zenoss.org/message/68566#68566
--------------------------------------------------------------

Reply to this message by replying to this email -or- go to the discussion on Zenoss Community
[http://community.zenoss.org/message/68593#68593]

Start a new discussion in zenoss-users by email
[discussions-community-forums-zenoss--***@community.zenoss.org] -or- at Zenoss Community
[http://community.zenoss.org/choose-container!input.jspa?contentType=1&containerType=14&container=2003]
jcurry
2012-09-16 18:00:49 UTC
Permalink
jcurry [http://community.zenoss.org/people/jcurry] created the discussion

"Re: Some graphs working, some not in 4.2 -- how to debug?"

To view the discussion, visit: http://community.zenoss.org/message/68601#68601

--------------------------------------------------------------
Do you have any data files created?  As the zenoss user, navigate to $ZENHOME/perf/Devices.  Under there, you should have a directory for each device.  Navigate down your test device directory.  At this level, you should see a number of rrd files for the whole device - stuff like CPU and Memory data files should show up here.

rrd datafiles for interfaces and filesystems are further down the directory hierarchy under os.

What device class are these devices in??  /Server/Linux would be a good start and should automatically have modeler plugins that collect configuration data for interfaces, filesystems, routes,....  This device should also automatically collect stuff like CPU and memory. Do you have components for filesystems and interfaces (indented items in left-hand menu) for your test Linux device?

It is the device class that determines the default modeler plugins (check from a device's left-hand menu Modeler Plugins menu) and the default performance Monitoring Templates (bottom of left-hand menu).

Collecting performance information for COMPONENTS (like interfaces and filesystems) should happen automatically.  You should not (and must not) manually bind component templates.  If you can see interface components for your device, select an interface and then use the Display dropdown in the middle of the main panel to show the component template for this interface that should be collecting performance information.

Cheers,
Jane
--------------------------------------------------------------

Reply to this message by replying to this email -or- go to the discussion on Zenoss Community
[http://community.zenoss.org/message/68601#68601]

Start a new discussion in zenoss-users by email
[discussions-community-forums-zenoss--***@community.zenoss.org] -or- at Zenoss Community
[http://community.zenoss.org/choose-container!input.jspa?contentType=1&containerType=14&container=2003]
Daniel Rich
2012-09-17 16:58:36 UTC
Permalink
Daniel Rich [http://community.zenoss.org/people/sjthespian] created the discussion

"Re: Some graphs working, some not in 4.2 -- how to debug?"

To view the discussion, visit: http://community.zenoss.org/message/68615#68615

--------------------------------------------------------------
The odd thing is, some of the graphs work and RRD files exist for all of them (i.e. mem and load). However, the non "app" rrds haven't updated since 13-Sep. Here is what I see there at the moment:
-rw-r--r-- 1 zenoss zenoss 35432 Sep 17 09:54 apache_slotLogging.rrd
-rw-r--r-- 1 zenoss zenoss 35432 Sep 17 09:54 apache_slotOpen.rrd
-rw-r--r-- 1 zenoss zenoss 35432 Sep 17 09:54 apache_slotReadingRequest.rrd
-rw-r--r-- 1 zenoss zenoss 35432 Sep 17 09:54 apache_slotSendingReply.rrd
-rw-r--r-- 1 zenoss zenoss 35432 Sep 17 09:54 apache_slotWaiting.rrd
-rw-r--r-- 1 zenoss zenoss 35432 Sep 17 09:54 apache_totalAccesses.rrd
-rw-r--r-- 1 zenoss zenoss 35432 Sep 17 09:54 apache_totalKBytes.rrd
-rw-r--r-- 1 zenoss zenoss 35432 Sep 17 09:54 apache_bytesPerReq.rrd
-rw-r--r-- 1 zenoss zenoss 35432 Sep 17 09:54 apache_cpuLoad.rrd
-rw-r--r-- 1 zenoss zenoss 35432 Sep 17 09:54 apache_slotDNSLookup.rrd
-rw-r--r-- 1 zenoss zenoss 35432 Sep 17 09:54 apache_slotKeepAlive.rrd
-rw-r--r-- 1 zenoss zenoss 35432 Sep 17 09:54 ntp_offset.rrd
-rw-r--r-- 1 zenoss zenoss 35432 Sep 17 09:54 HttpMonitor_size.rrd
-rw-r--r-- 1 zenoss zenoss 35432 Sep 17 09:54 HttpMonitor_time.rrd
-rw-r--r-- 1 zenoss zenoss 35432 Sep 17 09:54 NtpMonitor_offset.rrd
drwxr-x--- 4 zenoss zenoss  4096 Sep 13 13:50 os
-rw-r--r-- 1 zenoss zenoss 35432 Sep 13 13:48 ssCpuRawWait_ssCpuRawWait.rrd
-rw-r--r-- 1 zenoss zenoss 35432 Sep 13 13:48 ssCpuSystem_ssCpuSystem.rrd
-rw-r--r-- 1 zenoss zenoss 35432 Sep 13 13:48 ssCpuUser_ssCpuUser.rrd
-rw-r--r-- 1 zenoss zenoss 35432 Sep 13 13:48 ssIORawReceived_ssIORawReceived.rrd
-rw-r--r-- 1 zenoss zenoss 35432 Sep 13 13:48 ssIORawSent_ssIORawSent.rrd
-rw-r--r-- 1 zenoss zenoss 35432 Sep 13 13:48 memAvailSwap_memAvailSwap.rrd
-rw-r--r-- 1 zenoss zenoss 35432 Sep 13 13:48 memCached_memCached.rrd
-rw-r--r-- 1 zenoss zenoss 35432 Sep 13 13:48 memAvailReal_memAvailReal.rrd
-rw-r--r-- 1 zenoss zenoss 35432 Sep 13 13:48 memBuffer_memBuffer.rrd
-rw-r--r-- 1 zenoss zenoss 35432 Sep 13 13:48 laLoadInt15_laLoadInt15.rrd
-rw-r--r-- 1 zenoss zenoss 35432 Sep 13 13:48 laLoadInt1_laLoadInt1.rrd
-rw-r--r-- 1 zenoss zenoss 35432 Sep 13 13:48 sysUpTime_sysUpTime.rrd
-rw-r--r-- 1 zenoss zenoss 35432 Sep 13 13:48 laLoadInt5_laLoadInt5.rrd
-rw-r--r-- 1 zenoss zenoss 35432 Sep 13 13:48 ssCpuIdle_ssCpuIdle.rrd

The devices are subclasses of /Server/Linux, created just to make adding a couple of extra monitoring templates easier. All of the components I expect to see are there, network, memory, file systems, processors, etc.

Is there a way to run a debug collection run of a device? That way I could at least potentially see what is failing. Running "zencommand run -d <hostname> -v 10" doesn't give me any obvious smoking guns.
--------------------------------------------------------------

Reply to this message by replying to this email -or- go to the discussion on Zenoss Community
[http://community.zenoss.org/message/68615#68615]

Start a new discussion in zenoss-users by email
[discussions-community-forums-zenoss--***@community.zenoss.org] -or- at Zenoss Community
[http://community.zenoss.org/choose-container!input.jspa?contentType=1&containerType=14&container=2003]
Daniel Rich
2012-09-17 17:00:19 UTC
Permalink
Daniel Rich [http://community.zenoss.org/people/sjthespian] created the discussion

"Re: Some graphs working, some not in 4.2 -- how to debug?"

To view the discussion, visit: http://community.zenoss.org/message/68616#68616

--------------------------------------------------------------
Another interesting datapoint - even the mem and load graphs on my Zenoss server itself aren't updating. however, the mysql and NTP graphs are.
--------------------------------------------------------------

Reply to this message by replying to this email -or- go to the discussion on Zenoss Community
[http://community.zenoss.org/message/68616#68616]

Start a new discussion in zenoss-users by email
[discussions-community-forums-zenoss--***@community.zenoss.org] -or- at Zenoss Community
[http://community.zenoss.org/choose-container!input.jspa?contentType=1&containerType=14&container=2003]
Loading...