DA monitoring tools

assam_siddibapa

Verified User
Joined
Jan 29, 2011
Messages
10
Hello , I am quite new with DA . Had some questions .

I have DA installed on centos with xen . I am looking for some monitoring tools . After a little browing i found that MRTG and cacti are good . Who do you suggest and if i follow the instructions on there documationation will it work ?

Thank you
 
If you want a monitor tool that notify you about services downtime from one or more server, i would suggest nagios.

MRTG as far as i know create graph, not notify about downtime, cacti aswell (but ive never studied it deeply).

Nagios for sure is one of the best monitoring tool opensource ive ever seen.

Regards
 
Hello,

If you decide to use MRTG, I'd recommend using munin instead, it's more modern and efficient, and can send warnings and alerts to EMAIL/SMS.


Hello , I am quite new with DA . Had some questions .

I have DA installed on centos with xen . I am looking for some monitoring tools . After a little browing i found that MRTG and cacti are good . Who do you suggest and if i follow the instructions on there documationation will it work ?

Thank you
 
If you want a monitor tool that notify you about services downtime from one or more server, i would suggest nagios.

MRTG as far as i know create graph, not notify about downtime, cacti aswell (but ive never studied it deeply).

Nagios for sure is one of the best monitoring tool opensource ive ever seen.

Regards

I have nagios installed on a dedicated VPS and I find I am getting more false alarms than real server downtimes ..

Any suggestions?

It seems every time Nagios emails me to a server issue, if I go and try the server, 95% of the time its a false alarm ...

I used to pay $60 a month for an online monitoring service, and was highly recommended nagios by several friends, but it has been more headache than worth for me

Any help/suggestions would be greatly appreciated.

Tim
 
Actually ive never had false-positive, my suggestion is that maybe for some reason server desnt reply to all request (i suppose you use nrpe to check remote servers) and thats why he make false positive.

Check your connectivity, firewall rules, nrpe config (with xinetd i suppose).

Regards
 
False reports generally occur because of latency or delivery problems on the Internet. You can resolve the issue by monitoring from multiple locations, and only sending the email if all locations report inability to reach the server. You should also set your monitor system to not report any downtime if nothing can be reached; that could be a problem with it's own connectivity.

You should set up your system to monitor once a minute but only consider the server (or service) as down if it fails at least two checks a minute apart from all your checkpoints.

One of the site monitor companies uses us to host a system which monitors their monitor servers. They don't get reported as down if all of them appear down, because that most likely means our connectivity is down. It's the Occam's Razor (wikipedia.org) principle.

Jeff
 
Back
Top