Questions tagged [monitoring]

Applications or appliances that observe machines, systems and networks to find problems and notify administrators.

Filter by
Sorted by
Tagged with
591 votes
4 answers
333k views

What exactly do the colors in htop status bars mean?

By default, htop shows colored status bars for processors, memory, and swap. From left to right, the bars are colored green, blue, yellow, and red depending on some thresholds. What does it mean when ...
tobym's user avatar
  • 6,361
187 votes
73 answers
247k views

What tool do you use to monitor your servers?

For a more comprehensive list of monitoring tools and their features, check out this Wikipedia page. As the question states, what are the most commonly used tools used for this task and what are ...
60 votes
13 answers
89k views

How to monitor a windows log file in real time? [closed]

On windows how can I easily monitor a log file and see updates to the file in real time? Basically, same functionality like tail -f log_file on Unix systems Platform: Windows XP/2003/2008 server [...
dance2die's user avatar
  • 2,031
53 votes
3 answers
178k views

ipmitool - can't find /dev/ipmi0 or /dev/ipmidev/0

Based on a previous question, I installed ipmitool (yum install ipmitool). Even after a reboot, though, i get the following error when trying to run ipmitool power status: Could not open device at /...
warren's user avatar
  • 18.6k
45 votes
7 answers
64k views

View content of memcached

Are there any tools to explore what is currently cached inside a memcached pool? Not some much graphs, but the actual key/values currently stored.
Tom's user avatar
  • 766
44 votes
11 answers
98k views

How to monitor and log the memory/cpu usage of processes over time? [closed]

I am looking for a way to diagnose issues, such as swap death, where a balooning memory process fills up swap and kills the whole machine (such as apache). I'm already using cacti and I can set up ...
Artem Russakovskii's user avatar
43 votes
5 answers
56k views

monit: check process without pidfile

I'm looking for a way to kill all processes with a given name that have been running for more than X amount of time. I spawn many instances of this particular executable, and sometimes it goes into a ...
Parand's user avatar
  • 738
42 votes
19 answers
9k views

Can anyone recommend a website monitoring service? [closed]

What are people using for website monitoring services? I am referring to a service that I can configure specific hits to my site to monitor if the site is up, and how fast it is responding to the ...
qxotk's user avatar
  • 1,434
42 votes
8 answers
256k views

Get a history of CPU/memory usage

Is there someway to record Task Managers info about CPU and memory usage to examine later? Or an equivalent tool?
samoz's user avatar
  • 2,043
40 votes
34 answers
8k views

Best smartphone for sysadmins?

I need a cell phone that will help me keep an eye on my servers and services when I am away from my computer/desk/workplace. Which smart phone would you recommend for sysadmins? An SSH client is a ...
39 votes
8 answers
32k views

Colorize Monitoring of Logs

I sometimes monitor apache and php error logs using tail under FreeBSD. Is there any way to get colorized output, either using tail or some other command line app? Alternatively, what is your ...
Ian's user avatar
  • 1,498
38 votes
10 answers
299k views

Testing a website from Linux command line [closed]

I am looking for a tool to test a website from a Linux command line. From the output, I need to know the http response (status codes) but also benchmark the time it takes to download the different ...
Embreau's user avatar
  • 1,287
38 votes
2 answers
17k views

Get notification from supervisord when a job exits

Is there any way supervisord can automatically restart a failed/exited/terminated job and send me a notification email with a dump of the last x lines of log file?
Sebastian Hoitz's user avatar
37 votes
6 answers
115k views

How to see incoming IPs in Linux?

I am wondering what is the command/utility to have a real-time view of incoming IPs to my server, ideally along with the port and connected.
alfish's user avatar
  • 3,137
34 votes
6 answers
25k views

Linux: logwatch(8) is too noisy. How can I control the noise level?

Our Linux systems run logwatch(8) utility by default. On a RedHat/CentOS/SL system, Logwatch is called by the /etc/cron.daily/ cronjob, which then sends a daily email with the results. These emails ...
Stefan Lasiewski's user avatar
33 votes
4 answers
28k views

How i configure monit to start a process with a specific user?

Monit runs with root, but i don't want to start my processes as root.. like mysql, mongrel, apache..
Guilherme's user avatar
  • 751
33 votes
24 answers
11k views

What are your most useful/valued Nagios plugins? [closed]

We're expanding our Nagios 3 setup, and are frequently coming across new NRPE or general Nagios plugins to test our existing infrastructure. This is in dribs and drabs though - it would be useful to ...
Mike Pountney's user avatar
30 votes
5 answers
43k views

Do SSDs support SMART?

S.M.A.R.T. (for Self-Monitoring Analysis and Reporting Technology) is a wonderful technology to detect hard drive failure before it really happens. But is S.M.A.R.T. relevant for SSDs?
paulgreg's user avatar
  • 4,164
29 votes
2 answers
10k views

Is there a Windows equivalent of Unix 'CPU steal time'?

In order to assess performance monitoring accuracy on virtualization platforms, the CPU steal time has become an increasingly relevant metric - see EC2 monitoring: the case of stolen CPU for an ...
Steffen Opel's user avatar
  • 5,638
28 votes
2 answers
21k views

What is the difference between OpenTSDB and Graphite?

As far as I can tell, here are the main differences: OpenTSDB does not deteriorate data over time, unlike Graphite where the size of the database is pre-determined. OpenTSDB can store metrics per ...
vedang's user avatar
  • 382
27 votes
6 answers
103k views

A better "top" command for Mac OS X? [closed]

The top command on OS X is pretty crappy.. The one included with most Linux distros allows you to change the sort-by column using < and >, there is a coloured mode (by pressing the z key), and a ...
dbr's user avatar
  • 1,568
27 votes
5 answers
122k views

How can I manually run a nagios check from the command line?

When defining and testing new services in nagios I have been restarting nagios, then clicking the service, and rescheduling a check for as soon as possible, then waiting until the check happens. Is ...
cwd's user avatar
  • 2,783
27 votes
6 answers
51k views

How to find out the number of time series stored in Prometheus LevelDB

i'm responsible for maintaining the Prometheus servers in our company. The metrics however are provided by the teams. Is there a way to find out the number of time series stored in the Prometheus ...
Tobias Wiesenthal's user avatar
27 votes
8 answers
31k views

How can I see how much bandwidth each Apache Virtual Host is using?

I have Apache set up to serve several Virtual Hosts, and I would like to see how much bandwidth each site uses. I can see how much the entire server uses, but I would like more detailed reports. ...
pkaeding's user avatar
  • 810
25 votes
7 answers
59k views

Enable Ping in Windows Server Firewall?

I've just installed Windows Server 2008 on a server and I'm able to connect through Remote Desktop but can't ping. Do I need to open an special port on the firewall to be able to ping a server?
holiveira's user avatar
  • 889
25 votes
12 answers
55k views

Techniques to Monitor cron tasks?

Are there good techniques for monitoring cron tasks over a cluster? We're starting to use cron to launch tasks at daily intervals. A few ideas for checking out information: Add special application ...
25 votes
3 answers
4k views

Shinken - Anyone using it? [closed]

I've recently discovered Shinken, which is a new implementation of Nagios using python. Shinken "divides" Nagios in 5 different types of daemons, each one performing separated tasks. I haven't tried ...
Marco Ramos's user avatar
  • 3,130
24 votes
7 answers
15k views

Continuously monitor logs with tail that are occasionally rotated

We're using tail to continuously monitor several logs, but when a log is rotated the tail for that file will cease. As far as I understand, the problem is that when the log is rotated, there is a new ...
xkcd150's user avatar
  • 928
24 votes
7 answers
30k views

Process Monitor fails to start on a 64 bit computer

When I try to start Process Monitor from SysInternals on some 64 bit windows 7 machines,the process fails to start. There is no error message. I double click and nothing happens. Other 64 bit windows ...
Ryan Michela's user avatar
  • 1,147
24 votes
7 answers
34k views

Monitor an incoming SSH session in real time

Is there any linux software to monitor an incoming ssh session. At a previous job I was told that if you ever needed support from Red Hat for example you could have them SSH into your machine and you ...
Richard's user avatar
  • 627
23 votes
9 answers
77k views

Monitor network traffic volume over interface

Is there a way to monitor the traffic (e.g., get a live view of the utilization) over a particular network interface, say eth0? The catch here is that the set of tools on the box is fixed, and is ...
BeeOnRope's user avatar
  • 592
21 votes
9 answers
3k views

Best practices for backup checking?

It is a common situation, when administrator makes system for automatic backuping and forgets it. Only after a system fails administrator notices, that backup system has broken before or backups are ...
Kazimieras Aliulis's user avatar
21 votes
1 answer
27k views

htop - show I/O wait percentage

Is there a way to show I/O wait percentage in htop? I can't find a way how to enable this. On the following link there is an example of the functionality I need (the "Avg:" row): http://www.concept47....
CyberMuz's user avatar
  • 345
21 votes
5 answers
3k views

What am I looking for in a Monitoring Solution?

This is a Canonical Question about Monitoring Software. Also Related: What tool do you use to monitor your servers? I need to monitor my servers; what do I need to consider when deciding on a ...
Kyle Brandt's user avatar
21 votes
2 answers
20k views

Sending a test/example alert from monit?

I've configured monit tests and alerts — but I want to make sure that everything in my alert stack (outgoing email servers, sms email gateways…) is functioning properly. Is there a handy way to fire ...
John Bachir's user avatar
  • 2,364
20 votes
4 answers
45k views

How to generate netflow data in linux

We have a number of Linux servers for which I would like to capture netflow data to be processed by a netflow analyzer. I've been spoiled by the ease in which Mikrotik routers allow netflow data ...
Richard Keller's user avatar
19 votes
6 answers
21k views

Is Zabbix the right tool for me?

I just want to monitor a small handful of servers (less than 10). From reading various places it sounds like the top leading contenders (for open source at least) are: nagios munin zabbix From ...
hortitude's user avatar
  • 565
19 votes
1 answer
16k views

Monit versus Nagios

Does nagios offer any additional functionality when compared to monit? This answer states that monit is a clumsy tool for situational awareness, which is the reason I'm asking the question.
Kaan's user avatar
  • 358
19 votes
2 answers
109k views

How to find the .pid file for a given process

I'm setting up monit and want to monitor a given python application. Monit does this by looking at the .pid files for processes, but I don't know where this would be. I also tried creating my own ...
Yarin's user avatar
  • 1,366
18 votes
10 answers
5k views

Network Traffic Monitoring

What is the best tool to monitor/analyze network traffic on an entire network (several subnets)? I'm looking for something that will help me toubleshoot bandwidth problems when, for instance, users ...
Brent 's user avatar
  • 23k
18 votes
4 answers
3k views

Monitoring production server [closed]

We have 3 dedicated server, splitted in several VPS using openVZ. We're using munin to monitor the VPS with the production sites, and monit on some one of the VPS to make sure it restarts the service ...
Adam Benayoun's user avatar
17 votes
8 answers
11k views

There is any monitoring hosted solution? [closed]

I know Zabbix and Nagios, but I don't know any company that host those services for others to use. I would like to find a solution that monitored at least some of my public URLs (if they are ...
Jader Dias's user avatar
  • 4,735
17 votes
7 answers
11k views

How can I force a MySQL table to become corrupted?

I have written a simple Nagios plugin that calls mysqlcheck (which checks for corrupted tables) and will give a warning if any are corrupt. However none of my tables are corrupt now. So I'm not 100% ...
Amandasaurus's user avatar
  • 31.9k
16 votes
3 answers
13k views

Finding short-lived TCP connections owner process

Running tcpdump on local connections to an apache server, I found TCP connections being established and closed immediately every 2 seconds. How do I find which process is responsible for these? ...
pmezard's user avatar
  • 263
16 votes
8 answers
12k views

Nagios alerts by telephone?

I have a couple of nagios servers that currently alert me via SMS and email, but I'd like to add outgoing calls to that. The ideal solution would have the nagios machines call my cellphone and play an ...
Matt Simmons's user avatar
  • 20.5k
16 votes
6 answers
30k views

How do you monitor the health of a mirrored disk in Windows?

I have a Mirrored Dynamic disk on my Windows 2003 Server. How do you monitor the health of the volume? Is there a way to have the server send an email when there is an issue with the volume? Is there ...
NitroxDM's user avatar
  • 635
16 votes
1 answer
9k views

SNMP Deprecated in Windows Server 2012

According to Technet's Features Removed or Deprecated in Windows Server 2012, SNMP is being depreciated: SNMP is deprecated. Instead, use the Common Information Model (CIM), which is supported by ...
Kyle Brandt's user avatar
16 votes
2 answers
10k views

What's difference between monitoring, tracing and profiling?

I have seen these three words showing up a lot, but don't understand the exact differences between them. For example, collecting CPU utilisation is often called profiling and can also fall into ...
Richard's user avatar
  • 269
15 votes
7 answers
92k views

What Warning and Critical values to use for check_load?

Right now I am using these values: # y = c * p / 100 # y: nagios value # c: number of cores # p: wanted load procent # 4 cores # time 5 minutes 10 minutes 15 minutes # warning: 90% ...
Sandra's user avatar
  • 10.4k
15 votes
2 answers
5k views

What is the strategy for detecting time drift in a linux based data centre?

What is the strategy for detecting time drift in all linux based data centre? This is a more difficult problem than it seems at first. Time drift can cause serious problems for certain applications ...
Stuart Woodward's user avatar

1
2 3 4 5
50