Questions tagged [outage]

Power or network failure that causes downtime accross a large number of platforms. Blackout.

Filter by
Sorted by
Tagged with
2 votes
1 answer
104 views

How to check for faulty device causing short outages in my LAN?

We resolved an issue caused by a broken access point by simply disconnecting it from the LAN. The symptoms we experienced were that now and then, our whole network would get "stuck" for a ...
Theo's user avatar
  • 173
0 votes
1 answer
38 views

What tools do you use to measure your MTTR as the Ops Team?

and do you measure it at all? My problem is that when outage is alerted, it feels waste of time to create a JIRA ticket first, so I start solving it right away. Besides, some outages are solved by ...
Charming Borg's user avatar
2 votes
1 answer
55 views

Tracking Down Source of Network Instability

If the issue happens, it happens at 10am, for about 1minute between onset and return to stability. It does not happen every day. During the minute, pings go through the roof and packets start to drop,...
Kaz's user avatar
  • 23
0 votes
1 answer
399 views

Will GKE cluster will have downtime, when cluster features are enabled/disable?

I wanted to enable cloud operations for a zonal GKE cluster with just one node. I read lots of guides from Google, but didn't found any mentioning about outage if we disable/enable cluster features. ...
Mani Rai's user avatar
  • 101
-1 votes
1 answer
646 views

Why went dovecot pop3/imap server down?

The VPS went at some point down, I am watching the log but do not understand why went down and how it could be fixed to avoid outages in the future. The logs I have available for the time of the event:...
czlovjek's user avatar
0 votes
1 answer
45 views

Is there any way to understand what type of server downtime I am facing?

I have a remote server with no physical attendance 24x7. But sometimes I have faced a downtime due to either network related issues or power outages. But because I am sitting afar from my server I am ...
user3271961's user avatar
3 votes
2 answers
776 views

Unusual DHCP activity after power outage

Over the holiday weekend, one of our clients experienced a power outage. When everything came back online, most devices seemed to be fine, but a few (one of our ESXi hosts and a number of VDIs) could ...
Joel D.'s user avatar
  • 31
0 votes
0 answers
32 views

Where can I get outage updates/alerts for transoceanic cables (and other tier-1 fiber backbones)?

Is there some site that aggregates status updates from the companies that manage the transoceanic fiber cables so that I can get a world view of global internet outages that break connectivity between ...
Michael Altfield's user avatar
0 votes
1 answer
461 views

Ubuntu secondary IP drops out after a few hours

I'm running Ubuntu 18.10 on a VPS. Ever since upgrading (I'm pretty sure) from 16.04, my secondary IP address just stops receiving traffic after it's been up for a few hours. I'll have two pings ...
jorisw's user avatar
  • 103
0 votes
1 answer
153 views

intermittent drops on switches

I have a lab environment with 7 48-port switches (Ubiquiti ES-48-500W). All of them are connected to 3 16-port aggregation switches (Ubiquiti ES-16-XG) via fiber. All of the switches are brand new, ...
E C's user avatar
  • 99
0 votes
2 answers
795 views

Kafka cluster: Withstanding a network outage

We have a kafka cluster. And a network. Yay. The network will be unavailable across all racks in our data center for 5-10 minutes (!) because maintenance requires it. I'm concerned that is too long an ...
Eric Horne's user avatar
1 vote
3 answers
276 views

Electricity outage killed 4 out of 6 UPS

We recently had an electricity outage of 1-2 seconds at one of our buildings. In this building we have 6 UPS and 4 of them died while/after the electricity outage and all the servers were powered off. ...
Efekan's user avatar
  • 171
0 votes
0 answers
64 views

External monitoring shows outage in multiple regions & service types. Azure shows no outage

I'm using a service called Monitis to monitor the uptime of some of my web-based resources. Basically, it pings my resources from three geographic locations (West US, East US, and Mid US) and raises ...
JLRishe's user avatar
  • 111
1 vote
1 answer
1k views

Windows Server 2012 after power outage

Had a power outage, server restarted normally. Server is a database server for a hotel, which is connected to an ISP with optic fibre. On the restart, network went public, instead of the default ...
user3726760's user avatar
-2 votes
1 answer
126 views

How to investigate a server outage?

I know, the question is kinda generic, but I really can't be more specific because I simply have no idea what's going on: It has now happened twice (once on our live server and once on our test ...
David Behler's user avatar
3 votes
1 answer
119 views

How to enter past outage timing in OpenNMS

For scheduled outages, We need to enter timings so that these period will not be considered as a down time. So far so good. However, if someone doesnt define a blackout window and we need to adjust ...
akas's user avatar
  • 31
1 vote
2 answers
4k views

Temporarily website outages, fastcgi PHP communication aborted

I've noticed that my web server has occasional 1-5 minute outages every day. I've checked the Apache error log and found the following: [Sun May 10 14:13:19.299784 2015] [fastcgi:error] [pid 2599:...
gijs007's user avatar
  • 117
0 votes
1 answer
649 views

Guest VMs Unknown after power outage on HP VSA

A remote office had a power outage tonight. When the power came up, both the Physical DC and the Physical ESXi Server came up. Connecting to vSphere, I can see the HP-VSA has started, but the other ...
BillN's user avatar
  • 1,503
0 votes
2 answers
4k views

Can Redis RDB corrupt?

In redis.io site, it states that RDB is less durable than AOF. Does this mean there is some possibility of database corruption, and complete data loss if power is lost during a save operation?
Paiboon Panusbordee's user avatar
0 votes
1 answer
3k views

How to forward to Amazon S3 when in maintenance mode with nginx using 503?

During website maintenance, it is sometimes necessary to shutdown our website. Our current method is to touch a file which will trigger the web server (nginx) to redirect traffic to a maintenance ...
Tom's user avatar
  • 4,307
1 vote
2 answers
177 views

Recurring server outages CentOS on Linode

I am currently managing some servers for a client running close to 40 websites, nearly half of which are WordPress websites. We are currently using 4 VPS from Linode with the sites distributed across ...
unknownperson's user avatar
2 votes
5 answers
438 views

VOIP Power outage strategy

In the past 2 years, we have had 4 instances where we have been without power for a period of 4-6 hours - due to construction crews cutting power lines, car accidents involving downed power lines, etc....
AWippler's user avatar
  • 1,075
-2 votes
2 answers
167 views

Exchange fallover during power outage

Ok, I have yet to find a definitive answer for this. I have an exchange server, and am trying to find a good fallover solution. If the power goes out what is the best way for me to hold my email until ...
gobabushka's user avatar
0 votes
1 answer
728 views

What is a "Latency Outage"? [closed]

I'm one of many customers of Windstream, the only internet provider in my area, and just after having hit the 1-year mark of being a customer, my 10M internet connection became about a 512k connection....
Nathan Wheeler's user avatar
2 votes
1 answer
1k views

BES 5.x issues when connecting to Exchange 2010 SP2 RU4

Ever since we updated from SP1 RU4 to SP2 RU4 we have noticed that our BES devices will simply stop receiving email. This has occurred at least 5 times in the past few weeks. Today, while speaking ...
makerofthings7's user avatar
7 votes
1 answer
12k views

I had a power outage. Now MySQL's lock file won't go away. What do you suggest?

I do freelance IT consulting for various clients, both in Toronto, Canada, and worldwide. A client recently experienced a power failure. Now they've been having various problems with a Slackware 12....
jasonspiro's user avatar
2 votes
2 answers
638 views

Network outage - Mapping, Checking network, BGP, Traceroute, RIPE

At first, I'm not able to name my question properly, so this will be adjusted I recently experienced an international network gap. Mean when some part of world wide network is unavailable. I'm able ...
Marek Sebera's user avatar
1 vote
1 answer
81 views

Mysql password no longer working

Does anyone know if a power outage could somehow have corrupt the mysql logins? We had an outage the other day, my mysql logins no longer work. The mysql process seems to be running fine. Any idea ...
steve's user avatar
  • 525
1 vote
2 answers
100 views

SMTP Servers: How well supported is automatic reattempt when receiving connection refused?

When a mail server gives Connection Refused, emails will come through in future attempts assuming the mail server comes back online. How well supported is this? Do all mail servers support this? Is it ...
700 Software's user avatar
  • 2,253
2 votes
1 answer
2k views

Actions to take during/after a power outage

We're in the process of replacing the shelving in our server room, and I found a piece of paper that had been covered over which lists various actions to take during/after a power outage: ADSL modem ...
Scott's user avatar
  • 1,173
1 vote
1 answer
3k views

How to determine the cause of service outages on vmware virtual hosts

I am trying to determine the cause of outages i have experienced on an irregular basis with several of my virtual servers which run on vmware esxi4. I have 12 virtual servers spread across 2 esxi host ...
m3z's user avatar
  • 161
1 vote
1 answer
834 views

Migrating away from Rackspace (slicehost) to Amazon

After yet another Rackspace outage (ongoing as I type this, no ETA) I'm forced to look at options. Rackspace has simply not delivered the uptime that my customers expect and I'll admit that I'm ...
Disco's user avatar
  • 375
0 votes
1 answer
584 views

Cisco 851 (IOS) router: FastEthernet 4 (WAN) got the shutdown flag

At a customer location there was a Cisco 851 router (which uses IOS). The PCs on location were all of a sudden unable to connect. We came on site and found that FastEthernet 4 (the WAN port) was "...
700 Software's user avatar
  • 2,253
7 votes
6 answers
472 views

What's the major outage you've been part of?

Outages are some of the things that we try to avoid but they're inevitable: they happen (very rarely, we hope) and we have to know how to deal with them (and learn from them). So, what's the major ...
Marco Ramos's user avatar
  • 3,130
14 votes
1 answer
1k views

What HTTP status should I return during temporary site outage/downtime?

I'm going to be taking down my website for an upgrade to the code. I'd like to have a temporary downtime page display during the upgrade. For the sake of preventing issues with bots attempting to ...
Matt Huggins's user avatar
14 votes
4 answers
6k views

Documenting an outage for a post-mortem review

We had a rather serious outage this past week affecting several services which put us out of our SLA with customers. Now that everything has been resolved, I am conducting a post-mortem review. From ...
5 votes
3 answers
897 views

How do I update DNS without causing an outage?

If I change a computer's IP address, it can take a long time for ISP's to stop caching the results. Is there a way to mitigate this, if I plan ahead?
jldugger's user avatar
  • 14.4k
3 votes
5 answers
2k views

What's the best way to Notify network users of outages or maintenance

There are times when one of our applications is down for maintenance and we'd like to let our users know about it before they start flooding our help desk with calls. What's the best way to notify ...
Dubs's user avatar
  • 188