CRITICAL ISSUE- 520 Error – The Invisible Killer (Cannot Reach Origin Server)
-
Hi,
I wrote this out more or less the same on the Slack channel and nobody responded.
I love this plugin and Quic Cloud CDN service speed but there is one singular absolute deal-breaker issue, recurring intermittently, flying under the radar which is the 520 errors happening without being able to know where or when they happen.
When the 520 Error happens, it’s an error page that says the site can’t be reached. The site is down basically, but only for certain regions. It’s usually because my hosting company has blocked the IP of a regional server, in my case it’s usually the server NA-US-SJC-81 but it could be any server/location or more than one. This means that entire regions of the country going thru that CDN server basically cannot reach our site for long periods of time (weeks/months), the site is effectively down, even through it looks up for us from our location, and we lose 100% of sales in the effected region(s) during this downtime, which can be a huge loss.
What’s worse, there is no way to know when this happens to respond quickly – no warning that it’s happening – no notification in our account dashboard – no SMS or Email notification sent, nothing. It simply quietly dies, as does our revenue for that entire area, and we unknowingly bleed losses until by some chance we realize it. In one case, it was because we were running Google Page Speed Insights, which must have used this CDN server, and was coming back with the unreachable error page instead of our homepages. Since our support for our users is via our website, none of our users in that area can where it’s happening even tell us that it’s happening, they just get lost as a customer. It’s a brick wall of down time and lost customers with no warnings and no visibility.
This is an invisible killer.
We can’t see it happening, we get no warning it’s happening, and all the while it’s just absolutely killing our service/sales for whole cities of people.
Seeing how the entire livelihood of our websites is put in the hands of your servers being up and able to reach the website to serve it, I would think that knowing when that primary function is failing is pretty important to all users. I also have to imagine that right now, many users have no idea that it’s happening, but it’s happening to them as well, as we did not know either for our entire holiday rush season, where we incurred [tens of] thousands of dollars in losses with no warnings. As far as our Quic Cloud dashboard goes, our credits are on auto-refill and everything is running fine, when in fact it’s horribly broken for millions of potential users.
We (CDN Customers) desperately need visibility on this issue to continue using the service without incurring huge losses. We’ve already lost thousands and thousands of dollars in sales to regional downtime that we cannot see from our location – and nobody seems to care? Support’s ultimate answer is basically, then don’t use the service or switch hosting companies. Isn’t there a better solution to just provide visibility and/or notifications on this issue, rather than sweep it under the rug and hurt your customers business or lose customers?
Even right now, I have no way of knowing is my website up or not at every location for more than 10 active websites. I’ve signed up for Pingdom and tried other Geo-Located uptime monitors as an attempt to get my own visibility on the issue, but that’s not seeming to catch the issue either while it’s happening for some reason.
There are so many ways you could address this which are not complex, either bringing visibility for users, or all out preventing downtime. Such as:
- Notification in the dashboard when 520 errors are occurring letting us know a regional server is broken.
- Email / SMS Notification of the same.
- Allow users to customize their own Error 520 page so that we could add our own google analytics event, or pixel that would track this error
- Provide an error log in the dashboard somewhere (like most hosting companies have) so we can see all errors that are happening
- In the case of a 520 Error, have the server check with the next closest regional CDN and route through there instead (eliminating downtime!)
- A simple diagnostic tool that can run all of the active regional CDN servers and check if they can reach the origin server OK with a checkmark or X green or red for each server.
- This is what Support does manually for me when I report this on Slack, and then I can quickly resolve this with the hosting company. But Quic Cloud Support says there is no way for us (CDN customers) to check this for ourselves, so the response time on this issue ends up being really bad (weeks? months?).
Please provide a way to get visibility on whether or not the CDN is down regionally for any given website using your service, because otherwise your service is by far the fastest, and best I’ve seen for optimizing WordPress.
I’m more than happy to stop paying 3rd party uptime monitors and pay for it within the Quic Cloud dashboard if that’s what it requires to simply get a report of which of our enabled CDN servers are OK, and what servers are down/broken connections to the origin site.
I even have some enterprise level customers that I’d love to bring on the service, but I simply can’t knowing that we have uptime SLAs and this is going on. More lost business for Quic Cloud, too, so it’s not just affecting my business, but it’s preventing me from bringing clients to yours as well.
Please help!
- The topic ‘CRITICAL ISSUE- 520 Error – The Invisible Killer (Cannot Reach Origin Server)’ is closed to new replies.