Both Tender and Lighthouse will be down for up to 30 minutes tonight from 8pm Pacific while we reboot some critical servers. While we can reboot most of the instances without causing downtime (in fact, we already have), there are some services that don’t handle failover particularly gracefully and so it will be cleaner for us to take the apps down completely rather than try to do it ‘live’.  For example, we could easily fail over our redis cluster to the slave instances (they’re already used for reads) but there’s a risk that a few minutes of writes would be lost.

We apologize for any inconvenience due to the late notice - we’re not entirely happy about it either.

  1. entpblog posted this