OpsDB runs on two servers, named appropriately prodxxxxx-opsdb01.geant.net and prodxxxxx-opsdb02.geant.net where xxxxx = prod, uat, or test
First Steps
If for any reason the system becomes unavailable, the initial action is to make sure we have switched from the ‘Primary(01)’ instance of OpdDB to the ‘Secondary(02)’ instance. This will allow the general user to continue working on OpsDB whist we continue with our investigations as to why it initially went down.
...
If for any reason the primary instance of OpsDB becomes unavailable we need to change the OPDSB.dante.net URL to resolve to our secondary instance of our OpsDB server (which we call ‘Secondary’. or ‘02’) - Prod02.geant.net.Software development would not normally be involved in this, the exception being if we noticed ’01’ had gone down we would perhaps request the switch via DevOps.
The action taken by devOps would be as follows:
...