Incident Management Process (draft)

Establish who are the affected users and stakeholders

Communicate information about the incident to the affected users and stakeholders

The relevant team members should look into the issue

Create an Incident Report

Index

Severity

Data Loss

ServiceStart DateEnd DateSeverityData LossIncident Page
WordPress

 

 

Production wordpress site outage 2018-02-13
WordPress

 

 

Production wordpress site outage 2018-02-22
WordPress

 

 

Production wordpress site outage 2018-03-25
Dashboard

 

 

Production Dashboard Outage 2018-06-18
Staff IDP

 

 

Sympa

 

 

Production Sympa Service Outage 2018-08-03
Dashboard

 

 

Production Dashboard Outage 2018-07-11
DNS

 

 

DNS Outage 2019-02-27
SharePoint

 

 

SharePoint Outage 2019-02-07
Dashboard

 

 

Production Dashboard Outage 2019-07-16
Dashboard

 

 

Production Dashboard Outage 2019-07-27
SharePoint

 

 

SharePoint Outage 2020-01-08
SharePoint

 

 

RSS Feed in Jobs page Geant.org was down - 17/01/2020
BRIAN

 

Brian Outage 2020-01-26
Cacti

 

 

Cacti production incident - 06-03-2020
Cacti

 

 

Cacti Production Instance - July 2020
HAProxy

 

 

Haproxy Outage 2021-03-17
ProxySQL

 

 

ProxySQL Outage 2021-07-12
EMS

 

 

EMS - 2022-03-14 - Service Outage
EMS(DNS)

 

 

EMS - 2022-04-20 - Service Degradation
Dashboard

 

 

Production Dashboard - 2022-05-15 - Service Outage
PostgreSQL(VMWare)

 

PostgreSQL - 2022-05-30 - Wide-scale Service Outage
BRIAN

 

 

BRIAN - 2022-05-30/31 - Service Outage
BRIAN

 

 

BRIAN - 2023-02-26/27 - Service Outage
BRIAN

 

 

BRIAN 2023-11-16/17 Data Collection Outage


All Incident Documents