Maintaining Mission Critical Systems in a 24/7 Environment. Peter M. Curtis
Чтение книги онлайн.
Читать онлайн книгу Maintaining Mission Critical Systems in a 24/7 Environment - Peter M. Curtis страница 19
1.4 Critical Environment Workflow and Change Management
To assure reliable operation a Critical Environment Workflow and Change Management Process must be established and followed. Commensurate Roles and Responsibilities of the Engineering, Technology and Security groups will be developed, implemented, and adhered to in order to manage both planned and unplanned events and associated risks.
The Critical Environment (CE) is defined as the “physical space and the systems within a facility that are uniquely configured, sized and dedicated to supporting specific critical business operations as defined by the user”. There are many specific rooms and areas within facilities in today’s ever‐changing environment. Some are located within the buildings structure while others are located outside. Regardless of where a CE may be located, these locations have an immediate impact on the client’s ability to maintain business operations/continuity. Examples of some of these CE areas can be seen in Table 1.2.
Table 1.2 Critical Areas
Data Centers | Server Rooms |
---|---|
Operations Center | Business Continuity and Technology Recovery Rooms |
Electrical Switchgear Rooms | Tape Silo and StorageTek Rooms |
Network Equipment Rooms (NER) | Local Area Network (LAN) Rooms |
Intermediate Distribution Frames (IDF) | Business Operations Control Rooms |
Main Distribution Frames (MDF) | Uninterruptible Power Supply (UPS) Rooms |
Main Equipment Rooms (MER) | Command Centers |
Telecom Rooms (TR) | Chiller Rooms and Thermal Energy Storage Spaces |
Switching and Hub Rooms | Building Management, Monitoring, and Automation Centers |
Voice Telephone and Data Closets | Mechanical Equipment Rooms |
Standby Emergency Power (SEP) Generator and Switchgear Rooms |
Critical infrastructure systems are prevalent throughout a facility. Depending on the facility's size, there could be many redundant systems supporting the same critical environment. Knowing which systems that could impact the clients’ critical function/operation is paramount. Some of these systems are listed in Table 1.3.
Table 1.3 Critical Systems
Compressed Air Systems | Telephone and Fiber Optic Communications Systems |
---|---|
Utility Power Feeder Systems | Standby Emergency Power (SEP) Systems |
Diesel Engine and Boiler Fuel Systems | Glycol Systems |
Fire/Life Safety Systems | Environmental Control Systems (chillers, CRACs, etc.) |
Natural Gas Supply Systems | Water Service Systems |
Electrical Distribution and Grounding Systems | Building Management Systems (BMS) |
Condenser Water Systems | Boilers |
Uninterruptible Power Supply (UPS) Systems |
1.4.1 Change Management
Change Management is a process for managing and communicating change across relevant functions and business units to ensure and deliver integration of procedures and processes. Note that during emergency situations, established emergency response and escalation procedures shall be followed.
When work is contemplated for accomplishment within the critical environment, ranging from certain simple or routine cleaning and inspection tasks to very complex and detailed preventive maintenance, corrective maintenance or construction efforts, it is essential that an orderly and thorough approach to work planning and execution be undertaken. In every instance where work is planned in the critical environment, all departments must ensure that risk to company operations is thoroughly assessed and that appropriate risk mitigation is in place while the work is performed.
The level of detail required in a MOP (Method of Procedure) shall be correlated to the complexity of the work and magnitude of the potential risk. Relatively complex or high‐risk work shall be meticulously detailed in the MOP. The detail required for less complex work would not necessarily be as extensive. The bottom line: a properly developed, reviewed, and approved MOP will result in reduced risk to business operations. Required Change Request Information includes:
Who is doing the work?
What systems will be affected?
Which areas of the building will be affected?
Is there redundancy for the system being disrupted?
Detailed procedures for the proposed task.
Is assistance needed from other Lines of Business?
What hardware will be moved, added, or changed?
How long is the task duration?If an outage is required:
How long will power be out?
Are there any critical points during the process (High Risk) that can be identified?
Will those systems be protected by UPS, generators, or other redundancy?
What kind of backup systems are available if a problem arises?
Will utility power be taken down?
Are other feeds to the building affected?
If redundancy is to be reduced, what redundancy will be lost, and for how long?
Escalation Procedures
The