Maintaining Mission Critical Systems in a 24/7 Environment. Peter M. Curtis

Чтение книги онлайн.

Читать онлайн книгу Maintaining Mission Critical Systems in a 24/7 Environment - Peter M. Curtis страница 19

Автор:
Жанр:
Серия:
Издательство:
Maintaining Mission Critical Systems in a 24/7 Environment - Peter M. Curtis

Скачать книгу

strategies with implementation steps means no time is wasted in a recovery scenario. The focus is to implement the plan quickly, and successfully in order to accomplish this people must be properly trained. Is the person you hired three months ago up to this task? The right strategies implemented will effectively mitigate damages, minimize disruptions, reduce the cost of downtime, and remove the threat to life safety.

      To assure reliable operation a Critical Environment Workflow and Change Management Process must be established and followed. Commensurate Roles and Responsibilities of the Engineering, Technology and Security groups will be developed, implemented, and adhered to in order to manage both planned and unplanned events and associated risks.

Data Centers Server Rooms
Operations Center Business Continuity and Technology Recovery Rooms
Electrical Switchgear Rooms Tape Silo and StorageTek Rooms
Network Equipment Rooms (NER) Local Area Network (LAN) Rooms
Intermediate Distribution Frames (IDF) Business Operations Control Rooms
Main Distribution Frames (MDF) Uninterruptible Power Supply (UPS) Rooms
Main Equipment Rooms (MER) Command Centers
Telecom Rooms (TR) Chiller Rooms and Thermal Energy Storage Spaces
Switching and Hub Rooms Building Management, Monitoring, and Automation Centers
Voice Telephone and Data Closets Mechanical Equipment Rooms
Standby Emergency Power (SEP) Generator and Switchgear Rooms

      Critical infrastructure systems are prevalent throughout a facility. Depending on the facility's size, there could be many redundant systems supporting the same critical environment. Knowing which systems that could impact the clients’ critical function/operation is paramount. Some of these systems are listed in Table 1.3.

Compressed Air Systems Telephone and Fiber Optic Communications Systems
Utility Power Feeder Systems Standby Emergency Power (SEP) Systems
Diesel Engine and Boiler Fuel Systems Glycol Systems
Fire/Life Safety Systems Environmental Control Systems (chillers, CRACs, etc.)
Natural Gas Supply Systems Water Service Systems
Electrical Distribution and Grounding Systems Building Management Systems (BMS)
Condenser Water Systems Boilers
Uninterruptible Power Supply (UPS) Systems

      1.4.1 Change Management

      Change Management is a process for managing and communicating change across relevant functions and business units to ensure and deliver integration of procedures and processes. Note that during emergency situations, established emergency response and escalation procedures shall be followed.

      When work is contemplated for accomplishment within the critical environment, ranging from certain simple or routine cleaning and inspection tasks to very complex and detailed preventive maintenance, corrective maintenance or construction efforts, it is essential that an orderly and thorough approach to work planning and execution be undertaken. In every instance where work is planned in the critical environment, all departments must ensure that risk to company operations is thoroughly assessed and that appropriate risk mitigation is in place while the work is performed.

      The level of detail required in a MOP (Method of Procedure) shall be correlated to the complexity of the work and magnitude of the potential risk. Relatively complex or high‐risk work shall be meticulously detailed in the MOP. The detail required for less complex work would not necessarily be as extensive. The bottom line: a properly developed, reviewed, and approved MOP will result in reduced risk to business operations. Required Change Request Information includes:

       Who is doing the work?

       What systems will be affected?

       Which areas of the building will be affected?

       Is there redundancy for the system being disrupted?

       Detailed procedures for the proposed task.

       Is assistance needed from other Lines of Business?

       What hardware will be moved, added, or changed?

       How long is the task duration?If an outage is required:

       How long will power be out?

       Are there any critical points during the process (High Risk) that can be identified?

       Will those systems be protected by UPS, generators, or other redundancy?

       What kind of backup systems are available if a problem arises?

       Will utility power be taken down?

       Are other feeds to the building affected?

       If redundancy is to be reduced, what redundancy will be lost, and for how long?

      Escalation Procedures

      The

Скачать книгу