Communication and Coordination Failures in the Process Industries

Similar documents
LEARNING FROM TEXAS CITY REFINERY ( BP )

White Paper: CCPS Process Safety Metrics Review Considerations from an ASM Perspective

Functional Versus Schematic Overview Displays: Impact on Operator Situation Awareness in Process Monitoring

ANALYSIS OF HUMAN FACTORS FOR PROCESS SAFETY: APPLICATION OF LOPA-HF TO A FIRED FURNACE. Paul Baybutt Primatech Inc. and

SIL DETERMINATION AND PROBLEMS WITH THE APPLICATION OF LOPA

AVOID CATASTROPHIC SITUATIONS: EXPERT FIRE AND GAS CONSULTANCY OPTIMIZES SAFETY

AVOID CATASTROPHIC SITUATIONS: EXPERT FIRE AND GAS CONSULTANCY OPTIMIZES SAFETY

Human Performance Models for Response to Alarm Notifications in the Process Industries: An Industrial Case Study

Dangerous Assumptions

DynAMo Alarm & Operations Management

Branch Fire Rescue Services

Executive Summary of Qualifications

Effective Alarm Management for Dynamic and Vessel Control Systems

2013 Honeywell Users Group Americas. Jeff England & Dal Vernon Reising Successes & Challenges for Designing Effective Level 1 and Level 2 Displays

BRIDGING THE SAFE AUTOMATION GAP PART 1

Retrospective Hazard Review

PERFORMANCE METRICS FOR SAFE LASER OPERATIONS AT SLAC NATIONAL ACCELERATOR LABORATORY 1 Paper # 203

Petroleum Gas Systems - 1 -

Your Partner in Safety Solutions

FORT MYERS BEACH FIRE DEPARTMENT JOB DESCRIPTION

Rutgers Environmental Health and Safety (REHS)

The Contra Costa EMS System and Fire Station Closures: Impact and Mitigation

LANDSCAPE ARCHITECTURAL ASSISTANT, 7931 LANDSCAPE ARCHITECTURAL ASSOCIATE, 7932 LANDSCAPE ARCHITECT, 7929

New requirements for IEC best practice compliance

HONEYWELL VOICE SOLUTIONS FOR MAINTENANCE AND INSPECTION Sophisticated Inspection Documentation and Voice-Compliant Processing

BP TEXAS CITY REFINERY DISASTER

NECA S System Approach to Electrical Safety for NFPA 70E

PATTERNS OF FIREFIGHTER FIREGROUND INJURIES

WHAT IS FIRE DISASTER? Fire Disaster is an occurrence or an incident of such magnitude which creates a situation in which the normal life is suddenly

Overview of Control System Design

Corporate. Management. Line. Management. Plant. Manager HSSE. Maintenance Manager. (Plant. Operations Manager. Process Safety) Plant.

Audio Verification and Notification Procedures

Fire and Gas Detection and Mitigation Systems

SIOUX FALLS FIRE RESCUE STRATEGIC PLAN

Flint Canyon. September 6, 2012 Wildland Fire Investigation Report. 1 P a g e. Flint Canyon Fire Investigation Report

A GUIDE TO THE REGULATORY REFORM ORDER (FIRE SAFETY) 2005 (RRO)

Fire Prevention and Emergency Preparedness Plan for Black Oak Wind Farm

HOLLY HILL FIRE DEPARTMENT

Terminal Commissioning Case Study- Terminal A Logan International Airport

Chesapeake AIHA /ASSE Educational Seminar. Captain Stephen M. Hardesty Howard County Department Of Fire & Rescue Services Special Operations

The investigation of accidents and incidents in complex railway tunnel systems

FIRE PROTECTION PROGRAM

BCGEU JOB DESCRIPTION MINISTRY OF FORESTS

Dust Hazard Assessments

risk management and assessment for business Lessons Learned from Real World Application of Bow-tie Method 2010 International Symposium

Fire Investigation

The Abnormal Situation Management Consortium: Past and Future of Abnormal Situation Management

Managing the Lifecycle of Independent Protection Layers

Air Compressor Electrical Fire

Incident Summary (Reference # )

DYNAMO ALARM MANAGEMENT

Fired Heater Flooding

Centers for Regional Excellence. Downriver Fire Authority. Organization Committee Objective. Operations Committee Report 2007

COUNTY OF SAN DIEGO BOARD OF SUPERVISORS 1600 PACIFIC HIGHWAY, ROOM 335, SAN DIEGO, CALIFORNIA AGENDA ITEM

DEEP GEOLOGIC REPOSITORY PROJECT JOINT REVIEW PANEL RESPONSE TO INFORMATION REQUEST EIS

Evolving Fire Safety towards holistic Safety A Proposal. Lance Rütimann Senior Manager Industry Affairs, Siemens Switzerland Ltd.

STAFF REPORT SAUSALITO CITY COUNCIL

Y. ORMIERES. Fire risk analysis method for nuclear installations

ADIPEC 2013 Technical Conference Manuscript

Fire and Emergency Services Newfoundland and Labrador. Public Hearings Proposed Lower Churchill Hydroelectric Generation Project March 2011

EM-3.1 SAFETY ALERT HEAVY EQUIPMENT VEHICLE FIRES AND SUPPRESSION SYSTEMS

Washington Metropolitan Area Transit Authority (WMATA)

Root Cause Analysis Report Tesoro Golden Eagle Refinery Refinery Power Outage

4.13 Security and System Safety

MNEC NFPA 72 WHITE PAPER

Colorado Division of Fire Prevention and Control. Colorado Counties Inc. June 5, 2018

HYDROCARBONENGINEERING June June 2015

Crisis Response Plan Greenwood Center 207 West Park Avenue Greenwood, MS

2012 Honeywell Pacific Users Group. Sus tain.ability.

A Comprehensive Approach to Leak Detection

Introduction. 1 The Global Customer Committee Charter

NEW CD WARP CONTROL SYSTEM FOR THE CORRUGATING INDUSTRY

West Yorkshire Fire & Rescue Authority. Fire Protection Policy

Beth Israel Deaconess Medical Center BIDMC Manual

Are we using the wrong design fires for tall building fire strategies, in light of recent full height fires?

SAN JOSE FIRE FIGHTERS

EP ISSUES FOR INDEPENDENT SPENT FUEL STORAGE AND DECOMMISSIONING FACILITIES. Scenario Development For Shutdown And Defueled Nuclear Power Stations

EMERGENCY ACTION PLANS HOW READY ARE YOU?

Tree Removal Procedure

Village Enhancement Scheme Barton under Needwood

Chapter 1 Test. Directions: Write the correct letter on the blank before each question.

Critical Condition Management on a Corporate Scale. Lyondell Is a Major Global Chemical Company

Engine room fire caused by failed fuel pipe

Professional Security Officer Program(PSOP)

ELEVATOR RESCUE PROCEDURE

Tom Miesner Principal Pipeline Knowledge & Development

2013 Compliance Report RCW 52.33

How the United States is reducing its firefighter fatalities

SNS Modulator Fire of Friday, March 21, 2008 Fire Marshal Review

Clinical Alarm Effectiveness: Recognizing & Mitigating Risk to Patient Care

Practical Methods for Process Safety Management

FIRE DEPARTMENT FIRE CHIEF. Deputy Fire Chief Operations Section. Assistant Fire Chief Planning & Budget Section

Texas Fire Chiefs Association DOCUMENT SUBMISSION FORM

Mission. Vision. To set the standard for excellence and innovation in public safety. Department Description

Create Policy Options Draft Plan Plan Approval. Public Consultation Events. Phase 2

Eliminating Condensation and Resulting Corrosion. Sulphur Tail Gas and De-Gas Lines

Glenborough at Easton Land Use Master Plan

Where Technology Shapes Solutions. Alarm management : Wasn t that problem already solved years ago?

Interim Advice Note 76 / 06 ASSESSMENT PART 1 AIMS AND OBJECTIVES OF ENVIRONMENTAL ASSESSMENT. Contents

STAMP-Based Analysis of SBS Tank 731 Overflow Accident

Transcription:

Communication and Coordination s in the Process Industries 52 nd Annual HFES Meeting Jason Laberge Honeywell Advanced Technology Golden Valley, MN Peter Bullemer Human Centered Solutions Independence, MN Stephen Whitlow Honeywell Advanced Technology Golden Valley, MN September 25, 2008

Introduction and Motivation Process industries (Wikipedia, 2008) - involve extraction of raw materials, their transport and their transformation (conversion) into other products by means of physical, mechanical and/or chemical processes using different technologies - Examples: refineries, chemical plants, gas facilities 2 HONEYWELL

Introduction and Motivation Communication and coordination breakdowns are an important source of failures in the process industry (Laberge & Goknur, 2006) - Weak leadership - Poor control room design - Closed communication culture - Deficient work processes - Situation and work environment constraints Nature of these breakdowns and their relative frequency is unknown 3 HONEYWELL

Research Objective Identify common communication and coordination failures and root causes in the process industries Analyze incident reports to determine: - s = what happened, nature of the breakdown in communication and coordination - causes = reasons why the failure occurred Why analyze incident reports: - Incident reports provide a rich description of how failures and root causes contribute to real-life accident - Precedent in other industries to analyze incident reports for human factors issues (e.g., aviation, transportation) 4 HONEYWELL

Research Process - Overview Sample of s START 1.Identify 2.Prioritize 3. Cause Analysis 4.Identify 5. Cause Profiles END Public Incident Site Criteria Tap Cluster Analysis Sample of s A systematic research approach was developed 5 HONEYWELL

Methods Identify Sample of s START 1.Identify 2.Prioritize 3. Cause Analysis 4.Identify 5. Cause Profiles END Public Incident Site Criteria Tap Cluster Analysis Sample of s 6 HONEYWELL

Methods Identify We could not analyze all the available incident reports - Our goal was to identify a sample of incident reports that represent diverse process industries from multiple public and private company sources Search criteria: - lead to an abnormal situation (i.e., injury, production interruption, equipment damage, environmental release) - be described in enough detail so that the sequence of events, conditions, and outcomes could be understood - have an identified (documented in the report) or hypothesized (based on our own judgment) communication and coordination failure Search results: - 32 public incidents - 8 site proprietary incidents 7 HONEYWELL

Methods Prioritize Sample of s START 1.Identify 2.Prioritize 3. Cause Analysis 4.Identify 5. Cause Profiles END Public Incident Site Criteria Tap Cluster Analysis Sample of s 8 HONEYWELL

Methods Prioritize The incidents were subjectively rated by the research team and were approved by industry representatives: 1 = Latent failure 3 = Contributing failure 9 = Causal failure Industry 1 = General industrial 3 = Nuclear, Offshore 9 = Refining, Chemical Detail 1 = Insufficient detail (website) 3 = Moderate detail (case study, digest) 9 = Complete detail (full incident report) Recency 1 = Before 1990 3 = 1990 to 1999 9 = Since 2000 Based on this rating scheme, 14 incidents (10 public, 4 company proprietary) were selected for analysis - This sample size was considered sufficient to establish a preliminary understanding of the basic causes of incidents associated with communications and coordination failures 9 HONEYWELL

Methods Cause Analysis Sample of s START 1.Identify 2.Prioritize 3. Cause Analysis 4.Identify 5. Cause Profiles END Public Incident Site Criteria Tap Cluster Analysis Sample of s 10 HONEYWELL

Methods Cause Analysis Tap (www.tap.com) was used to complete the root cause analysis (Paradies & Unger, 2000) We used Tap because it: - is a structured approach to incident investigations - is based on sound process safety management principles and lessons learned (CCPS, 2003) - is systematic and work process driven - is robust and well grounded in human factors and systems - has credibility in both research and industry settings - is generic and not specific to a domain or problem space Tap is robust for this kind of analysis 11 HONEYWELL

Methods Cause Analysis 1. Determine Sequence of Events 2. Identify s 3. Analyze 4. Review With Technical Team 12 HONEYWELL

Methods Cause Analysis West Diversion Box high level alarm clears Blowdown Drum and Stack high level alarm activates (1st time) Around 3/23/2005 1:20pm Should have alarmed 2 minutes earlier Blowdown Drum and Stack high level alarm malfunctioned 3/23/2005 1:20pm After 3/23/2005 1:20pm Heavy Raffinate product flow to tankage 21,600bpd @ 1:27pm s = something that occurred prior to the incident, which if corrected, would have either prevented the incident from occurring, significantly mitigated its Heavy Raffinate product consequences, or reduced the likelihood that the flow to tankage incident would have occurred. 26,700bpd @ 1:24pm Incident = worst thing that happened, reason for investigation Around 3/23/2005 1:21-27pm 3/23/2005 1:29pm S Day Shift Board Operator monitors process Explosion Site Emergency Response Team responded Day Shift Board Operator monitors process Night Shift Board Operator enters site T Events = what happened Reflux Drum level at 0, starts to rise 15 killed, 170 injured Initiated search and rescue actvities Raffinate Splitter pressure 23psig; tray 13 temp 200deg; overhead temp 180deg @ 1:21pm Trailers damaged Feed preheat to Raffinate Splitter 267deg, starts to fall ISOM unit damaged Reflux Drum level at 50%; falls slightly @ 1:22pm 2 nd fires and hydrocarbon releases Condition = details related to the event 13 HONEYWELL

Methods Cause Analysis A conceptual model was developed to provide common operational definitions for failures (Laberge, 2008) - Communication failures are any problem involving the content, type, timing, or medium of communication - Coordination failures are any problem where two or more people must successfully interact to complete a job Communication and coordination failures are broad 14 HONEYWELL

Methods Cause Analysis Each failure was subject to detailed root cause analysis using the Tap root cause tree 1.Communication Level Question Answer Result If people had communicated more effectively, would the issue have been prevented? YES, MORE EFFECTIVE COMMUNICATION OF PROCESS SAFETY INFORMATION Continue to level 1.1 in Communication branch 1.1 No communication or not timely Was an issue caused by failure to communicate? YES, MANAGEMENT FAILED TO COMMUNICATE PSM INFORMATION TO PERSONNEL Continue to Level 1.1.1 in No communication or not timely branch 1.1.1 Communication system need improvement (NI) Was the system inadequate? YES, EMAIL USED BUT WAS INEFFECTIVE, COMMS DURING MEETINGS AND MEMOS W/SIGN OFF BUT NO PROOF WAS FOUND THAT THIS ACTUALLY OCCURRED cause of failure 1.1.2 Late communication Were communications provided too late because events happened too fast to allow time for communications? 1.2 Turnover NI Did incorrect, incomplete, or otherwise inadequate verbal or written turnover of information during shift/watch relief cause or fail to prevent an error? 15 HONEYWELL NO NO Exclude as root cause Do not proceed down branch

Methods Cause Analysis Two investigation team members reviewed all the incident reports, SnapCharts, list of failures, and root cause analyses The two-person team discussed differences of opinion and came to a consensus on the sequence of events, failures, and root causes before analyzing another incident This consensus process provided a quality control mechanism to increase the consistency of the results and the reliability of the findings across incidents 16 HONEYWELL

Methods Identify Sample of s START 1.Identify 2.Prioritize 3. Cause Analysis 4.Identify 5. Cause Profiles END Public Incident Site Criteria Tap Cluster Analysis Sample of s 17 HONEYWELL

Methods Identify 207 individual failures from all the incidents were clustered into common failure modes - failures highlight common problems that were shared across incidents - failures represent the shared problem elements that can be used to develop solutions to prevent future incidents failures = systemic problems for the industry 18 HONEYWELL

Methods Identify A taxonomy of failure modes was developed failure mode taxonomy was developed using a conceptual model (Laberge, 2008) Four team members independently clustered the individual failures - Average agreement (inter-rater reliability) was 70% - The team discussed where there was disagreement and came to a consensus before proceeding 19 HONEYWELL

Results Mode Analysis 5 common failure modes were: 100% 80% 60% 40% 20% 31% 14% 13% 12% 10% 20% 0% Planning activities Individual and team execution Work direction and supervision Communication functional groups Activity assessment Other common failures 80% of total Coordination related failures are more common 20 HONEYWELL

Methods Identify Sample of s START 1.Identify 2.Prioritize 3. Cause Analysis 4.Identify 5. Cause Profiles END Public Incident Site Criteria Tap Cluster Analysis Sample of s 21 HONEYWELL

Results root causes show why failures occurred across incidents Significant contributor (>15%) Substantial contributor (>10%) Moderate contributor (>5%) Not a contributor (0%) Combined for 5 Planning activities Individual and team execution Work direction and supervision Communication between functional groups Cause % % % % % % Activity assessment No SPAC 12.2% 20.4% 8.6% 7.8% 15.2% Crew teamwork needs improvement 11.1% 7.4% 15.5% 17.6% 6.5% 12.1% SPAC not followed 8.8% 7.4% 19.0% 7.8% 9.1% No communication 8.4% 6.5% 5.9% 32.6% No supervision 7.4% 12.1% 19.6% 15.2% SPAC Standards, Policies, Administrative Controls 22 HONEYWELL

Discussion Process industry companies interested in addressing the top 5 common failure modes should consider the following causes: - Ineffective standards, policies, administrative controls (SPAC) Enforcement, coverage, clarity, and accountability - Lack of communication No communication particularly between management, leaders, and employees; poor communication systems - Poor crew teamwork Not questioning problems, focusing on one problem and losing sight of overall status, person-in-charge leaves problems uncorrected - No supervision Person-in-charge does not provide support, coverage, or oversight vary; comprehensive solutions are required 23 HONEYWELL

Discussion The ASM Consortium is investigating the following solution areas to address the common failures and root causes identified in this project: - Team training (CRM-like) - Requirements for effective team communication and coordination - Best practices for leaders and supervisors - Collaboration technologies to support team coordination - Effective work processes (example of a SPAC) for team activities like work permitting, incident investigations 24 HONEYWELL

Limitations were mostly public from U.S. companies - The sample may not fully represent the process industries - A new ASM Consortium study is in progress to expand the sample size Tap is a subjective method - Developed systematic research approach - Mitigated to some degree through consensus building Incident reports were the only source of information - The consensus building approach and the use of operational definitions for both root causes and common failure modes was a mitigation technique to ensure the analysis was as systematic and objective as possible 25 HONEYWELL

Future Research Analysis that goes beyond communication and coordination activities to examine operations practices more generally - Could identify relative causes for problems more generally - May identify additional research areas or solution opportunities Compile and analyze near miss incidents - A near miss is an occurrence in which an accident (that is, property damage, environmental impact, or human loss) or an operational interruption could have plausibly resulted if circumstances had been slightly different (CCPS, 2003, p. 61) - Near miss reporting is a largely untapped source of information on failures and root causes (CCPS, 2003) - Other industries (e.g., aviation, medical) use near miss reporting to proactively identify problems and develop effective solutions before incidents occur 26 HONEYWELL

Acknowledgments Thanks to the HFES reviewers for their insightful comments This study was funded by the ASM Consortium, a Honeywell-led research and development consortium Questions? 27 HONEYWELL

www.asmconsortium.org www.honeywell.com