Published on 31.01.13 in Vol 2, No 1 (2013): Jan-Jun
Preprints (earlier versions) of this paper are available at http://preprints.jmir.org/preprint/2402, first published Oct 16, 2012.
Evaluation of User Interface and Workflow Design of a Bedside Nursing Clinical Decision Support System
Background: Clinical decision support systems (CDSS) are important tools to improve health care outcomes and reduce preventable medical adverse events. However, the effectiveness and success of CDSS depend on their implementation context and usability in complex health care settings. As a result, usability design and validation, especially in real world clinical settings, are crucial aspects of successful CDSS implementations.
Objective: Our objective was to develop a novel CDSS to help frontline nurses better manage critical symptom changes in hospitalized patients, hence reducing preventable failure to rescue cases. A robust user interface and implementation strategy that fit into existing workflows was key for the success of the CDSS.
Methods: Guided by a formal usability evaluation framework, UFuRT (user, function, representation, and task analysis), we developed a high-level specification of the product that captures key usability requirements and is flexible to implement. We interviewed users of the proposed CDSS to identify requirements, listed functions, and operations the system must perform. We then designed visual and workflow representations of the product to perform the operations.
The user interface and workflow design were evaluated via heuristic and end user performance evaluation. The heuristic evaluation was done after the first prototype, and its results were incorporated into the product before the end user evaluation was conducted. First, we recruited 4 evaluators with strong domain expertise to study the initial prototype. Heuristic violations were coded and rated for severity. Second, after development of the system, we assembled a panel of nurses, consisting of 3 licensed vocational nurses and 7 registered nurses, to evaluate the user interface and workflow via simulated use cases. We recorded whether each session was successfully completed and its completion time. Each nurse was asked to use the National Aeronautics and Space Administration (NASA) Task Load Index to self-evaluate the amount of cognitive and physical burden associated with using the device.
Results: A total of 83 heuristic violations were identified in the studies. The distribution of the heuristic violations and their average severity are reported. The nurse evaluators successfully completed all 30 sessions of the performance evaluations. All nurses were able to use the device after a single training session. On average, the nurses took 111 seconds (SD 30 seconds) to complete the simulated task. The NASA Task Load Index results indicated that the work overhead on the nurses was low. In fact, most of the burden measures were consistent with zero. The only potentially significant burden was temporal demand, which was consistent with the primary use case of the tool.
Conclusions: The evaluation has shown that our design was functional and met the requirements demanded by the nurses’ tight schedules and heavy workloads. The user interface embedded in the tool provided compelling utility to the nurse with minimal distraction.
Interact J Med Res 2013;2(1):e4
- clinical decision support systems;
- user-computer interface;
- software design;
- human computer interaction;
- usability testing;
- heuristic evaluations;
- software performance;
- patient-centered care
Usability Issues in Clinical Decision Support Systems
Clinical decision support systems (CDSS) are important tools to improve health care outcomes and reduce preventable medical adverse events [, ]. In the US, CDSS is one of the key requirements for the government mandated meaningful use of electronic medical record (EMR) adoption [ ]. It was suggested that smart, portable, point-of-care, and interoperable technology solutions could help reduce inefficiencies and improve patient safety and outcomes for nurses [ ].
However, the effectiveness and success of CDSS depend on their implementation context and usability in complex health care settings (eg, ). Studies have shown that different CDSS implementations often yield very different clinical outcomes (eg, [ , ]). A study found that a home grown CDSS designed specifically for a hospital out-performed 31 other similar CDSS deployments included in the study [ ]. A multi-site study indicated that nurses routinely over-ride CDSS recommendations that do not fit their local practice, leading to a potential increase of errors [ ].
In particular, CDSS implementations often suffer from poor usability, which directly impacts their adoption and effectiveness. For instance, user interface (UI) workarounds have been shown to greatly diminish the effectiveness of widely used CDSSs [, ]. While many CDSSs rely on alert/reminder-based user interactions to prompt the clinician correct potential guideline violations, alert fatigue was a common issue for those systems (eg, [ ]). A study showed that physicians who receive CDSS alerts were only slightly more likely to take appropriate actions than those who do not [ ]. In the area of diagnostic decision support, it has been demonstrated that the accuracy of diagnostic aid tools depends on their UI. Tools that require simple copying and pasting from free text medical records yield more accurate results than tools that require the physician to extract and categorize information from the medical records [ , ]. As a result, usability design and validation, especially in real world clinical settings, are crucial aspects of successful CDSS implementation.
In this study, we developed a novel CDSS for the CHRISTUS St. Michael health system (a 350 bed acute care hospital) to help frontline nurses better manage critical symptom changes in hospitalized patients. The CDSS is currently undergoing clinical pilots inside the hospital. The goal of the CDSS was to reduce preventable failure to rescue (FTR) cases in the hospital. Since the nursing work environment is subject to constant interruptions and is error prone , a robust UI and implementation strategy that fit into the existing workflow was crucial to the success of the system.
In this paper, we will discuss the design, evaluation, implementation, and validation of the CDSS UI. We will present several innovations in nursing CDSS UI design, especially on large touch screen devices. The internal algorithmic design and the validation of decision rules, however, are beyond the scope of this paper. In the next section, we will start with a brief clinical background of the nursing CDSS tool.
Nursing Decision Support for Early Detection of Critical Changes
Early Symptom Recognition and Response
The FTR is a leading patient safety indicator with the highest incident rates among all indicators according to a recent large-scale study . In 2010, FTR measure was included as one of the Inpatient Prospective Payment System measures by the Center for Medicare and Medicaid Services, which directly affects hospitals’ reimbursements [ ].
FTRs are often considered preventable because the symptoms of a deteriorating patient could present hours before the rescue starts. Examples of such critical symptom change include patient complaint of a new pain, mental status change, and difficulty breathing etc. Studies have indicated that many FTRs could have been averted if the critical symptoms in patients were captured, evaluated, and communicated early.
It was suggested that the nurses’ early recognition, evaluation, and decision making of symptom signs could play an important role in FTR [, ]. A study conducted in a surgical oncology population indicated that many complications are detectable by nurses and can be managed with timely intervention [ ]. It was suggested that 23,000 in-hospital cardiac arrests in the UK could be prevented every year if early signs of symptoms were detected and acted upon [ ]. A 2009 study demonstrated that an early symptom recognition and response system could help improve outcome of sepsis and septic shock, which have hard-to-detect symptoms [ ].
Simply detecting and evaluating the critical symptom changes is not enough. The potential complication must be communicated to the rest of the clinical team, and be escalated to the right team members in order to organize effective interventions. It was argued that FTRs are often caused by the failure to communicate . Interventions such as the rapid response team (RRT) have demonstrated effectiveness in reducing FTRs when the issues are escalated on time [ , ]. In fact, the national deployment of RRT has the explicit purpose of supporting nurses in managing critical changes before coding arrest [ ]. It was also suggested that escalating to surgical residents could improve rescue success rates [ ], indicating that the optimal path of escalation needs to be selected by the nurses as part of the decision-making process.
Role of Frontline Nurses in Symptom Evaluations and Rapid Response Interventions
Frontline nurses are often the first to notice critical symptom changes. Their decisions at the point-of-care are crucial factors determining whether FTR events can be reduced. However, at the same time, nurses are ill equipped to manage critical symptom changes in hospitals.
The frontline nursing staff in most hospitals have very high workloads, need to manage extensive multitasking, and are fatigued [, ]. The fatigue has been demonstrated to negatively impact nurses’ cognitive performance [ ], including symptom evaluations. In fact, studies have shown a strong anti-correlation between nursing staffing levels and medical error rates [ ].
The average skill and training levels of nurses do not adequately prepare them to evaluate potentially complex symptom changes. A study found that a 10% increase in the proportion of nurses holding a bachelor’s degree was associated with a 5% decrease in the odds of FTR . Furthermore, most diagnostic aid CDSSs, such as differential diagnostic tools and diagnostic reminder tools, were designed for physicians to use in office settings, as opposed to nurses at the bedside.
While the RRT is a proven effective intervention for FTR, RRT resources can be under-utilized  because the nurses do not feel comfortable activating the RRT. Better communication has been shown to improve RRT utilization [ ]. It has been suggested that mandatory RRT activation helps reduce cardiorespiratory arrests outside of critical care areas in a hospital [ ].
The hieratical structure in hospitals is known to impede nurse decision-making process . Nurses are often discouraged from communicating and escalating problems. While hospitals across the nation have implemented teamwork frameworks, such as the TeamSTEPPS [ ], the emergency communication between nurses and physicians is still often error prone and require standardization [ ].
Design of CDSS
A specially designed CDSS could potentially help the nurse address the above issues related to critical symptom changes and FTRs. Such CDSS requires special design considerations for two reasons.
First, the system must be tailored to the nurses’ training and cognitive levels, and generate action items that are appropriate for the nurse. Most floor nurses have gone through less than 4 years of medical training after high school, and they do not have independent authority to treat the patient without the physician’s prescription.
Second, the system must be adapted to the fast paced workflow during a rescue operation. The tool must be ubiquitous, instant on, and provides useful feedback in merely minutes. The application should enhance real-time communication across team members, as opposed to bringing in another computer that impedes face-to-face communication.
Both challenges highlight the need for a novel design, and formal evaluation of the system UI and workflow.
Cognitive Design of UI
Human-computer interaction and workflow designs are crucial for the success of clinical informatics projects. A large body of research has been devoted to study methods and techniques to evaluate usability of systems.
Early efforts focused on creating human models and breaking down tasks into small pieces that could be directly measured and optimized for user performance. For instance, the goals, operators, methods, and selection rules family of frameworks [- ] are widely used to model human users as information processors. They break down user actions (eg, every key stroke), and measure time consumed in each step to evaluate the overall effectiveness of the UI. However, such frameworks do not take into account the intrinsic difficulty of the task and the functionality of the UI. They are very good at evaluating systems that predominantly require movement operations, but are less effective in evaluating systems with heavy cognitive tasks.
For cognitive systems, analysis of the UI itself is a key aspect of usability design, because UI design often has a deterministic effect on user performance. Research in cognitive theory has indicated that different visual representation of the same underlying work problem could produce dramatically different user performance in terms of ability to complete tasks correctly and productivity [, ]. A well-known example is that Arabic numerals are much easier to add and multiply than their equivalent Roman numerals.
Furthermore, complex work often requires collaboration of multiple users. It was demonstrated that cognition can be distributed across multiple users working on the same system [- ]. Hence, another important aspect of usability design is to evaluate each user’s goals and functions, and then translate them into a cohesive UI.
A popular design approach that works with the above cognitive design principles is the work-centered design (WCD) [, ]. WCD treats the UI as an aid for the user to achieve a specific work task. It conceptualizes steps for knowledge capture, requirement analysis, aiding design, and evaluation, which is a process followed closely in modern software development.
A particularly interesting application of distributed cognition and WCD in the medical informatics field is the UFuRT (user, function, representation, and task analysis) [- ] framework. For this project, we decided to use the UFuRT framework as a guide for usability design. The primary reason for us to choose UFuRT is its successful track record in design and evaluation of medical information technology (IT) products [ - ]. Its usability evaluation process consists of 4 major steps:
- User analysis is used to identify users and stakeholders of the work product, and document their needs and objectives. The user requirements are translated into system design requirements in this process.
- Function analysis aims to generate an essential description of the work. The UFuRT process calls for a 4-step analysis to detail the dimensions, constraints, relations, and finally operations.
- Representational analysis is the design process to identify and determine the implementation representations of relations among the dimensions identified in the functional analysis. The representation includes UIs and workflows for different types of users of the system. Representational analysis is a crucial step of the design process since it has been convincingly demonstrated that different representations of the same task can have very different impacts on the user’s efficiency and productivity [ ]. The ease-of-use of the UI is also one of the major factors driving adoption of any technology product [ ].
- Task analysis is to identify steps by a specific user on a specific representation in order to carry out an operation.
In the context of our project, we used UFuRT framework to analyze software requirements and inform the specification. Hence, we focused on user analysis and UI design aspects of representation analysis. We performed a high-level functional analysis and did not perform task analysis in the design stage. The reason was that complete functional and task analysis require full knowledge of every detail of the product, which would not provide enough flexibility for our iterative software development process.
Design Goals and System Requirements
The overall objective of the system was to help prevent patient safety events during critical changes. Through interviews with hospital-based clinicians, we have specifically identified symptom evaluation and escalation as the 2 main functional goals of the CDSS.
Improve Symptom Recognition and Evaluation
While nurses do not make diagnoses, they are the first to recognize and evaluate the patient symptom changes. Based on their evaluation, they would decide how to (or whether to) coordinate further care, and their evaluation results are often accepted by the team as the basis of a formal diagnosis.
Existing diagnostic CDSS tools provide a proven framework to help reduce errors in diagnostic evaluation, and improve documentation of the clinical findings that lead to diagnoses. Specially, the CDSS needs to provide 2 core functionalities.
Provide Just-in-Time Medical Content to the Nurse
For many critical symptom changes, there are multiple possible diagnoses. An example is that a hospitalized patient suddenly feels chest pain. The chest pain could be an indicator of heart attack, which needs to be attended to by a cardiologist or surgery team immediately; or the chest pain could indicate reflux or indigestion, which is a rather common condition that is simple to treat.
The frontline nurses typically do not have enough medical training and experience to thoroughly evaluate those potential diagnostic outcomes. The CDSS should provide specific instructions for the nurse to follow, and then make recommendations on what to do next. For instance, it should provide specific instructions on whom to call and what to say during the call for each potential diagnosis. The system does not replace human decision-making or training, but it provides support to help nurses deal with complicated emergent situations to the best of their capabilities.
Reduce Common Cognitive Errors
Common cognitive errors that lead to diagnostic errors include premature closure, anchoring, confirmatory bias, and framing . Those errors happen because the clinicians ignore certain findings or give certain other findings too much weight. Studies have indicated that cognitive errors such as premature closure are the most common cause of diagnostic errors made by clinicians [ ]. A key design goal of the CDSS was to help reduce those common cognitive errors.
To reduce framing and premature closure, the CDSS should encourage and prompt the clinicians to check all possible diagnostic outcomes, especially severe outcomes that lead to FTRs. The CDSS should also prompt the clinicians to verify all important symptoms and findings related to major diagnostic outcomes to minimize missed diagnoses.
To reduce anchoring or confirmatory bias, the CDSS should present an objective estimate of likely diagnoses and suggested clinical actions based on the current findings. The objective probability estimate could reduce the user’s reliance on reconceived decision biases.
Facilitate Team Communication
Teamwork is one of the few proven approaches to improve patient safety and care quality in hospitals [, ]. Particularly, our system should be designed to increase the utilization of the RRT, and improve communication between nurses and physicians.
As we discussed in the clinical background, RRT is an effective approach to help reduce FTR when it is deployed correctly. Our CDSS aimed to improve the effectiveness of the RRT by activating RRT early and making RRT mandatory when the nurse detects certain warning signs.
The CDSS needs to provide an easy and non-intrusive way to automatically alert the RRT at appropriate times. The RRT consists of more experienced clinicians, and they can decide whether or when to respond to those alerts. At the same time, it is important for the CDSS to clearly notify the nurse when it sends alerts to the RRT and the status of the alerts. The user must feel that he/she is in full control in order to effectively utilize the system.
If the floor nurse determines that the patient needs assistance from a physician, he/she would call the physician and explain the situation. The conversation could be a frustrating experience for both the nurse and the physician due to different expectations. That could result in the physician losing confidence in the nursing staff, and nurses delaying calls to physicians. The system should provide tools to help nurses communicate better with physicians in emergency situations.
Development of the Software Specification
We used the UFuRT framework as a conceptual guide to develop the software specification for the CDSS tool. Specifically, we identified users of the system, and documented use case stories for each user (ie, user analysis). We identified high-level functions the system must perform to meet the user requirement (ie, functional analysis). And finally, we created visual representations of the UI that can best accomplish those functions (ie, representational analysis). The UFuRT task analysis was not conducted at the design stage. Instead, the tasks were evaluated as part of the user evaluation process described later in this article.
Users of the proposed CDSS were members of the clinician team responsible for rescuing patients in the hospital. They included floor nurses, RRT nurses, and physicians. The user roles described in this section were based on interviews with hospital clinicians.
The primary users of the CDSS were the floor nurses. The system presented information and actions that were appropriate to the floor nurses. Specifically, the system could not present medical content that required MD-level training to understand, or ask nurses to make diagnostic decisions on their own. The CDSS also could not instruct the nurse to perform clinical actions that he/she was not authorized or qualified to do, such as performing advanced examinations, ordering labs, or writing prescriptions. Furthermore, a key characteristic in the floor nurse’s work environment is that they are very busy and have established workflows. The system added minimal overhead to the existing workflows.
If the floor nurse detected a potential problem, the RRT nurse was the next escalation step. RRT nurses are typically paged by the hospital internal communication system, and hence the CDSS must support paging the RRT. The system should give RRT nurses more options as they have the authority to perform standing orders on patients. Finally, when the RRT nurse arrived at the bedside, in order to minimize errors at the hand-off of care, it was important for the CDSS to have clear documentation on the findings and actions that have been performed by the floor nurse so far.
The physician in charge of the patient should be notified when there is a probable problem with the patient. The system should provide accurate and concise summaries of the patient condition for the nurse to read to the physician when talking on the phone.
Once the user requirements were determined, we developed a list of high-level functions the system must perform. Please note that we did not create a detailed catalog of functions at this stage of development. Instead, we focused on high-level operations in order to provide implementation flexibility. Key operations of the system include the following:
- Identify the symptom change that triggers the use of the system
- Identify a list of potential diagnoses
- Identify a list of potential clinical findings that will reject or affirm those diagnoses
- Enter clinical findings
- Re-evaluate the probabilities for each diagnosis after each finding
- Repeat for all finds until a diagnosis becomes highly likely
- Identify the action items for this diagnosis
- Identify the escalation path for this diagnosis
- Perform operations required in the action items list
In addition, we have also identified non-essential operations that were related to the specific design of the system. Such operations included user login to the system with badge number, synchronization of the device content with online repositories, user entry of the patients’ room number, and user configuration of the device for display options.
The UI of the product was designed to address operations listed in the previous section. It aimed to present a familiar and non-intrusive interface to the user at the point-of-care. In this section, we describe key features of the UI.
Mobility Through a Consumer Tablet Device
We decided to implement the UI on a touch screen consumer tablet device. The reason behind choosing a tablet device was that it can be accessed anytime, anywhere, and could be carried around by the clinician or be made available at the bedside. The tablet device was connected to the hospital secure WiFi system to access medical records, alert RRT and other teams, and update clinical content as needed.
The choice of a consumer tablet, as opposed to a dedicated medical device, was due to two reasons. First, the consumer device was much cheaper to deploy. A consumer iPad costs less than one third of a special purpose tablet PC on the market. Second, the consumer device featured an UI that the nurses were already familiar with due to his or her use of similar devices at home.
The most widely used and user-friendly consumer tablet device on the market is the Apple iPad, which we chose as the implementation platform for the CDSS device.
Dynamic Checklist Design
Most existing diagnostic decision support tools use decision trees  or text-based free form search [ ] to generate potential diagnoses. We determined that neither approach was suitable for nurses in emergence situations. Decision trees are slow and hard to recover from accidental typos. Text-based data entry is very slow on a mobile device.
Instead, we decided to use another UI metaphor that is commonly used in hospital environments—the medical checklist. The main UI of the system was a dynamic checklist for the nurse to go over and examine clinical findings related to the patient. Checklists have been shown to reduce medical errors [, ], and could help prevent several categories of cognitive errors (outlined in Section 3.1.2 of [ ]). UI is important for checklists. Effective checklists need to be prioritized, short, highly usable, and integrated into the clinician workflow [ ].
shows a split panel screen with 2 lists. This is the screen that the nurses see when he/she selects a critical change (eg, "chest pain" or "mental status change"). The checklist to the right is a list of measurements and observations the nurse needs to perform in order to evaluate the patient. The list was ordered based on the priority and potential impact of each finding. The nurses were encouraged to work on the high priority tasks at the top of the list first.
The list on the left shows potential causes for the patient's critical change (ie, the diagnostic outcomes). The causes were listed in order of their probabilities based on the current findings from the checklist items on the right panel.
All the user needed to do was to follow the checklist and enter a simple yes/no answer to the findings. With each yes/no answer, the system automatically recalculated and redisplayed the diagnostic outcome probabilities and the priorities of the remaining checklist items.
The nurses could go through the findings checklist in any order. The nurses could also undo any choices to go back to any previous state. That allowed the nurses to pick and choose tasks that happen to fit the existing workflow at any point of the process. There was no need to interrupt the flow just to provide a finding required by the software.
This is different than the typical decision tree or flow chart decision models, where the workflow is dictated by the software system.
The CDSS was connected to the hospital communication system, and it automatically sent out pages to the RRT as the nurse works on the patient. The RRT members could then decide whether to intervene depending on how severe the patient condition was as reported by the nurse through the device.
If the RRT decided to intervene, they could simply take over the CDSS device, which has documentation of the findings the nurse had already completed.
The CDSS provided a standard list of items for the nurses to go through with the physicians when a likely diagnosis emerged (). The nurse action lists were customized for each diagnostic outcome, and included orders the nurses should anticipate from the physicians. The nurses could get a head start by preparing for those orders while trying to reach the physician, saving time for the patient rescue.
The action items were reviewed and approved by the physicians in the hospital, and they were designed to enable physicians to make quick decisions over the phone.
Implementation of the CDSS
The CDSS system was implemented as a client-server computer application. The main component of the system was an iPad application developed in Objective C using the Apple iOS software development kit. The iPad application provided all the UI elements described in the design, and it was the only UI device the nurses needed to interact with during the patient evaluation process. The iPad application contained a SQLite-based relational database to store decision rules, medical content, user credentials, and usage logs. The application required access to the hospital’s secure WiFi network in order to send paging messages to the RRT members. Except for the RRT page, the iPad device could function entirely without network connectivity, and only needed to occasionally synchronize with the backend database for content updates.
The second component of the system was an online content management system (CMS) to manage the decision rules, medical contents, and authorized users and devices. The system was designed as a Web application built on Java Enterprise Edition running on Tomcat and MySQL database servers. The interface with the iPad device was programmed as RESTful XML Web services. The CMS had a human UI that visualized the content and allowed CRUD (create, retrieval, update, and delete) operations of the content items from any Web browser. Proper user authorization was enforced in the CMS so that only users with certain roles (eg, physicians and managers) could update the content.shows a screenshot of the CMS Web page that allowed reviewers to associate findings and actions with diagnoses into clinical rules.
The CMS also provided an interface for the physician reviewers to review cases based on the usage log of the iPad device. That supplemented the brief information recorded in formal medical records and provided insights into how to improve the system in the future.
In the next two sections, we will discuss evaluations and validations we performed on the CDSS, especially the iPad UI.
The UI and workflow design of the product was evaluated using heuristic evaluation and performance-based end user evaluation. The heuristic evaluation was done after the first prototype, and its results were incorporated into the product before the performance-based evaluation was conducted.
Heuristic evaluation is a formal UI evaluation method designed to uncover potential problems in a product [- ]. It is particularly well suited for prototype and early stage products as a discounted alternative to full usability testing [ ]. A heuristic study is typically conducted by 3-5 independent expert evaluators who are trained on UIs. Studies have suggested that 3 expert evaluators can uncover 80-90% of usability problems that would have been uncovered by a full usability study from end users [ ]. In health care IT, heuristic evaluation has been successfully used to evaluate UIs for products ranging from EMRs [ ] to medical devices [ , ].
In this project, we incorporated heuristic evaluation into the iterative product design and development process. Based on the functional requirements outlined earlier in this paper, we built a first prototype, conducted heuristic evaluation, and then improved the prototype by addressing the heuristic violations identified by the evaluators.
It was demonstrated that the evaluators who are experts in both UI design and the specific application domain tend to be most effective in identifying heuristic violations . Since a key requirement in our product was to cause minimal disruption to the clinical workflow, we believed that evaluators with strong domain expertise are crucial. We recruited 4 evaluators to study the initial prototype. JL is an information scientist trained in usability evaluation and technology adoption. She is an associate professor at the Texas State University. CM is a registered nurse and hospital quality management specialist. She has over 5 years of experience with RRTs in hospitals. She received training by JL to conduct heuristic evaluation. RM is a registered nurse of 20 years of experience with 5 years in the RRT. She received training from JL to conduct heuristic evaluation. CE is a registered nurse of 15 years of experience with 5 years in the RRT. He received training from JL to conduct heuristic evaluation.
The evaluators went through all UI elements in the application, and used the 10 heuristics in the computer software for evaluation . The heuristic violations were coded and documented. They were then rated for severity by all evaluators in the team. The severity was rated on the scale of 0 to 4, where a score of 0 meant that it is not a usability problem at all, 1 was a cosmetic problem only that did not need to be fixed unless extra time was available, 2 was a minor usability problem and fixing this was given low priority, 3 was a major usability problem that was important to fix and was given high priority, and 4 was related to release block issues and was imperative to fix before the product could be released.
The heuristic violations were entered into an issue tracking system for the engineering team. The product reached its first release after all heuristic violations rated 3 and above were fixed.
Once the first release of system was developed, we assembled a panel of nurses to evaluate the UI and workflow via simulated use cases. The panel consisted of 10 nurses from our target user group in the hospital. The panelists had varied education background and experience levels. There were 3 licensed vocational nurses and 7 registered nurses on the panel. All of them were non-rapid response nurses working full time on the floor. Their work experience ranged from 1 to 39 years, with a median of 23 years. The simulation study was conducted as follows.
- The nurse enters a patient room to meet the study monitor. The monitor gives a trigger symptom verbally to the nurse.
- The nurse goes back to the station and fetches the tablet device. On the way, he/she will enter badge number, room number, and select the trigger symptom from a list.
- When the nurse enters the room again, he/she can go through the checklist in any order. The nurse will verbally ask the monitor questions on the checklist, and the monitor will provide a yes/no answer.
- When the nurse has received enough information, he/she decides on a likely diagnostic outcome for the patient.
- The nurse will read out aloud each of the action item associated with the diagnostic outcome.
The process was repeated 3 times for each nurse. The tablet device automatically logged usage during the sessions.
We recorded whether each nurse successfully completed each session. The first session for each nurse was considered a training session to get the nurse familiar with the device, and was not included in the evaluation results. The success criterion was to have the nurse walkthrough the entire process and reach the action items without external help.
Completion Time Evaluation
For each session, we recorded the entire duration from the time the nurse walked into the room to the point where the nurse finished reading the action items. The completion time was an estimate of how much overhead time the use of the device added to the whole workflow. Since the product was designed to help nurses make quick decisions in urgent situations, it was crucial that the tool does not introduce too much overhead on its own. The evaluation criterion for the tool was that it should add less than 5 minutes of overhead to the existing clinical workflows.
NASA Task Load Index
After each session, the nurse was asked to use the National Aeronautics and Space Administration (NASA) Task Load index  to self evaluate the amount of cognitive and physical burden associated with using the device. The NASA task load index is a validated instrument for evaluating the burden of multiple tasks a user has to perform in parallel. It is well suited for the use scenario of this application where the user is required to multitask. The NASA Task Load Index has been successfully applied in evaluating health care IT products in the past [ ]. The evaluation criterion for the released product was that the task load introduced by the tool should be minimal.
Key Issues Identified in Heuristic Evaluation
In, we list a few examples of the heuristic violations identified by the evaluators. Each issue was categorized into one of the 10 common software application heuristics [ ], identified by the place in the software product where it occurs, and assigned a severity based on the consensus rating by the evaluators.
A total of 83 heuristic violations were identified in the studies.to list the distribution of the heuristic violations and their average severity.
The released version of the product had all heuristic violations rated 3 and above fixed. In this study, heuristic evaluation conducted by experts improved the usability of the product.
Performance-Based Evaluation Results
The 10 nurses on the panel successfully completed all 30 sessions of the performance evaluations. All nurses were able to use the device after a single training session with the instructor.
For each nurse, we took the median completion time from the 3 sessions, and then calculated the mean and standard deviation across the 10 nurses. On average, the nurses took 111 seconds (SD 30 seconds) to complete the simulated task. That is well within the 5 minutes overhead goal that we had set.
The NASA Task Load Index results indicated that the work overhead on the nurses was low. In fact, most of the burden measures were consistent with zero, as seen in. The only potentially significant burden was temporal demand, which is consistent with the primary use case of the tool. The tool was designed for the nurses to go over the symptom and vital signs checklists quickly, hence it exerts natural temporal pressure to its users.
|Heuristics violated||Place of occurrence||Severity||Usability problem description|
|Visibility of system status||Start||3.8||When syncing the application, there was no way to know if it will take 15 seconds or 10 minutes. It would be nice to know that it will take approximately 1 minute or show a percent completion.|
|Match between system and the real world||Outcome||3.4||List the outcomes as percentages instead of just a number without percentages.|
|User control and freedom||Checklist||4||The user should have the ability to change an answer once it has gone down to the list of answered questions. I can see frustration with the process if you have to completely start over to change an answer.|
|Consistency and standards||Outcome||1||Color code should be far apart along the visible spectrum so that the outcome can be clearly distinguished.|
|Error prevention||Checklist||4||Have the user confirmation when backing out of a screen that would cause the user to have to reenter all data.|
|Recognition rather than recall||Checklist||2||Abbreviations are used in the checklist. It should follow a simple primary rule.|
|Flexibility and efficiency of use||Checklist||3||If we add future triggers, there needs to be a way to ensure that when the keyboard displays that it does not cover the last triggers. Currently it is not a problem but should build this into system now.|
|Aesthetic and minimalist design||Outcome||3||There were too many "start over" displays currently. It would be simpler to have 1 button with a drop down screen listing the options: trigger, patient, or user. The questions also need to be reviewed by Dr. Finley and the RRT as currently there are a few questions that ask the same thing, but are just worded differently, and duplicating the questions is unnecessary.|
|Help user Recognize, diagnose, and recover from errors||Start||4||When a user accidentally hit the home button on iPad, the system will close without any warning and all data will be lost. Restarting within 1 minute allows you to get back to where you were. Otherwise the program will close.|
|Documentation and help||Outcome||3||The outcomes are in different colors. I am not sure that the staff will know what the color-coding means. Define the color scheme.|
|Heuristics violated||Count of usability problem description|
|Aesthetic and minimalist design||4|
|Consistency and standards||10|
|Documentation and Help||13|
|Flexibility and efficiency of use||4|
|Help user recognize, diagnose, and recover from errors||12|
|Match between system and the real world||10|
|Recognition rather than recall||4|
|User control and freedom||8|
|Visibility of system status||12|
|Heuristics violated||Average of severity|
|Aesthetic and minimalist design||2.25|
|Consistency and standards||1.49|
|Documentation and Help||3.01|
|Flexibility and efficiency of use||2.88|
|Help user recognize, diagnose, and recover from errors||2.48|
|Match between system and the real world||2.50|
|Recognition rather than recall||2.20|
|User control and freedom||3.13|
|Visibility of system status||2.93|
We have demonstrated that the usability of the CDSS is suitable for nurses in hospital environments. However, the ultimate success of the CDSS tool depends on many factors beyond usability, such as training and culture. In the next phase of the project, we have received generous funding from the Center for Medicare and Medicaid Innovations and CHRISTUS Health System to deploy the CDSS in 17 acute and long care facilities in a 3-year clinical deployment. The direct measurement of FTR cases and preventable complications at the deployment sites will provide the ultimate validation of the efficacy of the tool in improving patient safety and hospital care.
In this paper, we discussed the UI design and evaluation of a new decision support tool for nurses. The system was designed to help nurses recognize and escalate early warning signs of patient deterioration in acute care settings. The system will be used by floor nurses to evaluate patients on a daily basis. It will automatically alert the RRT when probable diagnoses are reached.
Using established cognitive design framework UFuRT as a guide, we were able to identify key requirements for the product, create a high-level functional specification, and then translate those functions into UI designs. During the implementation of the product, we performed heuristic evaluation to iteratively identify 83 usability issues, and fixed all issues rated as severe. These design and implementation approaches can be widely used in many different types of software development projects.
After the product was developed, we validated the design by performing end user usability tests, including performance tests and NASA Task Load Index evaluation. The evaluation has shown that our design was functional and met the requirements demanded by the nurses’ tight schedules and heavy workloads.
UI design and implementation were critical factors contributing to successful deployment of the CDSS tools, but they were not the only factors. In follow-up research, we will deploy the solution in a working hospital environment, and evaluate the clinical outcome measures to determine the barriers and efficacy of the overall solution.
Conflicts of Interest
- Osheroff JA, Teich JM, Middleton B, Steen EB, Wright A, Detmer DE. A roadmap for national action on clinical decision support. J Am Med Inform Assoc 2007 Apr;14(2):141-145 [FREE Full text] [CrossRef] [Medline]
- Bryan C, Boren SA. The use and effectiveness of electronic clinical decision support tools in the ambulatory/primary care setting: a systematic review of the literature. Inform Prim Care 2008;16(2):79-91. [Medline]
- Centers for Medicare and Medicaid Services. 2011. Eligible Professional Meaningful Use Core Measures URL: http://www.cms.gov/Regulations-and-Guidance/Legislation/EHRIncentivePrograms/downloads/13ClinicalSummaries.pdf [accessed 2012-10-16] [WebCite Cache]
- Bolton LB, Gassert CA, Cipriano PF. Technology solutions can make nursing care safer and more efficient. J Healthc Inf Manag 2008;22(4):24-30. [Medline]
- Mack EH, Wheeler DS, Embi PJ. Clinical decision support systems in the pediatric intensive care unit. Pediatr Crit Care Med 2009 Jan;10(1):23-28. [CrossRef] [Medline]
- Ammenwerth E, Schnell-Inderst P, Machan C, Siebert U. The effect of electronic prescribing on medication errors and adverse drug events: a systematic review. J Am Med Inform Assoc 2008 Oct;15(5):585-600 [FREE Full text] [CrossRef] [Medline]
- Wolfstadt JI, Gurwitz JH, Field TS, Lee M, Kalkar S, Wu W, et al. The effect of computerized physician order entry with clinical decision support on the rates of adverse drug events: a systematic review. J Gen Intern Med 2008 Apr;23(4):451-458 [FREE Full text] [CrossRef] [Medline]
- Shojania KG, Jennings A, Mayhew A, Ramsay C, Eccles M, Grimshaw J. Effect of point-of-care computer reminders on physician behaviour: a systematic review. CMAJ 2010 Mar 23;182(5):E216-E225 [FREE Full text] [CrossRef] [Medline]
- Dowding D, Mitchell N, Randell R, Foster R, Lattimer V, Thompson C. Nurses' use of computerised clinical decision support systems: a case site analysis. J Clin Nurs 2009 Apr;18(8):1159-1167. [CrossRef] [Medline]
- Zhou X, Ackerman MS, Zheng K, Schoville R. A case study of CPOE adoption and use: work-arounds and their social-technical implications. In: AMIA Annu Symp Proc. 2008 Presented at: p. 1195.
- Koppel R, Wetterneck T, Telles JL, Karsh BT. Workarounds to barcode medication administration systems: their occurrences, causes, and threats to patient safety. J Am Med Inform Assoc 2008 Aug;15(4):408-423 [FREE Full text] [CrossRef] [Medline]
- Shah NR, Seger AC, Seger DL, Fiskio JM, Kuperman GJ, Blumenfeld B, et al. Improving acceptance of computerized prescribing alerts in ambulatory care. J Am Med Inform Assoc 2006 Feb;13(1):5-11 [FREE Full text] [CrossRef] [Medline]
- Judge J, Field TS, DeFlorio M, Laprino J, Auger J, Rochon P, et al. Prescribers' responses to alerts during medication ordering in the long term care setting. J Am Med Inform Assoc 2006 Aug;13(4):385-390 [FREE Full text] [CrossRef] [Medline]
- Berner ES, Webster GD, Shugerman AA, Jackson JR, Algina J, Baker AL, et al. Performance of four computer-based diagnostic systems. N Engl J Med 1994 Jun 23;330(25):1792-1796. [CrossRef] [Medline]
- Graber ML, Mathew A. Performance of a web-based clinical diagnosis support system for internists. J Gen Intern Med 2008 Jan;23 Suppl 1:37-40 [FREE Full text] [CrossRef] [Medline]
- Kalisch BJ, Aebersold M. Interruptions and multitasking in nursing care. Jt Comm J Qual Patient Saf 2010 Mar;36(3):126-132. [Medline]
- Reed K, May R. HealthGrades. 2011 Mar. HealthGrades Patient Safety in American Hospitals Study URL: https://www.cpmhealthgrades.com/CPM/assets/File/HealthGradesPatientSafetyInAmericanHospitalsStudy2011.pdf [accessed 2012-10-16] [WebCite Cache]
- Centers for MedicareMedicaid Services. 2011. Overview Acute Inpatient PPS URL: https://www.cms.gov/Medicare/Medicare-Fee-for-Service-Payment/AcuteInpatientPPS/index.html?redirect=/AcuteInpatientPPS/ [accessed 2012-10-16] [WebCite Cache]
- Tait D. Nursing recognition and response to signs of clinical deterioration. Nurs Manag (Harrow) 2010 Oct;17(6):31-35. [Medline]
- Sandroni C, Ferro G, Santangelo S, Tortora F, Mistura L, Cavallaro F, et al. In-hospital cardiac arrest: survival depends mainly on the effectiveness of the emergency response. Resuscitation 2004 Sep;62(3):291-297. [CrossRef] [Medline]
- Friese CR, Aiken LH. Failure to rescue in the surgical oncology population: implications for nursing and quality improvement. Oncol Nurs Forum 2008 Sep;35(5):779-785 [FREE Full text] [CrossRef] [Medline]
- Hodgetts TJ, Kenward G, Vlachonikolis IG, Payne S, Castle N. The identification of risk factors for cardiac arrest and formulation of activation criteria to alert a medical emergency team. Resuscitation 2002 Aug;54(2):125-131. [Medline]
- Funk D, Sebat F, Kumar A. A systems approach to the early recognition and rapid administration of best practice therapy in sepsis and septic shock. Curr Opin Crit Care 2009 Aug;15(4):301-307. [CrossRef] [Medline]
- Classen JL. Is failure to rescue really failure to communicate? Champion the move from reactive process to proactive model. Nurs Manage 2010 Jul;41(7):38-41. [CrossRef] [Medline]
- Chen J, Bellomo R, Flabouris A, Hillman K, Finfer S, MERIT Study Investigators for the Simpson Centre, ANZICS Clinical Trials Group. The relationship between early emergency team calls and serious adverse events. Crit Care Med 2009 Jan;37(1):148-153. [CrossRef] [Medline]
- Schmid A, Hoffman L, Happ MB, Wolf GA, DeVita M. Failure to rescue: a literature review. J Nurs Adm 2007 Apr;37(4):188-198. [CrossRef] [Medline]
- Berwick DM, Calkins DR, McCannon CJ, Hackbarth AD. The 100,000 lives campaign: setting a goal and a deadline for improving health care quality. JAMA 2006 Jan 18;295(3):324-327. [CrossRef] [Medline]
- Goldfarb M, Cavaretta M. Surgical resident bedside rescue successes. J Surg Educ 2010 Apr;67(2):95-98. [CrossRef] [Medline]
- Surani S, Murphy J, Shah A. Sleepy nurses: are we willing to accept the challenge today? Nurs Adm Q 2007 Jun;31(2):146-151. [CrossRef] [Medline]
- Graves K, Simmons D. Reexamining fatigue: implications for nursing practice. Crit Care Nurs Q 2009 Jun;32(2):112-115. [CrossRef] [Medline]
- Kane RL, Shamliyan TA, Mueller C, Duval S, Wilt TJ. The association of registered nurse staffing levels and patient outcomes: systematic review and meta-analysis. Med Care 2007 Dec;45(12):1195-1204. [CrossRef] [Medline]
- Aiken LH, Clarke SP, Cheung RB, Sloane DM, Silber JH. Educational levels of hospital nurses and surgical patient mortality. JAMA 2003 Sep 24;290(12):1617-1623 [FREE Full text] [CrossRef] [Medline]
- Bagshaw SM, Mondor EE, Scouten C, Montgomery C, Slater-MacLean L, Jones DA, Capital Health Medical Emergency Team Investigators. A survey of nurses' beliefs about the medical emergency team system in a canadian tertiary hospital. Am J Crit Care 2010 Jan;19(1):74-83 [FREE Full text] [CrossRef] [Medline]
- Cziraki K, Lucas J, Rogers T, Page L, Zimmerman R, Hauer LA, et al. Communication and relationship skills for rapid response teams at hamilton health sciences. Healthc Q 2008;11(3 Spec No):66-71 [FREE Full text] [Medline]
- Jones CM, Bleyer AJ, Petree B. Evolution of a rapid response system from voluntary to mandatory activation. Jt Comm J Qual Patient Saf 2010 Jun;36(6):266-70, 241. [Medline]
- Leach LS, Mayo A, O'Rourke M. How RNs rescue patients: a qualitative study of RNs' perceived involvement in rapid response teams. Qual Saf Health Care 2010 Oct;19(5).
- Stead K, Kumar S, Schultz TJ, Tiver S, Pirone CJ, Adams RJ, et al. Teams communicating through STEPPS. Med J Aust 2009 Jun 1;190(11 Suppl):S128-S132. [Medline]
- Mackintosh N, Sandall J. Overcoming gendered and professional hierarchies in order to facilitate escalation of care in emergency situations: the role of standardised communication protocols. Soc Sci Med 2010 Nov;71(9):1683-1686. [CrossRef] [Medline]
- Card, Thomas PM, Newell A. The Psychology of Human Computer Interaction. In: The Psychology of Human Computer Interaction. N/A: Lawrence Erlbaum Associates; 1983.
- Schrepp M, Fischer P. GOMS models to evaluate the efficiency of keyboard navigation in web units. Eminds-International Journal of Human Computer Interaction 2007;1(2):33-46.
- Gray WD, John BE, Atwood ME. The Precis of Project Ernestine or an overview of a validation of GOMS. 1992 Presented at: SIGCHI conference on Human factors in computing systems; 1992; Monterey, California, US.
- Simon HA, Hays JR. The understanding process: Problem isomorphs. Cognitive Psychology 1976;8:165.
- Marr D. Vision. In: Vision. San Francisco, CA: W. H. Freeman; 1982.
- Zhang J, Norman DA. Representations in distributed cognitive tasks. Cognitive Science 1994(18):87.
- Zhang J, Norman DA. A representational analysis of numeration systems. Cognition 1995 Dec;57(3):271-295. [Medline]
- Wright P, Fields R, Harrison M. Analyzing Human-Computer Interaction as distributed cognition: The resources model. Human Computer Interaction 2000;15(1):1.
- Eggleston RG. Work-centered design: a cognitive engineering approach to system design. 2003 Presented at: Human factors and ergonomics society 47th annual meeting; 2003; Denver, Colorado, US p. 263.
- Butler K, Zhang J, Esposito C, Bahrami A, Hebron R, Kieras D. Work-centered design: A case study of a mixed initiative scheduler. 2007 Presented at: CHI; 2007; San Jose, California, US.
- Zhang J, Butler K. UFuRT: A work-centered framework and process for design and evaluation of information systems. In: Proceedings of HCI International. 2007 Presented at: HCI International; 2007; Beijing, China.
- Zhang J, Patel VL, Johnson KA, Malin J, Smith JW. Designing human-centered distributed information systems. IEEE Intelligent Systems 2002;17(5):42-47.
- Butler K, Zhang J, Esposito C, Bahrami A, Hebron R, Kieras D. Work-centered design: A case study of a mixed initiative scheduler. 2007 Presented at: CHI; 2007; San Jose, California, US p. 747-756.
- Nahm M, Zhang J. Operationalization of the UFuRT methodology for usability analysis in the clinical research data management domain. J Biomed Inform 2009 Apr;42(2):327-333 [FREE Full text] [CrossRef] [Medline]
- Saitwal H, Feng X, Walji M, Patel V, Zhang J. Assessing performance of an Electronic Health Record (EHR) using Cognitive Task Analysis. Int J Med Inform 2010 Jul;79(7):501-506. [CrossRef] [Medline]
- Zhang Z, Walji M, Patel VL, Gimbel R, Zhang J. Functional Analysis of Interfaces in US Military Electronic Health Record System using UFuRT Framework. 2009 Presented at: AMIA; 2009; San Francisco, California, US.
- Zhang J, Norman DA. Representations in distributed cognitive tasks. Cognitive Science 1994;18:87-122.
- Davis FD. Perceived usefulness, perceived ease of use, and user acceptance of information technology. MIS Quarterly 1989;13(3):319-340.
- Amundson DE, Seda G. Look out doctor, you may be getting framed: heuristics in medical decision-making. Mil Med 2008;173(9):2-5.
- Graber ML, Franklin N, Gordon R. Diagnostic error in internal medicine. Arch Intern Med 2005 Jul 11;165(13):1493-1499. [CrossRef] [Medline]
- Weaver SJ, Rosen MA, DiazGranados D, Lazzara EH, Lyons R, Salas E, et al. Does teamwork improve performance in the operating room? A multilevel evaluation. Jt Comm J Qual Patient Saf 2010 Mar;36(3):133-142. [Medline]
- Barnett GO, Cimino JJ, Hupp JA, Hoffer EP. DXplain. An evolving diagnostic decision-support system. JAMA 1987 Jul 3;258(1):67-74. [Medline]
- Hales B, Terblanche M, Fowler R, Sibbald W. Development of medical checklists for improved quality of patient care. Int J Qual Health Care 2008 Feb;20(1):22-30.
- Pronovost P, Needham D, Berenholtz S, Sinopoli D, Chu H, Cosgrove S, et al. An intervention to decrease catheter-related bloodstream infections in the ICU. N Engl J Med 2006 Dec 28;355(26):2725-2732.
- Ely JW, Graber ML, Croskerry P. Checklists to Reduce Diagnostic Errors. Acad Med 2011 Jan 18.
- Winters BD, Gurses AP, Lehmann H, Sexton JB, Rampersad CJ, Pronovost PJ. Clinical review: checklists - translating evidence into practice. Crit Care 2009;13(6):210.
- Nielsen J, Molich R. Heuristic evaluation of user interfaces. 1990 Presented at: ACM CHI; 1990 p. 249-256.
- Nielsen J, Mack R. Usability inspection methods. In: Usability inspection methods. New York: Wiley; 1994.
- Nielsen J. Usability engineering. In: Usability engineering. Boston: AP Professional; 1994.
- Zhang J, Johnson TR, Patel VL, Paige DL, Kubose T. Using usability heuristics to evaluate patient safety of medical devices. J Biomed Inform 2003;36(1-2):23-30. [Medline]
- Nielsen J. Finding usability problems through heuristic evaluation. 1992 May Presented at: ACM CHI Conference; 1992; Monterey, CA, US p. 373-380.
- Sox CM, Gribbons WM, Loring BA, Mandl KD, Batista R, Porter SC. Patient-centered design of an information management module for a personally controlled health record. J Med Internet Res 2010;12(3):e36 [FREE Full text] [CrossRef] [Medline]
- Graham MJ, Kubose TK, Jordan D, Zhang J, Johnson TR, Patel VL. Heuristic evaluation of infusion pumps: implications for patient safety in Intensive Care Units. Int J Med Inform 2004 Nov;73(11-12):771-779. [CrossRef] [Medline]
- NASA. NASA Task Load Index (TLX) Version 1.0 User's Guide. Moffett Field, CA: NASA Ames Research Center; 1985.
- Sox CM, Gribbons WM, Loring BA, Mandl KD, Batista R, Porter SC. Patient-centered design of an information management module for a personally controlled health record. J Med Internet Res 2010;12(3):e36 [FREE Full text] [CrossRef] [Medline]
|CDSS: clinical decision support systems|
|CMS: content management system|
|EMR: electronic medical record|
|FTR: failure to rescue|
|IT: information technology|
|NASA: National Aeronautics and Space Administration|
|RRT: rapid response team|
|UFuRT: user, function, representation, and task analysis|
|UI: user interface|
|WCD: work-centered design|
Edited by G Eysenbach; submitted 16.10.12; peer-reviewed by O Anya; comments to author 09.11.12; revised version received 10.12.12; accepted 29.12.12; published 31.01.13.
©Michael Juntao Yuan, George Mike Finley, Ju Long, Christy Mills, Ron Kim Johnson. Originally published in the Interactive Journal of Medical Research (http://www.i-jmr.org/), 31.01.2013.
This is an open-access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Interactive Journal of Medical Research, is properly cited. The complete bibliographic information, a link to the original publication on http://www.i-jmr.org/, as well as this copyright and license information must be included.