Flores, Diasio, and Saif:

Introduction

The quality of colon cleansing is a key determinant of colonoscopy quality as it relates to polyp detection rates, complete examinations and overall efficiency. Inadequate preparations lead to lower adenoma detection rates (ADRs), longer insertion and withdrawal times, repeat colonoscopy, and increased costs [1-3]. Thus, it is intuitively assumed that the better a colon preparation is, the higher the ADR will be [4]. More recently, however, a few studies have shown that the quality of colon preparation is not necessarily associated with the ADR [5-7].

The Boston bowel preparation scale (BBPS) is a widely used bowel cleanliness scoring system. Each colon segment—right, transverse and left colon—is scored from 0-3, after washing the colon wall during the procedure. The overall score is calculated by adding scores from all three segments, with 9 being the highest score, representing an excellent prep. The BBPS is a validated prep scoring system that has been shown to have good intra- and interobserver reliability [8-10]. Furthermore, a BPPS segment score of less than 2 has been associated with missed lesions [11].

The impact of the BBPS and ADR at each individual colon segment has not been previously addressed for all BBPS scores. In this study, we used the total and segmental BBPS scores to analyze the association between the quality of the preparation and the corresponding ADR, serrated polyp detection rate (SDR), advanced adenoma detection rate (AADR), and polyp detection rate (PDR).

The primary objective of this study was to determine the association between the segmental or overall ADR and SDR and the segmental and total BBPS scores. In addition, we sought to determine the association between the segmental or overall AADR and PDR and the segmental and total BBPS scores.

Patients and methods

We performed a retrospective review of all screening colonoscopies performed at an academic center by 21 board-certified gastroenterologists, with and without trainees, from January to December 2013. The study was approved by the Mayo Clinic Institutional Review Board.

We collected demographic information, including patients’ sex, age, body mass index (BMI), and diabetes mellitus status. We excluded all incomplete colonoscopies, i.e., colonoscopies without cecal intubation. We have been using the BBPS scoring system for colon preparation at our institution since 2011, and all endoscopists at our institution know how to use this scoring system. In addition, pictorial description of each BBPS score is provided in each room in our endoscopy suite. We excluded colonoscopies without a BBPS score. For each colonoscopy, we collected the BBPS per segment (right colon, transverse and left colon segments) and the number of polyps removed from each segment. We calculated detection rates for each total BBPS value (scores 1-9). In addition, we also assigned categories to total BBPS score, as follows: poor (1-4), fair (5), good (6-7) and excellent (8-9). BBPS segment scores were assigned as poor (0), fair (1), good (2), and excellent (3). As the number of segmental observations with poor scores was limited, we combined poor and fair bowel preparations.

We defined the right colon as the cecum, ascending colon, and hepatic flexure. The left colon was defined as the splenic flexure, descending colon, sigmoid colon and rectum. We recorded the time of cecal intubation and the time at withdrawal. Withdrawal time was calculated as the difference between the time to cecal intubation from the time of scope removal. All pathology reports from each colonoscopy were reviewed. The type and size of polyps identified from each colon segment was recorded. Advanced adenoma was defined as polyps ≥10 mm, villous or tubulovillous adenomas, or polyps with high grade dysplasia or carcinomas. Serrated polyps included traditional serrated adenomas, sessile serrated adenomas, and hyperplastic polyps, following the World Health Organization’s classification of tumors of the colon and rectum [12]. SDR was calculated with and without inclusion of hyperplastic polyps.

ADR was calculated as the number of colonoscopies with at least one adenoma divided by the total number of colonoscopies. SDR was calculated as the number of colonoscopies with at least one serrated polyp divided by the total number of colonoscopies. AADR was calculated as the number of colonoscopies with at least one advanced adenoma divided by the total number of colonoscopies. PDR was calculated as the number of colonoscopies with at least one polyp divided by the total number of colonoscopies. Overall and segmental ADR, SDR, AADR, and PDR were calculated by total and segmental prep scores and by location.

Statistical analysis

Exact binomial confidence intervals were computed for ADR, SDR, AADR, and PDR at the 95% level of confidence. Chi-squared tests were used to individually analyze the rates across bowel prep categories. Logistic regression was used to individually analyze the detection rates of adenomas, serrated polyps, advanced adenomas and all polyps across bowel prep scores using BBPS as a continuous variable. In addition, the logistic regression analyses were repeated to control, for age, sex, BMI, diabetes status, and withdrawal time. Wald confidence intervals for the odds ratios (OR) were calculated at the 95% confidence level. Interquartile range (IQR) was calculated as the difference between the 75th percentile and the 25th percentile. A Kruskal-Wallis test was implemented to analyze the difference in withdrawal time between bowel prep categories. To correct for multiple testing, the unadjusted P-values were calculated along with P-values that have been adjusted to reflect a false discovery rate of 0.05. P-values less than 0.05 were considered statistically significant; all analyses were performed using SAS 9.4 (SAS Institute Inc.; Cary, NC).

Results

There were 2203 complete colonoscopies. Colonoscopies without a prep description (n=20) and those without a BBPS score (n=192) were excluded. A total of 1991 patients were included in this study: 1022 (51%) were female, 969 (49%) were male. The overall mean age (±SD) was 59±8.9 years (males 59.03±9.2; females 58.97±13.4). Using the total BBPS scores, 134 colonoscopies (6.7%) had a poor prep, 272 colonoscopies (13.7%) had a fair prep, 1033 (51.9%) had a good prep and 552 (27.7%) had an excellent prep; 1 patient had a total BBPS of 1 (0.05%), 4 patients had a total BBPS of 2 (0.2%), 24 patients had a total BBPS of 3 (1.21%), 105 patients had a BBPS of 4 (5.27%), 272 patients had a total BBPS of 5 (13.66%), 567 patients had a total BBPS of 6 (28.48%), 466 patients had a total of BBPS of 7 (23.41%), 339 patients had a total BBPS of 8 (17.03%), and 213 patients had a total BBPS of 9 (10.7%). The overall ADR was 37.5% (95% confidence interval [CI] 35.3-39.6), which is 12.5% above the current target ADR benchmark.

Table 1 shows that there is a significant decrease in the ADR, SDR (with hyperplastic polyps) and PDR amongst bowel preparation scores using BBPS as a continuous variate, as the bowel prep score increased. Fig. 1 shows that BBPS scores of 5, 6 and 7 had higher ADRs than scores of 8 and 9. There was a high detection rate of 50% in patients with a total BBPS score of 2, limited by the very small sample size of 4 colonoscopies with a total BBPS of 2 (0.2%). Table 2 shows that there was a significant difference in the ADR, SDR (with hyperplastic polyps), and PDR, with a score of excellent having a significantly lower detection rate than the other bowel preparation categories (good and poor/fair). The estimates of ADR, AADR, and SDR (without hyperplastic polyps) were highest for prep scores of good, while the estimates of PDR and SDR (with hyperplastic polyps) were highest for prep scores of poor/fair.

Table 1 Overall adenoma detection rate (ADR), advanced adenoma detection rate (AADR), serrated polyp detection rate (SDR), and polyp detection rate (PDR) by bowel prep score

Figure 1 Boston bowel preparation scale (BBPS) as a continuous variable with adenoma detection rate (ADR) n=1990, as the outlier observation was removed (BBPS=1). ADR weighted by frequency of the BBPS

Table 2 Overall adenoma detection rate (ADR), advanced adenoma detection rate (AADR), serrated polyp detection rate (SDR), and polyp detection rate (PDR) by bowel prep categories

Tables 3 and 4 show the AADR, ADR and SDR (with and without hyperplastic polyps) for each colon segment. Table 3 shows the odds ratios for each colon segment using BBPS as a continuous variable, with a significant decrease in the odds of an adenoma being detected in the right colon (OR 0.79 [CI 0.66-0.94), P=0.04), a significant decrease in the odds of a serrated polyp with hyperplastic polyps being detected in the left colon (OR 0.78, [CI 0.65-0.92] P=0.019) and a significant decrease in the odds of an advanced adenoma being detected in the transverse colon (OR 0.37 [CI 0.17-0.8), P=0.04) for every unit increase in BBPS. Table 4 shows that there was a significant difference in the ADR in the right colon, with the ADR dropping significantly with an excellent bowel prep score (P=0.045). There was a significant difference in the SDR (with hyperplastic polyps) in the left colon; the SDR again dropped significantly, with an excellent prep score having the lowest estimated detection rate (P=0.041).

Table 3 Advanced adenoma detection rate (AADR), adenoma detection rate (ADR) and serrated polyp detection rate (SDR) by colon segment and by bowel prep score

Table 4 Advanced adenoma detection rate (AADR), adenoma detection rate (ADR) and serrated polyp detection rate (SDR) by colon segment and by segmental bowel prep score

In Tables 5 and 6, we used logistic regression to adjust for withdrawal time, age, BMI, presence of diabetes mellitus, and sex, using BBPS as a continuous variable. Our results remained consistent, with detection rates of ADR, SDR (with hyperplastic polyps), and PDR decreasing as the total BBPS score increased.

Table 5 Logistic regression model1,3 results for total Boston bowel preparation scale scores and overall advanced adenoma detection rate (AADR), adenoma detection rate (ADR), serrated polyp (with hyperplastic polyps) detection rate (SDR w/ HP), serrated polyp (without hyperplastic polyps) detection rate (SDR w/o HP), and polyp detection rate (PDR) (n = 1965)

Table 6 Logistic regression model1,3 results for advanced adenoma detection rate (AADR), adenoma detection rate (ADR) and serrated polyp, including hyperplastic polyps, detection rate (SDR) by colon preparation and colon segment (n = 1965)

The poor/fair prep score category was associated with a median withdrawal time of 12 min (range 2-42, IQR 7 min), good was 12 min (range 3-65, IQR 6 min) and excellent was 10 min (range 3-74, IQR 5 min). To further support our findings, we compared withdrawal times between bowel prep score categories of poor/fair, good and excellent and found that withdrawal time differed significantly (Kruskal-Wallis, P<0.001). The withdrawal time did not differ significantly between prep scores of good and poor/fair (P=0.295), but was significantly different between prep scores of excellent and good (P<0.0001) and between excellent and poor/fair (Wilcoxon-Mann-Whitney, P=0.005). There was also a significant association between ADR and withdrawal time. Colonoscopies with an adenoma detected had a significantly higher median withdrawal time of 14 min (range 2-74, IQR 7 min) compared to examinations without an adenoma detected, where the median withdrawal time was 10 min (range 2-37, IQR 5 min; Wilcoxon-Mann-Whitney, P<0.001). Overall (for ADR, SDR with hyperplastic polyps and PDR), BBPS scores of poor/fair and good had significantly higher withdrawal times than an excellent prep. Finally, we also analyzed the effect of our observation on colonoscopies performed by trainees at our institution. We found no significant difference in the detection rates of AADR (chi-square P=0.44), ADR (chi-square P=0.48), SDR with hyperplastic polyps (chi-square P=0.85), SDR without hyperplastic polyps (chi-square P=0.98), or PDR (chi-square P=0.21) when trainees were or were not included.

Discussion

Our study compared the BBPS with the ADR, SDR, AADR and PDR for all colon segments. We used total BBPS scores as a continuous variable with total scores from 0 to 9, and we also compared prep categories of poor/fair, good and excellent using the overall BBPS. Because of the limited number of observations with a poor segmental BBPS (segmental BBPS score of 0), we combined BBPS scores equivalent to poor and fair together (BBPS scores of <6) comparing them to scores of good and excellent (BBPS scores ≥6). This was based on the Calderwood et al study regarding the adequacy of BBPS in predicting endoscopists’ behavior for follow-up intervals, where a total BBPS score ≥6 provided a standardized definition for adequate 10-year follow up [9]. In addition, a recent study showed that a segmental score of 1 has a significantly higher rate of missed adenomas compared with segmental scores of 2 or 3 [11].

We found lower detection rates of adenomas, sessile serrated polyps, advanced adenomas and polyps across all bowel preparation scores as the prep score increased. A score of excellent had lower detection rates for these lesions than a score of good or poor/fair in all colon segments. We also found that a score of excellent in the right colon had a significantly lower ADR than the two other bowel preparation categories. Our results remained consistent even when we adjusted for withdrawal time, age, BMI, presence of diabetes, and sex. The addition of hyperplastic polyps to the serrated polyp detection rate was significant amongst bowel prep classes but was not significant when hyperplastic polyps were removed from the SDR calculation. This is probably because there were more hyperplastic polyps than serrated adenomas. When poor and fair prep score categories were separated out, our results remained consistent (Supplemental Table 1). Furthermore, as a measure of the total polyps per colonoscopy, the relative polyp rate per colonoscopy (RPR) was also analyzed using the poor/fair, good and excellent categories and adjusted for withdrawal time (Supplemental Table 2) with a higher RPR for poor/fair preps compared with good preps (RPR 0.80, 95%CI 0.70-0.91; P<0.001) and excellent preps (RPR 0.71, 95%CI 0.61-0.83; P<0.001).

It is well known that adenomatous polyps are precursors of colorectal cancer (CRC). The detection and removal of colon polyps has been shown to decrease the incidence of CRC [13-15]. The ADR is an important predictor of interval CRC and is an important quality metric with a current benchmark of above 25% [13,14-16]. Higher ADRs are associated with a reduced risk of both proximal and distal cancers in men and women [13]. Bowel preparation quality is also another important colonoscopy quality indicator. Poor bowel preparation has been associated with prolonged procedure times, a higher risk of complications and incomplete procedures, and has been shown to reduce the detection of colonic lesions [17-18]. One study reported an adenoma miss rate of 42% for colonoscopies with suboptimal preparation repeated within three years after the index colonoscopy [17]. Another prospective observational study found that colonoscopies with a segmental BBPS of 1 have a higher miss rate of adenomas larger than 5 mm compared with segmental scores of 2 or 3 [11]. Current guidelines recommend that bowel preparation must be adequate in at least 85% of screening colonoscopies to allow the use of recommended surveillance or screening intervals [13]. The BBPS has been established as a validated preparation score for standardizing the definition of the level of cleanliness amongst endoscopists [9]. Unlike the Aronchick or Ottawa Bowl Prep scores, which assess the quality of the bowel prep prior to cleaning and suctioning, the BBPS has been shown to have excellent intra- and interobserver reliability for each colonic segment after cleaning [19]. While bowel preparation is an important determinant of the quality of a colonoscopy, it has become apparent that the relationship between quality and the detection of lesions in clinical practice is more complex. Several studies have shown that there is not necessarily an association between bowel preparation quality and ADR [5-7,20-22].

Clark et al [5], using the Aronchick bowel preparation scale for standardization, performed a meta-analysis and systematic review of the impact of bowel preparation on adenoma detection and found that ADRs were higher in either excellent/good or fair quality prep compared to low quality preps. However, there was no significant difference between intermediate quality (fair) preps and high quality (excellent/good) preps. In addition, there was no difference in the ADRs between good and excellent preps. Another study using the New Hampshire Colonoscopy registry with descriptive prep quality scores found no difference in the total or proximal ADR or the SDR between excellent/good and fair preps [7]. In addition, although Sherer et al found poor preps had a lower detection rate of diminutive adenoma (≤5 mm) and advanced adenomas, there was no statistical difference in the ADRs of diminutive, small (5-9 mm) and advanced adenomas in colonoscopies with fair preps versus good/excellent preps [20]. Another study using the Aronchick scale found that, while fair and poor preparations had significantly lower colonoscopy completion rates, there was no difference in the ADR between bowel preparations. Colonoscopies with fair preparation were more likely to find at least one polyp than those with excellent or poor preps [21]. Finally, Calderwood et al used the BBPS to categorize bowel cleanliness and found that ADR and PDR decreased as the degree of bowel cleanliness increased from good to excellent [6]. These authors found that a segment score of 2 compared with 3 was associated with higher PDR and ADR in all segments of the colon [6].

Two prior studies had found no difference in the SDR in the proximal colon amongst a preparation score of poor, fair and excellent/good [7,22]. In our study, the overall SDR was higher with poor/fair scores than in other prep scores; and in the left colon, a score of poor/fair had a significantly higher SDR (when hyperplastic polyps were included) compared with the other two bowel preparation categories.

There are several reasons why an excellent prep may have a lower rate of detection of colonic lesions [6,7]. One reason is that residual stool may help the endoscopist pay attention to the mucosa and thus identify lesions, which may otherwise not be seen. In the case of serrated polyps, residual stool may attach to the mucus caps and make these polyps more conspicuous in less clean colons. In addition, it is possible that the quality of inspection decreases at the best level of bowel preparation because the endoscopist has a false sense of confidence. Another theory is that endoscopists may rate a colon preparation as excellent when no polyps are found [6].

Although several studies have previously reported a non-linear relationship with prep quality [5-7], to the best of our knowledge, this is the first study to look at the effect of bowel preparation on detection rates in each colon segment using the BBPS. There are other strengths of our study. We used a prep scoring system, the BBPS, which has been validated with consistent intra- and interobserver variability. In addition, we had the pathological data available for each colonoscopy in a single tertiary institution with uniform agreement amongst pathologists on the definition of sessile serrated polyps. Our results remained consistent using the BBPS scores categorized as poor/fair (BBPS score of 0-5), good (BBPS scores of 6 and 7) and excellent (BBPS scores of 8 and 9), as well as using the total BBPS as a continuous variable. We adjusted our results using withdrawal time, age, BMI, presence of diabetes mellitus and sex, and our results remained consistent, with detection rates of adenomas and polyps decreasing as BBPS scores increased. Our results demonstrate generalizability, in that some recent studies have provided similar messages, also suggesting that ADR does not necessarily increase as the colon prep quality increases. It is important to note that this phenomenon is observed despite different bowel preparation scoring methods. We acknowledge the limitations of this single-center study. Other potential limitations of our study are its retrospective nature and small sample size compared with similar studies. We also acknowledge that there is only a small difference in overall ADR and PDR across bowel prep scores; nevertheless, the change was statistically significant. In addition, we did not collect data on any techniques, such as water infusion or image enhancement techniques that may have been used by endoscopists to improve adenoma detection.

Our findings have implications for screening and surveillance colonoscopies. In patients with excellent preps, care must be taken to meticulously inspect the mucosa. Colonoscopists must not be falsely reassured that an adequate inspection has occurred simply because of a high quality colonic preparation. Our findings reinforce the need for meticulous inspection regardless of the prep quality.

In conclusion, using the BBPS score, we have shown that the ADR, SDR with hyperplastic polyps, and PDR significantly decrease as the bowel preparation quality increases.

Summary Box

What is already known:

Inadequate colon preparation for colonoscopies leads to a lower adenoma detection rate

What the new findings are:

Using the Boston bowel preparation scale score, we show that the adenoma detection rate, serrated polyp detection rate with hyperplastic polyps, and polyp detection rate significantly decrease as the bowel preparation quality increases
A possible explanation for this unexpected discrepancy may be related to longer and better visualization of the mucosa when cleansing and suctioning is necessary

Is the level of cleanliness using segmental Boston bowel preparation scale associated with a higher adenoma detection rate?

Abstract

Introduction

Patients and methods

Statistical analysis

Results

Discussion

References