Whole genome sequencing of Streptococcus pneumoniae: development, evaluation and verification of targets for serogroup and serotype prediction using an automated pipeline

نویسندگان

  • Georgia Kapatai
  • Carmen L. Sheppard
  • Ali Al-Shahib
  • David J. Litt
  • Anthony P. Underwood
  • Timothy G. Harrison
  • Norman K. Fry
چکیده

Streptococcus pneumoniae typically express one of 92 serologically distinct capsule polysaccharide (cps) types (serotypes). Some of these serotypes are closely related to each other; using the commercially available typing antisera, these are assigned to common serogroups containing types that show cross-reactivity. In this serotyping scheme, factor antisera are used to allocate serotypes within a serogroup, based on patterns of reactions. This serotyping method is technically demanding, requires considerable experience and the reading of the results can be subjective. This study describes the analysis of the S. pneumoniae capsular operon genetic sequence to determine serotype distinguishing features and the development, evaluation and verification of an automated whole genome sequence (WGS)-based serotyping bioinformatics tool, PneumoCaT (Pneumococcal Capsule Typing). Initially, WGS data from 871 S. pneumoniae isolates were mapped to reference cps locus sequences for the 92 serotypes. Thirty-two of 92 serotypes could be unambiguously identified based on sequence similarities within the cps operon. The remaining 60 were allocated to one of 20 'genogroups' that broadly correspond to the immunologically defined serogroups. By comparing the cps reference sequences for each genogroup, unique molecular differences were determined for serotypes within 18 of the 20 genogroups and verified using the set of 871 isolates. This information was used to design a decision-tree style algorithm within the PneumoCaT bioinformatics tool to predict to serotype level for 89/94 (92 + 2 molecular types/subtypes) from WGS data and to serogroup level for serogroups 24 and 32, which currently comprise 2.1% of UK referred, invasive isolates submitted to the National Reference Laboratory (NRL), Public Health England (June 2014-July 2015). PneumoCaT was evaluated with an internal validation set of 2065 UK isolates covering 72/92 serotypes, including 19 non-typeable isolates and an external validation set of 2964 isolates from Thailand (n = 2,531), USA (n = 181) and Iceland (n = 252). PneumoCaT was able to predict serotype in 99.1% of the typeable UK isolates and in 99.0% of the non-UK isolates. Concordance was evaluated in UK isolates where further investigation was possible; in 91.5% of the cases the predicted capsular type was concordant with the serologically derived serotype. Following retesting, concordance increased to 99.3% and in most resolved cases (97.8%; 135/138) discordance was shown to be caused by errors in original serotyping. Replicate testing demonstrated that PneumoCaT gave 100% reproducibility of the predicted serotype result. In summary, we have developed a WGS-based serotyping method that can predict capsular type to serotype level for 89/94 serotypes and to serogroup level for the remaining four. This approach could be integrated into routine typing workflows in reference laboratories, reducing the need for phenotypic immunological testing.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Whole Genome Sequencing of 39 Invasive Streptococcus pneumoniae Sequence Type 199 Isolates Revealed Switches from Serotype 19A to 15B

Streptococcus pneumoniae is a major pathogen that causes different invasive pneumococcal diseases (IPD). The pneumococcal polysaccharide capsule is a main virulence factor. More than 94 capsule types have been described, but only a limited number of capsule types accounted for the majority of IPD cases before the introduction of pneumococcal vaccines. After the introduction of the conjugated pn...

متن کامل

Comparison of sequential multiplex PCR, sequetyping and whole genome sequencing for serotyping of Streptococcus pneumoniae

Streptococcus pneumoniae is one of the major causes of pneumonia, meningitis and other pneumococcal infections in young children and elders. Determination of circulating S. pneumoniae serotypes is an essential service by public health laboratories for the monitoring of putative serotype replacement following the introduction of pneumococcal conjugate vaccines (PCVs) and of the efficacy of the i...

متن کامل

Serotyping of Streptococcus pneumoniae Based on Capsular Genes Polymorphisms

Streptococcus pneumoniae serotype epidemiology is essential since serotype replacement is a concern when introducing new polysaccharide-conjugate vaccines. A novel PCR-based automated microarray assay was developed to assist in the tracking of the serotypes. Autolysin, pneumolysin and eight genes located in the capsular operon were amplified using multiplex PCR. This step was followed by a tagg...

متن کامل

Direct detection and prediction of all pneumococcal serogroups by target enrichment-based next-generation sequencing.

Despite the availability of standard methods for pneumococcal serotyping, there is room for improvement in the available methods, in terms of throughput, multiplexing capacity, and the number of serotypes identified. We describe a target enrichment-based next-generation sequencing method applied to nasopharyngeal samples for direct detection and serogroup prediction of all known serotypes of St...

متن کامل

Selective and Genetic Constraints on Pneumococcal Serotype Switching

Streptococcus pneumoniae isolates typically express one of over 90 immunologically distinguishable polysaccharide capsules (serotypes), which can be classified into "serogroups" based on cross-reactivity with certain antibodies. Pneumococci can alter their serotype through recombinations affecting the capsule polysaccharide synthesis (cps) locus. Twenty such "serotype switching" events were ful...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره 4  شماره 

صفحات  -

تاریخ انتشار 2016