The quantification of representative sequences pipeline for amplicon sequencing : case study on within-population ITS1 sequence variation in a microparasite infecting Daphnia

Gonzalez Tortuero, E ORCID: https://orcid.org/0000-0001-8934-6822, Rusek, J, Petrusek, A, Gießler, S, Lyras, D, Grath, S, Castro‐Monzón, F and Wolinska, J 2015, 'The quantification of representative sequences pipeline for amplicon sequencing : case study on within-population ITS1 sequence variation in a microparasite infecting Daphnia' , Molecular Ecology Resources, 15 (6) , pp. 1385-1395.

Full text not available from this repository. (Request a copy)

Abstract

Next generation sequencing (NGS) platforms are replacing traditional molecular biology protocols like cloning and Sanger sequencing. However, accuracy of NGS platforms has rarely been measured when quantifying relative frequencies of genotypes or taxa within populations. Here we developed a new bioinformatic pipeline (QRS) that pools similar sequence variants and estimates their frequencies in NGS data sets from populations or communities. We tested whether the estimated frequency of representative sequences, generated by 454 amplicon sequencing, differs significantly from that obtained by Sanger sequencing of cloned PCR products. This was performed by analysing sequence variation of the highly variable first internal transcribed spacer (ITS1) of the ichthyosporean Caullerya mesnili, a microparasite of cladocerans of the genus Daphnia. This analysis also serves as a case example of the usage of this pipeline to study within‐population variation. Additionally, a public Illumina data set was used to validate the pipeline on community‐level data. Overall, there was a good correspondence in absolute frequencies of C. mesnili ITS1 sequences obtained from Sanger and 454 platforms. Furthermore, analyses of molecular variance (amova) revealed that population structure of C. mesnili differs across lakes and years independently of the sequencing platform. Our results support not only the usefulness of amplicon sequencing data for studies of within‐population structure but also the successful application of the QRS pipeline on Illumina‐generated data. The QRS pipeline is freely available together with its documentation under GNU Public Licence version 3 at http://code.google.com/p/quantification-representative-sequences

Item Type: Article
Schools: Schools > School of Environment and Life Sciences
Journal or Publication Title: Molecular Ecology Resources
Publisher: Wiley
ISSN: 1755-098X
Related URLs:
Funders: German Research Foundation (DFG‐SPP 1399 and DFG‐SNF), European Science Foundation, Czech Science Foundation, German Science Foundation (DFG)
Depositing User: E Gonzalez Tortuero
Date Deposited: 22 Feb 2021 13:34
Last Modified: 27 Aug 2021 21:50
URI: http://usir.salford.ac.uk/id/eprint/59636

Actions (login required)

Edit record (repository staff only) Edit record (repository staff only)