< Back to previous page

Publication

ClusTCR

Journal Contribution - e-publication

Subtitle:a Python interface for rapid clustering of large sets of CDR3 sequences with unknown antigen specificity
Motivation The T-cell receptor (TCR) determines the specificity of a T-cell towards an epitope. As of yet, the rules for antigen recognition remain largely undetermined. Current methods for grouping TCRs according to their epitope specificity remain limited in performance and scalability. Multiple methodologies have been developed, but all of them fail to efficiently cluster large data sets exceeding 1 million sequences. To account for this limitation, we developed ClusTCR, a rapid TCR clustering alternative that efficiently scales up to millions of CDR3 amino acid sequences, without knowledge about their antigen specificity. Results Benchmarking comparisons revealed similar accuracy of ClusTCR as compared to other TCR clustering methods, as measured by cluster retention, purity and consistency. ClusTCR offers a drastic improvement in clustering speed, which allows clustering of millions of TCR sequences in just a few minutes through ultra-efficient similarity searching and sequence hashing. Availability
Journal: Bioinformatics
ISSN: 1367-4803
Volume: 37
Pages: 4865 - 4867
Publication year:2021
Keywords:A1 Journal article
Accessibility:Open