tigger - tigger
Here we provide a Tool for Immunoglobulin Genotype Elucidation via Rep-Seq (TIgGER). TIgGER inferrs the set of Ig alleles carried by an individual (including any novel alleles) and then uses this set of alleles to correct the initial assignments given to sample sequences by existing tools.
Immunoglobulin Repertoire-Sequencing (Rep-Seq) data is currently the subject of much study. A key step in analyzing these data involves assigning the closest known V(D)J germline alleles to the (often somatically mutated) sample sequences using a tool such as IMGT/HighV-QUEST. However, if the sample utilizes alleles not in the germline database used for alignment, this step will fail. Additionally, this alignment has an associated error rate of ~5 percent, notably among sequences carrying a large number of somatic mutations. The purpose of TIgGER is to address these issues.
Core tigger functions¶
- findNovelAlleles: Detect novel alleles
- plotNovel: Plot evidence of novel alleles
- inferGenotype: Infer an Ig genotype
- plotGenotype: A colorful genotype visualization
- genotypeFasta: Convert a genotype to sequences
- reassignAlleles: Correct allele calls
- getMutatedPositions: Find mutation locations
- getMutCount: Find distance from germline
- findUnmutatedCalls: Subset unmutated sequences
- getPopularMutationCount: Find most common sequence’s mutation count
- insertPolymorphisms: Insert SNPs into a sequence
Input and formatting¶
- readIgFasta: Read a fasta file of Ig sequences
- updateAlleleNames: Correct outdated allele names
- sortAlleles: Sort allele names intelligently
- cleanSeqs: Standardize sequence format
Gadala-Maria et al. (2015) Automated analysis of high-throughput B cell sequencing data reveals a high frequency of novel immunoglobulin V gene segment alleles. PNAS. 112(8):E862-70.