This document was last updated April 2, 2019, by Ragul Gowthaman (ragul@umd.edu). The corresponding principal investigator is Brian Pierce (pierce@umd.edu).
Ragul Gowthaman, Brian G Pierce; TCRmodel: high resolution modeling of T cell receptors from sequence, Nucleic Acids Research, Volume 46, Issue W1, 2 July 2018, Pages W396–W401, https://doi.org/10.1093/nar/gky432
T cell receptors, or TCRs, are immunoglobulin immune receptors found on T cells, and each human is estimated to possess millions of unique TCRs. As with antibodies, TCRs are heterodimers, and diversity is primarily exhibited in loops called complementarity determining region (CDR) loops, which engage foreign antigens. In the case of TCRs, these are typically peptides that are presented by major histocompatibility complex (MHC) proteins. TCRs represent a key element of the cellular immune response, and have major implications in vaccine design, autoimmune disease, and as immunotherapeutics for cancer.
To address the growing need to gain structural insights of TCRs based on their sequences, due in part to advances in immune repertoire sequencing and massive increases in TCR sequence data, we have produced RosettaTCR in the powerful protein modeling program Rosetta. Analogous to the RosettaAntibody program for antibody modeling, RosettaTCR is constructed using all known TCR structures as templates and with optimizations to correctly model TCR architectures.
The TCRmodel server has three main stages:
The database of template TCR structures is updated from the Protein Data Bank on a monthly basis, and currently includes several hundred crystal structures representing over 100 unique TCRs. If no template is identified among TCR structures, CDR loops are obtained from antibody crystal structures, or are modeled de novo.
TCR sequences for modeling can be submitted as amino acid sequences for alpha and beta chains, encompassing at minimum the TCR variable domains. The program will automatically identify germline genes in the input sequences, if there is an exact match in the database.
Template Database
The TCR template database, which is required for running the application, is not available by default for weekly Rosetta releases. The TCR template database directory is located at "Rosetta/database/additional_protocol_data/tcr" in the code base. Users can clone or download the template database as part of the "Rosetta/database/additional_protocol_data" directory.
For example:
git clone git@github.com:RosettaCommons/additional_protocol_data.git
or
git clone https://github.com/RosettaCommons/additional_protocol_data.git
The template database can also be dowloaded from TCRmodel web server from this page: https://tcrmodel.ibbr.umd.edu/links
The default location for the database path is "Rosetta/database/additional_protocol_data/tcr". If placed in a different location, use the flag "-tcr_template_db_path" to provide the path to the template database.
After submission of TCR sequences, the results will provide the parsed CDR loop sequences and identified templates to the user along with the modeled protein as PDB file.
Flag | Description | Type |
---|---|---|
-alpha | Amino acid sequence of TCR alpha chain. | String |
-beta | Amino acid sequence of TCR beta chain. | String |
-minimize_model | Minimize the output model.Default:True | Boolean |
-relax_model | Minimize the output model.Default:False | Boolean |
-refine_tcr_cdr3_loops | Refine the CDR3 loops of Alpha and Beta chain.Default:False | Boolean |
-remodel_tcr_cdr3_loops | Remodel the CDR3 loops of Alpha and Beta chain.Default:False | Boolean |
-remodel_tcr_cdr3a_loop | Remodel the CDR3 loop of alpha chain. Useful if remodeling is required only for the CDR3 loop of alpha chain. Default:False. | Boolean |
-remodel_tcr_cdr3b_loop | Remodel the CDR3 loop of beta chain. Useful if remodeling is required only for the CDR3 loop of beta chain.Default:False. | Boolean |
-refine_all_tcr_cdr_loops | Refine all the CDR loops. Refinement includes CDR1, CDR2, CDR3 & HV4 loops of Alpha and Beta chains. Default:False. | Boolean |
Flag | Description | Type |
---|---|---|
-include_ab_templates | Include antibody templates for modelling. Default false. | Boolean |
-ab_template_db_path | Antibody template database path. | String |
-dump_templates | Dump all template pdb files, Default:False | Boolean |
-tcr_template_db_path | tcr template database path, Default:Use the tcr template databasse located in '$Rosetta/database/additional_protocol_data/tcr' | String |
-template_similarity_cutoff | Similarity cutoff to ignore similar template sequences from template database. Default:100 | Real |
-template_identity_cutoff | Identity cutoff to ignore similar template sequences from template database.Default:100 | Real |
-blastp_identity_cutoff | Identity cutoff to ignore similar template sequences from template database.Default:100 | Real |
-ignore_list | List of PDB id's to ignore as templates.Default:None | File |
-use_alpha_germline_templates | use germline templates for alpha chain, by default germline or framework templates choosen by sequence match. Default=False | Boolean |
-use_beta_germline_templates | use germline templates for beta chain, by default germline or framework templates choosen by sequence match. Default:False | Boolean |
-use_user_templates | use user provided templates for TCR segments, useful for testing and integrations tests. Default:false | Boolean |
-alpha_framework_template_id | template PDB id with chain for alpha chain framework segment, eg. 1kgc_D. Default:None | String |
-beta_framework_template_id | template PDB id with chain for alpha chain framework segment, eg. 1kgc_E. Default:None | String |
-alpha_germline_template_id | template PDB id with chain for alpha chain germline segment, eg. 1kgc_D. Default:None | String |
-beta_germline_template_id | template PDB id with chain for alpha chain germline segment, eg. 1kgc_E. Default:None | String |
-alpha_cdr1_template_id | template PDB id with chain for alpha chain CDR1 segment, eg. 1kgc_D. Default:None | String |
-beta_cdr1_template_id | template PDB id with chain for alpha chain CDR1 segment, eg. 1kgc_E. Default:None | String |
-alpha_cdr2_template_id | template PDB id with chain for alpha chain CDR2 segment, eg. 1kgc_D. Default:None | String |
-beta_cdr2_template_id | template PDB id with chain for alpha chain CDR2 segment, eg. 1kgc_E. Default:None | String |
-alpha_cdr3_template_id | template PDB id with chain for alpha chain CDR3 segment, eg. 1kgc_D. Default:None | String |
-beta_cdr3_template_id | template PDB id with chain for alpha chain CDR3 segment, eg. 1kgc_E. Default:None | String |
-alpha_orientation_template_id | template PDB id with chain for orientation of alpha chain, use along with beta_orientation_template_id, eg. 1kgc_D. Default:None | String |
-beta_orientation_template_id | template PDB id with chain for orientation of beta chain, use along with -alpha_orientation_template_id, eg. 1kgc_E. Default:None | String |
-alpha_framework_template_pdb | template PDB file for alpha chain framework segment. Default:None | String |
-beta_framework_template_pdb | template PDB file for alpha chain framework segment. Default:None | String |
-alpha_germline_template_pdb | template PDB file for alpha chain germline segment. Default:None | String |
-beta_germline_template_pdb | template PDB file alpha chain germline segment. Default:None | String |
-alpha_cdr1_template_pdb | template PDB file alpha chain CDR1 segment. Default:None | String |
-beta_cdr1_template_pdb | template PDB file for alpha chain CDR1 segment. Default:None | String |
-alpha_cdr2_template_pdb | template PDB file for alpha chain CDR2 segment. Default:None | String |
-beta_cdr2_template_pdb | template PDB file for alpha chain CDR2 segment. Default:None | String |
-alpha_cdr3_template_pdb | template PDB file for alpha chain CDR3 segment. Default:None | String |
-beta_cdr3_template_pdb | template PDB file for alpha chain CDR3 segment. Default:None | String |
-alpha_orientation_template_pdb | template PDB file for orientation of alpha chain. Default:None | String |
-beta_orientation_template_pdb | template PDB file for orientation of beta chain. Default:None | String |
-num_cter_overhang_res | Number of C-terminal overhang residues to use for grafting alignment. Default:3 | Integer |
-num_nter_overhang_res | Number of N-terminal overhang residues to use for grafting alignment. Default:3 | Integer |
$ Rosetta/main/source/bin/tcrmodel.macosclangrelease -database Rosetta/main/database -alpha VEQNSGPLSVPEGAIASLNCTYSDRGSQSFFWYRQYSGKSPELIMSIYSNGDKEDGRFTAQLNKASQYVSLLIRDSQPSDSATYLCAVTTDSWGKLQFGAGTQVVVT -beta VTQTPKFQVLKTGQSMTLQCAQDMNHEYMSWYRQDPGMGLRLIHYSVGAGITDQGEVPNGYNVSRSTTEDFPLRLLSAAPSQTSVYFCASRPGLAGGRPEQYFGPGTRLTVT
$ Rosetta/main/source/bin/tcrmodel.macosclangrelease -database Rosetta/main/database -alpha VEQNSGPLSVPEGAIASLNCTYSDRGSQSFFWYRQYSGKSPELIMSIYSNGDKEDGRFTAQLNKASQYVSLLIRDSQPSDSATYLCAVTTDSWGKLQFGAGTQVVVT -beta VTQTPKFQVLKTGQSMTLQCAQDMNHEYMSWYRQDPGMGLRLIHYSVGAGITDQGEVPNGYNVSRSTTEDFPLRLLSAAPSQTSVYFCASRPGLAGGRPEQYFGPGTRLTVT -remodel_tcr_cdr3_loops -refine_tcr_cdr3_loops
$ Rosetta/main/source/bin/tcr.macosclangrelease -database Rosetta/main/database -alpha VEQNSGPLSVPEGAIASLNCTYSDRGSQSFFWYRQYSGKSPELIMSIYSNGDKEDGRFTAQLNKASQYVSLLIRDSQPSDSATYLCAVTTDSWGKLQFGAGTQVVVT -beta VTQTPKFQVLKTGQSMTLQCAQDMNHEYMSWYRQDPGMGLRLIHYSVGAGITDQGEVPNGYNVSRSTTEDFPLRLLSAAPSQTSVYFCASRPGLAGGRPEQYFGPGTRLTVT -alpha_germline_template_pdb gma_tmplt_piece.pdb -beta_germline_template_pdb gmb_tmplt_piece.pdb -alpha_cdr3_template_pdb cdr3a_tmplt_piece.pdb -beta_cdr3_template_pdb cdr3b_tmplt_piece.pdb -alpha_orientation_template_pdb ora_tmplt_piece.pdb -beta_orientation_template_pdb orb_tmplt_piece.pdb -use_alpha_germline_templates -use_beta_germline_templates