Learning about antibodies

Last updated: 2023-06-27

Checks: 7 0

Knit directory: muse/

This reproducible R Markdown analysis was created with workflowr (version 1.7.0). The Checks tab describes the reproducibility checks that were applied when the results were created. The Past versions tab lists the development history.

R Markdown file: up-to-date

Great! Since the R Markdown file has been committed to the Git repository, you know the exact version of the code that produced these results.

Environment: empty

Great job! The global environment was empty. Objects defined in the global environment can affect the analysis in your R Markdown file in unknown ways. For reproduciblity it’s best to always run the code in an empty environment.

Seed: set.seed(20200712)

The command set.seed(20200712) was run prior to running the code in the R Markdown file. Setting a seed ensures that any results that rely on randomness, e.g. subsampling or permutations, are reproducible.

Session information: recorded

Great job! Recording the operating system, R version, and package versions is critical for reproducibility.

Cache: none

Nice! There were no cached chunks for this analysis, so you can be confident that you successfully produced the results during this run.

File paths: relative

Great job! Using relative paths to the files within your workflowr project makes it easier to run your code on other machines.

Repository version: c90d5b7

Great! You are using Git for version control. Tracking code development and connecting the code version to the results is critical for reproducibility.

The results in this page were generated with repository version c90d5b7. See the Past versions tab to see a history of the changes made to the R Markdown and HTML files.

Note that you need to be careful to ensure that all relevant files for the analysis have been committed to Git prior to generating the results (you can use wflow_publish or wflow_git_commit). workflowr only checks the R Markdown file, but you know if there are other scripts or data files that it depends on. Below is the status of the Git repository when the results were generated:


Ignored files:
    Ignored:    .Rhistory
    Ignored:    .Rproj.user/
    Ignored:    r_packages_4.1.2/
    Ignored:    r_packages_4.2.0/
    Ignored:    r_packages_4.2.2/
    Ignored:    r_packages_4.3.0/

Untracked files:
    Untracked:  analysis/cell_ranger.Rmd
    Untracked:  analysis/tss_xgboost.Rmd
    Untracked:  code/multiz100way/
    Untracked:  data/HG00702_SH089_CHSTrio.chr1.vcf.gz
    Untracked:  data/HG00702_SH089_CHSTrio.chr1.vcf.gz.tbi
    Untracked:  data/ncrna_NONCODE[v3.0].fasta.tar.gz
    Untracked:  data/ncrna_noncode_v3.fa
    Untracked:  data/netmhciipan.out.gz
    Untracked:  women.json

Unstaged changes:
    Modified:   analysis/graph.Rmd

Note that any generated files, e.g. HTML, png, CSS, etc., are not included in this status report because it is ok for generated content to have uncommitted changes.

These are the previous versions of the repository in which changes were made to the R Markdown (analysis/antibody.Rmd) and HTML (docs/antibody.html) files. If you’ve configured a remote Git repository (see ?wflow_git_remote), click on the hyperlinks in the table below to view the files as they were in that past version.

File	Version	Author	Date	Message
Rmd	c90d5b7	Dave Tang	2023-06-27	CDR sequences
html	b707032	Dave Tang	2023-06-26	Build site.
Rmd	9161c86	Dave Tang	2023-06-26	Learning about antibodies

Notes based on the study: Development of a humanized monoclonal antibody (MEDI-493) with potent in vitro and in vivo activity against respiratory syncytial virus.

This study describes the generation of a humanised monoclonal antibody, MEDI-493, that recognises a conserved neutralising epitope on the F glycoprotein of RSV. Broad neutralisation of a panel of 57 clinical isolates of the RSV A and B subtypes was demonstrated.

Antibody information

Additional info.

UniProt Accession Number of Target Protein: P03420
Alternative Name(s) of Target: F2; Fusion glycoprotein F0; Protein F; F gycoprotein; Human respiratory syncytial virus; humanized mAb 129
Immunogen: The original antibody was generated by immunizing female BALB/c mice by primary intranasal infection with A2 strain of RSV.
Specificity: This antibody recognizes the site A of the RSV F glycoprotein. This antibody binds the NSELLSLINDMPITNDQKKLMSNN epitope (PMID: 20098425).

Fusion glycoprotein sequence

Fusion glycoprotein F0 of Human respiratory syncytial virus A (strain A2).

>sp|P03420|FUS_HRSVA Fusion glycoprotein F0 OS=Human respiratory syncytial virus A (strain A2) OX=11259 GN=F PE=1 SV=1
MELLILKANAITTILTAVTFCFASGQNITEEFYQSTCSAVSKGYLSALRTGWYTSVITIE
LSNIKENKCNGTDAKVKLIKQELDKYKNAVTELQLLMQSTPPTNNRARRELPRFMNYTLN
NAKKTNVTLSKKRKRRFLGFLLGVGSAIASGVAVSKVLHLEGEVNKIKSALLSTNKAVVS
LSNGVSVLTSKVLDLKNYIDKQLLPIVNKQSCSISNIETVIEFQQKNNRLLEITREFSVN
AGVTTPVSTYMLT NSELLSLINDMPITNDQKKLMSNN VQIVRQQSYSIMSIIKEEVLAYV
VQLPLYGVIDTPCWKLHTSPLCTTNTKEGSNICLTRTDRGWYCDNAGSVSFFPQAETCKV
QSNRVFCDTMNSLTLPSEINLCNVDIFNPKYDCKIMTSKTDVSSSVITSLGAIVSCYGKT
KCTASNKNRGIIKTFSNGCDYVSNKGMDTVSVGNTLYYVNKQEGKSLYVKGEPIINFYDP
LVFPSDEFDASISQVNEKINQSLAFIRKSDELLHNVNAGKSTTNIMITTIIIVIIVILLS
LIAVGLLLYCKARSTPVTLSKDQLSGINNIAFSN

Antibody sequence

Design of humanised VL and VH segments based on murine monoclonal antibody 1129. Human framework regions were derived from K102 for VL. VH framework (FR1) region was derived from Cor; remaining framework regions were derived from CE-1 sequence.

VL.

FR1                     CDR1       FR2             CDR2
DIQMTQSPSTLSASVGDRVTITC KCQLSVGYMH WYQQKPGKAPKLLIY DTSKLAS

FR3                              CDR3      FR4
GVPSRFSGSGSGTEFTLTISSLQPDDFATYYC FQGSGYPFT FGGGTKLEIK

VH.

FR1                            CDR1    FR2            CDR2
QVTLRESGPALVKPTQTLTLTCTFSGFSLS TSGMSVG WIRQPPGKALEWLA DIWWDDKKDYNPSLKS

FR3                              CDR3       FR4
RLTISKDTSKNQVVLKVTNMDPADTATYYCAR SMITNWYFDV WGAGTTVTVSS

Basics

Notes from Analysing antibody sequence for recombinant antibody expression.

An antibody (a.k.a. immunoglobulin) is a large Y-shaped protein produced by B cells. They are used by the immune system to neutralise foreign objects.
Fab region determines its antigen specificity (Fv region and CDR).
Fc region determines the effects.

Each antibody (Ab) only binds to a specific antigen (Ag).
Ab and Ag interact by spatial complementarity.
Ab-Ag interaction are based on weak and non-covalent binding between Ab and Ag:
- Electrostatic interactions
- Hydrogen bonds
- Van der Waals forces
- Hydrophobic interactions
The hinge region allows better binding.
Ab-Ag binding is reversible.

Ab binding residues are mainly located in Complementarity Determining Regions (CDRs)
6 CDRs form the combining site: L1, L2, L3, H1, H2, and H3
CDR-H3 and L3 have a distinctive role in antigen recognition.

The framework region is a subdivision of the variable region (Fab) of the antibody. The variable region is composed of seven amino acid regions, four of which are framework regions (FR1-4) and three of which are hypervariable regions. The framework region makes up about 85% of the variable region. Located on the tips of the Y-shaped molecule, the framework regions are responsible for acting as a scaffold for the complementarity determining regions (CDR), also referred to as hypervariable regions, of the Fab. These CDRs are in direct contact with the antigen and are involved in binding antigen, while the framework regions support the binding of the CDR to the antigen and aid in maintaining the overall structure of the four variable domains on the antibody.

Identifying CDRs by sequence

CDR-L1

Start - approximately residue 24
Residue before - always a Cys
Residue after - always a Trp. Typically Trp-Tyr-Gln, but also Trp-Leu-Gln, Trp-Phe-Gln, Trp-Tyr-Leu
Length - 10 to 17 residues

CDR-L2

Start - always 16 residues after the end of L1
Residue before - generally Ile-Tyr, but also, Val-Tyr, Ile-Lys, Ile-Phe
Length - always 7 residues (except NEW [7FAB], which has a deletion in this region.)

CDR-L3

Start - always 33 residues after end of L2 (except NEW [7FAB], which has the deletion at the end of CDR-L2)
Residue before - always Cys
Residue after - always Phe-Gly-XXX-Gly
Length - 7 to 11 residues

CDR-H1

Start - approximately residue 26 (always 4 after a Cys)
Residue before - always Cys-XXX-XXX-XXX
Residue after - always a Trp. Typically Trp-Val, but also, Trp-Ile, Trp-Ala
Length - 10 to 12 residues

CDR-H2

Start - always 15 residues after the end of Kabat / AbM definition of CDR-H1
Residue before - typically Leu-Glu-Trp-Ile-Gly, but a number of variations
Residue after - Lys/Arg-Leu/Ile/Val/Phe/Thr/Ala-Thr/Ser/Ile/Ala
Length - Kabat definition 16 to 19 residues

CDR-H3

Start - always 33 residues after end of CHR-H2 (always 2 after a Cys)
Residue before - always Cys-XXX-XXX (typically Cys-Ala-Arg)
Residue after - always Trp-Gly-XXX-Gly
Length - 3 to 25 residues

Leader sequence-FR1-CDR1-FR2-CDR2-FR3-CDR3-FR4-Constant region-Stop codon

sessionInfo()

R version 4.3.0 (2023-04-21)
Platform: x86_64-pc-linux-gnu (64-bit)
Running under: Ubuntu 22.04.2 LTS

Matrix products: default
BLAS:   /usr/lib/x86_64-linux-gnu/openblas-pthread/libblas.so.3 
LAPACK: /usr/lib/x86_64-linux-gnu/openblas-pthread/libopenblasp-r0.3.20.so;  LAPACK version 3.10.0

locale:
 [1] LC_CTYPE=en_US.UTF-8       LC_NUMERIC=C              
 [3] LC_TIME=en_US.UTF-8        LC_COLLATE=en_US.UTF-8    
 [5] LC_MONETARY=en_US.UTF-8    LC_MESSAGES=en_US.UTF-8   
 [7] LC_PAPER=en_US.UTF-8       LC_NAME=C                 
 [9] LC_ADDRESS=C               LC_TELEPHONE=C            
[11] LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=C       

time zone: Etc/UTC
tzcode source: system (glibc)

attached base packages:
[1] stats     graphics  grDevices utils     datasets  methods   base     

other attached packages:
 [1] lubridate_1.9.2 forcats_1.0.0   stringr_1.5.0   dplyr_1.1.2    
 [5] purrr_1.0.1     readr_2.1.4     tidyr_1.3.0     tibble_3.2.1   
 [9] ggplot2_3.4.2   tidyverse_2.0.0 workflowr_1.7.0

loaded via a namespace (and not attached):
 [1] sass_0.4.5       utf8_1.2.3       generics_0.1.3   stringi_1.7.12  
 [5] hms_1.1.3        digest_0.6.31    magrittr_2.0.3   timechange_0.2.0
 [9] evaluate_0.20    grid_4.3.0       fastmap_1.1.1    rprojroot_2.0.3 
[13] jsonlite_1.8.5   processx_3.8.1   whisker_0.4.1    ps_1.7.5        
[17] promises_1.2.0.1 httr_1.4.5       fansi_1.0.4      scales_1.2.1    
[21] jquerylib_0.1.4  cli_3.6.1        rlang_1.1.0      munsell_0.5.0   
[25] withr_2.5.0      cachem_1.0.7     yaml_2.3.7       tools_4.3.0     
[29] tzdb_0.3.0       colorspace_2.1-0 httpuv_1.6.9     vctrs_0.6.2     
[33] R6_2.5.1         lifecycle_1.0.3  git2r_0.32.0     fs_1.6.2        
[37] pkgconfig_2.0.3  callr_3.7.3      pillar_1.9.0     bslib_0.4.2     
[41] later_1.3.0      gtable_0.3.3     glue_1.6.2       Rcpp_1.0.10     
[45] xfun_0.39        tidyselect_1.2.0 rstudioapi_0.14  knitr_1.42      
[49] htmltools_0.5.5  rmarkdown_2.21   compiler_4.3.0   getPass_0.2-2