Help

User Guide

Introduction Antibody Search CDR Analysis Sequence Similarity Search Antigen-Antibody Interaction Profiles Sequence-Based Clustering and Tree Construction Downloads Statistics

FAQ

Frequently Asked Questions

Contact

Introduction

Overview

MAAD is an open-access, curated resource hosting over 27,500 antibody and nanobody entries targeting six major human viruses: SARS-CoV-1, SARS-CoV-2, MERS-CoV, Influenza Virus, RSV, and hMPV. Each entry is annotated with:

Target Specificity: Antigen and epitope details.
Sequence Data: Full-length or variable region amino acid/nucleotide sequences.
Functional Characterization: Binding and/or neutralization to viral strain.
Germline Annotations: V/J gene assignments.
CDR Annotations: Annotated under IMGT, Kabat, and Chothia schemes, with length statistics.
Structural Data: Direct links to PDB entries for 3D structures.
Sources: PubMed IDs or patent numbers for original references, plus collection/publication dates.

Antibody Search

Allows users to search MAAD entries using:

Antibody/Nanobody Name (e.g., "PDI-106", "MxR")
Target Virus (e.g., SARS-CoV-1, Influenza)
Germline V/J Gene (e.g., IGHV3-30)
Developmental Origin (e.g., Human, Mouse)
Project ID (e.g., PubMed ID: 37075126, Patent ID: CN100374464)

Clicking on a search result leads to a detailed entry page with all associated metadata, while clicking on a PDB number or project number (such as PMID or patent ID) directs users to the original source on the respective PDB website or database.

Step 1: Click the right-hand slider to choose a search type

Step 2: Type keywords

CDR Analysis

The search will match sequences of identical length, leveraging N-Gram indexing and LIKE queries to identify similar or embedded subsequences:

Input: Provide individual CDR sequences (CDR1/2/3) and select a numbering scheme (IMGT, Kabat, or Chothia) and chain type (Heavy or Light).
Output:
- Matched entries filtered by CDR length or sequence similarity.
- Interactive visualizations:
  - Sequence logo plots (via WebLogo) constructed for CDR regions of identical length.
  - Pairwise alignment visualization with mismatch highlighting for equal-length sequences.

Sequence Similarity Search

MAAD performs BLAST-based sequence alignment, querying both antibody nucleotide and amino acid sequences against the database:

Input: Submit a variable region (VH/VL) nucleic acid or amino acid sequence.
Process: MAAD performs a BLAST search against the database.
Output:
- A ranked list of the top 200 matched entries by sequence similarity.
- Interactive visualizations:
  - CDR and V/J germline genes of the input sequence.
  - V/J gene usage dot plot (germline distribution).
  - Pairwise alignment viewer with mismatch highlighting.

Antigen-Antibody Interaction Profiles

MAAD provides antigen-antibody complex structures with detailed interface annotations and allows direct search by PDB ID on the search page.

Step 1: Click the right-hand slider to choose a search type.

Step 2: View your results.

Sequence-Based Clustering and Tree Construction

Users can input their own heavy chain nucleotide sequences and select target viral antibody sequences for joint analysis. Results can be downloaded from the results page. Tree construction may take some time, usually less than 3 minutes.

Downloads

The Download module allows users to export customized or full datasets from MAAD in standardized .xlsx format. For each entry in MAAD, the following data is available for download:

Sequence Data:
- Full-length amino acid (VH/VL) and corresponding nucleotide sequences (if available).
- GenBank accession IDs (for nucleotide sequences).
CDR Annotation:
- Annotated CDR1/2/3 regions of heavy and light chains using three schemes: IMGT, Kabat, and Chothia.
- CDR lengths under each numbering convention.
Functional Information:
- Confirmed activity against (binding and/or neutralization) target strain.
- Target antigens (e.g., RBD, HA, F protein).
Structural Metadata:
- Resolved structure IDs and direct hyperlinks to PDB entries.
Reference Information:
- Source publication (PubMed ID or patent number).
- Curation date (inclusion and publication timestamp).

Statistics

Provides an overview of the MAAD dataset, enabling users to explore key trends and distributions:

Total number and distribution of antibodies/nanobodies per target antigen (Antibody Distribution).
Usage frequency of V/J genes across different antigens (Genotype Distribution). Genotype frequencies are categorized as: Low: all genotypes ranked within the top 30 by frequency; Medium: the top 50 most prevalent genotypes; High: the top 100 most prevalent genotypes.
CDR1/2/3 length distribution (CDR Statistics).

Frequently Asked Questions

Submission functionality is not yet available. Please contact the MAAD team if you are interested in contributing data.

MAAD includes both experimentally validated antibodies/nanobodies and sequence-only entries (from NGS datasets). Each entry is clearly labeled.

Entries include annotations on binding and/or neutralization activity (if reported), target antigens (e.g., S, RBD, HA, head, F, preF), and developmental origin (e.g., human-derived, murine, or synthetic).

Yes. If available, the nucleotide sequence is provided along with the GenBank accession ID.

This tool compares the user's input sequence against all MAAD entries using BLAST and returns a pairwise alignment that highlights sequence differences (e.g., mutations or insertions).

Yes. The analysis module includes V/J gene usage dot plots for matched entries and sequence logo plots for CDR variability analysis.

The system uses N-Gram indexing and LIKE search to identify:
Similarity: Sequences sharing common N-Gram patterns (substrings of length N), with adjustable thresholding for match precision.
Containment: A sequence is "contained" if it appears as a continuous substring within another sequence.

Contact

For questions, feature requests, or reporting issues, please contact us at:

wangjy@ibp.ac.cn

liyx@ibp.ac.cn