IQVIA_Specifica_White_2024-09-27-1
download-Nov-08-2024-07-24-55-3847-AM

Antibody data packages for your AI models

Choose Specifica for a proven antibody discovery platform to provide diverse, next-generation sequencing (NGS) data to power your AI engines. Using our proprietary naïve and semi-synthetic formats, we can generate data packages to train your AI and ML models.

Specifica delivers:

  • Ready-to-use, off-the-shelf NGS datasets;
  • Data packages produced against your specific target(s), to your requirements;
  • Customized libraries built from your design into our phage or yeast display formats, including selection campaigns against target(s) of your choosing.

Flexible formats and timelines:

  • Data packages derived from scFv, Fab or VHH formats
  • Naïve or semi-synthetic formats
  • Fully diverse or fixed light chain formats
  • Timelines ranging from weeks to a few months

 

 

Train your AI model

Test your AI model

  In-stock data package Target-specific data package Model testing data package
Deliverable Select data provided from one of our in-house discovery campaigns
  • Select data provided from a discovery campaign against your target of interest
  • Semi-synthetic format

A library custom-built to your specifications

  • Up to 6 selections included
  • Additional selections as needed

Specifica’s team can collaborate or provide guidance as needed.

Suitable for Early data submissions to AI and ML models Training of AI and ML models Building a custom library designed from your AI or ML outputs.
Availability

Coming soon

 

Note: Sample datasets (IFN and IL-2) available to illustrate data structure and diversity 

Available now Available now
Timeline for delivery Immediate delivery (once available) 1-4 months (phage to yeast) based on complexity 1-4 months, depending on complexity
All data packages include

Raw FASTQ files and annotated files with labels:

  • Sort concentration
  • Positive / negative binding
  • NGS abundance & enrichment
  • Others (competitor / non-competitor / ligand)

 

AdobeStock_286697660

The Specifica advantage

  • Start at an advantage by training on data from an antibody library already optimized for therapeutic properties:
    • High developabilities, straight from selections
    • High level of starting functionality
    • High diversities
    • High affinities
  • Go beyond training on public repositories by training on large datasets directly relevant to drug discovery against targets of interest.
  • Train on diverse outputs containing many different paratopes against targets of interest.
  • Early-round sequencing of diverse populations optimal for training foundation AI models.
  • Flexible sort options and labels to capture important populations to suit your fine-tuning  needs:
    • Binary: [bind/no bind; competitor/non-competitive binding]
  • Multi-categorical: Includes different sort concentrations (100nM, 50nM, 20nM, 1nM) and/or various output categories such as high, medium, and low-stringency sorting.
  • Start with the experts in antibody engineering and sequencing:

    • >200 individual naïve libraries built using different scaffolds, formats, variable gene sources and diversity
    • Antibody NGS experts demonstrated by high-profile publications
    • > 100 successful campaigns, each of which produces hundreds of distinct antibody clusters
  • Standardized datasets with key NGS fields provided – round-to-round enrichment, population relative frequency.
  • Diverse, NGS datasets from our validated platform
    • Millions of annotated and target-labeled sequences from our in vitro display platform optimal for AI models
AdobeStock_599528321

Whether you need to train, validate or test your model,
Specifica delivers high-quality, curated datasets at each phase.

Get In Touch

Contact Us

Contact Us

If you would like to get in touch with us please use the contact form or email address below.