AIKIUM

AI-DRIVEN NANOBODY DISCOVERY

Aiki
Engage
Data Experience
Technical
LIVE BETA
Aiki logo

Aiki

End-to-end nanobody design

AI-driven nanobody generation that orchestrates Aikium's proprietary in-house models alongside every external tool cleared for commercial use. The recipe for wiring them together β€” target intake from UniProt or PDB, epitope specification in 3D, ensemble design, structural validation, and developability filtering β€” is what turns sequences into high-affinity, selective, developable binders.

One front-end. Ranked candidates exported as CSV, FASTA, and PDB. Designed to grow with each new model we add.

External models orchestrated
RFantibody Β· Nature 2025 DiffAb Β· NeurIPS 2022 IgGM Β· ICLR 2025 ProteinMPNN AntiFold ESM-IF Boltz-2 Β· folding
In-house models in the loop
Aiki-XP Β· expression Aiki-GeNano Β· design Aiki-Sol Β· solubility composite_score_v3 Β· developability
Try Aiki β†’ More capabilities coming soon.

Note: ranked candidates are designs, not predicted binders β€” wet-lab validation remains required (typical hit rates ≀5%). Aiki-Foundry runs that screening for you.

NEW Β· WET LAB

Aiki-Foundry

Focused-library nanobody campaigns, in parallel

Our wet-lab approach has evolved. Instead of one trillion-scale library, we now deploy tens of focused, target-specific nanobody libraries in parallel β€” some drawn from an off-the-shelf collection of known binders and CDRs, others designed end-to-end by Aiki. Proprietary protocols compress every step: library design, assembly, Yotta-display screening, and high-throughput-ELISA validation.

Design
Aiki + off-the-shelf CDRs
Assemble
Proprietary protocols
Screen
Yotta display
Validate
High-throughput ELISA

Many targeted libraries beat one mega-library: deeper sampling of the relevant sequence space, shorter cycles, and parallel programs without competing for screening capacity.

Discuss an Aiki-Foundry campaign β†’ Available as a service program or as a partnership β€” engagement terms negotiable.

Specialized in-house models, available standalone

LIVE BETA

Aiki-XP

Bacterial expression prediction

Ranks a protein within its host's expression distribution by integrating genome context, operon architecture, codon usage, and biophysical features. Trained on 492K genes across 385 species; supports 1,831 host genomes. Tier D reaches Spearman 0.59.

Try Aiki-XP β†’
LIVE BETA

Aiki-GeNano

Nanobody design from any target

Three-stage language-model pipeline that generates ten ranked VHH candidates against linear peptides, IDRs, or soluble domains up to 244 aa. Returns developability scores, MSAs to the NBv1 scaffold, and ESMFold structures. +6.6 Β°C mean Ξ”Tm over supervised baselines.

Try Aiki-GeNano β†’
LIVE BETA

Aiki-Sol

Solubility prediction

Predicts soluble vs. insoluble across five experimental regimes β€” four in-vivo centrifugation speeds (3K–100K Γ—g) plus cell-free β€” returning calibrated per-protocol probabilities. Strips common purification tags before scoring; available as web UI, REST, pip, FASTA batch, or Docker.

Try Aiki-Sol β†’
Aiki-XP
Will it express?
Aiki-GeNano
Design nanobody binders
Aiki-Sol
Will it be soluble?

The three tools cover successive stages of a nanobody campaign β€” predicting expression in a chosen host, designing CDR variants against a target, and filtering candidates by developability before committing to wet-lab work.

The platform behind the models

Aiki and the specialized in-house models are trained on proprietary data from Yotta ML2, our mRNA-display platform engineered over two years to screen full-length protein scaffolds (VHH nanobodies, scFv). Yotta supports up to 1012 physical diversity per library β€” about 1,000Γ— larger than phage display β€” and is specifically optimized for membrane proteins and intrinsically disordered regions. The platform now powers Aiki-Foundry β€” the focused-library wet-lab side of the offering.

1012
Per-library physical diversity Yotta supports
100M+
Training datapoints from real campaigns
50+
Targets screened, including GPCRs and IDRs
8 wk
From target to validated hits

For the comparison against phage display, screening timeline, and platform architecture, see the Yotta ML2 Technology section in the Technical tab.

Founded by scientists from Harvard Medical School, NVIDIA, and 10x Genomics.

Engage

Four ways to work with us, from low-commitment to deep. Terms are flexible β€” service program, partnership, or licensing β€” and we shape the engagement around the program, not around a SKU.

Try the live products

No commitment

Aiki (end-to-end nanobody design) and the three standalone models β€” Aiki-XP, Aiki-GeNano, Aiki-Sol β€” run as hosted web apps with REST and batch endpoints. Aiki-Foundry's wet-lab campaign details are also on the landing tab. Fastest way to size up what we do before talking commercials.

Go to the live products β†’

License data & pre-trained models

~2–4 weeks Β· IP-free for internal use

90M+ sequenced binders across 90 epitopes and 50+ targets, with sequence, enrichment, and target metadata; plus the pre-trained weights behind Aiki-XP, Aiki-GeNano, and Aiki-Sol if you want to fine-tune in-house.

Discuss licensing β†’

Run an Aiki-Foundry campaign

~8–16 weeks to validated hits

One or more focused-library campaigns against your targets: AI-designed or off-the-shelf libraries, Yotta-display screening, HT-ELISA validation, end-to-end. Parallel campaigns across multiple targets are routine. Available as a service program or as a partnership β€” engagement terms negotiable, including IP structure.

Discuss a campaign β†’

Co-develop a program

Strategic Β· milestone-based Β· shared IP

For programs with a clear path to IND: hit discovery through lead optimization, affinity maturation, and developability engineering, with shared IP and milestone-based structure. The deepest form of partnership we offer.

Discuss co-development β†’

Note: our primary focus and validated data are protein scaffolds (VHH nanobodies, scFv). Yotta ML2 has also been tested with peptide libraries β€” if you're working on peptide therapeutics, ask.

Technical specifications for evaluating platform capabilities, library architectures, target coverage, experimental validation data, and AI model performance.

Platform & Libraries Target Coverage Experimental Data AI Models & Benchmarks
Yotta ML2 Technology

Yotta ML2 is a proprietary mRNA display variant optimized for screening protein scaffolds at 10ΒΉΒ² scale. The platform supports diverse scaffolds ranging from short peptides to full-length nanobodies (VHH, ~15 kDa) and engineered single-chain proteins (up to ~25 kDa).

Library assembly protocols accommodate extensive degeneracy across distant sequence regions. For example, SeqR-v3 libraries contain 46 degenerate positions distributed across 160 amino acids, enabling broad sequence space exploration. Custom library design services are available for partner-specified targets, with typical lead times of 8 weeks.

Yotta ML2 vs. Phage Display

Traditional
Phage Display
109
Physical library diversity
  • Β· ~1 billion variants
  • Β· Challenging with membrane targets
  • Β· 24+ week timelines
  • Β· Generic libraries
1000Γ—
larger library
3Γ—
faster to hits
Aikium
Yotta ML2
1012
Physical · virtual diversity 1024
  • Β· Trillion-scale diversity
  • Β· Optimized for membrane targets and IDRs
  • Β· 8-week timelines to validated hits
  • Β· AI-designed, target-specific libraries
Platform Specifications
Current operational parameters and throughput capabilities
24 hrs
Selection Round Duration
One complete cycle per day
24
Parallel Capacity
Simultaneous targets in active screening
10ΒΉΒ²
Library Size
Unique sequences per library

Campaign timelines: 4 weeks from screening initiation to NGS-based hit identification using in-house libraries. Add 4 weeks for preliminary biophysical validation via ELISA.

Data Generation Rate
Protein-protein interaction datapoints collected in 2025
Library Specifications
In-house libraries available for immediate screening. Custom library design available with 8-week lead time.

Nanobody-v1

11M
Binders
48
Targets
126
Length (AA)
22
Degenerate Pos

Nanobody-v2

24M
Binders
47
Targets
124
Length (AA)
24
Degenerate Pos

SeqR-v3

65M
Binders
54
Targets
198
Length (AA)
46
Degenerate Pos
Library Composition
Distribution of binders from VHH libraries (Nb-v1, Nb-v2) and engineered scaffold library (SeqR-v3)
Amino Acid Diversity by Position
Amino Acid Composition Heatmap
Binding Affinity Distribution
Pseudo-Kd estimates from experimental screening data across all libraries (heuristic calculation from number of cycles of selection and NGS read-counts)
Target Portfolio

Screening campaigns have been conducted against approximately 50 distinct protein targets, representing 90 unique epitopes. The portfolio includes GPCRs, single-pass and multipass membrane proteins, and proteins containing intrinsically disordered regions.

Target selection prioritizes membrane proteins and other historically challenging protein classes. Indications represented include oncology, immunology, neurology, and rare diseases.

Target Structural Classification
Distribution by protein class
Disease Area Distribution
Targets by indication category
Detailed Target Characterization

The platform supports diverse antigen presentation modes that maintain native epitope structure and enable selection under physiologically relevantΒ conditions.

Target Distribution by Disease Area and Stage
Targets classified by indication and clinical development status
Experimental Validation

Screening campaigns generate pools of enriched binders that undergo staged validation. Initial enrichment is monitored via next-generation sequencing. Top-enriched candidates are evaluated by ELISA to confirm binding. Selected hits advance to biophysical characterization and functional assays as appropriate.

Hit Rates Across Campaigns
ELISA-confirmed binders from nanobody screens

Observed rates: Mean of 9 validated binders per target across 20+ completed nanobody campaigns (range: 2-20+ per target).

Representative Binding Curve
ELISA titration from GPCR campaign

Typical affinities: Validated hits show Kd values ranging from low nanomolar to sub-micromolar.

Validation workflow: Selection campaigns proceed through 6-12 rounds of alternating negative and positive selection. Enrichment is monitored by NGS. Top 100-500 candidates per target undergo ELISA screening. Confirmed binders advance to SPR or BLI for affinity determination, followed by functional assays when applicable.

AI Model Development

Experimental data from screening campaigns has been used to train machine learning models for binding prediction and library design. External validation using published benchmarks indicates that Aikium data improves model performance on held-out test sets.

Testing methodology: A published nanobody-antigen predictor architecture was retrained using incremental additions of Aikium screening data. Performance was evaluated on the original authors' held-out test set without modification of model architecture or hyperparameters.

External Benchmark Performance
AUC improvement when training data is augmented with Aikium datasets

Reference: Model architecture from Deng et al., Nature Machine Intelligence (2024) . Evaluation performed on authors' test set.

Internal Model Performance
Test set error as a function of training data size (14 held-out epitopes)

Observation: Generalization error decreases systematically with training set size, suggesting transferable binding patterns across epitopes.

Aiki, Aiki-Foundry & specialized models

Aiki (end-to-end nanobody design), Aiki-Foundry (focused-library wet-lab campaigns), and the three standalone models (Aiki-XP for bacterial expression, Aiki-GeNano for nanobody design, Aiki-Sol for solubility) are all featured on the landing tab.

Go to landing tab β†’

Selection Data & Analysis

Actual selection data from completed experimental screening campaigns β€” amino-acid enrichment patterns, binding distributions, and hit diversity across library-target combinations.

Identifiers are anonymized to protect IP. Starting from >10ΒΉΒ²-member libraries, each campaign typically yields over one million binders per target across nanobody (VHH, ~15 kDa) and proprietary SeqR (~25 kDa) scaffolds, spanning ~90 epitopes including GPCRs, membrane proteins, and IDRs. Experimentally validated binders are starred in the UMAP plots.

Note: interactive visualizations display a representative 0.1–1% sample (100k–1M data points) for browser performance. Full datasets available under licensing.

Selection Pressure Analysis
Percentage change in amino acid frequencies when library is screened against each target
Hit Distribution
Number of unique binders per target
Hit Diversity
Number of binder sequence families for each target
Library-wise Binding Affinity Profiles
Pseudo-Kd estimated from Yotta ML2 read-counts
Selectivity of Binders
Binders found in screens to multiple targets
Data Explorer
Filter and explore 100K+ representative data points from screening campaigns

Target Selection

Library

Binding Strength

Validation Status

Target Organism

Target Class

Target Category

Pseudo-Kd Distribution
Loading...
UMAP Sequence Space
0 points

Visualization Options

Size Options

Legend
Statistics
Filtered Results
0 results
Binder ID Target ID Library Pseudo-Kd (nM) Binding Strength Validated