Catalogue · v1.0.4

Reference.

Every public symbol, every config key, every AIRR field. Reference is exhaustive — Concepts explain why, Guides walk you through how, and this is the page you grep when you already know what you're looking for.

38Public classes
182Public functions
~70AIRR truth fields
6CLI commands
{ } Python API 220 Configs 14 AIRR fields 70 ›_ CLI 6
01

Python API.

Everything importable from genairr. Grouped by module, sorted by what you'll touch first.

genairr.experiment core
genairr.pipeline extending
genairr.contracts predicates
genairr.io export
02

Configs.

14 built-in configs. Drop a name into Experiment("…") or fork the YAML and pass a path. Schema is documented per-key.

03

AIRR fields.

~70 fields per record. truth_* is the simulator's ground-truth answer. evidence is what an aligner would output. Use the former to score the latter.

truth · 18 evidence · 22 segments · 14 corruption · 9 meta · 7 Showing first 16. Open the full table →
Field Type Group Source pass Description
sequencestrmetaioFinal nucleotide sequence as written to the AIRR file.
sequence_lengthintmetaEndLossPassLength after all corruption passes.
v_callstrevidencelive-callV allele(s) the evidence-based caller supports, comma-separated when ambiguous; truth-first when the sampled allele survives the tie.
truth_v_callstrtruthSampleAllelePassThe V allele the simulator actually sampled.
truth_d_callstrtruthSampleAllelePassThe D allele actually sampled.
truth_j_callstrtruthSampleAllelePassThe J allele actually sampled.
junctionstrsegmentsAssembleSegmentPassCDR3 nucleotide sequence inclusive of anchors.
junction_aastrsegmentsAssembleSegmentPassCDR3 amino-acid translation.
productivebooltruthcontracts.productiveTrue if in-frame, no stop codon, anchors preserved.
truth_n_mutationsinttruthS5FMutationPassTotal SHM substitution count.
mutation_ratefloattruthS5FMutationPasstruth_n_mutations / sequence_length.
mutation_poslist[int]truthS5FMutationPassPer-substitution position, 0-indexed.
v_identityfloattruthderivedFraction of V positions identical to germline.
corruption_addedboolcorruptionEndLossPass+True if any corruption pass modified the sequence.
n_countintcorruptionNCorruptionPassNumber of ambiguous N bases.
5_prime_lossintcorruptionEndLossPassBases truncated from the 5′ end.
04

CLI.

Six commands installed alongside the Python package. For one-off simulations and CI.

genairr simulate Run a config and write AIRR output. The 90% command. genairr simulate human_igh_naive -n 50000 -o out.tsv
genairr inspect Print a single sequence's full trace — every pass, every event. genairr inspect human_igh_naive --seed 42
genairr validate Run marginal QC on a TSV against an empirical reference panel. genairr validate out.tsv --against oas_human_b
genairr configs List built-in configs with their key parameters. genairr configs --filter human
genairr benchmark Plug-and-play aligner benchmark against simulated truth. genairr benchmark igblast --n 10000
genairr export Convert a saved run between TSV / Parquet / FASTA. genairr export out.tsv -o out.parquet