Chorus API Documentation

This is the authoritative reference for all public APIs (oracle methods, analysis layer, utilities, normalization, and MCP tools). For a one-line-per-method cheat sheet, see METHOD_REFERENCE.md.

Overview
Core Classes
Prediction Methods
Utility Functions
Track Management
Environment Management
Examples
Quantile Normalization
Application layer (chorus.analysis) — high-level variant analysis, discovery, batch scoring, fine-mapping, sequence engineering

Overview

Chorus provides a unified interface for genomic sequence prediction models (oracles). Each oracle predicts regulatory activity from DNA sequences, with support for various genomic manipulations and analyses.

Core Classes

OracleBase

Base class for all oracle implementations. Provides common functionality and defines the interface that all oracles must implement.

class OracleBase(ABC):
    def __init__(self, use_environment: bool = True)

Attributes:

oracle_name (str): Name of the oracle (e.g., 'enformer')
reference_fasta (str): Path to reference genome FASTA file
loaded (bool): Whether the model is loaded
use_environment (bool): Whether to use isolated conda environment

EnformerOracle

Implementation of the Enformer model for predicting gene expression and chromatin states.

class EnformerOracle(OracleBase):
    def __init__(self, use_environment: bool = True, reference_fasta: Optional[str] = None)

Enformer-specific attributes:

sequence_length (int): 393,216 bp input sequence length
target_length (int): 896 bins in output
bin_size (int): 128 bp per bin
Output window: 114,688 bp (896 × 128)
Offset from input edges: 139,264 bp on each side

Prediction Methods

1. predict()

Basic prediction method for DNA sequences or genomic coordinates.

def predict(
    input_data: Union[str, Tuple[str, int, int]],
    assay_ids: List[str],
    create_tracks: bool = False
) -> Dict[str, np.ndarray]

Parameters:

input_data: Either:
- DNA sequence string (must be model's required length)
- Tuple of (chromosome, start, end) for genomic coordinates
assay_ids: List of track identifiers (oracle-specific)
- Enformer: ENCODE IDs (e.g., 'ENCFF413AHU'), CAGE IDs (e.g., 'CNhs11250'), or descriptions (e.g., 'DNase:K562')
create_tracks: Whether to create track files (not implemented)

Returns:

Dictionary mapping track IDs to prediction arrays
Each array has shape (n_bins,) where n_bins = output_length / bin_size

Logic:

If input is coordinates, extracts sequence from reference genome
Validates sequence length matches model requirements
Runs model prediction
Returns predictions for requested tracks

Example:

# From sequence
seq = 'ACGT' * 98304  # 393,216 bp for Enformer
predictions = oracle.predict(seq, ['ENCFF413AHU', 'CNhs11250'])

# From coordinates
predictions = oracle.predict(('chrX', 48777634, 48790694), ['ENCFF413AHU'])

2. predict_region_replacement()

Replace a genomic region with a new sequence and predict the effects.

def predict_region_replacement(
    genomic_region: Union[str, pd.DataFrame],
    seq: str,
    assay_ids: List[str],
    create_tracks: bool = False,
    genome: Optional[str] = None
) -> Dict

Parameters:

genomic_region: Region to replace
- String format: "chr1:1000-2000" (1-based, inclusive)
- DataFrame: First row with columns 'chrom', 'start', 'end'
seq: Replacement DNA sequence (must match region length exactly)
assay_ids: List of track identifiers
create_tracks: Whether to save track files
genome: Reference genome path (uses oracle's reference_fasta if None)

Returns: Dictionary with:

raw_predictions: Dict[track_id, np.ndarray] - Raw model outputs
normalized_scores: Dict[track_id, np.ndarray] - Min-max normalized (0-1)
track_objects: List[Track] - Track objects if create_tracks=True
track_files: List[str] - File paths if create_tracks=True

Logic:

Validates replacement sequence length matches region length
Calculates full context window centered on region
Extracts context sequence from reference genome
Replaces specified region within context
Runs prediction on modified full-length sequence
Returns predictions for the output window

Example:

# Replace 200bp region with GATA motif repeats
enhancer = 'GATA' * 50  # 200bp
results = oracle.predict_region_replacement(
    'chr11:5247400-5247600',
    enhancer,
    ['ENCFF413AHU']
)

3. predict_region_insertion_at()

Insert a sequence at a specific genomic position.

def predict_region_insertion_at(
    genomic_position: Union[str, pd.DataFrame],
    seq: str,
    assay_ids: List[str],
    create_tracks: bool = False,
    genome: Optional[str] = None
) -> Dict

Parameters:

genomic_position: Insertion point
- String format: "chr1:1000" (1-based)
- DataFrame: First row with columns 'chrom', 'pos'
seq: DNA sequence to insert (any length that fits in context)
assay_ids: List of track identifiers
create_tracks: Whether to save track files
genome: Reference genome path

Returns: Same format as predict_region_replacement()

Logic:

Calculates required flanking sequence sizes
Extracts left flank (before insertion point)
Extracts right flank (after insertion point)
Constructs: left_flank + inserted_seq + right_flank
Ensures total length matches model requirements
Runs prediction on modified sequence

Example:

# Insert enhancer at specific position
results = oracle.predict_region_insertion_at(
    'chr11:5247500',
    'GATA' * 50,  # Insert 200bp
    ['CNhs11250']
)

4. predict_variant_effect()

Analyze effects of genetic variants (SNPs, indels).

def predict_variant_effect(
    genomic_region: Union[str, pd.DataFrame],
    variant_position: Union[str, pd.DataFrame],
    alleles: Union[List[str], pd.DataFrame],
    assay_ids: List[str],
    create_tracks: bool = False,
    genome: Optional[str] = None
) -> Dict

Parameters:

genomic_region: Region containing the variant
- Should be large enough for model context
variant_position: Position of variant
- String format: "chr1:1000"
- Must be within genomic_region
alleles: List of alleles to test
- First element is reference allele
- Remaining elements are alternative alleles
- Can also be DataFrame with 'ref' and 'alt' columns
assay_ids: List of track identifiers
create_tracks: Whether to save track files
genome: Reference genome path

Returns: Dictionary with:

predictions: Dict of allele_name → track predictions
- 'reference': predictions for reference allele
- 'alt_1', 'alt_2', etc.: predictions for alternatives
effect_sizes: Dict of alt_allele → track → effect array
- Effect = alternative - reference
track_objects: Dict if create_tracks=True
track_files: Dict if create_tracks=True
variant_info: Summary of variant tested

Logic:

Extracts reference sequence for region
Validates reference allele matches genome
Creates modified sequences for each allele
Runs predictions for all alleles
Calculates effect sizes (alt - ref)
Returns comprehensive results

Example:

# Test all possible SNPs at a position
results = oracle.predict_variant_effect(
    'chr11:5247000-5248000',  # 1kb region
    'chr11:5247500',          # Variant position
    ['C', 'A', 'G', 'T'],     # C is reference
    ['ENCFF413AHU']
)

# Access results
ref_pred = results['predictions']['reference']['ENCFF413AHU']
alt1_pred = results['predictions']['alt_1']['ENCFF413AHU']
effect = results['effect_sizes']['alt_1']['ENCFF413AHU']

Utility Functions

Sequence Utilities (chorus.utils.sequence)

extract_sequence()

def extract_sequence(
    genomic_region: str,
    genome: str = "hg38.fa"
) -> str

Extracts DNA sequence from reference genome.

Parameters:

genomic_region: "chr1:1000-2000" format (1-based, inclusive)
genome: Path to indexed FASTA file

Returns:

DNA sequence string (uppercase)

Note: Properly handles coordinate conversion from 1-based genomic to 0-based pysam.

apply_variant()

def apply_variant(
    reference_seq: str,
    position: int,
    ref: str,
    alt: str
) -> str

Applies a variant to a sequence.

Parameters:

reference_seq: Original DNA sequence
position: 0-based position in sequence
ref: Reference allele (must match sequence)
alt: Alternative allele

Returns:

Modified sequence with variant applied

Genome Management (chorus.utils.genome)

get_genome()

def get_genome(genome_name: str = 'hg38') -> Path

Downloads and returns path to reference genome.

Parameters:

genome_name: One of 'hg38', 'hg19', 'mm10', 'mm9', 'dm6', 'ce11'

Returns:

Path object to genome FASTA file

Logic:

Checks if genome already downloaded
Downloads from UCSC if needed
Creates FASTA index
Returns path

Gene Annotations (chorus.utils.annotations)

download_gencode()

def download_gencode(
    version: str = 'v48',
    annotation_type: str = 'basic'
) -> Path

Downloads GENCODE gene annotations.

Parameters:

version: GENCODE version (e.g., 'v48')
annotation_type: 'basic' or 'comprehensive'

Returns:

Path to GTF file

get_gene_tss()

def get_gene_tss(gene_name: str) -> pd.DataFrame

Gets transcription start sites for a gene.

Parameters:

gene_name: Gene symbol (e.g., 'GATA1')

Returns:

DataFrame with columns: transcript_id, chrom, tss, strand, gene_name

Visualization (chorus.utils.visualization)

visualize_chorus_predictions()

def visualize_chorus_predictions(
    predictions: Dict[str, np.ndarray],
    chrom: str,
    start: int,
    track_ids: List[str],
    output_file: Optional[str] = None,
    bin_size: int = 128,
    style: str = 'modern',
    use_pygenometracks: bool = True,
    gtf_file: Optional[str] = None,
    show_gene_names: bool = True
) -> None

Creates publication-quality visualizations of predictions.

Parameters:

predictions: Dict of track_id → prediction array
chrom: Chromosome name
start: Start coordinate
track_ids: List of tracks to plot
output_file: Save to file if provided
bin_size: Bin size for predictions
style: 'modern', 'classic', or 'minimal'
use_pygenometracks: Use pyGenomeTracks if available
gtf_file: Gene annotation file for gene track
show_gene_names: Whether to label genes

Track Management

Track Class

class Track:
    def __init__(
        self,
        name: str,
        assay_type: str,
        cell_type: str,
        data: pd.DataFrame,
        color: Optional[str] = None
    )

Represents a genomic signal track.

Methods:

to_bedgraph(filename): Save as BedGraph
to_bigwig(filename, chrom_sizes): Save as BigWig
normalize(method): Normalize values
smooth(window_size): Smooth signal

save_predictions_as_bedgraph()

def save_predictions_as_bedgraph(
    predictions: Dict[str, np.ndarray],
    chrom: str,
    start: int,
    end: Optional[int] = None,
    output_dir: str = ".",
    prefix: str = "",
    bin_size: Optional[int] = None,
    track_colors: Optional[Dict[str, str]] = None
) -> List[str]

Saves predictions as BedGraph files for genome browser visualization.

Note for Enformer: Automatically handles coordinate mapping from input window to output window.

Environment Management

CLI Commands

# Set up oracle environment
chorus setup --oracle enformer

# Check environment health
chorus health

# List environments
chorus list

# Remove environment
chorus remove --oracle enformer

Programmatic Access

# Create oracle with environment
oracle = chorus.create_oracle('enformer', use_environment=True)

# Run code in oracle's environment
result = oracle.run_code_in_environment(
    "import tensorflow; print(tensorflow.__version__)"
)

Complete Example

import chorus
from chorus.utils import get_genome, download_gencode

# Setup
genome = get_genome('hg38')
gtf = download_gencode()
oracle = chorus.create_oracle('enformer', reference_fasta=str(genome))
oracle.load_pretrained_model()

# Define tracks (Enformer-specific)
tracks = ['ENCFF413AHU', 'CNhs11250']  # DNase:K562, CAGE:K562

# 1. Wild-type prediction
wt = oracle.predict(('chr11', 5247000, 5248000), tracks)

# 2. Test enhancer insertion
enhancer = 'GATA' * 50
inserted = oracle.predict_region_insertion_at(
    'chr11:5247500',
    enhancer,
    tracks
)

# 3. Test variant
variant = oracle.predict_variant_effect(
    'chr11:5247000-5248000',
    'chr11:5247500',
    ['C', 'A', 'G', 'T'],  # C is reference
    tracks
)

# 4. Analyze gene expression
expr = oracle.analyze_gene_expression(
    predictions=wt,
    gene_name='HBB',  # Beta-globin
    chrom='chr11',
    start=5247000,
    end=5248000,
    gtf_file=str(gtf),
    cage_track_ids=['CNhs11250']
)

# 5. Save for visualization
oracle.save_predictions_as_bedgraph(
    wt,
    chrom='chr11',
    start=5247000,
    end=5248000,
    output_dir='results'
)

Per-Track Normalization

Chorus uses per-track CDFs to normalize variant effects, activity percentiles, and IGV signal visualization. Each oracle has a single {oracle}_pertrack.npz file containing three CDF matrices:

CDF	Shape	Used for
`effect_cdfs`	`(n_tracks, 10000)`	Variant effect %ile (table column)
`summary_cdfs`	`(n_tracks, 10000)`	Activity %ile (table column)
`perbin_cdfs`	`(n_tracks, 10000)`	IGV per-bin visualization

get_pertrack_normalizer()

Factory function that returns a PerTrackNormalizer for a given oracle.

from chorus.analysis import get_pertrack_normalizer

norm = get_pertrack_normalizer('enformer')
# Loads ~/.chorus/backgrounds/enformer_pertrack.npz

Parameters:

oracle_name (str): Name of the oracle ('enformer', 'borzoi', 'alphagenome', 'chrombpnet', 'sei', 'legnet')
cache_dir (str, optional): Defaults to ~/.chorus/backgrounds/

Returns:

PerTrackNormalizer instance, or None if no NPZ file exists

PerTrackNormalizer

Key methods (all parameterized by track_id — e.g. an ENCODE identifier like ENCFF833POA):

effect_percentile(oracle, track_id, raw_score, signed=False) → variant effect percentile [0, 1] (or [-1, 1] for signed layers)
activity_percentile(oracle, track_id, raw_signal) → genome-wide activity percentile [0, 1]
perbin_percentile_batch(oracle, track_id, raw_values) → per-bin percentiles [0, 1] for visualization
perbin_floor_rescale_batch(oracle, track_id, raw_values, floor_pctile, peak_pctile, max_value) → linear rescale into [0, max_value] using CDF-derived noise floor and peak threshold (default for IGV)

IGV visualization modes

The IGV browser embedded in HTML reports has two modes:

1. Layer-aware floor rescale (default) — for each track:

display = (raw - cdf[floor_pctile]) / (cdf[peak_pctile] - cdf[floor_pctile])

Clipped to [0, 3.0]. Sharp signals (CAGE, TF, DNASE) use floor=p95, broad histones use floor=p90. All tracks on same comparable scale.

2. Raw autoscale — pass igv_raw=True to use original values with per-track autoscale:

from chorus.analysis import build_variant_report

report = build_variant_report(
    variant_result, oracle_name='enformer',
    normalizer=norm, igv_raw=True,  # use raw autoscale in IGV
)

Both modes show the same peak positions; the difference is whether tracks share a common scale (rescale) or each has its own (raw autoscale).

Building per-track CDFs

Use the standalone scripts in scripts/build_backgrounds_<oracle>.py. See scripts/README.md for the full pipeline.

mamba run -n chorus-enformer python scripts/build_backgrounds_enformer.py --part variants  --gpu 0
mamba run -n chorus-enformer python scripts/build_backgrounds_enformer.py --part baselines --gpu 1
mamba run -n chorus              python scripts/build_backgrounds_enformer.py --part merge

The scripts use ~30K positions (random + cCREs + TSSs + gene bodies) and collect 32 random bins per position per track. RNA-seq tracks (Borzoi, AlphaGenome) use exon-precise sampling.

Automatic download

Per-track backgrounds are hosted on HuggingFace at lucapinello/chorus-backgrounds and downloaded automatically the first time get_pertrack_normalizer() is called (or when an oracle is loaded via MCP). No HuggingFace account needed — the dataset is public.

# Explicit download (usually not needed — auto-downloaded on first use)
from chorus.analysis import download_pertrack_backgrounds
download_pertrack_backgrounds("enformer")  # → ~/.chorus/backgrounds/enformer_pertrack.npz

Auto-discovery via MCP

When an oracle is loaded via the MCP server (load_oracle), the corresponding {oracle}_pertrack.npz is auto-loaded by OracleStateManager._auto_load_normalizer(). Tools like analyze_variant_multilayer and discover_variant then use it automatically.

If the per-track NPZ is missing, the manager attempts to download it from HuggingFace, then falls back to legacy per-layer .npy files via QuantileNormalizer.

Legacy: QuantileNormalizer

The older per-layer QuantileNormalizer (one CDF per regulatory layer per oracle, stored as {oracle}_{layer}.npy) is still supported for backwards compatibility. Both normalizer types implement compatible interfaces and the variant analysis code paths handle either via isinstance(normalizer, PerTrackNormalizer) checks.

The in-library chorus.analysis.build_backgrounds.build_variant_backgrounds() function builds legacy per-layer backgrounds for quick test scenarios. For production use, prefer the standalone per-track scripts.

Python API Example

from chorus.analysis import get_normalizer, build_variant_report

# Load normalizer
normalizer = get_normalizer('alphagenome')

# Pass to build_variant_report
report = build_variant_report(
    variant_result,
    oracle_name='alphagenome',
    normalizer=normalizer,
)

Application layer (`chorus.analysis`)

The chorus.analysis module is the high-level surface most users actually want. Each function below takes a loaded oracle and returns a structured report object with to_markdown(), to_dict(), to_dataframe() and (where applicable) to_html() / to_tsv().

Every report accepts an optional analysis_request: AnalysisRequest that preserves the user's original natural-language question and renders it at the top of every output — so a teammate opening an HTML report a month later can tell what was asked.

`build_variant_report(variant_result, oracle_name, gene_name=None, normalizer=None, igv_raw=False, analysis_request=None) -> VariantReport`

Score a variant across all modality-specific layers (chromatin, TF binding, histone, TSS, gene expression, splicing) from a predict_variant_effect() result. Returns a VariantReport whose to_markdown() renders per-layer tables capped at the top 10 tracks per layer (full ordering still in the JSON / DataFrame).

`discover_variant_effects(oracle, oracle_name, variant_position, alleles, top_n_per_layer=10, top_n_cell_types=8, gene_name=None, normalizer=None, output_path=None) -> dict`

Score every track the oracle knows about (thousands for AlphaGenome), rank by effect magnitude, and build a VariantReport for the selected top tracks per layer/cell-type. This is the recommended entry point when you don't want to hand-pick assay_ids. Returns {"report": VariantReport, "total_tracks_scored": int, "selected_tracks": int, ...}.

`discover_and_report(oracle, variant_position, alleles, top_n=5, min_effect=0.15, ...) -> dict`

Two-stage cell-type discovery. First screens all DNASE/ATAC tracks to rank cell types by chromatin effect, then for each of the top top_n cell types runs a full multi-layer analysis. Returns {"hits": [...], "reports": {cell_type: VariantReport}}.

`score_variant_batch(oracle, variants, assay_ids, gene_name=None, normalizer=None, analysis_request=None) -> BatchResult`

Rank a list of variants by effect magnitude. Each entry in variants is a dict with keys chrom, pos, ref, alt, and optional id. Pass assay_ids=None (or []) to score all oracle tracks. Returns a BatchResult with to_markdown(), to_tsv(), to_html(), to_dict() and to_dataframe().

`prioritize_causal_variants(oracle, lead_variant, ld_variants, assay_ids, gene_name=None, oracle_name=None, weights=None, normalizer=None, analysis_request=None) -> CausalResult`

GWAS fine-mapping: score each LD proxy across all regulatory layers and rank by a composite causal score combining (1) max effect, (2) number of layers affected, (3) directional convergence across layers, (4) baseline activity at the variant site. Weights are configurable via CausalWeights.

`analyze_region_swap(oracle, region, replacement_sequence, assay_ids, gene_name=None, normalizer=None, oracle_name=None) -> VariantReport`

Replace a genomic region with a custom DNA sequence and score the resulting regulatory changes across all layers. The reference-vs-replacement output has the same shape as a variant report so all the same renderers apply.

`simulate_integration(oracle, position, construct_sequence, assay_ids, gene_name=None, normalizer=None, oracle_name=None) -> VariantReport`

Score the disruption caused by inserting a DNA construct at a genomic position — used for AAV/transgene integration site analysis.

`AnalysisRequest`

Dataclass carried on every report. Fields: user_prompt, tool_name, oracle_name, normalizer_name, tracks_requested, cell_types, notes, generated_at. Built automatically by the MCP wrappers when Claude calls any analysis tool; you can also construct one directly and pass it into build_variant_report(..., analysis_request=...) when using the Python API. See chorus/analysis/analysis_request.py.

Notes on Oracle-Specific Behavior

Enformer

Requires exactly 393,216 bp input sequence
Output covers middle 114,688 bp of input
Uses ENCODE and CAGE track identifiers
Supports gene expression analysis via CAGE at TSS

Borzoi

524,288 bp input / 6,144 output bins at 32 bp resolution
Strong distal-gene expression prediction; best choice when the target gene TSS is far from the variant
Uses ENCODE track identifiers

ChromBPNet / BPNet

2,114 bp input at single-base resolution
Separate model per assay/cell-type/TF combination — load with load_oracle("chrombpnet", assay="ATAC", cell_type="K562") or load_oracle("chrombpnet", assay="CHIP", cell_type="K562", TF="GATA1")
Ideal for motif-resolution TF-binding disruption

Sei

4,096 bp input; 40 sequence-class outputs (chromatin state classification)
Use for high-level regulatory element classification, not per-track scoring

LegNet

200 bp input, MPRA-style promoter activity predictions
Scalar output per cell type; no per-bin tracks

AlphaGenome

1 Mb input / 1 bp resolution / 5,731 tracks — the most comprehensive oracle
Requires HuggingFace gated-model access (see main README)
Recommended default for multi-layer variant analysis

Error Handling

Common exceptions:

ModelNotLoadedError: Call load_pretrained_model() first
InvalidSequenceError: Check sequence length and content
InvalidAssayError: Use valid track identifiers for the oracle
InvalidRegionError: Check genomic coordinates
FileFormatError: Ensure genome file is indexed

FilesExpand file tree

API_DOCUMENTATION.md

Latest commit

History

API_DOCUMENTATION.md

File metadata and controls

Chorus API Documentation

Table of Contents

Overview

Core Classes

OracleBase

EnformerOracle

Prediction Methods

1. predict()

2. predict_region_replacement()

3. predict_region_insertion_at()

4. predict_variant_effect()

Utility Functions

Sequence Utilities (chorus.utils.sequence)

extract_sequence()

apply_variant()

Genome Management (chorus.utils.genome)

get_genome()

Gene Annotations (chorus.utils.annotations)

download_gencode()

get_gene_tss()

Visualization (chorus.utils.visualization)

visualize_chorus_predictions()

Track Management

Track Class

save_predictions_as_bedgraph()

Environment Management

CLI Commands

Programmatic Access

Complete Example

Per-Track Normalization

get_pertrack_normalizer()

PerTrackNormalizer

IGV visualization modes

Building per-track CDFs

Automatic download

Auto-discovery via MCP

Legacy: QuantileNormalizer

Python API Example

Application layer (chorus.analysis)

build_variant_report(variant_result, oracle_name, gene_name=None, normalizer=None, igv_raw=False, analysis_request=None) -> VariantReport

discover_variant_effects(oracle, oracle_name, variant_position, alleles, top_n_per_layer=10, top_n_cell_types=8, gene_name=None, normalizer=None, output_path=None) -> dict

discover_and_report(oracle, variant_position, alleles, top_n=5, min_effect=0.15, ...) -> dict

score_variant_batch(oracle, variants, assay_ids, gene_name=None, normalizer=None, analysis_request=None) -> BatchResult

prioritize_causal_variants(oracle, lead_variant, ld_variants, assay_ids, gene_name=None, oracle_name=None, weights=None, normalizer=None, analysis_request=None) -> CausalResult

analyze_region_swap(oracle, region, replacement_sequence, assay_ids, gene_name=None, normalizer=None, oracle_name=None) -> VariantReport

simulate_integration(oracle, position, construct_sequence, assay_ids, gene_name=None, normalizer=None, oracle_name=None) -> VariantReport

AnalysisRequest

Notes on Oracle-Specific Behavior

Enformer

Borzoi

ChromBPNet / BPNet

Sei

LegNet

AlphaGenome

Error Handling

Application layer (`chorus.analysis`)

`build_variant_report(variant_result, oracle_name, gene_name=None, normalizer=None, igv_raw=False, analysis_request=None) -> VariantReport`

`discover_variant_effects(oracle, oracle_name, variant_position, alleles, top_n_per_layer=10, top_n_cell_types=8, gene_name=None, normalizer=None, output_path=None) -> dict`

`discover_and_report(oracle, variant_position, alleles, top_n=5, min_effect=0.15, ...) -> dict`

`score_variant_batch(oracle, variants, assay_ids, gene_name=None, normalizer=None, analysis_request=None) -> BatchResult`

`prioritize_causal_variants(oracle, lead_variant, ld_variants, assay_ids, gene_name=None, oracle_name=None, weights=None, normalizer=None, analysis_request=None) -> CausalResult`

`analyze_region_swap(oracle, region, replacement_sequence, assay_ids, gene_name=None, normalizer=None, oracle_name=None) -> VariantReport`

`simulate_integration(oracle, position, construct_sequence, assay_ids, gene_name=None, normalizer=None, oracle_name=None) -> VariantReport`

`AnalysisRequest`