Dual-branch deep learning pipeline combining CT scan ROI detection and urine biomarker analysis to aid early pancreatic cancer diagnosis.
Evaluates multiple multimodal fusion strategies (Late, Early, Orthogonal) on synthetic paired datasets to inform future clinical applications.
- Overview
- Architecture
- Data Sources
- Methodology
- Results
- Visualizations
- Limitations & Future Work
- Installation
- Usage
- Project Structure
- Citations
- License
This project explores multimodal fusion of:
- CT Imaging — Processed with YOLOv8 to detect regions of interest (ROI) and CNN for classification.
- Urine Biomarkers — Modeled via a Multi-Layer Perceptron (MLP).
The pipeline tests multiple fusion strategies using synthetic pairing of urine and CT data due to the unavailability of real paired datasets.
Why it matters:
Pancreatic cancer often goes undetected until late stages. Combining imaging with biomarker data could improve early diagnosis — our results provide insight into which fusion strategies may perform best when paired datasets become available.
Steps:
- Data Preprocessing
- CT DICOM → PNG → ROI extraction with YOLO
- Urine biomarker cleaning & normalization
- Unimodal Training
- CT ROI → CNN classifier
- Urine biomarkers → MLP classifier
- Fusion
- Late Fusion (logit averaging)
- Early Fusion (feature concatenation)
- Orthogonal Fusion (modality-specific projection layers)
- Evaluation
- Compare strategies on synthetic paired dataset
- Visualization
- Grad-CAM for CT slices
- SHAP plots for urine features
- CT Imaging: NCI Pancreatic CT dataset (TCIA)
- Urine Biomarkers: Kaggle — Early Pancreatic Cancer Urinary Biomarker Dataset (Link)
- Synthetic pairing generated for fusion experiments.
- ROI detection with YOLOv8 trained on annotated CT scans.
- Classification with ResNet50 fine-tuned for pancreatic tumor detection.
- Preprocessing: Missing value imputation, z-score normalization.
- Classification with MLP (2–3 dense layers + dropout).
- Late Fusion: Combine final prediction logits from both models.
- Early Fusion: Concatenate penultimate layer embeddings.
- Orthogonal Fusion: Learn modality-specific projections before fusion.
| Modality | Model | Accuracy | Sensitivity | Specificity |
|---|---|---|---|---|
| CT only | ResNet50 | <X.XX> |
<X.XX> |
<X.XX> |
| Urine only | MLP | <X.XX> |
<X.XX> |
<X.XX> |
| Fusion (Late) | MLP + CNN | <X.XX> |
<X.XX> |
<X.XX> |
| Fusion (Early) | MLP + CNN | <X.XX> |
<X.XX> |
<X.XX> |
| Fusion (Orth) | MLP + CNN | <X.XX> |
<X.XX> |
<X.XX> |
- Limitations:
- No real paired dataset for urine + CT
- Fusion tested on synthetic pairing
- Future Work:
- Apply on real paired datasets
- Incorporate blood biomarkers
- Deploy in a clinical decision-support setting


