quality-filtering
Here are 5 public repositories matching this topic...
A workflow designed to clean fastq files for the SEACONNECT project
-
Updated
Aug 21, 2019 - Python
Sievio turns GitHub, local repos, and web PDFs into clean JSONL for LLM pretraining, fine-tuning, and RAG. It offers structure-aware chunking, reliable Unicode decoding, pluggable QC and safety checks, plus optional dataset cards and deduplication.
-
Updated
Dec 27, 2025 - Python
Automated quality filtering for diabetic retinopathy images using adaptive, medically informed thresholds.
-
Updated
Nov 12, 2025 - Jupyter Notebook
Machine learning quality flags for Gaia DR3 effective temperatures using XGBoost, CatBoost, and LightGBM (MNRAS 2024)
-
Updated
Mar 1, 2026 - Python
Improve this page
Add a description, image, and links to the quality-filtering topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the quality-filtering topic, visit your repo's landing page and select "manage topics."