Skip to content

CI/CD: Duplicate issue detection and marking #306

@iamdwaseem

Description

@iamdwaseem

🚀 CI/CD Enhancement: Automated Duplicate Issue Detection

📌 Problem

Currently, contributors must manually search through existing issues before opening a new one.

This process:

  • Is time-consuming
  • Is often skipped
  • Leads to duplicate issues
  • Increases manual triaging effort for maintainers

As the repository grows, identifying duplicate issues becomes more difficult and inefficient.


💡 Proposed Solution

Introduce an automated duplicate issue detection system using a GitHub Action and bot-based workflow.

🔄 Workflow Overview

  1. A new issue is opened
  2. GitHub Action is triggered
  3. The system compares the issue title and description with existing issues
  4. Semantic similarity is calculated
  5. If similar issues are found:
    • A comment is posted listing related issues
    • A label such as possible-duplicate is added

⚠️ The system will only suggest duplicates and will NOT auto-close any issue.


⚙️ Technical Approach (High-Level)

  • Trigger: on: issues: [opened]
  • Use semantic similarity (e.g., sentence-transformers)
  • Compute cosine similarity scores
  • Apply configurable similarity threshold
  • Comment with top similar issues
  • Add label for maintainer review

Threshold and behavior can be configurable via repository variables.


🎯 Expected Impact

  • Reduced duplicate issues
  • Faster issue triaging
  • Cleaner issue tracking
  • Improved contributor experience
  • Reduced maintainer workload

🛡️ Safety Considerations

  • No issue will be automatically closed
  • Maintainers retain full control
  • Only non-destructive suggestions will be made

🖼️ Screenshots

N/A (CI/CD enhancement proposal)


📝 Record

  • I agree to follow this project's Code of Conduct
  • I want to work on this issue

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions