Feature/sam2 features by folkien · Pull Request #3 · AISP-PL/ssya

folkien · 2025-07-12T20:09:50Z

No description provided.

Copilot

Pull Request Overview

This PR integrates SAM2-based segmentation to compute and cache embeddings, and adds an interactive PyQt5 GUI for browsing images, viewing detections and masks, and filtering by embedding similarity.

Introduces Sam2Runner to run SAM2 on image crops and pool features into embeddings.
Updates DatasetManager and FeatureIndex to build or load a persistent embedding index.
Implements a PyQt5 GUI (MainWindow and ImageViewer) for image/detection lists and similarity-based filtering.

Reviewed Changes

Copilot reviewed 13 out of 25 changed files in this pull request and generated 4 comments.

Show a summary per file

File	Description
ssya/widgets/image_viewer.py	Added widget to render images with bounding boxes, masks, and similarity scores.
ssya/models/detection.py	Defined `Detection` dataclass and bounding-box normalization conversion.
ssya/main.py	Built main PyQt5 GUI logic: file/detection lists, sliders, and filter actions.
ssya/helpers/metrics.py	Added `cosine_similarity` utility.
ssya/controllers/sam2_wrapper.py	Wrapped SAM2 inference to produce masks and embeddings.
ssya/controllers/features_index.py	Created `FeatureIndex` for embedding persistence and lookup.
ssya/controllers/dataset_manager.py	Manages dataset loading, annotation parsing, index building/loading.
pyproject.toml	Declared `sam2` dependency.
README.md	Updated project overview, installation, usage, and documentation (in Polish).
.vscode/launch.json	Updated VSCode debug configuration for the new entrypoint.

Comments suppressed due to low confidence (1)

.vscode/launch.json:18

Remove the stray comma on its own line to fix invalid JSON syntax in the launch configuration.

Copilot · 2025-07-12T20:10:52Z

+        for det in detections:
+            if det.embedding is None:
+                for e in self.entries:
+                    if e["image_idx"] == det.image_idx and e["det_idx"] == det.class_id:


This condition compares the stored det_idx against det.class_id, but det_idx represents the detection's position, not its class. Consider comparing e['det_idx'] to the detection's index or storing class_id instead to ensure correct embedding lookup.

Copilot · 2025-07-12T20:10:52Z

+            self.detections[img_path] = [
+                Detection(int(cls), (float(xc), float(yc), float(w), float(h)), img_idx) for cls, xc, yc, w, h in lines
+            ]
+        logger.info("Dataset: %d images (%d with annotations)", len(self.images), len(self.detections))


len(self.detections) returns the total number of images (keys in the dict), not only those with annotations. To accurately log images with annotations, count entries where the detection list is non-empty.

Suggested change

logger.info("Dataset: %d images (%d with annotations)", len(self.images), len(self.detections))

logger.info("Dataset: %d images (%d with annotations)", len(self.images), sum(1 for dets in self.detections.values() if dets))

Copilot · 2025-07-12T20:10:53Z

+        self.lbl = QLabel(alignment=Qt.AlignCenter)
+        self.lbl.setSizePolicy(QSizePolicy.Expanding, QSizePolicy.Expanding)
+        self.lbl.setScaledContents(True)
+        # Fixed with and height set, 1920 - 200 , 1080 - 200


Typo in comment: 'with' should be 'width'.

Suggested change

# Fixed with and height set, 1920 - 200 , 1080 - 200

# Fixed width and height set, 1920 - 200 , 1080 - 200

Copilot · 2025-07-12T20:10:53Z

+        # Fixed with and height set, 1920 - 200 , 1080 - 200
+        self.lbl.setFixedWidth(1620)
+        self.lbl.setFixedHeight(880)


[nitpick] Using hardcoded width and height may limit flexibility; consider making these dimensions configurable or adapting to the parent layout dynamically.

Suggested change

# Fixed with and height set, 1920 - 200 , 1080 - 200

self.lbl.setFixedWidth(1620)

self.lbl.setFixedHeight(880)

# Allow dynamic resizing based on parent layout

self.lbl.setSizePolicy(QSizePolicy.Expanding, QSizePolicy.Expanding)

self.lbl.setScaledContents(True)

folkien added 22 commits July 12, 2025 17:24

Fixes.

30b6a0b

Fixes.

a04ac8c

Fixes.

9aa9499

Fixes.

8250987

Fixes.

c8ea7fa

Fixes.

ace2a4c

Fixes.

1e1b1e5

Fixes.

1bc0331

Fixes.

c442158

Fixes.

331d3f4

fixes.

6d7c691

Fixes.

6a52f8b

Fixes.

213cdb3

Fixes.

b29eb96

Fixes.

e060581

Fixes.

71f6493

Fixes.

621981c

Fixes.

ae9d136

Fixes.

bc94f6c

Fixes.

2392bd4

Updsate.

ccfb302

Fixes.

2b74dab

Copilot AI review requested due to automatic review settings July 12, 2025 20:09

folkien self-assigned this Jul 12, 2025

Copilot AI reviewed Jul 12, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature/sam2 features#3

Feature/sam2 features#3
folkien wants to merge 22 commits intomainfrom
feature/sam2_features

folkien commented Jul 12, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Jul 12, 2025

Uh oh!

Copilot AI Jul 12, 2025

Uh oh!

Copilot AI Jul 12, 2025

Uh oh!

Copilot AI Jul 12, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

	logger.info("Dataset: %d images (%d with annotations)", len(self.images), len(self.detections))
	logger.info("Dataset: %d images (%d with annotations)", len(self.images), sum(1 for dets in self.detections.values() if dets))

	# Fixed with and height set, 1920 - 200 , 1080 - 200
	# Fixed width and height set, 1920 - 200 , 1080 - 200

Conversation

folkien commented Jul 12, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Copilot AI Jul 12, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Jul 12, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Jul 12, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Jul 12, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants