Feat: phase 3 polish sse by chojuninengu · Pull Request #7 · Cameroon-Developer-Network/Zenvra

chojuninengu · 2026-04-28T18:18:11Z

Summary by CodeRabbit

Release Notes

New Features
- Added minimum severity filter for scans, allowing users to define a threshold for displayed results.
- Introduced usage tracking to display scan count against the plan limit.
Bug Fixes
- Improved error handling in scan result streaming.
- Fixed conditional scan results retrieval.
Documentation
- Updated development setup guide with clearer workflow and environment configuration.

…re findings - For both run_scan and run_workspace_scan, the 'scans' table row is now inserted BEFORE the background task starts emitting findings into 'scan_results'. This eliminates the foreign key constraint violation. - Completion logic now uses UPDATE instead of INSERT ... ON CONFLICT, keeping the logic clean and atomic. - Added usage store (apps/web/src/lib/stores/usage.ts) for reactive sidebar scan count that refreshes after each completed scan. - Removed stale code paths and unused mut warnings.

…oss server and scanner crates

coderabbitai · 2026-04-28T18:18:23Z

📝 Walkthrough

Walkthrough

This PR introduces severity-based filtering for security scan findings across the stack. Users can now select a minimum severity threshold when initiating scans. The backend filters findings accordingly before returning results, while a new usage tracking store monitors scan history. SSE streaming is refactored for cache-then-live consumption, and the rate limit increases from 10 to 100 requests per IP.

Changes

Cohort / File(s)	Summary
Documentation `README.md`	Updated development setup instructions with clearer numbered workflow, separate Docker containers for Postgres/Redis, explicit frontend startup steps, and simplified CLI quick-scan section.
Frontend API & Types `apps/web/src/lib/api.ts`	Added optional `min_severity` field to `ScanRequest` payload to support severity threshold filtering.
Usage Tracking Store `apps/web/src/lib/stores/usage.ts`	New Svelte store exporting `scanCount` writable state and `refreshScanCount()` async function to fetch and cache scan history count.
Layout & UI Updates `apps/web/src/routes/+layout.svelte`	Integrated usage store to dynamically display scan usage progress bar and remaining-scans text (maxScans = 10) on client mount.
Scan Result Page `apps/web/src/routes/history/[id]/+page.svelte`	Made scan result fetch conditional on valid `scanId` presence in route params.
Scan Initiation `apps/web/src/routes/scan/+page.svelte`	Added minimum severity selector UI; includes `min_severity` in both single-file and workspace scan requests; calls `refreshScanCount()` after scan completion; improved SSE error handling to check `EventSource` state before closing.
AI Detection Rules `crates/scanner/src/engines/ai_code.rs`	Simplified two regex rules (unauthenticated handlers and plain HTTP endpoints) and compensated with manual line-level filtering during processing to suppress false positives.
SCA Engine & Tests `crates/scanner/src/engines/sca.rs`, `crates/scanner/tests/integration.rs`, `crates/server/src/cve_sync/mod.rs`	Formatting and code style improvements with no logic changes.
Server Core Logic `crates/server/src/main.rs`	Increased per-IP rate limit from 10 to 100; added `min_severity` field to `ScanRequest` and `WorkspaceScanRequest`; implemented runtime finding severity filtering; changed scan row lifecycle to early insertion then update-on-completion; refactored SSE streaming to cache-then-live pattern; removed CVE title/description enrichment from vulnerabilities table; added history fetch logging.

Sequence Diagram

sequenceDiagram
    participant User as User
    participant Frontend as Frontend App
    participant Store as Usage Store
    participant Server as Server/API
    participant Scanner as Scanner Engine
    participant Database as Database

    User->>Frontend: Select min_severity & initiate scan
    Frontend->>Server: POST /scan (min_severity included)
    Server->>Database: INSERT scans row (early init)
    Server->>Scanner: Run scan with config
    Scanner->>Scanner: Process findings
    Scanner->>Server: Emit ScanEvent::Finding
    
    loop For each Finding
        Server->>Server: Check severity >= min_severity
        alt Severity meets threshold
            Server->>Database: Cache finding in state.results[scan_id]
            Server->>Frontend: SSE stream Finding event
        else Severity below threshold
            Server->>Server: Skip/filter out finding
        end
    end
    
    Server->>Database: UPDATE scans row (mark complete)
    Server->>Frontend: SSE stream completion event
    
    Frontend->>Store: Call refreshScanCount()
    Store->>Server: GET /history
    Server->>Database: Fetch scan history
    Database->>Server: Return history
    Server->>Store: Return history entries
    Store->>Store: Update scanCount state
    Frontend->>Frontend: Reactive UI updates (usage bar, count display)

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~45 minutes

Possibly related PRs

feat: mvp-foundation architecture, ai providers, and secrets engine #1 — Modifies scanner engine modules (ai\_code) and scan-related structures; overlaps on AI detection rule changes and scan request handling.
fix(ci): resolve failing workflows across rust, web, and vscode #2 — Updates server API and ScanRequest handling in crates/server/src/main.rs with related scanning/streaming logic on the same endpoint.
feat: complete Phase 2 - API server, Dashboard, and CVE sync #5 — Overlaps on API scan endpoints, client types (ScanRequest/WorkspaceScanRequest), and SSE streaming in both apps/web/src/lib/api.ts and crates/server/src/main.rs.

Suggested labels

enhancement

Suggested reviewers

Nkwenti-Severian-Ndongtsop
Copilot

Poem

🐰 A filter most keen, severity-bound,
Findings now sorted, the best ones found!
Usage tracked swift in a store so bright,
Streams cache-then-live, oh what a sight!
Rate limits rise as the scan dreams take flight! ✨

🚥 Pre-merge checks | ✅ 4 | ❌ 1

❌ Failed checks (1 inconclusive)

Check name	Status	Explanation	Resolution
Title check	❓ Inconclusive	The title 'Feat: phase 3 polish sse' refers to a real aspect of the changeset (SSE improvements and polish), but is vague and doesn't clearly summarize the primary changes—multiple substantial features are implemented beyond just SSE (usage tracking, severity filtering, scan history refactoring).	Consider a more descriptive title like 'Feat: Add min severity filtering, usage tracking, and SSE improvements' to better reflect the scope of changes across frontend, backend, and scanner components.

✅ Passed checks (4 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Docstring Coverage	✅ Passed	Docstring coverage is 100.00% which is sufficient. The required threshold is 80.00%.
Linked Issues check	✅ Passed	Check skipped because no linked issues were found for this pull request.
Out of Scope Changes check	✅ Passed	Check skipped because no linked issues were found for this pull request.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

📝 Generate docstrings

Create stacked PR
Commit on current branch

🧪 Generate unit tests (beta)

Create PR with unit tests
Commit unit tests in branch feat/phase-3-polish-sse

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

qodo-code-review · 2026-04-28T18:18:26Z

Review Summary by Qodo

Phase 3 Polish: Fix FK violations, add severity filtering, and improve SSE handling

✨ Enhancement 🐞 Bug fix

Walkthroughs

Description

• Resolve foreign key violations by inserting scan records before findings
• Add severity filtering for scan results with min_severity parameter
• Implement reactive usage store for sidebar scan count tracking
• Simplify regex patterns and add manual filtering in AI code engine
• Apply consistent code formatting across scanner and server crates
• Improve SSE subscription logic with cached event replay and live streaming
• Increase rate limit from 10 to 100 scans per minute
• Update setup documentation with clearer step-by-step instructions

Diagram

flowchart LR
  A["Scan Request"] -->|"Phase 1: Insert scan record"| B["scans table"]
  A -->|"Phase 2: Start background task"| C["Scanner Engine"]
  C -->|"Stream findings"| D["scan_results table"]
  D -->|"Update counts"| B
  B -->|"Cache results"| E["results store"]
  E -->|"Replay + Live stream"| F["SSE Subscriber"]
  G["min_severity filter"] -->|"Applied during scan"| C
  H["Usage Store"] -->|"Refresh after complete"| I["Sidebar UI"]

File Changes

1. crates/scanner/src/engines/ai_code.rs ✨ Enhancement +22/-4

Simplify regex patterns with manual filtering

crates/scanner/src/engines/ai_code.rs

2. crates/scanner/src/engines/sca.rs Formatting +30/-12

Apply consistent code formatting improvements

crates/scanner/src/engines/sca.rs

3. crates/scanner/tests/integration.rs Formatting +8/-10

Format test assertions for consistency

crates/scanner/tests/integration.rs

View more (8)

4. crates/server/src/cve_sync/mod.rs Formatting +5/-1

Format CVSS score parsing with improved readability

crates/server/src/cve_sync/mod.rs

5. crates/server/src/main.rs 🐞 Bug fix +165/-121

Fix FK violations and implement severity filtering

crates/server/src/main.rs

6. apps/web/src/routes/+layout.svelte ✨ Enhancement +10/-2

Add reactive usage store for scan count display

apps/web/src/routes/+layout.svelte

7. apps/web/src/routes/history/[id]/+page.svelte 🐞 Bug fix +3/-1

Add null check for scan ID parameter

apps/web/src/routes/history/[id]/+page.svelte

8. apps/web/src/routes/scan/+page.svelte ✨ Enhancement +50/-4

Add min severity filter UI and refresh count on completion

apps/web/src/routes/scan/+page.svelte

9. apps/web/src/lib/api.ts ✨ Enhancement +1/-0

Add min_severity field to ScanRequest interface

apps/web/src/lib/api.ts

10. apps/web/src/lib/stores/usage.ts ✨ Enhancement +13/-0

Create new usage store for reactive scan count

apps/web/src/lib/stores/usage.ts

11. README.md 📝 Documentation +33/-21

Improve setup instructions with numbered steps

README.md

qodo-code-review · 2026-04-28T18:18:28Z

Code Review by Qodo

🐞 Bugs (6) 📘 Rule violations (4)

1. Direct fetch in +page.svelte 📘 Rule violation ⚙ Maintainability

Description

The scan page modifies a direct fetch request instead of using the typed API client wrapper,
reducing consistency and type safety for frontend network calls. This violates the requirement to
route frontend network access through apps/web/src/lib/api.ts.

Code

apps/web/src/routes/scan/+page.svelte[R71-76]

            api_key:  aiConfig.apiKey,
            model:    aiConfig.model,
            endpoint: aiConfig.endpoint || undefined,
-          } : undefined
+          } : undefined,
+          min_severity: selectedMinSeverity !== "info" ? selectedMinSeverity : undefined
        })

Evidence

PR Compliance ID 11 requires all frontend network calls to go through the typed API client wrapper
in apps/web/src/lib/api.ts. The modified scan request payload (including min_severity) is still
being sent via an ad-hoc fetch call from +page.svelte, rather than via the centralized wrapper.

AGENTS.md
apps/web/src/routes/scan/+page.svelte[71-76]

Agent prompt

The issue below was found during a code review. Follow the provided context and guidance below and implement a solution

## Issue description
The scan page is constructing/modifying request payloads inside an ad-hoc `fetch` flow instead of calling the typed API wrapper.

## Issue Context
Compliance requires frontend network calls to be centralized in `apps/web/src/lib/api.ts` for consistent behavior and typing.

## Fix Focus Areas
- apps/web/src/routes/scan/+page.svelte[62-120]
- apps/web/src/routes/scan/+page.svelte[141-195]
- apps/web/src/lib/api.ts[48-137]

ⓘ Copy this prompt and use it to remediate the issue with your preferred AI generation tools

2. No Zod for minSeverity 📘 Rule violation ⛨ Security

Description

A new user input (selectedMinSeverity / min_severity) is introduced and sent to the API without
Zod-based validation. This weakens input validation guarantees and violates the Zod-only validation
requirement for forms/API inputs.

Code

apps/web/src/routes/scan/+page.svelte[R23-25]
  let selectedLanguage = $state("python");
+  let selectedMinSeverity = $state("info");

Evidence
PR Compliance ID 13 requires Zod validation for new/changed form inputs and API inputs. The PR adds
selectedMinSeverity and forwards it as min_severity in the scan request without any Zod
schema/parse step guarding the accepted values.
AGENTS.md
apps/web/src/routes/scan/+page.svelte[23-25]
apps/web/src/routes/scan/+page.svelte[339-352]
apps/web/src/routes/scan/+page.svelte[71-76]
apps/web/src/lib/api.ts[14-20]

Agent prompt

The issue below was found during a code review. Follow the provided context and guidance below and implement a solution

## Issue description
A new UI/API input (`min_severity`) is added without Zod validation.

## Issue Context
Compliance requires using Zod schemas for validating all new/changed form inputs and API inputs.

## Fix Focus Areas
- apps/web/src/routes/scan/+page.svelte[23-25]
- apps/web/src/routes/scan/+page.svelte[71-76]
- apps/web/src/routes/scan/+page.svelte[339-352]
- apps/web/src/lib/api.ts[14-20]

ⓘ Copy this prompt and use it to remediate the issue with your preferred AI generation tools

3. New serde_json unwrap in server 📘 Rule violation ☼ Reliability

Description

The server introduces new .unwrap() calls when serializing severity_counts, which can panic and
crash the API process. This violates the no-unwrap/expect requirement for Rust library/API code.

Code

crates/server/src/main.rs[R323-331]

+    if let Err(e) = sqlx::query(
+        "INSERT INTO scans (id, language, target_name, findings_count, severity_counts) 
+         VALUES ($1, $2, $3, 0, $4)",
+    )
+    .bind(scan_id)
+    .bind(&payload_lang)
+    .bind("Manual Scan")
+    .bind(serde_json::to_value(std::collections::HashMap::<String, i32>::new()).unwrap())
+    .execute(&state.db)

Evidence
PR Compliance ID 2 forbids adding .unwrap()/.expect() in Rust library/API code to prevent
panics. The PR adds .unwrap() on serde_json::to_value(...) when inserting/updating scan records,
which can panic on serialization failures.
AGENTS.md
crates/server/src/main.rs[321-333]
crates/server/src/main.rs[400-406]
crates/server/src/main.rs[501-512]
crates/server/src/main.rs[566-574]

Agent prompt

The issue below was found during a code review. Follow the provided context and guidance below and implement a solution

## Issue description
New `.unwrap()` calls were introduced in the API server when serializing JSON for DB writes; these can panic and crash the process.

## Issue Context
This is API/server code (`crates/server`) and must avoid panics by propagating/handling errors.

## Fix Focus Areas
- crates/server/src/main.rs[321-339]
- crates/server/src/main.rs[400-408]
- crates/server/src/main.rs[501-518]
- crates/server/src/main.rs[566-575]

ⓘ Copy this prompt and use it to remediate the issue with your preferred AI generation tools

View more (2)

4. Leaked scan state on DB fail 🐞 Bug ☼ Reliability

Description

run_scan/run_workspace_scan insert scan_id into in-memory maps before initializing the DB scan row;
if the INSERT INTO scans fails, the handler returns early without removing those entries, leaving an
orphaned scan channel/results in memory. This can cause memory leaks and SSE subscriptions that
never complete for that scan_id.

Code

crates/server/src/main.rs[R294-297]

    let (tx, _rx) = broadcast::channel(100);
    state.scans.insert(scan_id, tx.clone());
+    state.results.insert(scan_id, Vec::new());

Evidence

The scan is registered in state.scans/state.results before the DB initialization; on DB init
error the function returns immediately, and there is no cleanup path for the inserted DashMap
entries (same pattern exists for workspace scans).

crates/server/src/main.rs[269-340]
crates/server/src/main.rs[432-519]

Agent prompt

The issue below was found during a code review. Follow the provided context and guidance below and implement a solution

### Issue description
`run_scan` and `run_workspace_scan` insert `scan_id` into `state.scans` and `state.results` before inserting the corresponding row into `scans`. If the DB insert fails, the handler returns an error without removing those map entries, leaving an orphaned in-memory scan state.

### Issue Context
This can leak memory and create “ghost” scans that never progress/complete.

### Fix Focus Areas
- crates/server/src/main.rs[294-339]
- crates/server/src/main.rs[467-518]

### Suggested fix
- Move `state.scans.insert(...)` and `state.results.insert(...)` to *after* the successful `INSERT INTO scans ...`.
- Or, if you keep the current order, ensure you remove the entries on the error return path (and consider adding a small helper to avoid duplicating cleanup between scan types).

ⓘ Copy this prompt and use it to remediate the issue with your preferred AI generation tools

5. SSE drops first live events 🐞 Bug ≡ Correctness

Description

subscribe_to_scan skips the first num_cached messages from the live broadcast stream, but
num_cached includes events that occurred before the subscriber connected and therefore will not be
present in the broadcast receiver. This can drop the first real-time findings/progress events after
a client subscribes.

Code

crates/server/src/main.rs[R615-627]

+        let mut seen_count = 0;
+        let live_stream = tokio_stream::wrappers::BroadcastStream::new(rx)
+            .filter_map(|msg| msg.ok())
+            .filter(move |_| {
+                // Skip the first N events that were already in our cache clone
+                // This is a simple heuristic; a real implementation would use event IDs.
+                if seen_count < num_cached {
+                    seen_count += 1;
+                    false
+                } else {
+                    true
+                }
+            })

Evidence

The cache (state.results[id]) accumulates events from scan start, so num_cached can be non-zero
when a client connects mid-scan. The broadcast receiver only yields events after subscription, yet
the code unconditionally discards the first num_cached live events, causing data loss.

crates/server/src/main.rs[294-368]
crates/server/src/main.rs[588-641]

Agent prompt

The issue below was found during a code review. Follow the provided context and guidance below and implement a solution

### Issue description
`subscribe_to_scan` computes `num_cached` from the full cached event list and then skips that many events from the live broadcast stream. Because the broadcast receiver does not replay pre-subscription messages, this skip can discard legitimate new events.

### Issue Context
Cache contains *all* events up to the clone time; live stream contains only events *after* subscribe. Skipping `num_cached` assumes live contains duplicates of the entire cache, which is not true.

### Fix Focus Areas
- crates/server/src/main.rs[596-635]

### Suggested fix
- Remove the `seen_count/num_cached` skip filter entirely.
- If you still need de-duplication, add a monotonic event sequence number (or timestamp+nonce) to emitted events, and:
 - Send cached events first.
 - Then forward only live events with `seq > last_cached_seq`.
 - Alternatively, store `last_cached_seq` in the cache clone and filter on that instead of counts.

ⓘ Copy this prompt and use it to remediate the issue with your preferred AI generation tools

6. scan/+page.svelte exceeds 200 📘 Rule violation ⚙ Maintainability

Description

The apps/web/src/routes/scan/+page.svelte component is well over 200 lines, and this PR adds more
UI/logic to it instead of splitting it. Oversized components reduce maintainability and violate the
component size limit requirement.

Code

apps/web/src/routes/scan/+page.svelte[R339-352]

+        <!-- Severity Filter -->
+        <div class="flex items-center gap-3">
+          <label class="text-xs font-bold text-zinc-500 uppercase tracking-widest whitespace-nowrap">Min Severity</label>
+          <select
+            bind:value={selectedMinSeverity}
+            class="glass bg-zinc-900/80 px-3 py-2 rounded-xl border border-zinc-800 text-xs font-medium text-zinc-300 focus:ring-2 ring-brand-primary outline-none transition-all flex-1"
+          >
+            <option value="info">All (Info+)</option>
+            <option value="low">Low+</option>
+            <option value="medium">Medium+</option>
+            <option value="high">High+</option>
+            <option value="critical">Critical Only</option>
+          </select>
+        </div>

Evidence

PR Compliance ID 12 requires Svelte components to be kept at or under 200 lines (or split). This PR
adds additional blocks at line ~339+, demonstrating the file substantially exceeds the limit and is
still being grown rather than decomposed.

AGENTS.md
apps/web/src/routes/scan/+page.svelte[339-352]

Agent prompt

The issue below was found during a code review. Follow the provided context and guidance below and implement a solution

## Issue description
`apps/web/src/routes/scan/+page.svelte` is far beyond the 200-line limit and the PR adds more content to it.

## Issue Context
Decompose into smaller presentational/components (e.g., severity filter, scan controls, workspace uploader, findings list) and keep the route page as orchestration.

## Fix Focus Areas
- apps/web/src/routes/scan/+page.svelte[1-479]

ⓘ Copy this prompt and use it to remediate the issue with your preferred AI generation tools

7. Catch-up can lose SSE events 🐞 Bug ☼ Reliability

Description

subscribe_to_scan replays cached events first and only then starts polling the broadcast receiver,
which can cause the receiver to lag if many events are emitted during catch-up. Because the
broadcast channel is size 100 and filter_map(|msg| msg.ok()) drops lag errors, events can be
silently lost under load.

Code

crates/server/src/main.rs[R596-635]

+    // Use a multi-stage stream: catch up from cache, then switch to live.
+    // We subscribe first to ensure we don't miss anything that happens during the catch-up.
+    let rx = state.scans.get(&id).map(|tx| tx.subscribe());
+
+    let cached_events = state
+        .results
+        .get(&id)
+        .map(|c| c.clone())
+        .unwrap_or_default();
+    let num_cached = cached_events.len();
+
+    let past_stream = stream::iter(cached_events).map(|event| -> Result<Event, Infallible> {
+        Ok(Event::default()
+            .json_data(&event)
+            .unwrap_or_else(|_| Event::default()))
+    });
+
+    let stream: BoxedStream = if let Some(rx) = rx {
+        // Scan is ongoing
+        let mut seen_count = 0;
+        let live_stream = tokio_stream::wrappers::BroadcastStream::new(rx)
+            .filter_map(|msg| msg.ok())
+            .filter(move |_| {
+                // Skip the first N events that were already in our cache clone
+                // This is a simple heuristic; a real implementation would use event IDs.
+                if seen_count < num_cached {
+                    seen_count += 1;
+                    false
+                } else {
+                    true
+                }
+            })
+            .map(|event| -> Result<Event, Infallible> {
                Ok(Event::default()
-                    .json_data(&event)
+                    .json_data(event)
                    .unwrap_or_else(|_| Event::default()))
-            }),
-        )
+            });
+
+        Box::pin(past_stream.chain(live_stream))
    } else {
-        // Case 2: Scan is still in progress — subscribe to live broadcast
-        let tx = state
-            .scans
-            .get(&id)
-            .ok_or((StatusCode::NOT_FOUND, "Scan not found".to_string()))?
-            .clone();
-
-        let rx = tx.subscribe();
-        Box::pin(
-            tokio_stream::wrappers::BroadcastStream::new(rx)
-                .filter_map(|msg: Result<ScanEvent, _>| msg.ok())
-                .map(|event: ScanEvent| -> Result<Event, Infallible> {
-                    Ok(Event::default()
-                        .json_data(event)
-                        .unwrap_or_else(|_| Event::default()))
-                }),
-        )

Evidence

The live receiver is created before cache replay, but past_stream.chain(live_stream) means the
live stream is not polled until the cache is fully emitted. With broadcast::channel(100),
producing >100 events during catch-up can trigger lag, and the current code drops those
errors/events silently.

crates/server/src/main.rs[294-297]
crates/server/src/main.rs[596-635]

Agent prompt

The issue below was found during a code review. Follow the provided context and guidance below and implement a solution

### Issue description
Chaining `past_stream` before `live_stream` delays polling the broadcast receiver; with a bounded channel this can lead to lagged receivers and dropped events (currently ignored via `msg.ok()`).

### Issue Context
This is most likely when the cache is large or the client is slow, and the scan emits many progress/finding events quickly.

### Fix Focus Areas
- crates/server/src/main.rs[294-297]
- crates/server/src/main.rs[596-635]

### Suggested fix
- Replace `past_stream.chain(live_stream)` with a stream that can poll both sources concurrently:
 - Option A: spawn a task that drains the broadcast receiver into an `mpsc` channel immediately, while you emit cached events; then merge cached + drained live with proper de-duplication.
 - Option B: use a `select`/merge strategy so the receiver is continuously polled.
- Handle `BroadcastStreamRecvError::Lagged` explicitly (emit an SSE warning event, or resync by sending a “reload results” instruction) instead of dropping it.

ⓘ Copy this prompt and use it to remediate the issue with your preferred AI generation tools

8. Rate limit message mismatch 🐞 Bug ≡ Correctness

Description

The server rate limit constant is now 100 requests/window, but the 429 error message still states
“Maximum 10 scans per minute per IP,” which is incorrect. This misleads clients/operators about the
actual protection level.

Code

crates/server/src/main.rs[33]
+const RATE_LIMIT: u32 = 100;

Evidence
The configured limit (RATE_LIMIT) and the user-facing error message disagree in both scan
endpoints.
crates/server/src/main.rs[27-35]
crates/server/src/main.rs[274-283]
crates/server/src/main.rs[437-444]

Agent prompt

The issue below was found during a code review. Follow the provided context and guidance below and implement a solution

### Issue description
The 429 response message still hardcodes “Maximum 10 scans…” while the actual `RATE_LIMIT` is 100.

### Issue Context
This leads to confusion during debugging/operations and can break clients that surface the message.

### Fix Focus Areas
- crates/server/src/main.rs[27-35]
- crates/server/src/main.rs[274-283]
- crates/server/src/main.rs[437-444]

### Suggested fix
- Update the message to match the constant (100).
- Prefer formatting the message using `RATE_LIMIT` (and optionally `RATE_WINDOW`) so future changes remain consistent.

ⓘ Copy this prompt and use it to remediate the issue with your preferred AI generation tools

View more (2)

9. Usage bar overflows negative 🐞 Bug ≡ Correctness

Description

The sidebar computes width as ($scanCount / maxScans) * 100 and remaining as `maxScans -
$scanCount, but scanCount is set to history.length` and history is limited to 50; once history
exceeds 10, the UI can show negative remaining scans and widths >100%.

Code

apps/web/src/routes/+layout.svelte[R67-69]

+          <div class="bg-brand-primary h-full rounded-full shadow-[0_0_8px_rgba(236,72,153,0.5)]" style="width: {($scanCount / maxScans) * 100}%"></div>
        </div>
-        <p class="text-[10px] text-zinc-500 mt-2">4/10 scans remaining</p>
+        <p class="text-[10px] text-zinc-500 mt-2">{maxScans - $scanCount}/{maxScans} scans remaining</p>

Evidence
scanCount is derived from the number of history rows returned, not from a plan/quota counter, and
the server returns up to 50 rows; therefore $scanCount can exceed maxScans (=10) and break the
calculations.
apps/web/src/routes/+layout.svelte[62-70]
apps/web/src/lib/stores/usage.ts[1-13]
crates/server/src/main.rs[688-699]

Agent prompt

The issue below was found during a code review. Follow the provided context and guidance below and implement a solution

### Issue description
The UI treats `scanCount` as “scans used” against `maxScans`, but it is actually `history.length` (up to 50). This makes the progress bar exceed 100% and remaining scans go negative.

### Issue Context
This is visible after more than 10 scans exist in history.

### Fix Focus Areas
- apps/web/src/lib/stores/usage.ts[1-13]
- apps/web/src/routes/+layout.svelte[62-70]
- crates/server/src/main.rs[688-699]

### Suggested fix
- Clamp display values:
 - `const used = Math.min($scanCount, maxScans)`
 - `const remaining = Math.max(0, maxScans - $scanCount)`
 - `width = Math.min(100, (used / maxScans) * 100)`
- Longer-term: add a dedicated usage/quota endpoint rather than inferring from scan history length.

ⓘ Copy this prompt and use it to remediate the issue with your preferred AI generation tools

10. Unauthenticated route rule too broad 🐞 Bug ≡ Correctness

Description

The AiCode engine now flags any @app/@router.(delete|put|patch)(...) decorator line and only
filters on keywords present on that same line, so it cannot reliably determine whether the handler
is authenticated (e.g., if auth is expressed elsewhere). This can significantly increase false
positives and reduce scanner usefulness.

Code

crates/scanner/src/engines/ai_code.rs[R204-214]

+                // Manual filtering for rules that were simplified to avoid look-ahead panics
+                if rule.name == "Unauthenticated Route Handler" {
+                    let l = line.to_lowercase();
+                    if l.contains("auth")
+                        || l.contains("depends")
+                        || l.contains("token")
+                        || l.contains("user")
+                    {
+                        continue;
+                    }
+                }

Evidence
Rule matching is performed per-line, and the manual filter also only inspects the current line; this
limits the rule to decorator-line heuristics rather than validating the handler’s
signature/body/adjacent lines.
crates/scanner/src/engines/ai_code.rs[112-123]
crates/scanner/src/engines/ai_code.rs[195-236]

Agent prompt

The issue below was found during a code review. Follow the provided context and guidance below and implement a solution

### Issue description
The “Unauthenticated Route Handler” rule was simplified to match only the decorator line, and the manual filter only checks that same line for auth markers. This is too weak to infer authentication reliably.

### Issue Context
The engine loops line-by-line (`for (line_num, line) in config.code.lines()`) and emits a finding based solely on the matching line.

### Fix Focus Areas
- crates/scanner/src/engines/ai_code.rs[112-123]
- crates/scanner/src/engines/ai_code.rs[195-236]

### Suggested fix
- When the decorator regex matches, inspect a small window of subsequent lines (e.g., next 3–10) to look for an `async def` signature and common auth/dependency markers there.
- Prefer representing “special-case filtering” via an enum or dedicated rule function instead of `if rule.name == ...` string comparisons.

ⓘ Copy this prompt and use it to remediate the issue with your preferred AI generation tools

qodo-code-review · 2026-04-28T18:18:34Z

CI Feedback 🧐

A test triggered by this PR failed. Here is an AI-generated analysis of the failure:

Action: Validate PR title (Conventional Commits)
Failed stage: Run amannn/action-semantic-pull-request@v5 [❌]
Failed test name: ""
Failure summary: The action failed because a PR title validation step rejected the pull request title `Feat: phase 3` `polish sse`. - The workflow is configured to allow release types like `feat`, `fix`, `chore`, etc. (lowercase), but the title uses `Feat` (capitalized). - As a result, the action reported `Unknown` `release type "Feat"` and failed.
Relevant error logs: 1: ##[group]Runner Image Provisioner 2: Hosted Compute Agent ... 31: with: 32: types: feat 33: fix 34: chore 35: docs 36: test 37: refactor 38: perf 39: ci 40: 41: requireScope: false 42: githubBaseUrl: https://api.github.com 43: env: 44: GITHUB_TOKEN: *** 45: ##[endgroup] 46: ##[error]Unknown release type "Feat" found in pull request title "Feat: phase 3 polish sse". 47:

qodo-code-review · 2026-04-28T18:22:54Z

            api_key:  aiConfig.apiKey,
            model:    aiConfig.model,
            endpoint: aiConfig.endpoint || undefined,
-          } : undefined
+          } : undefined,
+          min_severity: selectedMinSeverity !== "info" ? selectedMinSeverity : undefined
        })


1. Direct fetch in +page.svelte 📘 Rule violation ⚙ Maintainability

The scan page modifies a direct fetch request instead of using the typed API client wrapper, reducing consistency and type safety for frontend network calls. This violates the requirement to route frontend network access through apps/web/src/lib/api.ts.

Agent Prompt

## Issue description The scan page is constructing/modifying request payloads inside an ad-hoc `fetch` flow instead of calling the typed API wrapper. ## Issue Context Compliance requires frontend network calls to be centralized in `apps/web/src/lib/api.ts` for consistent behavior and typing. ## Fix Focus Areas - apps/web/src/routes/scan/+page.svelte[62-120] - apps/web/src/routes/scan/+page.svelte[141-195] - apps/web/src/lib/api.ts[48-137]

ⓘ Copy this prompt and use it to remediate the issue with your preferred AI generation tools

qodo-code-review · 2026-04-28T18:22:54Z

  let selectedLanguage = $state("python");
+  let selectedMinSeverity = $state("info");



2. No zod for minseverity 📘 Rule violation ⛨ Security

A new user input (selectedMinSeverity / min_severity) is introduced and sent to the API without Zod-based validation. This weakens input validation guarantees and violates the Zod-only validation requirement for forms/API inputs.

Agent Prompt

## Issue description A new UI/API input (`min_severity`) is added without Zod validation. ## Issue Context Compliance requires using Zod schemas for validating all new/changed form inputs and API inputs. ## Fix Focus Areas - apps/web/src/routes/scan/+page.svelte[23-25] - apps/web/src/routes/scan/+page.svelte[71-76] - apps/web/src/routes/scan/+page.svelte[339-352] - apps/web/src/lib/api.ts[14-20]

ⓘ Copy this prompt and use it to remediate the issue with your preferred AI generation tools

qodo-code-review · 2026-04-28T18:22:54Z

+    if let Err(e) = sqlx::query(
+        "INSERT INTO scans (id, language, target_name, findings_count, severity_counts) 
+         VALUES ($1, $2, $3, 0, $4)",
+    )
+    .bind(scan_id)
+    .bind(&payload_lang)
+    .bind("Manual Scan")
+    .bind(serde_json::to_value(std::collections::HashMap::<String, i32>::new()).unwrap())
+    .execute(&state.db)


3. New serde_json unwrap in server 📘 Rule violation ☼ Reliability

The server introduces new .unwrap() calls when serializing severity_counts, which can panic and crash the API process. This violates the no-unwrap/expect requirement for Rust library/API code.

Agent Prompt

## Issue description New `.unwrap()` calls were introduced in the API server when serializing JSON for DB writes; these can panic and crash the process. ## Issue Context This is API/server code (`crates/server`) and must avoid panics by propagating/handling errors. ## Fix Focus Areas - crates/server/src/main.rs[321-339] - crates/server/src/main.rs[400-408] - crates/server/src/main.rs[501-518] - crates/server/src/main.rs[566-575]

ⓘ Copy this prompt and use it to remediate the issue with your preferred AI generation tools

qodo-code-review · 2026-04-28T18:22:54Z

    let (tx, _rx) = broadcast::channel(100);
    state.scans.insert(scan_id, tx.clone());
+    state.results.insert(scan_id, Vec::new());



4. Leaked scan state on db fail 🐞 Bug ☼ Reliability

run_scan/run_workspace_scan insert scan_id into in-memory maps before initializing the DB scan row; if the INSERT INTO scans fails, the handler returns early without removing those entries, leaving an orphaned scan channel/results in memory. This can cause memory leaks and SSE subscriptions that never complete for that scan_id.

Agent Prompt

### Issue description `run_scan` and `run_workspace_scan` insert `scan_id` into `state.scans` and `state.results` before inserting the corresponding row into `scans`. If the DB insert fails, the handler returns an error without removing those map entries, leaving an orphaned in-memory scan state. ### Issue Context This can leak memory and create “ghost” scans that never progress/complete. ### Fix Focus Areas - crates/server/src/main.rs[294-339] - crates/server/src/main.rs[467-518] ### Suggested fix - Move `state.scans.insert(...)` and `state.results.insert(...)` to *after* the successful `INSERT INTO scans ...`. - Or, if you keep the current order, ensure you remove the entries on the error return path (and consider adding a small helper to avoid duplicating cleanup between scan types).

ⓘ Copy this prompt and use it to remediate the issue with your preferred AI generation tools

qodo-code-review · 2026-04-28T18:22:54Z

+        let mut seen_count = 0;
+        let live_stream = tokio_stream::wrappers::BroadcastStream::new(rx)
+            .filter_map(|msg| msg.ok())
+            .filter(move |_| {
+                // Skip the first N events that were already in our cache clone
+                // This is a simple heuristic; a real implementation would use event IDs.
+                if seen_count < num_cached {
+                    seen_count += 1;
+                    false
+                } else {
+                    true
+                }
+            })


5. Sse drops first live events 🐞 Bug ≡ Correctness

subscribe_to_scan skips the first num_cached messages from the live broadcast stream, but num_cached includes events that occurred before the subscriber connected and therefore will not be present in the broadcast receiver. This can drop the first real-time findings/progress events after a client subscribes.

Agent Prompt

### Issue description `subscribe_to_scan` computes `num_cached` from the full cached event list and then skips that many events from the live broadcast stream. Because the broadcast receiver does not replay pre-subscription messages, this skip can discard legitimate new events. ### Issue Context Cache contains *all* events up to the clone time; live stream contains only events *after* subscribe. Skipping `num_cached` assumes live contains duplicates of the entire cache, which is not true. ### Fix Focus Areas - crates/server/src/main.rs[596-635] ### Suggested fix - Remove the `seen_count/num_cached` skip filter entirely. - If you still need de-duplication, add a monotonic event sequence number (or timestamp+nonce) to emitted events, and: - Send cached events first. - Then forward only live events with `seq > last_cached_seq`. - Alternatively, store `last_cached_seq` in the cache clone and filter on that instead of counts.

ⓘ Copy this prompt and use it to remediate the issue with your preferred AI generation tools

coderabbitai

Actionable comments posted: 11

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (4)

crates/scanner/tests/integration.rs (1)
246-254: ⚠️ Potential issue | 🟠 Major

Major test robustness: “sorted by severity” assertion is too strict

The multi-engine test constructs severities, sorts a clone, and then uses assert_eq!(severities, sorted, ...) (Lines 246-254). This requires the entire vector order to match, including the relative order of findings that share the same severity.

If there are multiple findings with equal severity, the ordering among them may legitimately differ while still being “severity descending”. A more robust check is to assert monotonic non-increasing severities across adjacent elements.
🛠️ Proposed more robust assertion
-    let severities: Vec<&Severity> = findings.iter().map(|f| &f.severity).collect();
-    let mut sorted = severities.clone();
-    sorted.sort_by(|a, b| b.cmp(a));
-    assert_eq!(
-        severities, sorted,
-        "Findings should be sorted by severity descending"
-    );
+    let severities: Vec<&Severity> = findings.iter().map(|f| &f.severity).collect();
+    assert!(
+        severities
+            .windows(2)
+            .all(|w| w[0] >= w[1]),
+        "Findings should be sorted by severity descending"
+    );
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@crates/scanner/tests/integration.rs` around lines 246 - 254, The current test
collects severities into severities (Vec<&Severity>), clones it into sorted and
asserts exact equality (assert_eq!(severities, sorted)) which is too strict for
equal-severity items; change the assertion to verify non-increasing order
instead: iterate adjacent pairs of severities (e.g., using windows or a simple
for loop over indices) and assert that each earlier severity >= the next,
replacing the assert_eq on the whole vector; update the check near the code that
builds severities, sorted, and calls sorted.sort_by (remove or keep sort_by only
as needed for comparison) and replace the equality assertion with the
monotonicity assertion so equal severities may appear in any relative order.
crates/scanner/src/engines/sca.rs (1)
271-321: ⚠️ Potential issue | 🟠 Major

Potential correctness risk: zip() may silently truncate OSV batch results.

In query_osv, you populate results via:

for (dep, query_result) in chunk.iter().zip(batch_resp.results.iter()) { ... }

If OSV ever returns fewer results than queries (or in the unlikely case ordering/length diverges), zip() will stop early and you’ll fail to push (dep.clone(), vec![]) for the remaining deps—leading to missing/partial findings.

Suggested defensive approach (recommended refactor):

iterate by index over chunk

pull batch_resp.results.get(i) and default to None/empty vulns if missing
🔧 Proposed fix sketch
-                for (dep, query_result) in chunk.iter().zip(batch_resp.results.iter()) {
-                    let vulns = query_result.vulns.clone().unwrap_or_default();
-                    results.push((dep.clone(), vulns));
-                }
+                for (i, dep) in chunk.iter().enumerate() {
+                    let query_result = batch_resp
+                        .results
+                        .get(i);
+                    let vulns = query_result
+                        .and_then(|qr| qr.vulns.clone())
+                        .unwrap_or_default();
+                    results.push((dep.clone(), vulns));
+                }
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@crates/scanner/src/engines/sca.rs` around lines 271 - 321, The use of zip in
query_osv can silently truncate results when OSV returns fewer entries than
queries; change the chunk processing to iterate by index (for i in
0..chunk.len()) and for each dep use batch_resp.results.get(i) to obtain the
corresponding result (or None) and default to an empty vec when missing, then
push (dep.clone(), vulns) for every index so every dependency in the chunk is
always accounted for even if the response is shorter or out-of-sync.
apps/web/src/routes/scan/+page.svelte (2)
54-200: 🛠️ Refactor suggestion | 🟠 Major

This route needs to be split before it grows further.

The page is already far past the 200-line Svelte cap, and the new severity controls plus duplicated single/workspace scan flow add even more parallel logic to keep in sync. Please extract the scan form/results pieces or move the shared orchestration into reusable helpers.

As per coding guidelines, apps/web/src/**/*.svelte: Component files max 200 lines — split into smaller components if larger.

Also applies to: 339-393
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@apps/web/src/routes/scan/`+page.svelte around lines 54 - 200, The page
exceeds the 200-line Svelte limit and duplicates scan orchestration; extract the
scan UI and event handling into smaller components and reusable helpers: move
the form and results display into new Svelte components (e.g., ScanForm.svelte
and ScanResults.svelte) and factor the SSE/scan orchestration logic into shared
functions (e.g., startScan(), subscribeToScanEvents(scan_id) used by runScan and
runWorkspaceScan) so runScan and runWorkspaceScan only prepare payloads and call
the shared helpers; ensure you preserve symbols scanProgress, scanStatus,
findings, isScanning and call refreshScanCount from the shared completion
handler.
62-77: 🛠️ Refactor suggestion | 🟠 Major

Keep scan POSTs in the typed API client.

This page still builds both scan requests with raw fetch, which breaks the repo's API-boundary rule and leaves the new min_severity contract duplicated in two places. Move these POSTs into $lib/api.ts and call the typed helper from here.

As per coding guidelines, apps/web/src/**/*.{ts,tsx,svelte}: All fetch calls must go through typed API client functions in apps/web/src/lib/api.ts.

Also applies to: 141-159
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@apps/web/src/routes/scan/`+page.svelte around lines 62 - 77, Replace the
inline fetch POSTs with a typed API client function: add a function in
apps/web/src/lib/api.ts (e.g., submitScan or createScan) that accepts the same
payload fields (code, language, engines, optional ai_config with
provider/api_key/model/endpoint, and optional min_severity) and performs the
POST to /api/v1/scan returning the typed response; then update
apps/web/src/routes/scan/+page.svelte to call that new client function instead
of using fetch (do this for both occurrences around lines 62-77 and 141-159) so
the min_severity contract and all fetches live in the typed API surface.

🧹 Nitpick comments (4)

crates/scanner/tests/integration.rs (3)

140-148: Suggested: strengthen GitHub token test to validate engine/severity (not just non-empty)

The GitHub token test now only asserts !findings.is_empty() (Line 147). To make this integration test more meaningful (and less dependent on scan ordering/contents), consider asserting at least one finding matches the expected engine and minimum severity (e.g., Engine::Secrets and Severity::High/Critical, whichever is the intended contract).

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@crates/scanner/tests/integration.rs` around lines 140 - 148, Update the test
secrets_detects_github_token to not only assert findings is non-empty but to
assert at least one finding has engine == Engine::Secrets and a severity at or
above the intended threshold (e.g., Severity::High or Severity::Critical);
locate the test function secrets_detects_github_token and the scan(...) call,
iterate/scan the returned findings to check finding.engine == Engine::Secrets
and compare finding.severity against Severity::High (or the correct enum
variant) and fail the test if no such finding exists.

90-97: Minor: avoid double to_lowercase() in weak-hashing predicate

In sast_detects_weak_hashing, the predicate calls f.title.to_lowercase() twice (Lines 93-95). This allocates twice and is slightly noisy. Cache the lowercase title once inside the closure.

♻️ Suggested tweak

-    assert!(
-        findings
-            .iter()
-            .any(|f| f.title.to_lowercase().contains("md5")
-                || f.title.to_lowercase().contains("hash")),
-        "Expected a weak-hashing finding"
-    );
+    assert!(
+        findings.iter().any(|f| {
+            let t = f.title.to_lowercase();
+            t.contains("md5") || t.contains("hash")
+        }),
+        "Expected a weak-hashing finding"
+    );

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@crates/scanner/tests/integration.rs` around lines 90 - 97, In the
sast_detects_weak_hashing test, the predicate currently calls
f.title.to_lowercase() twice which causes two allocations; modify the closure to
compute let title = f.title.to_lowercase(); once and then use
title.contains("md5") || title.contains("hash") so the lowercase string is
reused. Update the iterator predicate (the closure over f) to cache the
lowercase title before the contains checks to eliminate the duplicate
to_lowercase() call.

151-166: Potential brittleness: avoid assuming findings[0] is the RSA private key

In secrets_detects_rsa_private_key, you check non-empty (Line 163) but then assert findings[0].severity == Severity::Critical (Line 164). If the scan can emit multiple findings (even for the same input), [0] could be the “wrong” one and make the test flaky.

Prefer iter().any(...) to assert “there exists a critical RSA-private-key finding” rather than relying on index 0.

♻️ Proposed change

-    assert!(!findings.is_empty(), "Expected a private key finding");
-    assert_eq!(findings[0].severity, Severity::Critical);
+    assert!(!findings.is_empty(), "Expected a private key finding");
+    assert!(
+        findings.iter().any(|f| f.severity == Severity::Critical),
+        "Expected at least one Critical private key finding"
+    );

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@crates/scanner/tests/integration.rs` around lines 151 - 166, The test
secrets_detects_rsa_private_key is brittle because it assumes findings[0] is the
RSA key; instead update the assertion to check that at least one finding matches
the RSA private key criteria (e.g., Severity::Critical and the finding kind or
name indicates an RSA private key) by using findings.iter().any(|f| /* predicate
*/) so the test asserts existence rather than relying on index 0; locate the
test function secrets_detects_rsa_private_key and replace the
assert_eq!(findings[0].severity, Severity::Critical) with an
assert!(findings.iter().any(...)) using a predicate that checks severity and
identifies the RSA-private-key finding.

crates/scanner/src/engines/sca.rs (1)

1-437: Run formatting + clippy as required by repo guidelines.

The PR appears to be formatting/polish-heavy; please ensure the repository requirement is met before merge:

cargo fmt
cargo clippy -- -D warnings

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@crates/scanner/src/engines/sca.rs` around lines 1 - 437, Run cargo fmt and
cargo clippy -- -D warnings and fix any issues they report; specifically run
`cargo fmt` to apply formatting and then `cargo clippy -- -D warnings` and
address lint errors/warnings coming from functions and types in this file (e.g.
parse_dependencies, parse_package_lock, parse_requirements_txt, parse_go_sum,
map_cvss_severity, query_osv, run) by adjusting code to satisfy clippy (remove
unused imports/variables, handle Result unwraps/expectations, simplify patterns,
add explicit types or docs as required) until clippy exits cleanly with zero
warnings.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@apps/web/src/lib/stores/usage.ts`:
- Around line 6-9: refreshScanCount() currently calls getHistory() which returns
a capped page (latest 50) so scanCount will max out and not reflect real usage;
change refreshScanCount to call a dedicated usage/count endpoint (e.g. create
and use getUsageCount() or fetch '/api/v1/usage' that returns the total scan
count) and set scanCount from that response instead of history.length; update
any callers that rely on refreshScanCount to expect a numeric total and add
minimal error handling for the usage fetch in the refreshScanCount function.

In `@apps/web/src/routes/`+layout.svelte:
- Around line 67-69: Clamp the progress percent and remaining scans before
rendering: compute a safe percent (e.g. use Math.min(100, ( $scanCount /
maxScans )*100) with a guard for maxScans === 0) and a non-negative remaining
count (e.g. Math.max(0, maxScans - $scanCount)), then use those values in the
progress bar div inline style and in the paragraph text instead of using
($scanCount / maxScans) * 100 and maxScans - $scanCount directly; update
references to $scanCount and maxScans in the progress bar div and the "{maxScans
- $scanCount}/{maxScans} scans remaining" paragraph to use the clamped
variables.

In `@apps/web/src/routes/scan/`+page.svelte:
- Around line 111-119: The onerror handler is closing the EventSource during
EventSource.CONNECTING (transient retry) which aborts automatic reconnection;
update the condition in the onerror handler so it only closes when
eventSource.readyState === EventSource.CLOSED (remove EventSource.CONNECTING
from the OR), and keep isScanning false + eventSource.close() only in that
CLOSED branch; reference the eventSource variable and isScanning flag in your
change.

In `@crates/scanner/src/engines/ai_code.rs`:
- Around line 115-117: The two uses of Regex::new(...).unwrap() (the
decorator-detection Regex instances created in ai_code.rs) introduce panics in
library code; replace the .unwrap() calls by propagating the compile error
instead—change the surrounding code to return Result<..., regex::Error> (or
propagate via ?), remove the unwraps, and map/return the Regex::new(...) error
to the caller so regex construction failures are handled instead of panicking;
update the function/closure signatures that construct these Regexes accordingly
(and adjust callers) so errors flow through.
- Around line 205-213: The suppression check is too broad: l.contains("user")
hides valid routes like "/users/{id}"; update the condition inside the rule.name
== "Unauthenticated Route Handler" block to only suppress when "user" appears as
an auth-related token or standalone identifier (not as part of "users"). Replace
l.contains("user") with a stricter match (e.g., a regex word-boundary check or
explicit tokens) such as testing
r"\b(user|current_user|user_id|get_current_user|authenticated_user)\b" or
equivalent so "/users" no longer matches while "{user}" or "current_user" still
suppress. Reference: the if block checking rule.name == "Unauthenticated Route
Handler" and the variable l.
- Around line 154-155: The current Regex::new(...) in
crates/scanner/src/engines/ai_code.rs is too narrow (only matches `url =
"http://..."` and `fetch("http://..."`); broaden the pattern used when
constructing the Regex in the code (the Regex::new call) to match other common
HTTP usages like function calls and attribute accesses (e.g.,
`requests.get("http://...")`, `axios.get("http://...")`, plain quoted
`"http://..."`, and similar call/assignment patterns) while keeping the existing
localhost/127.0.0.1 suppression logic inside the run function intact; update the
Regex::new invocation to a more general case-insensitive pattern that captures
quoted http:// URLs and typical call forms and leave the filtering in run
unchanged.

In `@crates/server/src/cve_sync/mod.rs`:
- Around line 336-340: The code currently treats the OSV severity string in
variable score as a numeric value (used to compute n), but OSV returns CVSS
vectors like "CVSS:3.1/AV:N/..." so .split('/').next() will not yield a numeric
base score; update the parsing where n is computed to detect CVSS vector strings
and extract the numeric base score by using a CVSS parser (add and use a CVSS v3
crate or helper) instead of naive splitting: if score starts_with "CVSS" parse
the vector (e.g., via a cvss crate API such as cvss::v3::Vector::from_str or
equivalent) and read its base score, otherwise fall back to parsing a numeric
string or default to 5.0; replace the current expression that sets n to call
this parser and return the parsed base score as f32. Ensure you reference the
same variable names (score and n) and update error handling to fall back to 5.0
on parse failure.

In `@crates/server/src/main.rs`:
- Around line 401-408: The code currently swallows the result of the SQL update
(the sqlx::query that binds findings.len(),
serde_json::to_value(&severity_counts) and scan_id and executes against
state_task.db) which can make the scan appear successful even when the DB write
fails; change this to handle the Result: check the .await result, log any error
via the existing logging facility and, on failure, update the scan record to a
failed/incomplete state (e.g., set a status column or failure flag on the scans
row referenced by scan_id) so /history and /results reflect the real state;
apply the same treatment to the other background persistence calls around the
553-574 region.
- Around line 323-330: The INSERT query uses serde_json::to_value(...).unwrap()
which can panic; remove the unwrap and handle the serialization error
explicitly: call serde_json::to_value(...) and propagate or map the Result to an
error path (e.g., return Err or call the existing scan-failure/cleanup routine
used elsewhere in this module) before executing sqlx::query. Update the code
around the sqlx::query/execute block (the INSERT for scans) to match error
handling style used elsewhere (check for Result from to_value, log the error
with scan_id/payload_lang, and fail the scan cleanly), and apply the same unwrap
removal/explicit handling for the other occurrences noted (around the other
serde_json::to_value calls in the file ranges mentioned).
- Around line 596-634: The current count-based deduplication using num_cached
and seen_count is wrong; remove the seen_count logic and instead deduplicate by
stable event IDs or sequence numbers: build a HashSet of IDs (from
cached_events) before creating past_stream, then in the live_stream (the
tokio_stream::wrappers::BroadcastStream created from rx) filter incoming events
by checking their event_id/sequence (extract the ID/seq from the incoming event)
against that set, only forwarding events whose ID/seq is not already present and
insert newly-seen IDs into the set; update the map/filter chain that produces
live_stream (and keep past_stream unchanged) to use this ID-based check rather
than the count-based closure so you don’t drop genuine future events.
- Around line 321-339: The pre-created scan row is left with findings_count=0 on
early exits, making it look like a successful scan; update the early-return
paths that follow the initial INSERT (the block using scan_id and the
surrounding scan workflow) to either set a terminal failure state (e.g., run
sqlx::query("UPDATE scans SET status = $1 WHERE id =
$2").bind("Failed").bind(scan_id).execute(&state.db).await) or delete the
placeholder row (sqlx::query("DELETE FROM scans WHERE id =
$1").bind(scan_id).execute(&state.db).await) before returning; factor this into
a small helper like mark_scan_failed_or_delete(scan_id, &state.db) and call it
on every early-exit path (including the other similar blocks you noted) so the
DB never retains a misleading zero-finding successful scan.

---

Outside diff comments:
In `@apps/web/src/routes/scan/`+page.svelte:
- Around line 54-200: The page exceeds the 200-line Svelte limit and duplicates
scan orchestration; extract the scan UI and event handling into smaller
components and reusable helpers: move the form and results display into new
Svelte components (e.g., ScanForm.svelte and ScanResults.svelte) and factor the
SSE/scan orchestration logic into shared functions (e.g., startScan(),
subscribeToScanEvents(scan_id) used by runScan and runWorkspaceScan) so runScan
and runWorkspaceScan only prepare payloads and call the shared helpers; ensure
you preserve symbols scanProgress, scanStatus, findings, isScanning and call
refreshScanCount from the shared completion handler.
- Around line 62-77: Replace the inline fetch POSTs with a typed API client
function: add a function in apps/web/src/lib/api.ts (e.g., submitScan or
createScan) that accepts the same payload fields (code, language, engines,
optional ai_config with provider/api_key/model/endpoint, and optional
min_severity) and performs the POST to /api/v1/scan returning the typed
response; then update apps/web/src/routes/scan/+page.svelte to call that new
client function instead of using fetch (do this for both occurrences around
lines 62-77 and 141-159) so the min_severity contract and all fetches live in
the typed API surface.

In `@crates/scanner/src/engines/sca.rs`:
- Around line 271-321: The use of zip in query_osv can silently truncate results
when OSV returns fewer entries than queries; change the chunk processing to
iterate by index (for i in 0..chunk.len()) and for each dep use
batch_resp.results.get(i) to obtain the corresponding result (or None) and
default to an empty vec when missing, then push (dep.clone(), vulns) for every
index so every dependency in the chunk is always accounted for even if the
response is shorter or out-of-sync.

In `@crates/scanner/tests/integration.rs`:
- Around line 246-254: The current test collects severities into severities
(Vec<&Severity>), clones it into sorted and asserts exact equality
(assert_eq!(severities, sorted)) which is too strict for equal-severity items;
change the assertion to verify non-increasing order instead: iterate adjacent
pairs of severities (e.g., using windows or a simple for loop over indices) and
assert that each earlier severity >= the next, replacing the assert_eq on the
whole vector; update the check near the code that builds severities, sorted, and
calls sorted.sort_by (remove or keep sort_by only as needed for comparison) and
replace the equality assertion with the monotonicity assertion so equal
severities may appear in any relative order.

---

Nitpick comments:
In `@crates/scanner/src/engines/sca.rs`:
- Around line 1-437: Run cargo fmt and cargo clippy -- -D warnings and fix any
issues they report; specifically run `cargo fmt` to apply formatting and then
`cargo clippy -- -D warnings` and address lint errors/warnings coming from
functions and types in this file (e.g. parse_dependencies, parse_package_lock,
parse_requirements_txt, parse_go_sum, map_cvss_severity, query_osv, run) by
adjusting code to satisfy clippy (remove unused imports/variables, handle Result
unwraps/expectations, simplify patterns, add explicit types or docs as required)
until clippy exits cleanly with zero warnings.

In `@crates/scanner/tests/integration.rs`:
- Around line 140-148: Update the test secrets_detects_github_token to not only
assert findings is non-empty but to assert at least one finding has engine ==
Engine::Secrets and a severity at or above the intended threshold (e.g.,
Severity::High or Severity::Critical); locate the test function
secrets_detects_github_token and the scan(...) call, iterate/scan the returned
findings to check finding.engine == Engine::Secrets and compare finding.severity
against Severity::High (or the correct enum variant) and fail the test if no
such finding exists.
- Around line 90-97: In the sast_detects_weak_hashing test, the predicate
currently calls f.title.to_lowercase() twice which causes two allocations;
modify the closure to compute let title = f.title.to_lowercase(); once and then
use title.contains("md5") || title.contains("hash") so the lowercase string is
reused. Update the iterator predicate (the closure over f) to cache the
lowercase title before the contains checks to eliminate the duplicate
to_lowercase() call.
- Around line 151-166: The test secrets_detects_rsa_private_key is brittle
because it assumes findings[0] is the RSA key; instead update the assertion to
check that at least one finding matches the RSA private key criteria (e.g.,
Severity::Critical and the finding kind or name indicates an RSA private key) by
using findings.iter().any(|f| /* predicate */) so the test asserts existence
rather than relying on index 0; locate the test function
secrets_detects_rsa_private_key and replace the assert_eq!(findings[0].severity,
Severity::Critical) with an assert!(findings.iter().any(...)) using a predicate
that checks severity and identifies the RSA-private-key finding.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: 39b02cea-e8ef-44f8-a8d6-af64c4048ca2

📥 Commits

Reviewing files that changed from the base of the PR and between 0c7dba6 and 9b46716.

📒 Files selected for processing (11)

README.md
apps/web/src/lib/api.ts
apps/web/src/lib/stores/usage.ts
apps/web/src/routes/+layout.svelte
apps/web/src/routes/history/[id]/+page.svelte
apps/web/src/routes/scan/+page.svelte
crates/scanner/src/engines/ai_code.rs
crates/scanner/src/engines/sca.rs
crates/scanner/tests/integration.rs
crates/server/src/cve_sync/mod.rs
crates/server/src/main.rs

coderabbitai · 2026-04-28T18:26:27Z

+export async function refreshScanCount() {
+  try {
+    const history = await getHistory();
+    scanCount.set(history.length);


⚠️ Potential issue | 🟠 Major

This store is counting a capped history page, not actual usage.

refreshScanCount() uses /api/v1/history, but the server only returns the latest 50 scans. After that point scanCount stops reflecting real usage, so anything deriving quota/billing state from this value will drift. A dedicated usage/count endpoint would be safer here.

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@apps/web/src/lib/stores/usage.ts` around lines 6 - 9, refreshScanCount() currently calls getHistory() which returns a capped page (latest 50) so scanCount will max out and not reflect real usage; change refreshScanCount to call a dedicated usage/count endpoint (e.g. create and use getUsageCount() or fetch '/api/v1/usage' that returns the total scan count) and set scanCount from that response instead of history.length; update any callers that rely on refreshScanCount to expect a numeric total and add minimal error handling for the usage fetch in the refreshScanCount function.

coderabbitai · 2026-04-28T18:26:27Z

+          <div class="bg-brand-primary h-full rounded-full shadow-[0_0_8px_rgba(236,72,153,0.5)]" style="width: {($scanCount / maxScans) * 100}%"></div>
        </div>
-        <p class="text-[10px] text-zinc-500 mt-2">4/10 scans remaining</p>
+        <p class="text-[10px] text-zinc-500 mt-2">{maxScans - $scanCount}/{maxScans} scans remaining</p>


⚠️ Potential issue | 🟡 Minor

Clamp the usage math once the free tier is exhausted.

If $scanCount goes past maxScans, the bar width exceeds 100% and the remaining label becomes negative. Clamp both values so the UI stays sane after the limit is reached.

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@apps/web/src/routes/`+layout.svelte around lines 67 - 69, Clamp the progress percent and remaining scans before rendering: compute a safe percent (e.g. use Math.min(100, ( $scanCount / maxScans )*100) with a guard for maxScans === 0) and a non-negative remaining count (e.g. Math.max(0, maxScans - $scanCount)), then use those values in the progress bar div inline style and in the paragraph text instead of using ($scanCount / maxScans) * 100 and maxScans - $scanCount directly; update references to $scanCount and maxScans in the progress bar div and the "{maxScans - $scanCount}/{maxScans} scans remaining" paragraph to use the clamped variables.

coderabbitai · 2026-04-28T18:26:27Z

      eventSource.onerror = () => {
        console.error("SSE connection failed");
-        isScanning = false;
-        eventSource.close();
+        scanStatus = "Connection lost. Reconnecting or finished.";
+        // We don't necessarily want to set isScanning to false here because SSE often reconnects automatically.
+        // But for Zenvra's simple model, we'll close it to avoid zombie states if the scan is actually done.
+        if (eventSource.readyState === EventSource.CLOSED || eventSource.readyState === EventSource.CONNECTING) {
+           isScanning = false;
+           eventSource.close();
+        }


⚠️ Potential issue | 🟠 Major

🧩 Analysis chain

🌐 Web query:

On the EventSource Web API, what does readyState === EventSource.CONNECTINGmean duringonerror, and should the client close the connection or allow the browser to retry automatically?

💡 Result:

readyState === EventSource.CONNECTING during the onerror event means the EventSource connection has encountered an error (e.g., network issue, server error) and the browser has set the state to CONNECTING to automatically attempt reconnection, as per the WHATWG HTML spec. This is the normal reconnection process: the browser will retry after a delay (default ~3s, adjustable via retry: field), keeping readyState as CONNECTING until success (OPEN) or permanent failure (CLOSED). The client should NOT close the connection in onerror when readyState is CONNECTING. Closing it (es.close) sets readyState to CLOSED, permanently stopping retries. Official docs (MDN, WHATWG) do not recommend closing; they advise logging/handling the error and letting the browser retry automatically for transient failures. Only close for fatal/permanent errors (when readyState === CLOSED) or per app logic. Many implementations add manual reconnection logic only if readyState === CLOSED to handle cases where browser stops retrying (e.g., certain HTTP errors like 500 in some browsers).

Citations:

1: https://developer.mozilla.org/en-US/docs/Web/API/EventSource/readyState

2: https://developer.mozilla.org/en-US/docs/Web/API/EventSource

3: https://html.spec.whatwg.org/multipage/server-sent-events.html

4: https://stackoverflow.com/questions/24564030/is-an-eventsource-sse-supposed-to-try-to-reconnect-indefinitely

5: https://whatpr.org/html/3851/server-sent-events.html

Remove EventSource.CONNECTING from the close condition.

EventSource.CONNECTING indicates the browser is actively retrying after a transient failure. Closing the source while in this state cancels automatic reconnection, causing scans to abort on brief network hiccups. Per the WHATWG spec, clients should only close when readyState === EventSource.CLOSED (permanent failure) or allow the browser to retry automatically. Update the condition to exclude CONNECTING:

Current problematic code

if (eventSource.readyState === EventSource.CLOSED || eventSource.readyState === EventSource.CONNECTING) { isScanning = false; eventSource.close(); }

Only close if readyState === EventSource.CLOSED or handle retries explicitly for fatal errors, not transient ones.

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@apps/web/src/routes/scan/`+page.svelte around lines 111 - 119, The onerror handler is closing the EventSource during EventSource.CONNECTING (transient retry) which aborts automatic reconnection; update the condition in the onerror handler so it only closes when eventSource.readyState === EventSource.CLOSED (remove EventSource.CONNECTING from the OR), and keep isScanning false + eventSource.close() only in that CLOSED branch; reference the eventSource variable and isScanning flag in your change.

coderabbitai · 2026-04-28T18:26:27Z

+                // Simplified: detect the decorator. Manual filtering below.
+                Regex::new(r"(?i)@(app|router)\.(delete|put|patch)\s*\(").unwrap()
            })


🛠️ Refactor suggestion | 🟠 Major

Avoid introducing new .unwrap() in scanner library code.

Line 116 and Line 155 add new panic paths in library code. Please propagate regex construction errors instead of unwrapping.

Proposed direction

-fn build_rules() -> Vec<AiCodeRule> { +fn build_rules() -> Result<Vec<AiCodeRule>> { vec![ AiCodeRule { name: "Unauthenticated Route Handler", - regex: NO_AUTH_ROUTE_REGEX.get_or_init(|| { - Regex::new(r"(?i)@(app|router)\.(delete|put|patch)\s*\(").unwrap() - }).clone(), + regex: Regex::new(r"(?i)@(app|router)\.(delete|put|patch)\s*\(")?, ... }, AiCodeRule { name: "Plain HTTP Endpoint (No TLS)", - regex: PLAIN_HTTP_REGEX.get_or_init(|| { - Regex::new(r#"(?i)(url\s*=\s*['"]http://|fetch\s*\(\s*['"]http://)"#).unwrap() - }).clone(), + regex: Regex::new(r#"(?i)(url\s*=\s*['"]http://|fetch\s*\(\s*['"]http://)"#)?, ... }, ] } ... - let rules = build_rules(); + let rules = build_rules()?;

As per coding guidelines: crates/scanner/**/*.rs: “never use .unwrap() in library code” and crates/{scanner,server}/**/*.rs: “Do not use .unwrap() or .expect() in library and API code”.

Also applies to: 154-156

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@crates/scanner/src/engines/ai_code.rs` around lines 115 - 117, The two uses of Regex::new(...).unwrap() (the decorator-detection Regex instances created in ai_code.rs) introduce panics in library code; replace the .unwrap() calls by propagating the compile error instead—change the surrounding code to return Result<..., regex::Error> (or propagate via ?), remove the unwraps, and map/return the Regex::new(...) error to the caller so regex construction failures are handled instead of panicking; update the function/closure signatures that construct these Regexes accordingly (and adjust callers) so errors flow through.

coderabbitai · 2026-04-28T18:26:27Z

+                // Simplified: detect http:// without localhost/127.0.0.1
+                Regex::new(r#"(?i)(url\s*=\s*['"]http://|fetch\s*\(\s*['"]http://)"#).unwrap()


⚠️ Potential issue | 🟠 Major

Plain HTTP regex became too narrow and likely regresses detection coverage.

The new pattern only catches url = "http://..." and fetch("http://..."). It misses common forms like requests.get("http://..."), axios.get("http://..."), and other quoted URLs.

Broader pattern while keeping local-host suppression in run

- Regex::new(r#"(?i)(url\s*=\s*['"]http://|fetch\s*\(\s*['"]http://)"#).unwrap() + Regex::new(r#"(?i)['"]http://[^'"]+['"]"#).unwrap()

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

// Simplified: detect http:// without localhost/127.0.0.1

Regex::new(r#"(?i)(url\s*=\s*['"]http://|fetch\s*\(\s*['"]http://)"#).unwrap()

// Simplified: detect http:// without localhost/127.0.0.1

Regex::new(r#"(?i)['"]http://[^'"]+['"]"#).unwrap()

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@crates/scanner/src/engines/ai_code.rs` around lines 154 - 155, The current Regex::new(...) in crates/scanner/src/engines/ai_code.rs is too narrow (only matches `url = "http://..."` and `fetch("http://..."`); broaden the pattern used when constructing the Regex in the code (the Regex::new call) to match other common HTTP usages like function calls and attribute accesses (e.g., `requests.get("http://...")`, `axios.get("http://...")`, plain quoted `"http://..."`, and similar call/assignment patterns) while keeping the existing localhost/127.0.0.1 suppression logic inside the run function intact; update the Regex::new invocation to a more general case-insensitive pattern that captures quoted http:// URLs and typical call forms and leave the filtering in run unchanged.

coderabbitai · 2026-04-28T18:26:28Z

+    let n: f32 = score
+        .split('/')
+        .next()
+        .and_then(|s| s.parse().ok())
+        .unwrap_or(5.0);


⚠️ Potential issue | 🟠 Major

🧩 Analysis chain

🌐 Web query:

In the OSV API schema, what is the format of severity[].scorefor CVSS entries? Is it a numeric score or a CVSS vector string such asCVSS:3.1/...?

💡 Result:

In the OSV API schema, the format of severity[].score for CVSS entries is a CVSS vector string such as CVSS:3.1/AV:N/AC:H/PR:N/UI:N/S:C/C:H/I:N/A:N, not a numeric score. The OSV schema defines severity as an array of objects with type (e.g., "CVSS_V3") and score as a string containing the vector for CVSS types. API responses follow this schema, returning vector strings in score, as confirmed by schema documentation, examples, and discussions where numeric scores are not provided and must be calculated separately using CVSS libraries.

Citations:

1: https://ossf.github.io/osv-schema/

2: https://another-rex.github.io/osv-schema/

3: How to get an integer severity score from osv Rest API google/osv.dev#2643

4: OSV API Response is Inconsistent with Schema google/osv.dev#2337

5: https://github.com/ossf/osv-schema/blob/main/proto/vulnerability.proto

Parse CVSS vectors from OSV severity entries to extract numeric base scores.

The OSV API returns CVSS entries as vector strings (e.g., CVSS:3.1/AV:N/AC:H/PR:N/UI:N/S:C/C:H/I:N/A:N), not numeric scores. The current parsing logic with .split('/').next() will fail on the vector format, causing all OSV findings to silently default to a score of 5.0 (mapping to "medium" severity). This skews severity filtering and counts. Extract the numeric base score from the vector string using a proper CVSS parsing approach.

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@crates/server/src/cve_sync/mod.rs` around lines 336 - 340, The code currently treats the OSV severity string in variable score as a numeric value (used to compute n), but OSV returns CVSS vectors like "CVSS:3.1/AV:N/..." so .split('/').next() will not yield a numeric base score; update the parsing where n is computed to detect CVSS vector strings and extract the numeric base score by using a CVSS parser (add and use a CVSS v3 crate or helper) instead of naive splitting: if score starts_with "CVSS" parse the vector (e.g., via a cvss crate API such as cvss::v3::Vector::from_str or equivalent) and read its base score, otherwise fall back to parsing a numeric string or default to 5.0; replace the current expression that sets n to call this parser and return the parsed base score as f32. Ensure you reference the same variable names (score and n) and update error handling to fall back to 5.0 on parse failure.

coderabbitai · 2026-04-28T18:26:28Z

+    // ─── Phase 1: Initialize Scan Record ─────────────────────────────────────
+    // We MUST insert the scan record before any findings to avoid FK violations.
+    if let Err(e) = sqlx::query(
+        "INSERT INTO scans (id, language, target_name, findings_count, severity_counts) 
+         VALUES ($1, $2, $3, 0, $4)",
+    )
+    .bind(scan_id)
+    .bind(&payload_lang)
+    .bind("Manual Scan")
+    .bind(serde_json::to_value(std::collections::HashMap::<String, i32>::new()).unwrap())
+    .execute(&state.db)
+    .await
+    {
+        tracing::error!("Failed to initialize scan record {}: {}", scan_id, e);
+        return Err((
+            StatusCode::INTERNAL_SERVER_ERROR,
+            "Database initialization error".to_string(),
+        ));
+    }


⚠️ Potential issue | 🟠 Major

Don't leave pre-created scan rows looking like successful zero-finding scans.

These branches insert into scans before the worker starts, but the non-Complete exits just break and clean up memory. If the worker errors or the channel closes early, history keeps a findings_count = 0 row that looks successful, and the new usage counter will count it as real usage too. Persist a terminal failed state or delete the placeholder row on every early-exit path.

Also applies to: 399-416, 501-519, 566-575

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@crates/server/src/main.rs` around lines 321 - 339, The pre-created scan row is left with findings_count=0 on early exits, making it look like a successful scan; update the early-return paths that follow the initial INSERT (the block using scan_id and the surrounding scan workflow) to either set a terminal failure state (e.g., run sqlx::query("UPDATE scans SET status = $1 WHERE id = $2").bind("Failed").bind(scan_id).execute(&state.db).await) or delete the placeholder row (sqlx::query("DELETE FROM scans WHERE id = $1").bind(scan_id).execute(&state.db).await) before returning; factor this into a small helper like mark_scan_failed_or_delete(scan_id, &state.db) and call it on every early-exit path (including the other similar blocks you noted) so the DB never retains a misleading zero-finding successful scan.

coderabbitai · 2026-04-28T18:26:28Z

+    if let Err(e) = sqlx::query(
+        "INSERT INTO scans (id, language, target_name, findings_count, severity_counts) 
+         VALUES ($1, $2, $3, 0, $4)",
+    )
+    .bind(scan_id)
+    .bind(&payload_lang)
+    .bind("Manual Scan")
+    .bind(serde_json::to_value(std::collections::HashMap::<String, i32>::new()).unwrap())


🛠️ Refactor suggestion | 🟠 Major

Remove the new unwrap()s from server request paths.

These serialization calls now panic inside API/background-server code. Please handle the error explicitly and fail the scan cleanly instead.

As per coding guidelines, crates/{scanner,server}/**/*.rs: Do not use .unwrap() or .expect() in library and API code.

Also applies to: 401-405, 502-509, 567-571

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@crates/server/src/main.rs` around lines 323 - 330, The INSERT query uses serde_json::to_value(...).unwrap() which can panic; remove the unwrap and handle the serialization error explicitly: call serde_json::to_value(...) and propagate or map the Result to an error path (e.g., return Err or call the existing scan-failure/cleanup routine used elsewhere in this module) before executing sqlx::query. Update the code around the sqlx::query/execute block (the INSERT for scans) to match error handling style used elsewhere (check for Result from to_value, log the error with scan_id/payload_lang, and fail the scan cleanly), and apply the same unwrap removal/explicit handling for the other occurrences noted (around the other serde_json::to_value calls in the file ranges mentioned).

coderabbitai · 2026-04-28T18:26:28Z

+                    let _ = sqlx::query(
+                        "UPDATE scans SET findings_count = $1, severity_counts = $2 WHERE id = $3",
                    )
-                    .bind(scan_id)
-                    .bind(payload_lang)
-                    .bind("Manual Scan")
                    .bind(findings.len() as i32)
-                    .bind(serde_json::to_value(&severity_counts).unwrap_or(serde_json::Value::Object(Default::default())))
+                    .bind(serde_json::to_value(&severity_counts).unwrap())
+                    .bind(scan_id)
                    .execute(&state_task.db)
-                    .await {
-                        tracing::error!("Failed to finalize scan {}: {}", scan_id, e);
-                    }
+                    .await;


⚠️ Potential issue | 🟠 Major

Don't swallow persistence failures in the background worker.

The let _ = ...await writes let SSE report a successful scan even when final counts or workspace findings never reach Postgres. That leaves /history and /results inconsistent with what the client saw live. At minimum, log these failures and mark the scan as failed/incomplete.

Also applies to: 553-574

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@crates/server/src/main.rs` around lines 401 - 408, The code currently swallows the result of the SQL update (the sqlx::query that binds findings.len(), serde_json::to_value(&severity_counts) and scan_id and executes against state_task.db) which can make the scan appear successful even when the DB write fails; change this to handle the Result: check the .await result, log any error via the existing logging facility and, on failure, update the scan record to a failed/incomplete state (e.g., set a status column or failure flag on the scans row referenced by scan_id) so /history and /results reflect the real state; apply the same treatment to the other background persistence calls around the 553-574 region.

coderabbitai · 2026-04-28T18:26:28Z

+    // Use a multi-stage stream: catch up from cache, then switch to live.
+    // We subscribe first to ensure we don't miss anything that happens during the catch-up.
+    let rx = state.scans.get(&id).map(|tx| tx.subscribe());
+
+    let cached_events = state
+        .results
+        .get(&id)
+        .map(|c| c.clone())
+        .unwrap_or_default();
+    let num_cached = cached_events.len();
+
+    let past_stream = stream::iter(cached_events).map(|event| -> Result<Event, Infallible> {
+        Ok(Event::default()
+            .json_data(&event)
+            .unwrap_or_else(|_| Event::default()))
+    });
+
+    let stream: BoxedStream = if let Some(rx) = rx {
+        // Scan is ongoing
+        let mut seen_count = 0;
+        let live_stream = tokio_stream::wrappers::BroadcastStream::new(rx)
+            .filter_map(|msg| msg.ok())
+            .filter(move |_| {
+                // Skip the first N events that were already in our cache clone
+                // This is a simple heuristic; a real implementation would use event IDs.
+                if seen_count < num_cached {
+                    seen_count += 1;
+                    false
+                } else {
+                    true
+                }
+            })
+            .map(|event| -> Result<Event, Infallible> {
                Ok(Event::default()
-                    .json_data(&event)
+                    .json_data(event)
                    .unwrap_or_else(|_| Event::default()))
-            }),
-        )
+            });
+
+        Box::pin(past_stream.chain(live_stream))


⚠️ Potential issue | 🔴 Critical

🧩 Analysis chain

🌐 Web query:

Does tokio::sync::broadcast::Sender::subscribe() receive messages that were sent before the receiver subscribed, or only messages sent afterwards?

💡 Result:

Only messages sent after the receiver subscribes. tokio::sync::broadcast::Sender::subscribe creates a new Receiver handle that “will receive values sent after this call to subscribe”, and each Receiver receives clones of “all values sent after it has subscribed”. Therefore, a receiver does not get messages that were sent before the call to subscribe (though it may get lagged/overwritten behavior for messages sent after it subscribed).

Citations:

1: https://docs.rs/tokio/latest/tokio/sync/broadcast/struct.Sender.html

2: https://docs.rs/tokio/1.6.0/tokio/sync/broadcast/struct.Receiver.html

3: https://docs.rs/tokio/latest/tokio/sync/broadcast/index.html

4: https://docs.rs/tokio/latest/src/tokio/sync/broadcast.rs.html

Remove count-based event deduplication; use stable event IDs or sequence numbers instead.

The cached/live replay heuristic incorrectly drops legitimate events. When broadcast::subscribe() is called, it only receives messages sent after the subscription. Skipping the first num_cached events from the live stream therefore discards genuine future events that have no overlap with the cached batch. This needs event-based deduplication (e.g., event IDs or sequence numbers), not count-based skipping.

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@crates/server/src/main.rs` around lines 596 - 634, The current count-based deduplication using num_cached and seen_count is wrong; remove the seen_count logic and instead deduplicate by stable event IDs or sequence numbers: build a HashSet of IDs (from cached_events) before creating past_stream, then in the live_stream (the tokio_stream::wrappers::BroadcastStream created from rx) filter incoming events by checking their event_id/sequence (extract the ID/seq from the incoming event) against that set, only forwarding events whose ID/seq is not already present and insert newly-seen IDs into the set; update the map/filter chain that produces live_stream (and keep past_stream unchanged) to use this ID-based check rather than the count-based closure so you don’t drop genuine future events.

chojuninengu added 2 commits April 28, 2026 19:11

refactor: apply consistent code formatting and style improvements acr…

9b46716

…oss server and scanner crates

chojuninengu self-assigned this Apr 28, 2026

qodo-code-review Bot reviewed Apr 28, 2026

View reviewed changes

coderabbitai Bot reviewed Apr 28, 2026

View reviewed changes

		let selectedLanguage = $state("python");
		let selectedMinSeverity = $state("info");

		// Simplified: detect http:// without localhost/127.0.0.1
		Regex::new(r#"(?i)(url\s=\s['"]http://\|fetch\s\(\s['"]http://)"#).unwrap()

Conversation

chojuninengu commented Apr 28, 2026 • edited by coderabbitai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by CodeRabbit

Release Notes

Uh oh!

coderabbitai Bot commented Apr 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Sequence Diagram

Estimated code review effort

Possibly related PRs

Suggested labels

Suggested reviewers

Poem

❌ Failed checks (1 inconclusive)

Uh oh!

qodo-code-review Bot commented Apr 28, 2026

Review Summary by Qodo

Walkthroughs

File Changes

Uh oh!

qodo-code-review Bot commented Apr 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Code Review by Qodo

Uh oh!

qodo-code-review Bot commented Apr 28, 2026

CI Feedback 🧐

Uh oh!

qodo-code-review Bot Apr 28, 2026

Choose a reason for hiding this comment

Uh oh!

qodo-code-review Bot Apr 28, 2026

Choose a reason for hiding this comment

Uh oh!

qodo-code-review Bot Apr 28, 2026

Choose a reason for hiding this comment

Uh oh!

qodo-code-review Bot Apr 28, 2026

Choose a reason for hiding this comment

Uh oh!

qodo-code-review Bot Apr 28, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Apr 28, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Apr 28, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Apr 28, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Apr 28, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Apr 28, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Apr 28, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Apr 28, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Apr 28, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Apr 28, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Apr 28, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

chojuninengu commented Apr 28, 2026 •

edited by coderabbitai Bot

Loading

coderabbitai Bot commented Apr 28, 2026 •

edited

Loading

qodo-code-review Bot commented Apr 28, 2026 •

edited

Loading