Skip to content

Remove pattern system, add job post source type#63

Merged
scaleborg merged 1 commit intomainfrom
refactor/remove-pattern-system
Mar 19, 2026
Merged

Remove pattern system, add job post source type#63
scaleborg merged 1 commit intomainfrom
refactor/remove-pattern-system

Conversation

@scaleborg
Copy link
Owner

Summary

  • Remove the pattern clustering/curation system entirely (patterns_* tables, UI components, routes, tests)
  • Add job postings as a first-class source type using the existing ingest pipeline
  • Refactor retriever stats to use generic typed_sources grouping by metadata.source_type instead of hardcoded article bucket

Job Post Source Type

  • content_type="job" routes through ingest_article(source_type="job") - same pipeline, no parallel logic
  • normalize_job_text() strips boilerplate sections (About Us, Benefits, EEO) before chunking
  • Source ID: job_{hash}, metadata: source_type="job" via existing ArticleChunkMeta
  • Provided URL stored as-is, absent URL becomes job://{hash}
  • Library filter type=job, source detail, connections all work
  • AddSourceModal job post mode with textarea, optional URL, optional company name

Test plan

  • ruff check backend/ passes
  • npx tsc --noEmit passes
  • Job ingest with URL: source_url = provided URL, source_id = job_*
  • Job ingest without URL: source_url = job://{hash}
  • /library/sources?type=job returns job sources only
  • /library/sources?type=article returns articles only (unchanged)
  • Article source detail still opens correctly
  • No new tables, no new metadata classes, no parallel pipeline

🤖 Generated with Claude Code

Remove the pattern clustering/curation system entirely — patterns_*
tables, PatternCurationPanel, PatternsSection, and all related frontend
routes and views (behavioral-design, bio-augmentation, cognitive-toolkit,
embodied-ai, freelance-strategy).

Add job postings as a first-class source type:
- content_type="job" flows through the existing ingest_article pipeline
- normalize_job_text strips boilerplate (About Us, Benefits, EEO)
- source_id prefix job_, metadata source_type="job"
- generic typed_sources grouping in retriever (no hardcoded buckets)
- library filter type=job, source detail, connections all work
- AddSourceModal job post mode with optional URL and company fields
- Provided URL stored as-is, absent URL becomes job://{hash}
@scaleborg scaleborg merged commit e1b87f3 into main Mar 19, 2026
4 of 6 checks passed
@scaleborg scaleborg deleted the refactor/remove-pattern-system branch March 19, 2026 03:38
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant