AI Privacy Layer

A stateless reverse proxy that intercepts LLM API requests, applies parallel regex and spaCy NER-based PII detection to tokenize sensitive data before forwarding to OpenAI/Anthropic, then de-anonymizes responses using ephemeral in-memory token mappings scoped to the conversation context.

How It Works

Client Request                    Proxy                           LLM Provider
     |                              |                                   |
     |  POST /v1/proxy/openai/...  |                                   |
     |---------------------------->|                                   |
     |                              |                                   |
     |                   +----------+----------+                        |
     |                   |  Parallel Detection |                        |
     |                   |  Regex | NER(spaCy) |                        |
     |                   +----------+----------+                        |
     |                              |                                   |
     |                   Tokenize: "John Smith" -> {{PERSON_1}}         |
     |                             "4532-1234..." -> {{CREDIT_CARD_1}}  |
     |                              |                                   |
     |                              |  Anonymized request               |
     |                              |---------------------------------->|
     |                              |                                   |
     |                              |<----------------------------------|
     |                              |  Response (may contain tokens)    |
     |                              |                                   |
     |                   De-anonymize using token mappings              |
     |                              |                                   |
     |<-----------------------------|                                   |
     |  Response with original data |                                   |

Detection Methods

Method	Implementation	Detects	Execution
Regex	Compiled patterns from `patterns.json`	Credit cards, emails, SSN, phone, IP	Parallel
NER	spaCy `en_core_web_sm`	Person names, organizations, locations	Parallel

Both methods run concurrently via ThreadPoolExecutor. Results are merged with regex taking priority on overlapping spans.

Supported Entity Types

Type	Token Format	Detection Method
`CREDIT_CARD`	`{{CREDIT_CARD_N}}`	Regex
`EMAIL`	`{{EMAIL_N}}`	Regex
`SSN`	`{{SSN_N}}`	Regex
`PHONE`	`{{PHONE_N}}`	Regex
`IP_ADDRESS`	`{{IP_ADDRESS_N}}`	Regex
`PERSON`	`{{PERSON_N}}`	NER
`ORG`	`{{ORG_N}}`	NER
`LOCATION`	`{{LOCATION_N}}`	NER

Installation

# Install dependencies
pip install -r requirements.txt

# Download spaCy model for NER
python -m spacy download en_core_web_sm

Requirements:

Python 3.11+
No external services (Redis, databases)

Configuration

Create .env file:

PORT=8000
ENVIRONMENT=development
LOG_LEVEL=DEBUG
OPENAI_API_KEY=sk-...      # Optional, for testing
ANTHROPIC_API_KEY=sk-ant-... # Optional, for testing

Usage

Start the server:

uvicorn app.main:app --port 8000

API Endpoints

Endpoint	Method	Description
`/v1/proxy/openai/chat/completions`	POST	OpenAI proxy with anonymization
`/v1/proxy/anthropic/messages`	POST	Anthropic proxy with anonymization
`/health`	GET	Health check

Request Format

curl -X POST http://localhost:8000/v1/proxy/openai/chat/completions \
  -H "X-Target-API-Key: YOUR_OPENAI_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "gpt-4",
    "messages": [
      {"role": "user", "content": "Contact John Smith at john@example.com"}
    ]
  }'

The X-Target-API-Key header contains the user's LLM provider API key (passed through, never stored).

Token Consistency

Same sensitive value receives the same token within a conversation. The proxy scans the entire message array and maintains a value -> token mapping:

# Input messages
[
  {"role": "user", "content": "Card: 4532-1234-5678-9010"},
  {"role": "assistant", "content": "Saved"},
  {"role": "user", "content": "Verify 4532-1234-5678-9010"}
]

# Both instances map to {{CREDIT_CARD_1}}

This works statelessly because clients send full conversation history with each request.

Architecture

app/
├── main.py                 # FastAPI application entry
├── config.py               # Environment configuration
├── api/
│   └── proxy.py            # Proxy endpoints
├── adapters/
│   ├── openai_adapter.py   # OpenAI API integration
│   └── anthropic_adapter.py # Anthropic API integration
└── anonymization/
    ├── pattern_loader.py   # Loads regex patterns from JSON
    ├── ner_detector.py     # spaCy NER wrapper
    └── tokenizer.py        # Parallel detection + tokenization

Pattern Configuration

Edit patterns.json to add regex patterns:

{
  "patterns": [
    {
      "name": "credit_card",
      "type": "CREDIT_CARD",
      "regex": "\\b\\d{4}[-\\s]?\\d{4}[-\\s]?\\d{4}[-\\s]?\\d{4}\\b"
    }
  ]
}

Patterns are loaded at startup. Restart required for changes.

NER Configuration

Edit app/anonymization/ner_detector.py to modify entity type mappings:

ENTITY_TYPE_MAP = {
    "PERSON": "PERSON",
    "ORG": "ORG",
    "GPE": "LOCATION",
    "LOC": "LOCATION",
}

Security Model

Token mappings exist only in memory during request lifecycle
Mappings are discarded immediately after response is sent
Sensitive values are never persisted to disk or logged
LLM providers receive only tokens, never original values

Performance

Metric	Value
Regex detection	~0.5ms
NER detection	~2-5ms
Total overhead	~3-8ms (parallel execution)
Memory (spaCy model)	~50MB
Cold start (model load)	~1-2s (once at startup)

Typical LLM API calls take 1-5 seconds; proxy overhead is <1% of total latency.

Limitations

Regex patterns may produce false positives on clustered numbers
NER accuracy depends on spaCy model quality
No streaming response support (SSE)
No rate limiting
No persistent audit logging

Testing

./tests/test_openai.sh
./tests/test_anthropic.sh
./tests/test_ner.sh
./tests/test_conversation_consistency.sh

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.cursor/rules		.cursor/rules
app		app
docs		docs
tests		tests
.gitignore		.gitignore
DEMO_FIXES.md		DEMO_FIXES.md
DEMO_GUIDE.md		DEMO_GUIDE.md
DEMO_SUMMARY.md		DEMO_SUMMARY.md
LICENSE		LICENSE
PROGRESS.md		PROGRESS.md
QUICK_DEMO.md		QUICK_DEMO.md
README.md		README.md
WORKFLOW_PROOF.md		WORKFLOW_PROOF.md
check_demo_ready.sh		check_demo_ready.sh
demo.sh		demo.sh
patterns.json		patterns.json
requirements.txt		requirements.txt
server.log		server.log
start_server_with_logs.sh		start_server_with_logs.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AI Privacy Layer

How It Works

Detection Methods

Supported Entity Types

Installation

Configuration

Usage

API Endpoints

Request Format

Token Consistency

Architecture

Pattern Configuration

NER Configuration

Security Model

Performance

Limitations

Testing

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

AI Privacy Layer

How It Works

Detection Methods

Supported Entity Types

Installation

Configuration

Usage

API Endpoints

Request Format

Token Consistency

Architecture

Pattern Configuration

NER Configuration

Security Model

Performance

Limitations

Testing

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages