GovStack API Security Guide

Overview

GovStack API implements a comprehensive security model based on API key authentication with role-based access control. This document outlines security features, best practices, and configuration guidelines.

Authentication System

API Key Authentication

All API endpoints (except public health checks) require authentication via the X-API-Key header:

X-API-Key: your-secure-api-key-here

API Key Structure

API keys are validated against a predefined set of valid keys with associated permissions:

VALID_API_KEYS = {
    "gs-master-key": {
        "name": "master",
        "permissions": ["read", "write", "delete", "admin"],
        "description": "Master API key with full access"
    },
    "gs-admin-key": {
        "name": "admin", 
        "permissions": ["read", "write", "admin"],
        "description": "Admin API key with management access"
    }
}

API Key Types

Master API Key

Environment Variable: GOVSTACK_API_KEY
Permissions: read, write, delete, admin (full access)
Use Case: Full system administration, all operations
Default Development: gs-dev-master-key-12345

Admin API Key

Environment Variable: GOVSTACK_ADMIN_API_KEY
Permissions: read, write, admin (no delete)
Use Case: Content management, audit log access
Default Development: gs-dev-admin-key-67890

Security Dependencies

The security system provides FastAPI dependencies for permission checking:

# Permission-based dependencies
require_read_permission()    # GET operations
require_write_permission()   # POST operations  
require_delete_permission()  # DELETE operations
require_admin_permission()   # Audit logs, admin functions

Permission Levels

Read Permission

Endpoints: All GET endpoints
Operations:
- View documents and metadata
- Access chat history
- Retrieve webpage data
- Get collection statistics
- Check crawl status

Write Permission

Endpoints: POST endpoints
Operations:
- Upload documents
- Create chat messages
- Start web crawls
- Extract text content
- Fetch webpages

Delete Permission

Endpoints: DELETE endpoints
Operations:
- Remove documents
- Delete chat sessions
- Clean up resources

Admin Permission

Endpoints: Admin-specific endpoints
Operations:
- Access audit logs
- View system statistics
- Administrative functions

Audit Trail Integration

Automatic Audit Logging

The security system automatically logs all authenticated actions:

async def log_audit_action(
    user_id: str,
    action: str,
    resource_type: str,
    resource_id: Optional[str] = None,
    details: Optional[dict] = None,
    request: Optional[Request] = None,
    api_key_name: Optional[str] = None
):
    """Log user actions for audit trail."""

Audit Information Captured

For every authenticated request, the system records:

User ID: API key name or user identifier
Action: Operation performed (e.g., 'upload', 'delete', 'chat')
Resource Type: Type of resource affected (e.g., 'document', 'chat')
Resource ID: Specific resource identifier
IP Address: Client IP address
User Agent: Client browser/application information
Timestamp: When the action occurred
Details: Additional context (e.g., file size, collection ID)

Security Dependencies with Audit

# Create audit-enabled dependency
def create_audit_dependency(action: str, resource_type: str):
    """Create dependency that requires permission and logs audit trail."""
    
    async def audit_dependency(
        request: Request,
        api_key_info: APIKeyInfo = Depends(require_permission)
    ):
        # Log the action
        await log_audit_action(
            user_id=api_key_info.name,
            action=action,
            resource_type=resource_type,
            request=request,
            api_key_name=api_key_info.name
        )
        return api_key_info
    
    return audit_dependency

Security Configuration

Environment Variables

Required Security Variables

# Master API key with full permissions
GOVSTACK_API_KEY="your-secure-master-key-here"

# Admin API key with read/write permissions  
GOVSTACK_ADMIN_API_KEY="your-secure-admin-key-here"

Database Security

# Strong database password
POSTGRES_PASSWORD="your-strong-db-password"

# Secure database connection string
DATABASE_URL="postgresql+asyncpg://postgres:password@host:5432/db"

ChromaDB Security

# ChromaDB authentication
CHROMA_USERNAME="your-chroma-username"
CHROMA_PASSWORD="your-secure-chroma-password"
CHROMA_CLIENT_AUTHN_CREDENTIALS="username:password"

MinIO Security

# MinIO access credentials
MINIO_ACCESS_KEY="your-minio-access-key"  
MINIO_SECRET_KEY="your-minio-secret-key"

API Key Best Practices

Key Generation

Use cryptographically secure random generators
Minimum 32 characters length
Include alphanumeric and special characters
Avoid predictable patterns

Example secure key format:

GOVSTACK_API_KEY="gs-prod-$(openssl rand -hex 32)"

Key Management

Rotation: Rotate API keys regularly (every 90 days recommended)
Storage: Store keys in secure environment variables or secret management systems
Scope: Use minimal required permissions for each key
Monitoring: Log and monitor API key usage
Revocation: Have a process to quickly revoke compromised keys

Key Distribution

Never commit API keys to version control
Use secure channels for key distribution
Implement key provisioning workflows
Document key ownership and purpose

Production Security Checklist

Environment Setup

Infrastructure Security

API Security

Monitoring and Auditing

Common Security Configurations

Development Environment

# Development - Less restrictive but still secure
GOVSTACK_API_KEY="gs-dev-master-key-12345"
GOVSTACK_ADMIN_API_KEY="gs-dev-admin-key-67890"
DEV_MODE=true
LOG_LEVEL=DEBUG

Production Environment

# Production - Maximum security
GOVSTACK_API_KEY="gs-prod-$(openssl rand -hex 32)"
GOVSTACK_ADMIN_API_KEY="gs-prod-admin-$(openssl rand -hex 32)"
DEV_MODE=false
LOG_LEVEL=INFO

Staging Environment

# Staging - Production-like security
GOVSTACK_API_KEY="gs-staging-$(openssl rand -hex 24)"
GOVSTACK_ADMIN_API_KEY="gs-staging-admin-$(openssl rand -hex 24)"
DEV_MODE=false
LOG_LEVEL=INFO

Error Handling

Authentication Errors

The API returns specific error codes for authentication issues:

Missing API Key

{
  "detail": "Missing API key",
  "status_code": 401
}

Invalid API Key

{
  "detail": "Invalid API key", 
  "status_code": 401
}

Insufficient Permissions

{
  "detail": "Insufficient permissions for this operation",
  "status_code": 403
}

Security Headers

Recommended security headers for production:

Strict-Transport-Security: max-age=31536000; includeSubDomains
X-Content-Type-Options: nosniff
X-Frame-Options: DENY
X-XSS-Protection: 1; mode=block
Content-Security-Policy: default-src 'self'

Incident Response

Compromised API Key

Immediate: Rotate the compromised key
Investigate: Check access logs for unauthorized usage
Update: Update all systems using the old key
Monitor: Watch for continued unauthorized access attempts
Document: Record the incident and response actions

Suspected Breach

Isolate: Restrict access to affected systems
Assess: Determine scope and impact
Contain: Prevent further unauthorized access
Recover: Restore systems from secure backups if needed
Learn: Update security measures based on lessons learned

Compliance Considerations

Data Protection

Encrypt data in transit and at rest
Implement data retention policies
Provide data access and deletion capabilities
Maintain audit trails

Access Control

Implement principle of least privilege
Regular access reviews
Automated access provisioning/deprovisioning
Multi-factor authentication for administrative access

Monitoring and Logging

Comprehensive audit logging
Real-time security monitoring
Incident response procedures
Regular security assessments

Contact and Reporting

For security issues:

Review this documentation first
Check application logs
Verify API key configuration
Test with appropriate permissions

Remember: Security is a shared responsibility between the API provider and consumers. Follow these guidelines to ensure secure operations.

Security: think-ke/GovBot-Prototype

Security

docs/SECURITY.md