Now I channel that into building: production-grade tools I maintain myself, open-source utilities that scratch real itches, and contributions to projects I actually use. Most of what I ship lives at the intersection of AI, audio/music, and security tooling.
Bootstrapped, solo-dev AI workstation for lyric writing and music production — now a multi-app platform spanning 3 deployments, 3 separate codebases, and 9 proprietary model variants. Grew from a personal tool to daily active users and paying subscribers with zero advertising.
| App | Stack | Status |
|---|---|---|
| vrsa.app | React + Vite + Capacitor 8 | Live |
| studio.vrsa.app | Next.js 16 + TypeScript + Docker | Live |
| generate.vrsa.app | Next.js + React + Zustand + WaveSurfer.js | WIP |
Platform Architecture
graph TB
subgraph USER [" "]
direction LR
Web["🌐 Web"]
PWA["📱 PWA"]
Android["🤖 Android"]
end
Web & PWA & Android --> A
subgraph APPS ["Three Deployments"]
A["<b>vrsa.app</b> React · Vite · Capacitor · 32 routes · 50 components · 9 hooks · Ghostwriter · Canvas · Audio Analyzer · Suno Bridge · Album Art · Rights Mgmt"]
B["<b>studio.vrsa.app</b> Next.js · TypeScript · Tone.js · Browser DAW — 16 tracks · 64 clips · Piano Roll · Drum Sequencer · Waveform Editor · 1,300 lines of synth presets · MIDI I/O"]
C["<b>generate.vrsa.app</b> Next.js · TypeScript · Zustand · 11 pages · 8 stores · ~120 components · 7 gen modes · AI Radio · AI DJ · LoRA Training · WaveSurfer.js"]
end
Web --> B & C
subgraph AUTH ["Shared Auth Layer"]
D[("Supabase · .vrsa.app cookies · RLS · Edge Functions · RPCs · Storage")]
end
A & B & C --> D
subgraph GPU ["Modal Backend — H100 GPU"]
E["Async Spawn/Poll · 600s Timeout · Persistent Weight Volume"]
E --> F["<b>6 DiT Models</b> v1.5 · v1.5-stream · v1.5-craft · v1.5-core · v1.5-shift3 · v1.5-shift1"]
E --> G["<b>3 LM Models</b> pulse-nano 0.6B · pulse 1.7B · pulse-pro 4B"]
end
D --> E
subgraph LLM ["Multi-Provider LLM Routing"]
H["GPT · Claude · DeepSeek · Gemini · Grok · Kimi K2 · MiniMax · Mistral"]
end
A --> H
subgraph MEDIA ["Media Generation"]
I["Flux 1.1 Pro · Flux 2 · Sora via Azure OpenAI"]
end
A --> I
subgraph INFRA ["Infrastructure"]
J["107 API Endpoints · Vercel · Docker · Sentry · Honeybadger · Amplitude · Intercom · Cloudflare Turnstile · FingerprintJS · Crowdin i18n"]
end
style USER fill:transparent,stroke:#58A6FF,stroke-width:2px,color:#58A6FF
style APPS fill:transparent,stroke:#8B5CF6,stroke-width:2px,color:#8B5CF6
style AUTH fill:transparent,stroke:#3ECF8E,stroke-width:2px,color:#3ECF8E
style GPU fill:transparent,stroke:#F97316,stroke-width:2px,color:#F97316
style LLM fill:transparent,stroke:#EC4899,stroke-width:2px,color:#EC4899
style MEDIA fill:transparent,stroke:#EAB308,stroke-width:2px,color:#EAB308
style INFRA fill:transparent,stroke:#6B7280,stroke-width:2px,color:#6B7280
| VRSA Name | Type | Description |
|---|---|---|
| vrsa v1.5 | DiT (default) | Turbo — balanced speed + quality |
| vrsa v1.5-stream | DiT | Continuous generation mode |
| vrsa v1.5-craft | DiT | SFT — higher fidelity |
| vrsa v1.5-core | DiT | Base model |
| vrsa v1.5-shift3 / shift1 | DiT | Flow-shift variants |
| vrsa-pulse-nano | LM 0.6B | Fast token-level generation |
| vrsa-pulse | LM 1.7B | Balanced LM |
| vrsa-pulse-pro | LM 4B (default) | Full reasoning + CoT metadata |
| Text to Music | Music to Music | Cover | Repainting | Extracting | Section by Section Build | Complete |
Output formats: mp3 · flac · wav · wav32 · opus · aac Generation surface: BPM, key/scale, time signature, guidance scale, ODE/SDE sampling, CFG interval, latent shift/rescale, batch size up to 8
| Module | Description |
|---|---|
| Ghostwriter | Chat-based lyric generation with session memory, multi-model A/B mode, take history, and album-context awareness |
| Canvas | Notion-inspired editor with inline AI edits, syllable counter, rhyme heatmap, and MP3 transcription |
| Suno Bridge | Chrome + Firefox extensions that capture auth tokens and route lyrics directly to the Suno with version picker (v4 → v5) |
| Audio Analyzer | BPM + key/scale detection via custom VM audio engine running Essentia.js WASM |
| Album Art | Flux 1.1 Pro, Flux 2, and Sora via Azure OpenAI deployments |
| Multi-model routing | GPT, Claude, DeepSeek, Gemini, Grok, Kimi K2, MiniMax, Mistral across OpenAI, Bedrock, OpenRouter, and Google APIs |
| Rights Management | Proof-of-creation PDF certificate generator |
| MyMusic / Projects / AlbumWorkspace | Full library, project, and album management |
| AdminPanel | Platform administration, really just a view only style cms for me to see stats |
| StudioPass | Subscription and billing management |
| Mobile | Capacitor 8 Android build + full PWA support |
| Auth | Cross-subdomain cookies scoped to .vrsa.app, shared across all three apps |
| Compute | Modal async generation pipeline, H100 GPU, 600s timeout, persistent weights volume |
| API | 107 endpoints mapped and routed |
| DevOps | Crowdin i18n workflow, Docker + docker-compose, multi-project deployment |
App: vrsa.app · Studio: studio.vrsa.app · Generate (WIP): generate.vrsa.app
A curated list of 100+ AI-powered audio tools across 16 categories: music creation, voice cloning, stem separation, TTS, transcription, sound detection, and more.
Optimized scripts for running HeartMuLa music generation with maximum performance on NVIDIA GPUs, especially A100.
Full Ollama stack running on Google Colab with Gradio as a UI — spin up a local LLM in your browser with zero local setup.
Plug-and-play Colab notebook for Wan 2.2 — an advanced image-to-video model — stripped down to just work.
This was a proof of concept project for myself. Open-source AI-powered penetration testing agent that automates reconnaissance, vulnerability discovery, and analysis. Executes authorized security tests using installed tools, maintains immutable audit trails, and delivers findings with OWASP mapping and remediation guidance. Fair warning, it's a monorepo.
A comprehensive, unified research archive aggregating public releases related to the Jeffrey Epstein estate and associated investigations.
When the U.S. House Oversight Committee released 20,000+ pages of unstructured documents, the data was technically public but practically inaccessible. I built a forensic analysis tool to change that.
- Search Engine:
- — Local RAG-ready search interface via Gradio for rapid keyword and passage analysis across the full corpus.
- Governance:
- — Established a Responsible Use Framework to prevent misuse while keeping the tool genuinely open for researchers and journalists.
The original eDEX-UI — a well-known sci-fi terminal emulator — was archived with an unresolved security vulnerability. I forked it, patched the flaw, modernized the codebase, removed all deprecated dependencies, and cut a clean release.
All legacy code has been replaced, minor additional vulns remediated, and the dependency tree is fully up to date.
Contributed a helper function to automate efficient SQL datatype selection for pandas DataFrames — reducing manual overhead and ensuring optimal storage across schema generation workflows.
Local AI chat workbench with multi-provider support (AWS Bedrock, OpenAI, Azure, Gemini) - Deno-powered server with browser-based UI












