alexghayward 10 hours ago

Hi HN,

I'm Alex, co-founder of GoMask.

We built this because every team I've worked with has the same challenge: they use production data for testing because nothing else actually works. Mock data isn't realistic enough to debug with. Snapshots go stale. Manual masking takes weeks and breaks referential integrity.

GoMask does schema-aware masking that preserves relationships and generates synthetic data that behaves like production. Built for CI/CD pipelines.

Technical approach:

- Graph analysis to discover relationships across tables automatically - Format-preserving masking that maintains referential integrity - ML-based synthetic generation trained on schema patterns, not actual data - Processing happens in your environment, we don't require shipping prod data externally - Native integrations with Jenkins, GitHub Actions, GitLab

Main use cases we're seeing: dev environments, staging, QA test data, analytics sandboxes, demo environments.

Handles PostgreSQL, MySQL, SQL Server, Oracle, MongoDB. Working on more connectors.

Free tier available to try it out. Usage-based pricing after that.

Technical questions welcome. Especially interested in edge cases we might have missed or databases we should prioritise.

Built this because we were tired of waiting days for test data ourselves.