On-premise data discovery and classification engine with NLP for secure internal metadata scanning
A Go-based data discovery agent that scans databases, classifies sensitive data using Python/ONNX NLP sidecar, and stores only metadata - no raw data hoarding. Targets enterprises needing GDPR/DSAR compliance without exposing data to SaaS. Architecture includes PostgreSQL state DB, gRPC inter-service comms, and E2E encryption. Unlike cloud data catalog tools, this runs entirely on-prem with outbound-only SaaS sync.
Visit author’s GitHub →mjgomesvix/kero-datafinder