hatchmoment. scored by care · not by stars

kero-datafinder

On-premise data discovery and classification engine with NLP for secure internal metadata scanning

This project was removed, hidden or re-uploaded by its author. The description is kept here as a snapshot of the idea — search for it manually on the author’s page.

A Go-based data discovery agent that scans databases, classifies sensitive data using Python/ONNX NLP sidecar, and stores only metadata - no raw data hoarding. Targets enterprises needing GDPR/DSAR compliance without exposing data to SaaS. Architecture includes PostgreSQL state DB, gRPC inter-service comms, and E2E encryption. Unlike cloud data catalog tools, this runs entirely on-prem with outbound-only SaaS sync.

Visit author’s GitHub →

mjgomesvix/kero-datafinder