Real time PII detection and redaction in structured and unstructured data.
We’re an open source data anonymization and synthetic data platform that companies like Intel, Siemens, C2FO, Alasco and others use to anonymize their sensitive production data and sync it to lower-level environments.
Today, we’re launching a new product designed to detect and anonymize PII data in free-form text.
These are two main use cases:
For example:
The text:
{ text: "Dear Mr. John Chang, your physical therapy for your rotator cuff injury is approved for 12 sessions. Your first appointment with therapist Jake is on 8/1/2024 at 11 AM. Please bring a photo ID. We have your SSN on file as 246-80-1357. Is this correct?"}
Would be transformed to:
Anonymization result: '{"text":"Dear Mr. \u003cREDACTED\u003e, your physical therapy for your rotator cuff injury is approved for 12 sessions. Your first appointment with therapist \u003cREDACTED\u003e is on \u003cREDACTED\u003e at \u003cREDACTED\u003e. Please bring a photo ID. We have your SSN on file as \u003cREDACTED\u003e. Is this correct?"}'
You can also customize this with custom allow/deny lists and even custom recognizers.
We’re already working with companies in Healthtech and Fintech on this and would love to open it up to more companies. If you’re interested in trying it out, shoot me a note at evis@neosync.dev, and I can get you a sandbox and free credits to trial it.