About this Catalog

Alfresco Addons Catalog is a community-driven space where customers, partners, and community members can showcase add-ons, solutions, and real-world use cases built around Alfresco. Browse the catalog to quickly discover extensions and ideas that may inspire or accelerate your projects.


Want to share your own work?

Click “+ Submit Entry” to suggest a new listing, every submission is reviewed by the Hyland Team before it appears here.

Alfresco TEngine PII Redaction

by Angel Borroy

Community

Alfresco Transform Engine that detects and redacts Personally Identifiable Information (PII) in PDF documents using Microsoft Presidio, producing a sanitized PDF or structured PII metadata.

screenshot

Compatibility ACS 25.x

License LGPL-3.0

Keywordstransformer, pdf, pii, redaction, presidio, privacy, compliance

Download

About

Integrates Microsoft Presidio into an Alfresco T-Engine to redact PII from PDFs or extract it as metadata.

Two transform modes:

  • application/pdfapplication/pdf: produces a redacted PDF with configurable label and score threshold
  • application/pdfalfresco-metadata-extract: returns structured JSON with entity counts, scores, and values, mappable to Alfresco content model properties (pii:hasPII, pii:entities, pii:countPerson, etc.)

Configurable via pii_engine_config.json:

  • entities: list of PII types (PERSON, PHONE_NUMBER, EMAIL_ADDRESS, CREDIT_CARD, …)
  • scoreThreshold: confidence threshold (0.0–1.0)
  • label: replacement text for redacted content

Deployable as a local T-Engine (Community: localTransform.pdf-pii.url) or async T-Engine (Enterprise: queue pii-engine-queue).

Update Entry Request Removal