EntityRecognizer |
An EntityRecognizer is an object in Presidio that is responsible for detecting entities in text. An entity recognizer can be rule-based, a machine learning model, or a combination of both. |
Analyzer concepts |
RecognizerResult |
A RecognizerResult holds the type and span of a PII entity. |
Analyzer concepts |
RecognizerRegistry |
The RecognizerRegistry is a class in Presidio that is responsible for holding the various recognizers used by the AnalyzerEngine. |
link |
NlpEngine |
The NlpEngine is an interface that defines the methods for processing text. Presidio provides several implementations of the NlpEngine, such as SpacyNlpEngine, TransformersNlpEngine, and StanzaNlpEngine. |
Analyzer concepts |
AnalyzerEngine |
The AnalyzerEngine is the main class in Presidio that is responsible for orchestrating the PII detection in text. It uses an NlpEngine to process the text and a RecognizerRegistry to hold the different recognizers. |
Analyzer concepts |
BatchAnalyzerEngine |
The BatchAnalyzerEngine is a class in Presidio that is responsible for detecting PII entities in a batch of texts. It uses the AnalyzerEngine to process each text in the batch. |
Batch processing sample |
AnonymizerEngine |
The AnonymizerEngine is the main class in Presidio that is responsible for anonymizing PII entities in text. It uses the results from the AnalyzerEngine to perform the anonymization. |
Anonymizer concepts |
DeanonymizerEngine |
The DeanonymizerEngine is a class in Presidio that is responsible for deanonymizing text that has been anonymized by the AnonymizerEngine, given that the operation is reversible (e.g. encryption). |
Anonymizer concepts |
Operator |
An Operator is an object in Presidio that is responsible for performing the anonymization operation on a PII entity. Presidio provides several built-in operators, such as Replace, Redact, and Encrypt, and allows users to create custom operators. |
Anonymizer concepts |
BatchAnonymizerEngine |
The BatchAnonymizerEngine is a class in Presidio that is responsible for anonymizing PII entities in a batch of texts. It uses the AnonymizerEngine to perform the anonymization on each text in the batch. |
Sample |
ImageRedactorEngine |
The ImageRedactorEngine is a class in Presidio that is responsible for redacting PII entities in images. It leverages the AnalyzerEngine to detect PII entities in the text extracted from the images. |
Image redaction docs |
StructuredEngine |
The StructuredEngine is a class in Presidio that is responsible for detecting PII entities in structured data. It uses the AnalyzerEngine to detect PII entities in the text fields of the structured data. |
Image redaction docs |