GPU optimizations #1812

RonShakutai · 2025-12-10T17:16:02Z

Change Description

This PR introduce GPU optimizations for Gliner, Spacy, stanza and transformers.

Technical Implementation

Added DeviceDetector singleton for automatic GPU detection and CUDA initialization
Integrated GPU support into all NLP engines (SpacyNlpEngine, TransformersNlpEngine, StanzaNlpEngine, GLiNERRecognizer)
Dependency optimization: Using cupy-cuda12x

This PR introduces GPU handling improvements for GLiNER, spaCy, Transformers, and Stanza NLP engines, optimizing GPU detection and utilization.

Reproduce Results

On this branch:

cd presidio-analyzer
poetry run python ../benchmark_presidio.py --engines spacy/transformers/gliner/stanza --sizes 50,500 --json gpu_results.json

Then download the script, switch to main branch, and re-run: pay attention to change the json results file name in the command.

Compare gpu_results.json vs main_results.json.

Results!

GLiNER - Big Improvement

Rows	Metric	Before Optimization	After Optimization	Improvement
50	Total Time	13.37s	2.73s	4.9x faster
50	Throughput	3.74 texts/sec	18.31 texts/sec	4.9x faster
50	Entities Found	241	245	+1.7%
500	Total Time	211.21s	31.34s	6.7x faster
500	Throughput	2.37 texts/sec	15.96 texts/sec	6.7x faster
500	Entities Found	2,435	2,469	+1.4%

Transformers - Big Improvement

Comparison of Transformers (StanfordAIMI/stanford-deidentifier-base)

Rows	Metric	Before Optimization	After Optimization	Improvement
50	Total Time	3.64s	0.77s	4.7x faster
50	Throughput	13.73 texts/sec	64.85 texts/sec	4.7x faster
50	Entities Found	273	273	0%
500	Total Time	76.78s	7.97s	9.6x faster
500	Throughput	6.51 texts/sec	62.73 texts/sec	9.6x faster
500	Entities Found	2,746	2,746	0%

Stanza - Big Improvement

Rows	Metric	Before Optimization	After Optimization	Change
50	Total Time	9.53s	7.57s	1.3x faster
50	Throughput	5.24 texts/sec	6.61 texts/sec	1.3x faster
50	Entities Found	253	253	0%
500	Total Time	141.98s	33.77s	4.2x faster
500	Throughput	3.52 texts/sec	14.81 texts/sec	4.2x faster
500	Entities Found	2,510	2,511	+0.04%

spaCy - No Change

Comparison of spaCy (en_core_web_lg) performance before and after GPU handling improvements.

Rows	Metric	Before Optimization	After Optimization	Change
50	Total Time	0.36s	0.54s	1.5x slower
50	Throughput	138.58 texts/sec	93.25 texts/sec	1.5x slower
50	Entities Found	235	235	0%
500	Total Time	2.97s	4.62s	1.6x slower
500	Throughput	168.60 texts/sec	108.17 texts/sec	1.6x slower
500	Entities Found	2,377	2,377	0%

Checklist

I have reviewed the contribution guidelines
I have signed the CLA (if required)
My code includes unit tests
All unit tests and lint checks pass locally
My PR contains documentation updates / additions if required

github-actions · 2025-12-10T17:17:05Z

Coverage report (presidio-anonymizer)

This PR does not seem to contain any modification to coverable code.

github-actions · 2025-12-10T17:17:36Z

Coverage report (presidio-structured)

This PR does not seem to contain any modification to coverable code.

github-actions · 2025-12-10T17:17:56Z

Coverage report (presidio-cli)

This PR does not seem to contain any modification to coverable code.

github-actions · 2025-12-10T17:19:14Z

Coverage report (presidio-image-redactor)

This PR does not seem to contain any modification to coverable code.

…raries

omri374

Thanks! A long due proper support for GPU workloads. Left some comments to consider.

presidio-analyzer/presidio_analyzer/nlp_engine/device_detector.py

omri374 · 2025-12-13T10:05:32Z

benchmark_presidio.py

@@ -0,0 +1,606 @@
+#!/usr/bin/env python3


Consider putting the benchmark result files in a dedicated folder under docs, or omit them from the repo. There's a chance for this to become stale very quickly

its only for the GPU tests. i think we will need something more organized later on.

presidio-analyzer/presidio_analyzer/nlp_engine/stanza_nlp_engine.py

omri374 · 2025-12-13T10:08:26Z

presidio-analyzer/presidio_analyzer/nlp_engine/transformers_nlp_engine.py


        logger.debug(f"Loading SpaCy and transformers models: {self.models}")
+
+        # Configure GPU if available


Already called in the super

omri374 · 2025-12-13T10:13:22Z

presidio-analyzer/pyproject.toml

    "phonenumbers (>=8.12,<10.0.0)",
-    "pydantic (>=2.0.0,<3.0.0)"
+    "pydantic (>=2.0.0,<3.0.0)",
+    "cupy-cuda12x>=13.4.1",


Would this install cuda? Will the work for CPU only machines?

Yes, but I believe it should be in the new GPU section to give users more flexibility regarding GPU dependencies. and not install what not in use.

presidio-analyzer/gpu_stanza_results.json

RonShakutai · 2025-12-13T13:28:01Z

Thanks! A long due proper support for GPU workloads. Left some comments to consider.
Hi @omri374

This PR is not ready for full review yet. I mainly want someone with a different GPU to run it and check whether the GPU-pref optimization improves performance on their setup.

Before finalizing, I’ll remove the benchmark script—I kept it only so others can measure GPU improvements on their machines.

I’m also considering adding an extra gpu section in the pyproject.toml, so people can either install a common set of GPU dependencies or use their own GPU libraries.

…sage

…shakutai/gpu-optimizations

…U config tests

github-actions · 2025-12-15T12:13:19Z

Coverage report (presidio-analyzer)

Click to see where and how coverage changed

File	Statements	Missing	Coverage	Coverage (new stmts)	Lines missing
presidio-analyzer/presidio_analyzer/nlp_engine
__init__.py
device_detector.py
spacy_nlp_engine.py
stanza_nlp_engine.py					292
transformers_nlp_engine.py
presidio-analyzer/presidio_analyzer/predefined_recognizers/ner
gliner_recognizer.py					116
Project Total

_{This report was generated by python-coverage-comment-action}

tamirkamara · 2025-12-15T12:29:21Z

@RonShakutai Is this meant to be support inside the docker image without additional changes?

RonShakutai · 2025-12-15T12:30:46Z

@RonShakutai Is this meant to be support inside the docker image without additional changes?

Yes, the device detection is automatic and requires no code changes

tamirkamara · 2025-12-15T14:15:32Z

@RonShakutai Is this meant to be support inside the docker image without additional changes?

Yes, the device detection is automatic and requires no code changes

Doesn't GPU require drivers on the OS level? Not sure we install those currently.
I also see you have instructions to do pip install which makes we think this won't work unless we do something for it. But this can be an separate issue/PR.

RonShakutai · 2025-12-15T14:26:28Z

@RonShakutai Is this meant to be support inside the docker image without additional changes?

Yes, the device detection is automatic and requires no code changes

Doesn't GPU require drivers on the OS level? Not sure we install those currently. I also see you have instructions to do pip install which makes we think this won't work unless we do something for it. But this can be an separate issue/PR.

GPU execution still requires OS-level drivers (CUDA, NVIDIA runtime), which are outside the scope of this PR.

This PR focuses on two things:

Correct GPU usage in code paths for Stanza, spaCy, GLiNER, and Transformers, once a GPU is available.

Providing an optional gpu extra that installs commonly used CUDA-compatible Python dependencies, so users can “plug and go” in most setups.

CUDA versions and drivers are highly GPU-specific and must be installed by the GPU owner, who knows their hardware best. For that reason, we do not bundle or enforce CUDA drivers.

We only add a recommended GPU dependency set via an extra in pyproject.toml.
As a result, this change does not affect the Docker image and requires no Docker-level changes.

…configurations

…S initialization failures

omri374 · 2025-12-17T11:21:21Z

@RonShakutai Is this meant to be support inside the docker image without additional changes?

Yes, the device detection is automatic and requires no code changes

Doesn't GPU require drivers on the OS level? Not sure we install those currently. I also see you have instructions to do pip install which makes we think this won't work unless we do something for it. But this can be an separate issue/PR.

GPU execution still requires OS-level drivers (CUDA, NVIDIA runtime), which are outside the scope of this PR.

This PR focuses on two things:

Correct GPU usage in code paths for Stanza, spaCy, GLiNER, and Transformers, once a GPU is available.

Providing an optional gpu extra that installs commonly used CUDA-compatible Python dependencies, so users can “plug and go” in most setups.

CUDA versions and drivers are highly GPU-specific and must be installed by the GPU owner, who knows their hardware best. For that reason, we do not bundle or enforce CUDA drivers.

We only add a recommended GPU dependency set via an extra in pyproject.toml. As a result, this change does not affect the Docker image and requires no Docker-level changes.

For this to work in Docker, we need two things: (1) code level adjusments, as done in this PR, (2) Dockerfile adjustments, as our current Dockerfile doesn't install cuda/cudnn packages (e.g. FROM nvidia/cuda:12.1.1-cudnn8-runtime-ubuntu22.04).
I would suggest to finalize this PR to make sure that whoever wants to use GPU using the python package can, and later continue to providing GPU compatible Dockerfiles/images. WDYT?

…formersNlpEngine

…raries

…sage

…U config tests

…configurations

…S initialization failures

…formersNlpEngine

…crosoft/presidio into ronshakutai/gpu-optimizations align to main

RonShakutai · 2025-12-17T15:42:10Z

@RonShakutai Is this meant to be support inside the docker image without additional changes?

Yes, the device detection is automatic and requires no code changes

Doesn't GPU require drivers on the OS level? Not sure we install those currently. I also see you have instructions to do pip install which makes we think this won't work unless we do something for it. But this can be an separate issue/PR.

GPU execution still requires OS-level drivers (CUDA, NVIDIA runtime), which are outside the scope of this PR.
This PR focuses on two things:
Correct GPU usage in code paths for Stanza, spaCy, GLiNER, and Transformers, once a GPU is available.
Providing an optional gpu extra that installs commonly used CUDA-compatible Python dependencies, so users can “plug and go” in most setups.
CUDA versions and drivers are highly GPU-specific and must be installed by the GPU owner, who knows their hardware best. For that reason, we do not bundle or enforce CUDA drivers.
We only add a recommended GPU dependency set via an extra in pyproject.toml. As a result, this change does not affect the Docker image and requires no Docker-level changes.

For this to work in Docker, we need two things: (1) code level adjusments, as done in this PR, (2) Dockerfile adjustments, as our current Dockerfile doesn't install cuda/cudnn packages (e.g. FROM nvidia/cuda:12.1.1-cudnn8-runtime-ubuntu22.04). I would suggest to finalize this PR to make sure that whoever wants to use GPU using the python package can, and later continue to providing GPU compatible Dockerfiles/images. WDYT?

Totally agree. @omri374
I think this PR should stay focused on the code-level GPU support (detection and correct applications of it in the different code components).
GPU drivers and CUDA versions are highly hardware-specific and should be installed by the environment owner.

For Docker concern that was raised correctly by @tamirkamara !, this is a separate concern.
We can address it later by providing GPU-enabled Docker images or a solution that allows choosing the appropriate base image at build/run time, depending on the available hardware.

RonShakutai added 2 commits December 10, 2025 09:47

Gpu first draft

656b3bb

Gliner flow adapted to GPU

9e1a171

RonShakutai self-assigned this Dec 10, 2025

RonShakutai added 5 commits December 10, 2025 19:23

deleted testing files

fac346f

fix: Suppress warnings from spacy_huggingface_pipelines and other lib…

74c6495

…raries

stanza mode added.

d07609d

ruff issues

a4b9a29

adding to to the mock

5668d62

omri374 reviewed Dec 13, 2025

View reviewed changes

RonShakutai added 4 commits December 15, 2025 13:29

refactor: remove unused GPU result files and update device detector u…

ffe66e0

…sage

Merge branch 'main' of https://github.com/microsoft/presidio into ron…

62a1185

…shakutai/gpu-optimizations

fix: reorder device_detector import for consistency

03742ea

docs added around gpu

0509b6a

RonShakutai marked this pull request as ready for review December 15, 2025 11:46

RonShakutai requested a review from a team as a code owner December 15, 2025 11:46

RonShakutai added 2 commits December 15, 2025 13:54

test: enhance GPU detection tests for DeviceDetector and SpacyNlpEngine

754349b

Add comprehensive error path mocking for device_detector and spacy GP…

8d7804d

…U config tests

RonShakutai requested a review from SharonHart December 15, 2025 12:10

Merge branch 'main' into ronshakutai/gpu-optimizations

c46929f

RonShakutai requested a review from omri374 December 15, 2025 12:29

remove benchmark script.

8c453d4

RonShakutai and others added 3 commits December 16, 2025 15:50

Enhance device detection to support MPS and update related tests and …

64677eb

…configurations

Refactor device detection logic to improve clarity and logging for MP…

adcef7d

…S initialization failures

Merge branch 'main' into ronshakutai/gpu-optimizations

ded48fe

omri374 mentioned this pull request Dec 17, 2025

Guidance on running Presidio on GPUs is missing #1566

Open

SharonHart force-pushed the main branch from 150664d to b707c40 Compare December 17, 2025 11:42

RonShakutai added 19 commits December 17, 2025 15:35

Remove unnecessary comments regarding pipeline configuration in Trans…

b075c17

…formersNlpEngine

Gpu first draft

686b6f3

Gliner flow adapted to GPU

a46ca7d

deleted testing files

d167913

fix: Suppress warnings from spacy_huggingface_pipelines and other lib…

b6a918e

…raries

stanza mode added.

7000cfe

ruff issues

03032b0

adding to to the mock

d09b0fe

refactor: remove unused GPU result files and update device detector u…

7e1ab3d

…sage

fix: reorder device_detector import for consistency

f18c12c

docs added around gpu

64423e1

test: enhance GPU detection tests for DeviceDetector and SpacyNlpEngine

6a0a0d7

Add comprehensive error path mocking for device_detector and spacy GP…

7a70c2b

…U config tests

remove benchmark script.

9cd6ad9

Enhance device detection to support MPS and update related tests and …

d3d787f

…configurations

Refactor device detection logic to improve clarity and logging for MP…

fb68d6c

…S initialization failures

Remove unnecessary comments regarding pipeline configuration in Trans…

62271b5

…formersNlpEngine

Merge branch 'ronshakutai/gpu-optimizations' of https://github.com/mi…

963a672

…crosoft/presidio into ronshakutai/gpu-optimizations align to main

Update GPU acceleration instructions and remove deprecated dependencies

033ec57


		logger.debug(f"Loading SpaCy and transformers models: {self.models}")

		# Configure GPU if available

GPU optimizations #1812

Are you sure you want to change the base?

GPU optimizations #1812

Conversation

RonShakutai commented Dec 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Change Description

Technical Implementation

Reproduce Results

Results!

GLiNER - Big Improvement

Transformers - Big Improvement

Stanza - Big Improvement

spaCy - No Change

Checklist

Uh oh!

github-actions bot commented Dec 10, 2025

Coverage report (presidio-anonymizer)

Uh oh!

github-actions bot commented Dec 10, 2025

Coverage report (presidio-structured)

Uh oh!

github-actions bot commented Dec 10, 2025

Coverage report (presidio-cli)

Uh oh!

github-actions bot commented Dec 10, 2025

Coverage report (presidio-image-redactor)

Uh oh!

omri374 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

omri374 Dec 13, 2025

Choose a reason for hiding this comment

Uh oh!

RonShakutai Dec 15, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

omri374 Dec 13, 2025

Choose a reason for hiding this comment

Uh oh!

RonShakutai Dec 15, 2025

Choose a reason for hiding this comment

Uh oh!

omri374 Dec 13, 2025

Choose a reason for hiding this comment

Uh oh!

RonShakutai Dec 15, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

RonShakutai commented Dec 13, 2025

Uh oh!

github-actions bot commented Dec 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Coverage report (presidio-analyzer)

Uh oh!

tamirkamara commented Dec 15, 2025

Uh oh!

RonShakutai commented Dec 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tamirkamara commented Dec 15, 2025

Uh oh!

RonShakutai commented Dec 15, 2025

Uh oh!

omri374 commented Dec 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

RonShakutai commented Dec 17, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

RonShakutai commented Dec 10, 2025 •

edited

Loading

github-actions bot commented Dec 15, 2025 •

edited

Loading

RonShakutai commented Dec 15, 2025 •

edited

Loading

omri374 commented Dec 17, 2025 •

edited

Loading