Skip to content

Conversation

@telackey
Copy link

@telackey telackey commented Dec 3, 2025

Change Description

The REST API only supports analyzing one text at a time. This adds support for batch processing.

The implementation is backwards compatible: if text is a string, it returns identical results as before. This simply adds support for text to be supplied as a string[] instead, in which case it returns an array of results.

Issue reference

Fixes #XX

Checklist

  • I have reviewed the contribution guidelines
  • I have signed the CLA (if required)
  • My code includes unit tests
  • All unit tests and lint checks pass locally
  • My PR contains documentation updates / additions if required - (N/A)

@telackey
Copy link
Author

telackey commented Dec 3, 2025

@microsoft-github-policy-service agree company="SolidCore.ai"

@telackey telackey marked this pull request as ready for review December 3, 2025 20:57
@SharonHart
Copy link
Contributor

Great addition @telackey!
Can you please update the openapi documentation, it is currenty not autogenerated from the routes defintion

@telackey
Copy link
Author

telackey commented Dec 9, 2025

Great addition @telackey! Can you please update the openapi documentation, it is currenty not autogenerated from the routes defintion

Can do

@telackey
Copy link
Author

telackey commented Dec 9, 2025

@SharonHart, I think I updated the relevant docs. If I missed anything, just point me in the right direction and I'll update as needed.

@SharonHart
Copy link
Contributor

@SharonHart, I think I updated the relevant docs. If I missed anything, just point me in the right direction and I'll update as needed.

spot on, looks good 🎯

SharonHart
SharonHart previously approved these changes Dec 10, 2025
RonShakutai and others added 2 commits December 11, 2025 19:15
* Refactor LangExtract recognizers to support customizable extraction parameters and improve configuration clarity for Azure OpenAI and Ollama.

* Add tests for parameter configuration and overrides in Ollama and Azure OpenAI recognizers

* Refactor Azure OpenAI and Ollama LangExtract recognizers to improve parameter handling and documentation clarity

* Update max_char_buffer parameter in OllamaLangExtractRecognizer to improve performance

* Update max_char_buffer parameter in Ollama configuration to enhance performance

* Update presidio-analyzer/presidio_analyzer/predefined_recognizers/third_party/langextract_recognizer.py

Co-authored-by: Dor Lugasi-Gal <dorlugasigal@gmail.com>

* lx to top level

---------

Co-authored-by: Omri Mendels <omri374@users.noreply.github.com>
Co-authored-by: Dor Lugasi-Gal <dorlugasigal@gmail.com>
@telackey
Copy link
Author

Thanks!

@SharonHart SharonHart dismissed their stale review December 17, 2025 11:42

The merge-base changed after approval.

@SharonHart
Copy link
Contributor

@telackey
I accidentally made the branch behind main after a force-push :(
can you please update your branch?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants