Add Azure OpenAI service support to marker package #675

MauritsBrinkman · 2025-04-29T13:09:09Z

Create AzureOpenAIService class that implements BaseService interface
Fix duplicate function name in test_service_init.py
Add test case for Azure OpenAI service
Update README.md to document Azure OpenAI service option
Add sample script for converting PDFs with Azure OpenAI

This implementation allows marker to use Azure OpenAI for LLM-enhanced processing and image descriptions by configuring azure_endpoint, azure_api_key and deployment_name parameters.

- Create AzureOpenAIService class that implements BaseService interface - Fix duplicate function name in test_service_init.py - Add test case for Azure OpenAI service - Update README.md to document Azure OpenAI service option - Add sample script for converting PDFs with Azure OpenAI This implementation allows marker to use Azure OpenAI for LLM-enhanced processing and image descriptions by configuring azure_endpoint, azure_api_key and deployment_name parameters.

github-actions · 2025-04-29T13:09:24Z

CLA Assistant Lite bot All contributors have signed the CLA ✍️ ✅

MauritsBrinkman · 2025-04-29T13:10:38Z

I have read the CLA document and I hereby sign the CLA

This because it yielded cut-off-responses, which leads to invalid json

… be compatible with other LLM processing units

chintan1891 · 2025-05-06T09:40:33Z

@VikParuchuri Can you please review this & merge it?

…ions The inference_blocks method in LLMImageDescriptionProcessor had a logic error where it would return an empty list when extract_images=True, effectively disabling all image description processing. This is counterintuitive and contradicts the documented behavior where extract_images should control whether to keep images in output, not whether to generate descriptions. This fix ensures the processor always proce 8000 sses image blocks and generates descriptions regardless of the extract_images setting, aligning with the expected behavior described in the CLI documentation.

Adlef · 2025-05-08T14:13:56Z

Agreed. I would be interested to get it merged please. Thanks!

Adlef · 2025-05-19T13:39:06Z

@VikParuchuri : could you have a look into this PR please? Thanks a lot :)

VikParuchuri

Thanks for the contribution! A couple of questions in the comments

VikParuchuri · 2025-05-19T17:07:42Z

marker/processors/llm/llm_image_description.py

@@ -41,8 +41,6 @@ class LLMImageDescriptionProcessor(BaseLLMSimpleBlockProcessor):

    def inference_blocks(self, document: Document) -> List[BlockData]:
        blocks = super().inference_blocks(document)
-        if self.extract_images:


Why was this removed?

I think self.extract_images default True, so inference_blocks() return []. We should remove them.

Indeed - I found this one during debugging: with extract_images=True (default), the processor would never process any images, making it completely non-functional... So removing ensures the processor actually does its job of generating image descriptions

We can set extract_images=False in confict, so don't need remove them

pyproject.toml

Enable dropping repeated text

MauritsBrinkman · 2025-06-04T07:31:19Z

@tuantran23012000 @VikParuchuri I think we're now ready to merge this? Removed the langchain dependency

github-actions bot added a commit that referenced this pull request Apr 29, 2025

@MauritsBrinkman has signed the CLA in #675

9b31f5b

MauritsBrinkman added 4 commits April 29, 2025 15:06

fix: Add missing dependencies and update poetry lock

16a7518

Merge branch 'VikParuchuri:master' into master

7855472

Remove max_tokens from AzureChatOpenAI config

8e98710

This because it yielded cut-off-responses, which leads to invalid json

fix: Use dynamically passed in schema instead of hardcoded one, so to…

d07065d

… be compatible with other LLM processing units

VikParuchuri changed the base branch from master to dev May 19, 2025 17:04

VikParuchuri reviewed May 19, 2025

View reviewed changes

@jacksontromero has signed the CLA in datalab-to#721

0ad13c6

grahama1970 pushed a commit to grahama1970/marker that referenced this pull request Jun 2, 2025

@MauritsBrinkman has signed the CLA in datalab-to#675

385bd68

tarun-menta and others added 4 commits June 2, 2025 15:13

Merge pull request datalab-to#705 from VikParuchuri/dev

49614cb

Merge pull request datalab-to#724 from VikParuchuri/dev

22bb321

Enable dropping repeated text

Fix version number [skip ci]

2985d2e

chore: Release v1.7.4

86af672

MauritsBrinkman force-pushed the master branch from 6c02019 to 86af672 Compare June 3, 2025 19:18

MauritsBrinkman added 2 commits June 3, 2025 20:49

Merge branch 'master' of https://github.com/MauritsBrinkman/marker

de3de2d

chore: Remove use of langchain (only use openai)

96602ea

VikParuchuri force-pushed the dev branch 2 times, most recently from fe180a0 to 06ad1e6 Compare June 6, 2025 18:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add Azure OpenAI service support to marker package #675

Add Azure OpenAI service support to marker package #675

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Add Azure OpenAI service support to marker package #675

Are you sure you want to change the base?

Add Azure OpenAI service support to marker package #675

Uh oh!

Conversation

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!