8000 error after longtime Recognizing Text: · Issue #703 · datalab-to/marker · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
error after longtime Recognizing Text: #703
Open
@jj-a-li

Description

@jj-a-li

Recognizing Text: 100%|███████████████████████████████████████████████████████████████████████████████████████████████| 20371/20371 [1:00:05<00:00, 5.65it/s]
Traceback (most recent call last):
File "/workspace/marker/bin/marker_single", line 8, in
.....
ocr_builder(document, provider)
File "/workspace/marker/lib/python3.10/site-packages/marker/builders/ocr.py", line 61, in call
self.ocr_extraction(
File "/workspace/marker/lib/python3.10/site-packages/marker/builders/ocr.py", line 170, in ocr_extraction
new_spans = self.spans_from_html_chars(
File "/workspace/marker/lib/python3.10/site-packages/marker/builders/ocr.py", line 343, in spans_from_html_chars
if not spans[-1].html:
IndexError: list index out of range

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions

      0