10000 Table caption HTML tag stops parsing (Exception thrown converting HTML file) · Issue #156 · docwire/docwire · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

Table caption HTML tag stops parsing (Exception thrown converting HTML file) #156

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
efieleke-tausight opened this issue Oct 23, 2024 · 1 comment

Comments

@efieleke-tausight
Copy link

The following HTML file causes an exception to be thrown in 2024.10.15
plain_text_writer.cpp, line 400:
throw_if (table.empty(), "Cell content inside table without rows");
Dataset Overview _ National Centers for Environmental Information (NCEI)_files.zip

This file opens fine in a browser.

@as-ascii
Copy link
Contributor

Thank you for the report.

This part is probably the cause of the issue:

<table class="table table-condensed table-bordered multi-table">
                                                <caption class="text-left"> Lineage information for: repository</caption>
                                                <tbody><tr>

Tags like "caption" is rarely used inside html table. Support for this kind structures should be tested in details and improved probably.
Fortunately the fix should be quite easy.

@as-ascii as-ascii changed the title Exception thrown converting HTML file Table caption HTML tag stops parsing (Exception thrown converting HTML file) Oct 25, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants
0