Open
Description
I'm trying to parse a PDF using the example, but parsing a small 209 kb file requires more than 5 seconds.
using namespace docwire;
std::stringstream out_stream;
std::filesystem::path("D:\\pdf.pdf") | ParseDetectedFormat<OfficeFormatsParserProvider>() | PlainTextExporter() | out_stream;
Compiler MSVC Version 19.39.33523 for x64.
Build type = release.
DocWire version - 2024.04.04.
But when parsing directly using PoDoFo - it's required 70ms...
std::string input = "D:\\pdf.pdf";
PdfMemDocument doc;
doc.Load(input);
auto& pages = doc.GetPages();
std::stringstream ss;
for (unsigned i = 0; i < pages.GetCount(); i++)
{
auto& page = pages.GetPageAt(i);
std::vector<PdfTextEntry> entries;
page.ExtractTextTo(entries);
for (auto& entry : entries)
ss << entry.Text << "\n";
}
Metadata
Metadata
Assignees
Labels
No labels