-
Notifications
You must be signed in to change notification settings - Fork 1
License
docwire/wv2
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
This is a patched version of the wv2 library version 0.2.3. This can be compiled with Visual Studio without libgsf and glib2 dependencies. This repository is used for the following project: *************************************************************************************************************************************************** * DocToText - A multifaceted, data extraction software development toolkit that converts all sorts of files to plain text and html. * * Written in C++, this data extraction tool has a parser able to convert PST & OST files along with a brand new API for better file processing. * * To enhance its utility, DocToText, as a data extraction tool, can be integrated with other data mining and data analytics applications. * * It comes equipped with a high grade, scriptable and trainable OCR that has LSTM neural networks based character recognition. * * * * This document parser is able to extract metadata along with annotations and supports a list of formats that include: * * DOC, XLS, XLSB, PPT, RTF, ODF (ODT, ODS, ODP), OOXML (DOCX, XLSX, PPTX), iWork (PAGES, NUMBERS, KEYNOTE), ODFXML (FODP, FODS, FODT), * * PDF, EML, HTML, Outlook (PST, OST), Image (JPG, JPEG, JFIF, BMP, PNM, PNG, TIFF, WEBP) and DICOM (DCM) * * * * Copyright (c) SILVERCODERS Ltd * * http://silvercoders.com * * * * Project homepage: * * http://silvercoders.com/en/products/doctotext * * https://www.docwire.io/
About
No description, website, or topics provided.
Resources
License
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published