Does your company handle a lot of documents?
Does it involve manual labour?
DExtr helps you improve your processes, reduce manual work and cut your costs by:
Looking for new data sources?
Searching for intelligent Data Mining Tool?
DExtr provides set of professional tools, which allows you to:
Text reconstruction. Semantic analysis.
Typical document processing is in time interval of under one second. DExtr is built on the Java 8 platform, which has the needed performance and tooling. DExtr engine uses efficiently implemented extraction strategies to boost performance using parallel recognition.
Web Services can be used to expose DExtr functionality to external or internal systems, as well as command line tool or DExtr tool library for direct batch processing. This allows straightforward integration with other systems.
Classification is essential part of data extraction process to achieve better extraction results. DExtr can use machine learning techniques to classify documents based on recognized entities.
Highly expressive domain specific API and scripting language is used to leverage the configurability of extraction and classification process. DExtr provides Document Viewer (DView) as a visual desktop application for better document visualization, extraction configuration and debugging.
Textual and Visual document representation is used simultaneously to extract data, as each bit of information helps recognize important patterns.
State-of-the-art recognition algorithms detect relevant data, enabling DExtr to outperform template based solutions.
Document Viewer (DView) is a desktop application for:
DView is a part of DExtr tools, used to facilitate DExtr in process of document data extraction. DExtr supports standard digital document formats, like PDF, DOC, TXT, XML, HOCR. DExtr can read the output of major OCRs, such as ABBYY and Tesseract.
Document Viewer