Skip to content
Release notes v1.0.0:

What is new?
The workflow is now completed: after performing inference on a given set of pages based on another given set of example pages, where the user has manually extracted the variables, the results are now displayed in the fresh views. First view shows the pages rendered and the variables extracted
highlighted from the last processing run. The second one shows all consolidated extracted variables in a lemmatized form. From there on, the user can also download the variables as json, simple text file or csv file.

What has changed?
Then this version brings also a big refactoring: a page is now considered a crucial element of the tool and is designed as a class based on a state machine with defined possible states and state transitions. This caused a major refactoring among how the background preprocessing and processing works as well as how selection of pages for either examples or to be processed work. The benefit out of that is the increased resilience of the different steps, easier background processing algorithms and a clearly defined behaviour of a page.
Then the overview view has been refactored drastically: now not the whole pdf is rendered but only a single page, but it is now possible to jump to a specific page, to select a specific page out of a page preview and to "scroll" through the pages as one is used by a usual pdf viewer.
Then there has been some improvements on the performance side: the preprocessing is now much faster as the page screenshots are now of reduced size, increasing the risk of faulty OCR but with a huge time boost. So this is something which can be further improved in the future. Also the memory footprint could be reduced by not caching any components anymore.
Finally, the ExperimentalView has been removed completely as well as the Home View has been adapted to display now some welcoming information. Also, the color palette has been changed, the tool comes now in a bluish tone. Making the major upgrade also visually.