INFORMys is a flexible form reader system especially conceived for dealing with invoices and similar documents. Like most systems, INFORMys offers a twofold user interface for assisting the user during both form modeling and form reading. These different operations are supported by two modules, namely the Form Modeler (FM), which is charged for allowing the user to build the model (a graph) describing the forms of a specific class, and the Form Reader (FR), which implements the recognition engine. Fig. 1 shows the basic structure of the system.
Figure 1: The basic structure of INFORMys, a flexible form reader system,
where the user can shape the layout of its own documents
in the class of invoice-like documents.
The acceptable documents
The documents that are acceptable by the system contain information fields that cannot be located by relying on their absolute position, or by relying on their relationship with form lines. Some fields can be located only after having recognized the corresponding instruction fields. Typical examples of such forms are those delivered by companies for accounting their service (see Fig. 10).
With INFORMys we can deal with documents where four different kinds of ``objects'' can be found, namely lines, logos, instruction fields, and information fields. Our document class is identified by specifying all possible ways of relating the information to instruction fields and other objects. The class is defined specifying the objects and their relationships in the documents.
There are essentially four ways of locating an information field:
The actual structure of the form arises, during the modeling phase, by properly combining instruction and information fields.
