Overview

This guide walks through creating and registering a new Document Type — the foundational step for teaching the AI how to extract data from your specific documents.

Time Required

10–15 minutes to create and register your first Document Type.

Step 1: Navigate to the Document Type Manager

  1. From the main dashboard, hover over Data Hub in the top navigation bar.
  2. In the dropdown, under Document Manager, click Document Type.

Navigating to the Document Type section.

This opens the "All Document Type" page listing existing types.

The Document Type listing page.

Step 2: Create and Configure the New Document Type

  1. Click + Create New Document Type.
  2. On the "Upload Document" screen, fill in:
    • Document Type Name: Clear, unique name (e.g., US Drivers License - CA).
    • Layout: Choose the best fit (e.g., Structured).
    • Tags (Optional): Add tags for organization.
  3. In the "UPLOAD FILE" section, drag-and-drop or browse to upload an example document.
  4. Click Next Step.

Configuring the new Document Type and uploading a sample document.

Step 3: Review the Initial (0‑Shot) Extraction

After upload, you land in the annotation interface. The system automatically performs a 0-shot extraction — an initial attempt to identify and extract data with no prior training.

The initial 0-shot extraction result in the annotation interface.

On the right, two tabs present the results:

Taxonomy (Field Names)Document Extraction (Field Values)
The Taxonomy tab lists attribute names the AI believes are present — your starting schema.The Document Extraction tab shows extracted values with confidence scores.
Initial 0-shot taxonomy list.Initial 0-shot extracted values.

Refine this into a clean, reusable taxonomy.

Step 4: Refine the Taxonomy

Editing Standard Attributes

  1. Click an attribute in the list to open its configuration panel.
  2. Define properties:
    • Attribute Name: Use a friendly, unique name (e.g., Date of Birth).
    • Annotation: Adjust the bounding box on the document image as needed.
    • Select Type: Choose the correct datatype (e.g., Date, Free Form Text).
    • Description / Instruction: Provide context for the model and reviewers.
  3. Click Save.
Date attributeFree-form text attribute
Date attribute format.Text attribute settings.

Configuring Table Attributes

  1. For a Table attribute, first set approximate Columns and Rows. Draw a bounding box around the entire table.

Annotating a table and setting its dimensions.

  1. Click Manage under "Configure Columns" to define the internal schema.

Manage a table's column schema.

  1. Configure each column's Header, Alias, Data Type, and guidance. This standardizes your table output.

Step 5: Verify the Final Taxonomy and Extraction

  1. Review the Taxonomy tab for your finalized attribute list.

Completed taxonomy list.

  1. Review the Document Extraction tab for correctness and formatting. Use tags for filtering where useful.

Final extraction results.

Cropped extraction results.

Step 6: Register the Document Type

  1. Click Register in the top-right.
  2. Status changes from Draft to Registered.

All set

Your Document Type is now live and reusable for processing new documents of the same kind.