Platforms to show: All Mac Windows Linux Cross-Platform

TessPageIteratorMBS class

Type Topic Plugin Version macOS Windows Linux iOS Targets
class OCR MBS OCR Plugin 21.3 ✅ Yes ✅ Yes ✅ Yes ✅ Yes All
The iterator to walk over page content.

Class to iterate over tesseract page structure, providing access to all levels of the page hierarchy, without including any tesseract headers or having to handle any tesseract structures.
This is an abstract class. You can't create an instance, but you can get one from various plugin functions.

Block Types

Constant Value Description
kBlockTypeCaptionText 8 Text that belongs to an image.
kBlockTypeEquation 4 Partition belonging to an equation region.
kBlockTypeFlowingImage 9 Image that lives inside a column.
kBlockTypeFlowingText 1 Text that lives inside a column.
kBlockTypeHeadingImage 10 Image that spans more than one column.
kBlockTypeHeadingText 2 Text that spans more than one column.
kBlockTypeHorzLine 12 Horizontal Line.
kBlockTypeInlineEquation 5 Partition has inline equation.
kBlockTypeNoise 14 Lies outside of any column.
kBlockTypePulloutImage 11 Image that is in a cross-column pull-out region.
kBlockTypePulloutText 3 Text that is in a cross-column pull-out region.
kBlockTypeTable 6 Partition belonging to a table region.
kBlockTypeUnknown 0 Type is not yet known. Keep as the first element.
kBlockTypeVerticalText 7 Text-line runs vertically.
kBlockTypeVertLine 13 Vertical Line.

Justifications

Constant Value Description
kJustificationCenter 2 The text lines of the paragraph are centered about a line going down through their middle of the text lines.
kJustificationLeft 1 Each line, except possibly the first, is flush to the same left tab stop.
kJustificationRight 3 Each line, except possibly the first, is flush to the same right tab stop.
kJustificationUnknown 0 Unknown.

Level

Constant Value Description
kLevelBlock 0 Block
kLevelParagraph 1 Paragraph
kLevelSymbol 4 Symbol
kLevelTextline 2 Textline
kLevelWord 3 Word

Orientations

Constant Value Description
kOrientationPageDown 2 Orientation is down.
kOrientationPageLeft 3 Orientation is left.
kOrientationPageRight 1 Orientation is right.
kOrientationPageUp 0 Orientation is up.

Text Line Order

Constant Value Description
kTextlineOrderLefttoRight 0 Mongolian is written in vertical columns top to bottom like Chinese, but the lines order left-to right.
kTextlineOrderRighttoLeft 1 In Chinese, vertical text lines are read right-to-left.
kTextlineOrderToptoBottom 2 In English, the order is top-to-bottom.

Writing Direction

Constant Value Description
kWritingDirectionLefttoRight 0 For English text, the writing direction is left-to-right.
kWritingDirectionRighttoLeft 1 For Arabic the text direction is right to left.
kWritingDirectionToptoBottom 2 For the Chinese text in the above example, the writing direction is top-to-bottom.

Sub classes:

Some methods using this class:


The items on this page are in the following plugins: MBS OCR Plugin.


TesseractResultIteratorMBS   -   TessResultIteratorMBS


The biggest plugin in space...