Platforms to show: All Mac Windows Linux Cross-Platform

Back to TessEngineMBS class.

TessEngineMBS.GetAltoText(PageNumber as Integer) as String

Type Topic Plugin Version macOS Windows Linux iOS Targets
method OCR MBS OCR Plugin 21.3 ✅ Yes ✅ Yes ✅ Yes ✅ Yes All
Make an XML-formatted string with Alto markup from the internal data structures.

TessEngineMBS.GetAvailableLanguages as String()

Type Topic Plugin Version macOS Windows Linux iOS Targets
method OCR MBS OCR Plugin 21.3 ✅ Yes ✅ Yes ✅ Yes ✅ Yes All
Returns the available languages as array.
Example
Var OCR As TessEngineMBS // your instance of tesseract
Var AvailableLanguages() As String = OCR.GetAvailableLanguages

Some examples using this method:

TessEngineMBS.GetBoolVariable(Name as String, byref value as boolean) as Boolean

Type Topic Plugin Version macOS Windows Linux iOS Targets
method OCR MBS OCR Plugin 21.3 ✅ Yes ✅ Yes ✅ Yes ✅ Yes All
Queries boolean variable value.

Returns true if the parameter was found among Tesseract parameters.
Fills in value with the value of the parameter.

TessEngineMBS.GetBoxText(PageNumber as Integer) as String

Type Topic Plugin Version macOS Windows Linux iOS Targets
method OCR MBS OCR Plugin 21.3 ✅ Yes ✅ Yes ✅ Yes ✅ Yes All
The recognized text is returned as a char* which is coded in the same format as a box file used in training.

Constructs coordinates in the original image - not just the rectangle.
PageNumber is a 0-based page index that will appear in the box file.

TessEngineMBS.GetDoubleVariable(Name as String, byref value as Double) as Boolean

Type Topic Plugin Version macOS Windows Linux iOS Targets
method OCR MBS OCR Plugin 21.3 ✅ Yes ✅ Yes ✅ Yes ✅ Yes All
Queries double variable value.

Returns true if the parameter was found among Tesseract parameters.
Fills in value with the value of the parameter.

TessEngineMBS.GetHOCRText(PageNumber as Integer) as String

Type Topic Plugin Version macOS Windows Linux iOS Targets
method OCR MBS OCR Plugin 21.3 ✅ Yes ✅ Yes ✅ Yes ✅ Yes All
Make a HTML-formatted string with hOCR markup from the internal data structures.

PageNumber is 0-based but will appear in the output as 1-based.

TessEngineMBS.GetIntVariable(Name as String, byref value as Integer) as Boolean

Type Topic Plugin Version macOS Windows Linux iOS Targets
method OCR MBS OCR Plugin 21.3 ✅ Yes ✅ Yes ✅ Yes ✅ Yes All
Queries integer variable value.

Returns true if the parameter was found among Tesseract parameters.
Fills in value with the value of the parameter.

TessEngineMBS.GetLoadedLanguages as String()

Type Topic Plugin Version macOS Windows Linux iOS Targets
method OCR MBS OCR Plugin 21.3 ✅ Yes ✅ Yes ✅ Yes ✅ Yes All
Returns the loaded languages as array.
Example
Var OCR As TessEngineMBS // your instance of tesseract
Var LoadedLanguages() As String = OCR.GetLoadedLanguages

Includes all languages loaded by the last Init, including those loaded as dependencies of other loaded languages.

Some examples using this method:

TessEngineMBS.GetLSTMBoxText(PageNumber as Integer) as String

Type Topic Plugin Version macOS Windows Linux iOS Targets
method OCR MBS OCR Plugin 21.3 ✅ Yes ✅ Yes ✅ Yes ✅ Yes All
Make a box file for LSTM training from the internal data structures.

Constructs coordinates in the original image - not just the rectangle.
PageNumber is a 0-based page index that will appear in the box file.

TessEngineMBS.GetStringVariable(Name as String) as String

Type Topic Plugin Version macOS Windows Linux iOS Targets
method OCR MBS OCR Plugin 21.3 ✅ Yes ✅ Yes ✅ Yes ✅ Yes All
Queries string variable value.

Returns true if the parameter was found among Tesseract parameters.
Fills in value with the value of the parameter.

TessEngineMBS.GetText as String

Type Topic Plugin Version macOS Windows Linux iOS Targets
method OCR MBS OCR Plugin 21.3 ✅ Yes ✅ Yes ✅ Yes ✅ Yes All
The recognized text is returned as UTF-text.

TessEngineMBS.GetTsvText(PageNumber as Integer) as String

Type Topic Plugin Version macOS Windows Linux iOS Targets
method OCR MBS OCR Plugin 21.3 ✅ Yes ✅ Yes ✅ Yes ✅ Yes All
Make a TSV-formatted string from the internal data structures.

PageNumber is 0-based but will appear in the output as 1-based.

TessEngineMBS.GetUNLVText as String

Type Topic Plugin Version macOS Windows Linux iOS Targets
method OCR MBS OCR Plugin 21.3 ✅ Yes ✅ Yes ✅ Yes ✅ Yes All
The recognized text is returned as a char* which is coded as UNLV format Latin-1 with specific reject and suspect codes.

TessEngineMBS.GetWordStrBoxText(PageNumber as Integer) as String

Type Topic Plugin Version macOS Windows Linux iOS Targets
method OCR MBS OCR Plugin 21.3 ✅ Yes ✅ Yes ✅ Yes ✅ Yes All
The recognized text is returned as a char* which is coded in the same format as a WordStr box file used in training.

PageNumber is a 0-based page index that will appear in the box file.

The items on this page are in the following plugins: MBS OCR Plugin.


The biggest plugin in space...