ocropus add-ons

Modules

 ThresholderParams
 

fxn for setting truth_* in WERD_RES


Functions

void tesseract::TessBaseAPI::AdaptToCharacter (const char *unichar_repr, int length, float baseline, float xheight, float descender, float ascender)
PAGE_REStesseract::TessBaseAPI::RecognitionPass1 (BLOCK_LIST *block_list)
PAGE_REStesseract::TessBaseAPI::RecognitionPass2 (BLOCK_LIST *block_list, PAGE_RES *pass1_result)
void tesseract::TessBaseAPI::DetectParagraphs (int debug_level)
static int tesseract::TessBaseAPI::TesseractExtractResult (char **text, int **lengths, float **costs, int **x0, int **y0, int **x1, int **y1, PAGE_RES *page_res)
const PAGE_REStesseract::TessBaseAPI::GetPageRes () const

Variables

Tesseract * tesseract::TessBaseAPI::tesseract_
 The underlying data object.
Tesseract * tesseract::TessBaseAPI::osd_tesseract_
 For orientation & script detection.
EquationDetect * tesseract::TessBaseAPI::equ_detect_
 The equation detector.
ImageThresholder * tesseract::TessBaseAPI::thresholder_
 Image thresholding module.
GenericVector< ParagraphModel * > * tesseract::TessBaseAPI::paragraph_models_
BLOCK_LIST * tesseract::TessBaseAPI::block_list_
 The page layout.
PAGE_REStesseract::TessBaseAPI::page_res_
 The page-level data.
STRINGtesseract::TessBaseAPI::input_file_
 Name used by training code.
STRINGtesseract::TessBaseAPI::output_file_
 Name used by debug code.
STRINGtesseract::TessBaseAPI::datapath_
 Current location of tessdata.
STRINGtesseract::TessBaseAPI::language_
 Last initialized language.
OcrEngineMode tesseract::TessBaseAPI::last_oem_requested_
 Last ocr language mode requested.
bool tesseract::TessBaseAPI::recognition_done_
 page_res_ contains recognition data.
TruthCallback * tesseract::TessBaseAPI::truth_cb_

Detailed Description


Function Documentation

void tesseract::TessBaseAPI::AdaptToCharacter ( const char *  unichar_repr,
int  length,
float  baseline,
float  xheight,
float  descender,
float  ascender 
) [protected, inherited]

Adapt to recognize the current image as the given character. The image must be preloaded and be just an image of a single character.

void tesseract::TessBaseAPI::DetectParagraphs ( int  debug_level  )  [protected, inherited]

After text is recognized, break each paragraph into blocks.

const PAGE_RES* tesseract::TessBaseAPI::GetPageRes (  )  const [inline, protected, inherited]
PAGE_RES * tesseract::TessBaseAPI::RecognitionPass1 ( BLOCK_LIST *  block_list  )  [protected, inherited]

Recognize text doing one pass only, using settings for a given pass.

PAGE_RES * tesseract::TessBaseAPI::RecognitionPass2 ( BLOCK_LIST *  block_list,
PAGE_RES pass1_result 
) [protected, inherited]
int tesseract::TessBaseAPI::TesseractExtractResult ( char **  text,
int **  lengths,
float **  costs,
int **  x0,
int **  y0,
int **  x1,
int **  y1,
PAGE_RES page_res 
) [static, protected, inherited]

Extract the OCR results, costs (penalty points for uncertainty), and the bounding boxes of the characters.


Variable Documentation

BLOCK_LIST* tesseract::TessBaseAPI::block_list_ [protected, inherited]

The page layout.

STRING* tesseract::TessBaseAPI::datapath_ [protected, inherited]

Current location of tessdata.

EquationDetect* tesseract::TessBaseAPI::equ_detect_ [protected, inherited]

The equation detector.

Name used by training code.

STRING* tesseract::TessBaseAPI::language_ [protected, inherited]

Last initialized language.

OcrEngineMode tesseract::TessBaseAPI::last_oem_requested_ [protected, inherited]

Last ocr language mode requested.

Tesseract* tesseract::TessBaseAPI::osd_tesseract_ [protected, inherited]

For orientation & script detection.

Name used by debug code.

The page-level data.

bool tesseract::TessBaseAPI::recognition_done_ [protected, inherited]

page_res_ contains recognition data.

Tesseract* tesseract::TessBaseAPI::tesseract_ [protected, inherited]

The underlying data object.

ImageThresholder* tesseract::TessBaseAPI::thresholder_ [protected, inherited]

Image thresholding module.

TruthCallback* tesseract::TessBaseAPI::truth_cb_ [protected, inherited]
Generated on Thu Feb 2 08:19:23 2012 for Tesseract by  doxygen 1.6.3