Skip to content

Advanced OCR

Advanced OCR gives you full control for high-accuracy text extraction from complex documents, scanned PDFs, and low-quality images. You can choose between Paddle OCR for AI-powered accuracy or Tesseract OCR for multi-language support, then fine-tune settings like confidence level, PDF processing mode, and display options for better results.

It also includes powerful Image Preprocessing tools such as denoising, sharpening, grayscale conversion, thresholding, deskew, and layout preservation. By configuring preprocessing steps and execution order, you can significantly improve text clarity and extraction accuracy -- perfect for professional and advanced OCR workflows.

When to Use Advanced OCR

Use Advanced OCR when Easy OCR doesn't give satisfactory results, especially for:

  • Scanned documents with noise or low resolution
  • Complex layouts with tables, columns, or mixed content
  • Images taken at angles or with poor lighting
  • Documents requiring specific preprocessing steps

Video Tutorial

Step-by-Step Guide

Step 1: Open Advanced OCR

From the Kaizen OCR dashboard, click on the Advanced OCR tile to open the Advanced OCR module.

Advanced OCR Home Screen

Step 2: Add Your File

Click the file browser button or drag & drop your image or PDF into the application.

Open File

Step 3: Select Your File

Choose the document or image you want to process from the file dialog.

Select File

Step 4: Configure Settings and Start OCR

Configure your OCR settings:

  • OCR Engine: Choose between Paddle OCR or Tesseract OCR
  • Confidence Level: Adjust the minimum confidence threshold
  • Preprocessing: Enable tools like denoising, sharpening, grayscale, thresholding, and deskew
  • Execution Order: Arrange preprocessing steps in your preferred order

Once configured, click Start OCR to begin the extraction.

Start OCR

Step 5: View and Export Results

Review the extracted text in the results panel. The output will reflect the improved accuracy from your preprocessing settings.

OCR Results

Preprocessing Tools

Tool Description
Denoising Removes noise and artifacts from scanned images
Sharpening Enhances text edges for better recognition
Grayscale Converts color images to grayscale for consistent processing
Thresholding Converts images to black and white for clearer text
Deskew Straightens tilted or rotated documents
Layout Preservation Maintains the original document structure in output