Mobile OCR

Mobile OCR is a Flutter plugin that delivers fully on-device text detection and recognition on Android and iOS. The two platforms share the same Dart API:

Android (ONNX pipeline) – A faithful port of the PaddleOCR v5 models, executed with ONNX Runtime for high-accuracy OCR without network access.
iOS (Apple Vision) – Uses the system Vision framework, so no model downloads are required and the plugin stays lightweight.

Everything below describes the Android pipeline unless explicitly noted. The iOS implementation returns the same JSON payload so the Dart surface remains identical.

Features

Text detection (DB algorithm) with oriented bounding polygons
Text recognition (SVTR_LCNet + CTC) mirroring PaddleOCR v5 behaviour
Text angle classification and auto-rotation for skewed crops
On-device processing with no network calls
Multi-language character dictionary (Chinese + English)
Shared results structure across Android and iOS

Installation

Add this to your package's pubspec.yaml file:

dependencies:
  mobile_ocr:
    git:
      url: https://github.com/ente-io/mobile_ocr

Usage

Basic Usage

import 'package:mobile_ocr/mobile_ocr_plugin.dart';

// Create plugin instance
final ocrPlugin = MobileOcr();

// Android only: ensure ONNX models are cached locally (downloads on first run).
// No-op on iOS because Vision ships with the OS.
await ocrPlugin.prepareModels();

// Optional quick check if the image contains high-confidence text (runs much faster than actual full text recognition)
final hasText = await ocrPlugin.hasText(
  imagePath: '/path/to/image.png',
);

// Perform OCR by supplying an image path
final textBlocks = await ocrPlugin.detectText(
  imagePath: '/path/to/image.png',
);

for (final block in textBlocks) {
  print('Text: ${block.text}');
  print('Confidence: ${block.confidence}');
  print('Corners: ${block.points}');
  final bounds = block.boundingBox;
  print('Bounds: ${bounds.left}, ${bounds.top} -> ${bounds.right}, ${bounds.bottom}');
}

Detection Output

Each TextBlock mirrors the shape produced by the PaddleOCR detector:

text – recognized string
confidence – recognition probability (0–1)
points – four corner points (clockwise) describing the oriented quadrilateral; the sample app uses these to draw rotated boxes exactly as they appear in the source image
boundingBox – convenience Rect derived from the polygon for quick overlays or cropping

Using with Image Picker

import 'package:image_picker/image_picker.dart';

final ImagePicker picker = ImagePicker();
final XFile? image = await picker.pickImage(source: ImageSource.gallery);

if (image != null) {
  await ocrPlugin.prepareModels(); // Android: ensure models are ready (no-op on iOS)
  final result = await ocrPlugin.detectText(imagePath: image.path);
  // Process results...
}

Example App

The plugin includes a comprehensive example app that demonstrates:

Loading images from camera or gallery
Running OCR on selected images
Displaying detected text regions with colored overlays
Tapping on text regions to view and copy the recognized text
Toggle text overlay visibility

To run the example:

cd example
flutter run

Android Model Assets (ONNX)

The ONNX models (~20 MB total) are not bundled with the plugin. They are hosted at https://models.ente.io/PP-OCRv5/ and downloaded on demand the first time you call prepareModels(). Files are cached under context.filesDir/assets/mobile_ocr/ with SHA-256 verification so subsequent runs work offline. You can call prepareModels() during app launch to show a download progress indicator before triggering OCR.

iOS does not require this step because it relies on the built-in Vision framework.

Platform Support

Currently supports:

✅ Android (API 24+)
✅ iOS 14+

Acknowledgments

This work would not be possible without:

PaddleOCR - The original OCR models and algorithms
OnnxOCR - ONNX implementation and pipeline architecture

License

This plugin is released under the MIT License. The ONNX models are derived from PaddleOCR and follow their licensing terms.

Name		Name	Last commit message	Last commit date
Latest commit History 95 Commits
android		android
docs		docs
example		example
ios		ios
lib		lib
test		test
.gitignore		.gitignore
.metadata		.metadata
AGENTS.md		AGENTS.md
CLAUDE.md		CLAUDE.md
LICENSE		LICENSE
README.md		README.md
analysis_options.yaml		analysis_options.yaml
pubspec.yaml		pubspec.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Mobile OCR

Features

Installation

Usage

Basic Usage

Detection Output

Using with Image Picker

Example App

Android Model Assets (ONNX)

Platform Support

Acknowledgments

License

About

Uh oh!

Releases

Packages

Contributors 3

Uh oh!

Languages

License

ente-io/mobile_ocr

Folders and files

Latest commit

History

Repository files navigation

Mobile OCR

Features

Installation

Usage

Basic Usage

Detection Output

Using with Image Picker

Example App

Android Model Assets (ONNX)

Platform Support

Acknowledgments

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Uh oh!

Languages

Packages