capacitor-mlkit-doc-scanner

v0.0.4

Published

a year ago

Capacitor plugin for ML Kit Document Scanner (Android Only)

0High
0Medium
0Low

therealabdi27

capacitor plugin native mlkit document scanner doc scanner android

Capacitor ML Kit Document Scanner Plugin

This plugin allows you to use Google's ML Kit Document Scanner in your Capacitor applications. It provides an easy way to add a document scanning feature, allowing users to scan documents and receive them as JPEG images and/or PDF files.

Demo

A demo application showcasing the capabilities of this plugin can be found here: https://github.com/therealabdi2/demo-mlkit-doc-scanner

Installation

npm install capacitor-mlkit-doc-scanner
npx cap sync

Android

Add the ML Kit Document Scanner dependency to your app's build.gradle file (android/app/build.gradle):

dependencies {
    // ...other dependencies
    implementation 'com.google.android.gms:play-services-mlkit-document-scanner:16.0.0-beta1'
}

Ensure your project's minSdkVersion is 21 or higher in android/app/build.gradle. The ML Kit Document Scanner also requires a minimal device total RAM of 1.7GB.

Supported Platforms

[x] Android
[ ] iOS (Not available)
[ ] Web (Not available - native feature)

API

scanDocument(...)

scanDocument(options?: ScanOptions | undefined) => Promise<ScanResult>

Starts the document scanning process.

| Param | Type | Description | | ------------- | --------------------------------------------------- | -------------------------------------- | | options | ScanOptions | Configuration options for the scanner. |

Returns: Promise<ScanResult>

Interfaces

ScanResult

Result of a document scan operation.

| Prop | Type | Description | | ------------------- | ------------------------------------------- | -------------------------------------------------------------------------------------------------------------------- | | scannedImages | string[] | An array of URIs for the scanned image pages (JPEG). Present if 'JPEG' or 'JPEG_PDF' was requested in resultFormats. | | pdf | PdfInfo | Information about the generated PDF. Present if 'PDF' or 'JPEG_PDF' was requested in resultFormats. |

PdfInfo

Information about a generated PDF document.

| Prop | Type | Description | | --------------- | ------------------- | ---------------------------------- | | uri | string | The URI of the generated PDF file. | | pageCount | number | The number of pages in the PDF. |

ScanOptions

Options for the document scanner.

| Prop | Type | Description | Default | | -------------------------- | --------------------------------------------------- | -------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- | ----------------------- | | galleryImportAllowed | boolean | Whether to allow importing from the photo gallery. | false | | pageLimit | number | The maximum number of pages that can be scanned. | 10 | | resultFormats | 'JPEG' | 'PDF' | 'JPEG_PDF' | The desired result formats. Can be 'JPEG', 'PDF', or 'JPEG_PDF'. | 'JPEG_PDF' | | scannerMode | 'FULL' | 'BASE' | 'BASE_WITH_FILTER' | The scanner mode. BASE: Basic editing capabilities (crop, rotate, reorder pages, etc.). BASE_WITH_FILTER: Adds image filters (grayscale, auto image enhancement, etc.) to the BASE mode. FULL: Adds ML-enabled image cleaning capabilities (erase stains, fingers, etc.) to the BASE_WITH_FILTER mode. This mode will also allow future major features to be automatically added along with Google Play services updates, while the other two modes will maintain their current feature sets and only receive minor refinements. | "FULL" |

Usage Example

import { MlkitDocScanner } from 'capacitor-mlkit-doc-scanner';

async function startScan() {
  try {
    const result = await MlkitDocScanner.scanDocument({
      galleryImportAllowed: true,
      pageLimit: 5,
      resultFormats: 'JPEG_PDF',
      scannerMode: 'FULL',
    });

    console.log('Scan successful:', result);

    if (result.scannedImages && result.scannedImages.length > 0) {
      console.log('First image URI:', result.scannedImages[0]);
      // Handle image URIs (e.g., display them)
    }

    if (result.pdf) {
      console.log('PDF URI:', result.pdf.uri);
      console.log('PDF Page Count:', result.pdf.pageCount);
      // Handle PDF URI (e.g., open or upload it)
    }
  } catch (error) {
    console.error('Scan failed:', error);
  }
}

Important Notes

The ML Kit Document Scanner models, scanning logic, and UI flow are dynamically downloaded by Google Play services. Users might have to wait for these to download before the first use.
This API requires Android API level 21 or above.
It also requires a minimal device total RAM of 1.7GB. If lower, it returns an MlKitException with error code UNSUPPORTED when calling the API (this plugin will reject the promise).
Consider that generating document files takes time and requires processing power, so only request the output formats (JPEG, or PDF, or both) you actually need via the resultFormats option.

Contributing

Contributions are welcome! Please feel free to submit issues and pull requests to https://github.com/therealabdi2/capacitor-mlkit-doc-scanner-plugin.

License

This plugin is licensed under the MIT License.