capacitor-mlkit-doc-scanner
v0.0.4
Published
Capacitor plugin for ML Kit Document Scanner (Android Only)
Maintainers
Readme
Capacitor ML Kit Document Scanner Plugin
This plugin allows you to use Google's ML Kit Document Scanner in your Capacitor applications. It provides an easy way to add a document scanning feature, allowing users to scan documents and receive them as JPEG images and/or PDF files.
Demo
A demo application showcasing the capabilities of this plugin can be found here: https://github.com/therealabdi2/demo-mlkit-doc-scanner
Installation
npm install capacitor-mlkit-doc-scanner
npx cap syncAndroid
Add the ML Kit Document Scanner dependency to your app's
build.gradlefile (android/app/build.gradle):dependencies { // ...other dependencies implementation 'com.google.android.gms:play-services-mlkit-document-scanner:16.0.0-beta1' }Ensure your project's
minSdkVersionis 21 or higher inandroid/app/build.gradle. The ML Kit Document Scanner also requires a minimal device total RAM of 1.7GB.
Supported Platforms
- [x] Android
- [ ] iOS (Not available)
- [ ] Web (Not available - native feature)
API
scanDocument(...)
scanDocument(options?: ScanOptions | undefined) => Promise<ScanResult>Starts the document scanning process.
| Param | Type | Description |
| ------------- | --------------------------------------------------- | -------------------------------------- |
| options | ScanOptions | Configuration options for the scanner. |
Returns: Promise<ScanResult>
Interfaces
ScanResult
Result of a document scan operation.
| Prop | Type | Description |
| ------------------- | ------------------------------------------- | -------------------------------------------------------------------------------------------------------------------- |
| scannedImages | string[] | An array of URIs for the scanned image pages (JPEG). Present if 'JPEG' or 'JPEG_PDF' was requested in resultFormats. |
| pdf | PdfInfo | Information about the generated PDF. Present if 'PDF' or 'JPEG_PDF' was requested in resultFormats. |
PdfInfo
Information about a generated PDF document.
| Prop | Type | Description |
| --------------- | ------------------- | ---------------------------------- |
| uri | string | The URI of the generated PDF file. |
| pageCount | number | The number of pages in the PDF. |
ScanOptions
Options for the document scanner.
| Prop | Type | Description | Default |
| -------------------------- | --------------------------------------------------- | -------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- | ----------------------- |
| galleryImportAllowed | boolean | Whether to allow importing from the photo gallery. | false |
| pageLimit | number | The maximum number of pages that can be scanned. | 10 |
| resultFormats | 'JPEG' | 'PDF' | 'JPEG_PDF' | The desired result formats. Can be 'JPEG', 'PDF', or 'JPEG_PDF'. | 'JPEG_PDF' |
| scannerMode | 'FULL' | 'BASE' | 'BASE_WITH_FILTER' | The scanner mode. BASE: Basic editing capabilities (crop, rotate, reorder pages, etc.). BASE_WITH_FILTER: Adds image filters (grayscale, auto image enhancement, etc.) to the BASE mode. FULL: Adds ML-enabled image cleaning capabilities (erase stains, fingers, etc.) to the BASE_WITH_FILTER mode. This mode will also allow future major features to be automatically added along with Google Play services updates, while the other two modes will maintain their current feature sets and only receive minor refinements. | "FULL" |
Usage Example
import { MlkitDocScanner } from 'capacitor-mlkit-doc-scanner';
async function startScan() {
try {
const result = await MlkitDocScanner.scanDocument({
galleryImportAllowed: true,
pageLimit: 5,
resultFormats: 'JPEG_PDF',
scannerMode: 'FULL',
});
console.log('Scan successful:', result);
if (result.scannedImages && result.scannedImages.length > 0) {
console.log('First image URI:', result.scannedImages[0]);
// Handle image URIs (e.g., display them)
}
if (result.pdf) {
console.log('PDF URI:', result.pdf.uri);
console.log('PDF Page Count:', result.pdf.pageCount);
// Handle PDF URI (e.g., open or upload it)
}
} catch (error) {
console.error('Scan failed:', error);
}
}Important Notes
- The ML Kit Document Scanner models, scanning logic, and UI flow are dynamically downloaded by Google Play services. Users might have to wait for these to download before the first use.
- This API requires Android API level 21 or above.
- It also requires a minimal device total RAM of 1.7GB. If lower, it returns an
MlKitExceptionwith error codeUNSUPPORTEDwhen calling the API (this plugin will reject the promise). - Consider that generating document files takes time and requires processing power, so only request the output formats (JPEG, or PDF, or both) you actually need via the
resultFormatsoption.
Contributing
Contributions are welcome! Please feel free to submit issues and pull requests to https://github.com/therealabdi2/capacitor-mlkit-doc-scanner-plugin.
License
This plugin is licensed under the MIT License.
