REST API for programmatic PDF extraction. Send a document, get structured data back. Write directly to Google Sheets or retrieve JSON for your own pipeline.
The Lido REST API accepts PDF documents as Base64-encoded input and returns structured JSON with extracted fields, confidence scores, and table data. You can write results directly to a Google Sheet or consume the JSON response in your own application.
The API uses the same layout-agnostic AI as the web interface — no templates, no training data, no per-document configuration. Send any PDF and get structured output on the first request.
Custom automation pipelines. Your application receives PDFs (from email, upload, or another API) and needs to extract data programmatically. The API handles extraction and returns structured data your code can process, transform, and route to downstream systems.
Integration with internal tools. You're building a custom portal, dashboard, or workflow tool that needs PDF extraction as a feature. The API provides extraction without you building or maintaining your own OCR/AI infrastructure.
High-volume processing. Batch process hundreds or thousands of PDFs programmatically. Send documents in parallel and collect results. The API handles queuing and processing automatically.
Google Sheets as a data destination. Use the API to extract PDF data and write it to specific Google Sheets, tabs, and cell ranges programmatically. Useful when you need more control over where data lands than the email or upload workflows provide.
The API returns structured JSON with:
Header fields — key-value pairs for document-level data (invoice number, date, vendor name, total).
Table data — rows and columns for tabular content (line items, transaction lists), with column headers auto-detected from the document.
Confidence scores — a 0.0-1.0 score for each extracted field, enabling automated quality control in your pipeline.
Raw text — the full OCR text of the document, useful for search indexing or custom parsing logic.
Direct API calls. Standard REST API with JSON request/response. Works with any language — Python, Node.js, Java, Go, Ruby, or curl.
Power Automate connector. For no-code automation in Microsoft 365 environments. Trigger extraction from email, SharePoint, or any Power Automate-supported source and write to Google Sheets.
Google Apps Script. Call the API from within Google Sheets using Apps Script. Upload a PDF from Drive, extract data, and write results to the current sheet — all without leaving the Google ecosystem.
API access is included in the Scale ($7,000/year) and Enterprise plans. Start with a free trial to test extraction accuracy on your documents, then contact sales for API credentials and rate limits.
Upload your documents and verify extraction quality. 50 free pages, no credit card required.
50 free pages. All features included. No credit card required.