OCR API – Create Searchable PDFs from Images and Scans

MaraDocs OCR API creates searchable PDFs with text overlay. Your document stays intact – text is selectable and searchable. Not just extracted text.

16. Februar 2026•Martin Kurtz

APIOCRPDFText RecognitionDeveloper

Scanned documents and photos contain text that isn't selectable or searchable. Many OCR APIs return only extracted text, not your original document with an invisible text layer. You want the same PDF – same layout and appearance – but with selectable, searchable content. That's what a proper OCR API for searchable PDFs should deliver.

Why Building a Searchable PDF OCR Solution Yourself Takes Weeks

If you try to build this yourself, you'll quickly find that Tesseract, EasyOCR, or cloud OCR return plain text and bounding boxes. To build a searchable PDF, you must overlay the text invisibly on the original image or PDF. That means coordinating coordinate systems, fonts, encoding, and PDF structure. Different languages, fonts, and layouts add complexity. A robust OCR API that keeps "your" document intact takes significant engineering.

How the MaraDocs OCR API Solves This in Minutes

The MaraDocs API performs OCR and outputs a PDF with the text invisibly overlaid. You get your original document – layout, images, appearance – with selectable and searchable text. Not a separate text file. Not a stripped-down version. The same document, enhanced.

OCR Workflow: Validate, OCR, Optimize

For images: validate, then img.ocrToPdf. For PDFs: validate, then pdf.ocrToPdf (optionally after pdf.orientation to fix rotated pages first). The high-level flow.ocrImg and flow.ocrPdf combine orientation, OCR, and optimization in one call. Output is always a PDF handle – the same document with an invisible text layer – that you can download or pass to composition, compression, or email workflows. The pipeline stays server-side; no re-upload between steps.

Get your API key in under a minute

Register for a free account and get your API key in under a minute. Of course we'll provide you with some developer credits.

Try MaraDocs API now →

Why MaraDocs is Different: Workspaces, Webview, and German Data Privacy

Most document APIs force you to upload, process, download, then re-upload for the next step. With MaraDocs, OCR runs in your workspace. Chain with document extraction, composition, or compression – pass the PDF handle directly to the next operation. No re-upload, fewer round-trips.

When OCR results need manual correction – misread characters, complex layouts, or low-quality scans – open app.maradocs.io for manual review and editing. Your users get full manual control when automation hits an edge case.

All processing runs in Germany (Maramia GmbH), encrypted at rest and in transit. Workspaces expire after 7 days. No data leaves the EU. For GDPR-sensitive OCR workloads, this matters.

TypeScript Code for Creating Searchable PDFs with OCR

API reference: data/upload, img/validate, pdf/validate, img/ocr/to/pdf, pdf/ocr/pdf, data/download/pdf

import { MaraDocsClient } from "@maramia/maradocs-sdk-ts";
import { okImg } from "@maramia/maradocs-sdk-ts/models/img";
import { okPdf } from "@maramia/maradocs-sdk-ts/models/pdf";

const client = new MaraDocsClient({ workspaceSecret: workspace_secret });

// High-level: upload, validate, full pipeline, download
const pdfHandle = await client.flow.ocrImg(imageFile);
const blob = await client.data.downloadPdf({ pdf_handle: pdfHandle });

// Low-level: image – upload, validate, OCR, download
const uploaded = await client.data.upload(imageFile);
const validated = await client.img.validate({ unvalidated_file_handle: uploaded.unvalidated_file_handle });
const imgHandle = okImg(validated);
const ocrPdf = await client.img.ocrToPdf({
  img_handle: imgHandle,
  options: { embed_in_blank_page: { size: { width: 210, height: 297 }, position: "center" } },
});
const blob2 = await client.data.downloadPdf({ pdf_handle: ocrPdf.pdf_handle });

// PDF: upload, validate, ocrToPdf, download
const pdfUploaded = await client.data.upload(pdfFile);
const pdfValidated = await client.pdf.validate({ unvalidated_file_handle: pdfUploaded.unvalidated_file_handle });
const pdfOcr = await client.pdf.ocrToPdf({ pdf_handle: okPdf(pdfValidated) });
const blob3 = await client.data.downloadPdf({ pdf_handle: pdfOcr.pdf_handle });

Python Code for OCR to Searchable PDF

API reference: data/upload, img/validate, img/ocr/to/pdf, pdf/ocr/pdf, data/download/pdf

# pip install python-decouple requests
"""OCR an image to a searchable PDF using MaraDocs. Set ACCOUNT_SECRET in .env or environment."""

import sys
import time
from pathlib import Path

import requests
from decouple import config

API_URL = "https://api.maradocs.io/v1"


def create_workspace() -> dict:
    """Create a workspace and return auth headers."""
    r = requests.post(
        f"{API_URL}/workspace",
        headers={"Authorization": f"Bearer {config('ACCOUNT_SECRET')}"},
        json={"subaccount": None},
    )
    ws = r.json()
    return {"Authorization": f"Bearer {ws['workspace_secret']}"}


def upload_file(path: Path, auth: dict) -> dict:
    """Upload a file via two-step flow (signed URL + S3 POST). Returns unvalidated_file_handle."""
    data = path.read_bytes()
    resp = requests.post(
        f"{API_URL}/data/upload",
        headers={**auth, "Content-Type": "application/json"},
        json={"name": path.name, "size": len(data)},
    ).json()
    requests.post(
        resp["post_url"],
        data=resp.get("post_header", {}),
        files={"file": (path.name, data, "image/png")},
    )
    return resp["unvalidated_file_handle"]


def run_job(path: str, payload: dict, auth: dict, timeout: int = 60) -> dict:
    """Run a job and poll until complete. Returns the result (unwraps response if present)."""
    url = f"{API_URL}/{path}"
    r = requests.post(url, headers=auth, json=payload).json()
    job_id = r["job_id"]
    start = time.time()
    while time.time() - start < timeout:
        poll_r = requests.get(f"{url}/{job_id}", headers=auth)
        if poll_r.status_code == 200:
            return poll_r.json()
    raise TimeoutError(f"Job {path} timed out")


def download_pdf(pdf_handle: str, auth: dict) -> bytes:
    """Request signed URL and fetch PDF bytes."""
    r = requests.post(
        f"{API_URL}/data/download/pdf",
        headers=auth,
        json={"pdf_handle": pdf_handle},
    )
    info = r.json()
    dl = requests.get(info["url"], headers=info.get("headers", {}))
    return dl.content


def main() -> None:
    img_path = Path(sys.argv[1]) if len(sys.argv) > 1 else Path("photo.png")
    auth = create_workspace()
    handle = upload_file(img_path, auth)
    validate = run_job("img/validate", {"unvalidated_file_handle": handle}, auth)
    img_handle = validate["response"]["img_handle"]
    ocr_result = run_job("img/ocr/to/pdf", {"img_handle": img_handle}, auth)
    pdf_handle = ocr_result.get("pdf_handle") or ocr_result.get("response", {}).get("pdf_handle")
    Path("searchable.pdf").write_bytes(download_pdf(pdf_handle, auth))
    print("searchable.pdf created")


if __name__ == "__main__":
    main()

Summary and Next Steps

An OCR API that creates searchable PDFs – your document with invisible text overlay – is available. MaraDocs keeps the original layout and adds selectable, searchable text. See Document Scanner, PDF Handling, and Image on Blank Page for more.

Useful links

Try it: MaraDocs API | TypeScript SDK

Subscribe to our newsletter

Stay up to date with us and receive the latest news, articles, and resources by email.