Extract text from scanned PDFs using optical character recognition. All processing runs locally in your browser.
Drop a PDF here or click to browse
Extract text from scanned documents
Optical Character Recognition (OCR) turns scanned documents and image-based PDFs into searchable, selectable text. If you've ever scanned a paper document and ended up with a PDF you can't search or copy text from, this tool solves that problem. It analyzes each page image, recognizes the characters, and produces a new PDF with a searchable text layer.
Searchable Text Layer
Adds invisible text over each page so you can search, select, and copy text from the resulting PDF.
Multi-Language Support
Select from multiple languages to improve recognition accuracy for non-English documents.
Powered by Tesseract.js
Uses Tesseract.js, the leading open-source OCR engine, running entirely in your browser.
No Server Upload
Your scanned documents stay private. The OCR engine runs locally in JavaScript.
Merge PDFs
Combine multiple PDF files into one document
Split PDF
Split a PDF into separate files at chosen pages
Delete PDF Pages
Remove specific pages from a PDF
Reorder PDF Pages
Drag & drop pages into any order
Extract PDF Pages
Pull out specific pages as a new PDF
Rotate PDF
Rotate PDF pages 90°, 180°, or 270°