skip navigation

PDF Document Management Software, Services & Support

Server Desktop Services Support Why Us? About Us

The Latest

SecurSign 5 Now Available! Includes Signature Validation to Detect Tampering.
Lansdowne, PA (July 13, 2011)
Encrypt, digitally sign and verify digital signatures on PDF documents.

Google WILL Index Your Scanned PDFs!

TalkPDF225x100_noDJ.png

Thursday, November 20, 2008

by Duff Johnson

The lords of search over at Google recently announced an interesting new feature for PDFs created from scanned pages.

Searchable PDF files are nothing new - and neither are searchable PDF files produced from scanned pages. Simply run OCR and voila - your scanned PDFs are now searchable.

But let's say you didn't OCR your files. Maybe you didn't want to take the time, maybe its impractical, or maybe you didn't even WANT your files to be searchable (my legal friends should take note here).

Too bad!

Post those PDFs on a publicly accessible site and now Google will OCR and index them for you, no extra charge.

I'm sure there are some limits here. Google isn't saying, but I'm guessing it won't download a 500 MB PDF just to discover that there's no text to index.

I'm also unsure as to the quality of the OCR. I'd have to believe that it's super-quick, and therefore, less than super-accurate, but then again, Google has computing resources that defy my paltry imagination, so no bets there either.

I'll be running some tests before long, but I'm curious to know what you think.

Do you WANT your scanned PDFs indexed by Google? Are you tempted to post oceans of scanned content online? Or is this a big yawn, something you thought Google was doing all along, so what's the big deal?

Originally posted on Duff Johnson's PDF Perspective blog for acrobatusers.com


Server Desktop Services Support Why Us? About Us
AppendPDF
AppendPDF Pro
FDFMerge
FDFMerge Lite
pdfHarmony
Redax Enterprise Server
SecurSign
StampPDF Batch
APCrypt
APJavaScript
APSplit
APGetInfo
pdfAPilot Server 2
Redax
StampPDF plugin
StampPDF DE
AppendPDF DE
APSplit DE
PDF Forms
Designer/XFA Forms
PDF JavaScript
PDF Accessibility
Section 508
Publication Scanning
CD/DVD-ROMs
Custom Development
Software Support Policy
Technical Support
Product Documentation
FAQs
Sample Scripts
PDF Glossary
Contact Support

Talking PDF
Appligent Labs
Customers
Testimonials
Case Studies
Cost Effectiveness
Innovation
PDF Standards
Experience
Mission
History
People
Partners
Contact Us
News & Events
Site Accessibility
Site Index
 
Site Accessibility | Email the WebAdmin
Valid HTML 4.01! Section 508 Compliance logo