Scanned PDFs will now be searchable

by Cooee on November 10, 2008

 

Google has taken another step in its effort to shed light on the so-called “Dark Web” with an announcement that its engine can now search scanned documents in Adobe Systems’ PDF format.

Using Optical Character Recognition technology, Google’s search engine now can convert scanned PDF documents into text that can be searched and indexed.

  This is part on an ongoing effort by Google to shed more light on the Deep, or Dark Web, in which there is a massive amount of information that can be accessed but not indexed by a search engine because it is behind databases or in a format — such as PDF — that can’t be easily searched.

computerworld.co.nz

{ 0 comments… add one now }

Leave a Comment

You can use these HTML tags and attributes: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <strike> <strong>