3.0

Table Of Contents

pushes an XML feed with the recognized text to the Google Search Appliance for indexing. When this process is

complete, the documents become available for searching.

IFilter for Microsoft Office SharePoint Server and Windows Desktop

Search for information is a vital part of any office workflow. As the organization grows, documents scatter across

departments, file folders and ECM system, and search takes more and more valuable time. Microsoft® offers effective

instruments to organize enterprise search on different levels, from centralized content storages to individual desktops.

Microsoft Office SharePoint Server has powerful capabilities to search in SharePoint libraries and folders; Windows

Desktop Search is helpful in finding files on desktop computers.

However, SharePoint Server and Windows Desktop Search index contents of files in certain document formats only, like

HTML, RTF, DOC, XLS. Information contained in image files, such as JPEGs, or highly popular PDFs and TIFFs, remains

uncovered. This means that content of scanned documents, faxes, letters, contracts, is invisible to the server, and those

documents may not be displayed in search results.

ABBYY Recognition Server with its IFilter component extends Microsoft search capabilities over image documents. It

“unlocks” the content of image files by means of OCR and makes it available for indexing by SharePoint Server and

Windows Desktop Search. With ABBYY Recognition Server IFilter, the document search in the organization becomes

truly encompassing.

ABBYY Recognition Server is integrated with Microsoft Office SharePoint Server and Windows Desktop Search as

described below:

1. ABBYY Recognition Server is installed on a server computer (which may be separate from the SharePoint

hosting machine). A special component, ABBYY Recognition Server IFilter, is installed on top of the SharePoint

Server and/or user desktops to provide communication between the search system and the Recognition Server.

The IFilter is a light component which consumes almost no computer resources to make sure it host’s

performance remains on high level.

2. Each ABBYY IFilter receives image documents from the corresponding SharePoint or Windows Desktop search

crawler and passes them to the Recognition Server.

3. ABBYY Recognition Server in the background performs highquality OCR on the images and sends the

recognized text back to the IFilter.

4. Microsoft search engine accepts the document contents from ABBYY IFilter and builds an index. The image

then becomes discoverable via fulltext search.