3.0

Table Of Contents
pushes an XML feed with the recognized text to the Google Search Appliance for indexing. When this process is
complete, the documents become available for searching.
IFilter for Microsoft Office SharePoint Server and Windows Desktop
Search
Search for information is a vital part of any office workflow. As the organization grows, documents scatter across
departments, file folders and ECM system, and search takes more and more valuable time. Microsoft® offers effective
instruments to organize enterprise search on different levels, from centralized content storages to individual desktops.
Microsoft Office SharePoint Server has powerful capabilities to search in SharePoint libraries and folders; Windows
Desktop Search is helpful in finding files on desktop computers.
However, SharePoint Server and Windows Desktop Search index contents of files in certain document formats only, like
HTML, RTF, DOC, XLS. Information contained in image files, such as JPEGs, or highly popular PDFs and TIFFs, remains
uncovered. This means that content of scanned documents, faxes, letters, contracts, is invisible to the server, and those
documents may not be displayed in search results.
ABBYY Recognition Server with its IFilter component extends Microsoft search capabilities over image documents. It
“unlocks” the content of image files by means of OCR and makes it available for indexing by SharePoint Server and
Windows Desktop Search. With ABBYY Recognition Server IFilter, the document search in the organization becomes
truly encompassing.
ABBYY Recognition Server is integrated with Microsoft Office SharePoint Server and Windows Desktop Search as
described below:
1. ABBYY Recognition Server is installed on a server computer (which may be separate from the SharePoint
hosting machine). A special component, ABBYY Recognition Server IFilter, is installed on top of the SharePoint
Server and/or user desktops to provide communication between the search system and the Recognition Server.
The IFilter is a light component which consumes almost no computer resources to make sure it host’s
performance remains on high level.
2. Each ABBYY IFilter receives image documents from the corresponding SharePoint or Windows Desktop search
crawler and passes them to the Recognition Server.
3. ABBYY Recognition Server in the background performs highquality OCR on the images and sends the
recognized text back to the IFilter.
4. Microsoft search engine accepts the document contents from ABBYY IFilter and builds an index. The image
then becomes discoverable via fulltext search.