Google Search Appliance Connectors Deploying the Connector for File Systems Google Search Appliance Connector for File Systems software version 4.0.2 Google Search Appliance software version 7.
Table of Contents About this Guide Overview of the GSA Connector for File Systems Supported file system protocols Distributed File System support Supported operating systems Before you deploy the Connector for File Systems Windows account permissions Download the connector software Deploy the Connector for File Systems Step 1 Configure the search appliance Add the URL Add the IP address Step 2 Install the Connector for File Systems Step 3 Configure adaptor-config.
About this Guide This guide is intended for anyone who needs to deploy the Google Search Appliance Connector 4.0.2 for File Systems. The guide assumes that you are familiar with Windows operating systems, file systems, and configuring the Google Search Appliance by using the Admin Console. See the Google Search Appliance Connectors Administration Guide 4.0.2 for general information about the connectors, including: ● What’s new in Connectors 4.
Overview of the GSA Connector for File Systems The Connector for File Systems enables the Google Search Appliance to crawl and index content from Windows shares. A single connector instance can support a single Windows share. The share can be a UNC path or a mapped drive. The following diagram provides an overview of how the search appliance gets content from the repository through the Connector for File Systems. For explanations of the numbers in the process, see the steps following the diagram. 1.
9. The connector sends the URLs to the search appliance. For a folder, the connector sends an HTML listing of the contents of the folder to the search appliance. 10. The search appliance continues to crawls the repository. After the initial process completes, the connector periodically sends files or folders that have been modified, added, or deleted to the search appliance, according to the value set in the connector configuration. The default interval value is 15 minutes.
Before you deploy the Connector for File Systems Before you deploy the Connector for File Systems, ensure that your environment has all of the following required components: ● GSA software version 7.2.0.G.90 or higher To download GSA software, visit the Google Enterprise Support Portal (password required) ● Java JRE 1.7 update 6 or higher installed on computer that runs the connector ● Connector for File Systems 4.0.
● Server Operators Note: It is not sufficient for the user to be member of one of these groups at the domain level. The user must be a member of one of these groups on the local machine that exports the Windows share. Download the connector software The Connector for File Systems must be installed on a host machine. This connector version does not support installing the connector on the Google Search Appliance. To download the software for Connector for File Systems: 1. Visit https://code.google.
2. Add the IP address of the computer that hosts the connector to the list of Trusted IP addresses so that the search appliance will accept feeds from this address. Add the URL To add the URLs provided by the connector to the search appliance’s crawl configuration follow patterns: 1. In the search appliance Admin Console, click Content Sources > Web Crawl > Start and Block URLs. 2.
3. Copy the File System 4.0 JAR executable to the directory. 4. Create an ASCII or UTF-8 file named adaptor-config.properties in the directory that contains the connector binary. 5. Provide the following configuration (replacing bolded items with your real configuration) within the file: gsa.hostname=yourgsa.hostname.com filesystemadaptor.src=\\\\host\\share Notes: Backslashes are entered as double backslashes.To represent a single '\' you need to enter '\\'. DFS links can be given as filesystemadaptor.
supportedAccounts will be included in ACLs regardless if they are builtin or not. BUILTIN\\Guest,NT AUTHORITY\\INTERACTIVE, NT AUTHORITY\\Authenticated Users filesystemadaptor. builtinGroupPrefix Builtin accounts are excluded from the ACLs that are pushed to the GSA. An account that starts with this prefix is considered a builtin account and will be excluded from the ACLs. BUILTIN\\ filesystemadaptor.
format, YYYY-MM-DD. Setting filesystemadaptor.last AccessedDate to 2010- 01-01 would only crawl content that has been accessed since the beginning of 2010. Only one of filesystemadaptor. lastAccessedDate or filesystemadaptor.last AccessedDays may be specified. filesystemadaptor. lastAccessedDays Disables crawling of files disabled that have not been accessed within the specified number of days. Unlike the absolute cutoff date used by filesystemadaptor.
whose time of last access is earlier than a specific date. The cut-off date is specified in ISO8601 date format, YYYY-MM-DD. Setting filesystemadaptor.last ModifiedDate to 2010- 01-01 would only crawl content that has been modified since the beginning of 2010. Only one of filesystemadaptor.last ModifiedDate or filesystemadaptor.last ModifiedDays may be specified. filesystemadaptor. lastModifiedDays Disables crawling of files disabled that have not been modified within the specified number of days.
ModifiedDays may be specified. adaptor.incrementalPoll PeriodSecs Time between incremental crawls. 300 seconds adaptor.namespace Namespace used for ACLs sent to GSA Default server.port Port from which 5678 documents are served. GSA crawls this port. Each instance of a Connector on same machine requires a unique port. Step 4 Run the Connector for File Systems After you install the Connector for File Systems, you can run it on host machine by using a command like the following example: java -Djava.