Google Search Appliance Planning for Search Appliance Installation Google Search Appliance software version 7.
Google, Inc. 1600 Amphitheatre Parkway Mountain View, CA 94043 www.google.com October 2012 © Copyright 2012 Google, Inc. All rights reserved. Google and the Google logo are registered trademarks or service marks of Google, Inc. All other trademarks are the property of their respective owners. Use of any Google solution is governed by the license agreement included in your original contract.
Contents Planning for Search Appliance Installation ........................................................
What Tasks Do I Need to Perform Before I Install? Required Tasks Optional Tasks Electrical and Other Technical Requirements Google Search Appliance: Planning for Search Appliance Installation 19 19 21 21 4
Planning for Search Appliance Installation This document provides the information you need to plan a Google Search Appliance installation. The guide contains an overview and checklists of the values you must determine and decisions you must make before you set up your network and the content files and perform the Google Search Appliance installation process. After you complete the installation process, the search appliance can crawl and index the content files.
How Does the Search Appliance Work? The Google Search Appliance is a one-stop search and index solution for businesses of all sizes. Using a search appliance, you can quickly deploy search within an enterprise. By default, a search appliance can index and serve content located on a file system or a web server.
• Start URLs, which control where the crawl begins. All content must be reachable by following links from one or more start URLs. • Follow and Crawl URLs, which set the patterns of URLs that are crawled. Use follow and crawl URLs to define the paths to pages and files you want crawled. If a URL in a crawled document links to a document whose URL does not match a pattern defined as a follow and crawl URL, that document is not crawled.
Serving Users submit search requests to the Google Search Appliance a web page similar to the search page at Google.com. A user types a search term into the search box and the request is transmitted to the serving software. The search appliance locates results in the index. The search appliance then returns the results to the user’s browser as a series of links. When the user clicks a link in the results, the content file is displayed.
7. Implement any authorization and authentication requirements. 8. Deploy the content files to your web site or intranet. 9. Complete the tasks and collect the information and values described in the tables in “What Values Do I Need for the Installation Process?” on page 17 and “What Tasks Do I Need to Perform Before I Install?” on page 19. For Sites or Locations with Existing Content 1. Determine the physical location of the search appliance.
• A web browser, which must be a current version of Internet Explorer, Firefox, or Chrome. The Guide to Software Release 7.0 lists the supported browsers. • An uninterruptible power supply to provide electricity to the search appliance if there is a power failure. The index data cannot be backed up, and the crawl, index, and serve functions will be interrupted if there is a power failure.
The Google Search Appliance cannot index text contained in graphic file formats, such a JPEG, GIF, or TIFF. When a file in a graphic format is submitted for indexing, text embedded in the graphic is not indexed. However, the file name is indexed. If any metadata is associated with the graphic in an HTML meta tag that metadata is indexed. Certain file formats are excluded from the crawl by default on the search appliance Admin Console.
How Many URLs Can Be Crawled? The number of URLs that your search appliance can crawl depends on the model and license limit. The follow table lists the maximum number of URLs matching the crawl patterns you define that the search appliance can crawl. Search Appliance Model Maximum License Limit Maximum Number of URLs that Match Crawl Patterns GB-7007 10 million ~ 13.
Can the Search Appliance Use a Dedicated Network Interface Card for Administration? You can optionally configure your search appliance to use a dedicated network interface card (NIC) for administrative functions. Use this option when you have two or more search appliances that are typically accessed through a load balancer. Search appliances in this configuration are reached at the same IP address and there is no way to connect to a specific search appliance in the configuration.
Outbound Ports Function 7885 Used for exporting logs for the preinstalled connector manager at the URL https:// search_appliance_location:7885/connector-manager/getConnectorLogs/ALL 8080 Default port for requests to the connector manager when a connector manager is installed on an external host. Configurable when a connector manager is installed. The following table lists the inbound search appliance ports.
Additional Ports Used When the Dedicated Administrative Network Interface Card is Enabled When the dedicated network interface card is enabled on a search appliance, the ports in the following table are visible only on the dedicated interface card. All other port usage remains as detailed in the section “Ports Used at All Times” on page 13.
• Under Local Authentication, administrators and managers are authenticated using credentials you enter directly on the Admin Console. • Under LDAP Authentication, administrators and managers are authenticated against an LDAP server. To use this option, you must initially connect to the Admin Console using the admin account and the password you assign the account during configuration, then provide settings for the LDAP administrator group and the LDAP server itself.
What Values Do I Need for the Installation Process? The following tables describe the values you need before you install the Google Search Appliance. If you are indexing a content repository, refer to the connector documentation for more information on values you need before installing the connector manager and a connector. Required Values Before you install and configure the Google Search Appliance, obtain the following required values and write them in the column labeled Your Value.
Value Definition Your Value The IP address of one or more domain name system (DNS) servers These IP addresses identify DNS servers used to resolve host names. Identifying DNS servers enables the use of host names, rather than IP addresses, in crawl URLs when the search appliance crawls an intranet. The search appliance will not operate correctly without functioning DNS servers.
Value Definition For More Information, Contact To use a dedicated administrative network interface card, an additional IP address and subnet mask The IP address and subnet mask identify the dedicated network interface card IP address to be used by IPMI Intelligent Platform Management Interface (IPMI) is available on T2 and U1 series search appliances only. This is the IP address you use to connect to IPMI on the search appliance’s primary network interface card.
Task Description For More Information Mount the search appliance on a rack or otherwise place it in the desired location. You can mount the search appliance on a rack in a data center or keep it on a flat surface in your office. If you keep it in your office, choose a location that has good sound isolation from work areas. Consult your hardware administrator Create an account with Google Enterprise Technical Support. A Support account enables you to receive technical support.
Optional Tasks The following table describes optional tasks you can complete before installing the search appliance. Task Description Accomplished? Configure proxy servers. If the search appliance must access content through a proxy server, set up the proxies. If you plan to enable remote access using secure shell (SSH) for Google Support, arrange for network port 22 to be opened.
Requirement Google Search Appliance GB7007 (T2 Series) Google Search Appliance GB7007 (T3 Series) Google Search Appliance GB9009 Processing Unit (U1) Google Search Appliance GB9009 Storage Unit Google Search Appliance GB9009 Processing Unit (U2) Google Search Appliance GB9009 Storage Unit (U2) Operating Relative Humidity Range 20% to 80% (noncondensi ng), with a maximum humidity gradation of 10% per hour. 20% to 80% (noncondensi ng), with a maximum humidity gradation of 10% per hour.
The following table shows requirements for the G100 and G500. Requirement Google Search Appliance G100 (T4) Google Search Appliance G500 (U3) Typical Thermal Dissipation 836 BTU/hr 1504 BTU/hr Operating Temperature Range 10° C to 35° C (50° F to 90° F) with a maximum temperature gradation of 10°C per hour Note: For altitudes above 2950 feet, the maximum operating temperature is derated 1°F/550 ft.