Next:
List of Figures
Up:
Harvest User's Manual
Previous:
Copyright
Contents
Acknowledgements
Copyright
List of Figures
1 Introduction to Harvest
2 Subsystem Overview
Distributing the Gathering and Brokering Processes
3 Getting and Running the Harvest Software
3.1 Choosing an appropriate server machine
Supported platforms; software needed to run Harvest
3.2 Binary distribution
3.3 Source distribution
3.4 Upgrading versions of the Harvest software
3.4.1 Upgrading from version 1.1 to version 1.2
3.4.2 Upgrading to version 1.1 from version 1.0 or older
3.5 Starting up the system: RunHarvest and related commands
3.6 Support policy and Harvest team contact information
3.7 User-contributed software
4 The Gatherer
4.1 Overview
4.2 Basic setup
4.3 RootNode specifications
Example RootNode configuration
Using extreme values -- ``robots''
Gatherer enumeration vs. candidate selection
4.4 Extracting data for indexing: The Essence summarizing subsystem
4.4.1 Default actions of ``stock'' summarizers
4.4.2 Summarizing SGML data
Location of support files
The SGML-to-SOIF table
Errors and warnings from the SGML Parser
Creating an summarizer for a new SGML-tagged data type
The SGML-based HTML summarizer
Other examples
4.4.3 Summarizer components distribution
Using ``Rainbow'' to summarize MIF and RTF documents
The translation table
4.4.4 Customizing the type recognition, candidate selection, presentation unnesting, and summarizing steps
Customizing the type recognition step
Customizing the candidate selection step
Customizing the presentation unnesting step
Customizing the summarizing step
4.5 Setting variables in the Gatherer configuration file
4.6 Incorporating manually generated information into a Gatherer
4.7 Controlling access to the Gatherer's database
4.8 Periodic gathering and realtime updates
4.9 Local file system gathering for reduced CPU load
4.10 Troubleshooting
5 The Broker
5.1 Overview
5.2 Basic setup
5.3 Querying a Broker
Example queries
Query options selected by menus or buttons
Result set presentation
Regular expressions
Default query settings
5.4 Customizing the Broker's Query Result Set
5.4.1 The BrokerQuery.cf configuration file
Defined Variables
List of Definitions
5.4.2 Example BrokerQuery.cf customization file
5.4.3 Integrating your customized configuration file
5.5 Administrating a Broker
Deleting unwanted Broker objects
5.6 Tuning Glimpse indexing in the Broker
The glimpseserver program
5.7 Using different index/search engines with the Broker
Using WAIS as an indexer
Integrating a new index/search back-end into the Broker
5.8 Query Manager interface description
5.9 Collector interface description
5.10 World Wide Web interface description
HTML files for graphical user interface
CGI programs
Help files for the user
5.11 Troubleshooting
6 The Object Cache
6.1 Overview
6.2 Basic setup
6.3 Setting up WWW clients to use the Cache
6.4 Using the Cache as an httpd accelerator
6.5 Running a Cache hierarchy
6.6 Using multiple disks with the Cache
6.7 Using the Cache's remote instrumentation interface
6.8 Details of Cache operation
6.8.1 Cache access protocols
6.8.2 Cacheable objects
6.8.3 Unique object naming
6.8.4 Cache consistency
6.8.5 Negative caching and DNS caching
6.8.6 Security and privacy implications
6.8.7 Summary: object caching ``flow chart''
6.9 Meanings of log files
6.10 Troubleshooting
7 The Replicator
7.1 Overview
7.2 Basic setup
CreateReplica usage line
7.3 Customizations
7.4 Distributing the load among replicas
7.5 Troubleshooting
References
A Programs and layout of the installed Harvest software
A.1 $HARVEST_HOME
A.2 $HARVEST_HOME/bin
A.3 $HARVEST_HOME/brokers
A.4 $HARVEST_HOME/cgi-bin
A.5 $HARVEST_HOME/gatherers
A.6 $HARVEST_HOME/lib
A.7 $HARVEST_HOME/lib/broker
A.8 $HARVEST_HOME/lib/cache
A.9 $HARVEST_HOME/lib/gatherer
B The Summary Object Interchange Format (SOIF)
B.1 Formal description of SOIF
B.2 List of common SOIF attribute names
B.3 Using the SOIF processing software
SOIF library written in C
SOIF library written in Perl
C Gatherer Examples
C.1 Example 1 - A simple Gatherer
C.2 Example 2 - Incorporating manually generated information
C.3 Example 3 - Customizing type recognition and candidate selection
C.4 Example 4 - Customizing type recognition and summarizing
Using regular expressions to summarize a format
Using programs to summarize a format
Running the example
Index
About this document ...
Next:
List of Figures
Up:
Harvest User's Manual
Previous:
Copyright
Darren Hardy
Mon Apr 3 15:22:37 MDT 1995