For complete documentation on ht://Dig, see the ht://Dig.org web site.
About | Update Frequency | Noindex Directory | Document Weighting | Add a SFSU search box
The ht://Dig search engine is meant to cover the search needs for a single company, campus, or even a particular sub section of a web site. As opposed to some WAIS-based or web-server based search engines, ht://Dig can span several web servers at a site. Documents not in the sfsu.edu "domain" are excluded from the index
Words shorter than three characters are not indexed. Words shorter than this value will be ignored but still put into the excerpt placed on the search results page.
Word Matches: If a word match is found in the first 500 characters of a document, an excerpt of the document will be displayed when using the LONG search format. The first matched word will be bolded in the middle of the excerpt so that there is some surrounding context.
Indexing is limited by the number of hops or clicks a document is removed from the starting URL. Currently defined as seven hops from the server home page to include the neighborhood of the documents being referenced on any subsequent linked pages.
Documents are weighted and assigned stars based on an algorithm using the heading, keyword and title factors as well as content inside a HTML document. By using factors that are weighted higher the document will show up higher in the search results and be assigned more stars. More stars signify the document has more relevance to the search request.
The heading factors are
Keywords are also used is a weighting factor. To add keywords to your documents, use the following code in the <HEAD> section of your HTML document
Notice that repeated words rank your page higher. For complete documentation on ht://Dig, see the http://htdig.org web site.
The document title is also a weighting factor and is how the document is linked in the search return results.
The following code will allow you to add your own search box to your page.
<form method="post" action="http://www.sfsu.edu/cgi-bin/htsearch">
<p align="left">Search Term(s):
<input type=hidden name=method value=and>
<input type=hidden name=format value=builtin-short>
<input type=hidden name=config value=htdig>
<input type=hidden name=restrict value="http://www.sfsu.edu/~accountname">
<input type=text name="words" value="" size=50>
<INPUT type=submit value="Search" ALT="Search">
</p> </form>
Departments with accounts on www.sfsu.edu must replace the accountname with the name of your account.
<input type=hidden name=restrict value="http://www.sfsu.edu/~accountname">
Faculty using the server online should change the name of the server to online.sfsu.edu and replace the accountname with the name of your account.
<input type=hidden name=restrict value="http://online.sfsu.edu/~accountname">
Home
Search
Need Help?
1600 Holloway Avenue, San Francisco, CA 94132 (415) 338-1111
Last modified February 11, 2008 by the Web Team