summaryrefslogtreecommitdiffstats
path: root/debian/htdig/htdig-3.2.0b6/ChangeLog.0
diff options
context:
space:
mode:
Diffstat (limited to 'debian/htdig/htdig-3.2.0b6/ChangeLog.0')
-rw-r--r--debian/htdig/htdig-3.2.0b6/ChangeLog.04359
1 files changed, 4359 insertions, 0 deletions
diff --git a/debian/htdig/htdig-3.2.0b6/ChangeLog.0 b/debian/htdig/htdig-3.2.0b6/ChangeLog.0
new file mode 100644
index 00000000..825f3ddf
--- /dev/null
+++ b/debian/htdig/htdig-3.2.0b6/ChangeLog.0
@@ -0,0 +1,4359 @@
+Thu Jan 31 17:32:33 2002 Geoff Hutchison <[email protected]>
+
+ * Release of 3.1.6.
+
+ * htdoc/confindex.html, htdoc/htsearch.html, htdoc/index.html,
+ htdoc/mailarchive.html: Remove CSS link, not needed in these
+ frameset pages.
+
+ * htdoc/howto-mirror.html: Update with Jesse's latest version.
+
+Thu Jan 31 15:13:07 2002 Gilles Detillieux <[email protected]>
+
+ * Makefile.in: Fixed install-strip target to properly handle relative
+ paths in INSTALL_PROGRAM when passing it to subdirectories.
+
+Thu Jan 31 11:41:39 2002 Gilles Detillieux <[email protected]>
+
+ * htdoc/FAQ.html: Updated questions 4.8 & 4.9 to emphasize use of
+ doc2html over parse_doc.pl. Further clarified question 2.1.
+
+Thu Jan 31 10:14:23 2002 Gilles Detillieux <[email protected]>
+
+ * contrib/parse_doc.pl: Added comments explaining why you should
+ not be using this script.
+
+Wed Jan 30 17:20:51 2002 Geoff Hutchison <[email protected]>
+
+ * htdoc/FAQ.html: Updated to mention 3.1.6 as the newest version
+ and --with-rx as a fix for regex problems on BSDI.
+
+Wed Jan 30 17:15:49 2002 Gilles Detillieux <[email protected]>
+
+ * installdir/synonyms: Updated with the version contributed by
+ David Adams, with minor changes. Kept old one as synonyms.original.
+
+ * installdir/english.0: Changed lots more dubious uses of suffixes to
+ get more appropriate and correct fuzzy endings expansions.
+
+Wed Jan 30 12:30:16 2002 Geoff Hutchison <[email protected]>
+
+ * htlib/Connection.cc (connect): Fixed bug with allow_EINTR and
+ add support for looping when the connection returns EAGAIN (no
+ more free local ports). Thanks to Ahmon Dancy <[email protected]>
+ for pointing out the EAGAIN issue.
+
+Tue Jan 29 09:59:58 2002 Gilles Detillieux <[email protected]>
+
+ * htdoc/FAQ.html: Updated with today's changes to maindocs FAQ.
+
+Mon Jan 28 16:54:15 2002 Gilles Detillieux <[email protected]>
+
+ * contrib/README: Added mentions of examples & xmlsearch, fixed typo.
+
+Sun Jan 27 23:13:11 2002 Geoff Hutchison <[email protected]>
+
+ * htdoc/*.html: Final batch of documentation updates.
+
+Sat Jan 26 23:28:25 2002 Geoff Hutchison <[email protected]>
+
+ * htdoc/*: More documentation updates from merging with the
+ current maindocs CVS.
+
+Fri Jan 25 21:36:21 2002 Geoff Hutchison <[email protected]>
+
+ * acconfig.h, include/htconfig.h.in: Add USE_RX to potential
+ configure #include macros.
+
+ * htlib/gregex.h: Rename regex.h to prevent conflicts with system
+ version.
+
+ * htlib/regex.c, htlib/HtRegex.h: Ditto.
+
+ * htfuzzy/EndingsDB.cc: Use same tests as HtRegex.h for rxposix.h,
+ gregex.h or regex.h depending on configure results.
+
+ * configure.in: Implement more flexible test for rx/regex, which
+ will check for rxposix.h if --with-rx is supplied, will "fall
+ back" to regex test if rxposix.h isn't available and will only use
+ the htlib/ code and header for regex compile.
+
+ * configure: Update using autoconf.
+
+Fri Jan 25 12:14:26 2002 Gilles Detillieux <[email protected]>
+
+ * contrib/whatsnew/README, contrib/whatsnew/whatsnew.html: Added
+ an example of how to get a what's new listing from the new features
+ in htsearch.
+
+Thu Jan 24 22:43:28 2002 Geoff Hutchison <[email protected]>
+
+ * htcommon/defaults.cc: Add ignore_dead_servers attribute to
+ control whether indexing will continue to try to contact a dead
+ server.
+
+ * htdig/Retriever.cc: Only mark a server as dead if the
+ ignore_dead_servers attribute is set.
+
+ * htdoc/cf_byname.html, htdoc/cf_byprog.html, htdoc/attrs.html:
+ Documentation updates.
+
+Thu Jan 24 15:32:59 2002 Geoff Hutchison <[email protected]>
+
+ * configure, configure.in: Add --with-rx option to switch to
+ system rx code (e.g. on BSDI). Needs some touchups still,
+ including checking that rxposix.h exists and if --without-rx was
+ supplied for some reason.
+
+ * htlib/HtRegex.h: Add conditional <rxposix.h> header for systems
+ where rx is better than regex.
+
+ * htlib/Makefile.in: Make sure regex.o is only compiled if it
+ works on a given system via LIBOBJS as supplied by the configure
+ script.
+
+Mon Jan 21 22:33:30 2002 Geoff Hutchison <[email protected]>
+
+ * htdoc/RELEASE.html: Add first shot at the release notes for
+ 3.1.6. Still need to finish some of the htdoc/ merges, including
+ the SF icons and such.
+
+ * htdoc/*.html: First stab at many of the htdoc/merges including
+ the new Copyright line. (It is 2002, after all.)
+
+Fri Jan 18 18:17:34 2002 Geoff Hutchison <[email protected]>
+
+ * htmerge/docs.cc: Add a test if the DB database has no URLs
+ before proceeding.
+
+ * htmerge/words.cc: Add a slightly more user-friendly error
+ message if the word list file doesn't exist. Remove exit()
+ statements since reportError does this for us.
+
+Fri Jan 18 16:47:50 2002 Gilles Detillieux <[email protected]>
+
+ * htdoc/attrs.html: Rewrote description of prefix_match_character
+ to make it more clear, with crosslinks to related attributes, and
+ described new wildcard matching feature. Added more explanations
+ for relative days & months in startday et al. to make it clearer.
+ Added more notes about to-strings in the url_part_aliases description
+ and explained the example even more, as well as adding crosslinks
+ to the new *_rewrite_rules.
+
+Fri Jan 18 15:56:11 2002 Gilles Detillieux <[email protected]>
+
+ * htsearch/htsearch.cc (setupWords), htsearch/parser.cc (perform_push):
+ Added support for a wildcard word of "*" (or prefix_match_character
+ if set and not empty) which returns all documents.
+
+Wed Jan 16 17:21:26 2002 Gilles Detillieux <[email protected]>
+
+ * htdoc/attrs.html, htdoc/hts_form.html: Described how to use
+ relative dates for startyear et al.
+
+Wed Jan 16 16:58:05 2002 Gilles Detillieux <[email protected]>
+
+ * htsearch/Display.cc (buildMatchList): Fixed startday et al. to
+ allow relative days, month & years if values are negative.
+
+Fri Jan 11 20:57:51 2002 Gilles Detillieux <[email protected]>
+
+ * htdoc/attrs.html: Updated descriptions for translate_* attributes
+ to match the new default behavior.
+
+Fri Jan 11 17:48:54 2002 Gilles Detillieux <[email protected]>
+
+ * htdig/SGMLEntities.cc (translateAndUpdate): Added support for
+ translate_latin1 attribute, to turn off ISO-8859-1-specific entities.
+ * htcommon/defaults.cc: Added translate_latin1 attribute.
+ * htdoc/attrs.html, htdoc/cf_by{name,prog}.html: Documented it.
+
+Fri Jan 11 17:14:54 2002 Gilles Detillieux <[email protected]>
+
+ * contrib/xmlsearch.{README,tar.gz}: Removed older xmlsearch package.
+
+Fri Jan 11 17:06:09 2002 Gilles Detillieux <[email protected]>
+
+ * contrib/xmlsearch/*: Added files contributed by Nathan Hand and
+ me to implement XML output from htsearch, including DTD, templates
+ and config file.
+
+Wed Jan 9 22:08:21 2002 Gilles Detillieux <[email protected]>
+
+ * CONFIG.in: Fixed to allow setting BIN_DIR by configure option.
+ * contrib/htdig-3.1.6.spec: Fixed to make use of new ./configure
+ options for pathnames, do away with patch file. Used variables for
+ many pathnames to allow easy changes.
+
+Wed Jan 9 16:22:32 2002 Gilles Detillieux <[email protected]>
+
+ * htdig/ExternalParser.cc (parse): Added support for max_keywords
+ attribute.
+
+Wed Jan 9 16:10:44 2002 Gilles Detillieux <[email protected]>
+
+ * htdig/HTML.cc (HTML, do_tag), htdig/ExternalParser.cc (parse):
+ Added support for description_meta_tag_names attribute.
+ Ensure external parser interface accepts META descriptions even if
+ 'description' is added to the keyword list.
+ * htcommon/defaults.cc: Added description_meta_tag_names attribute.
+ * htdoc/attrs.html, htdoc/cf_by{name,prog}.html: Documented it.
+
+Tue Jan 8 17:39:24 2002 Gilles Detillieux <[email protected]>
+
+ * htdig/ExternalParser.cc (parse): Added support for use_doc_date
+ attribute.
+
+Thu Jan 3 17:10:50 2002 Gilles Detillieux <[email protected]>
+
+ * htlib/Makefile.in, htlib/lib.h: Removed references to timegm,
+ mytimegm and strptime functions. Removed C source for these.
+
+Thu Jan 3 16:43:31 2002 Gilles Detillieux <[email protected]>
+
+ * htdoc/htmerge.html: Added extra description for -m option to clear
+ up common points of confusion, added note about LC_COLLATE environment
+ variable.
+
+Fri Dec 21 18:52:32 2001 Gilles Detillieux <[email protected]>
+
+ * htdig/Retriever.cc: Added parsedcdate function, used by got_time,
+ to parse DC date meta tags without requiring strptime or timegm.
+
+Thu Dec 20 12:25:47 2001 Gilles Detillieux <[email protected]>
+
+ * htdig/Document.cc: Added parsedate function, used by getdate, to
+ parse date headers without requiring strptime or timegm, which have
+ caused problems on some systems.
+
+Thu Dec 20 11:51:26 CET 2001 Gabriele Bartolini <[email protected]>
+
+ * configure.in: reviewed directory settings
+ * Makefile.in: ditto (for 'make install' of htdig.conf and rundig)
+
+Wed Dec 19 23:05:09 2001 Geoff Hutchison <[email protected]>
+
+ * configure.in: Add tests for ostream.h and iostream.h.
+
+ * htlib/htString.h: Add HAVE_OSTREAM_H and HAVE_IOSTREAM_H
+ preprocessor statements to deal with portability issues around the
+ C++ header files.
+
+Wed Dec 19 13:33:55 2001 Gabriele Bartolini <[email protected]>
+
+ * configure.in: fixed bug in customisation of configure paramters
+ * CONFIG.in: ditto
+ * configure: re-generated with autoconf
+
+Tue Dec 18 16:12:17 2001 Gilles Detillieux <[email protected]>
+
+ * htsearch/Display.cc (displayMatch): Fixed to clear out old values
+ of ANCHOR template variable for each result.
+
+Thu Dec 6 13:14:22 2001 Gilles Detillieux <[email protected]>
+
+ * contrib/examples/rundig.sh: Fixed to make use of DBDIR variable.
+
+Wed Nov 21 12:54:42 2001 Gilles Detillieux <[email protected]>
+
+ * htdoc/rundig.html: Added note about effect of changing database_base.
+
+ * htmerge/docs.cc (convertDocs): Changed confusing message about
+ total doc db size in stats.
+
+Wed Nov 21 11:37:52 2001 Gilles Detillieux <[email protected]>
+
+ * htsearch/TemplateList.cc (createFromString), htdoc/attrs.html:
+ Treat template_map as a _quoted_ string list. Change <i> tags to
+ the HTML-4.0 compliant <em> tags in builtin-long template.
+
+Tue Nov 20 17:13:27 2001 Gilles Detillieux <[email protected]>
+
+ * htlib/String.cc (String, append, sub): Added checks for negative
+ lengths or start position to make code more fault-tolerant.
+
+Tue Nov 20 16:37:26 2001 Gilles Detillieux <[email protected]>
+
+ * htfuzzy/Synonym.cc (createDB): Check for lines with less than
+ 2 words, to avoid segfault caused by calling Database::Put() with
+ negative length for data field.
+
+Sat Nov 3 23:55:00 2001 Geoff Hutchison <[email protected]>
+
+ * htlib/htString.h: Add #include for ostream.h to solve compile
+ problems with gcc3.
+
+ * htlib/Connection.h, htlib/Connection.cc: Backport Connection
+ class from 3.2 code--installs alarm() call to timeout connections
+ and will retry connections a few times before giving up.
+
+Fri Nov 2 12:28:35 2001 Gilles Detillieux <[email protected]>
+
+ * htdig/HTML.cc, htdoc/attrs.html: Added support for dc.date,
+ dc.date.created and dc.date.modified to use_doc_date handling.
+
+Fri Nov 2 12:12:59 2001 Gilles Detillieux <[email protected]>
+
+ * contrib/xmlsearch.README, contrib/xmlsearch.tar.gz: Added files
+ contributed by Nathan Hand and me to implement XML output from
+ htsearch, including DTD, templates and config file.
+
+Fri Nov 2 12:05:49 2001 Gilles Detillieux <[email protected]>
+
+ * htdig/HTML.cc (do_tag), htcommon/defaults.cc: Added ignore_alt_text
+ attribute to avoid indexing alt text in img tags.
+ * htdoc/attrs.html, htdoc/cf_by{name,prog}.html: Documented it.
+
+Thu Nov 1 14:43:13 2001 Gilles Detillieux <[email protected]>
+
+ * htsearch/htsearch.cc (main): Fixed to only show file names in
+ error messages when REQUEST_METHOD not set and -v option given,
+ for security.
+
+Thu Nov 1 10:19:27 2001 Gilles Detillieux <[email protected]>
+
+ * htsearch/Display.cc, htsearch/Display.h: Added a localized
+ method for outputing HTTP headers, added support for a new
+ search_results_contenttype attribute to control that header.
+ * htcommon/defaults.cc: Added default for it.
+ * htdoc/attrs.html, htdoc/cf_by{name,prog}.html: Documented it.
+
+Wed Oct 31 13:31:18 2001 Gilles Detillieux <[email protected]>
+
+ * installdir/english.0: Changed lots of dubious uses of suffixes to
+ get more appropriate and correct fuzzy endings expansions.
+
+Tue Oct 23 14:06:37 2001 Gilles Detillieux <[email protected]>
+
+ * htdig/Retriever.cc (RetrievedDocument): Fixed handling of null
+ return from getParsable(), to avoid segfault problem introduced
+ by text/css conditional added Jul 25.
+
+Fri Oct 19 17:24:19 2001 Gilles Detillieux <[email protected]>
+
+ * htsearch/Display.cc (hilight): Added Stefan Nehlsen's idea for
+ anchor_target attribute.
+ * htcommon/defaults.cc: Added default for it.
+ * htdoc/attrs.html, htdoc/cf_by{name,prog}.html: Documented it.
+
+Sun Oct 14 22:05:30 2001 Gilles Detillieux <[email protected]>
+
+ * htdoc/attrs.html (external_parsers): Documented external converter
+ chaining to same content-type, e.g. text/html->text/html-internal.
+
+Sun Oct 14 21:54:24 2001 Gilles Detillieux <[email protected]>
+
+ * htdoc/attrs.html, htdoc/cf_byprog.html, htdoc/cf_byname.html,
+ htcommon/defaults.cc: Documented and declared startyear, etc.
+ attributes used by htsearch.
+
+Sun Oct 14 21:16:19 2001 Gilles Detillieux <[email protected]>
+
+ * htdoc/htdump.html, htdoc/htload.html, htdoc/attrs.html,
+ htdoc/cf_byprog.html, htdoc/contents.html: Documented htdump and
+ htload, indicating which attributes are used by them.
+
+Fri Oct 12 14:58:15 2001 Gilles Detillieux <[email protected]>
+
+ * htlib/URL.cc (removeIndex): Fixed to make sure the matched file
+ name is at the end of the URL.
+
+Tue Oct 2 09:34:43 2001 Gilles Detillieux <[email protected]>
+
+ * htdoc/attrs.html (start_url): Added a reference and link to
+ limit_urls_to, explaining how the two are tied together.
+
+Fri Sep 28 17:19:45 2001 Gilles Detillieux <[email protected]>
+
+ * contrib/htdig-3.1.6.spec: Fixed %install to make symlinks for
+ htdump & htload, added these to %files list.
+
+Fri Sep 28 15:38:00 2001 Gilles Detillieux <[email protected]>
+
+ * htsearch/Display.cc (displayMatch): Save rewritten URL in DocumentRef
+ so it'll be used for star_patterns and template_patterns matching.
+
+Fri Sep 28 14:25:29 2001 Gilles Detillieux <[email protected]>
+
+ * htsearch/Display.cc (buildMatchList, displayMatch),
+ htsearch/htsearch.cc (main): Added calls to pass search_rewrite_rules
+ to HtURLRewriter class and use it to rewrite URLs in results.
+
+ * htdoc/attrs.html, htdoc/cf_byname.html, htdoc/cf_byprog.html,
+ htcommon/defaults.cc: Added search_rewrite_rules attribute.
+
+Thu Sep 27 16:34:51 2001 Gilles Detillieux <[email protected]>
+
+ * htlib/Makefile.in, htlib/HtRegex.cc, htlib/HtRegex.h,
+ htlib/HtRegexReplace.cc, htlib/HtRegexReplace.h,
+ htlib/HtRegexReplaceList.cc, htlib/HtRegexReplaceList.h,
+ htlib/HtURLRewriter.cc, htlib/HtURLRewriter.h: Added new classes to
+ support regular expressions and implement url_rewrite_rules attribute,
+ using Geoff's variation of Andy Armstrong's implementation of this.
+
+ * htlib/URL.h, htlib/URL.cc: Added URL::rewrite() method.
+
+ * htlib/htString.h: Added Nth() method for HtRegex class.
+
+ * htdig/Retriever.cc (got_href, got_redirect): Added calls to
+ url.rewrite(), and debugging output for this.
+
+ * htdig/htdig.cc (main): Added calls to make instance of
+ HtURLRewriter class.
+
+ * htdoc/attrs.html, htdoc/cf_byname.html, htdoc/cf_byprog.html,
+ htcommon/defaults.cc: Added url_rewrite_rules attribute.
+
+Mon Sep 17 16:52:07 2001 Gilles Detillieux <[email protected]>
+
+ * htdoc/running.html: New documentation on how to run after configuring.
+ * htdoc/rundig.html: New manual page for rundig script.
+ * htdoc/install.html: Added link to running.html.
+ * htdoc/contents.html: Added link to running.html, rundig.html, related
+ projects. Updated links to contrib and developer site. Got rid of
+ link to web site stats.
+
+Fri Sep 14 09:18:38 2001 Gilles Detillieux <[email protected]>
+
+ * htdig/Document.cc (RetrieveHTTP): Add port to Host: header when
+ port is not default, as per RFC2616(14.23). Fixes bug #459969.
+
+Sat Sep 8 22:04:47 2001 Geoff Hutchison <[email protected]>
+
+ * acconfig.h, include/htconfig.h.in: Add undef for
+ ALLOW_INSECURE_CGI_CONFIG, which if defined does about what you'd
+ expect. (This is for any wrapper authors who don't want to rewrite
+ but are willing to run insecure.)
+
+ * htsearch/htsearch.cc: Only allow the -c flag to work when
+ REQUEST_METHOD is undefined. Fixes PR#458013.
+
+Fri Aug 31 16:00:37 2001 Gilles Detillieux <[email protected]>
+
+ * htlib/URL.cc (URL): Fixed to call normalizePath() even if URL
+ is relative but with absolute path. Should fix bug #408586.
+
+Fri Aug 31 15:21:49 2001 Gilles Detillieux <[email protected]>
+
+ * htdig/HTML.h, htdig/HTML.cc (HTML, parse, do_tag): Fixed buggy
+ handling of nested tags that independently turn off indexing, so
+ </script> doesn't cancel <meta name=robots ...> tag. Add handling
+ of <noindex follow> tag.
+
+Fri Aug 31 14:33:41 2001 Gilles Detillieux <[email protected]>
+
+ [ Backport some 3.2.0b4 HTML parser changes. ]
+ * htdig/HTML.cc (do_tag): Rewrite using Configuration class to
+ separate tag attributes. Parse <object> tags properly, looking
+ for data= attribute rather than src=. Add support for TITLE
+ attributes in anchor and related tags. Treat <script></script>
+ tags as noindex tags, much like <style></style> as suggested
+ by Torsten.
+ * htdig/HTML.cc(parse): Fix to prevent closing ">" from being passed
+ to do_tag().
+
+Wed Aug 29 10:20:55 2001 Gilles Detillieux <[email protected]>
+
+ * htdoc/attrs.html (allow_in_form, build_select_lists,
+ limit_normalized, server_aliases, server_max_docs, server_wait_time,
+ url_part_aliases): Added clarifications to allow_in_form,
+ server_aliases and url_part_aliases descriptions. Changed word
+ "directive" to "attribute" where appropriate. Added cross-link to
+ server_aliases from limit_normalized, and to allow_in_form from
+ build_select_lists.
+
+Mon Aug 27 17:22:56 2001 Gilles Detillieux <[email protected]>
+
+ * htdig/HTML.cc (do_tag): Improve handling of whitespace in META
+ refresh handling. Fixes bug #406244.
+
+Mon Aug 27 16:38:43 2001 Gilles Detillieux <[email protected]>
+
+ * htdig/HTML.cc (parse): Fixed delete [] text (was missing []), added
+ simple optimizations for comment & noindex_start skipping, handle
+ decoded &lt; entity correctly.
+
+Mon Aug 27 15:31:01 2001 Gilles Detillieux <[email protected]>
+
+ [ Backport 3.2.0b4 config files. ]
+ * installdir/htdig.conf: Added .css to bad_extensions default,
+ added missing closing ">", added mentions of accents & substring,
+ fixed a couple typos in comments.
+ * installdir/search.html: Add DTD tag for HTML 4 compliance.
+ * installdir/{long, syntax, header, footer, wrapper, nomatch}.html:
+ Add DTD tags, ALT attributes and remove bogus </select> tags to
+ fix invalid HTML pointed out in PR#901. Change all <b> and <i> tags
+ to the HTML-4.0 compliant <strong> and <em> tags.
+ * htdoc/config.html: Updated with sample of latest htdig.conf and
+ installdir/*.html, added blurb on wrapper.html.
+
+Thu Jul 26 15:05:29 2001 Gilles Detillieux <[email protected]>
+
+ * htcommon/defaults.cc, htsearch/parser.cc (perform_or),
+ htdoc/attrs.html, htdoc/cf_by{name,prog}.html: Added new attribute
+ multimatch_method and used it to boost score on 'or' method with
+ multiple matches.
+
+Thu Jul 26 14:25:01 2001 Gilles Detillieux <[email protected]>
+
+ * htcommon/defaults.cc, htsearch/parser.cc, htdoc/attrs.html,
+ htdoc/cf_by{name,prog}.html: Added new attribute boolean_syntax_errors
+ and used it to generate syntax error messages for boolean method.
+
+Wed Jul 25 23:39:00 2001 Gilles Detillieux <[email protected]>
+
+ * htnotify/htnotify.cc: Changed calls to EmailNotification class
+ to avoid compiler warnings.
+
+Wed Jul 25 23:15:24 2001 Gilles Detillieux <[email protected]>
+
+ * htcommon/defaults.cc, htsearch/htsearch.cc, htdoc/attrs.html,
+ htdoc/cf_by{name,prog}.html: Added new attribute boolean_keywords
+ and used it to make LOGICAL_WORDS and parse "words" using boolean
+ method.
+
+Wed Jul 25 22:31:19 2001 Gilles Detillieux <[email protected]>
+
+ * htlib/Dictionary.cc (Remove): Fixed so it doesn't clobber rest of
+ chain when removing an entry, as suggested by Yariv Tal.
+
+Wed Jul 25 22:06:08 2001 Gilles Detillieux <[email protected]>
+
+ * htcommon/defaults.cc: Add new attributes htnotify_replyto,
+ htnotify_webmaster, htnotify_prefix_file, htnotify_suffix_file.
+
+ * htdoc/attrs.html, htdoc/cf_by{name,prog}.html: Document them.
+
+ * htnotify/htnotify.cc, htnotify/EmailNotification.{h,cc},
+ htnotify/Makefile.in: Added in code from Richard Beton
+ <[email protected]> to collect multiple URLs per e-mail
+ address and allow customization of notification messages by
+ reading in header/footer text as designated by the new attributes
+ above.
+
+ * htdoc/THANKS.html: Credit where due.
+
+Wed Jul 25 21:38:21 2001 Gilles Detillieux <[email protected]>
+
+ * htcommon/defaults.cc: Added .css to bad_extensions, for consistency
+ with 3.2.
+
+ * htdoc/attrs.html: Ditto for default value. Also set examples for
+ translate_* and modification_time_is_now to false so the example is
+ different than default.
+
+Wed Jul 25 17:26:07 2001 Geoff Hutchison <[email protected]>
+
+ * htdig/Document.cc (getParsable): Add conditional to catch
+ text/css files to prevent these from being parsed as Plaintext.
+
+ * htdig/htdig.cc: Quick fix to make the logging -l flag the
+ default behavior. (Set to Retriever_logUrl from the start.)
+
+ * htcommon/defaults.cc: Set modification_time_is_now to default to
+ true (now that it works correctly). Also set translate_*
+ attributes to true.
+
+ * htdoc/htdig.html: Remove documentation for -l flag--now no
+ longer used.
+
+ * htdoc/attrs.html: Correct new default values for
+ modification_time_is_now and translate_* attributes.
+
+Tue Jul 24 16:12:45 2001 Gilles Detillieux <[email protected]>
+
+ * htdoc/attrs.html: Added reference to maximum_page_buttons in the
+ section on maximum_pages.
+
+Tue Jul 24 15:38:39 2001 Gilles Detillieux <[email protected]>
+
+ * htsearch/Display.cc (generateStars): Add NSTARS variable for
+ template output as suggested by Caleb Crome
+ <[email protected]> (except here precision is 0). Fixes
+ feature request #405787.
+
+ * htdoc/hts_templates.html: Add description of NSTARS variable
+ above. (Actually copied hts_templates.html from 3.2.0b4.)
+
+Tue Jul 24 14:21:53 2001 Gilles Detillieux <[email protected]>
+
+ * htsearch/Display.cc (expandVariables, outputVariable),
+ htdoc/hts_templates.html: Add support for $=(var) template variable
+ references, as suggested by Quim Sanmarti.
+
+Tue Jul 24 14:12:06 2001 Gilles Detillieux <[email protected]>
+
+ * htsearch/Display.cc (readFile): Added missing fclose() call, and
+ debugging message for when file can't be opened.
+
+ * htsearch/Display.cc (displayParsedFile): Added debugging message
+ for when file can't be opened.
+
+Tue Jul 24 14:03:12 2001 Gilles Detillieux <[email protected]>
+
+ * htsearch/Display.cc (setVariables), htcommon/defaults.cc: Added
+ maximum_page_buttons attribute, to limit buttons to less than
+ maximum_pages. Fixes PR#731 & PR#781.
+ * htdoc/attrs.html, htdoc/cf_by{name,prog}.html: Documented it.
+
+Tue Jul 24 13:42:56 2001 Gilles Detillieux <[email protected]>
+
+ * htdoc/hts_templates.html, htsearch/Display.cc (displayMatch):
+ Add METADESCRIPTION variable.
+
+Tue Jul 24 13:20:24 2001 Gilles Detillieux <[email protected]>
+
+ * htcommon/DocumentDB.{h,cc}: Added FindCoded() method to lookup
+ docdb record with URL that's still encoded.
+
+ * htsearch/Display.cc (display, displayMatch, buildMatchList):
+ Use new method to avoid problems with URLs that are decoded and
+ reencoded with another, more ambiguous url_part_aliases setting.
+ Also fixed a problem with date range checking looking at ref before
+ checking if it's null.
+
+Thu Jul 12 11:45:05 2001 Gilles Detillieux <[email protected]>
+
+ * contrib/conv_doc.pl, contrib/parse_doc.pl: Fixed EOF handling in
+ dehyphenation, fixed to handle %xx codes in title made from URL.
+
+ * contrib/doc2html/doc2html.pl, contrib/doc2html/pdf2html.pl,
+ contrib/doc2html/swf2html.pl: Fixed to handle %xx codes in URL title.
+
+Thu Jul 5 11:23:40 2001 Geoff Hutchison <[email protected]>
+
+ * db/dist/config.guess: Update with more recent GNU version that
+ recognizes various flavors of Mac OS X automatically.
+
+ * htlib/DB2_db.cc: Only #include <malloc.h> if we have it. Fixes
+ compilation problems on Mac OS X.
+
+ * htlib/String.cc: Include <iostream.h> instead of depreciated
+ <stream.h>. Fixes compilation problems with Mac OS X.
+
+ * htlib/Configuration.cc: Make sure we never try to operate on
+ strings of no length--accessing string[-1] is a bug--exposed on
+ Mac OS X.
+
+Fri Jun 29 11:56:25 2001 Gilles Detillieux <[email protected]>
+
+ * htdig/Retriever.cc (got_redirect): Allow the redirect to accept
+ relative redirects instead of just full URLs.
+
+Fri Jun 22 16:25:21 2001 Gilles Detillieux <[email protected]>
+
+ * htdoc/THANKS.html: Credit Marc Pohl and Robert Marchand.
+
+ * htsearch/Display.cc (buildMatchList): Fix date_factor calculation
+ to avoid 32-bit int overflow after multiplication by 1000, and avoid
+ repetitive time(0) call, as contributed by Marc Pohl. Also move the
+ localtime() call up before gmtime() call, to avoid clobbering gmtime's
+ returned static structure (my thinko).
+
+Tue Jun 19 17:07:01 2001 Gilles Detillieux <[email protected]>
+
+ * htsearch/Display.cc (setVariables): Fixed handling of
+ build_select_lists attribute, to deal with new restrict & exclude
+ attributes.
+
+Fri Jun 15 17:45:40 2001 Gilles Detillieux <[email protected]>
+
+ * htdoc/require.html: Added mentions of accents, prefix & substring,
+ taken from 3.2.0b4.
+ * htdoc/htfuzzy: Added blurb on accents algorithm, taken from 3.2.0b4.
+ * htdoc/attrs.html, htdoc/cf_by{name,prog}.html: Added entry for
+ accents_db attribute for htfuzzy and htsearch. Mentioned accents
+ algorithm in description of search_algorithm. Noted effect of
+ locale setting on floating point numbers in search_algorithm
+ and locale descriptions.
+
+Fri Jun 15 16:47:09 2001 Gilles Detillieux <[email protected]>
+
+ * htfuzzy/Accents.{h,cc}, htfuzzy/Fuzzy.c (getFuzzyByName),
+ htfuzzy/htfuzzy.cc (main, usage), htfuzzy/Makefile.in: Added
+ latest version of Robert Marchand's accents fuzzy match algorithm.
+ * htcommon/defaults.cc: Added accents_db attribute for this.
+ * htsearch/htsearch.cc: Fixed parsing of search_algorithm not to
+ use comma as separator, because it may be needed as decimal point
+ in some locales.
+
+Fri Jun 15 16:30:19 2001 Gilles Detillieux <[email protected]>
+
+ * htfuzzy/Endings.cc (getWords): Undid change introduced in 3.1.3,
+ in part. It now gets permutations of word whether or not it has
+ a root, but it also gets permutations of one or more roots that
+ the word has, based on a suggestion by Alexander Lebedev.
+ * htfuzzy/EndingsDB.cc (createRoot): Fixed to handle words that have
+ more than one root.
+ * installdir/english.0: Removed P flag from wit, like and high, so
+ they're not treated as roots of witness, likeness and highness, which
+ are already in the dictionary.
+
+Thu Jun 7 17:09:46 2001 Gilles Detillieux <[email protected]>
+
+ * htcommon/defaults.cc: Add new attribute use_doc_date to use
+ document meta information for the DocTime() field.
+ * htdoc/attrs.html, htdoc/cf_byname.html, htdoc/cf_byprog.html:
+ Document it.
+ * htdig/HTML.cc(do_tag): Call Retriever::got_time if use_doc_date
+ is set and we run across a META date tag.
+ * htdig/Retriever.h, htdig/Retriver.cc: Add new got_date
+ function. When called, sets the DocTime field of the DocumentRef
+ after parsing is completed. Currently assumes ISO 8601 format for
+ the date tag.
+
+Thu Jun 7 16:48:13 2001 Gilles Detillieux <[email protected]>
+
+ * htcommon/defaults.cc: Add new attribute any_keywords to allow
+ ORing of keywords input parameter.
+ * htsearch/htsearch.cc (addRequiredWords): Use it. Fix handling
+ of empty search word list.
+ * htsearch/Display.cc (excerpt, highlight): Fix handling of case
+ where "words" is empty but "keywords" isn't.
+ * htdoc/attrs.html, htdoc/cf_byname.html, htdoc/cf_byprog.html:
+ Document any_keywords.
+
+Thu Jun 7 16:34:41 2001 Gilles Detillieux <[email protected]>
+
+ * htcommon/defaults.cc: Add new attribute plural_suffix to set the
+ language-dependent suffix for PLURAL_MATCHES contributed by Jesse.
+ * htsearch/Display.cc (setVariables): Use it.
+ * htdoc/attrs.html, htdoc/cf_byname.html, htdoc/cf_byprog.html:
+ Document it.
+
+Thu Jun 7 16:03:17 2001 Gilles Detillieux <[email protected]>
+
+ * htsearch/Display.{h,cc}, htcommon/defaults.cc: Added multi-excerpt
+ feature and max_excerpts attribute, as contributed by Jim Cole.
+ * htdoc/THANKS.html, htdoc/attrs.html, htdoc/cf_byname.html,
+ htdoc/cf_byprog.html: Credit where due, and document attribute.
+
+Thu Jun 7 15:27:33 2001 Gilles Detillieux <[email protected]>
+
+ * htdig/ExternalParser.cc: Backported from 3.2.0b3, fixing these
+ problems: no longer confused by "; charset=..." in Content-Type,
+ avoids security problems with popen() and shell parsing untrusted URL
+ (PR#542, PR#951), avoids predictable temporary file name if mkstemp()
+ exists, binary output from external converter no longer mangled,
+ less ambiguous error messages, opens temp. file in binary mode on
+ non-Unix systems.
+
+Thu Jun 7 15:10:14 2001 Gilles Detillieux <[email protected]>
+
+ * htcommon/DocumentDB.{h,cc}: Replace CreateSearchDB() with DumpDB(),
+ add LoadDB(), both backported from 3.2.0b3.
+ * htdig/htdig.cc (main, usage), htdig/Makefile.in, htdoc/htdig.html:
+ Add handling of -m (minimal) option, file input for URLs, and arg 0
+ handling for htdump & htload.
+ * htdig/HTML.cc (do_tag): Change all white space to blanks in meta
+ description tag, for proper ASCII record dumps by htdump, and to fix
+ bug #405771.
+ * htlib/String.cc (= operator), htlib/htString.cc: change handling
+ of 0 length strings. Add readLine() for htload support.
+
+Thu Jun 7 14:41:42 2001 Gilles Detillieux <[email protected]>
+
+ * htdig/Retriever.cc (got_href): Fix hop count mishandling.
+
+Thu Jun 7 14:23:47 2001 Gilles Detillieux <[email protected]>
+
+ * htmerge/db.cc (mergeDB), htmerge/words.cc (mergeWords),
+ installdir/rundig: Fix various htmerge bugs. Quotes the temp.
+ directory name and word_list name (PR#872). Correctly handles
+ words beginning with +, - and ! when in extra_word_characters
+ (PR#952). Corrects problems with bad wordlists generated by
+ htmerge -m causing it to lose entries in words.db and problems
+ with the sort program using non-ASCII collating having a similar
+ effect.
+
+Thu Jun 7 14:13:56 2001 Gilles Detillieux <[email protected]>
+
+ * htsearch/htsearch.cc (main), htsearch/Display.cc (setVariables,
+ createURL, buildMatchList), htdoc/THANKS.html, htdoc/hts_form.html,
+ htdoc/hts_templates.html: Add Mike Grommet's date range search
+ feature.
+
+Thu Jun 7 13:57:06 2001 Gilles Detillieux <[email protected]>
+
+ * htdig/Retriever.cc (GetLocal, GetLocalUser): Fix to allow compiling
+ on AIX & other non-GNU compilers.
+
+Thu Jun 7 13:52:20 2001 Gilles Detillieux <[email protected]>
+
+ * htsearch/Display.cc (setVariables): Extend the handling of
+ build_select_lists to handle select multiple, radio buttons and
+ checkboxes.
+ * htdoc/attrs.html, htdoc/hts_selectors.html: Describe this.
+
+Thu Jun 7 13:40:13 2001 Gilles Detillieux <[email protected]>
+
+ * htfuzzy/Exact.cc (Exact), htfuzzy/Prefix.cc (Prefix): Set the
+ name field to the class name, as suggested by Jesse.
+
+Thu Jun 7 13:27:35 2001 Gilles Detillieux <[email protected]>
+
+ * contrib/htdig-3.1.6.spec, contrib/htdig-3.1.6-conf.patch,
+ htdoc/where.html, .version, README: Bump to version 3.1.6.
+
+Thu Jun 7 11:58:28 2001 Gilles Detillieux <[email protected]>
+
+ * contrib/multidig/*: Backport from 3.2.0b3, including fixes below.
+
+ * contrib/multidig/Makefile, gen-collect, db.conf, multidig.conf:
+ Add missing trailing newlines as pointed out by Doug Moran
+
+ * contrib/multidig/Makefile (install): Make sure scripts have a+x
+ permissions. Pointed out by Doug Moran.
+
+ * contrib/multidig/new-collect: Fix typo to ensure MULTIDIG_CONF
+ is set correctly.
+
+Thu Jun 7 11:37:52 2001 Gilles Detillieux <[email protected]>
+
+ * contrib/README: Add in descriptions for web site contrib directory,
+ acroconv.pl & conv_doc.pl.
+ * contrib/examples/rundig.sh: Update to most recent version for 3.1.x.
+ * contrib/htparsedoc/htparsedoc: Add in contributed bug fixes from
+ Andrew Bishop to work on SunOS 4.x machines.
+ * contrib/acroconv.pl: Added external converter script to convert
+ PDFs with acroread.
+
+Thu Jun 7 10:41:05 2001 Gilles Detillieux <[email protected]>
+
+ * htlib/ParsedString.cc (get), htsearch/Display.cc (expandVariables):
+ Use isalnum() instead of isalpha() to allow digits in attribute and
+ variable names, allow '-' in variable names too for consistency.
+
+Wed Jun 6 17:13:49 2001 Gilles Detillieux <[email protected]>
+
+ * htdig/HTML.cc (do_tag): Make parsing of meta robots tag case
+ insensitive.
+
+Wed Jun 6 15:31:00 2001 Gilles Detillieux <[email protected]>
+
+ * contrib/doc2html/DETAILS, contrib/doc2html/README,
+ contrib/doc2html/doc2html.cfg, contrib/doc2html/doc2html.sty,
+ contrib/doc2html/doc2html.pl, contrib/doc2html/pdf2html.pl,
+ contrib/doc2html/swf2html.pl: Added version 3.0 of doc2html,
+ contributed by David Adams <[email protected]>.
+
+Mon Jun 4 10:31:45 CEST 2001 Gabriele Bartolini <[email protected]>
+
+ * htdoc/cf_byname.html: I forgot to insert the 'restrict' attribute.
+
+Wed May 30 11:30:43 2001 Gabriele Bartolini <[email protected]>
+
+ * htsearch/htsearch.cc: two new attributes, used by htsearch, have
+ been added: restrict and exclude. They can now give more control
+ to template customisation through configuration files, allowing
+ to restrict or exclude URLs from search without passing
+ any CGI variables (although this specification overrides the
+ configuration one).
+ * htcommon/defaults.cc: ditto
+ * htdoc/attrs.html: ditto
+ * htdoc/cf_byname.html: ditto
+ * htdoc/cf_byprog.html: ditto
+ * htdoc/hts_form.html: ditto
+
+Sat May 5 21:43:32 2001 Geoff Hutchison <[email protected]>
+
+ * configure.in, configure: Add tests for wait.h, sys/wait.h,
+ mkstemp() and malloc.h.
+
+ * acconfig.h, include/htconfig.h.in: Update with autoheader for
+ new tests.
+
+ * htlib/regex.[h,c]: Update with backports from 3.2.0b4 development.
+
+Tue Feb 29 23:04:04 2000 Geoff Hutchison <[email protected]>
+
+ * htlib/DB2_db.cc (Error): Simply fprint the error message on
+ stderr. This is not a method since the db.h interface expects a C
+ function.
+ (db_init): Don't set db_errfile, instead set errcall to point to
+ the new Error function.
+
+Fri Feb 25 10:11:50 2000 Gilles Detillieux <[email protected]>
+
+ * htdoc/attrs.html (maximum_pages): Describe new bahaviour (as of
+ 3.1.4), where this limits total matches shown.
+
+Thu Feb 24 20:24:24 2000 Geoff Hutchison <[email protected]>
+
+ * htdoc/FAQ.html: Update to refer to 3.1.5 and edit comments about 3.2.
+
+Thu Feb 24 15:20:08 2000 Gilles Detillieux <[email protected]>
+
+ * htdoc/RELEASE.html, htdoc/main.html: Updated notes for 3.1.5 release.
+
+Thu Feb 24 10:37:45 2000 Gilles Detillieux <[email protected]>
+
+ * htdoc/attrs.html (external_parsers): Add references to FAQ 4.8 & 4.9.
+ (local_default_doc): Give an expanded example.
+ (logging): Explain log entry format.
+ (star_blank): Fix some old typos (incorrect references to other attrs.)
+
+Wed Feb 23 13:58:24 2000 Gilles Detillieux <[email protected]>
+
+ * htcommon/cgi.cc(init): Fixed bug: array must be free by
+ delete [] buf, not just delete buf; (from Vadim).
+ * installdir/syntax.html: Fixed a $(WORDS) I'd missed earlier.
+
+Tue Feb 22 12:40:22 2000 Gilles Detillieux <[email protected]>
+
+ * htdoc/RELEASE.html, htdoc/main.html: Updated notes for 3.1.5 release.
+ * htlib/URL.cc (URL, normalizePath): Fix PR#779, to handle relative
+ URLs correctly when there's a trailing ".." or leading "//".
+
+Thu Feb 17 15:58:53 2000 Gilles Detillieux <[email protected]>
+
+ * htdoc/RELEASE.html, htdoc/main.html: Add notes for 3.1.5 release.
+ * htdoc/TODO.html, htdoc/author.html, htdoc/bugs.html,
+ htdoc/cf_general.html, htdoc/cf_types.html, htdoc/cf_variables.html,
+ htdoc/config.html, htdoc/howitworks.html, htdoc/htdig.html,
+ htdoc/htfuzzy.html, htdoc/htmerge.html, htdoc/htnotify.html,
+ htdoc/hts_form.html, htdoc/hts_general.html, htdoc/hts_method.html,
+ htdoc/install.html, htdoc/isp.html, htdoc/mailing.html,
+ htdoc/meta.html, htdoc/notification.html, htdoc/require.html,
+ htdoc/uses.html, htdoc/where.html: Update copyright date and fix
+ last modified date for automatic CVS update.
+
+Thu Feb 17 14:37:18 2000 Gilles Detillieux <[email protected]>
+
+ * installdir/htdig.conf: quote all HTML tag parameters.
+ * htsearch/TemplateList.cc (createFromString), installdir/long.html,
+ installdir/short.html: Use $&(URL) in templates.
+
+Thu Feb 17 14:01:34 2000 Gilles Detillieux <[email protected]>
+
+ * contrib/htdig-3.1.5.spec: Fix silly typos in %post script,
+ make cron script a %config file.
+
+Thu Feb 17 10:34:05 2000 Gilles Detillieux <[email protected]>
+
+ [ Improve htsearch's HTML 4.0 compliance ]
+ * htsearch/TemplateList.cc (createFromString): Use file name rather
+ than internal name to select builtin-* templates, use $&(TITLE) in
+ templates and quote HTML tag parameters.
+ * installdir/long.html, installdir/short.html: Use $&(TITLE) in
+ templates and quote HTML tag parameters.
+ * htsearch/Display.cc (setVariables): quote all HTML tag parameters
+ in generated select lists.
+ * installdir/footer.html, installdir/header.html,
+ installdir/nomatch.html, installdir/search.html,
+ installdir/syntax.html, installdir/wrapper.html:
+ Use $&(var) where appropriate, and quote HTML tag parameters.
+
+Thu Feb 17 10:00:26 2000 Gilles Detillieux <[email protected]>
+
+ * contrib/htdig-3.1.5.spec: Fix %post script to add more descriptive
+ htdig.conf entries.
+
+Wed Feb 16 16:26:05 2000 Gilles Detillieux <[email protected]>
+
+ * contrib/htdig-3.1.5.spec, contrib/htdig-3.1.5-conf.patch,
+ htdoc/where.html, .version, README: Bump to version 3.1.5.
+ * htdoc/THANKS.html: Added new contributors.
+ * htdoc/FAQ.html, htdoc/main.html: Updated to versions from web site.
+
+Wed Feb 16 15:49:28 2000 Gilles Detillieux <[email protected]>
+
+ * htlib/Configuration.h, htlib/Configuration.cc: split Add() method
+ into Add() and AddParsed(), so that only config attributes get parsed.
+ Use AddParsed() only in Read() and Defaults().
+
+Wed Feb 16 15:02:47 2000 Gilles Detillieux <[email protected]>
+
+ * htlib/URL.h (encodeURL): Change list of valid characters to
+ include only unreserved ones.
+ * htlib/cgi.cc (init): Allow "&" and ";" as input parameter separators.
+ * htsearch/Display.cc (createURL): Encode each parameter separately,
+ using new unreserved list, before piecing together query string, to
+ allow characters like "?=&" within parameters to be encoded.
+
+Wed Feb 16 14:42:02 2000 Gilles Detillieux <[email protected]>
+
+ * htsearch/Display.cc (encodeSGML, excerpt): Add encoding for
+ characters that could pose problems in HTML output.
+ * htsearch/Display.cc (expandVariables, outputVariables): Add support
+ for $&(var) and $%(var) template variable references. This should
+ fix PR#750, once we use this in common/*.html.
+
+Tue Feb 15 17:21:08 2000 Gilles Detillieux <[email protected]>
+
+ [ Applied a whole collection of patches and fixes from the archives ]
+ * htdig/Server.cc (robotstxt): apply more rigorous parsing of
+ multiple user-agent fields, and use only the first one.
+
+ * htdig/Retriever.cc(GetLocal, GetLocalUser): Add URL-decoding
+ enhancements to local_urls, local_default_urls & local_default_doc,
+ to allow hex encoding of special characters.
+ * htdoc/attrs.html: Document these.
+
+ * htdig/Retriever.cc (IsValidURL): Fix problem with
+ valid_extensions when an "extension" would include part of a
+ directory path or server name, as contributed by Warren Jones.
+ Also fix problem with valid_extensions matching failure when URL
+ parameters follow extension, as reported by [email protected].
+
+ * htdig/Document.cc (RetrieveLocal), htdig/Document.h,
+ htdig/Retriever.cc(Initial, parse_url, GetLocal, GetLocalUser,
+ IsLocalURL, got_href, got_redirect), htdig/Retriever.h,
+ htdig/Server.cc(Server), htdig/Server.h: Apply Paul B. Henson's
+ enhancements to local_urls, local_user_urls & local_default_doc.
+ * htdoc/attrs.html: Document these.
+
+ * htsearch/htsearch.cc (setupWords): Fix problem reported by
+ D.J. Adams, in which bad_words removal failed on upper-case
+ search words.
+
+ * htsearch/Display.cc(setVariables), htcommon/defaults.cc: Added
+ build_select_lists attribute, to generate selector menus in forms.
+ * htdoc/hts_selectors.html: Added this page to explain this new
+ feature, plus other details on select lists in general.
+ * htdoc/hts_templates.html: Added relevant links to related attributes
+ and selectors documentation.
+ * htdoc/attrs.html, htdoc/cf_by{name,prog}.html: Added relevant
+ explanations and links to selectors documentation.
+
+ * htlib/QuotedStringList.cc (Create): fix PR#743, where quoted string
+ lists didn't allow embedded quotes of opposite sort in strings
+ (e.g. "'" or '"'), and fix to avoid overrunning end of string
+ if it ends with backslash.
+
+ * htcommon/WordList.cc (valid_word): Applied Marc Pohl's fix to make
+ this 8-bit clean on Solaris.
+
+ * contrib/conv_doc.pl, contrib/parse_doc.pl: Applied Warren Jones's
+ changes to these scripts.
+
+ * htdig/PDF.cc (parseNonTextLine): Fix bogus escape sequences
+ around Title parsing. (Fixes PR#740)
+
+ * htsearch/Display.cc (display, displaySyntaxError),
+ htdoc/attrs.html, htdoc/cf_byname.html, htdoc/cf_byprog.html,
+ htcommon/defaults.cc: Add new attribute "nph" to send out
+ non-parsed headers for servers that do not supply HTTP headers on
+ CGI output (e.g. IIS). If nph is set, send out HTTP OK header,
+ as suggested by Matthew Daniel <[email protected]> (PR#727)
+
+ * htdig/Document.cc (getdate): avoid strftime() altogether on
+ filled-in tm structure, to avoid recurring segfault problems. (PR#734)
+
+ * htlib/strptime.cc (mystrptime): Use Warren Jones's fix to deal
+ with a web server that returns dates with a two digit year field.
+ (Fixes PR#770)
+
+ * htdig/HTML.cc (HTML, parse, do_tag), htcommon/defaults.cc,
+ htdoc/attrs.html, htdoc/cf_byname.html, htdoc/cf_byprog.html:
+ Add max_keywords attribute to limit meta keyword spamming.
+
+Wed Dec 8 18:19:32 1999 Geoff Hutchison <[email protected]>
+
+ * htdoc/FAQ.html, htdoc/bugs.html: Update to refer to latest versions.
+ (Update for 3.1.4 release.)
+
+Wed Dec 8 18:10:27 1999 Gilles Detillieux <[email protected]>
+
+ * htlib/QuotedStringList.cc (Create): Make sure that an empty
+ token isn't ignored.
+
+Tue Dec 7 10:26:58 1999 Geoff Hutchison <[email protected]>
+
+ * htsearch/Display.cc (setVariables): Fix a compilation error by
+ making a statment with '?' an explicit if-else statment.
+
+ * htdoc/RELEASE.html: Change case_sensitive fix to a bug-fix,
+ update release date for 12/9/99. (We certainly didn't release yesterday!)
+
+Mon Dec 6 22:17:21 1999 Gilles Detillieux <[email protected]>
+
+ * htsearch/Display.cc(Display): Add missing call to setupTemplates(),
+ for handling template_patterns. Oops!
+ * htdoc/attrs.html: Fixed a couple typos in new attributes.
+ * htdoc/ChangeLog: Update to latest version.
+
+Mon Dec 6 16:41:04 1999 Gilles Detillieux <[email protected]>
+
+ * htdoc/main.html: Update news with latest version.
+ * htdig/htdig.cc(main), htdig/Document.cc(Document),
+ htcommon/defaults.cc, htdoc/attrs.html, htdoc/cf_byname.html,
+ htdoc/cf_byprog.html: Add authorization attribute, settable by
+ htdig -u. Also fixes PR#490, by setting authentication before
+ robots.txt fetched.
+ * htdoc/RELEASE.html: Update with latest fix.
+
+Fri Dec 3 17:31:47 1999 Gilles Detillieux <[email protected]>
+
+ * htcommon/DocumentRef.cc(Clear): Set docHopCount & docSig to 0,
+ and clear docEmail, docNotification & docSubject strings to have
+ a clean slate for Deserialize(), which assume 0/empty for these.
+ Fixes problem with hop counts getting clobbered.
+ * htdoc/RELEASE.html: Update with latest fix.
+ * htdoc/ChangeLog: Update to latest version.
+
+Fri Dec 3 12:12:19 1999 Gilles Detillieux <[email protected]>
+
+ * htdig/Document.cc: removed vestiges of internal Postscript
+ support that never worked, and removed test for application/msword,
+ which is handled only by external parser.
+ * htdig/Makefile.in: removed Postscript.o from list.
+ * htdig/Retriever.cc(parse_url): Fix compilation error;
+ (Initial, got_href, got_redirect): Try to get the local filename
+ for a server's robots.txt file and pass it along to the newly
+ generated server.
+ * htdig/Server.cc(Server): Retrieve the robots.txt file from the
+ filesystem when possible; fix compilation error.
+ * htdig/Server.h(Server): Add local_robots_file parameter to Server().
+ * htlib/HtWordType.h, htlib/HtWordType.cc: fix compilation errors.
+
+Fri Dec 3 10:52:57 1999 Gilles Detillieux <[email protected]>
+
+ * htdig/HTML.cc(parse, do_tag): Add handling of <img alt=...> text,
+ fix parsing of words in meta tags, disable indexing of meta tags
+ when "noindex" state in effect, fix calculations of word positions
+ to more accurately reflect relative positions.
+ * htlib/HtWordType.h, htlib/HtWordType.cc: Add HtWordToken() function,
+ to replace strtok() in HTML parser.
+ * htdoc/RELEASE.html: Update with latest fixes.
+
+Fri Dec 3 09:02:55 1999 Gilles Detillieux <[email protected]>
+
+ * htlib/Configuration(Add): handle strings in single quotes, as in
+ parm='value'.
+
+Thu Dec 2 16:14:28 1999 Gilles Detillieux <[email protected]>
+
+ * htdoc/attrs.html: Add Tom Metro's suggested revisions for pdf_parser
+ and external_parsers.
+
+Thu Dec 2 15:15:03 1999 Gilles Detillieux <[email protected]>
+
+ * htdoc/mailing.html: Updated to version from htdig.org web site.
+ * htcommon/defaults.cc: Add missing no_page_number_text and
+ page_number_text attribute definitions.
+ * htdoc/attrs.html(modification_time_is_now): Make the description
+ a bit clearer as to how it may cut down on reindexing.
+
+Thu Dec 2 13:46:11 1999 Gilles Detillieux <[email protected]>
+
+ * htdig/Retriever.cc(parse_url), htdig/Server.cc(Server),
+ htcommon/defaults.cc, htdoc/attrs.html, htdoc/cf_byname.html,
+ htdoc/cf_byprog.html: Add support for local_urls_only attribute.
+ * htdoc/RELEASE.html: Update with latest feature.
+
+Thu Dec 2 11:02:07 1999 Gilles Detillieux <[email protected]>
+
+ * htlib/URL.cc(ServerAlias): Fix server_aliases processing to prevent
+ infinite loop (as for local_urls in PR#688).
+
+Wed Dec 1 17:23:24 1999 Gilles Detillieux <[email protected]>
+
+ * htdig/Retriever.cc(parse_url), htdig/Server.h: add IsDead() methods
+ to query and set server status, use them in Retriever to avoid repeated
+ HTTP request to a dead server. (Needed for persistent local stuff.)
+
+Wed Dec 1 16:56:28 1999 Gilles Detillieux <[email protected]>
+
+ * htdig/Retriever.cc(GetLocal): Fix error in GetLocalUser() return
+ value check, as suggested by Vadim.
+ * contrib/conv_doc.pl: Added a sample external converter script.
+ * htdoc/THANKS.html: A couple more additions.
+
+Tue Nov 30 15:02:25 1999 Gilles Detillieux <[email protected]>
+
+ * htdig/Retriever.cc(IsValidURL): Fix compilation error in
+ valid_extensions list handling.
+ * contrib/htdig-3.1.4.spec, contrib/htdig-3.1.4-conf.patch:
+ Added sample RPM spec file and config patch for it.
+
+Tue Nov 30 14:01:51 1999 Gilles Detillieux <[email protected]>
+
+ * htdoc/where.html: Bump to version 3.1.4.
+ * htdoc/THANKS.html: Added new contributors.
+ * htdoc/isp.html, htdoc/uses.html, htdoc/main.html, htdoc/mailing.html:
+ Updated to versions from htdig.org web site.
+
+Tue Nov 30 13:01:20 1999 Gilles Detillieux <[email protected]>
+
+ * htdoc/RELEASE.html: Add release notes for 3.1.4 release.
+ * .version, README: Bump for 3.1.4.
+
+Tue Nov 30 11:03:34 1999 Gilles Detillieux <[email protected]>
+
+ * htdoc/attrs.html(backlink_factor): Added Geoff's clarification of
+ what this attribute does.
+
+Tue Nov 30 09:47:05 1999 Gilles Detillieux <[email protected]>
+
+ * htdig/Document.cc(RetrieveLocal): Handle common extensions for
+ text/plain, application/pdf & application/postscript.
+ * htdig/Retriever.cc(IsValidURL): Add valid_extensions list handling,
+ make it and bad_extensions case insensitive.
+ * htcommon/defaults.cc: Add config attribute valid_extensions,
+ with default as empty.
+ * htdoc/attrs.html, htdoc/cf_by{name,prog}.html: Document it.
+
+Tue Nov 30 09:02:02 1999 Gilles Detillieux <[email protected]>
+
+ * htdig/Retriever.cc(got_href & got_redirect): remove all of Patrick's
+ case insensitive server code, to replace it with Geoff's fix to URL.cc
+ * htlib/URL.cc(normalizePath, path): If not case_sensitive,
+ lowercase the URL. Should ensure that all URLs are appropriately
+ lowercased, regardless of where they're generated.
+
+Mon Nov 29 20:25:01 1999 Gilles Detillieux <[email protected]>
+
+ * htdig/Retriever.cc, htdig/Retriever.h, htdig/Server.cc(push),
+ htdig/Server.h: added Alexis's patch for persistent local digging
+ even if HTTP server is down. Also made new GetLocal() method
+ call GetLocalUser() itself, to simplify its use, and made it
+ non-private, for eventual use by Server code.
+
+Mon Nov 29 19:18:20 1999 Gilles Detillieux <[email protected]>
+
+ * htdig/Retriever.cc(got_href & got_redirect): corrections to case
+ insensitive server fix, to handle redirects, to make more thorough
+ use of mapped URL, and to update it after normalization.
+
+Fri Nov 26 17:14:46 1999 Gilles Detillieux <[email protected]>
+
+ * htdig/Document.cc(RetrieveHTTP): always c.close() the connection
+ when returning.
+ * htdig/HTML.cc(HTML & do_tag): add code to turn off indexing between
+ <style> and </style> tags.
+
+Fri Nov 26 16:31:06 1999 Gilles Detillieux <[email protected]>
+
+ * htlib/Configuration.cc(Read): fixed to allow final line without
+ terminating newline character, rather than ignoring it.
+ * htlib/String.cc(write): added Alexis Mikhailov's fix to bump up
+ pointer after writing a block.
+ * htsearch/Display.cc(setVariables): added Alexis Mikhailov's fix
+ to check the number of pages against maximum_pages at the right time.
+ (Put it even earlier, to make sure nPages is at least 1.)
+ * htsearch/Display.cc(generateStars): Remove extra newline after
+ STARSRIGHT and STARSLEFT variables, noted by Torsten Neuer
+
+Wed Nov 24 20:33:13 1999 Gilles Detillieux <[email protected]>
+
+ * installdir/htdig.conf: Add bad_extensions to make it
+ more obvious to users how to exclude certain document types.
+ Fix the comments for search_algorithm to refer to all the current
+ possibilities. Add example of no_excerpt_show_top attribute in
+ line with most user's expectations. (Geoff's changes)
+
+Wed Nov 24 20:02:32 1999 Gilles Detillieux <[email protected]>
+
+ * installdir/search.html (Match): Add Boolean to default search
+ form, as suggested by PR#561.
+
+Tue Nov 23 23:03:45 1999 Gilles Detillieux <[email protected]>
+
+ * htsearch/Display.cc(setupTemplates), htsearch/Display.h: fixed a
+ couple of compilation errors in template_patterns code.
+
+Tue Nov 23 22:16:31 1999 Gilles Detillieux <[email protected]>
+
+ * htdig/Retriever.cc(got_href): Applied Patrick's case insensitive
+ server fix, to lowercase all URLs if case_sensitive is false.
+
+Tue Nov 23 22:08:22 1999 Gilles Detillieux <[email protected]>
+
+ * htlib/StringList.cc(Join): Applied Loic's patch to fix memory leak.
+
+Tue Nov 23 21:52:18 1999 Gilles Detillieux <[email protected]>
+
+ [Applied patch from Hanno Mueller <[email protected]>, which includes...]
+ * contrib/README: Add scriptname directory.
+ * contrib/scriptname/*: An example of using htsearch within
+ dynamic SSI pages
+ * htcommon/defaults.cc: Add script_name attribute to override
+ SCRIPT_NAME CGI environment variable.
+ * htdoc/FAQ.html: Update question 4.7 based on including htsearch
+ as a CGI in SSI markup.
+ * htdoc/attrs.html, htdoc/cf_byname.html, htdoc/cf_byprog.html,
+ htdoc/hts_templates.html: Update based on behavior of script_name
+ attribute.
+ * htsearch/Display.cc: Set SCRIPT_NAME variable to attribute
+ script_name if set and CGI environment variable if undefined.
+
+Tue Nov 23 21:29:03 1999 Gilles Detillieux <[email protected]>
+
+ * htdoc/FAQ.html: Added the past few month's updates to the FAQ.
+
+Tue Nov 23 21:20:35 1999 Gilles Detillieux <[email protected]>
+
+ * htcommon/defaults.cc, htsearch/Display.h, htsearch/Display.cc,
+ htdoc/attrs.html, htdoc/cf_byname.html, htdoc/cf_byprog.html,
+ htdoc/hts_templates.html: add template_patterns attribute, to select
+ result templates based on URL patterns.
+
+Tue Nov 23 20:52:38 1999 Gilles Detillieux <[email protected]>
+
+ * htlib/cgi.h, htlib/cgi.cc(cgi & init), htsearch/htsearch.cc
+ (main & usage): allow a query string to be passed as an argument.
+
+Tue Nov 23 20:35:05 1999 Gilles Detillieux <[email protected]>
+
+ * htsearch/Display.cc(setVariables & createURL),
+ htsearch/htsearch.cc(main), htdoc/hts_templates.html: handle keywords
+ input parameter like others, and make it propagate to followups.
+
+Tue Nov 23 20:25:45 1999 Gilles Detillieux <[email protected]>
+
+ * htdoc/attrs.html: removed vestigial references to MAX_MATCHES
+ template variables in search_results_{header,footer}.
+
+ * htdoc/hts_form.html: add disclaimer about keywords parameter not
+ being limited to meta keywords.
+
+ * htdoc/meta.html: add description of "keywords" meta tag property.
+ add links to keywords_factor & meta_description_factor attributes.
+
+Tue Nov 23 20:07:20 1999 Gilles Detillieux <[email protected]>
+
+ * htsearch/Display.cc(setVariables & hilight): added Sergey's idea
+ for start_highlight, end_highlight & page_number_separator attributes.
+ * htcommon/defaults.cc: added defaults for these.
+ * htdoc/attrs.html, htdoc/cf_by{name,prog}.html: documented them.
+
+Tue Nov 23 19:58:28 1999 Gilles Detillieux <[email protected]>
+
+ * htdig/ExternalParser.cc: added support for external converters
+ as extension to external_parsers attribute.
+ * htdoc/attrs.html: Updated external_parsers with new description
+ and examples of external converters.
+
+Tue Nov 23 19:52:27 1999 Gilles Detillieux <[email protected]>
+
+ * htdig/HTML.cc(transSGML), htdig/SGMLEntities.cc(translateAndUpdate):
+ Fix the infamous problem in htdig 3.1.3 of mangling URL parameters that
+ contain bare ampersands (&), and not converting &amp; entities in URLs.
+
+ * htdig/Retriever.cc(IsLocal & IsLocalUser): Fix PR#688, where
+ htdig goes into an infinite loop if an entry in local_urls
+ (or local_user_urls) is missing a '=' (or a ',').
+
+ * htcommon/cgi.cc(cgi): Fix bug in reading long queries via POST
+ method (PR#668).
+
+ * htnotify/htnotify.cc(send_notification): apply Jason Haar's fix
+ to quote the sender name "ht://Dig Notification Service".
+
+Wed Sep 22 11:12:38 1999 Geoff Hutchison <[email protected]>
+
+ * htdoc/ChangeLog, htdoc/isp.html, htdoc/FAQ.html,
+ htdoc/RELEASE.html, htdoc/THANKS.html, htdoc/attrs.html,
+ htdoc/bugs.html, htdoc/contents.html, htdoc/main.html,
+ htdoc/require.html, htdoc/uses.html, htdoc/where.html: Update for
+ 3.1.3 release and synch with latest versions from the website.
+
+Wed Sep 15 17:54:31 1999 Alexander Bergolth <[email protected]>
+
+ A few changes to satisfy the AIX xlC compiler:
+
+ * htdig/htdig.cc: Moved variable declaration out of case block.
+
+ * configure.in, htconfig.in: Add check for sys/select.h.
+ Add "long unsigned int" to the possible getpeername_length types.
+
+ * htlib/Connection.cc: Include sys/select.h.
+
+Sun Sep 12 15:02:19 1999 Geoff Hutchison <[email protected]>
+
+ * .version: Bump for 3.1.3.
+
+ * README: Bump first line for 3.1.3 release, remove mention of rx
+ directory.
+
+ * htdoc/ChangeLog: Update with latest version.
+
+ * htdoc/RELEASE.html: Add release notes for 3.1.3 release.
+
+Thu Sep 9 14:52:19 1999 Gilles Detillieux <[email protected]>
+
+ * contrib/parse_doc.pl: fix bug in pdf title extraction.
+
+Wed Sep 1 15:58:14 1999 Gilles Detillieux <[email protected]>
+
+ * htdig/Retriever.cc(got_word): add code to check for compound words
+ and add their component parts to the word database.
+
+ * htdig/PDF.cc(parseString), htdig/Plaintext.cc(parse): Don't strip
+ punctuation or lowercase the word before calling got_word. That
+ should be left up to got_word & Word methods.
+
+ * htlib/StringMatch.h, htlib/StringMatch.cc(Pattern, IgnoreCase):
+ Add an IgnorePunct() method, which allows matches to skip over valid
+ punctuation, change Pattern() and IgnoreCase() to accomodate this.
+
+ * htsearch/htsearch.cc(main, createLogicalWords): use IgnorePunct()
+ to highlight matching words in excerpts regardless of punctuation,
+ toss out old origPattern, and don't add short or bad words to
+ logicalPattern.
+
+ * htlib/HtWordType.h, htlib/HtWordType.cc(Initialize): set up and
+ use a lookup table to speed up HtIsWordChar() and HtIsStrictWordChar().
+
+Wed Sep 1 15:48:13 1999 Gilles Detillieux <[email protected]>
+
+ * htdig/PDF.cc(parse), htcommon/defaults.cc, htdoc/attrs.html:
+ Fix PDF.cc to handle acroread in Acrobat 4, which has a bug with
+ the -pairs option. It turns out that even without the -pairs
+ option, acroread 4 is still prone to segmentation violations when
+ generating PostScript, so acroread 3 is a better choice anyway.
+
+ * htdoc/FAQ.html: Added the past few month's updates to the FAQ.
+
+ * contrib/parse_doc.pl: Updated to latest version, adapted for
+ xpdf 0.90.
+
+Wed Sep 1 15:39:41 1999 Gilles Detillieux <[email protected]>
+
+ Applied "bugfixes" patch collection, which I had posted to
+ [email protected] mailing list in August. Changes include...
+
+ * htsearch/Display.cc(expandVariables): Fix problem with $(VAR)
+ at end of template string not being expanded.
+
+ * htlib/URL.cc(URL): Fix PR#566 by setting the correct length of the
+ string being matched. 'http://' is 7 characters. Submitted by
+
+ * htdig/HTML.h, htdig/HTML.cc(do_tag, transSGML): Fix the HTML parser
+ to decode SGML entities within tag attributes.
+
+ * htlib/URL.cc(ServerAlias): Fix server_aliases entries so port
+ defaults to 80 if omitted.
+
+ * htlib/URL.cc(removeIndex): Fix the infamous problem with files
+ like left_index.html not getting indexed. PR#543 & PR#585.
+
+ * htdig/PDF.cc(parseNonTextLine): Fixed a bug in the PDF parser:
+ when the Title header was just the temporary file name, it
+ wouldn't be used, but it also wouldn't be cleared from the
+ _parsedString variable, so it ended up polluting the document
+ excerpt.
+
+ * htdig/Document.cc(RetrieveHTTP): Added error messages for unknown
+ hosts.
+
+ * htlib/cgi.cc(cgi): Fix PR#572, where htsearch crashed if
+ CONTENT_LENGTH was not set but REQUEST_METHOD was.
+
+ * htdig/HTML.cc(do_tag): Fix <meta> robots parsing to allow
+ multiple directives to work correctly. Fixes PR#578, as provided
+ by Chris Liddiard <[email protected]>.
+
+ * htsearch/htsearch.cc(main): Allow multiple keywords input
+ parameters in search forms.
+
+ * htdig/Document.cc(Reset, readHeader): Fix the bug in the handling
+ of modification_time_is_now.
+
+ * htfuzzy/Fuzzy.cc(getWords), htfuzzy/Metaphone.cc(vscode,generateKey):
+ Should fix PR#514 in the bug database. It's Geoff's first attempt,
+ with a minor correction, plus an added test in the vscode macro,
+ which is where the problem seemed to be happening. This won't
+ map accented vowels to their unaccented counterparts, but
+ it should hopefully put an end to the segmentation faults.
+
+ * include/htconfig.h.in, htcommon/WordReference.h,
+ htcommon/WordList.cc(Word, Flush, BadWordFile),
+ htcommon/DocumentRef.cc(AddDescription), htcommon/defaults.cc,
+ htsearch/parser.cc(perform_push), htdoc/attrs.html,
+ htdoc/cf_byname.html, htdoc/cf_byprog.html: Change the maximum word
+ length into a run-time option, rather than compile-time.
+
+ * htsearch/Display.cc(displayMatch): Applied Torsten Neuer's
+ <[email protected]> fix for PR#554.
+
+ * htdig/HTML.cc(HTML, do_tag): Added support for <embed>, <object>
+ and <link> tags.
+
+ * htdig/htdig.cc(main): Applied Geoff's patch to hide the
+ username/password in the command line arguments.
+
+ * htdig/Document.cc(readHeader): Fixed a few problems with header
+ parsing, including PR#535 & PR#557.
+
+ * htdig/Document.cc(getdate): This should help with PR#81 & PR#472,
+ where strftime() would crash on some systems. Idea submitted
+
+ * COPYING, htdoc/COPYING, Makefile.in: Updated the FSF address
+ in COPYING & Makefile.in. PR#595.
+
+ * htdig/Retriever.cc(IsValidURL): Fix PR#493, to avoid rejecting
+ a valid URL with ".." in it.
+
+ * htlib/URL.cc(parse): Fix PR#348, to make sure a missing
+ or invalid port number will get set correctly.
+
+ * htsearch/Display.h, htsearch/Display.cc(excerpt): Fix declaration
+ to refer to "first" as reference--ensures ANCHOR is properly set.
+ Fixes PR#541 as suggested by <[email protected]>.
+
+ * htdig/ExternalParser.cc(parse): Quote the filename before passing
+ it to the command-line to prevent shell escapes. Fixes PR#542.
+ Also make error messages more useful.
+
+ * htfuzzy/Endings.cc(getWords): Suffix-handling improvement (PR#560),
+ to prevent inappropriate suffix stripping in endings fuzzy matches.
+
+ * htlib/URLTrans.cc(encodeURL): Fix encoding so all non-ascii
+ characters get hex-encoded. I think this is what PR#339 was all about.
+
+ * htdoc/attrs.html, htdoc/cf_byname.html, htdoc/cf_byprog.html:
+ Added descriptions for attributes that were missing, added
+ a few clarifications, and corrected a few defaults and typos.
+ Covers PR#558, PR#626, and then some.
+
+ * configure.in, configure, include/htconfig.h.in, htlib/regex.c:
+ Fix PR#545, to test for presence of alloca.h
+
+Wed Apr 21 22:45:16 1999 Geoff Hutchison <[email protected]>
+
+ * .version: Bump for final 3.1.2 release.
+
+ * htdoc/where.html, htdoc/FAQ.html: Update to mention the new release.
+
+Tue Apr 20 13:34:22 1999 Gilles Detillieux <[email protected]>
+
+ * htdoc/RELEASE.html: Fixed a few typos, updated modification date.
+
+Tue Apr 20 10:54:59 1999 Geoff Hutchison <[email protected]>
+
+ * htdoc/RELEASE.html: Add notes on changes in the 3.1.2 release.
+
+ * htdoc/contents.html, htdoc/mailarchive.html, htdoc/where.html,
+ htdoc/uses.html: Update with versions from maindocs.
+
+ * installdir/htdig.conf: Add example max_doc_size attribute to cut
+ down on FAQ, also add comment on including a file for start_url.
+
+Mon Apr 19 15:40:24 1999 Gilles Detillieux <[email protected]>
+
+ * htcommon/WordList.cc(valid_word): fixed to avoid having the new
+ HtIsStrictWordChar() test circumvent the allow_numbers option by
+ allowing numbers all the time. Also fixed to allow HtIsStrictWordChar()
+ to override iscntrl(), so extra_word_characters can define characters
+ that a broken locale would define as control characters.
+
+Mon Apr 19 15:17:12 1999 Gilles Detillieux <[email protected]>
+
+ * htcommon/WordList.cc(valid_word): fixed bug introduced Jan 9,
+ where it stopped scanning for control characters prematurely.
+ Now also use iscntrl() to detect all control characters.
+
+Fri Apr 16 10:30:42 1999 Gilles Detillieux <[email protected]>
+
+ * htdoc/FAQ.html: fixed typo - use_meta_description was plural.
+
+Wed Apr 14 20:22:31 1999 Alexander Bergolth <[email protected]>
+
+ * htlib/regex.h: fixed compile problem with AIX xlc compiler
+
+Tue Apr 13 13:01:04 1999 Gilles Detillieux <[email protected]>
+
+ * htsearch/Display.cc(generateStars): Set status to -1 if
+ URLimage.hasPattern() fails, to avoid empty URLimageList.
+ (Fix to Mar 31 change.)
+
+Tue Apr 13 11:27:45 1999 Gilles Detillieux <[email protected]>
+
+ * htsearch/Display.h(class Display): move enum SortType up to public
+ section, to avoid problem compiling on IBM AIX C++ compiler.
+
+Mon Apr 12 17:36:20 1999 Gilles Detillieux <[email protected]>
+
+ * htdoc/FAQ.html: added sections on indexing docs in other languages,
+ practical & theoretical limits of ht://Dig.
+
+Fri Apr 9 16:47:34 1999 Gilles Detillieux <[email protected]>
+
+ * htdoc/FAQ.html: Fixed a few typos.
+
+Fri Apr 9 16:24:21 1999 Gilles Detillieux <[email protected]>
+
+ * htdig/Document.cc(RetrieveHTTP): Show "Unable to build connection"
+ message at lower debug level.
+
+Fri Apr 9 15:17:53 1999 Gilles Detillieux <[email protected]>
+
+ * htdoc/FAQ.html: Added changes in maindocs from Mar 18, a few
+ clarifications, and four new questions.
+
+Wed Apr 7 19:41:12 1999 Geoff Hutchison <[email protected]>
+
+ * htsearch/htsearch.cc (usage): Remove bogus -w flag.
+
+Thu Apr 1 11:58:20 1999 Gilles Detillieux <[email protected]>
+
+ * htsearch/htsearch.cc(main): Apply Gabriele's patch to avoid using an
+ invalid matchesperpage CGI input variable.
+
+ * htsearch/Display.cc(display) & (setVariables): Correct any invalid
+ values for matches_per_page attribute to avoid div. by 0 error.
+
+Wed Mar 31 18:21:21 1999 Geoff Hutchison <[email protected]>
+
+ * htdig/htdig.cc: Undo March 30 change.
+
+ * htdig/Retriever.cc: Use excludes.hasPattern before using the
+ exclude list. (More elegant solution to problem, as pointed out by
+ Gilles.)
+
+ * htsearch/Display.cc: Remove code setting URLimage to a bogus
+ pattern. Instead, check that URLimage.hasPattern() before using
+ it.
+
+Wed Mar 31 15:16:36 1999 Gilles Detillieux <[email protected]>
+
+ * htfuzzy/Synonym.cc: Fix previous fix of minor memory leak.
+ (db pointer wasn't properly set)
+
+Tue Mar 30 20:08:18 1999 Geoff Hutchison <[email protected]>
+
+ * htdig/htdig.cc: If exclude_urls attribute is set to empty, set
+ it to something that will never match a URL to ensure nothing is
+ excluded.
+
+ * Makefile.config.in: Fix typo leading to HTLIBS referring to itself.
+
+Mon Mar 29 16:47:48 1999 Gilles Detillieux <[email protected]>
+
+ * htsearch/Display.cc(excerpt): Added patch from Gabriele to
+ improve display of excerpts--show top of description always,
+ otherwise try to find the excerpt.
+
+Mon Mar 29 15:57:06 1999 Geoff Hutchison <[email protected]>
+
+ * htdig/htdig.cc: Rename main.cc for consistency with other
+ directories.
+
+ * htdig/Makefile.in: Use it.
+
+Mon Mar 29 12:53:17 1999 Gilles Detillieux <[email protected]>
+
+ * htlib/HtWordType.h (HtIsWordChar): Avoid matching 0 when using
+ strchr.
+ (HtIsStrictWordChar): Ditto. (Patch from Hans-Peter Nilsson)
+
+Mon Mar 29 10:51:54 1999 Geoff Hutchison <[email protected]>
+
+ * htlib/regex.h, htlib/regex.c: Include glibc versions of the
+ regex functions to override possibly buggy system versions.
+
+ * htlib/Makefile.in: Use them.
+
+ * htfuzzy/EndingsDB.cc: Use glibc regex functions instead of rx
+ for massive speedups on non-English affix files.
+
+ * configure, configure.in: Use the system timegm function if present.
+ Don't configure rx since we don't use it any more. Don't worry
+ about tsort since that was only needed for rx.
+
+ * Makefile.in, Makefile.config.in: Ignore the rx directory if present.
+
+Thu Mar 25 12:24:18 1999 Gilles Detillieux <[email protected]>
+
+ * installdir/long.html, installdir/short.html: Remove backslashes
+ before quotes in HTML versions of the builtin templates.
+
+ * Makefile.in: Add long.html & short.html to COMMONHTML list, so
+ they get installed in common_dir.
+
+Thu Mar 25 11:45:59 1999 Gilles Detillieux <[email protected]>
+
+ * htsearch/Display.cc(displayMatch), htcommon/defaults.cc,
+ htdoc/attrs.html, htdoc/cf_byname.html, htdoc/cf_byprog.html:
+ Add date_format attribute suggested by Marc Pohl.
+
+Thu Mar 25 09:49:33 1999 Gilles Detillieux <[email protected]>
+
+ * htsearch/Display.cc(displayMatch): Avoid segfault when DocAnchors
+ list has too few entries for current anchor number.
+
+Wed Mar 24 12:20:02 1999 Gilles Detillieux <[email protected]>
+
+ * htdig/main.cc (main): Call HtWordType::Initialize. (Missed this
+ one yesterday. Oops!)
+
+Tue Mar 23 17:11:46 1999 Gilles Detillieux <[email protected]>
+
+ * backport Hans-Peter Nilsson's suite of changes for HtWordType
+ and extra_word_characters support, to 3.1.2...
+
+ * htlib/HtWordType.h (class HtWordType): New.
+ * htlib/HtWordType.cc: New.
+ * htlib/Makefile.in (OBJS): Add HtWordType.o
+
+ * htdoc/attrs.html: Document attribute extra_word_characters.
+ * htdoc/cf_byprog.html: Ditto.
+ * htdoc/cf_byname.html: Ditto.
+
+ * htcommon/defaults.cc (defaults): Add extra_word_characters.
+
+ * htsearch/htsearch.h: Lose spurious extern declaration of unused
+ variable valid_punctuation.
+ * htsearch/htsearch.cc (main): Call HtWordType::Initialize.
+ (setupWords): Use HtIsWordChar, HtIsStrictWordChar and
+ HtStripPunctuation. Do not read valid_punctuation.
+
+ * htsearch/Display.cc (excerpt): Use HtIsStrictWordChar.
+
+ * htlib/StringMatch.cc (FindFirstWord): Ditto.
+ (CompareWord): Ditto.
+
+ * htdig/Retriever.h (class Retriever): Lose member
+ valid_punctuation.
+ * htdig/Retriever.cc (Retriever): Lose its initialization.
+
+ * htdig/Postscript.h (class Postscript): Lose member
+ valid_punctuation.
+ * htdig/Postscript.cc (Postscript): Lose its initialization.
+ (flush_word): Use HtStripPunctuation.
+ (parse_string): Use HtIsWordChar,
+ HtIsStrictWordChar and HtStripPunctuation.
+
+ * htdig/Parsable.h (class Parsable): Lose member
+ valid_punctuation.
+ * htdig/Parsable.cc (Parsable): Lose its initilization.
+
+ * htcommon/WordList.cc (valid_word): Use HtIsStrictWordChar.
+ (BadWordFile): Use HtStripPunctuation. Do not read
+ valid_punctuation.
+
+ * htcommon/DocumentRef.cc (AddDescription): Use HtIsWordChar,
+ HtIsStrictWordChar and HtStripPunctuation. Do not read
+ valid_punctuation.
+
+ * htdig/PDF.cc (parseString): Similar..
+
+ * htdig/HTML.cc (parse): Similar.
+
+ * htdig/Plaintext.cc (parse): Similar.
+
+Tue Mar 23 15:52:33 1999 Gilles Detillieux <[email protected]>
+
+ * .version: Bump to 3.1.2-dev.
+
+Tue Mar 23 14:50:37 1999 Gilles Detillieux <[email protected]>
+
+ * htlib/String.cc: Fix up code to be cleaner with memory
+ allocation, inline next_power_of_2, fix some memory leaks.
+ (Geoff's changes of Feb 22-25)
+
+Tue Mar 23 14:35:37 1999 Gilles Detillieux <[email protected]>
+
+ * htlib/HtWordCodec.cc(HtWordCodec): Fix bug with constructing from
+ uninitialized variables!
+
+ * htlib/HtURLCodec.cc (~HtURLCodec): Add missing deletion of
+ myWordCodec.
+
+Tue Mar 23 14:18:16 1999 Gilles Detillieux <[email protected]>
+
+ * htdig/PDF.cc(parseString): Use minimum_word_length instead of
+ hardcoded constant.
+
+Tue Mar 23 12:02:00 1999 Gilles Detillieux <[email protected]>
+
+ * htsearch/Display.cc(generateStars): Add in support for use_star_image
+ which was lost when template support was put in way back when.
+
+Tue Mar 23 11:47:52 1999 Gilles Detillieux <[email protected]>
+
+ * Makefile.in: add missing ';' in for loops, between fi & done
+
+Mon Mar 22 19:26:56 1999 Gilles Detillieux <[email protected]>
+
+ * htcommon/DocumentRef.cc(AddDescription): Check to see that
+ description isn't a null string or contains only whitespace before
+ doing anything.
+
+Mon Mar 22 19:21:16 1999 Gilles Detillieux <[email protected]>
+
+ * htcommon/DocumentRef.h, htcommon/DocumentRef.cc: Fix #ifdef
+ problems with zlib.
+
+Mon Mar 22 19:14:40 1999 Gilles Detillieux <[email protected]>
+
+ * htdoc/attrs.html (template_name): Typo; used by htsearch, not htdig.
+
+Mon Mar 22 19:10:56 1999 Gilles Detillieux <[email protected]>
+
+ * htdig/Retriever.cc (got_href): Check if the ref is for the
+ current document before adding it to the db. (From H-P Nilsson, Mar 8)
+
+Mon Mar 22 19:03:23 1999 Gilles Detillieux <[email protected]>
+
+ * htdoc/attrs.html: Rephrase and clarify entry for url_part_aliases.
+ (From Hans-Peter Nilsson, Mar 2)
+
+Mon Mar 22 18:48:10 1999 Gilles Detillieux <[email protected]>
+
+ * htfuzzy/Synonym.cc: Fix minor memory leak.
+
+ * htlib/Dictionary.h, htlib/Dictionary.cc(hashCode): Check if key
+ can be converted to an integer using strtol. If so, use the
+ integer as the hash code. (Geoff's patch)
+
+Mon Mar 22 18:23:11 1999 Gilles Detillieux <[email protected]>
+
+ * htlib/List.cc(Nth): Check for out-of-bounds requests before
+ doing anything.
+
+Mon Mar 22 17:50:47 1999 Gilles Detillieux <[email protected]>
+
+ * htsearch/Display.cc(display): Free DocumentRef memory after
+ displaying them.
+ (displayMatch): Fix memory leak when documents did not have anchors,
+ fix problems when documents did not have descriptions.
+
+Mon Mar 22 17:32:14 1999 Gilles Detillieux <[email protected]>
+
+ * htmerge/docs.cc(convertDocs): Replace previous verbose patch
+ with H-P Nilsson's.
+
+Mon Mar 22 17:13:35 1999 Gilles Detillieux <[email protected]>
+
+ * htdig/Plaintext.cc, htmerge/words.cc: removed Log lines.
+
+Mon Mar 22 16:11:31 1999 Gilles Detillieux <[email protected]>
+
+ * htsearch/htsearch.cc: Add patch from Jerome Alet <[email protected]>
+ to allow '.' in config field but NOT './' for security reasons.
+
+Mon Mar 22 15:56:55 1999 Geoff Hutchison <[email protected]>
+
+ * installdir/long.html, installdir/short.html: Write out HTML
+ versions of the builtin templates. (committed to 3.1.2 by Gilles)
+
+ * installdir/htdig.conf: Add commented-out template_map and
+ template_name attributes to use the on-disk versions.
+
+Mon Mar 22 15:13:33 1999 Gilles Detillieux <[email protected]>
+
+ * htcommon/defaults.cc, htdoc/attrs.html: Change default locale
+ to "C", as H-P Nilsson recommended.
+
+ * htlib/Configuration.cc(Add): Fix small memory leak in locale code,
+ as Geoff discovered.
+
+Mon Mar 22 15:03:10 1999 Gilles Detillieux <[email protected]>
+
+ * contrib/parse_doc.pl: uses pdftotext to handle PDF files,
+ generates a head record with punctuation intact, extra checks
+ for file "wrappers" & check for MS Word signature (no longer
+ defaults to catdoc), strip extra punct. from start & end of words,
+ rehyphenate text from PDFs, fix handling of minimum word length.
+
+Mon Mar 22 14:38:01 1999 Gilles Detillieux <[email protected]>
+
+ * htdig/Plaintext.cc(parse): Use minimum_word_length instead of
+ hardcoded constant.
+
+Mon Mar 22 14:33:45 1999 Gilles Detillieux <[email protected]>
+
+ * htlib/Configuration.cc(Add): Fix function to avoid infinite loop
+ on some systems, which don't allow all the letters in isalnum() that
+ isalpha() does, e.g. accented ones.
+
+ * htdig/HTML.cc: Fix three reported bugs about inconsistent
+ handling of space and punctuation in title, href description & head.
+ Now makes destinction between tags that cause word breaks and those
+ that don't, and which of the latter add space.
+
+Mon Mar 22 14:25:34 1999 Gilles Detillieux <[email protected]>
+
+ * htmerge/docs.cc: Make htmerge -vv report reasons for deleting docs.
+
+ * htmerge/words.cc(mergeWords): Fix to prevent description text
+ words from clobbering anchor number of merged anchor text words.
+
+Fri Mar 19 17:09:21 1999 Gilles Detillieux <[email protected]>
+
+ * htdig/HTML.cc: Fix bug where noindex_start was empty, allow case
+ insensitive matching of noindex_start & noindex_end.
+
+ * htdoc/attrs.html, htdoc/cf_byname.html, htdoc/cf_byprog.html:
+ Fix inconsistencies in documentation for noindex_start & noindex_end.
+
+Fri Mar 19 17:05:16 1999 Gilles Detillieux <[email protected]>
+
+ * htdig/HTML.cc: Add check for <a href=...> tag that is missing a
+ closing </a> tag, terminating it at next href.
+
+Fri Mar 19 17:00:18 1999 Gilles Detillieux <[email protected]>
+
+ * htdig/Document.cc: Fix check of Content-type header in readHeader(),
+ correcting bug introduced Jan 10 (for PR#91), and check against
+ allowed external parsers.
+
+ * htdig/HTML.cc: More lenient comment parsing, allows extra dashes.
+
+Fri Mar 19 16:52:51 1999 Gilles Detillieux <[email protected]>
+
+ * htdig/HTML.cc: Check for presence of more than one <title> tag.
+
+ * htlib/mytimegm.cc: Fix Y2K problems.
+
+Fri Mar 19 16:43:28 1999 Gilles Detillieux <[email protected]>
+
+ * htdig/HTML.cc: Add patch from Gabriele to ensure META
+ descriptions are parsed, even if 'description' is added to the
+ keyword list.
+
+Fri Mar 19 16:37:08 1999 Gilles Detillieux <[email protected]>
+
+ * htsearch/parser.h, htsearch/parser.cc: Clean up patch made for
+ error messages, made on Feb 16.
+
+Tue Feb 16 23:48:09 1999 Geoff Hutchison <[email protected]>
+
+ * configure.in, configure: Default to 'int' when we cannot
+ establish type used by getpeername.
+
+ * htdoc/RELEASE.html: Additional notes on everything fixed in 3.1.1.
+
+Tue Feb 16 23:45:26 1999 Gilles Detillieux <[email protected]>
+
+ * contrib/parse_doc.pl: Add replacement for less-capable (and
+ buggy) parse_word_doc.pl script. Handles Word, PS, RTF, and
+ WordPerfect files, with appropriate file->text converters.
+
+ * htsearch/parser.cc, htsearch/parser.h: Add more error messages
+ when the boolean expression is invalid.
+
+Mon Feb 15 21:02:24 1999 Geoff Hutchison <[email protected]>
+
+ * htdig/Document.cc(RetrieveLocal): Fix to ensure we report
+ reading only max_doc_size bytes, even when the document is larger.
+
+ * configure.in, configure: Add 'socklen_t' to getpeername check to
+ prevent problems configuring on Solaris 7.
+
+ * htdoc/RELEASE.html: Minor changes for 3.1.1 release.
+
+Sun Feb 14 16:29:48 1999 Geoff Hutchison <[email protected]>
+
+ * htdig/Document.cc(retrieveHTTP, retrieveLocal): Fix document
+ size when the document is larger than max_doc_size. Size should be
+ that sent by the server or as given by stat().
+
+ * htdoc/*.html: More cleanups from Marjolein.
+
+Sat Feb 13 20:53:34 1999 Geoff Hutchison <[email protected]>
+
+ * htdig/Retriever.cc(got_word): Ensure heading is in a normal range.
+
+ * htdoc/RELEASE.html: Added information on the bugs fixed in 3.1.1.
+
+ * htdoc/attrs.html: Added info on the changed syntax of the pdf_parser
+ attribute in 3.1.0 and later.
+
+Sat Feb 13 20:29:26 1999 Marjolein Katsma <[email protected]>
+
+ * htdoc/*.html: Cleaned up HTML, fixed typos, added appropriate
+ HTML 4.0 syntax, added DTDs to files, other minor fixed.
+
+Fri Feb 12 19:58:28 1999 Geoff Hutchison <[email protected]>
+
+ * .version: Bump for version 3.1.1.
+
+ * configure.in, configure: Fix problems determining getpeername
+ syntax under IRIX.
+
+ * db/os/os_map.c: Fixed problems on AlphaLinux pointed out by Paul
+ J. Meyer.
+
+Fri Feb 12 12:00:25 1999 Gilles Detillieux <[email protected]>
+
+ * htdig/ExternalParser.cc: Fix crashes noted by Frank Richter.
+
+ * contrib/htparsedoc/parse_word_doc.pl: Use updated version (with
+ fixed line breaks).
+
+ * htnotify/htnotify.cc: Add patch mentioned in Feb 8 documentation
+ change.
+
+Thu Feb 11 00:29:42 1999 Hans-Peter Nilsson <[email protected]>
+
+ * htcommon/DocumentRef.cc (NUM_ASSIGN): Expand from unsigned types.
+ (getnum): Use temporary for "unsigned short", and memcpy data into
+ it instead of assignment.
+
+Tue Feb 9 19:21:55 1999 Geoff Hutchison <[email protected]>
+
+ * htdoc/FAQ.html, htdoc/where.html: Update for 3.1.0 release.
+
+ * htdoc/uses.html: Added remaining backlog.
+
+ * htdoc/RELEASE.html: Finish up release notes for 3.1.0.
+
+Tue Feb 9 19:19:13 1999 Gilles Detillieux <[email protected]>
+
+ * htdig/ExternalParser.cc: Ensure we remove the temporary file.
+
+Mon Feb 8 20:28:07 1999 Geoff Hutchison <[email protected]>
+
+ * htdoc/ma_menu: Change relative URLs to absolute URLs to
+ www.htdig.org to reflect the changing mail archive.
+
+ * htdoc/install.html: Add notes on new configure flags to set
+ CONFIG variables.
+
+ * htdoc/*.html: Ensure Last Modifed date stamps are up-to-date.
+
+Mon Feb 8 20:26:40 1999 Gilles Detillieux <[email protected]>
+
+ * htdoc/meta.html, htdoc/notification.html: Add info on date
+ formats for the htnotify-date tag, esp. in relation to ISO 8601.
+
+Sat Feb 6 23:24:19 1999 Geoff Hutchison <[email protected]>
+
+ * htcommon/DocumentRef.cc: Fixed compile problem when zlib is disabled.
+
+ * htdoc/cf_byname, htdoc/cf_byprog.html, htdoc/attrs.html: Added
+ entries for url_log, compression_level, noindex_start, noindex_end,
+ allow_in_form, bad_querystr, no_title_text.
+
+ * htdoc/THANKS.html: Added Gabriele Bartolini.
+
+ * htdoc/uses.html, htdoc/FAQ.html, htdoc/bugs.html: Synch with the
+ latest versions from the website tree.
+
+Fri Feb 5 19:57:39 1999 Gilles Detillieux <[email protected]>
+
+ * htnotify/htnotify.cc: Add function parse_date() to parse date
+ strings from htnotify-date tags. It tries to be as flexible as
+ possible about formatting and will report invalid dates. Based in
+ part from code contributed by Gabriele Bartolini.
+
+Fri Feb 5 19:28:24 1999 Geoff Hutchison <[email protected]>
+
+ * configure, configure.in: Add a test to ensure the zlib.h header
+ file exists.
+
+ * include/htconfig.h.in: Added definition for HAVE_ZLIB_H.
+
+ * htcommon/DocumentRef.h, htcommon/DocumentRef.cc: Add checks for
+ HAVE_ZLIB_H in addition to HAVE_LIBZ. Ensures the library is
+ actually accessible, not just present.
+
+ * htfuzzy/Soundex.cc: Fix typo.
+
+Thu Feb 4 22:51:37 1999 Gilles Detillieux <[email protected]>
+
+ * Makefile.in: Clean up previous patch and tidy up HTML and
+ dictionary installation.
+
+Thu Feb 4 22:31:35 1999 Ric Klaren <[email protected]>
+
+ * Makefile.in, */Makefile.in: Add support for
+ $INSTALL_ROOT, making it easier to build packages (e.g. RPMs) into
+ directories for later processing.
+
+ * htsearch/Display.cc: Tiny patch to silence a compiler warning.
+
+Thu Feb 4 13:03:44 1999 Geoff Hutchison <[email protected]>
+
+ * htfuzzy/Soundex.cc(generateKey): Skip initial non-alphabetic
+ characters and explicitly skip characters without values.
+
+ * htfuzzy/Metaphone.cc(generateKey): General bug-fixing, fixing a
+ bug that corrupted the string to be processed, fixing typos, and
+ ensuring keys generated fit the metaphone algorithm.
+
+ * htfuzzy/Fuzzy.cc(getWords): Add debugging output of the fuzzy
+ key used.
+
+ * contrib/doclist/doclist.pl, contrib/doclist/listafter.pl,
+ contrib/whatsnew/whatsnew.pl, contribu/urlindex.pl: Change to
+ support additions to ht://Dig database format.
+
+Thu Feb 4 02:09:22 1999 Geoff Hutchison <[email protected]>
+
+ * htsearch/htsearch.cc: Add debugging information on words
+ returned from fuzzy matching.
+
+ * htfuzzy/Metaphone.cc(addWord): Fix bug where only one word would be
+ stored per key in the database.
+
+ * htfuzzy/Soundex.cc(addWord): Ditto.
+ (generateKey): Rewrite to generate keys correctly.
+
+Wed Feb 3 19:24:36 1999 Geoff Hutchison <[email protected]>
+
+ * htdoc/htdig.html: Added documentation on the -l log and restart
+ feature.
+
+ * htdoc/htmerge.html: Added documentation on the -m merge database
+ feature.
+
+ * htdig/main.cc: Added documentation on the -l flag to the usage
+ message.
+
+ * .version: Bump to 3.1.0.
+
+Wed Feb 3 19:09:31 1999 Gilles Detillieux <[email protected]>
+
+ * htsearch/Display.cc: Add check for URLs with no / in the
+ no_title code.
+
+ * htdig/Document.cc: Fix problems with dates returned from servers
+ with incorrect formats. Those simply missing the day of week are
+ parsed correctly, otherwise output an error, use the current date,
+ and keep going.
+
+Wed Feb 3 09:57:14 1999 Geoff Hutchison <[email protected]>
+
+ * installdir/nomatch.html: Fix small typo.
+
+ * htdoc/RELEASE.html: Finish up 3.1.0 release notes.
+
+ * htdoc/TODO.html: Update with status and new directions.
+
+Wed Feb 3 14:22:11 1999 Alexander Bergolth <[email protected]>
+
+ * htsearch/Display.cc(setVariables): Removed some of yesterdays
+ changes. Thanks to Gilles!
+
+Tue Feb 2 17:26:06 1999 Gilles Detillieux <[email protected]>
+
+ * htdig/PDF.h, htdig/PDF.cc: Fix problems with PDFs generated by
+ CorelDraw.
+
+ * htdoc/attrs.html: Fixed small typo.
+
+Tue Feb 2 21:02:25 1999 Alexander Bergolth <[email protected]>
+
+ * htsearch/Display.cc(setVariables,createURL): As pointed out by
+ Gilles, append allow_in_form variables to the query strings only
+ if they are given as input parameters.
+
+Tue Feb 2 10:29:09 1999 Geoff Hutchison <[email protected]>
+
+ * configure, configure.in: Rewrite getpeername_length_t detection
+ to use prototypes to eliminate type conversion.
+
+ * htsearch/Display.cc(buildMatchList): Ensure scores are always
+ positive or zero.
+
+Mon Feb 1 22:54:02 1999 Hans-Peter Nilsson <[email protected]>
+
+ * htdoc/attrs.html: Correct "default" for "nothing_found_file".
+
+Mon Feb 1 14:44:32 1999 Geoff Hutchison <[email protected]>
+
+ * htsearch/Display.cc(displayMatch): Remove compiler warnings.
+
+ * */Makefile.in: Define INSTALL_PROGRAM from configure script.
+
+Mon Feb 1 14:04:18 1999 Gilles Detillieux <[email protected]>
+
+ * htdig/ExternalParser.cc: Add checks to prevent wayward parsers
+ from bringing down the dig.
+
+Sun Jan 31 23:15:36 1999 Geoff Hutchison <[email protected]>
+
+ * htsearch/WeightWord.cc(set): Ensure word is lowercased for
+ accurate fuzzy comparisons.
+
+ * htfuzzy/Fuzzy.cc(openIndex): Destroy the database reference if
+ we cannot open the database. Fixes a coredump in classes that
+ inherit this method.
+
+ * Makefile.config.in: Remove bogus definitions of INSTALL.
+
+ * Makefile.in: Define INSTALL, INSTALL_PROGRAM, INSTALL_SCRIPT,
+ and INSSTALL_DATA as defined by configure. Use them.
+
+ * htdoc/RELEASE.html: Started release notes for version 3.1.0.
+
+Mon Feb 1 04:36:29 1999 Hans-Peter Nilsson <[email protected]>
+
+ * htsearch/Display.cc (displayMatch): Fix leaking user of
+ String(String *).
+ * htfuzzy/Prefix.cc (getWords): Ditto.
+
+ * htlib/htString.h, htlib/String.cc (String(const String &)): New.
+ * htlib/htString.h, htlib/String.cc (String(const String &, int)):
+ No default argument.
+ * htlib/htString.cc, htlib/String.cc (String(String *)): Removed.
+
+Sun Jan 31 21:46:52 1999 Alexander Bergolth <[email protected]>
+
+ * htlib/Connection.cc: Include sys/time.h needed by select, fixes
+ PR #322.
+
+Sun Jan 31 20:50:38 1999 Hans-Peter Nilsson <[email protected]>
+
+ * htdig/Retriever.cc (Initial, GetRef, Need2Get, IsValidURL,
+ got_href, got_redirect): Do not lowercase URLs.
+
+ * htlib/HtURLCodec.h (class HtURLCodec): Fake a friend function.
+
+Sat Jan 30 22:29:50 1999 Geoff Hutchison <[email protected]>
+
+ * configure, configure.in: Add support for program name
+ transformations.
+
+ * */Makefile.in: Do it.
+
+Sat Jan 30 21:16:50 1999 Geoff Hutchison <[email protected]>
+
+ * htmerge/docs.cc: Added translation of Dutch comment for us ignorant
+ Americans. ;-)
+
+ * installdir/rundig: As mentioned by Gilles, use sed with ls -t
+ test. Add more comments for FAQs.
+
+ * configure.in, configure: Add --disable-zlib to turn off compiling
+ compression entirely. Add --with-cgi-bin-dir,
+ --with-image-dir and --with-search-dir flags to set CONFIG
+ variables.
+
+ * CONFIG.in: Use them.
+
+Sat Jan 30 21:05:35 1999 Randy Winch <[email protected]>
+
+ * htcommon/DocumentRef.h: If using compressed document databases,
+ declare compress and decompress functions and the current state of
+ the head (excerpt).
+
+ * htcommon/DocumentRef.cc: Change document compression to only
+ compress the DocHead field and only decompress when necessary.
+
+Sat Jan 30 03:49:21 1999 Hans-Peter Nilsson <[email protected]>
+
+ * htcommon/DocumentRef.h: Add #ifdef around declaration of
+ c_buffer.
+
+ * htcommon/DocumentRef.cc: Remove spurious extra "static" from
+ c_buffer definition. Add #ifdef HAVE_LIBZ around it.
+
+Fri Jan 29 13:30:11 1999 Geoff Hutchison <[email protected]>
+
+ * htsearch/htsearch.cc: Construct the StringMatch used for finding
+ excerpts in two pieces--user input and post-fuzzy matching. Fixes
+ problems with matching searches with punctuation.
+
+ * htlib/StringMatch.cc(IgnoreCase): Fix small memory leak pointed
+ out by Gilles.
+
+Thu Jan 28 21:36:03 1999 Geoff Hutchison <[email protected]>
+
+ * htdoc/*.html: Changed copyright information to mention the
+ ht://Dig group, removing Andrew's name.
+
+ * README, configure.in, Makefile.in: Ditto.
+
+ * configure: Change mention of libg++ -> libstdc++.
+
+Thu Jan 28 12:53:40 1999 Gilles Detillieux <[email protected]>
+
+ * htdoc/attrs.html, htdoc/cf_byname.html, htdoc/cf_byprog.html:
+ Document new remove_default_doc attribute.
+
+ * Makefile.in: Make sure we put the wrapper file in the right place.
+ Make sure dictionaries are installed with the correct permissions.
+
+ * installdir/rundig: Use a portable test for testing the endings
+ and synonym databases. Also enhanced support for flags (-a, -s,
+ -vvv, -c config).
+
+ * htsearch/Display.cc: Fix bug when sorting results would cause a
+ coredump.
+
+Wed Jan 27 20:00:40 1999 Gilles Detillieux <[email protected]>
+
+ * htdig/HTML.cc, htdig/SGMLEntities.cc, htdig/ExternalParser.cc,
+ htcommon/WordList.cc, htcommon/DocumentRef.cc: Speedup by
+ converting many config lookups into static variables.
+
+ * htdoc/attrs.html, htdoc/hts_templates.cc, htdoc/cf_byname.html,
+ htdoc/cf_byprog.html: Various minor fixes.
+
+ * htsearch/Display.cc: Fix problems with star_patterns attribute.
+
+Wed Jan 27 13:02:39 1999 Geoff Hutchison <[email protected]>
+
+ * htdig/SGMLEntities.cc: Use StringMatch class for matching
+ &quot; &amp; &lt; and &gt; as defined by config options. Should
+ speed up translation.
+
+ * htdoc/THANKS.html: Minor updates for contributions towards 3.1.0.
+
+Tue Jan 26 19:29:08 1999 Geoff Hutchison <[email protected]>
+
+ * include/htconfig.h.in: Define TRUE and FALSE if not
+ defined. Change default of NO_WORD_COUNT (now undefined) for
+ compatibility.
+
+ * htdig/htdig.h: Remove definition of TRUE and FALSE (for consistency).
+
+ * htcommon/DocumentDB.cc(Add, Delete, Exists, []): Do not
+ lowercase the URL before storing it. URLs can be case-sensitive.
+
+Tue Jan 26 19:07:03 1999 Gilles Detillieux <[email protected]>
+
+ * htcommon/defaults.cc: Define remove_default_doc as option of
+ default document to strip off URLs (e.g. /index.html -> /).
+
+ * htlib/URL.cc(removeIndex): Use it.
+ (normalizePath): Fix bug with stripping double slashes and the
+ like from a query string.
+
+ * htdig/Document.h, htdig/Document.cc: Add new variable
+ contentLength and consider content-length headers when reading in
+ documents.
+
+ * htdig/PDF.cc: Fix broken code calling acroread.
+
+ * htsearch/Display.cc: Allow braces in wrapper file.
+
+ * htdoc/hts_general.html, htdoc/hts_templates.html: Add info on
+ the wrapper alternative to separate header and footer files.
+
+ * htdoc/config.html, installdir/header.html,
+ installdir/nomatch.html, installdir/wrapper.html,
+ installdir/search.html: Change sort option to be more grammatically
+ correct.
+
+Tue Jan 26 21:19:02 1999 Hans-Peter Nilsson <[email protected]>
+
+ * htmerge/docs.cc (convertDocs): Use HtURLCodec to encode URLs
+ going into the doc_index database.
+
+ * htsearch/Display.cc (buildMatchList): Use HtURLCodec to decode
+ URLs from docIndex.
+
+ * htcommon/defaults.cc (defaults): Fix typo with "case_sensitive".
+
+Tue Jan 26 18:08:19 1999 Alexander Bergolth <[email protected]>
+
+ * include/htconfig.h.in: Added HAVE_STRINGS_H. (I forgot that when
+ added the configure check.)
+ * htdig/Retriever.h: Fix small compiler error. Removed Log-lines.
+
+Tue Jan 26 02:22:45 1999 Hans-Peter Nilsson <[email protected]>
+
+ * htdig/main.cc (main): Fix typo "uncoded_db_compatbile".
+
+Mon Jan 25 19:38:31 1999 Geoff Hutchison <[email protected]>
+
+ * htlib/Configuration(Find): Make error message for missing
+ entries conditional to DEBUG symbol. Removes odd error messages
+ under normal use.
+
+Sun Jan 24 23:55:57 1999 Geoff Hutchison <[email protected]>
+
+ * htmerge/db.cc, htmerge/docs.cc: Fix compiler errors.
+ * htnotify/htnotify.cc: Similar.
+
+Sun Jan 24 14:13:37 1999 Hans-Peter Nilsson <[email protected]>
+
+ * htcommon/WordRecord.h (struct WordRecord): Remove member count
+ if NO_WORD_COUNT defined.
+ * htmerge/db.cc (mergeDB): Remove handling.
+ * htmerge/words.cc (mergeWords): Similar.
+
+ * include/htconfig.h.in: Define NO_WORD_COUNT by default.
+
+Sun Jan 24 14:13:37 1999 Geoff Hutchison <[email protected]>
+
+ * htsearch/Display.cc(logSearch): Added fix from Gilles in case
+ REMOTE_ADDR is NULL as well.
+
+ * htnotify/htnotify.cc: Fix compiler warnings.
+
+ * htlib/String.cc(indexOf): Use autoconf check for strstr, fix
+ compiler warnings.
+
+ * htlib/Configuration.cc(Find): Complain when option is not in the
+ list.
+
+ * htdig/HTML.cc(parse): Move declarations out of the loop.
+ (parse): Don't add non-word characters to the excerpt if they're
+ in the title. Fixes PR #80.
+
+Mon Jan 25 02:17:58 1999 Hans-Peter Nilsson <[email protected]>
+
+ * htcommon/defaults.cc (defaults): New option
+ "uncoded_db_compatible", default true.
+
+ * htcommon/DocumentDB.h (DocumentDB::SetCompatibility): New
+ function.
+ (DocumentDB::myTryUncoded): New member.
+
+ * htcommon/DocumentDB.cc (Constructor, Add(), operator[],
+ Exists(), Delete()): Handle uncoded URL in database if
+ myTryUncoded.
+
+ * htdig/main.cc (main): Call (DocumentDB::)SetCompatibility() with
+ option "uncoded_db_compatible".
+ * htsearch/Display.cc (Display): Likewise.
+ * htnotify/htnotify.cc (main): Likewise.
+ * htmerge/docs.cc (convertDocs): Likewise.
+ * htmerge/db.cc (mergeDB): Likewise.
+
+ * htdoc/attrs.html, htdoc/cf_byname.html, htdoc/cf_byprog.html:
+ Document option "uncoded_db_compatible".
+
+Sun Jan 24 15:21:02 1999 Hans-Peter Nilsson <[email protected]>
+
+ * htlib/HtWordCodec.cc (HtWordCodec(StringList &, etc)): Check
+ limits separately for "to" and "from". Do not calculate
+ string-lengths separately for limit-checking; use methods Count()
+ and length() on data near the final result.
+
+ * htlib/HtWordCodec.cc (HtWordCodec constructors): Do not
+ explicitly add '\0' to the pattern strings.
+
+ * htlib/HtWordCodec.cc (code): Check for zero-length replacement
+ list.
+
+Sat Jan 23 22:18:18 1999 Geoff Hutchison <[email protected]>
+
+ * htdig/Retriever.cc(parse_url): If a server ignores the
+ If-Modified-Since request, still compare the retrieved date to the
+ stored date to see if it has been modified.
+
+Sat Jan 23 13:09:03 1999 Geoff Hutchison <[email protected]>
+
+ * htmerge/htmerge.cc: Unlink the db.docs.index file before we
+ build it again. This ensures we have a clean copy and don't
+ duplicate URLs.
+
+Fri Jan 22 23:12:12 1999 Geoff Hutchison <[email protected]>
+
+ * include/htconfig.h.in: Cleaned up preprocessor definitions.
+
+ * configure.in, configure: Fix NEED_PROTO_GETHOSTNAME check and
+ make check for GETPEERNAME_LENGTH_T more flexible.
+
+ * htlib/Connection.cc: Change __sun__ to NEED_PROTO_GETHOSTNAME
+ since we prefer feature tests.
+
+Sat Jan 23 02:38:08 1999 Hans-Peter Nilsson <[email protected]>
+
+ * htsearch/Display.cc (logSearch): Fix simple typo in last change.
+
+Sat Jan 23 01:18:05 1999 Hans-Peter Nilsson <[email protected]>
+
+ * htlib/String.cc (operator =): Add const modifier: const String &.
+ * htlib/htString.h (String::operator=(const String &)): Ditto.
+
+ * htlib/DB2_db.h (class DB2_db): Make Put(), Get(), Exists() and
+ Delete() use const modifiers on appropriate parameters.
+ * htlib/DB2_db.cc: Ditto.
+ * htlib/GDBM_db.h (class GDBM_db): Ditto.
+ * htlib/GDBM_db.cc: Ditto.
+ * htlib/Database.h (class Database): Ditto.
+ * htlib/Database.cc (Put): Similar.
+
+ * htlib/BTree.h (class BTree): Make Put(), Get() and Exists() use
+ const modifiers on appropriate parameters.
+ * htlib/BTree.cc: Ditto.
+
+ * htcommon/DocumentDB.cc (Add, operator[], Exists, Delete): Remove
+ needless temporary String.
+ * htcommon/DocumentRef.cc (Deserialize): Ditto.
+
+Fri Jan 22 21:10:12 1999 Gilles Detillieux <[email protected]>
+
+ * htlib/Configuration.cc: Add support for keyword "include" to
+ include other config files.
+
+ * htdoc/cf_general.html: Document it.
+
+Thu Jan 21 23:25:37 1999 Geoff Hutchison <[email protected]>
+
+ * htsearch/Display.cc(logSearch): Check if HTTP_REFERER is NULL,
+ if so, use a dash. (Otherwise we'll kill some syslog() services).
+
+Thu Jan 21 05:30:40 1999 Hans-Peter Nilsson <[email protected]>
+
+ * htlib/HtURLCodec.h, htlib/HtURLCodec.cc, htlib/HtWordCodec.cc,
+ htlib/HtWordCodec.h, htlib/HtCodec.cc, htlib/HtCodec.h: New files.
+
+ * htlib/Makefile.in (OBJS): Add the corresponding *.o files
+
+ * htcommon/DocumentDB.cc (Open, Read, Add, operator[], Exists,
+ Delete, CreateSearchDB, URLs): Use HtURLCodec; ::encode() and
+ ::decode() the URL used as a key.
+
+ * htcommon/DocumentRef.cc (Serialize): Encode the URL using
+ HtURLCodec.
+ (Deserialize): Decode it.
+
+ * htmerge/htmerge.h: #include <HtURLCodec.h>
+ * htmerge/htmerge.cc (main): Check HtURLCodec for errors.
+ * htnotify/htnotify.cc (main): Ditto.
+ * htsearch/htsearch.cc (main): Ditto.
+ * htdig/main.cc (main): Ditto.
+
+ * htcommon/defaults.cc (defaults): Add common_url_parts and
+ url_part_aliases.
+
+ * htdoc/cf_byprog.html, htdoc/cf_byname.html,
+ htdoc/attrs.html: Document url_part_aliases and
+ common_url_parts.
+
+ * htlib/StringMatch.h (StringMatch::Pattern): Add default
+ parameter sep = '|'.
+
+ * htlib/StringMatch.cc (Pattern): Similar.
+
+Wed Jan 20 20:20:35 1999 Geoff Hutchison <[email protected]>
+
+ * htsearch/Display.cc(logSearch): Use REMOTE_ADDR when REMOTE_HOST
+ is unavailable (otherwise we silently dump core). Fixes PR #138.
+
+ * htcommon/WordList.cc(valid_word): Words cannot be valid if
+ they're shorter than minimum_word_length! Fixes PR #139.
+
+ * htsearch/Display.cc(expandVariables): Allow variables of the
+ form ${VAR}, fixes PR #121.
+
+Wed Jan 20 17:21:33 1999 Gilles Detillieux <[email protected]>
+
+ * htmerge/docs.cc: Fix logic to remove documents--missing else
+ statements allow some "deleted" documents to not be removed.
+
+Wed Jan 20 11:52:18 1999 Geoff Hutchison <[email protected]>
+
+ * htlib/good_strtok.h, htlib/good_strtok.cc: Added fixes and speed
+ improvements contributed by Andrew Bishop.
+
+ * htdig/ExternalParser.cc, htdig/Server.cc, htlib/cgi.cc,
+ htmerge/db.cc, htmerge/words.cc: Call good_strtok with appropriate
+ parameters (explicitly include NULL first parameter, second param
+ is char, not char *).
+
+ * htcommon/WordList.cc(Word): Added check for adding words with
+ weight zero.
+
+ * htsearch/Display.h, htsearch/Display.cc: Revised setting ANCHOR
+ variable: it will be empty if there is no excerpt which matches
+ the search formula. Fixes problems with META descriptions. Based
+ on a patch contributed by Marjolein.
+
+Wed Jan 20 00:30:12 1999 Geoff Hutchison <[email protected]>
+
+ * htdig/SGMLEntities.cc: Declare extern config, since we now use
+ config options.
+
+ * htsearch/Display.cc: Fix typo causing compile problems.
+
+Tue Jan 19 23:51:38 1999 Geoff Hutchison <[email protected]>
+
+ * htcommon/defaults.cc: Added options translate_amp, _lt_gt, _quot as
+ suggested by Marjolein to control SGML translation of these
+ entities.
+
+ * htdig/SGMLEntities.cc: Use them as contributed by Marjolein.
+
+Tue Jan 19 12:55:36 1999 Hans-Peter Nilsson <[email protected]>
+
+ * htlib/StringMatch.cc (Pattern): Always set PreviousState before
+ checking PreviousValue.
+
+ * htlib/StringMatch.cc (FindFirst): Be "greedy"; match longest.
+ (Compare): Ditto.
+
+ * htcommon/DocumentRef.cc (MEMCPY_ASSIGN, NUM_ASSIGN): New macros
+ for assigning portably to some possibly-enum numeric type.
+ (getnum): Use them.
+
+ * htlib/StringMatch.cc (FINAL): Remove.
+ (MATCH_INDEX_MASK): Include highest bit.
+ (Pattern, FindFirst, Compare, FindFirstWord, CompareWord): Do not
+ use FINAL.
+ (FindFirst, Compare, FindFirstWord, CompareWord): When shifting by
+ INDEX_SHIFT, cast to unsigned.
+
+Mon Jan 18 17:43:29 1999 Geoff Hutchison <[email protected]>
+
+ * htcommon/defaults.cc: Added no_title_text option to allow
+ configuration of the text when no title is available. Default is
+ the filename.
+
+ * htsearch/Display.cc: Use no_title_text to set the title
+ appropriately, as contributed by Marjolein.
+
+ * htsearch/Display.cc: Ensure PERCENT variable has a minimum of 1.
+
+Mon Jan 18 17:41:44 1999 Gilles Detillieux <[email protected]>
+
+ * htdig/Server.cc: Use max_doc_size when retrieving robots.txt
+ files instead of a hard-coded 10k limit.
+
+ * htdig/Document.cc: When reading chunks of document, if a chunk
+ puts us over the max_doc_size limit, take everything up to that
+ limit (rather than discarding the entire chunk).
+
+ * htcommon/DocumentRef.cc: Fix thinko with compression_level.
+
+Sun Jan 17 21:48:05 1999 Gilles Detillieux <[email protected]>
+
+ * htdoc/(attrs.html, cf_byname.html, cf_byprog.html, config.html,
+ hts_form.html, hts_templates.html): Add documentation for "sort"
+ config and form input.
+
+ * htcommon/defaults.cc: Added options "sort" and "sort_names" to
+ pick result sorting order and text names for sort options.
+
+ * htsearch/Display.cc: Added variable SORT to render a form menu
+ for sort options, based on "sort" and "sort_names" options.
+
+ * installdir/(wrapper.html, header.html, nomatch.html,
+ footer.html, search.html, syntax.html): Add in sort option to form.
+
+Sun Jan 17 14:03:54 1999 Geoff Hutchison <[email protected]>
+
+ * htsearch/TemplateList.h
+ htsearch/TemplateList.cc(createFromString): Ensure
+ template_map config has three members for each template we add,
+ contributed by Gabriele Bartolini <[email protected]>.
+
+ * htsearch/Display.cc(Display): Take advantage of createFromString
+ returning an error value to bail out of poorly-constructed
+ template_maps, based on code contributed by <[email protected]>.
+
+ * htdig/PDF.cc: Add debugging output of URLs causing
+ problems. Also, switch system call to make it easier to call xpdf
+ instead of acroread.
+
+ * htcommon/defaults.cc: Change default pdf_parser attribute to
+ include acrobat-specific flags. Fix mismatched naming of
+ compression_level (was compression_factor).
+
+ * htdig/Retriever.cc: Fix compiler warnings.
+
+ * contrib/examples/updatedig: Added contributed rundig-type script
+ from David Robley <[email protected]>.
+
+Sun Jan 17 13:42:43 1999 didier Gautheron <[email protected]>
+
+ * htcommon/defaults.cc: add url_log parameter for save and restart
+ function.
+
+ * htdig/Retriever.cc, htdig/Retriever.h: Add save and restart
+ function.
+
+ * htdig/main.cc: Add option -l for save and restart
+ function.
+
+ * htdig/PDF.cc: Check to see if we have acroread before copying
+ the pdf into TMPDIR!
+
+Fri Jan 15 07:23:30 1999 Hans-Peter Nilsson <[email protected]>
+
+ * htcommon/DocumentRef.cc(Serialize): Save
+ space when lengths can fit in an unsigned char or unsigned short.
+
+ * htcommon/DocumentRef.cc(Deserialize): Handle expansion.
+
+Thu Jan 14 23:37:29 1999 Geoff Hutchison <[email protected]>
+
+ * htcommon/defaults.cc: Added options noindex_start and
+ noindex_end to enable NOT indexing some sections of
+ HTML. Contributed by Marjolein.
+
+ * htdig/HTML.cc: Use them.
+
+ * contrib/examples/rundig.sh: Add rundig example from Colin
+ Viebrock with a few modifications for using less disk space.
+
+Thu Jan 14 23:27:24 1999 Gilles Detillieux <[email protected]>
+
+ * htlib/URL.cc: Fix parent path logic to ignore slashes in query
+ string. Noted by Adam Coyne <[email protected]>.
+
+Thu Jan 14 00:04:03 1999 Geoff Hutchison <[email protected]>
+
+ * README: Fix for upcoming 3.1.0 release.
+
+ * htcommon/defaults.cc: Set compression_factor to 0 for default
+ (no compression).
+
+Thu Jan 14 03:16:15 1999 Hans-Peter Nilsson <[email protected]>
+
+ * htdig/ExternalParser.cc (parse): Added support for 'm': meta element.
+
+ * htdoc/attrs.html: Document it.
+
+Wed Jan 13 21:31:38 1999 Geoff Hutchison <[email protected]>
+
+ * Makefile.in(install): Add wrapper.html to the common directory
+ when installing.
+
+ * contrib/examples: Added directory for example common files
+ (e.g. badwords, dictionaries, templates, etc.)
+
+ * contrib/examples/badwords: Added example bad_words file by Marjolein.
+
+ * .version: Bump to 3.1.0dev.
+
+ * htdig/HTML.cc(parse): Added slight fixes to the comment parsing
+ code, contributed by Marjolein.
+
+Wed Jan 13 20:11:26 1999 Gilles Detillieux <[email protected]>
+
+ * htdoc/attrs.html: Fix typo with META example.
+
+ * htdig/Document.cc: Use new StringList::Join function for
+ http_proxy_exclude.
+
+ * htnotify/htnotify.cc: Bring latest security patch from 3.1.0b4
+ onto the mainline source.
+
+ * installdir/wrapper.html: New file to merge header and footer files.
+
+ * htcommon/defaults.cc: Added search_results_wrapper for the
+ location of the wrapper file, if used. (The default is empty,
+ which uses header.html and footer.html)
+
+ * htsearch/Display.cc: Added support for using the wrapper instead
+ of header and footer if search_results_wrapper is set.
+
+ * htsearch/htsearch.cc: Added check for sort config.
+
+ * htsearch/Display.cc, htsearch/Display.h: Added support for
+ sorting and reverse sorting by date, time, and score.
+
+Wed Jan 13 18:45:17 1999 Geoff Hutchison <[email protected]>
+
+ * htcommon/defaults.cc: Removed use_document_compression
+ (redundant) and fixed problem with missing comma. Setting
+ compression_factor to 0 is the equivalent of turning off
+ use_document_compression.
+
+ * htcommon/DocumentRef.cc(Serialize, Deserialize): Update from
+ Randy Winch to eliminate use_document_compression and fix
+ compilation problems noted by Hans-Peter.
+
+ * htmerge/db.cc: Fixed problem with db.NextDocID() being set
+ incorrectly, reported by Roman Dimov <[email protected]>.
+
+ * htcommon/DocumentDB.h: Added IncNextDocID to allow big changes
+ in db.NextDocID(), such as those above.
+
+ * htdoc/THANKS.html: Added Akos Domotor.
+
+Wed Jan 13 07:07:35 1999 Hans-Peter Nilsson <[email protected]>
+
+ * htsearch/htsearch.cc (setupWords): Remove parsedWords parameter
+ with accociated processing of original words - deletion of
+ bad_words, spacing and on-the-fly modifiers.
+ (main): Create originalWords from input, not via setupWords().
+
+Tue Jan 12 09:16:49 1999 didier Gautheron <[email protected]>
+
+ * htcommon/WordList.cc, htmerge/words.cc: Changed field order
+ in db.wordlist. With the old order, words from HTML body and words
+ from links to that url weren't merged sometimes.
+
+ * htdig/Document.cc, htmerge/words.cc: Small speed improvements.
+
+ * htdig/HTML.cc: Fixed small memory leak with bogus HTML and small
+ speedups.
+
+ * htdig/Retriever.cc(got_href) : if ref exists we have to call
+ AddDescription even if max_hop_count is reached. It's important
+ for wwwoffle (urls in the cache are restricted by max_hop_count)
+
+ * htcommon/DocumentDB.cc, htcommon/DocumentDB.h, htdig/Retriever.cc,
+ htlib/Dictionary.cc, htlib/Dictionary.h, htlib/Object.cc,
+ htlib/Object.h, htlib/String.cc, htlib/htString.h,
+ htcommon/WordList.cc: Speedups after gprof data.
+
+Tue Jan 12 07:23:35 1999 didier Gautheron <[email protected]>
+
+ * htlib/Configuration.cc: Fixed time format to standard to avoid
+ sending If-Modified-Since http headers in native format (which
+ would be incorrect behavior). Use C locale.
+
+ * htlib/Dictionary.h, htlib/Dictionary.cc: Add new method
+ GetNextElement to directly return next object when iterating.
+
+Tue Jan 12 12:56:26 1999 Geoff Hutchison <[email protected]>
+
+ * htcommon/DocumentRef.h, htcommon/DocumentRef.cc(serialize,
+ deserialize): Added support for compressing data using zlib if
+ available, contributed by Randy Winch <[email protected]>.
+
+ * htcommon/defaults.cc: Added config options
+ use_document_compression and compression_factor for zlib support.
+
+ * configure.in, include/htconfig.h.in: Added autoconf check for
+ libz and deflate function.
+
+ * configure: Generated from above change.
+
+Mon Jan 11 22:48:17 1999 Geoff Hutchison <[email protected]>
+
+ * htmerge/db.cc: Fixed thinko with setting the docIDs of new words
+ in the destination wordlist.
+
+ * htdoc/FAQ.html, htdoc/THANKS.html, htdoc/contents.html: Minor
+ cleanups.
+
+ * htdoc/RELEASE.html: Added release info from 3.1.0b4.
+
+ * htdoc/uses.html: Alphabetized, added a form for requests, and
+ added in lots of new sites.
+
+Mon Jan 11 02:42:51 1999 Hans-Peter Nilsson <[email protected]>
+
+ * htsearch/htsearch.cc (setupWords): Do not skip words if
+ "boolean" search.
+
+Mon Jan 11 00:42:51 1999 Hans-Peter Nilsson <[email protected]>
+
+ * htdoc/hts_method.html: Add explanation of operator "not".
+
+ * installdir/syntax.html: Added examples of correct logical
+ expressions.
+
+Mon Jan 11 00:23:58 1999 Geoff Hutchison <[email protected]>
+
+ * htdoc/attrs.html(search_algorithm): Added prefix and substring
+ matching--somehow slipped through the cracks!
+
+ * htdoc/THANKS.html: Update to be more accurate as far as recent
+ contributions.
+
+Sun Jan 10 00:06:59 1999 Geoff Hutchison <[email protected]>
+
+ * htdig/Document.cc(readHeader): Added check for header status
+ when considering content-types. Fixed PR #91.
+
+Sat Jan 9 20:52:49 1999 didier Gautheron <[email protected]>
+
+ * htcommon/WordList.cc(valid_word): Break out of looping once
+ we're sure the word is invalid.
+
+ * htlib/Dictionary.cc(Remove, Exists): Remember special case of an
+ empty dictionary.
+
+Sat Jan 9 20:16:25 1999 Geoff Hutchison <[email protected]>
+
+ * htdig/HTML.cc(parse): Don't capitalize headers--this creates
+ problems with non-ASCII values, since String::uppercase doesn't
+ know how to capitalize them. Fixes PR #100.
+
+Sat Jan 9 14:47:17 1999 Geoff Hutchison <[email protected]>
+
+ * htdig/Document.cc(getdate): Strip off weekday before calling
+ strptime since some servers return invalid weekdays. Fixes PR #79.
+
+ * htmerge/htmerge.h: Declare new mergeDB code.
+
+ * htmerge/htmerge.cc: Set up merge_config file and add options for
+ mergeDB code.
+
+ * htmerge/db.cc: New file. Implements merging of two database sets
+ specified by the merge_config and config variables.
+
+ * htmerge/Makefile.in: Add db.o as an object to be compiled.
+
+Fri Jan 8 20:11:56 1999 Alexander Bergolth <[email protected]>
+
+ * htdig/Plaintext.cc: fixed bug that inhibited compressing of
+ whitespace
+
+ * htlib/URL.cc: fixed problem in stripping anchors from URLs
+
+Thu Jan 7 23:29:32 1999 Geoff Hutchison <[email protected]>
+
+ * htdig/HTML.cc(parse): Corrected problems with parsing comments,
+ as contributed by Marjolein Katsma <[email protected]> and
+ Gilles.
+
+ * htsearch/Display.cc, htsearch/Display.h: Implement
+ add_anchors_to_excerpt option and new variable ANCHOR as
+ contributed by Marjolein.
+
+ * htdoc/THANKS.html: Added new contributors.
+
+ * README: Update for 1999 copyright, version, etc.
+
+Thu Jan 7 17:29:52 1999 Geoff Hutchison <[email protected]>
+
+ * htdoc/(attrs.html, cf_byname.html, cf_byprog.html): Fix typo
+ noted by Joe Jah: keyword_factor -> keywords_factor.
+
+Thu Jan 7 14:32:34 1999 Gilles Detillieux <[email protected]>
+
+ * htsearch/Display.cc (display): The start template, if provided,
+ should come out after the header, not before.
+
+ * htcommon/defaults.cc, installdir/footer.html: Use the
+ no_page_list_header stuff.
+
+Thu Jan 7 11:09:08 1999 Geoff Hutchison <[email protected]>
+
+ * installdir/*.png: Add PNG versions of the default GIF graphics.
+
+Wed Jan 6 22:03:54 1999 Geoff Hutchison <[email protected]>
+
+ * htfuzzy/Synonym.cc, htfuzzy/htfuzzy.cc, htmerge/docs.cc,
+ htmerge/words.cc, htdig/SGMLEntities.cc: Fix minor memory leaks.
+
+ * htcommon/defaults.cc: Add .bin, .tgz, .rpm, .mov, .mpg, .avi to
+ bad_extensions.
+
+ * htdoc/attrs.html: Update documentation on default.
+
+ * installdir/rundig: Removed check for age of synonym and endings
+ DB. Nice feature, but it broke under too many shells.
+
+ * htlib/DB2_db.cc: Change allocation of database cursors to match
+ API in new version.
+
+ * htdig/Retriever.cc(got_word): Skip changing to lowercase, we do
+ it in WordList::Word.
+
+Wed Jan 6 14:49:47 1999 Gilles Detillieux <[email protected]>
+
+ * htdoc/attrs.html: Added four new attributes, fixed defaults & typos.
+
+ * htdoc/cf_byname.html: Added four new attributes.
+
+ * htdoc/cf_byprog.html: Added four new attributes.
+
+Wed Jan 6 14:37:06 1999 Geoff Hutchison <[email protected]>
+
+ * configure.in: Changed to require Autoconf 2.13 to eliminate bugs
+ obeserved by users with older autoconf versions.
+
+ * configure: Regenerated using Autoconf 2.13.
+
+Wed Jan 6 13:08:26 1999 Geoff Hutchison <[email protected]>
+
+ * htcommon/DocumentRef.cc: Applied fix from Dave Alden
+ <[email protected]> to compile under SunPRO compilers
+ by eliminating trailing comma in enum.
+
+Wed Jan 6 17:50:55 1999 Alexander Bergolth <[email protected]>
+
+ * {.,htcommon,htdig,htfuzzy,htlib,htmerge,htnotify,htsearch}/
+ Makefile.in, Makefile.config.in: fixed relative path problem if
+ install-sh is used.
+
+Wed Jan 6 17:12:04 1999 Alexander Bergolth <[email protected]>
+
+ * htlib/StringList.cc: fixed bug in StringList::Join (oops!)
+
+Wed Jan 6 10:34:45 1999 Geoff Hutchison <[email protected]>
+
+ * htcommon/DocumentRef.cc(AddDescription): Remove delete
+ instruction that fouls up everything (it was removing descriptions
+ as we add them!).
+
+Wed Jan 6 14:52:11 1999 Hans-Peter Nilsson <[email protected]>
+
+ * htlib/String.cc (allocate_space): Add missing [] to delete.
+
+Wed Jan 6 05:53:02 1999 Hans-Peter Nilsson <[email protected]>
+
+ * htcommon/DocumentRef.cc(AddDescription): Do not add non-word
+ characters to the wordlist.
+
+Wed Jan 6 00:28:19 1999 Hans-Peter Nilsson <[email protected]>
+
+ * htdoc/cf_byname.html: Fixed html syntax "<br" and "/a>".
+
+Tue Jan 5 22:40:58 1999 Geoff Hutchison <[email protected]>
+
+ * htsearch/Display.cc: Check if we need to do backlink and date
+ factoring (e.g. we don't if they're zero!), from a patch by Gilles.
+
+Tue Jan 5 20:57:02 1999 Alexander Bergolth <[email protected]>
+
+ * configure.in, htlib/Connection.cc: Check for strings.h for those
+ platforms that don't have it.
+
+Tue Jan 5 14:24:52 1999 Geoff Hutchison <[email protected]>
+
+ * htcommon/DocumentRef.h: Added comments on the members (fields)
+ of DocumentRef objects.
+
+ * htcommon/defaults.cc: Added new option max_descriptions for
+ limit on the number of descriptions to store (default 5, matches
+ behavior pre 3.1.0b3).
+
+ * htcommon/DocumentRef.cc: Support restriction of max_descriptions.
+
+ * .version: Bump to 3.1.0b5dev.
+
+Tue Jan 5 20:07:05 1999 Alexander Bergolth <[email protected]>
+
+ * htdig/Retriever.cc: fixed bug in bad_querystring detection
+
+Sat Jan 2 16:39:34 1999 Alexander Bergolth <[email protected]>
+
+ * htdig/main.cc, htlib/Configuration.cc: Added warning message if
+ the locale selection was not successful. (e.g. because the locale
+ definition is not installed) config["locale"] is now set to the
+ return string of setlocale.
+
+ * {.,htcommon,htdig,htfuzzy,htlib,htmerge,htnotify,htsearch}/
+ Makefile.in, Makefile.config.in, configure.in: Changed to allow
+ compiling in seperate build directories.
+
+Fri Jan 1 05:49:19 1999 Hans-Peter Nilsson <[email protected]>
+
+ * htdoc/attrs.html: Describe more thoroughly how "pdf_parser"
+ is used.
+
+ * htdoc/attrs.html: Fix typo for anchor/attribute
+ "allow_virtual_hosts".
+
+ * htdoc/attrs.html: Correct and add more verbose description of
+ external parser program parameters and fields.
+
+Sun Dec 27 14:52:45 1998 Alexander Bergolth <[email protected]>
+
+ * htlib/URL.cc: Small change in URL::removeIndex so that URLs are not
+ stripped if a query string ends with /index.html
+
+ * htsearch/Display.cc, htnotify/htnotify.cc: Added patches from
+ Gilles Detillieux <[email protected]> to fix memory leaks.
+
+Sat Dec 19 17:53:44 1998 Alexander Bergolth <[email protected]>
+
+ * htdig/main.cc, htdig/htdig.h, htdig/Retriever.cc: Added new option
+ bad_querystr. Allows exclusion when digging CGI-Scripts.
+
+ * htsearch/htsearch.cc, htsearch/Display.cc: Added new option
+ allow_in_form. Does currently not work with some special variable
+ names!
+
+ * htcommon/defaults.cc: Added the two new options.
+
+Sat Dec 19 11:21:38 1998 Geoff Hutchison <[email protected]>
+
+ * contrib/htparsedoc/parse_word_doc.pl: Update from Jesse.
+
+ * .version: Bump for 3.1.0b4.
+
+ * README: Ditto.
+
+ * Makefile.in: Remove references to version number.
+
+ * htnotify/htnotify.cc: Fix nasty security hole found by Werner
+
+Sat Dec 19 15:22:38 1998 Alexander Bergolth <[email protected]>
+
+ * htlib/StringList.cc, htlib/StringList.h: Added StringList::Join
+ to simplify the creation of patterns for StringMatch.
+
+ * htlib/String.cc: lastIndexOf(char ch) added
+
+ * htlib/URL.cc: Changed URL::removeIndex to use local_default_doc.
+ (index.html was hardcoded) local_default_doc can be a list.
+
+ * htdig/main.cc, htlib/URL.cc: Use StringList::Join.
+
+Sun Dec 13 23:06:35 1998 Geoff Hutchison <[email protected]>
+
+ * htsearch/Display.cc: Fix potential coredump when calculating
+ date_factor and backlink_factor on docs that aren't in the
+ database.
+
+Sat Dec 12 23:17:56 1998 Geoff Hutchison <[email protected]>
+
+ * htdoc/cf_byname.html, htdoc/cf_byprog.html, htdoc/attrs.html:
+ Added docs for new options since version 3.1.0b2.
+
+ * htdoc/RELEASE.html: Added notes on changes since 3.1.0b2 (we
+ should keep this up rather than all-at-once).
+
+ * htdoc/hts_templates: Include documentation on using CGI
+ environment variables in templates with this version.
+
+ * htdig/Retriever.cc(got_href): Added check to prevent
+ currenthopcount from becoming -1.
+
+ * htcommon/WordList.cc: Change undefined minimumWordLength to
+ config("minimum_word_length").
+
+Sat Dec 12 12:01:55 1998 Geoff Hutchison <[email protected]>
+
+ * Makefile.in, Makefile.config.in, */Makefile.in: Added target
+ mostlyclean to clean up, but leave compile-intensive targets
+ (e.g. db, rx code). General cleanup too.
+
+ * htdoc/where.html: Updated for eventual 3.1.0b3 release.
+
+ * htcommon/WordList.cc: Added additional cleanups for the words in
+ the bad word file, in case they have invalid punctuation, etc.
+
+Sat Dec 12 18:41:29 1998 Alexander Bergolth <[email protected]>
+
+ * htmerge/words.cc: Fix last update so that it compiles on AIX.
+
+Fri Dec 11 10:40:48 1998 Geoff Hutchison <[email protected]>
+
+ * htdig/Retriever.cc: Added additional debugging info on the
+ reason for excluding a URL, based on a patch by Benoit Majeau
+
+ * htmerge/words.cc: Fixed a bug where pointer, rather than strings
+ were assigned. Silly references...
+
+ * htsearch/Display.cc, htsearch/Display.h: Added patch from Gilles
+ to allow CGI environment variables in templates.
+
+ * htdig/HTML.cc: Fix core dump when META refresh tags don't have
+ content portions.
+
+Thu Dec 10 22:28:44 1998 Geoff Hutchison <[email protected]>
+
+ * htdig/Retriever.cc, htdig/Server.cc, htdig/Server.h:
+ Changed support for server_wait_time to use delay() method in
+ Server. Delay is from beginning of last connection to this
+ one. Currently this also delays local digging, which may not be ideal.
+
+ * htcommon/defaults.cc: Added option for server_max_docs as a
+ limit on the number of docs returned from a server.
+
+ * contrib/htparsedoc/parse_word_doc.pl: New version from
+ Jesse. New code speedups and better matching of punctuation.
+
+ * htdig/Document.cc: Check http_proxy_exclude to see if it's
+ empty. If so, use the proxy.
+
+Mon Dec 7 21:46:34 1998 Geoff Hutchison <[email protected]>
+
+ * htsearch/htsearch.cc: Fix thinko with multiple excludes and
+ restricts. Pointed out by Gilles.
+
+ * htcommon/defaults.cc: Add new option server_wait_time for the
+ number of seconds to wait between requests.
+
+ * htdig/Retriever.cc: Use server_wait_time to call sleep() before
+ requests. Should help prevent server abuse. :-)
+
+ * htcommon/WordList.cc(valid_word): Remove unnecessary code.
+
+ * htcommon/DocumentRef.cc: Fix typo that added description text
+ that contained punctuation or was too short.
+
+Sun Dec 6 13:12:55 1998 Geoff Hutchison <[email protected]>
+
+ * htsearch/parser.cc: Check for empty boolean searches and report
+ an error. Fixes bug reported by Chuck O'Donnell <[email protected]>.
+
+ * install-sh, mkinstalldirs: Import latest version from autoconf.
+
+ * htcommon/DocumentRef.cc: Add the text of descriptions to the
+ word database with weight description_factor.
+
+ * htcommon/WordList.cc: Ensure duplicate words have minimum
+ location and anchor attributes.
+
+ * htcommon/WordRecord.h: Ensure blank WordRecords have a default
+ count of 1 since a word has to exist to have a WordRecord!
+
+ * htdig/ExternalParser.cc, htdig/PDF.cc, htfuzzy/EndingsDB.cc:
+ Ensure temporary files are placed in TMPDIR if it's set.
+
+ * htdig/Retriever.cc: Don't add the text of descriptions to the
+ word db here, it's better to do it in the DocumentRef itself.
+
+ * htmerge/words.cc: Check for word entries that are essentially
+ duplicates and compact them.
+
+Sat Dec 5 01:10:46 1998 Geoff Hutchison <[email protected]>
+
+ * htdoc/THANKS.html: Updated for recent submissions.
+
+ * htdoc/FAQ.html: Cleaned up title.
+
+ * htdoc/uses.html: Added more sites and cleaned up the HTML.
+
+Fri Dec 4 20:15:41 1998 Geoff Hutchison <[email protected]>
+
+ * db/os/os_fsync.c, db/mutex/mutex.c: Patch from Klaus Mueller
+ <[email protected]> to compile under CygWinB20.
+
+ * htdig/HTML.cc: Fix mistake in last update--file was included
+ twice.
+
+ * htdig/Retriever.cc: Do a check for blank URLs before adding them
+ to the list to be retrieved.
+
+Fri Dec 4 19:21:17 1998 Didier Gautheron <[email protected]>
+
+ * htdig/HTML.cc: Fix parser bug with &lt; becoming a tag.
+
+ * htlib/Dictionary.cc: Added check for empty dictionaries.
+
+ * htlib/URL.cc: Allow server_aliases to work under virtual hosts.
+
+ * htmerge/htmerge.cc: Remove previous db.words.db file before
+ doing a word merging. Fixes bug with deleted documents keeping
+ entries.
+
+ * htdig/main.cc, htdig/Retriever.h, htdig/Retriever.cc: Added
+ parameter to Initial function to prevent URLs from being checked
+ twice during an update dig.
+
+ * htcommon/WordList.cc, htmerge/words.cc: Don't store c:1 and a:0
+ entries in db.wordlist to save space.
+
+Fri Dec 4 19:08:28 1998 Geoff Hutchison <[email protected]>
+
+ * configure.in, Makefile.in, Makefile.config.in: Remove DB_DIR and
+ RX_DIR.
+
+ * configure: Regenerated for configure.in changes.
+
+ * htsearch/htsearch.cc: Added usage message for the command line.
+
+Fri Dec 4 18:52:55 1998 Geoff Hutchison <[email protected]>
+
+ * htdoc/FAQ.html: Added question about phrase matching.
+
+Fri Dec 4 21:21:00 1998 Alexander Bergolth <[email protected]>
+
+ * configure.in: Check if the third argument of getpeername is a
+ size_t* or an unsigned int*.
+
+ * include/htconfig.h.in: Define GETPEERNAME_LENGTH_T.
+
+ * htlib/Connection.cc: Use GETPEERNAME_LENGTH_T as the type of the
+ third getpeername argument. Included strings.h which is needed for
+ FD_ZERO on AIX.
+
+Thu Dec 3 23:03:15 1998 Geoff Hutchison <[email protected]>
+
+ * configure.in: Check for getopt.h for those platforms that don't
+ have it. Fix checks for db and rx dirs since these names won't
+ change.
+
+ * include/htconfig.h.in: Define HAVE_GETOPT_H.
+
+ * configure: Generate from configure.in with latest autoconf
+ (2.12.2).
+
+ * htdig/Plaintext.cc: Removed compiler warnings.
+
+ * htdig/main.cc, htfuzzy/htfuzzy.cc, htmerge/htmerge.cc,
+ htnotify/htnotify.cc, htsearch/htsearch.cc: Use configure check to
+ only include getopt.h when it exists.
+
+ * htcommon/defaults.cc: Add new option http_proxy_exclude for
+ servers that shouldn't use the proxy, from a patch by Gilles
+ Detillieux.
+
+ * htdig/Document.h, htdig/Document.cc: Use it, from a patch by Gilles.
+
+Tue Dec 1 21:36:37 1998 Geoff Hutchison <[email protected]>
+
+ * Makefile.in: Fixed bug with "make depend," noted by Morgan Davis
+
+ * htdig/main.cc, htfuzzy/htfuzzy.cc, htmerge/htmerge.cc,
+ htnotify/htnotify.cc, htsearch/htsearch.cc: Add include <getopt.h>
+ to help compiling under Win32 with CygWinB20.
+
+ * htdig/Retriever.cc: Update hopcount correctly by taking the
+ shortest paths to documents.
+
+ * htlib/DB2_db.cc: Added fix from Alexander Bergolth for Berkeley
+ DB under AIX.
+
+ * htlib/StringMatch.cc: Added fix from Christian Schneider
+ <[email protected]>, discovered from behavior with limit_urls_to.
+
+Tue Dec 1 18:06:33 1998 Geoff Hutchison <[email protected]>
+
+ * htdoc/hts_form.html: Explained why config fields reject periods.
+
+ * htdoc/FAQ.html: Added information about Internal Server Errors.
+
+ * htdoc/uses.html: Updated with more sites, change e-mail to Geoff.
+
+Sun Nov 29 21:26:56 1998 Geoff Hutchison <[email protected]>
+
+ * htsearch/htsearch.cc: Fix last update so it compiles (oops!).
+
+ * htdig/Document.cc: As above!
+
+Sun Nov 29 20:06:58 1998 Geoff Hutchison <[email protected]>
+
+ * htsearch/htsearch.cc: Improved support for multiple restrict and
+ exclude patterns, based on code from Gilles Detillieux
+ and William Rhee <[email protected]>.
+
+ * htdig/Document.cc, htdig/PDF.cc: Fixed problems under FreeBSD
+ where <sys/types.h> needed to be before <sys/stat.h>, noted by
+ Gilles.
+
+ * htdig/Server.cc: Fixed bug with robots.txt files containing
+ tabs, based on patch from Christian Schneider <[email protected]>.
+
+ * htdig/Document.cc: Fixed core dumps caused by mystrptime
+ returning NULL. Instead, we'll use the current timestamp. Noted by
+ Michael Hauber <[email protected]> and
+
+Fri Nov 27 19:09:33 1998 Geoff Hutchison <[email protected]>
+
+ * db/*: Import of Sleepycat's Berkely DB 2.5.9
+
+ * rx/*: Import of FSF rx 1.5
+
+ * configure, configure.in: Updated to deal with changes in db, rx
+ directories.
+
+ * Attic/db-2.4.14.tar.gz: Removed old db package for update.
+
+ * htsearch/parser.cc: Removed bogus code with "%01" -> "|"
+
+ * htlib/URL.cc: Considers URLs with "%7E" to be equivalent to "~"
+
+ * htlib/String.cc: Changed MinimumAllocationSize to cut down on
+ memory usage on small strings.
+
+ * htdig/Retriever.h, htdig/Retriever.cc, htdig/HTML.cc: Changed
+ Retriever::got_word to check for small words, valid_punctuation to
+ remove bugs in HTML.cc.
+
+ * htcommon/defaults.cc: Changed backlink_factor to 1000,
+ description_factor to 150, match_method to and, and
+ meta_description factor to 50. Should produce more accurate search
+ results.
+
+ * htcommon/WordList.cc: Fixed bug with bad_words and
+ MAX_WORD_LENGTH, noted by Jeff Breidenbach <[email protected]>.
+
+ * README: Updated to reflect bug-tracking system.
+
+Tue Nov 24 15:57:28 1998 Geoff Hutchison <[email protected]>
+
+ * htdig/Retriever.cc: Added patch to use local_default doc with
+ local_user_urls from Gilles Detillieux
+
+Mon Nov 23 18:57:16 1998 Geoff Hutchison <[email protected]>
+
+ * htdoc/RELEASE.html, htdoc/bugs.html, htdoc/contents.html,
+ htdoc/where.html: Updated for new bug reporting system.
+
+ * htdoc/TODO.html: Updated To Do w/ current status.
+
+Sun Nov 22 14:03:06 1998 Geoff Hutchison <[email protected]>
+
+ * installdir/rundig: Added checks for synonym databases older than
+ the synonym files.
+
+ * htcommon/defaults.cc: New config options "description_factor"
+ for weighting words added as link descriptions, and
+ "no_excerpt_show_top" to show the top of an excerpt instead of the
+ "no_excerpt_text".
+
+ * htdig/Retriever.cc: Use "description_factor" to weight link
+ descriptions with the documents at the end of the link.
+
+ * htsearch/Display.cc: Adjust date_factor and backlink_factor
+ rankings to produce better results.
+
+ * htsearch/Display.cc: Use "no_excerpt_show_top."
+
+ * htsearch/htsearch.cc: Don't remove boolean operators from
+ boolean search strings!
+
+Thu Nov 19 01:31:37 1998 Geoff Hutchison <[email protected]>
+
+ * htdoc/FAQ.html: Update for -ldb problem on Digital UNIX.
+
+Wed Nov 18 05:14:53 1998 Geoff Hutchison <[email protected]>
+
+ * htdoc/FAQ.html: Update FAQ w/ new questions, better responses.
+
+ * htdoc/mailing.html: Mention additional archive at
+ www.mail-archive.com.
+
+ * htdoc/require.html: Update requirements (libstc++ instead of libg++).
+
+Tue Nov 17 23:13:04 1998 Geoff Hutchison <[email protected]>
+
+ * contrib/wordfreq/wordfreq.pl: Added changes by Isoif.
+
+ * htsearch/Display.cc: Added HTTP_REFERER to htsearch logging
+
+ * htdig/Document.cc: Fixed memory leak as a result of thinko.
+
+ * htcommon/DocumentRef.cc: Removed limit on number of link
+ descriptions.
+
+Mon Nov 16 22:30:07 1998 Geoff Hutchison <[email protected]>
+
+ * htcommon/defaults.cc: Declare new config options backlink_factor
+ and date_factor for counting document backlink counts and modifed
+ dates in rankings.
+
+ * htsearch/Display.cc: Use above factors.
+
+ * htsearch/ResultMatch.cc: Clarify getScore() comments.
+
+ * htlib/mktime.c: Import new version.
+
+ * installdir/htdig.conf: Add max_doc_size example (to help w/FAQ).
+
+Mon Nov 16 10:46:15 1998 Geoff Hutchison <[email protected]>
+
+ * htdig/ExternalParser.cc: Add checks for null tokens, adapted
+ from patch by Vadim Checkan.
+
+ * htdig/Retriever.cc: Count docBackLinks accurately (previously
+ all docs had count of 2!).
+
+Sun Nov 15 17:04:34 1998 Geoff Hutchison <[email protected]>
+
+ * htdig/HTML.cc(do_tag): Fix for refresh tags w/o URLs.
+
+ * htmerge/docs.cc, htmerge/words.cc: Change \r to \n, as mentioned
+ by Andrew Bishop.
+
+ * htcommon/DocumentRef.h, htcommon/DocumentRef.cc: Define new fields
+ docBackLinks (backlink count) and docSig (document signature).
+
+ * htdig/Retriever.cc: Keep track of docBackLinks.
+
+ * htsearch/Display.cc: Add variable BACKLINKS to display the count.
+
+Sat Nov 14 20:30:18 1998 Geoff Hutchison <[email protected]>
+
+ * htdig/HTML.cc(parse, do_tag): Ensure links respect META robot
+ settings. Patch contributed by Michael Spann
+
+ * htdig/HTML.cc(do_tag): Eliminate bug that ignores "?" in URLs
+
+ * htdig/HTML.cc(do_tag): Add support for META refresh tags as
+ "redirects", submitted by Aidas Kasparas
+
+Thu Nov 12 04:13:26 1998 Geoff Hutchison <[email protected]>
+
+ * htdoc/contents.html: Added link to jitterbug bug db.
+
+Sun Nov 8 21:10:19 1998 Geoff Hutchison <[email protected]>
+
+ * htdoc/ChangeLog, htdoc/RELEASE.html, htdoc/THANKS.html:
+ Correct spelling error with Rene' Seindal's name.
+
+ * htdoc/hts_templates.html: Update to improve clarity.
+
+Sun Nov 8 20:33:22 1998 Geoff Hutchison <[email protected]>
+
+ * htdig/Document.cc: Changed reset to keep proxy settings--fixes
+ bug noted by Didier Gautheron <[email protected]>
+
+Fri Nov 6 17:07:00 1998 Geoff Hutchison <[email protected]>
+
+ * contrib/wordfreq/wordfreq.pl: Updated with patch from Isoif
+ Fettich <[email protected]> to use Berkeley DB.
+
+ * contrib/whatsnew/whatsnew.pl: Fixed mistake from Oct 26 change.
+
+ * contrib/htparsedoc/parse_word_doc.pl: Added file contributed by
+ Jesse.
+
+ * contrib/README: Updated to include short descriptions of the scripts.
+
+ * contrib/multidig/*: New scripts to make working with multiple DB
+ a little easier.
+
+ * configure, configure.in: Added changes to support snapshots.
+
+ * .version: Resurrected to automate snapshot versions.
+
+Wed Nov 4 20:13:10 1998 Geoff Hutchison <[email protected]>
+
+ * htdoc/contents.html: Added "Contributors" for THANKS.html
+
+ * htdoc/THANKS.html: Added acknowledgement to contributors.
+
+Wed Nov 4 15:02:43 1998 Geoff Hutchison <[email protected]>
+
+ * htnotify/htnotify.cc: Fixed buglet with -F flag to sendmail.
+
+ * htdig/Plaintext.cc: Added patch from Vadim Chekan to change char
+ to unsigned char to fix reading Cyrillic plaintext files.
+
+Mon Nov 2 15:34:53 1998 Geoff Hutchison <[email protected]>
+
+ * htnotify/htnotify.cc, Makefile.config.in, README:
+ Changed "HTDig" to "ht://Dig."
+
+Sun Nov 1 20:34:14 1998 Geoff Hutchison <[email protected]>
+
+ * Makefile.in: Fixed buglet with dist target.
+
+ * htdig/Makefile.in: Fixed buglet with distclean target.
+
+ * htdoc/FAQ.html, htdoc/RELEASE.html, htdoc/attrs.html
+ htdoc/cf_byname.html, htdoc/cf_byprog.html, htdoc/htdig.html
+ htdoc/hts_templates.html: Updated documentation for new features,
+ bug-fixes in ht://Dig 3.1.0b2.
+
+ * htlib/Makefile.in, htlib/lib.h: Call mytimegm.cc instead of timegm.c.
+
+ * Attic/makedp: Remove file generated by configure
+
+ * htdig/Document.cc: Remove const from *ext to fix compiler warning.
+
+Sun Nov 1 00:17:08 1998 Geoff Hutchison <[email protected]>
+
+ * htsearch/Display.cc: Added template var DESCRIPTION as first
+ item in DESCRIPTIONS, as requested by Ryan Scott
+
+ * htlib/mytimegm.cc: Resurrected mytimegm() until problems with
+ glibc version can be solved.
+
+ * htdig/Document.cc, htdig/Retriever.cc, htfuzzy/Prefix.cc,
+ htsearch/WeightWord.cc, htsearch/htsearch.cc: Replaced system
+ calls with htlib/my* functions.
+
+Sat Oct 31 23:58:22 1998 Geoff Hutchison <[email protected]>
+
+ * htlib/URL.cc: Fixed compiler warning.
+
+ * rx-1.5/Attic/Makefile, rx-1.5/Attic/config.log:
+ Removed useless Makefile and config.log file.
+
+Tue Oct 27 22:53:03 1998 Andrew Scherpbier <[email protected]>
+
+ * */Makefile.in (depend): Fixed so that 'make depend' works
+ again. (Not sure exactly how long it was broken!)
+
+Tue Oct 27 20:00:16 1998 Geoff Hutchison <[email protected]>
+
+ * Makefile.in: Fix buglet with distclean target
+
+ * configure configure.in: Added check for LOCALTIME_R, removed
+ test for timegm replacement, changed compiler for most tests to
+ $CC.
+
+ * include/htconfig.in: Added option for LOCALTIME_R.
+
+ * htlib/timegm.c, htlib/mktime.c: Fixed some compilation problems.
+
+ * htlib/Makefile.in: Remove mktime.o since source is included in
+ timegm.o.
+
+Tue Oct 27 13:31:25 1998 Geoff Hutchison <[email protected]>
+
+ * htlib/mktime.c: Imported new version from glibc-2.0.99.
+
+ * htcommon/DocumentDB.cc: Fixed bug noted by Vadim Chekan with
+ CreateSearchDB.
+
+Mon Oct 26 15:27:28 1998 Geoff Hutchison <[email protected]>
+
+ * Makefile.config.in, configure.in, configure: Fixed problem with
+ -ldb, -lrx, etc. not being declared in $LIBS
+
+ * htdoc/install.html: Added remarks about using ./configure
+ --prefix=
+
+ * README: Cleaned up for new URLs, version numbers, etc.
+
+ * htsearch/htsearch.cc: Added patch by Esa Ahola fixing bug with
+ not ingoring bad_words properly.
+
+ * contrib/whatsnew/whatsnew.pl: Added fix from Jacques Reynes
+ <[email protected]> to get whatsnew to work with Berkeley DB.
+
+ * htdig/Retriever.cc, htdig/Document.cc: Fixed bug introduced by
+ Oct 18 change. Authorization will not be cleared.
+
+ * htlib/URL.cc: Fixed new -Wall warnings.
+
+Wed Oct 21 13:30:05 1998 Geoff Hutchison <[email protected]>
+
+ * htlib/timegm.c: Corrected Oct 17 change. Should now work. :-)
+
+ * htcommon/defaults.cc: Added defaults for new directives
+ server_aliases and limit_normalized.
+
+ * htdig/HTML.cc: Cleaned up HTML parsing based on patch by Rene'
+ Seindal.
+
+Wed Oct 21 18:31:00 1998 Alexander Bergolth <[email protected]>
+
+ * htlib/URL.cc, htlib/URL.h: Added patch to support translation of
+ server names. (Configuration directive: server_aliases)
+
+ * htdig/Retriever.cc, htdig/htdig.h, htdig/main.cc:
+ Additional limiting after normalization of the URL.
+ (Configuration directive: limit_normalized)
+
+Sun Oct 18 17:19:51 1998 Geoff Hutchison <[email protected]>
+
+ * htlib/Connection.h, htlib/Connection.cc: Define new function
+ timeout() as adapted from a patch by Rene' Seindal.
+
+ * htdig/Document.cc: Use it as adapted from a patch by Rene' Seindal.
+
+Sun Oct 18 16:33:58 1998 Geoff Hutchison <[email protected]>
+
+ * htcommon/DocumentDB.cc: Changed deserialize function to
+ explicitly delete DocumentRef.
+
+ * htcommon/DocumentRef.cc: Added trap for DOC_STRING value.
+
+ * htdig/Retriever.cc: Delete and reallocate Document variable
+ before retrieving. (Fixes database corruption bug) Removed code to
+ add a "/" to every URL with a 404--servers should send a redirect
+ in this case.
+
+Sat Oct 17 20:15:44 1998 Geoff Hutchison <[email protected]>
+
+ * htlib/timegm.c: Declare __gmtime_r if not defined
+
+Sat Oct 17 10:15:57 1998 Geoff Hutchison <[email protected]>
+
+ * configure.in: Fixed problem with configuring DB_DIR introduced
+ by Oct 11 change.
+
+ * configure: Regenerated by autoconf for above fix.
+
+ * htlib/Connection.h, htlib/Connection.cc: Included fixes sent by
+ Paul J. Meyer <[email protected]> to fix connections on
+ Dec Alpha environments.
+
+ * htsearch/Display.cc, htsearch/Display.h,
+ htdoc/hts_templates.html: Added variable CURRENT as the number of
+ the current match, adapted from a patch by Rene' Seindal
+
+ * htcommon/defaults.cc: Changed htdig.sdsu.edu to www.htdig.org in
+ start_urls
+
+Wed Oct 14 03:43:22 1998 turtle <turtle@kiwi>
+
+ * installdir/htdig.conf: fixed broken link pointed out by
+ [email protected], moved maintainer stuff up in the file
+
+Sun Oct 11 22:16:27 1998 Geoff Hutchison <[email protected]>
+
+ * htlib/DB2_db.cc: Added fix suggested by Domotor Akos
+ <[email protected]> with (char *)NULL cast.
+
+ * htlib/Attic/mytimegm.cc: Removed old mytimegm function.
+
+ * installdir/syntax.html: Improved boolean method error
+ message. It now gives examples of boolean expressions.
+
+ * htcommon/defaults.cc, htsearch/Display.cc, htsearch/Display.h,
+ htsearch/parser.cc: Added htsearch logging patch from Alexander
+ Bergolth.
+
+ * */Makefile.in, include/htconfig.h.in, htdig/Document.cc,
+ htdig/Images.cc, Attic/.version, Makefile.config.in, Makefile.in,
+ configure, configure.in, mkinstalldirs: Updated Makefiles and
+ configure variables.
+
+ * htfuzzy/Endings.cc, htfuzzy/Fuzzy.cc, htfuzzy/Prefix.cc,
+ htfuzzy/htfuzzy.cc, htlib/DB2_db.cc, htcommon/DocumentDB.cc:
+ Removed more -Wall warnings.
+
+Fri Oct 9 00:29:18 1998 Geoff Hutchison <[email protected]>
+
+ * htdig/Retriever.cc: Fixed typo with "meta_desription_factor".
+
+ * htdig/Images.cc: Use user_agent config in GET request.
+
+Thu Oct 8 09:05:41 1998 Geoff Hutchison <[email protected]>
+
+ * installdir/syntax.html: Improved Boolean search description.
+
+Mon Oct 5 11:30:16 1998 Geoff Hutchison <[email protected]>
+
+ * contrib/ewswrap/ewswrap.cgi, contrib/ewswrap/htwrap.cgi,
+ contrib/ewswrap/README: New scripts, contributed by John Grohol
+
+Fri Oct 2 13:11:24 1998 Geoff Hutchison <[email protected]>
+
+ * htdig/Retriever.cc: Added check for docs removed with
+ noindex. Now words in these docs should be ignored for the word
+ db.
+
+Fri Oct 2 13:09:04 1998 Geoff Hutchison <[email protected]>
+
+ * CONFIG Makefile.config.in Makefile.in */Makefile.in,
+ htcommon/defaults.cc htdig/main.cc, htfuzzy/htfuzzy.cc,
+ htmerge/htmerge.cc, htnotify/htnotify.cc include/htconfig.h.in:
+ More configure improvements--use top_srcdir instead of
+ HTDIG_TOP, use PACKAGE, VERSION, etc.
+
+Fri Oct 2 11:32:59 1998 Geoff Hutchison <[email protected]>
+
+ * htlib/StringList.cc: Added patch by Alexander Bergolth for bug
+ with multiple delimeter characters
+
+Fri Oct 2 15:22:06 1998 Geoff Hutchison <[email protected]>
+
+ * installdir/rundig, configure.in, CONFIG, CONFIG.in, aclocal.m4,
+ configure: Improvements in configure.in, notably using --prefix=
+ and --exec-prefix=
+
+Tue Sep 29 19:26:11 1998 Geoff Hutchison <[email protected]>
+
+ * htdig/HTML.cc: Added patch from Tim Frost <[email protected]> for
+ single quotes around URLs.
+
+ * htfuzzy/Prefix.cc: Added patch from Esa to fix Prefix matching
+ for capitalization.
+
+ * htcommon/defaults.cc: Added modification_time_is_now config
+
+ * htdig/Document.cc:, htdig/Retriever.cc: Added patch from Andrew
+ Bishop <[email protected]> for above to use modification
+ times when servers do not supply them.
+
+ * htsearch/htsearch.cc: Added patch from Andrew Bishop for -c switch.
+
+Wed Sep 23 14:46:34 1998 Geoff Hutchison <[email protected]>
+
+ * htcommon/defaults.cc, htdig/Server.cc: Added case_sensitive
+ attribute to work on case insensitive servers.
+
+Wed Sep 23 11:58:22 1998 Geoff Hutchison <[email protected]>
+
+ * htsearch/Display.cc: re-fixed bug noted by Alexander Bergolth
+
+ * htlib/Attic/timegm.cc, htlib/Makefile.in, htlib/mktime.c,
+ htlib/mytimegm.cc, htlib/timegm.c: Switched to using glibc timegm
+ replacement.
+
+ * configure, configure.in, Makefile.config.in: Add configure
+ searches for acroread and sendmail programs.
+
+ * htnotify/Makefile.in, htnotify/htnotify.cc,
+ htcommon/Makefile.in, htcommon/defaults.cc: Use them.
+
+ * htdig/HTML.cc: Fix thinko in META robots tag.
+
+ * htcommon/defaults.cc: Define iso_8601 date formatting option
+
+ * htsearch/Display.cc, htnotify/htnotify.cc: Use it as suggested
+ by Knut A. Syed <[email protected]>
+
+Fri Sep 18 14:35:02 1998 Geoff Hutchison <[email protected]>
+
+ * htsearch/Display.cc: Fixed bug noted by Alexander Bergolth
+ <[email protected]> in exclude logic
+
+ * htdig/HTML.cc: Fixed bug in comma-separated keywords noted by
+
+ * installdir/synonyms: New version contributed by John Banbury
+
+Fri Sep 18 00:38:09 1998 Geoff Hutchison <[email protected]>
+
+ * .version: Bump to 3.1.0b2
+
+ * htsearch/Makefile.in, htdig/Makefile.in, htfuzzy/Makefile.in,
+ htlib/Makefile.in, htmerge/Makefile.in,
+ htnotify/Makefile.in, htcommon/Makefile.in: Remove include
+ .sniffdir directive.
+
+ * htdig/HTML.cc: Fix horrible META description coding.
+
+ * htfuzzy/EndingsDB.cc, htfuzzy/Fuzzy.cc htfuzzy/Synonym.cc,
+ htfuzzy/htfuzzy.cc: Change "\r" to "\n" in statistics on
+ suggestion of Andrew M. Bishop <[email protected]>
+
+ * Makefile.config.in: Remove -ggdb from LDFLAGS.
+
+Tue Sep 15 22:31:48 1998 turtle <turtle@kiwi>
+
+ * Makefile.in: add substitution for @DATABASE_DIR@
+
+Thu Sep 10 00:06:58 1998 Geoff Hutchison <[email protected]>
+
+ * htdig/HTML.cc: Change debug level of META tags.
+
+ * htsearch/TemplateList.cc, htsearch/htsearch.cc, htsearch/Display.cc,
+ htsearch/Display.h: Backed out builtin-long default from Monday, now
+ use error handler
+
+Mon Sep 7 23:19:12 1998 Geoff Hutchison <[email protected]>
+
+ * contrib/htparsedoc: Added contributed external parser for MS
+ Word documents by Richard Jones <[email protected]>.
+
+ * htdig/Document.cc: Added fix to use htparsedoc.
+
+ * htdoc/*.html: Merged in new documentation for htdig-3.1.0b1.
+
+ * htdig/HTML.cc: Extended "noindex" behavior in previous patch.
+
+ * htcommon/defaults.cc: Added user_agent config option.
+
+ * htdig/Document.cc: Use it.
+
+Mon Sep 7 00:34:19 1998 Geoff Hutchison <[email protected]>
+
+ * htcommon/DocumentRef.h: Added DocState for documents marked as
+ "noindex".
+
+ * htdig/HTML.cc, htdig/Retriever.h, htdig/Retriever.cc,
+ htmerge/docs.cc: Use it to remove them.
+
+ * htsearch/TemplateList.cc: Add default template of builtin-long
+ to slot 0 in case of an error.
+
+ * htsearch/Display.cc: Use it.
+
+Sun Sep 6 21:36:16 1998 Geoff Hutchison <[email protected]>
+
+ * htcommon/defaults.cc: Sorted the current list of defaults, added
+ "pdf_parser" for the program to use in PDF.cc.
+
+ * htdig/PDF.cc: Use it, checking for the file before calling
+ system to fail gracefully.
+
+ * htlib/URL.cc: Bug fix for http:/ v. http://
+
+Sat Sep 5 23:11:48 1998 Geoff Hutchison <[email protected]>
+
+ * htlib/String.cc: Added patch by Zvi Har'El
+ <[email protected]> to indexOf function to prevent "false
+ positive" matches.
+
+ * installdir/nomatch.html, installdir/syntax.html: Fixed reference
+ to ht://Dig 3.0.
+
+ * htdig/Document.cc: Use robotstxt_name as user-agent as a more
+ consistent approach.
+
+ * htsearch/parser.cc: Convert "%01" to "|" to support <SELECT
+ ... MULTIPLE> tags.
+
+Thu Sep 3 20:53:51 1998 Geoff Hutchison <[email protected]>
+
+ * htsearch/Makefile.in: Remove reference to -lgdbm
+
+ * htsearch/Display.cc: Send Content-type header after all variable
+ expansion is completed.
+
+ * htcommon/WordList.cc: Removed warning under egcs-1.1
+
+Tue Aug 11 08:58:34 1998 Geoff Hutchison <[email protected]>
+
+ * htsearch/Display.cc, htdig/Retriever.h,
+ htdig/Retriever.cc, htdig/Parsable.h, htdig/Parsable.cc,
+ htdig/HTML.h, htdig/HTML.cc, htcommon/defaults.cc,
+ htcommon/DocumentRef.h, htcommon/DocumentRef.cc,
+ htcommon/DocumentDB.cc:
+ Second patch for META description tags. New field in DocDB for the
+ desc., space in word DB w/ proper factor.
+
+ * htmerge/docs.cc: Added statistic for total size of docs in DB.
+
+Thu Aug 6 10:15:22 1998 Geoff Hutchison <[email protected]>
+
+ * htdig/Retriever.cc: Added "local_dir_doc" config option,
+ the default filename in a directory.
+
+ * htcommon/defaults.cc: Fixed "elipses" spelling mistake,
+ local_dir_doc as above
+
+Tue Aug 4 11:34:46 1998 Geoff Hutchison <[email protected]>
+
+ * htlib/Configuration.cc: Added fix by Philippe Rochat
+ <[email protected]> to remove whitespace after config
+ options.
+
+ * htdig/HTML.cc, htdig/HTML.h: Added support for META robots tags.
+
+Mon Aug 3 16:50:46 1998 Geoff Hutchison <[email protected]>
+
+ * htsearch/ResultList.cc, htnotify/htnotify.cc,
+ htmerge/htmerge.cc, htmerge/docs.cc, htlib/String.cc,
+ htlib/ParsedString.cc, htfuzzy/Substring.cc,
+ htfuzzy/Prefix.cc, htfuzzy/Exact.cc,
+ htdig/SGMLEntities.cc, htdig/Retriever.cc, htdig/PDF.cc,
+ htdig/HTML.cc, htdig/Document.cc:
+ Fixed compiler warnings under -Wall
+
+Mon Aug 3 05:56:23 1998 Geoff Hutchison <[email protected]>
+
+ * htsearch/Display.cc: Spelling correction for "ellipses"
+
+Thu Jul 23 12:14:34 1998 Geoff Hutchison <[email protected]>
+
+ * htdig/PDF.cc, htdig/PDF.h, htdig/Document.cc: Added files (and
+ patch) from Sylvain Wallez for PDF parsing. Incorporates fix for
+ non-Adobe PDFs.
+
+ * htcommon/defaults.cc: Removed .pdf extension from bad_extensions.
+
+Wed Jul 22 10:04:31 1998 Geoff Hutchison <[email protected]>
+
+ * htsearch/Display.cc: Added patch from Sylvain Wallez
+ <[email protected]> to use the filename if no title is found.
+
+ * htnotify/htnotify.cc: Added patch from Chris Jason
+ Richards <[email protected]> to fix problems with sendmail.
+
+Tue Jul 21 09:56:58 1998 Geoff Hutchison <[email protected]>
+
+ * htsearch/Display.cc: Added patch by Rob Stone
+ <[email protected]> to create new environment variables to
+ htsearch: SELECTED_FORMAT and SELECTED_METHOD.
+
+Sun Jul 19 09:51:47 1998 Andrew Scherpbier <[email protected]>
+
+ * configure.in (berkeley db stuff): Added the berkeley db .tar.gz
+ to the distribution and modified configure.in to extract it if it
+ needs to.
+
+Thu Jul 9 09:39:01 1998 Geoff Hutchison <[email protected]>
+
+ * htdig/Server.cc, htdig/Retriever.h, htdig/Retriever.cc,
+ htdig/Document.h, htdig/Document.cc, htcommon/defaults.cc: Added
+ support for local file digging using patches by Pasi Eronen
+ <[email protected]>. Patches include support for local user (~username)
+ digging.
+
+ * htdig/HTML.h, htdig/HTML.cc, htcommon/defaults.cc:
+ Added support for META name=description tags. Uses new config-file
+ option "use_meta_description" which is off by default.
+
+Mon Jun 22 05:02:01 1998 turtle <turtle@kiwi>
+
+ * configure.in:
+ Added test to make sure that the berkeley db library is present
+
+ * .cvsignore: Ignore the berkeley db library
+
+ * configure: changed
+
+ * Makefile.config.in: Removed GDBM references
+
+ * Makefile.in: Removed GDMB references
+
+ * .version: updated version to 3.1.0b1
+
+ * README: Updated version # and website location
+
+ * htdig/HTML.cc: Applied patch that prevented SGML entities that
+ translate to valid_punctuation characters from becoming part of
+ words
+
+ * configure.in: Removed references to GDBM
+
+ * htcommon/defaults.cc: Got rid of my email address as the default
+ maintainer
+
+ * htdig/htdig.conf: simple config file for development
+
+ * htlib/String.cc, htlib/Attic/SDSU.h, htlib/Attic/SDSU.cc,
+ htlib/DB2_db.cc, htlib/Connection.cc, htlib/Configuration.cc,
+ htlib/BTree.cc: New Berkeley database stuff
+
+ * htlib/.sniffdir/ofiles.incl: removed SDSU.*
+
+ * installdir/syntax.html, installdir/search.html,
+ installdir/rundig, installdir/nomatch.html, installdir/htdig.conf,
+ installdir/footer.html: Changed to use the new
+ http://www.htdig.org/ instead of the sdsu site
+
+Sun Jun 21 23:20:14 1998 turtle <turtle@kiwi>
+
+ * rx-1.5/rx/Attic/config.log, htsearch/htsearch.cc,
+ htsearch/Attic/display.cc, htsearch/Display.cc, htmerge/docs.cc,
+ htlib/.sniffdir/ofiles.incl, htlib/Database.h, htlib/DB2_db.cc,
+ htlib/DB2_db.h, htlib/Database.cc, htfuzzy/.sniffdir/ofiles.incl,
+ htfuzzy/Prefix.cc, htfuzzy/Prefix.h, htfuzzy/Makefile.in,
+ htfuzzy/Fuzzy.cc, htcommon/defaults.cc, configure.in, Makefile.in,
+ Makefile.config.in: patches by Esa and Jesse to add BerkeleyDB and
+ Prefix searching
+
+Mon Jun 15 18:15:50 1998 turtle <turtle@kiwi>
+
+ * htdig/HTML.cc: Added suggestion by Chris Liddiard to add ',' to
+ the list of separator characters for meta keyword parsing
+
+Tue May 26 03:58:14 1998 turtle <turtle@kiwi>
+
+ * rx-1.5/rx/Attic/config.log, htlib/htString.h, htlib/cgi.cc,
+ htlib/URL.cc, htlib/String.cc, htlib/ParsedString.cc,
+ htlib/Database.cc, htlib/Connection.cc: Got rid of compiler
+ warnings.
+
+ * rx-1.5/rx/.cvsignore: added config.log
+
+Fri Apr 3 17:10:44 1998 turtle <turtle@kiwi>
+
+ * htsearch/Display.cc: Patch to make excludes work
+
+Tue Mar 10 16:02:32 1998 turtle <turtle@kiwi>
+
+ * htlib/strcasecmp.cc: Applied patch by Bernhard Griener to add
+ arguments checks in the mystrncasecmp() function
+
+Sun Feb 22 17:43:49 1998 turtle <turtle@kiwi>
+
+ * htdoc/mailing.html: New mailing list archive location
+
+Tue Feb 17 18:05:40 1998 turtle <turtle@kiwi>
+
+ * htdoc/uses.html: added new one
+
+Thu Feb 12 22:22:15 1998 turtle <turtle@kiwi>
+
+ * htdoc/uses.html: Added more sites
+
+Mon Jan 5 06:14:11 1998 turtle <turtle@kiwi>
+
+ * configure, configure.in: Added check for fstream.h to get rid of
+ the annoying emails about ht://Dig not compiling...
+
+ * Makefile.config.in: Added include of the GDBM library back
+
+ * .version: Now at version 3.0.9
+
+ * include/htconfig.h.in: Changed refs to time related stuff
+
+ * htmerge/htmerge.cc, htmerge/docs.cc: format changes
+
+ * htdig/Document.cc: Changed tm from pointer to real structure
+
+ * htlib/.sniffdir/ofiles.incl, htlib/timegm.cc: Our own timegm
+ function
+
+ * rx-1.5/rx/.cvsignore, rx-1.5/rx/Attic/Makefile: cvs cleanup
+
+ * htmerge/docs.cc: Fixed memory leak
+
+ * htlib/lib.h: Added own replacement of timegm()
+
+ * htlib/Dictionary.cc: Fixed memory leaks
+
+ * htlib/Connection.cc: Fix by Pontus Borg for AIX. Changed
+ 'size_t' to 'unsigned long' for the length parameter for
+ getpeername()
+
+ * htfuzzy/Metaphone.cc: formatting changes
+
+ * htdig/Retriever.cc: fixed memory leak
+
+ * htdig/Document.cc: * Alarm was not cancelled if readHeader
+ returned anything but OK * Use our own timegm() replacement if
+ necessary
+
+ * htcommon/DocumentRef.h, htcommon/DocumentRef.cc: format changes
+
+ * htcommon/DocumentDB.h: reformatting
+
+ * htcommon/DocumentDB.cc: Fixed major memory leak
+
+ * include/.cvsignore, include/Attic/htconfig.h, rx-1.5/.cvsignore,
+ rx-1.5/Attic/config.cache, rx-1.5/Attic/config.status,
+ rx-1.5/rx/.cvsignore, rx-1.5/rx/Attic/config.status,
+ htlib/Attic/htlib.proj, htmerge/.cvsignore,
+ htmerge/Attic/htmerge.proj, htnotify/.cvsignore,
+ htnotify/Attic/htnotify.proj, htsearch/.cvsignore,
+ htsearch/Attic/htsearch.proj, Attic/config.cache,
+ htcommon/Attic/htcommon.proj, htfuzzy/.cvsignore,
+ htfuzzy/Attic/htfuzzy.proj, lookfor: General cleanup of archived
+ stuff
+
+ * .cvsignore: config.cache added
+
+ * htdig/.cvsignore: Added htdig
+
+Tue Dec 16 15:57:22 1997 turtle <turtle@kiwi>
+
+ * htdig/Document.cc: Added little patch by Tobias Oetiker
+ <[email protected]> that should fix problems with timeouts.
+
+Thu Dec 11 00:28:59 1997 turtle <turtle@kiwi>
+
+ * htlib/URL.h, htlib/URL.cc: Added double slash removal code.
+ These were causing loops.
+
+Thu Oct 23 18:01:10 1997 turtle <turtle@kiwi>
+
+ * htlib/Connection.cc: Fix by Pontus Borg for AIX. Changed
+ 'size_t' to 'unsigned long' for the length parameter for
+ getpeername()
+
+Mon Oct 13 02:13:52 1997 turtle <turtle@kiwi>
+
+ * htdig/Attic/Makefile, htdig/Attic/htdig.proj: remove files that
+ shouldn't be in the repository
+
+ * htdig/.cvsignore: Ignore Makefile
+
+ * htdoc/cf_byname.html, htdoc/cf_byprog.html, htdoc/attrs.html,
+ htdoc/ChangeLog: Added documentation for the external_parsers
+ attribute.
+
+Mon Jul 14 15:32:22 1997 turtle <turtle@kiwi>
+
+ * htdoc/uses.html: added cambridge
+
+Wed Jul 9 15:57:30 1997 turtle <turtle@kiwi>
+
+ * htdoc/uses.html: added the rhodos project
+
+Mon Jul 7 22:15:45 1997 turtle <turtle@kiwi>
+
+ * htdig/Document.cc: Removed old getdate() code that replaced '-'
+ with ' '.
+
+ * htlib/URL.cc: Sequences of "/./" are now replaced with "/" to
+ reduce the chance of infinite loops
+
+ * htdig/Document.cc: Added better date parsing. Now also supports
+ the old RFC 850 format
+
+Thu Jul 3 17:44:39 1997 turtle <turtle@kiwi>
+
+ * htdoc/cf_byname.html, htdoc/cf_byprog.html,
+ htcommon/defaults.cc, htdig/htdig.h, htdoc/attrs.html,
+ htlib/Configuration.h, htlib/URL.cc, htdig/Attic/Makefile,
+ htdig/Document.cc: Added support for virtual hosts
+
+Mon Jun 30 17:07:49 1997 turtle <turtle@kiwi>
+
+ * htdoc/uses.html: Added Depaul university
+
+Tue Jun 24 14:59:45 1997 turtle <turtle@kiwi>
+
+ * Makefile.in: Fixed syntax error in the installation target.
+
+Mon Jun 23 17:33:14 1997 turtle <turtle@kiwi>
+
+ * htdig/Attic/teamball.conf, htdig/Attic/tsdsu.conf,
+ htdig/Attic/rohan.conf, htdig/Attic/sdsu.conf, htdig/Attic/t.conf,
+ htdig/Attic/nsdsu.conf, htdig/Attic/daztec.conf,
+ htdig/Attic/max.conf, htdig/htdig.conf, htdig/Attic/Makefile,
+ htdig/Attic/catalog.conf: Removed old config files
+
+ * htdoc/FAQ.html: FAQ initial
+
+ * htdoc/contents.html: Added link to the new FAQ
+
+ * htdoc/FAQ.html: *** empty log message ***
+
+ * htnotify/htnotify.cc: Added version info to the usage output
+
+ * htfuzzy/htfuzzy.cc: Added version info the usage output
+
+ * htmerge/htmerge.cc: Added version info to usage message
+
+ * htdig/main.cc: Added version info to the usage message
+
+Mon Jun 16 15:35:56 1997 turtle <turtle@kiwi>
+
+ * installdir/footer.html: Changed the hardcoded version number to
+ the new VERSION variable
+
+ * htdoc/hts_templates.html: Added docs for the VERSION and PERCENT
+ variables
+
+ * htsearch/Display.cc: Added PERCENT and VERSION variables for the
+ output templates
+
+Sat Jun 14 18:52:42 1997 turtle <turtle@kiwi>
+
+ * htdig/Document.cc: Made redirect detection code more general
+
+Fri Jun 13 05:31:17 1997 turtle <turtle@kiwi>
+
+ * htdoc/cf_general.html: Fixed typo
+
+Thu Jun 5 15:00:53 1997 turtle <turtle@kiwi>
+
+ * htdoc/uses.html: added VG Gas Analysis Systems
+
+Tue Jun 3 17:49:05 1997 turtle <turtle@kiwi>
+
+ * installdir/english.0.original, installdir/english.0: Added new
+ english dictionary for the endings algorithm
+
+Thu May 29 14:56:40 1997 turtle <turtle@kiwi>
+
+ * htdoc/uses.html: Added Indiana University Computer Security
+ Office
+
+Wed May 28 14:47:25 1997 turtle <turtle@kiwi>
+
+ * htdoc/main.html: Fixed typo
+
+Mon May 19 15:23:18 1997 turtle <turtle@kiwi>
+
+ * htdoc/uses.html: Added daily californian online
+
+Tue May 13 19:28:32 1997 turtle <turtle@kiwi>
+
+ * htdoc/uses.html: Added The Reohr Group
+
+ * htdoc/uses.html: Added the Linux Documentation Project
+
+Sun May 11 17:52:05 1997 turtle <turtle@kiwi>
+
+ * htdoc/index.html: Made the contents frame a little wider so that
+ text doesn't wrap
+
+ * htdoc/uses.html: Added NOVA and Gajo & Associati
+
+Fri May 2 23:35:56 1997 turtle <turtle@kiwi>
+
+ * htdoc/uses.html: added www.bajan.org
+
+Wed Apr 30 22:28:28 1997 turtle <turtle@kiwi>
+
+ * htdoc/uses.html: Added Caldera, Inc.
+
+Sun Apr 27 14:43:31 1997 turtle <turtle@kiwi>
+
+ * htsearch/parser.cc, htsearch/parser.h, include/Attic/htconfig.h,
+ htdoc/RELEASE.html, htdoc/uses.html, htdoc/where.html,
+ htlib/URL.cc, htlib/strcasecmp.cc, htsearch/htsearch.cc, .version,
+ README, htdig/Attic/Makefile, htdoc/ChangeLog: changes
+
+Mon Apr 21 15:44:39 1997 turtle <turtle@kiwi>
+
+ * htsearch/htsearch.cc: Added code to check the search words
+ against the minimum_word_length attribute
+
+Sun Apr 20 15:27:37 1997 turtle <turtle@kiwi>
+
+ * CONFIG: Made paths more generic
+
+ * htdig/Document.cc: Added include for ctype.h
+
+ * htdig/Plaintext.cc: Fixed bug
+
+Tue Apr 1 17:56:57 1997 turtle <turtle@kiwi>
+
+ * htdoc/uses.html: added ukc
+
+Sun Mar 30 01:18:16 1997 turtle <turtle@kiwi>
+
+ * htdig/Attic/Makefile, htdoc/uses.html, Attic/Makefile.config,
+ Attic/config.log, Attic/config.status, .cvsignore, Attic/Makefile,
+ htsearch/Attic/Makefile, htsearch/.cvsignore,
+ htnotify/Attic/Makefile, htnotify/.cvsignore, htmerge/.cvsignore,
+ htmerge/Attic/Makefile, htlib/.cvsignore, htlib/Attic/Makefile,
+ htfuzzy/.cvsignore, htfuzzy/Attic/Makefile, htcommon/.cvsignore,
+ htcommon/Attic/Makefile: update
+
+Thu Mar 27 00:06:05 1997 turtle <turtle@kiwi>
+
+ * htdig/Plaintext.cc: Applied patch supplied by Peter Enderborg
+ <[email protected]> to fix a problem with a pointer running off the end
+ of a string.
+
+Mon Mar 24 04:33:26 1997 turtle <turtle@kiwi>
+
+ * rx-1.5/rx/Attic/config.log, rx-1.5/rx/Attic/config.status,
+ htsearch/htsearch.h, htsearch/parser.h, include/Attic/htconfig.h,
+ rx-1.5/Attic/config.status, htsearch/Attic/Makefile,
+ htsearch/ResultList.cc, htsearch/ResultMatch.h,
+ htsearch/Template.h, htsearch/WeightWord.h, htlib/cgi.cc,
+ htlib/htString.h, htlib/io.cc, htmerge/Attic/Makefile,
+ htmerge/htmerge.h, htnotify/Attic/Makefile, htlib/StringList.cc,
+ htlib/StringList.h, htlib/String_fmt.cc, htlib/URL.h,
+ htlib/URLTrans.cc, htlib/Attic/SDSU.cc, htlib/Attic/String.h,
+ htlib/ParsedString.h, htlib/String.cc, htfuzzy/htfuzzy.cc,
+ htlib/Attic/Makefile, htlib/Configuration.cc, htlib/Connection.cc,
+ htlib/Database.h, htdig/URLRef.h, htfuzzy/Attic/Makefile,
+ htfuzzy/Exact.cc, htfuzzy/Fuzzy.h, htfuzzy/Substring.cc,
+ htfuzzy/SuffixEntry.h, htdig/Plaintext.cc, htdig/Postscript.cc,
+ htdig/SGMLEntities.cc, htdig/Server.cc, htdig/Server.h,
+ htdig/Attic/Makefile, htdig/ExternalParser.cc,
+ htdig/ExternalParser.h, htdig/Parsable.h, htcommon/Attic/Makefile,
+ htcommon/DocumentRef.h, htcommon/WordList.cc, htcommon/WordList.h,
+ htcommon/WordReference.h, htdig/Document.h, Attic/config.status,
+ configure, configure.in, Attic/Makefile, Attic/Makefile.config,
+ Attic/config.cache, Attic/config.log, Makefile.config.in: Renamed
+ the String.h file to htString.h to help compiling under win32
+
+ * Makefile.in: Updated "make dist" to remove CVS stuff
+
+Fri Mar 14 17:15:32 1997 turtle <turtle@kiwi>
+
+ * htcommon/defaults.cc: Changed default value for remove_bad_urls
+ to true
+
+Thu Mar 13 18:37:50 1997 turtle <turtle@kiwi>
+
+ * htnotify/htnotify.cc, Attic/Makefile.config,
+ htdig/SGMLEntities.cc, htdoc/uses.html: Changes
+
+Thu Feb 27 00:52:52 1997 turtle <turtle@kiwi>
+
+ * htdoc/uses.html: new uses
+
+Mon Feb 24 17:52:55 1997 turtle <turtle@kiwi>
+
+ * htsearch/htsearch.cc, htnotify/Attic/Makefile,
+ htsearch/Attic/Makefile, htlib/strcasecmp.cc,
+ htmerge/Attic/Makefile, htlib/Attic/Makefile, htlib/String.cc,
+ htlib/StringMatch.cc, htdig/SGMLEntities.cc,
+ htfuzzy/Attic/Makefile, htdig/Attic/Makefile,
+ htcommon/Attic/Makefile, htcommon/WordList.cc: Applied patches
+ supplied by "Jan P. Sorensen" <[email protected]> to make
+ ht://Dig run on 8-bit text without the global unsigned-char option
+ to gcc.
+
+Sun Feb 23 17:29:38 1997 turtle <turtle@kiwi>
+
+ * htdoc/uses.html: *** empty log message ***
+
+Tue Feb 18 15:03:03 1997 turtle <turtle@kiwi>
+
+ * htdoc/uses.html: New uses of ht://Dig
+
+Tue Feb 11 00:38:48 1997 turtle <turtle@kiwi>
+
+ * htsearch/htsearch.cc: Renamed the very bad wordlist variable to
+ badWords
+
+Mon Feb 10 17:32:47 1997 turtle <turtle@kiwi>
+
+ * htlib/Connection.cc, htdig/Document.h, htdig/Document.cc,
+ htcommon/DocumentRef.cc, htcommon/DocumentRef.h: Applied AIX
+ specific patches supplied by Lars-Owe Ivarsson
+
+Fri Feb 7 18:04:13 1997 turtle <turtle@kiwi>
+
+ * htlib/URL.cc: Fixed problem with anchors without a URL
+
+Mon Feb 3 17:37:59 1997 turtle <turtle@kiwi>
+
+ * .version, README: updated stuff to 3.0.8
+
+ * Many files: Initial CVS
+
+Local Variables:
+add-log-time-format: current-time-string
+End: