- close() - Method in class is.landsbokasafn.deduplicator.CrawlDataIterator
-
Close any resources held open to read the crawl data.
- close() - Method in class is.landsbokasafn.deduplicator.CrawlLogIterator
-
Closes the crawl.log file.
- close(boolean) - Method in class is.landsbokasafn.deduplicator.DigestIndexer
-
Close the index.
- CommandLineParser - Class in is.landsbokasafn.deduplicator
-
Print DigestIndexer command-line usage message.
- CommandLineParser(String[], PrintWriter) - Constructor for class is.landsbokasafn.deduplicator.CommandLineParser
-
Constructor.
- CommandLineParser.DigestHelpFormatter - Class in is.landsbokasafn.deduplicator
-
Override so can customize usage output.
- CommandLineParser.DigestHelpFormatter() - Constructor for class is.landsbokasafn.deduplicator.CommandLineParser.DigestHelpFormatter
-
- CONTENT_CHANGED - Static variable in interface is.landsbokasafn.deduplicator.DedupAttributeConstants
-
URI content had changed between the two latest, successfully completed
fetches.
- CONTENT_UNCHANGED - Static variable in interface is.landsbokasafn.deduplicator.DedupAttributeConstants
-
URI content has not changed between the two latest, successfully
completed fetches.
- CONTENT_UNKNOWN - Static variable in interface is.landsbokasafn.deduplicator.DedupAttributeConstants
-
No knowledge of URI content.
- contentDigest - Variable in class is.landsbokasafn.deduplicator.CrawlDataItem
-
- CrawlDataItem - Class in is.landsbokasafn.deduplicator
-
A base class for individual items of crawl data that should be added to the
index.
- CrawlDataItem() - Constructor for class is.landsbokasafn.deduplicator.CrawlDataItem
-
Constructor.
- CrawlDataItem(String, String, String, String, String, String, boolean, long) - Constructor for class is.landsbokasafn.deduplicator.CrawlDataItem
-
Constructor.
- crawlDataItemFormat - Variable in class is.landsbokasafn.deduplicator.CrawlLogIterator
-
The date format specified by the
CrawlDataItem
for dates
entered into it (and eventually into the index)
- CrawlDataIterator - Class in is.landsbokasafn.deduplicator
-
An abstract base class for implementations of iterators that iterate over
different sets of crawl data (i.e.
- CrawlDataIterator(String) - Constructor for class is.landsbokasafn.deduplicator.CrawlDataIterator
-
Constructor.
- crawlDateFormat - Variable in class is.landsbokasafn.deduplicator.CrawlLogIterator
-
The date format used in crawl.log files.
- CrawlLogIterator - Class in is.landsbokasafn.deduplicator
-
An implementation of a is.hi.bok.deduplicator.CrawlDataIterator
capable of iterating over a Heritrix's style crawl.log
.
- CrawlLogIterator(String) - Constructor for class is.landsbokasafn.deduplicator.CrawlLogIterator
-
Create a new CrawlLogIterator that reads items from a Heritrix crawl.log