public class DeDupFetchHTTP
extends org.archive.modules.fetcher.FetchHTTP
org.archive.crawler.fetcher.FetchHTTP
processor for downloading HTTP documents. This extension adds a check after
the content header has been downloaded that compares the 'last-modified' and
or 'last-etag' values from the header against information stored in an
appropriate index.is.hi.bok.deduplicator.DigestIndexer
,
org.archive.crawler.fetcher.FetchHTTP
Constructor and Description |
---|
DeDupFetchHTTP() |
addResponseContent, checkMidfetchAbort, cleanupHttp, configureHttp, configureHttp, configureMethod, doAbort, getAcceptCompression, getAcceptHeaders, getAttributeEither, getAuthScheme, getCookieStorage, getCredentialStore, getDefaultCharset, getDefaultEncoding, getDigestAlgorithm, getDigestContent, getHttp, getHttpBindAddress, getHttpProxyHost, getHttpProxyPassword, getHttpProxyPort, getHttpProxyUser, getIgnoreCookies, getMaxFetchKBSec, getMaxLengthBytes, getSendConnectionClose, getSendIfModifiedSince, getSendIfNoneMatch, getSendRange, getSendReferer, getServerCache, getShouldFetchBodyRule, getSoTimeoutMs, getSslTrustLevel, getTimeoutSeconds, getUseHTTP11, getUserAgentProvider, handle401, innerProcess, isRunning, process, report, setAcceptCompression, setAcceptHeaders, setConditionalGetHeader, setCookieStorage, setCredentialStore, setDefaultEncoding, setDigestAlgorithm, setDigestContent, setHttpBindAddress, setHttpProxyHost, setHttpProxyPassword, setHttpProxyPort, setHttpProxyUser, setIgnoreCookies, setMaxFetchKBSec, setMaxLengthBytes, setSendConnectionClose, setSendIfModifiedSince, setSendIfNoneMatch, setSendRange, setSendReferer, setServerCache, setShouldFetchBodyRule, setSizes, setSoTimeoutMs, setSslTrustLevel, setTimeoutSeconds, setUseHTTP11, setUserAgentProvider, shouldProcess, start, stop
doCheckpoint, finishCheckpoint, flattenVia, fromCheckpointJson, getBeanName, getEnabled, getKeyedProperties, getRecordedSize, getShouldProcessRule, getURICount, hasHttpAuthenticationCredential, innerProcessResult, innerRejectProcess, isSuccess, setBeanName, setEnabled, setRecoveryCheckpoint, setShouldProcessRule, startCheckpoint, toCheckpointJson
Copyright © 2014 National and University Library of Iceland. All Rights Reserved.