Lucene index of TREC Disks 4 & 5 (minus Congressional Records), used in the TREC 2004 Robust Track.
This index was built on 2024/08/03 at Anserini commit 36f7e3
on orca
with the following command:
nohup bin/run.sh io.anserini.index.IndexCollection \
-collection TrecCollection \
-input /store/collections/newswire/disk45 \
-generator DefaultLuceneDocumentGenerator \
-index indexes/lucene-inverted.disk45/ \
-threads 16 -storePositions -storeDocvectors -storeRaw -optimize \
>& logs/log.disk45 &