Skip to content

Latest commit

 

History

History
14 lines (12 loc) · 611 Bytes

lucene-index.miracl-v1.0.20221004.2b2856.README.md

File metadata and controls

14 lines (12 loc) · 611 Bytes

miracl-v1.0

Lucene index for MIRACL v1.0 (All languages)

This index was generated on 2022/10/04 at Anserini commit b5ecc5 on orca with the following command:

lang=ar # or: bn en fi fr hi id ja ko fa ru es sw te th zh
target/appassembler/bin/IndexCollection \
    -collection MrTyDiCollection \
    -input MIRACL/miracl-corpus-v1.0-$lang \
    -index lucene-index.miracl-v1.0-$lang \
    -generator DefaultLuceneDocumentGenerator \
    -threads 16 -storePositions -storeDocvectors -storeRaw -language $lang