A C++ implementation of the Single-Pass In-Memory Indexing (SPIMI) algorithm to build an inverted index from a collection of documents.
- Processes all text files in a specified folder.
- Generates sub-inverted indices based on a maximum term limit.
- Merges sub-indices using a multiway merging algorithm.
- Outputs a final inverted index file (
invIndex.csv
) in CSV format.
- C++17 or newer compiler (e.g.,
g++
,clang++
). - Standard Library (no external dependencies required).