资 源 简 介
Introduction
FM-index++ is a C++ implementation of FM-index(1).
FM-index is a compressed data structure to search for text data, and has been used in many applications such as information retrieval and bioinformatics.
FM-index++ supports three types of similarity searches: exact search, Hamming distance search and Edit distance search.
As building blocks, FM-index++ uses SAIS(2) for suffix array constructions and wat-array(3) for wavelet tree constructions.
The size of FM-index is O(nlgs), where n is the length of a text and s is the number of words.
Quick Start
tar xvjf fmindex-x.x.x.tar.bz2cd fmindex-x.x.x/src./waf configure./waf sudo ./waf install
To create an index file from a text, please use the construct command as following.
./fmconstruct -percent 25 ../dat/article.txt index
Index is an index file made by the program. The option -percent (0<=value<=100) indicates the percentage of sa