In addition to implementing the LSA computations described in the introduction, Gensim contains a general library for topic modeling, document indexing and similarity retrieval for large corpora of texts, as well as the Porter stemming algorithm
The PTStemmer is a toolkit that provides Java, Python, and .NET C# implementations of several Portuguese language stemming algorithms
(Orengo, Porter, and Savoy).
Lancaster, (2010) Lancaster, The Lancaster Stemming Algorithm
, other stemmers, http:// www.comp.
 experimental results show that the suggested hybrid method of statistical and light stemmers is the most suitable stemming algorithm
for Arabic language and gives generalization accuracy of about 98%.
(4) Bacchin, M., Ferro, N., Melucci, M., (2002) "Experiments to evaluate a statistical stemming algorithm
", Proceedings of the international Cross-Language Evaluation Forum Workshop, University of Padua at CLEF 2002.
Porter, best known as the author of the Porter Stemming Algorithm
, worked in information retrieval research at Cambridge University for over 9 years.
We refer to the group of words that result in a common root after applying a specific stemming algorithm
as a conflation or equivalence class.
For example, a stemming algorithm
reduces the words "fishing", "fished", "fish", and "fisher" to the root word, "fish".
Popular stemming algorithm
, Porter's stemming algorithm
has been used .
Porter's stemming algorithm
 was applied to remove prefixes and suffixes to obtain word stems.
After traversing the whole document, identifying its words, omitting its stop-list terms and phrases, and identifying its nouns and verbs, a stemming algorithm
is applied to each word, in the second phase.