-
The original authors of LexRank use TF - IDF as a vectorization method
-
The paper for LSA, where they tested all the vectorization methods, found out that the binary format performed the best in terms of the quality of summaries. Therefore, I used the binary matrix for LSA
-
LexRank creates more concise and clear summaries
-
LSA creates more detailed summaries
-
Both perform better than TextRank implemented by some library installed from npm => Better to implement TextRank with word2vec and co-reference resolution
-
LSA performed the best in a super long text (3280 words) from my observations