Follow the Diagonals: Finding String Matches through Matrix Operations

Faculty Sponsor: Professor Pavel Oleinikov Abstract: Pairwise comparisons produce a diagonal line of matches and are originally used in homology. This method is extended to text similarity detections in programming languages as well as large collections of texts. The tokenization of texts allows dynamic programming such as Global Alignment to detect similar texts while running … Read more