Exact Pattern Matching Algorithms and Analysis

Author: Pallavi Garg

Algorithms

Brute force
Knuth-Morris-Pratt (KMP)
Boyer-Moore-Horspool (BMH)
Bitap
Rabin Karp

Test Setup

Random Strings Testing
English Strings and Words Testing

To run the algorithms

./project "Text" "Pattern" "algo"

Observation

Random strings: KMP algorithm performs the best

English Strings and Words: KMP algorithm performs the best

Conclusion

As per the observations, in both test settings KMP performs the best in both these test setups. The BMH algorithm performed poorly in this setup. The reason for this observation is that:

The KMP uses the information about the previous comparisons which makes it fast for these scenarios. Because of the large degree of similarity among portions of pattern and text in English words setup, the BMH algorithm makes a lot of comparisons before it is able to find a match.
In both the test setups, text contains a large number of repeating characters and BMH algorithm relies on the “bad character shift” to determine how far to move the pattern when mismatch occurs. Which makes it shift the pattern by a small amount only, resulting in greater number of comparisons. On the other hand, the KMP algorithm uses the previous match information, which makes it much faster.

In general, it is noticeable from both the graphs, that brute force algorithm performance is better than both Rabin Karp and BMH algorithms. This shows that depending on the problem, brute force algorithm might be a better choice.

Overall, the performance of these algorithms depends on the characteristics of the specific problem being solved. It is important to consider the properties of the text and pattern when selecting an algorithm for string matching.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
.vscode		.vscode
inputs		inputs
outputs		outputs
report		report
.gitignore		.gitignore
Makefile		Makefile
README.md		README.md
a.out		a.out
bitap.cpp		bitap.cpp
bmh.cpp		bmh.cpp
brute.cpp		brute.cpp
filereader.cpp		filereader.cpp
kmp.cpp		kmp.cpp
main.cpp		main.cpp
plot_graph.py		plot_graph.py
project		project
project.h		project.h
randomtext.cpp		randomtext.cpp
reader.h		reader.h
requirements.txt		requirements.txt
rk.cpp		rk.cpp
text.txt		text.txt
words.txt		words.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Exact Pattern Matching Algorithms and Analysis

Author: Pallavi Garg

Algorithms

Test Setup

To run the algorithms

Observation

Conclusion

About

Uh oh!

Releases

Packages

Uh oh!

Languages

pallavi-garg/PatternMatching

Folders and files

Latest commit

History

Repository files navigation

Exact Pattern Matching Algorithms and Analysis

Author: Pallavi Garg

Algorithms

Test Setup

To run the algorithms

Observation

Conclusion

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages