About: Two-way string-matching algorithm

Property	Value
dbo:abstract	In computer science, the two-way string-matching algorithm is an string-searching algorithm, discovered by Maxime Crochemore and Dominique Perrin in 1991. It takes a pattern of size m, called a “needle”, preprocesses it in linear time O(m), producing information that can then be used to search for the needle in any “haystack” string, taking only linear time O(n) with n the haystack's length. The two-way algorithm can be viewed as a combination of the forward-going Knuth–Morris–Pratt algorithm (KMP) and the backward-running Boyer–Moore string-search algorithm (BM).Like those two, the 2-way algorithm preprocesses the pattern to find partially repeating periods and computes “shifts” based on them, indicating what offset to “jump” to in the haystack when a given character is encountered. Unlike BM and KMP, it uses only O(log m) additional space to store information about those partial repeats: the search pattern is split into two halves (its critical factorization), represented only by the position of that split. Being a number lesser than m, it can be represented in ⌈log₂ m⌉ bits. In most practical settings, this can be taken to be O(1), as the needle's size is limited by the size of addressable memory.The actual matching operation performs at most 2n − m comparisons. Breslauer later published two improved variants performing fewer comparisons, at the cost of storing additional data about the preprocessed needle: * The first one performs at most n + ⌊(n − m)/2⌋ comparisons, ⌈(n − m)/2⌉ fewer than the original. It must however store ⌈log m⌉ additional offsets in the needle, using O(log2 m) space. * The second adapts it to only store a constant number of such offsets, denoted c, but must perform n + ⌊(1⁄2 + ε) * (n − m)⌋ comparisons, with ε = 1⁄2(Fc+2 − 1)−1 = O(−c) going to zero exponentially quickly as c increases. The algorithm is considered fairly efficient in practice, being cache-friendly and using several operations that can be implemented in well-optimized subroutines. It is used by the C standard libraries glibc, newlib, and musl, to implement the memmem and strstr family of substring functions. As with most advanced string-search algorithms, the naïve implementation may be more efficient on small-enough instances; this is especially so if the needle isn't searched in multiple haystacks, which would amortize the preprocessing cost. (en)
dbo:wikiPageID	62412427 (xsd:integer)
dbo:wikiPageLength	9212 (xsd:nonNegativeInteger)
dbo:wikiPageRevisionID	1097829028 (xsd:integer)
dbo:wikiPageWikiLink	dbr:C_standard_libraries dbr:C_string_handling dbr:Boyer–Moore_string-search_algorithm dbr:Glibc dbr:Golden_ratio dbr:Musl dbr:Computer_science dbr:String_(computer_science) dbr:Address_space dbc:String_matching_algorithms dbr:Fibonacci_number dbr:Knuth–Morris–Pratt_algorithm dbr:Landau_notation dbr:Newlib dbr:Lyndon_word dbr:String-searching_algorithm
dbp:bestTime	O (en)
dbp:class	dbr:String-searching_algorithm
dbp:data	Any string with an ordered alphabet (en)
dbp:space	⌈log₂ m⌉ (en)
dbp:time	O (en)
dbp:wikiPageUsesTemplate	dbt:! dbt:Frac dbt:Math dbt:Missing_information dbt:Reflist dbt:Infobox_algorithm
dcterms:subject	dbc:String_matching_algorithms
rdfs:comment	In computer science, the two-way string-matching algorithm is an string-searching algorithm, discovered by Maxime Crochemore and Dominique Perrin in 1991. It takes a pattern of size m, called a “needle”, preprocesses it in linear time O(m), producing information that can then be used to search for the needle in any “haystack” string, taking only linear time O(n) with n the haystack's length. Breslauer later published two improved variants performing fewer comparisons, at the cost of storing additional data about the preprocessed needle: (en)
rdfs:label	Two-way string-matching algorithm (en)
owl:sameAs	wikidata:Two-way string-matching algorithm https://global.dbpedia.org/id/C1q5D
prov:wasDerivedFrom	wikipedia-en:Two-way_string-matching_algorithm?oldid=1097829028&ns=0
foaf:isPrimaryTopicOf	wikipedia-en:Two-way_string-matching_algorithm
is dbo:wikiPageWikiLink of	dbr:String-searching_algorithm
is foaf:primaryTopic of	wikipedia-en:Two-way_string-matching_algorithm