在ruby中计算子字符串列表出现次数的最快方法

kognpnkq 于 2022-11-04 发布在 Ruby

关注(0)|答案(3)|浏览(189)

我的问题很简单，我有一个子字符串列表，我必须计算一个特定字符串中包含多少个子字符串。下面是我的代码：

string = "..."
substrings = ["hello", "foo", "bar", "brol"]
count = 0
substrings.each do |sub|
    count += 1 if string.include?(sub)
end

在这个例子中，我们运行了整个字符串4次，这是相当耗时的。你将如何优化这个过程？

ruby

来源：https://stackoverflow.com/questions/23427299/fastest-way-to-count-occurences-of-a-substrings-list-in-ruby

3条答案

按热度按时间

iq0todco1#

它使用 Regexp.union 只运行字符串一次：

string = 'hello there! this is foobar!'
substrings = ["hello", "foo", "bar", "brol"]

string.scan(Regexp.union(substrings)).count

# => 3

中的每一个
虽然这个解决方案在小输入时明显较慢，但它具有较低的复杂度 - 对于长度为 n 的字符串和长度为 m 的子字符串，原始解决方案的复杂度为 O(m*n) ，而这个解决方案的复杂度为 O(m+n) 。

- 更新 * *

在再次阅读了这个问题和我的答案之后，我得出了这样的结论：这不仅是一个过早的优化（正如@Max 所指出的），而且我的答案在语义上与 OP * 不同 * 。
让我解释一下 - OP 代码计算字符串中有多少个 substrings * 至少出现一次 * ，而我的解决方案计算 * 有多少个 substrings * 出现在 * 任何 * 字符串中 * ：

op_solution('hello hello there', ["hello", "foo", "bar", "brol"])

# => 1

uri_solution('hello hello there', ["hello", "foo", "bar", "brol"])

# => 2

格式
这也解释了为什么我的解决方案如此缓慢，即使对于长字符串也是如此 - - 尽管它只对输入字符串进行了一次传递，但它必须传递 * 所有 * 字符串，而原始代码在一个单词的第一次出现时就停止了。
我的结论是 - - 采用奥雅纳的解决方案。它不会比你的更快，只是更简洁，但我想不出更好的办法了：）

赞(0）回复(0）举报 2022-11-09

y0u0uwnf2#

书写为：-

substrings.count { |sub| string.include?(sub) }

赞(0）回复(0）举报 2022-11-09

ars1skjm3#

subtrings.collect { |i| string.scan(i).count }.sum
很优雅。

赞(0）回复(0）举报 2022-11-09

我来回答

在ruby中计算子字符串列表出现次数的最快方法

3条答案

相关问题

热门标签

最新问答