为什么Ruby中的迭代需要这么长时间?

问题描述 投票:2回答:1

嘿,我是Ruby的新手,我遇到了问题。我的文件Wordlist有超过100,000个单词,如果我的hashcode等于我的文件Wordlist中的一个单词,我想用方法test_password检查,但是当我检查我的文件的最后一个单词时,需要花费很多时间来迭代它,可以请某人帮助我如何让它更快?

File.open("Wordlist.txt", "r") do |fi|
  fi.each_line do |words|
    text_word << words.chomp
  end
end

text_word.each do |words|
  if test_password(words,ARGV[0])
    puts "FOUND: " + words
    break
  end
end
ruby file iteration hashcode
1个回答
3
投票

您可以使用[hash_code(word), word]对创建一次哈希,并将结果写为JSON,YAML或数据库(例如SQLite)。如果计算此哈希需要很长时间,这是可以的,因为您只需要执行一次。接下来,您只需要读取保存的哈希值,这应该很快。

现在,检查哈希中是否有单词或哈希码应该非常快。

以下是TODO为您留下的一个小例子:

require 'json'
require 'digest/md5'

hashcodes = {}

def my_hashcode(word)
  Digest::MD5.hexdigest word
end

# This part is slow, that's okay because it can be saved once and for all and doesn't depend on your input
File.open('/usr/share/dict/american-english') do |wordlist|
  wordlist.each do |word| 
    word.chomp!
    hashcodes[my_hashcode(word)] = word
  end
end

#TODO: Write hashcodes to JSON file
#TODO: Read hashcode from JSON file

# This part depends on your input but is very fast:
some_hashcode = my_hashcode("test")

p hashcodes[some_hashcode]
# => "test"

p hashcodes["S0MEWEIRDH4SH"]
# => nil
© www.soinside.com 2019 - 2024. All rights reserved.