布隆算法算法
先来看几个比较常见的例子数组
这几个例子有一个共同的特色: 如何判断一个元素是否存在一个集合中?网络
对于低内存的字典,方法以下:spa
1
import jieba 2 def check(s): 3 huangfan_path = 'path/to/dict.txt' 4 jieba.load_userdict(huangfan_path) 5 huangfan_words_dict = set() 6 with open(huangfan_path, 'rb') as fr: 7 for line in fr.readlines(): 8 huangfan_words_dict.add(line.strip().decode('utf-8')) 9 return set(jieba.lcut(s)) & self.huangfan_words_dict