52ky 发表于 2022-4-25 17:20:43

emlog 标题相似比查询插件

计算所有日志标题的相似度,大于设定值。 一般用于在采集后检查标题的重复性。
我没测试万级数据,测试500条文章查询用了1.5秒。

(Calculate the similarity of all log titles, which is greater than the set value. It is generally used to check the repeatability of the title after collection.I didn't test 10000 level data. It took 1.5 seconds to test 500 article queries.)





页: [1]
查看完整版本: emlog 标题相似比查询插件