代码之家 › 专栏 › 技术社区 › Hypothetical Ninja

Python中的Levenshtein距离循环

word-frequency levenshtein-distance for-loop function python

Hypothetical Ninja · 技术社区 · 11 年前

我有一组参考单词(拼写正确),我需要使用用户输入的单词。使用Levenstein距离将输入单词与参考列表进行比较,我需要从成本最低的参考列表中返回单词。此外,该参考列表按频率排序,因此较高的频率出现在顶部。如果两个单词的距离相同,则返回频率较高的单词。“NWORDS”是我根据频率排序的参考列表。“候选”是用户输入的单词。

代码:

for word in NWORDS: #iterate over all words in ref
    i = jf.levenshtein_distance(candidate,word) #compute distance for each word with user input

        #dont know what to do here
    return word #function returns word from ref list with lowest dist and highest frequency of occurrence.

1 回复 | 直到 11 年前

jonrsharpe 11 年前

您可以按如下方式处理:

match = None # best match word so far
dist = None # best match distance so far
for word in NWORDS: #iterate over all words in ref
    i = jf.levenshtein_distance(candidate, word) #compute distance for each word with user input
    if dist is None or i < dist: # or <= if lowest freq. first in NWORDS
        match, dist = word, i
return match #function returns word from ref list with lowest dist and highest frequency of occurrence

推荐文章

Dave · 如何在for循环中修改列表值

11 月前

Haru Hoshizora · 为什么一个整数的位置没有改变,但值却不同

1 年前

BlurKid · R中for循环时结果的奇怪差异

1 年前

Rudraksh_pd · 取炭。通过char。c中创建字符串的输入

1 年前

Mtullis · 在我的表单值中循环遍历数组[重复]

1 年前

puboot · 我的for循环没有运行,我不知道为什么。它甚至不会在控制台上打印任何内容

1 年前

Justin Hawkins · 在一个数组中返回两个数组,其中包含带特定字母的名称和不带指定字母的名称

1 年前

leiseg · 使用扩展变量的for循环中PowerShell CLI中缺少终止符

1 年前

xhamsterIT · 循环VBA Microsoft Excel

1 年前

AlexC · 如何循环dplyr group_by并总结变量列表的语句

1 年前