Edit Distance In C++
2015-10-18 12:48
387 查看
Refference: programcreek.com/2013/12/edit-distance-in-java/
From Wiki:
In computer science, edit distance is a way of quantifying how dissimilar two strings (e.g., words) are to one another by counting the minimum number of operations required to transform one string into the other.
There are three operations permitted on a word: replace, delete, insert. For example, the edit distance between "a" and "b" is 1, the edit distance between "abc" and "def" is 3. This post analyzes how to calculate edit distance by using dynamic
programming.
Key Analysis
Let dp[i][j] stands for the edit distance between two strings with length i and j,
i.e., word1[0,...,i-1] and word2[0,...,j-1].
There is a relation between dp[i][j] and dp[i-1][j-1]. Let's say we transform from one string to another. The first string has length i and
it's last character is "x"; the second string has length jand its last character
is "y". The following diagram shows the relation
if x == y, then dp[i][j] == dp[i-1][j-1]
if x != y, and we insert y for word1, then dp[i][j] = dp[i][j-1] + 1
if x != y, and we delete x for word1, then dp[i][j] = dp[i-1][j] + 1
if x != y, and we replace x with y for word1, then dp[i][j] = dp[i-1][j-1] + 1
When x!=y, dp[i][j] is the min of the three situations.
Initial condition:
dp[i][0] = i, dp[0][j] = j
From Wiki:
In computer science, edit distance is a way of quantifying how dissimilar two strings (e.g., words) are to one another by counting the minimum number of operations required to transform one string into the other.
There are three operations permitted on a word: replace, delete, insert. For example, the edit distance between "a" and "b" is 1, the edit distance between "abc" and "def" is 3. This post analyzes how to calculate edit distance by using dynamic
programming.
Key Analysis
Let dp[i][j] stands for the edit distance between two strings with length i and j,
i.e., word1[0,...,i-1] and word2[0,...,j-1].
There is a relation between dp[i][j] and dp[i-1][j-1]. Let's say we transform from one string to another. The first string has length i and
it's last character is "x"; the second string has length jand its last character
is "y". The following diagram shows the relation
if x == y, then dp[i][j] == dp[i-1][j-1]
if x != y, and we insert y for word1, then dp[i][j] = dp[i][j-1] + 1
if x != y, and we delete x for word1, then dp[i][j] = dp[i-1][j] + 1
if x != y, and we replace x with y for word1, then dp[i][j] = dp[i-1][j-1] + 1
When x!=y, dp[i][j] is the min of the three situations.
Initial condition:
dp[i][0] = i, dp[0][j] = j
int inDistance(String word1, String word2) { int len1 = word1.length(); int len2 = word2.length(); // len1+1, len2+1, because finally return dp[len1][len2] int[][] dp = new int[len1 + 1][len2 + 1]; for (int i = 0; i <= len1; i++) { dp[i][0] = i; } for (int j = 0; j <= len2; j++) { dp[0][j] = j; } //iterate though, and check last char for (int i = 0; i < len1; i++) { char c1 = word1.charAt(i); for (int j = 0; j < len2; j++) { char c2 = word2.charAt(j); //if last two chars equal if (c1 == c2) { //update dp value for +1 length dp[i + 1][j + 1] = dp[i][j]; } else { int replace = dp[i][j] + 1; int insert = dp[i][j + 1] + 1; int delete = dp[i + 1][j] + 1; int min = replace > insert ? insert : replace; min = delete > min ? min : delete; dp[i + 1][j + 1] = min; } } } return dp[len1][len2]; }
相关文章推荐
- Vusual C++连接Mysql和从MySql中取出数据的API介绍
- visualC/C++连接MySql数据库
- 比赛
- C语言输出数组的三种方法
- 深入了解C++中间mutablekeyword
- 【黑马训练营】IOS C语言函数
- C++操作符的优先级
- [C++基金会]位计算 游戏开发中的应用
- 剑指offer第十九题【顺时针打印矩阵】c++实现
- C++实现二叉树及其线索化和遍历
- c语言和c++的“&”的语言
- mfc c++常用函数
- 【黑马训练营】IOS C语言循环语句
- 用C++实现STL容器stack
- C++设计模式--享元模式【转载】
- C/C++经典图书推荐
- Merge Two Sorted Lists(C++)
- C语言获取文件大小相关操作
- [黑马训练营]IOS C语言分支语句
- c字符串的处理