您的位置:首页 > 编程语言 > C语言/C++

Edit Distance In C++

2015-10-18 12:48 387 查看
Refference: programcreek.com/2013/12/edit-distance-in-java/

From Wiki:

In computer science, edit distance is a way of quantifying how dissimilar two strings (e.g., words) are to one another by counting the minimum number of operations required to transform one string into the other.

There are three operations permitted on a word: replace, delete, insert. For example, the edit distance between "a" and "b" is 1, the edit distance between "abc" and "def" is 3. This post analyzes how to calculate edit distance by using dynamic
programming.

Key Analysis

Let dp[i][j] stands for the edit distance between two strings with length i and j,
i.e., word1[0,...,i-1] and word2[0,...,j-1].

There is a relation between dp[i][j] and dp[i-1][j-1]. Let's say we transform from one string to another. The first string has length i and
it's last character is "x"; the second string has length jand its last character
is "y". The following diagram shows the relation



if x == y, then dp[i][j] == dp[i-1][j-1]
if x != y, and we insert y for word1, then dp[i][j] = dp[i][j-1] + 1
if x != y, and we delete x for word1, then dp[i][j] = dp[i-1][j] + 1
if x != y, and we replace x with y for word1, then dp[i][j] = dp[i-1][j-1] + 1
When x!=y, dp[i][j] is the min of the three situations.

Initial condition:

dp[i][0] = i, dp[0][j] = j

int inDistance(String word1, String word2) {
int len1 = word1.length();
int len2 = word2.length();

// len1+1, len2+1, because finally return dp[len1][len2]
int[][] dp = new int[len1 + 1][len2 + 1];

for (int i = 0; i <= len1; i++) {
dp[i][0] = i;
}

for (int j = 0; j <= len2; j++) {
dp[0][j] = j;
}

//iterate though, and check last char
for (int i = 0; i < len1; i++) {
char c1 = word1.charAt(i);
for (int j = 0; j < len2; j++) {
char c2 = word2.charAt(j);

//if last two chars equal
if (c1 == c2) {
//update dp value for +1 length
dp[i + 1][j + 1] = dp[i][j];
} else {
int replace = dp[i][j] + 1;
int insert = dp[i][j + 1] + 1;
int delete = dp[i + 1][j] + 1;

int min = replace > insert ? insert : replace;
min = delete > min ? min : delete;
dp[i + 1][j + 1] = min;
}
}
}

return dp[len1][len2];
}
内容来自用户分享和网络整理,不保证内容的准确性,如有侵权内容,可联系管理员处理 点击这里给我发消息
标签: