Fast parallel algorithms for the unit cost editing distance between trees

D Shasha, K Zhang - Proceedings of the first annual ACM symposium on …, 1989 - dl.acm.org
Proceedings of the first annual ACM symposium on Parallel algorithms and …, 1989dl.acm.org
1. Problem Ordered labeled trees are trees whose nodes are labeled and in which the left-to-
right order among siblings is significant. We consider the distance between two trees to be
the minimum number of edit operations (insert, delete, and modify) necessary to transform
one tree to another. We present three algorithms to find the distance. The first algorithm is a
simple dynamic programming algorithm based on a postorder traversal whose complexity
improves upon the best previously published algorithm due to Tai (T79 in JACM). The …
1. Problem Ordered labeled trees are trees whose nodes are labeled and in which the left-to-right order among siblings is significant. We consider the distance between two trees to be the minimum number of edit operations (insert, delete, and modify) necessary to transform one tree to another.
We present three algorithms to find the distance. The first algorithm is a simple dynamic programming algorithm based on a postorder traversal whose complexity improves upon the best previously published algorithm due to Tai (T79 in JACM). The second and third algorithms are parallel algorithms based on the application of suffix trees to the comparison problem. The cost of executing these algorithms is a monotonic increasing function of the distance between the two trees. Results Let trees TI and T2 have numbers of levels L i and L 2 respectively. Let k be the actual distance between T 1 and T2. Let N be rain (IT11, IT2]). The asymptotic running times (assuming a concurrentread concurrent-write parallel random access machine) are:
ACM Digital Library