.. This file is part of the OpenDSA eTextbook project. See .. http://opendsa.org for more details. .. Copyright (c) 2012-2020 by the OpenDSA Project Contributors, and .. distributed under an MIT open source license. .. avmetadata:: :title: Reductions :author: Cliff Shaffer :institution: Virginia Tech :topic: Reductions :keyword: Reduction :naturallanguage: en :programminglanguage: N/A :description: Introduces the concept of a reduction. Demonstrates the two-sided relationship between two problems: One side relates upper bounds, the other relates lower bounds. Presents a visual representation for the reduction process. This is important prepatory material for later introducing NP-completeness theory. Reductions ========== Reductions ---------- This module introduces an important concept for understanding the relationships between problems, called :term:`reduction`. Reduction allows us to solve one problem in terms of another. Equally importantly, when we wish to understand the difficulty of a problem, reduction allows us to make relative statements about upper and lower bounds on the cost of a problem (as opposed to an algorithm or program). Because the concept of a problem is discussed extensively in this chapter, we want notation to clarify problem descriptions. Throughout this chapter, a problem will be defined in terms of a mapping between inputs and outputs, and the name of the problem will be given in all capital letters. Thus, a complete definition of the sorting problem could appear in our notation as follows: .. topic:: SORTING **Input:** A sequence of integers :math:`x_0, x_1, x_2, \ldots, x_{n-1}`. **Output:** A permutation :math:`y_0, y_1, y_2, \ldots, y_{n-1}` of the sequence such that :math:`y_i \leq y_j` whenever :math:`i < j`. .. inlineav:: SortingProblemCON ss :long_name: Sorting Problem :links: AV/NP/SortingProblemCON.css :scripts: AV/NP/SortingProblemCON.js :output: show :keyword: Reductions Example: The Pairing Problem ~~~~~~~~~~~~~~~~~~~~~~~~~~~~ When you buy or write a program to solve one problem, such as sorting, you might be able to use it to help solve a different problem. This is known in software engineering as :term:`software reuse`. To illustrate this, let us consider another problem. .. topic:: PAIRING **Input:** Two sequences of integers :math:`X = (x_0, x_1, ..., x_{n-1})` and :math:`Y =(y_0, y_1, ..., y_{n-1})`. **Output:** A pairing of the elements in the two sequences such that the least value in :math:`X` is paired with the least value in :math:`Y`, the next least value in :math:`X` is paired with the next least value in :math:`Y`, and so on. .. _Pair: .. inlineav:: pairingCON dgm :long_name: Pairing problem illustration :links: AV/NP/pairingCON.css :scripts: AV/NP/pairingCON.js :output: show :keyword: Reductions An illustration of **PAIRING**. The two lists of numbers are paired up so that the least values from each list make a pair, the next smallest values from each list make a pair, and so on. Figure :num:`Figure #Pair` illustrates **PAIRING**. One way to solve **PAIRING** is to use an existing sorting program to sort each of the two sequences, and then pair off items based on their position in sorted order. Technically we say that in this solution, **PAIRING** is :term:`reduced ` to **SORTING**, because **SORTING** is used to solve **PAIRING**. The next slideshow shows this process in detail, so that you can see the necessary steps in any reduction. .. inlineav:: pairToSortCON ss :long_name: Pair to Sorting Reduction Slideshow :links: AV/NP/pairToSortCON.css :scripts: AV/NP/pairToSortCON.js :output: show :keyword: Reductions Notice that reduction is a three-step process. The first step is to convert an instance of **PAIRING** into two instances of **SORTING**. The conversion step in this example is not very interesting; it simply takes each sequence and assigns it to an array to be passed to **SORTING**. The second step is to sort the two arrays (i.e., apply **SORTING** to each array). The third step is to convert the output of **SORTING** to the output for **PAIRING**. This is done by pairing the first elements in the sorted arrays, the second elements, and so on. A reduction of **PAIRING** to **SORTING** helps to establish an upper bound on the cost of **PAIRING**. In terms of asymptotic notation, assuming that we can find one method to convert the inputs to **PAIRING** into inputs to **SORTING** "fast enough", and a second method to convert the result of **SORTING** back to the correct result for **PAIRING** "fast enough", then the asymptotic cost of **PAIRING** cannot be more than the cost of **SORTING**. In this case, there is little work to be done to convert from **PAIRING** to **SORTING**, or to convert the answer from **SORTING** back to the answer for **PAIRING**, so the dominant cost of this solution is performing the sort operation. Thus, an upper bound for **PAIRING** is in :math:`O(n \log n)`. Reduction and Finding a Lower Bound ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ It is important to note that the pairing problem does **not** require that elements of the two sequences be sorted. This is merely one possible way to solve the problem. **PAIRING** only requires that the elements of the sequences be paired correctly. Perhaps there is another way to do it? Certainly if we use sorting to solve **PAIRING**, the algorithms will require :math:`\Omega(n \log n)` time. But, another approach might conceivably be faster. There is another use of reductions aside from applying an old algorithm to solve a new problem (and thereby establishing an upper bound for the new problem). That is to prove a lower bound on the cost of a new problem by showing that it could be used as a solution for an old problem with a known lower bound. This can be a hard concept to wrap your head around. But it is absolutely critical for understanding NP-Completeness later on. Assume we can go the other way and convert **SORTING** to **PAIRING** "fast enough". What does this say about the minimum cost of **PAIRING**? We know that the :ref:`lower bound ` for **SORTING** in the worst and average cases is in :math:`\Omega(n \log n)`. In other words, the best possible algorithm for sorting requires at least :math:`n \log n` time. Assume that **PAIRING** could be done in :math:`O(n)` time. Then, one way to create a sorting algorithm would be to convert **SORTING** into **PAIRING**, run the algorithm for **PAIRING**, and finally convert the answer back to the answer for **SORTING**. Provided that we can convert **SORTING** to/from **PAIRING** "fast enough", this process would yield an :math:`O(n)` algorithm for sorting! Because this contradicts what we know about the lower bound for **SORTING**, and the only flaw in the reasoning is the initial assumption that **PAIRING** can be done in :math:`O(n)` time, we can conclude that there is no :math:`O(n)` time algorithm for **PAIRING**. This reduction process tells us that **PAIRING** must be at least as expensive as **SORTING** and so must itself have a lower bound in :math:`\Omega(n \log n)`. To complete this proof regarding the lower bound for **PAIRING**, we need now to find a way to reduce **SORTING** to **PAIRING**. This is easily done. Take an instance of **SORTING** (i.e., an array :math:`A` of :math:`n` elements). A second array :math:`B` is generated that simply stores :math:`i` in position :math:`i` for :math:`0 \leq i < n`. Pass the two arrays to **PAIRING**. Take the resulting set of pairs, and use the value from the :math:`B` half of the pair to tell which position in the sorted array the :math:`A` half should take; that is, we can now reorder the records in the :math:`A` array using the corresponding value in the :math:`B` array as the sort key and running a simple :math:`\Theta(n)` :ref:`Binsort `. The conversion of **SORTING** to **PAIRING** can be done in :math:`O(n)` time, and likewise the conversion of the output of **PAIRING** can be converted to the correct output for **SORTING** in :math:`O(n)` time. Thus, the cost of this "sorting algorithm" is dominated by the cost for **PAIRING**. The Reduction Template ~~~~~~~~~~~~~~~~~~~~~~ Before showing the example in detail for the reduction of **SORTING** to **PAIRING**, let's step back for a moment and make sure that we understand the general process defining it in terms of the Reduction Template. Consider any two problems for which a suitable reduction from one to the other can be found. The first problem takes an arbitrary instance of its input, which we will call **I**, and transforms **I** to a solution, which we will call **SLN**. The second problem takes an arbitrary instance of its input, which we will call **I'**, and transforms **I'** to a solution, which we will call **SLN'**. We can define reduction more formally as a three-step process: #. Transform an arbitrary instance of the first problem to an instance of the second problem. In other words, there must be a transformation from any instance **I** of the first problem to an instance **I'** of the second problem. #. Apply an algorithm for the second problem to the instance **I'**, yielding a solution **SLN'**. #. Transform **SLN'** to the solution of **I**, known as **SLN**. Note that **SLN** must in fact be the correct solution for **I** for the reduction to be acceptable. .. _BlackBox: .. inlineav:: blackBoxCON dgm :long_name: General blackbox reduction diagram :links: AV/NP/blackBoxCON.css :scripts: AV/NP/blackBoxCON.js :output: show :keyword: Reductions A graphical representation of the general reduction process, showing the role of the two problems, and the two transformations. Next is a slideshow that shows the steps for the reduction of **SORTING** to **PAIRING**. .. inlineav:: sortToPairCON ss :long_name: Sorting to Pairing Reduction Slideshow :links: AV/NP/sortToPairCON.css :scripts: AV/NP/sortToPairCON.js :output: show :keyword: Reductions It is important to note that the reduction process does not give us an algorithm for solving either problem by itself. It merely gives us a method for solving the first problem given that we already have a solution to the second. More importantly for the topics to be discussed in the remainder of this chapter, reduction gives us a way to understand the bounds of one problem in terms of another. Specifically, given efficient transformations, the upper bound of the first problem is at most the upper bound of the second. Conversely, the lower bound of the second problem is at least the lower bound of the first. None of this involves knowing the actual solution to either problem. This is especially helpful since we often use reductions to prove that something cannot exist (or at least, cannot run fast). Two Multiplication Examples ~~~~~~~~~~~~~~~~~~~~~~~~~~~ As a second example of reduction, consider the simple problem of multiplying two :math:`n`-digit numbers. The standard long-hand method for multiplication is to multiply the last digit of the first number by the second number (taking :math:`\Theta(n)` time), multiply the second digit of the first number by the second number (again taking :math:`\Theta(n)` time), and so on for each of the :math:`n` digits of the first number. Finally, the intermediate results are added together. Note that adding two numbers of length :math:`M` and :math:`N` can easily be done in :math:`\Theta(M + N)` time. Because each digit of the first number is multiplied against each digit of the second, this algorithm requires :math:`\Theta(n^2)` time. Asymptotically faster (but more complicated) algorithms are known, but none is so fast as to be in :math:`O(n)`. Next we ask the question: Is squaring an :math:`n`-digit number as difficult as multiplying two :math:`n`-digit numbers? We might hope that something about this special case will allow for a faster algorithm than is required by the more general multiplication problem. However, a simple reduction proof serves to show that squaring is "as hard" as multiplying. The key to the reduction is the following formula: .. math:: X \times Y = \frac{(X + Y)^2 - (X - Y)^2}{4}. The significance of this formula is that it allows us to convert an arbitrary instance of multiplication to a series of operations involving three addition/subtractions (each of which can be done in linear time), two squarings, and a division by 4. This is because .. math:: (X + Y)^2 - (X - Y)^2 = X^2 + 2XY + Y^2 - (X^2 - 2XY + Y^2) = 4XY. Note that the division by 4 can be done in linear time (simply convert to binary, shift right by two digits, and convert back). This reduction shows that if a linear time algorithm for squaring can be found, it can be used to construct a linear time algorithm for multiplication. Our next example of reduction concerns the multiplication of two :math:`n \times n` matrices. For this problem, we will assume that the values stored in the matrices are simple integers and that multiplying two simple integers takes constant time (because multiplication of two ``int`` variables takes a fixed number of machine instructions). The standard algorithm for multiplying two matrices is to multiply each element of the first matrix's first row by the corresponding element of the second matrix's first column, then adding the numbers. This takes :math:`\Theta(n)` time. Each of the :math:`n^2` elements of the solution are computed in similar fashion, requiring a total of :math:`\Theta(n^3)` time. Faster algorithms are known (see :ref:`Strassen's algorithm `), but none are so fast as to be in :math:`O(n^2)`. Now, consider the case of multiplying two :term:`symmetric matrices `. A symmetric matrix is one in which entry :math:`ij` is equal to entry :math:`ji`; that is, the upper-right triangle of the matrix is a mirror image of the lower-left triangle. Is there something about this restricted case that allows us to multiply two symmetric matrices faster than in the general case? The answer is no, as can be seen by the following reduction. Assume that we have been given two :math:`n \times n` matrices :math:`A` and :math:`B`. We can construct a :math:`2n \times 2n` symmetric matrix from an arbitrary matrix :math:`A` as follows: .. math:: \left[ \begin{array}{cc} 0 &A\\ A^{\rm T}& 0 \end{array} \right]. Here 0 stands for an :math:`n \times n` matrix composed of zero values, :math:`A` is the original matrix, and :math:`A^{\rm T}` stands for the transpose of matrix :math:`A`. (The transpose operation takes position :math:`ij` of the original matrix and places it in position :math:`ji` of the transpose matrix. This can easily be done in :math:`n^2` time for an :math:`n \times n` matrix.) Note that the resulting matrix is now symmetric. We can convert matrix :math:`B` to a symmetric matrix in a similar manner. If symmetric matrices could be multiplied "quickly" (in particular, if they could be multiplied together in :math:`\Theta(n^2)` time), then we could find the result of multiplying two arbitrary :math:`n \times n` matrices in :math:`\Theta(n^2)` time by taking advantage of the following observation: .. math:: \left[ \begin{array}{cc} 0&A\\ A^{\rm T}&0 \end{array} \right] \left[ \begin{array}{cc} 0&B^{\rm T}\\ B&0 \end{array} \right] = \left[ \begin{array}{cc} AB&0\\ 0&A^{\rm T}B^{\rm T} \end{array} \right]. In the above formula, :math:`AB` is the result of multiplying matrices :math:`A` and :math:`B` together. The following slideshow illustrates this reduction process. .. inlineav:: matrixMulCON ss :long_name: Matrix Multiplication :links: AV/NP/matrixMulCON.css :scripts: AV/NP/matrixMulCON.js :output: show :keyword: Reductions Bounds Theorems ~~~~~~~~~~~~~~~ The notation :math:`\leq_{O(g(n))}` means that a reduction can be done with transformations that cost :math:`O(g(n))`. **Lower Bound Theorem}**: If :math:`P_1 \leq_{O(g(n))} P_2`, then there is a lower bound of :math:`\Omega(h(n))` on the time complexity of :math:`P_1`, and :math:`g(n) = o(h(n))`, then there is a lower bound of :math:`\Omega(h(n))` on :math:`P_2`. (Notice little-oh, not big-Oh. Meaning that the one thing is really smaller than the other.) Example: **SORTING** :math:`\leq_{O(n)}` **PAIRING**, because :math:`g(n) = n`, :math:`h(n) = n \log n`, and :math:`g(n) = o(h(n))`. The Lower Bound Theorem gives us an :math:`\Omega(n \log n)` lower bound on **PAIRING**. This also goes the other way. **Upper Bound Theorem**: If :math:`P_2` has time complexity :math:`O(h(n))` and :math:`P_1 \leq_{O(g(n))} P_2`, then :math:`P_1` has time complexity :math:`O(g(n) + h(n))`. So, given good transformations, both problems take at least :math:`\Omega(P_1)` and at most :math:`O(P_2)`. The Cost of Making a Simple Polygon ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ **SIMPLE POLYGON**: Given a set of :math:`n` points in the plane, find a simple polygon with those points as vertices. (Here, "simple" means that no lines cross.) We will show that **SORTING** :math:`\leq_{O(n)}` **SIMPLE POLYGON**. We start with an instance of **SORTING**: :math:`\{x_1, x_2, \cdots, x_n\}`. In linear time, find :math:`M = \max|x_i|`. Let :math:`C` be a circle centered at the origin, of radius :math:`M`. We will generate an instance of **SIMPLE POLYGON** by replacing each value in the array to be sorted with a corresponding point defined as .. math:: \{(x_1, \sqrt{M^2 - x_i^2}), \cdots, (x_n, \sqrt{M^2 - x_n^2})\}. .. inlineav:: ReduceSimpPCON dgm :links: AV/SeniorAlgAnal/ReduceSimpPCON.css :scripts: AV/SeniorAlgAnal/ReduceSimpPCON.js :align: center :keyword: Reductions Input to SORTING: the values 5, -3, 2, 0, 10. When converted to points, they fall on a circle as shown. It is an important fact that all of these points fall on :math:`C`. Furthermore, when we find a simple polygon, the points all fall along the circle in sort order. This is because the only simple polygon having all of its points on :math:`C` as vertices is the convex one. Therefore, by the Lower Bound Theorem, **SIMPLE POLYGON** is in :math:`\Omega(n \log n)`.