The authors introduce the concept of dynamic programming in this chapter where programming refers to tabular method. Unlike divide-and-conquer technique, where a problem is divided into independent sub-problems, dynamic programming is applicable when the sub-problems are not independent and share common sub-sub-problems. A divide-and-conquer approach does more work than necessary solving the common sub-sub-problems repeatedly. A dynamic programming algorithm solves every sub-sub-problem once and stores the result in a table thereby avoiding the work of recomputing the answer every time the sub-sub-problem is encountered. This simple technique sometimes transforms exponential-time algorithms to polynomial-time algorithms. Dynamic programming is typically applicable to optimization problems where there are multiple solutions with one value attached to each solution and we need to find the solution with an optimal value.
The development of a dynamic programming algorithm is divided in four steps:
The authors then apply the above steps to this problem as below:
$$ P(n) = \begin{cases}
1 & \text{ if }n = 1,\\
\displaystyle\sum\limits_{k=1}^{n-1}P(k) \times P(n-k) & \text{ if }n >= 2,
\end{cases} $$
The authors then apply the dynamic programming steps to the problem as below:
The development of a dynamic programming algorithm is divided in four steps:
- Characterize the structure of an optimal solution.
- Recursively define the value of an optimal solution.
- Compute the value of an optimal solution in a bottom-up approach.
- Construct an optimal solution from computed information.
The authors then apply the above steps to this problem as below:
- The structure of an optimal solution: The authors prove that an optimal solution to the problem contains an optimal solution to sub-problem, i.e. fastest way through station $ S_{i,j} $ contains the fastest way through either $ S_{1, j-1} $ or $ S_{2, j-1} $.
- The recursive solution: The authors define $ f_i[j] $ to be the fastest time to get a chassis all the way from start to station $ S_{i,j} $ and derive following recursive relationship
- $$ f_1[j] = \begin{cases}
e_1 + a_{1,1}, & \text{ if }j = 1 \\
min(f_1[j-1]+a_{1,j},f_2[j-1]+t_{2,j-1}+a_{1,j}), & \text{ if }j >= 2 \end{cases} $$ - $$ f_2[j] = \begin{cases} e_2 + a_{2,1}, & \text{ if } j = 1 \\ min(f_2[j-1]+a_{2,j},f_1[j-1]+t_{1,j-1}+a_{2,j}), & \text{ if } j >= 2 \end{cases} $$
- Computing the fastest times: The authors provide an algorithm which finds the fastest assembly line in $ \Theta(n) $ time.
- Constructing the fastest way: The authors then show the solution for a sample problem given.
$$ P(n) = \begin{cases}
1 & \text{ if }n = 1,\\
\displaystyle\sum\limits_{k=1}^{n-1}P(k) \times P(n-k) & \text{ if }n >= 2,
\end{cases} $$
The authors then apply the dynamic programming steps to the problem as below:
- The structure of an optimal parenthesization: The authors prove that the optimal solution for parenthesization of matrix chain $A_iA_{i+1}...A_j$ already contain the optimal solution for parenthesization of matrix chain $A_iA_{i+1}...A_k$ and $A_{k+1}...A_j$ where $i<=k<j$.
- A recursive solution: Let $m[i,j]$ be the minimum number of scalar multiplications needed to compute the matrix $A_{i...j}$. It can be recursively defined as follows:
- $$ m[i,j] = \begin{cases}
0, & \text{ if }i=j \\
min_{i<=k<j}\{m[i,k] + m[k+1,j] + p_{i-1}p_{k}p_{j}\}, & \text{ if } i<j
\end{cases}$$ - Computing the optimal costs: The authors point out that deciding $i$ and $j$ such that $1 <= i <= j <= n$ requires $ \Theta(n) $ choices in all and provide an algorithm to compute $m[1,n]$ with a running time of $ O(n^3) $.
- Constructing an optimal solution: The authors provide a method that prints the parenthesization required to generate the minimum number of scalar multiplication to get the matrix multiplication.
- Optimal substructure: If optimal solution to the problem contains optimal solution to sub-problems, the problem is a probable candidate for dynamic programming (and greedy method too).
- Overlapping sub-problems: The space of the sub-problems must be small and a recursive algorithm must be solving the same sub-problem over and over. Typically, the total number of distinct sub-problems is a polynomial in input size.
- Characterizing the longest common sub-sequence: If $ X = <x_1,x_2,...,x_m> $ and $ Y = <y_1,y_2,...,y_n> $ be sequences, and let $ Z = <z_1,z_2,...,z_k> $ be any longest common prefix of $X$ and $Y$. Then one of the following three holds:
- If $ x_m = y_n $ then $ z_k = x_m = y_n $ and $ Z_{k-1} $ is a longest common sub-sequence of $X_{m-1}$ and $Y_{n-1}$.
- If $ x_m != y_n $ then $ z_k != x_m $ implies that $ Z $ is a longest common sub-sequence of $X_{m-1}$ and $Y$.
- If $ x_m != y_n $ then $ z_k != y_n $ implies that $ Z $ is a longest common sub-sequence of $X$ and $Y_{n-1}$.
- A recursive solution: If the length of a longest common sub-sequence of the sequences $ X_{i} $ and $ Y_{j} $ is defined as $ c[i,j] $ then the following recursive formula can be derived based on the observation in 1 above
- $$ c[i,j] =
\begin{cases} 0 & \text{ if } i=0 \text{ or } j=0,\\
c[i-1,j-1] + 1 & \text{ if } i,j > 0 \text{ and } x_i = y_j, \\
max(c[i-1,j],c[i,j-1]) & \text{ if } i,j > 0 \text{ and } x_i \ne y_j.
\end{cases}$$
- Computing the length of a longest common sub-sequence: The authors then provide a $ O(m \times n) $ algorithm to compute the length of a longest common sub-sequence.
- Constructing a longest common sub-sequence: The authors then provide an algorithm to print the longest common sub-sequence from the tables generated in 3.
- The structure of an optimal binary search tree: The authors describe that the optimal binary search tree will also contain optimal binary search trees as sub-trees rooted at the left and right child of the root of the tree.
- A recursive solution: If $ e[i,j] $ is the expected cost of searching an optimal binary search tree containing the keys $ k_i,...,k_j $ and if the sum of all the probabilities in the sub-tree with keys $ k_i,...,k_j $ is $ w(i,j) = \displaystyle\sum\limits_{l=i}^{j}p_l + \sum\limits_{l=i-1}^{j}q_l $ then $$ e[i,j] = \begin{cases} q_{i-1} & \text{ if } j=i-1, \\
min_{i<=r<=j} { e[i,r-1] + e[r+1,j] + w[i,j] } & \text{ if } i<=j. \end{cases} $$
- Computing the expected search cost of an optimal binary search tree: The authors give $ \Omega (n^3) $ time algorithm to calculate the expected search cost of an optimal binary search tree.
- Constructing an optimal binary search tree: The authors leave it to an exercise to write an algorithm to output an optimal binary search tree constructed from the root table generated in 3.
No comments:
Post a Comment