Always choose the best choice every single step. Never look back, the past is in the past.

Applicability

The greedy algorithm may be used when the problem has the two characteristics:

If a problem doesn't have one of these characteristics, then using greedy algorithm will usually not get the best answer, for greedy algorithm often gets an answer too early.

Activity Selection Problem

Now given several activities all need to use the same playground, and their scheduled start & end time are also known. Make an arrangement to get most activities arranged.

Untitled

Use $a_i(i=1,2,\cdots,n)$ to present activities, $s_i$ is the start time of $a_i$ and $f_i$ is the end time of it. $S_{ij}$ means activities between time points $i$ and $j$. This problem can be solved using greedy algorithm.

Optimal Sub-structure

Assume $a_k$ is in the maximum compatible activities subset of $S_{ij}$—$A_{ij}$, then $S_{ik}$ and $S_{kj}$ will also have their maximum compatible subsets $A_{ik}$ and $A_{kj}$. Otherwise, there'll be a better choice $A'$ in $S_{ik}$ ($S_{kj}$) which can make a better choice $A_{ij}'$. It's impossible.

Greedy Choice Property

We follow this rule: always pick the activities with the earliest end time. For $S_{ij}$, we pick $f_m=\min\{f_k|a_k\in S_{ij}\}$. Then:

  1. $S_{im}=\empty$, then the only non-empty sub-problem of $S_{ij}$ is $S_{mj}$. It's obvious.

  2. $a_m$ will be included in one of $S_{ij}$'s maximum compatible activities subset. Proof:

    Assume $A_{ij}$ is a maximum compatible subset of $S_{ij}$. Now we sort the activities in it by their end time. Pick the earliest one $a_k$. Then:

    1. If $a_k$ is $a_m$, then $a_m$ is in $A_{ij}$. QED.
    2. Otherwise, use $a_m$ to replace $a_k$ in $A_{ij}$, and we get $A_{ij}'$. Because $a_m$ is the first activity to end in $S_{ij}$, which means $a_k$ will end no earlier than $a_m$, namely $A_{ij}'$ will also be a maximum compatible subset, and $a_m$ is in it. QED.

To sum up, by always choosing the first activity that ends first, we will get a maximum compatible subset of $S_{ij}$.

Huffman Coding

The process of using Huffman coding is introduced in both Discrete Mathematics and Data Structure, so omitted here...