Analysis of Algorithms

Ø An algorithm is a clearly specified set of instructions the computer will follow to solve a problem. We will look at the time analysis of algorithms, that is, how long does an algorithm take to run as a function of its input.

Ø Start with a simple non-programming example, the stair-counting problem:

Suppose that you and a friend are at the top of a lighthouse and you wonder how many stairs there are to the bottom. We'll look at three different methods that you could use to answer this question and we will analyze the time requirements of each.

Ø When a time analysis is expressed in big-Oh notation, the result is called the order of the algorithm. The stair-counting example also illustrates one of the most important concepts behind the use of big-Oh notation: the order of the algorithm is generally more important than the speed of the processor. For example, a very fast stair climber is unlikely to be faster than a slowpoke provided the slowpoke uses one of the faster counting methods.

Ø The table from last weeks lecture illustrates the fact that growth rate (i.e., the order) of an algorithm is most important when n (i.e., the input size) becomes sufficiently large.

Time Analysis of C++ Functions

Ø An example of a linear time algorithm: a function that finds the maximum value in an array of n elements. In this example, an array of n elements is the input to the algorithm.

template <class Etype>

Etype max_element(const Etype array[ ], size_t size)

{

Etype max=array[0];

for (int i=1; i<size; i++)

max=max<array[i] ? array[i] : max;

return max;

}

Ø Time analysis:

- The parameter size holds the size of the array (i.e., the input to the function), therefore, n is the value of size.

- Prior to the for-loop, there is one operation.

- After the for-loop there is one operation.

- The body of the for-loop consists of one statement (i.e., one operation) that is executed (n-1) times

- Therefore, the total number of operations is 2+n-1 which is O(n) in big-Oh notation.

Ø In fact, if the body of the for-loop had contained any number of operations, say, k operations, the function would take 2+(n-1)k time which is still linear time. This can be summarized as follows: A loop that does a fixed number of operations n times requires O(n) time.

Ø An example of a quadratic time algorithm: the insertion sort algorithm.

template <class Etype>

void insertionSort(Etype array[ ], size_t n)

{

for (int p=1; p<n; p++)

{

Etype tmp = array[p];

int j;

for (j=p; j>0 && tmp<array[j-1]; j--)

array[j]=array[j-1];

array[j]=tmp;

}

Ø Time analysis:

- There are 2 nested for-loops. The body of the outer for-loop is executed n times.

- The body of the inner for-loop can be executed at most p times for each value of p. Summing over all values of p gives a total of: (1+2+3+4...+n) = n(n+1)/2 = ½ n² + ½ n = O(n²)

Ø A simple rule of thumb is that if you have 2 nested loops, each of which can be executed at most n times, then the algorithm is O(n²).

Ø An example of a logarithmic time algorithm: binary search. If an input array is sorted and you want to find the index position of a particular number in that array, you can perform a binary search to find that position.

template <class Etype>

int binarySearch(const Etype array[ ], const Etype & value, size_t n)

{

int low=0, high=n-1, mid;

while (low<=high)

{

mid=(low+high)/2;

if (array[mid]<value)

low=mid+1;

else if (array[mid]>value)

high=mid;

else

return mid;

}

return -1; // this indicates the value was not found

}

Ø Time analysis:

- Each pass through the while loop, the range of array positions to be searched is cut in half.

- Starting with n positions, after 1 pass through the while-loop we will have n/2 positions remaining to check; after 2 passes we will have ½ x n/2=n/4 positions that still need to be checked; after 3 passes the number is reduced to n/8, and after k passes that number is n/2^k.

- In the case of an unsuccessful search, the number of times, k, that the while-loop will be executed is bounded by the requirement that n/2^k>=1 or 2^k<=n, thus

k<=log₂ n.

- Therefore, the time required by this algorithm is

O(log n) in the worse case.

Ø A simple rule of thumb is that an algorithm is

O(log n) if it repeatedly cuts the problem size in half using a constant number of operations each time.

Ø In this discussion, we have talked about the worse-case performance of an algorithm. This occurs when we count the maximum number of required operations for inputs of any size. During a time analysis you may find that you are unable to provide an exact count of the operations required, but if the analysis is a worse-case analysis, you may estimate the number of operations always making sure that your estimate is on the high side. Later on you will see average-case and best-case analysis as well.

Analysis of Algorithms

Big-Oh Notation

Time Analysis of C++ Functions