Big-O Notation

Introduction

Big-O notation is used to estimate time or space complexities of algorithms according to their input size. Big-O notation usually only provides an upper bound on the growth rate of the function, so people can expect the guaranteed performance in the worst case. Due to the reason, Big-O is more widely used by developers compared to Big-Theta and Big-Omega.

Definition

Let f and g be two functions defined on some subset of the real numbers. If there is a positive constant C such that for all sufficiently large values of n, the absolute value of f(n) is at most C multiplied by the absolute value of g(n). That is, f(n)=O(g(n)) if and only if there exists a positive real number C and a real number n0 such that
|f(n)|C|g(n)| for all nn0.

Let’s see examples for better understanding. There are two algorithms written in C for sum from 1 to n. (The comment of each line means the operation number.)

// Algorithm1
int calcSum(int n) {
	int i = 1; // 1
	int sum = 0; // 1
	for(; i<=n; ++i)
	{
		sum = sum+i;
	} // 3n+1
	return sum;
}
// Total operation number: 1+1+3n+1 = 3n+3
// Algorithm2
int calcSum(int n) {
	int count = n; // 1
	int sum = 1+n; // 1
	sum = sum*count; // 1
	sum = sum/2; // 1
	return sum;
}
// Total operation number: 1+1+1+1 = 4
Big-O notation for Algorithm1

There exists C and n0 such that |3n+3|C|n| for all nn0. (e.g. C=4,n0=3)
Therefore, 3n+3=O(n).

Big-O notation for Algorithm2

There exists C such that |4|C|1|. (e.g. C=4)
That means, 4=O(1).

Polynomial Expression

In real situations, we may face more complicated expressions than the examples above. Let’s suppose that the complexity function f(n) is n2+2n+1, and we need to simplify this expression using Big-O notation. To say the conclusion first, among the three terms n2, 2n, and 1, the one with the highest growth rate will be used as follows: O(n2).

Theorem

f(n)=O(nk), where f(n)=annk+an1nk1+...+a1n+a0 and a0,a1,a2,...,an1,an are real numbers.

Proof:
For all n1,
|f(n)|=|annk+an1nk1+...+a1n+a0| =|an|nk+|an1|nk1+...+|a1|n+|a0| =nk(|an|+|an1|/n+...+|a1|/nk1+|a0|/nk)
nk(|an|+|an1|+...+|a1|+|a0|)
Therefore, |f(n)|Cnk where C=|an|+|an1|+...+|a1|+|a0| and n0=1. Consequently, f(n)=O(nk)

Going back to f(n)=n2+2n+1, we can use the same approach in order to make Big-O notation for f(n)=n2+2n+1.

For all n1, (n0=1)
|f(n)|=n2+2n+1
=n2(1+2n1+n2)
n2(1+2+1)=4n2
That is, f(n)=O(n2) (C=4, n0=1)

Other Useful Theorems

Many algorithms consist of two or more sub-procedures. We can derive their proper Big-O notations using the sub-procedure’s Big-O.

Theorem 1

Let f1(n)=O(g1(n)) and f2(n)=O(g2(n)).
Then, (f1+f2)(n)=O(max(|g1(n)|,|g2(n)|)).

Proof:
By the definition of Big-O, there exists C1,C2,k1,k2 such that |f1(n)|C1|g1(n)| for all n>k1 and |f2(n)|C2|g2(n)| for all n>k2.
Also, let nmax(k1,k2), g(n)=max(|g1(n)|,|g2(n)|), and C=C1+C2.
|f1(n)+f2(n)|
C1|g1(n)|+C2|g2(n)|
C1|g(n)|+C2|g(n)|
=(C1+C2)|g(n)|
=C|g(n)|

Corollary:
(f1+f2)(n)=O(g(n)), where f1(n)=O(g(n)) and f2(n)=O(g(n)).

Theorem 2

Let f1(n)=O(g1(n)) and f2(n)=O(g2(n)).
Then, (f1f2)(n)=O(g1(n)g2(n)).

Proof:
Let nmax(k1,k2) and C=C1C2
|(f1f2)(n)|=|f1(n)||f2(n)|
C1|g1(n)|C2|g2(n)|
C1C2|(g1g2)(n)|
C|g1g2|(n)

References