Introduction
Big-O notation is used to estimate time or space complexities of algorithms according to their input size. Big-O notation usually only provides an upper bound on the growth rate of the function, so people can expect the guaranteed performance in the worst case. Due to the reason, Big-O is more widely used by developers compared to Big-Theta and Big-Omega.
Definition
Let f and g be two functions defined on some subset of the real numbers.
If there is a positive constant C such that for all sufficiently large values of n, the absolute value of f(n) is at most C multiplied by the absolute value of g(n). That is, f(n)=O(g(n)) if and only if there exists a positive real number C and a real number n0 such that
|f(n)|≤C|g(n)|
for all
n≥n0.
Let’s see examples for better understanding. There are two algorithms written in C for sum from 1 to n. (The comment of each line means the operation number.)
// Algorithm1
int calcSum(int n) {
int i = 1; // 1
int sum = 0; // 1
for(; i<=n; ++i)
{
sum = sum+i;
} // 3n+1
return sum;
}
// Total operation number: 1+1+3n+1 = 3n+3
// Algorithm2
int calcSum(int n) {
int count = n; // 1
int sum = 1+n; // 1
sum = sum*count; // 1
sum = sum/2; // 1
return sum;
}
// Total operation number: 1+1+1+1 = 4
Big-O notation for Algorithm1
There exists C and n0 such that
|3n+3|≤C|n|
for all
n≥n0. (e.g. C=4,n0=3)
Therefore, 3n+3=O(n).
Big-O notation for Algorithm2
There exists C such that
|4|≤C|1|. (e.g. C=4)
That means, 4=O(1).
Polynomial Expression
In real situations, we may face more complicated expressions than the examples above. Let’s suppose that the complexity function f(n) is n2+2n+1, and we need to simplify this expression using Big-O notation. To say the conclusion first, among the three terms n2, 2n, and 1, the one with the highest growth rate will be used as follows: O(n2).
Theorem
f(n)=O(nk), where f(n)=annk+an−1nk−1+...+a1n+a0 and a0,a1,a2,...,an−1,an are real numbers.
Proof:
For all n≥1,
|f(n)|=|annk+an−1nk−1+...+a1n+a0|
=|an|nk+|an−1|nk−1+...+|a1|n+|a0|
=nk(|an|+|an−1|/n+...+|a1|/nk−1+|a0|/nk)
≤nk(|an|+|an−1|+...+|a1|+|a0|)
Therefore, |f(n)|≤Cnk where C=|an|+|an−1|+...+|a1|+|a0| and n0=1. Consequently, f(n)=O(nk)
Going back to f(n)=n2+2n+1, we can use the same approach in order to make Big-O notation for f(n)=n2+2n+1.
For all n≥1, (n0=1)
|f(n)|=n2+2n+1
=n2(1+2n−1+n−2)
≤n2(1+2+1)=4n2
That is, f(n)=O(n2) (C=4, n0=1)
Other Useful Theorems
Many algorithms consist of two or more sub-procedures. We can derive their proper Big-O notations using the sub-procedure’s Big-O.
Theorem 1
Let f1(n)=O(g1(n)) and f2(n)=O(g2(n)).
Then, (f1+f2)(n)=O(max(|g1(n)|,|g2(n)|)).
Proof:
By the definition of Big-O, there exists C1,C2,k1,k2
such that |f1(n)|≤C1|g1(n)| for all n>k1 and |f2(n)|≤C2|g2(n)| for all n>k2.
Also, let n≥max(k1,k2), g(n)=max(|g1(n)|,|g2(n)|), and C=C1+C2.
|f1(n)+f2(n)|
≤C1|g1(n)|+C2|g2(n)|
≤C1|g(n)|+C2|g(n)|
=(C1+C2)|g(n)|
=C|g(n)|
Corollary:
(f1+f2)(n)=O(g(n)), where f1(n)=O(g(n)) and f2(n)=O(g(n)).
Theorem 2
Let f1(n)=O(g1(n)) and f2(n)=O(g2(n)).
Then, (f1f2)(n)=O(g1(n)g2(n)).
Proof:
Let n≥max(k1,k2) and C=C1C2
|(f1f2)(n)|=|f1(n)||f2(n)|
≤C1|g1(n)|C2|g2(n)|
≤C1C2|(g1g2)(n)|
≤C|g1g2|(n)
References
- Big O notation, Wikipedia, https://en.wikipedia.org/wiki/Big_O_notation
- Discrete Mathematics and its Applications 5th edition, Kenneth H. Rosen, McGraw-Hill Education
- Data Structures for Game Programmers, Ron Penton, PremierPress
- Big-O Notation, openparadigm’s blog, http://openparadigm.tistory.com/20