# Pollard’s Rho Algorithm for Prime Factorization

Given a positive integer **n**, and that it is composite, find a divisor of it.

Example:

Input: n = 12; Output: 2 [OR 3 OR 4] Input: n = 187; Output: 11 [OR 17]

**Brute approach:** Test all integers less than n until a divisor is found.

**Improvisation: **Test all integers less than √n

A large enough number will still mean a great deal of work. **Pollard’s Rho** is a prime factorization algorithm, particularly fast for a large composite number with small prime factors. The Rho algorithm’s most remarkable success was the factorization of eighth Fermat number: 1238926361552897 * 93461639715357977769163558199606896584051237541638188580280321.

The Rho algorithm was a good choice because the first prime factor is much smaller than the other one.

**Concepts used in Pollard’s Rho Algorithm:**

- Two numbers x and y are said to be congruent modulo n (x = y modulo n) if
- their absolute difference is an integer multiple of n, OR,
- each of them leaves the same remainder when divided by n.

- The Greatest Common Divisor is the largest number which divides evenly into each of the original numbers.
- Birthday Paradox: The probability of two persons having same birthday is unexpectedly high even for small set of people.
- Floyd’s cycle-finding algorithm: If tortoise and hare start at same point and move in a cycle such that speed of hare is twice the speed of tortoise, then they must meet at some point.

**Algorithm**:

- Start with random x and c. Take y equal to x and f(x) = x
^{2}+ c. - While a divisor isn’t obtained
- Update x to f(x) (modulo n) [Tortoise Move]
- Update y to f(f(y)) (modulo n) [Hare Move]
- Calculate GCD of |x-y| and n
- If GCD is not unity
- If GCD is n, repeat from step 2 with another set of x, y and c
- Else GCD is our answer

**Illustration:**

Let us suppose **n = 187** and consider different cases for different random values.

- An Example of random values such that algorithm
**finds result**:

y = x = 2 and c = 1, Hence, our f(x) = x^{2}+ 1.

- An Example of random values such that algorithm
**finds result faster**:

y = x = 110 and ‘c’ = 183. Hence, our f(x) = x^{2}+ 183.

- An Example of random values such that algorithm
**doesn’t find result**:

x = y = 147 and c = 67. Hence, our f(x) = x^{2}+ 67.

Below is C/C++ implementation of above algorithm:

## C/C++

`/* C++ program to find a prime factor of composite using ` ` ` `Pollard's Rho algorithm */` `#include<bits/stdc++.h> ` `using` `namespace` `std; ` ` ` `/* Function to calculate (base^exponent)%modulus */` `long` `long` `int` `modular_pow(` `long` `long` `int` `base, ` `int` `exponent, ` ` ` `long` `long` `int` `modulus) ` `{ ` ` ` `/* initialize result */` ` ` `long` `long` `int` `result = 1; ` ` ` ` ` `while` `(exponent > 0) ` ` ` `{ ` ` ` `/* if y is odd, multiply base with result */` ` ` `if` `(exponent & 1) ` ` ` `result = (result * base) % modulus; ` ` ` ` ` `/* exponent = exponent/2 */` ` ` `exponent = exponent >> 1; ` ` ` ` ` `/* base = base * base */` ` ` `base = (base * base) % modulus; ` ` ` `} ` ` ` `return` `result; ` `} ` ` ` `/* method to return prime divisor for n */` `long` `long` `int` `PollardRho(` `long` `long` `int` `n) ` `{ ` ` ` `/* initialize random seed */` ` ` `srand` `(` `time` `(NULL)); ` ` ` ` ` `/* no prime divisor for 1 */` ` ` `if` `(n==1) ` `return` `n; ` ` ` ` ` `/* even number means one of the divisors is 2 */` ` ` `if` `(n % 2 == 0) ` `return` `2; ` ` ` ` ` `/* we will pick from the range [2, N) */` ` ` `long` `long` `int` `x = (` `rand` `()%(n-2))+2; ` ` ` `long` `long` `int` `y = x; ` ` ` ` ` `/* the constant in f(x). ` ` ` `* Algorithm can be re-run with a different c ` ` ` `* if it throws failure for a composite. */` ` ` `long` `long` `int` `c = (` `rand` `()%(n-1))+1; ` ` ` ` ` `/* Initialize candidate divisor (or result) */` ` ` `long` `long` `int` `d = 1; ` ` ` ` ` `/* until the prime factor isn't obtained. ` ` ` `If n is prime, return n */` ` ` `while` `(d==1) ` ` ` `{ ` ` ` `/* Tortoise Move: x(i+1) = f(x(i)) */` ` ` `x = (modular_pow(x, 2, n) + c + n)%n; ` ` ` ` ` `/* Hare Move: y(i+1) = f(f(y(i))) */` ` ` `y = (modular_pow(y, 2, n) + c + n)%n; ` ` ` `y = (modular_pow(y, 2, n) + c + n)%n; ` ` ` ` ` `/* check gcd of |x-y| and n */` ` ` `d = __gcd(` `abs` `(x-y), n); ` ` ` ` ` `/* retry if the algorithm fails to find prime factor ` ` ` `* with chosen x and c */` ` ` `if` `(d==n) ` `return` `PollardRho(n); ` ` ` `} ` ` ` ` ` `return` `d; ` `} ` ` ` `/* driver function */` `int` `main() ` `{ ` ` ` `long` `long` `int` `n = 10967535067; ` ` ` `printf` `(` `"One of the divisors for %lld is %lld."` `, ` ` ` `n, PollardRho(n)); ` ` ` `return` `0; ` `} ` |

*chevron_right*

*filter_none*

## Python3

`# Python 3 program to find a prime factor of composite using ` `# Pollard's Rho algorithm ` `import` `random ` `import` `math ` ` ` `# Function to calculate (base^exponent)%modulus ` `def` `modular_pow(base, exponent,modulus): ` ` ` ` ` `# initialize result ` ` ` `result ` `=` `1` ` ` ` ` `while` `(exponent > ` `0` `): ` ` ` ` ` `# if y is odd, multiply base with result ` ` ` `if` `(exponent & ` `1` `): ` ` ` `result ` `=` `(result ` `*` `base) ` `%` `modulus ` ` ` ` ` `# exponent = exponent/2 ` ` ` `exponent ` `=` `exponent >> ` `1` ` ` ` ` `# base = base * base ` ` ` `base ` `=` `(base ` `*` `base) ` `%` `modulus ` ` ` ` ` `return` `result ` ` ` `# method to return prime divisor for n ` `def` `PollardRho( n): ` ` ` ` ` `# no prime divisor for 1 ` ` ` `if` `(n ` `=` `=` `1` `): ` ` ` `return` `n ` ` ` ` ` `# even number means one of the divisors is 2 ` ` ` `if` `(n ` `%` `2` `=` `=` `0` `): ` ` ` `return` `2` ` ` ` ` `# we will pick from the range [2, N) ` ` ` `x ` `=` `(random.randint(` `0` `, ` `2` `) ` `%` `(n ` `-` `2` `)) ` ` ` `y ` `=` `x ` ` ` ` ` `# the constant in f(x). ` ` ` `# Algorithm can be re-run with a different c ` ` ` `# if it throws failure for a composite. ` ` ` `c ` `=` `(random.randint(` `0` `, ` `1` `) ` `%` `(n ` `-` `1` `)) ` ` ` ` ` `# Initialize candidate divisor (or result) ` ` ` `d ` `=` `1` ` ` ` ` `# until the prime factor isn't obtained. ` ` ` `# If n is prime, return n ` ` ` `while` `(d ` `=` `=` `1` `): ` ` ` ` ` `# Tortoise Move: x(i+1) = f(x(i)) ` ` ` `x ` `=` `(modular_pow(x, ` `2` `, n) ` `+` `c ` `+` `n)` `%` `n ` ` ` ` ` `# Hare Move: y(i+1) = f(f(y(i))) ` ` ` `y ` `=` `(modular_pow(y, ` `2` `, n) ` `+` `c ` `+` `n)` `%` `n ` ` ` `y ` `=` `(modular_pow(y, ` `2` `, n) ` `+` `c ` `+` `n)` `%` `n ` ` ` ` ` `# check gcd of |x-y| and n ` ` ` `d ` `=` `math.gcd(` `abs` `(x ` `-` `y), n) ` ` ` ` ` `# retry if the algorithm fails to find prime factor ` ` ` `# with chosen x and c ` ` ` `if` `(d ` `=` `=` `n): ` ` ` `return` `PollardRho(n) ` ` ` ` ` `return` `d ` ` ` `# Driver function ` `if` `__name__ ` `=` `=` `"__main__"` `: ` ` ` ` ` `n ` `=` `10967535067` ` ` `print` `(` `"One of the divisors for"` `, n , ` `"is "` `,PollardRho(n)) ` ` ` `# This code is contributed by chitranayal ` |

*chevron_right*

*filter_none*

Output:

One of the divisors for 10967535067 is 104729

**How does this work?**

Let **n be a composite (non-prime)**. Since n is composite, it has a non trivial factor **f** < n. In fact, there is at least one **f <= √n **.

Now suppose we have to pick two numbers x and y from the range [0, n-1]. The only time we get x = y modulo n is when x and y are identical. However, since f < √n, there is a good chance x = y modulo f even when x and y are not identical (Birthday Paradox).

We begin by randomly selecting x with replacement from the set {0, 1, …, n-1} to form a sequence s_{1}, s_{2}, s_{3} … Defining** ś _{i} = s_{i} mod f**, our sequence now has each ś

_{i}belonging to {0, 1, …, f-1}. Because both the sets are finite, eventually there will be a repeated integer in both. We expect to achieve the repeat earlier in ś

_{i}, since f < n.

Now, say ś_{i} = ś_{j} for i ≠ j, then, s_{i} = s_{j} modulo d. And hence, |s_{i} – s_{j}| will be a multiple of f. As per assumed above, n is also a multiple of f. Together this means that GCD of |s_{i} – s_{j}| and n will be positive integral multiple of f, and also our **candidate divisor d!** The catch here is that we just knew there had to be some divisor of n, and we didn’t even care of its value. Once we hit s_{i} and s_{j} (our final x and y) then each element in the sequence starting with s_{i} will be congruent modulo f to the corresponding element in the sequence starting with s_{j}, and hence, a cycle. If we graph the sequence s_{i}, we will observe the shape of Greek letter Rho (ρ).

At the heart of Rho algorithm is picking up random values and evaluating GCDs. To decrease the costly GCD calculations, we can implement the Pollard’s Rho with Floyd’s cycle detection (which can be understood with the tortoise-hare analogy where the tortoise moves through each element one at a time in order, and the hare starts at the same point but moves twice as fast as the tortoise). We shall have some polynomial f(x) for the same, start with random x_{0}, y_{0} = x_{0}, and compute x_{i+1} = f(x_{i}) and y_{i+1} = f(f(y_{i})). Since we don’t know much about d, a typical choice for the polynomial is f(x) = x^{2} + c (modulo n) (Yes, ‘c’ **is also be chosen randomly**).

**Note:**

- Algorithm will run indefinitely for prime numbers.
- The algorithm may not find the factors and return a failure for composite n. In that case, we use a different set of x, y and c and try again.
- The above algorithm only finds a divisor. To find a prime factor, we may
**recursively factorize**the divisor d, run algorithm for d and n/d. The cycle length is typically of the order √d.

**Time Complexity analysis: **

The algorithm offers a trade-off between its running time and the probability that it finds a factor. A prime divisor can be achieved with a probability around 0.5, in O(√d) <= O(n^{1/4}) iterations. This is a heuristic claim, and rigorous analysis of the algorithm remains open.

This article is contributed by **Yash Varyani**. Please write comments if you find anything incorrect, or you want to share more information about the topic discussed above

Don’t stop now and take your learning to the next level. Learn all the important concepts of Data Structures and Algorithms with the help of the most trusted course: **DSA Self Paced**. Become industry ready at a student-friendly price.

## Recommended Posts:

- Trial division Algorithm for Prime Factorization
- Count occurrences of a prime number in the prime factorization of every element from the given range
- Wheel Factorization Algorithm
- Sum of Factors of a Number using Prime Factorization
- Prime Factorization using Sieve O(log n) for multiple queries
- Generating all divisors of a number using its prime factorization
- New Algorithm to Generate Prime Numbers from 1 to Nth Number
- How to generate Large Prime numbers for RSA Algorithm
- Fermat's Factorization Method
- Euler's Factorization method
- Dixon's Factorization Method with implementation
- Fermat's Factorization method for large numbers
- Check if a number is Prime, Semi-Prime or Composite for very large numbers
- Permutation of first N positive integers such that prime numbers are at prime indices
- Permutation of first N positive integers such that prime numbers are at prime indices | Set 2
- Find coordinates of a prime number in a Prime Spiral
- Print prime numbers with prime sum of digits in an array
- Count all prime numbers in a given range whose sum of digits is also prime
- Check if a prime number can be expressed as sum of two Prime Numbers
- Check whether the sum of prime elements of the array is prime or not