Given a two strings S and T, find the count of distinct occurrences of T in S as a subsequence.
Input: S = banana, T = ban Output: 3 Explanation: T appears in S as below three subsequences. [ban], [ba n], [b an] Input: S = geeksforgeeks, T = ge Output: 6 Explanation: T appears in S as below three subsequences. [ge], [ ge], [g e], [g e] [g e] and [ g e]
Approach: Create a recursive function such that it returns count of subsequences of S that match T. Here m is the length of T and n is length of S. This problem can be recursively defined as below.
- Given the string T is an empty string, returning 1 as an empty string can be the subsequence of all.
- Given the string S is an empty string, returning 0 as no string can be the subsequence of an empty string.
- If the last character of S and T do not match, then remove the last character of S and call the recursive function again. Because the last character of S cannot be a part of the subsequence or remove it and check for other characters.
- If the last character of S match then there can be two possibilities, first there can be a subsequence where the last character of S is a part of it and second where it is not a part of the subsequence. So the required value will be the sum of both. Call the recursive function once with last character of both the strings removed and again with only last character of S removed.
Blue round rectangles represent accepted states or there are a subsequence and red round rectanges represent No subsequence can be formed.
Since there are overlapping subproblems in the above recurrence result, Dynamic Programming approach can be applied to solve the above problem. Store the subproblems in a Hashmap or an array and return the value when the function is called again.
- Create a 2D array mat[m+1][n+1] where m is length of string T and n is length of string S. mat[i][j] denotes the number of distinct subsequence of substring S(1..i) and substring T(1..j) so mat[m][n] contains our solution.
- Initialize the first column with all 0s. An empty string can’t have another string as suhsequence
- Initialize the first row with all 1s. An empty string is subsequence of all.
- Fill the matrix in bottom up manner, i.e. all the sub problems of the current string is calculated first.
- Traverse the string T from start to end. (counter is i)
- For every iteration of the outer loop, Traverse the string S from start to end. (counter is j)
- If the character at ith index of string T matches with jth character of string S, the value is obtained considering two cases. First, is all the substrings without last character in S and second is the substrings without last characters in both, i.e mat[i+1][j] + mat[i][j] .
- Else the value will be same even if jth character of S is removed, i.e. mat[i+1][j]
- Print the value of mat[m-1][n-1] as the answer.
- Time Complexity: O(m*n).
Only one traversal of the matrix is needed, so the time Compelxiy is O(m*n)
- Auxiliary Space: O(m*n).
A matrix of size m*n is needed so the space complexity is O(m*n).
Note:Since mat[i][j] accesses elements of the current row and previous row only, we can optimize auxiliary space just by using two rows only reducing space from m*n to 2*n.
This article is contributed by Utkarsh Trivedi. If you like GeeksforGeeks and would like to contribute, you can also write an article using contribute.geeksforgeeks.org or mail your article to email@example.com. See your article appearing on the GeeksforGeeks main page and help other Geeks.
Please write comments if you find anything incorrect, or you want to share more information about the topic discussed above.
Attention reader! Don’t stop learning now. Get hold of all the important DSA concepts with the DSA Self Paced Course at a student-friendly price and become industry ready.
- Count maximum occurrence of subsequence in string such that indices in subsequence is in A.P.
- Longest subsequence such that every element in the subsequence is formed by multiplying previous element with a prime
- Longest Increasing Subsequence using Longest Common Subsequence Algorithm
- Find the equal pairs of subsequence of S and subsequence of T
- Maximum length subsequence such that adjacent elements in the subsequence have a common factor
- Find all distinct subset (or subsequence) sums of an array
- Length of the longest subsequence consisting of distinct elements
- Find all distinct subset (or subsequence) sums of an array | Set-2
- Maximum score assigned to a subsequence of numerically consecutive and distinct array elements
- Longest Palindromic Subsequence of two distinct characters
- Length of longest subsequence consisting of distinct adjacent elements
- Longest subsequence having maximum GCD between any pair of distinct elements
- Count of occurrences of each prefix in a string using modified KMP algorithm
- Printing frequency of each character just after its consecutive occurrences
- Split a binary string into K subsets minimizing sum of products of occurrences of 0 and 1
- Maximize occurrences of values between L and R on sequential addition of Array elements with modulo H
- Count of binary strings of length N having equal count of 0's and 1's and count of 1's ≥ count of 0's in each prefix substring
- Count subsequence of length three in a given string
- Count common subsequence in two strings
- Count of K length subsequence whose product is even