1977. Number of Ways to Separate Numbers
Problem Description
In this problem, we have a string num
which consists of concatenated positive integers. We're told that the original list of integers was written in a non-decreasing order and none of the integers had leading zeros. The challenge is to figure out how many different lists of positive integers could have been combined to form the initial string num
.
Since the answer could be very large, we need to return it modulo 10^9 + 7
.
To illustrate with an example, if the input string num
is "327", there are two possible lists of non-decreasing integers: [3, 27] and [327]. Hence the function would return 2.
Intuition
Approaching this problem requires that we solve it piece by piece. We need to determine under what conditions we can split the string num
into different integers such that they maintain a non-decreasing order. We can start from the first character and continue appending the following characters until we get a valid number, then process the remainder of the string in a similar manner.
Here's how we might think about the solution:
-
Dynamic Programming (DP): We can consider this problem as one that can be solved using dynamic programming. We try to build up the number of ways to form the string
num
from smaller subproblems based on the length of the last number used. -
Longest Common Prefix (LCP): To maintain the non-decreasing order of numbers, we need a way to compare the numbers in
num
efficiently. The LCP (Longest Common Prefix) array can be used to store the length of the longest common prefix between two substrings of num. This helps us compare numbers quickly without generating all substrings, which would be inefficient. -
Construction of DP Table: We construct a 2D DP table with
dp[i][j]
meaning the number of ways to form the firsti
digits ofnum
where the last number used hasj
digits. -
Transition: We consider two cases for the transition. Either the last two numbers used are the same length - in which case we can form a new list of integers by using the same length of the number only if the two numbers are in non-decreasing order. Or, the last number used is shorter than the previous - in which case you can always append it because it won’t violate the non-decrease constraint.
-
Base Case: We initiate
dp[0][0]
as 1 because there is one way to form an empty string with an empty list.
This solution iteratively builds upon smaller subproblems and combines their results to find the total number of combinations for larger lengths of the string num
. At each step, it ensures that numbers are added in a non-decreasing order and without leading zeros.
Learn more about Dynamic Programming patterns.
Solution Approach
The solution utilizes a nested function cmp
to compare substrings of num
, and a Dynamic Programming (DP) table to keep track of the number of valid combinations that can be formed up to certain points in the string num
.
Let's walk through the components of the implementation:
-
Longest Common Prefix (LCP) Calculation: Before the main DP logic, the function computes the LCP array using a nested loop. For each pair of indices
i
andj
in the stringnum
,lcp[i][j]
is calculated to store the length of the longest common prefix between substrings starting fromi
andj
. This pre-computation is essential for thecmp
function. -
Comparison function (
cmp
): The functioncmp(i, j, k)
is used to compare two numbers within the stringnum
of lengthk
starting fromi
andj
respectively. This function will indirectly utilize the precomputed LCP values to efficiently determine if the first number starting at indexi
is greater than or equal to the second number starting at indexj
. -
Dynamic Programming: The 2D DP table
dp
is created to keep track of valid combinations.dp[i][j]
represents the number of ways to partitionnum[0:i]
such that the last number used hasj
digits. -
DP Initialization: The
dp
table is initialized such thatdp[0][0]
is 1, since there is exactly one way to create an empty partition. -
DP Iteration: The code iterates through each end index
i
of the substring and considers all possible lengthsj
for the last number. Inside this nested loop, the algorithm decides whether the substring of lengthj
ending ati
can form a valid number (it cannot start with '0') and if it maintains the non-decreasing property compared to the previous number. -
DP Transition: The crucial part of this solution is how the DP values transition from one state to another. For each
i
andj
, the implementation considers the number of combinations if the last two numbers are of equal length (and the second is greater or equal) or if the second number may length less.- If the last two numbers are of equal length and the comparison via
cmp
is true, it takes the value fromdp[i-j][j]
(which represents the count up to the first number's end index and the subsequent number also ending ati-j
and also has lengthj
). - Otherwise, it uses the value from
dp[i-j][min(j-1,i-j)]
to represent scenarios wherein the most recent number is of smaller size and doesn't threaten the non-decreasing order.
- If the last two numbers are of equal length and the comparison via
-
DP State Updates: After ascertaining which previous DP state can be transitioned from,
dp[i][j]
is updated with the count of combinations by considering the previous number lengths into account. It sums the count from the previously calculated number of ways and the new count, keeping in mind that we must perform this addition modulo10^9 + 7
. -
Result Extraction: Lastly, after populating the DP table, the result is the total combinations possible which is
dp[n][n]
, meaning the number of combinations to form the entire string with all possible last number lengths considered.
The solution is a well-crafted example of combining DP with string comparisons optimized by LCP to ensure polynomial time complexity and manage the large answer space creatively with modular arithmetic.
Example Walkthrough
Let's walk through a small example to illustrate the solution approach. Suppose our input string num
is "1123". We want to find out in how many different ways we can split this string into a list of non-decreasing positive integers.
Step 1: Longest Common Prefix (LCP) Calculation:
We calculate the LCP for every pair of starting positions in the string num
. For simplicity, let's assume we have a function that does this efficiently.
Step 2: Comparison function (cmp
):
This function will compare two numbers within the string num
based on their starting indices and length using the LCP information.
Step 3: Dynamic Programming (DP):
We initialize a DP table dp
where dp[i][j]
will represent the number of ways to split the substring num[0:i]
such that the last number has length j
.
Step 4: DP Initialization:
We set dp[0][0]
to 1 because there's one way to form an empty string which is an empty list of integers.
Step 5: DP Iteration:
- For
i = 1
, the substring "1" can form one list,[1]
. - For
i = 2
, the substring "11" can form two lists,[1, 1]
or[11]
. - For
i = 3
, the substring "112" can form three lists,[1, 1, 2]
,[1, 12]
, or[11, 2]
. - For
i = 4
, the substring "1123" can form the following lists: [1, 1, 2, 3], [1, 1, 23], [1, 12, 3], [1, 123], [11, 2, 3], [11, 23]. That's six valid lists.
Step 6: DP Transition:
- We check if we can append a number to the existing one to make sure they are in non-decreasing order.
- For instance, at
i = 4, j = 2
, we compare "23" with "12" (previous number of length 2). Since "23" >= "12", we can consider the count fromdp[2][2]
.
Step 7: DP State Updates:
We update the dp
table by adding the number of ways to the current count, ensuring we consider each possible length of the last number.
Step 8: Result Extraction:
We find the total count of combinations to form the string "1123", which will be dp[4][4] + dp[4][3] + dp[4][2] + ... + dp[4][1]
after iterating through all possible lengths for the last number.
For this example, by walking through the DP table, we find that there are six valid lists that can be formed from "1123". This approach ensures we explore all possibilities iteratively while keeping the problem manageable using the principles of dynamic programming.
Solution Implementation
1class Solution:
2 def numberOfCombinations(self, num: str) -> int:
3 # Helper function to compare the lexicographical order of two numbers within 'num'
4 # representing the start indices and their length 'length'.
5 def is_greater_or_equal(start1, start2, length):
6 common_prefix_length = longest_common_prefix[start1][start2]
7 return common_prefix_length >= length or num[start1 + common_prefix_length] >= num[start2 + common_prefix_length]
8
9 MOD = 10**9 + 7
10 num_length = len(num)
11
12 # Pre-compute the longest common prefix (LCP) array.
13 longest_common_prefix = [[0] * (num_length + 1) for _ in range(num_length + 1)]
14 for i in range(num_length - 1, -1, -1):
15 for j in range(num_length - 1, -1, -1):
16 if num[i] == num[j]:
17 longest_common_prefix[i][j] = 1 + longest_common_prefix[i + 1][j + 1]
18
19 # Dynamic programming table where dp[i][j] represents the number of combinations
20 # of valid integers with length 'j' that end at index 'i - 1'.
21 dp = [[0] * (num_length + 1) for _ in range(num_length + 1)]
22 dp[0][0] = 1
23
24 # Build up the DP table.
25 for i in range(1, num_length + 1):
26 for j in range(1, i + 1):
27 current_value = 0
28 if num[i - j] != '0': # Skip numbers with leading zero.
29 if i - j - j >= 0 and is_greater_or_equal(i - j, i - j - j, j):
30 # The substring is greater or equal to the previous,
31 # so we can safely append to the previous.
32 current_value = dp[i - j][j]
33 else:
34 # Take the combination counts from the smaller number if present.
35 current_value = dp[i - j][min(j - 1, i - j)]
36
37 # Update the current dp value including the current value with MOD.
38 # Accumulate with the previous j-1 combinations.
39 dp[i][j] = (dp[i][j - 1] + current_value) % MOD
40
41 # Total number of combinations is the last value in the dp table.
42 return dp[num_length][num_length]
43
1class Solution {
2 private static final int MOD = (int) 1e9 + 7;
3
4 public int numberOfCombinations(String num) {
5 int length = num.length();
6 // Initialize the longest common prefix (LCP) array.
7 int[][] longestCommonPrefix = new int[length + 1][length + 1];
8
9 // Calculate the LCP for each substring pair.
10 for (int i = length - 1; i >= 0; --i) {
11 for (int j = length - 1; j >= 0; --j) {
12 if (num.charAt(i) == num.charAt(j)) {
13 longestCommonPrefix[i][j] = 1 + longestCommonPrefix[i + 1][j + 1];
14 }
15 }
16 }
17
18 // dp[i][j] will hold the number of ways to partition the string ending at i using a number ending at j.
19 int[][] dp = new int[length + 1][length + 1];
20 dp[0][0] = 1; // Base case: one way to partition an empty string.
21
22 // Build the dp array from the bottom up.
23 for (int i = 1; i <= length; ++i) {
24 for (int j = 1; j <= i; ++j) {
25 int currentVal = 0;
26
27 // We should not start with a leading zero.
28 if (num.charAt(i - j) != '0') {
29 // There should be enough characters for the previous number.
30 if (i >= 2 * j) {
31 int commonPrefixLength = longestCommonPrefix[i - j][i - 2 * j];
32
33 // Check if the current number is greater than or equal to the previous number.
34 if (commonPrefixLength >= j || num.charAt(i - j + commonPrefixLength) >= num.charAt(i - 2 * j + commonPrefixLength)) {
35 currentVal = dp[i - j][j];
36 }
37 }
38 // If the number is not valid, use the number of ways of the smaller previous number.
39 if (currentVal == 0) {
40 currentVal = dp[i - j][Math.min(j - 1, i - j)];
41 }
42 }
43
44 // Sum the ways, and do mod to avoid overflow.
45 dp[i][j] = (dp[i][j - 1] + currentVal) % MOD;
46 }
47 }
48 // The answer is the number of ways to form the number sequence using the entire string.
49 return dp[length][length];
50 }
51}
52
1class Solution {
2public:
3 const int MOD = 1e9 + 7; // Define modulo value for the results to prevent overflow
4
5 // This function counts the number of non-empty increasing sequences of integers
6 // that can be formed by the digits of the given string 'num'
7 int numberOfCombinations(string num) {
8 int n = num.size(); // Size of the input string
9 // 2D vector to store the longest common prefix (LCP) lengths between suffixes of 'num'
10 vector<vector<int>> longestCommonPrefix(n + 1, vector<int>(n + 1));
11
12 // Preprocess to fill in the longestCommonPrefix array
13 for (int i = n - 1; i >= 0; --i) {
14 for (int j = n - 1; j >= 0; --j) {
15 if (num[i] == num[j]) {
16 longestCommonPrefix[i][j] = 1 + longestCommonPrefix[i + 1][j + 1];
17 }
18 }
19 }
20
21 // Comparator function to compare two substrings in 'num' when one is a prefix of the other
22 auto canPlace = [&](int start, int prevStart, int len) {
23 int commonPrefixLen = longestCommonPrefix[start][prevStart];
24 return commonPrefixLen >= len || num[start + commonPrefixLen] >= num[prevStart + commonPrefixLen];
25 };
26
27 // 2D vector for dynamic programming (dp), where dp[i][j] represents the number of ways
28 // to form an increasing sequence upto index i, using a last number of length j
29 vector<vector<int>> dp(n + 1, vector<int>(n + 1));
30 dp[0][0] = 1; // Base case: 1 way to form a sequence of length 0
31
32 // Populate the dp array
33 for (int i = 1; i <= n; ++i) {
34 for (int j = 1; j <= i; ++j) {
35 int count = 0;
36 // Check if the current number doesn't start with '0' and if it can be placed in the sequence
37 if (num[i - j] != '0') {
38 if (i - j - j >= 0 && canPlace(i - j, i - j - j, j)) {
39 count = dp[i - j][j];
40 } else {
41 count = dp[i - j][min(j - 1, i - j)];
42 }
43 }
44 // Update the dp table taking the modulo to prevent overflow
45 dp[i][j] = (dp[i][j - 1] + count) % MOD;
46 }
47 }
48 // Final answer: sum of all sequences ending at the last digit 'n'
49 return dp[n][n];
50 }
51};
52
1const MOD: number = 1e9 + 7; // Define modulo value for the results to prevent overflow
2
3function numberOfCombinations(num: string): number {
4 const n: number = num.length; // Size of the input string
5 // 2D array to store the longest common prefix (LCP) lengths between suffixes of 'num'
6 let longestCommonPrefix: number[][] = Array.from({length: n + 1}, () => Array(n + 1).fill(0));
7
8 // Preprocess to fill in the longestCommonPrefix array
9 for (let i = n - 1; i >= 0; --i) {
10 for (let j = n - 1; j >= 0; --j) {
11 if (num[i] === num[j]) {
12 longestCommonPrefix[i][j] = 1 + longestCommonPrefix[i + 1][j + 1];
13 }
14 }
15 }
16
17 // Comparator function to compare two substrings in 'num' when one is a prefix of the other
18 const canPlace: (start: number, prevStart: number, len: number) => boolean = (start, prevStart, len) => {
19 const commonPrefixLen: number = longestCommonPrefix[start][prevStart];
20 return commonPrefixLen >= len || num[start + commonPrefixLen] >= num[prevStart + commonPrefixLen];
21 };
22
23 // 2D array for dynamic programming (dp), where dp[i][j] represents the number of ways
24 // to form an increasing sequence up to index i, using a last number of length j
25 let dp: number[][] = Array.from({length: n + 1}, () => Array(n + 1).fill(0));
26 dp[0][0] = 1; // Base case: 1 way to form a sequence of length 0
27
28 // Populate the dp array
29 for (let i = 1; i <= n; ++i) {
30 for (let j = 1; j <= i; ++j) {
31 let count: number = 0;
32 // Check if the current number doesn't start with '0' and if it can be placed in the sequence
33 if (num[i - j] !== '0') {
34 if (i - j - j >= 0 && canPlace(i - j, i - j - j, j)) {
35 count = dp[i - j][j];
36 } else {
37 count = dp[i - j][Math.min(j - 1, i - j)];
38 }
39 }
40 // Update the dp table taking the modulo to prevent overflow
41 dp[i][j] = (dp[i][j - 1] + count) % MOD;
42 }
43 }
44 // Final answer: sum of all sequences ending at the last digit 'n'
45 return dp[n][n];
46}
47
Time and Space Complexity
The time complexity of the provided algorithm is primarily determined by two nested loops that iterate over the length of the input string num
and the computation of the longest common prefix (LCP) for all substrings. Calculating the LCP takes O(n^2)
time since it requires two nested loops, each running for n
iterations, where n
is the length of the string num
. Afterward, the main part of the algorithm uses dynamic programming where there are two nested loops that also run for n
iterations each, thus also taking O(n^2)
time. In the innermost part of these loops, the cmp
function is called, which takes constant time due to the precomputed LCP values. Therefore, the dominant part of the time complexity is O(n^2)
.
The space complexity of the algorithm comes from the storage of the dynamic programming table dp
, and the longest common prefix table lcp
, each of size n x n
. Therefore, the space complexity is O(n^2)
to store these two tables.
Learn more about how to find time and space complexity quickly using problem constraints.
Which of the following problems can be solved with backtracking (select multiple)
What is Dynamic Programming Prerequisite DFS problems dfs_intro Backtracking problems backtracking Memoization problems memoization_intro Pruning problems backtracking_pruning Dynamic programming is an algorithmic optimization technique that breaks down a complicated problem into smaller overlapping sub problems in a recursive manner and uses solutions to the sub problems to construct a solution
Coding Interview Patterns Your Personal Dijkstra's Algorithm to Landing Your Dream Job The goal of AlgoMonster is to help you get a job in the shortest amount of time possible in a data driven way We compiled datasets of tech interview problems and broke them down by patterns This way
Recursion Recursion is one of the most important concepts in computer science Simply speaking recursion is the process of a function calling itself Using a real life analogy imagine a scenario where you invite your friends to lunch https algomonster s3 us east 2 amazonaws com recursion jpg You first
Want a Structured Path to Master System Design Too? Don’t Miss This!