Balanced number partitioning explained

Balanced number partitioning is a variant of multiway number partitioning in which there are constraints on the number of items allocated to each set. The input to the problem is a set of n items of different sizes, and two integers mk. The output is a partition of the items into m subsets, such that the number of items in each subset is at most k. Subject to this, it is required that the sums of sizes in the m subsets are as similar as possible.

An example application is identical-machines scheduling where each machine has a job-queue that can hold at most k jobs.[1] The problem has applications also in manufacturing of VLSI chips, and in assigning tools to machines in flexible manufacturing systems.[2]

In the standard three-field notation for optimal job scheduling problems, the problem of minimizing the largest sum is sometimes denoted by "P | # ≤ k | Cmax". The middle field "# ≤ k" denotes that the number of jobs in each machine should be at most k. This is in contrast to the unconstrained version, which is denoted by "

P\parallelCmax

".

Two-way balanced partitioning

A common special case called two-way balanced partitioning is when there should be two subsets (m = 2). The two subsets should contain floor(n/2) and ceiling(n/2) items. It is a variant of the partition problem. It is NP-hard to decide whether there exists a partition in which the sums in the two subsets are equal; see [3] problem [SP12]. There are many algorithms that aim to find a balanced partition in which the sum is as nearly-equal as possible.

n+
4
1
2n+2
. The expected work-difference (difference between largest and smallest sum) is

\Theta(1/n)

.[2]

\Theta(1/n)

.

O(logn/n2)

almost surely.

n-\Theta(log

.

Balanced triplet partitioning

Another special case called 3-partitioning is when the number of items in each subset should be at most 3 (k = 3). Deciding whether there exists such a partition with equal sums is exactly the 3-partition problem, which is known to be strongly NP-hard. There are approximation algorithms that aim to find a partition in which the sum is as nearly-equal as possible.

4m-1
3m
of the minimum largest sum, which is the same approximation ratio that LPT attains for the unconstrained problem. The bound is tight for MLPT.
3m-1
4m-2
of the maximum smallest sum, which is again the same ratio that LPT attains for the unconstrained problem.

7/6

of the minimum largest sum.

Balanced partitioning with larger cardinalities

A more general case, called k-partitioning, is when the number of items in each subset should be at most k, where k can be any positive integer..

2-1
m
.
2-2
m+1
. It is not known if it runs in polynomial time.

4/3

. It is tight for k = 4 when m is sufficiently large; the precise lower bound is
4m
3m+1
).
4m-1
3m
. Currently, this conjecture is known to be true only for k = 3. For k > 3, it is known that its approximation ratio is at most 2.

O(nlogn)

. They prove that its approximation ratio for the minimum largest sum is exactly 4/3 for k = 3, 19/12 for k = 4, 103/60 for k = 5, 643/360 for k = 6, and 4603/2520 for k = 7. The ratios were found by solving a mixed integer linear program. In general (for any k), the approximation ratio is at least
k-1
2-\sum
j=0
j!
k!
and at most
2-1
k-1
. The exact MILP results for 3,4,5,6,7 correspond to the lower bound. For k>7, no exact results are known, but the difference between the lower and upper bound is less than 0.3%. When the parameter is the number of subsets (m), the approximation ratio is exactly
2-1
m
.

max((\sumxi)/m,x1)

, where xi are the inputs ordered from large to small, is a lower bound for the optimal largest-sum, and its worst-case ratio is 1/2 in both variants. The improved expression

max((\sumxi)/m,x1,xk+xm+1)

has worst-case ratio 2/3 in the unconstrained variant and 1/2 in the constrained variant. The approximation ratio of the modified list scheduling is 1/2 for the unconstrained variant, but it is 0 for the constrained variant (it can be arbitrarily bad). The approximation ratio of the modified LPT algorithm is at most 2. They also show that the lower bound of has a tight worst-case performance ratio of 3/4, and that their PD algorithm has a tight performance ratio of 4/3 (when m is sufficiently large).
max\left(2
k

,

1
m

\right)

. They present a new algorithm, HARMONIC1, with worst-case ratio at least
max\left(1
k

,

1
\lceil
m
\sum
i=1
1
i
\rceil+1

\right)

. Both these algorithms are ordinal – they partition the items based only on the order between them rather than their exact values. They prove that any ordinal algorithm has ratio at most

O(1/ln{m})

for maximizing the smallest sum. This indicates that HARMONIC1 is asymptotically optimal. For any fixed k, any ordinal algorithm has ratio at most the smallest root of the equation
m
\sum
i=1

\left\lfloor\left\lfloor

k+i-1
i

\right\rfloorx\right\rfloor=k

. When k tends to infinity, this upper bound approaches 0.

Relations between balanced and unconstrained problems

There are some general relations between approximations to the balanced partition problem and the standard (unconstrained) partition problem.

2-2
m
, and it is tight.

max\left(r,

k+2-
k+1
1
m(k+1)

\right)

. In particular, their

7/6

-approximation algorithm for triplet partitioning (k = 3) can be used to obtain a heuristic for unconstrained partitioning with approximation-ratio
max\left(7
6

,

5-
4
1
4m

\right)

.

Different cardinality constraints

The cardinality constraints can be generalized by allowing a different constraint on each subset. This variant is introduced in the "open problems" section of, who call the ki-partitioning problem. He, Tan, Zhu and Yao present an algorithm called HARMONIC2 for maximizing the smallest sum with different cardinality constraints. They prove that its worst-case ratio is at least

max\left(1
km

,

k1
km
1
\left\lceil
m
\sum
i=1
1
i
\right\rceil+1

\right)

.

Categorized cardinality constraints

Another generalization of cardinality constraints is as follows. The input items are partitioned into k categories. For each category h, there is a capacity constraint kh. Each of the m subsets may contain at most kh items from category h. In other words: all m subsets should be independent set of a particular partition matroid. Two special cases of this problem have been studied.

Kernel partitioning

In the kernel balanced-partitioning problem, some m pre-specified items are kernels, and each of the m subsets must contain a single kernel (and an unlimited number of non-kernels). Here, there are two categories: the kernel category with capacity 1, and the non-kernel category with unlimited capacity.

4m-1
3m
for the minimum largest sum. However, Chen, He and Lin claim that its tight approximation ratio is
3m-1
2m
for the minimum largest sum, and
2m-1
3m-2
for the maximum smallest sum.

One-per-category partitioning

In another variant of this problem, there are some k categories of size m, and each subset should contain exactly one item from each category. That is, kh = 1 for each category h.

2-1
m
for minimizing the largest sum;
1
m
for maximizing the smallest sum in the general case; and in some special cases, can be improved to
m
2m-1
for general k and
m-1
2m-3
for k = 3.

See also

Matroid-constrained number partitioning is a generalization in which a fixed matroid is given as a parameter, and each of the m subsets should be an independent set or a base of this matroid.

Notes and References

  1. Zhang. Jilian. Mouratidis. Kyriakos. Pang. HweeHwa. 2011-06-28. Heuristic Algorithms for Balanced Multi-Way Number Partitioning. Twenty-Second International Joint Conference on Artificial Intelligence. en.
  2. Tsai. Li-Hui. 1992-02-01. Asymptotic Analysis of an Algorithm for Balanced Parallel Processor Scheduling. SIAM Journal on Computing. 21. 1. 59–64. 10.1137/0221007. 0097-5397.
  3. Book: Garey. Michael. Computers and Intractability; A Guide to the Theory of NP-Completeness. Johnson. David. 1979. 978-0-7167-1045-5. 96–105. registration.
  4. Coffman. E. G.. Frederickson. G. N.. Lueker. G. S.. 1984-05-01. A Note on Expected Makespans for Largest-First Sequences of Independent Tasks on Two Processors. Mathematics of Operations Research. 9. 2. 260–266. 10.1287/moor.9.2.260. 0364-765X.
  5. Lueker. George S. 1987-12-01. A note on the average-case behavior of a simple differencing method for partitioning. Operations Research Letters. en. 6. 6. 285–287. 10.1016/0167-6377(87)90044-7. 0167-6377.
  6. Yakir. Benjamin. 1996-02-01. The Differencing Algorithm LDM for Partitioning: A Proof of a Conjecture of Karmarkar and Karp. Mathematics of Operations Research. 21. 1. 85–99. 10.1287/moor.21.1.85. 0364-765X.
  7. Mertens. Stephan. 1999-03-11. A complete anytime algorithm for balanced number partitioning. cs/9903011.
  8. Kellerer. Hans. Woeginger. Gerhard. 1993-09-07. A tight bound for 3-partitioning. Discrete Applied Mathematics. en. 45. 3. 249–259. 10.1016/0166-218X(93)90013-E. 0166-218X. free.
  9. Chen. Shi Ping. He. Yong. Lin. Guohui. 2002-03-01. 3-Partitioning Problems for Maximizing the Minimum Load. Journal of Combinatorial Optimization. en. 6. 1. 67–80. 10.1023/A:1013370208101. 9053629. 1573-2886.
  10. Kellerer. Hans. Kotov. Vladimir. 1999-02-01. A 7/6–Approximation Algorithm For 3-Partitioning And Its Application To Multiprocessor Scheduling. INFOR: Information Systems and Operational Research. 37. 1. 48–56. 10.1080/03155986.1999.11732368. 0315-5986.
  11. Babel. Luitpold. Kellerer. Hans. Kotov. Vladimir. 1998-02-01. The k-partitioning problem. Mathematical Methods of Operations Research. en. 47. 1. 59–82. 10.1007/BF01193837. 5594197. 1432-5217.
  12. Michiels. W.. Aarts. E.. Korst. J.. van Leeuwen. J.. Spieksma. F. C. R.. 2012-02-01. Computer-assisted proof of performance ratios for the Differencing Method. Discrete Optimization. en. 9. 1. 1–16. 10.1016/j.disopt.2011.10.001. 1572-5286. free.
  13. Hochbaum. Dorit S.. Shmoys. David B.. 1987-01-01. Using dual approximation algorithms for scheduling problems theoretical and practical results. Journal of the ACM. 34. 1. 144–162. 10.1145/7531.7535. 0004-5411. 9739129. free.
  14. Dell'Amico. Mauro. Martello. Silvano. 2001. Bounds for the cardinality constrained P∥Cmax problem. Journal of Scheduling. en. 4. 3. 123–138. 10.1002/jos.68. 1099-1425.
  15. He. Yong. Tan. Zhiyi. Zhu. Jing. Yao. Enyu. 2003-11-01. κ-Partitioning problems for maximizing the minimum load. Computers & Mathematics with Applications. en. 46. 10. 1671–1681. 10.1016/S0898-1221(03)90201-X. 0898-1221. free.
  16. Chen. S. -P.. He. Y.. Yao. E. -Y.. 1996-09-01. Three-partitioning containing kernels: Complexity and heuristic. Computing. 57. 3. 255–271. 10.1007/bf02247409. 21935917. 0010-485X.
  17. Wu. Biao. Yao. Enyue. 2007-04-20. m-Partitioning problems with partition matroid constraint. Theoretical Computer Science. en. 374. 1. 41–48. 10.1016/j.tcs.2006.11.016. 0304-3975.
  18. Wu. Biao. Yao. En-yu. 2008-03-01. Lower bounds and modified LPT algorithm for m-partitioning problems with partition matroid constraint. Applied Mathematics-A Journal of Chinese Universities. en. 23. 1. 1–8. 10.1007/s11766-008-0101-8. 16565038. 1993-0445.
  19. Li. Weidong. Li. Jianping. 2013-04-06. Approximation algorithms for $$m$$-partitioning problems with partition matroid constraint. Optimization Letters. 8. 3. 1093–1099. 10.1007/s11590-013-0637-2. 3385030. 1862-4472.
  20. Dell’Olmo. Paolo. Hansen. Pierre. Pallottino. Stefano. Storchi. Giovanni. 2005-09-01. On uniform k-partition problems. Discrete Applied Mathematics. en. 150. 1. 121–139. 10.1016/j.dam.2005.02.013. 0166-218X.