Abstract
We consider the minimization of input cost for a selective assembly system which features two random inputs and a finite number of matching classes. This setup frequently arises in high-precision manufacturing when input tolerances are not tight enough for the required output precision. We determine optimality conditions for the cost-optimal input portfolio given an expected-output target, first using a normal approximation of the multinomial binning distribution, and second employing a simple upper envelope of the output objective. We show that the relative error tends to zero as the production scale becomes sufficiently large. The envelope optimization problem also yields a closed-form solution for the cost-minimizing inputs as well as total costs, which are easy to understand for managers. A numerical study tests the practicality of the envelope solution. The latter can be used as seed for a numerical solution of the exact problem, as well as a closed-form approximate solution which allows for an analysis of structural properties.
Disclaimer
As a service to authors and researchers we are providing this version of an accepted manuscript (AM). Copyediting, typesetting, and review of the resulting proofs will be undertaken on this manuscript before final publication of the Version of Record (VoR). During production and pre-press, errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal relate to these versions also.1 Introduction
By sorting low-precision inputs into predefined tolerance classes, so as to obtain a grade-specific matching for components of different types, it is possible to produce relatively high-precision output. This process of “selective assembly” has been known for more than 100 years—as part of interchangeable manufacturing. While much of the literature thus far has been concerned with optimizing the design of the matching bins, the available work on cost-effectively procuring the inputs for a given system of matching bins with arbitrary distributions of part characteristics is rather sparse. This paper provides such an analysis, characterizing the optimal order quantities for two different part types, together with a closed-form approximation of the optimal input quantities. The latter allows for structural insights and suggests a straightforward generalization to an arbitrary number of part types.
In the batch production of movements for mechanical watches, the oscillator—assembled from a hairspring and a balance wheel—must exhibit a well-defined resonance frequency within a small tolerance. 1 The accuracy of this output characteristic is key for the quality of the watch, and this is why many high-end horology companies have adopted a system of selective assembly, sorting hairsprings and balance wheels into different grades, so as to guarantee a sufficiently high clocking precision after joining any two components with matching grades. Naturally, when using selective assembly the probability that, say, of 100 hairsprings not all can be matched with 100 balance wheels is generally positive, and this mismatch likelihood increases in the number of matching classes. That is, a greater output precision comes at the price of a bigger loss of input components. Finding the cost-optimal input quantities needs to trade off the cost of the inputs against the benefits of the outputs, which would vary across the different assembly grades. Given two multinomial distributions for the input parts being in the various matching classes, we characterize the cost-minimizing input quantities so as to achieve a given expected output quantity. We also show that a good approximation of the output objective (with arbitrarily small relative error for sufficiently large production lots) leads to a constant ratio between inputs and outputs, taking the production process to be a weighted sum of perfectly complementary systems. The latter may then allow for the separation of input parts into “principal components” and “buffer components.” In mechanical watchmaking, for example, hairsprings turn out to be buffer components for the significantly more expensive balance wheels; cf. Sec. 4.
1.1 Literature
The idea of sorting inputs of a given type into different bins so as to be able to interchangeably match components across corresponding bins for different part types, also termed “selective assembly,” was discussed more than a century ago by Buckingham (1921) who notes: “[t]he chief purpose of manufacturing, by selective assembly (…), is the production of large quantities of duplicate parts as economically as possible, within such limits that they may be assembled without further machining” (p. 224). Our concern here is related to the requirement of “as economically as possible,” by specifically considering the tradeoff of input costs against output benefits. The latter may naturally depend on the specific “grade” of the output, that is, on the matching class from which inputs were taken to assemble a given unit of a finished product. Mansoor (1961) points out that in practice assembly processes with more than 20 matching classes can be observed, but that one of the main problems of selective assembly is evidently the possibility for mismatch. In contrast to our input-output view, which is driven by the input-cost minimization problem and an interest in the dependence of input cost on required output precision and the number of matching classes, Caputo and Di Salvo (2019) are interested in different manufacturing methods for which they disaggregate production-specific cost parameters including processing times, sorting costs, and so forth, which are essentially invariant once the production process has been fixed, leaving open the question about the optimal inputs.
There are numerous practical applications for selective assembly systems, such as hole-and-shaft combinations (Asha et al., 2008; Kannan and Pandian, 2021), sheet metal assembly (Rezaei Aderiani et al., 2019), flat-panel display manufacturing (Duenyas et al., 1997), camshafts in the automobile industry (Mease et al., 2004), as well as bimetal thermostats, Fortini’s clutch, and knuckle-joint assembly (Tan and Wu, 2012). Mease et al. (2004) note that cost imbalances may put into question an otherwise yield optimized selective assembly system. In the case where input prices differ substantially, as for hairsprings versus the at least 10-times more expensive balance wheels, the less costly parts should be ordered in excess so that the expensive parts are paired more often.
One main concern of the literature thus far has been the design of these matching classes, in the form of bins in the space of part characteristics which are often assumed to be normally distributed. For example, Pugh (1986) considers an equidistant partition of scalar part characteristics. Kwon et al. (1999) improve on the equidistant partition by determining matching classes so as to minimize a square deviation of the difference of the (assumed to be) commensurate parts from some target. While Matsuura (2011) allows for more general convex loss functions, the general problem with the loss-function approach is that it aggregates penalizations (by taking the expectation) irrespective of whether parts fit or not. In other words, under a set of matching classes which was designed using the penalization approach it is generally possible that parts do not match, even though they were taken from corresponding bins and therefore should match. We therefore follow here the robust binning approach by Weber (2021) which guarantees that parts across corresponding bins always match. This allows us to concentrate on the design of the minimum-cost input portfolio. We also mention in passing that in addition to the fixed-bin selective assembly discussed thus far there is the notion of a “direct” selective assembly which requires to keep track of every single part so as to be able to match parts one-to-one by solving a matching problem on a bipartite graph (Coullard et al., 1998). Since tracking very similar individual parts would often prove impractical and very costly in production settings with significant output volumes, we restrict attention to a selective assembly where bins are given, so that for each part type it is possible to specify a discrete probability distribution of parts being associated with a given bin. As noted earlier, we also assume that parts of different types can always be matched when taken from corresponding bins.
In Economics, the perfect complementarity of inputs for production, with transformation to outputs in constant proportions, goes back to Leontief (1941) who used it as a simplifying assumption to analyze a large economy. In assembly operations, constant proportions in the conversion of inputs to outputs arise because the required amount of input for a unit of output is typically fixed. Without any significant loss of generality, the different part types are assumed to be matched one-to-one, i.e., at equal proportions. For example, if each finished ball bearing requires 8 balls and 1 shaft, then we can think of the balls as being provided in lots of 8, so as to maintain equal proportions of inputs and outputs. Given that for any given matching class, the number of components of each type is ex-ante uncertain, the deterministic production function mapping inputs to expected outputs no longer exhibits perfect complementarity but rather a certain input substitutability, since additional quantities of type-1 parts (e.g., ‘balls’) provides an insurance against being long with type-2 inputs (e.g., ‘shafts’) in a given matching class. Thus, increasing the quantity of one input type must always lead to a nonzero increase in the expected output, despite the perfect complementarity applied to any realization of the “sorting product” (i.e., the vector of measured parts in the different matching classes). Our paper extends the work by Weber (2022) which focuses on the special case of binary random part matching, while we here allow for a selective assembly system with an arbitrary number of matching classes and also examine the scaling behavior.
2 Model
2.1 Matching Classes and Sorting Products
The selective assembly problem considered here involves components of two types (“1” and “2”) and matching classes. A component of type features a (random) characteristic , with realizations in the (measurable) space which has at least M elements; see Sec. 3.7. The “matching classes” , for , are nonempty subsets which together form a partition of , in the sense that
for all . Let(1) (1)
be a given probability that a given component of type i has characteristics that lie in the matching class m. To avoid trivialities, we exclude the case where . Indeed, for no successful grade-m matching is possible and the class should be suppressed. Conversely if , then no other matching class can have successful matches, contradicting the assumption of having at least two functional matching classes, the minimum number of classes for a meaningful selective assembly.
Two parts of characteristics and a referred to as “matched” (across types) if the vector is an element of the joint matching class , for some . Since by assumption the set of matching classes forms a partition of the space of characteristics , all incoming parts can be sorted into matching classes; thus, taking two parts from corresponding matching classes and always produces a “matched pair;” see Fig. 1. In practice, the characteristics of different types of parts (e.g., hairsprings and balance wheels for mechanical watches) can vary substantially and be measured in different units (cf. footnote 1). When matching parts of corresponding grades, the number of feasible assemblies for any given matching class (referred to as “grade”) corresponds to the minimum number of parts available from either type. To clarify this strict complementarity, Fig. 1 depicts a situation where the smallest matching class (for each component type) produces zero output: indeed, the two type-1 components in cannot be matched with any type-2 components, as is empty.
Let be the chosen input quantity for type-i components. Sorting them into matching classes yields the vector (henceforth referred to as “sorting product”), where denotes the number of components of type i in matching class m, so that
The multinomial probability that a size- ordering of type-i components into M matching classes produces a particular sorting product is
where is a given vector of positive likelihoods such that(2) (2)
The preceding normalization constraint may be relaxed to include the possibility of wasteful off-spec components (cf. Remark 5).
2.2 Normal Approximation
We use the standard normal approximation of the multinomial distribution of the random vector with the mean , where , and the symmetric positive-definite covariance matrix . The latter features the diagonal elements and the off-diagonal elements for with . In what follows, we usually suppress the dependence on for simplicity, as it is not a decision variable. Under the proposed normal approximation,
denotes the cumulative distribution function (cdf) for the random number of type-i components in the m-th matching class, where the cdf of the standard normal distribution is
with probability density function (pdf)
Clearly, the functions , for , constitute marginal distributions of the joint cdf that describes the random sorting product of type-i components under the normal approximation, but it turns out that they are all that is required for our purposes.
Remark 1 (Approximation Error).
By the Berry-Esseen theorem (Berry, 1941; Esseen, 1942) the absolute deviation of the marginal distribution from the underlying binomial distribution can be bounded as follows:
for all , with (Shevtsova, 2011), which lies above the theoretical lower bound for of (Esseen, 1956). Hence, as long as
the approximation error corresponds to the probability of ‘extreme events,’ further than standard deviations away from the mean.
Remark 2 (Continuous Input Values).
The normal approximation allows for nonnegative input quantities , lifting the burden of the integer constraint for the purposes of optimization. A practically implementable input vector can then usually be found ex post by rounding to the nearest integer or the nearest feasible batch size.
2.3 Matching Output
Given its type-1 and type-2 inputs, the firm produces the (sorted) output vector . The m-th random matching output , generated by using components with joint characteristics in matching class , is(3) (3)
It is evident that the intrinsic value of the firm’s output may not be the same for all matching classes. For example, the matching-class index m can denote different quality grades in which case high-quality output would yield a higher price than low-quality output, without any difference in the unit cost for each random input component. To capture the differentiation of benefits across matching classes, let(4) (4)
denote the firm’s total (weighted) output, where is a vector which contains the net unit value for the m-th matching output; see Fig. 2. The net unit value can be viewed as an absolute or a relative measure of benefits. When absolute, it may incorporate the revenues from selling the assembled grade-m product minus production costs, whereas when relative it serves to quantify a normalized net benefit. In the latter case, the normalizations are commonly used; for instance, by setting one obtains a simple physical output count, , relevant for many (if not most) manufacturing applications. The assumption that is strictly positive reflects the firm’s concern for all units and its option of “free disposal.” If a certain matching class (e.g., ) represents “junk” components, then it is possible to reduce that weight reflecting any possible scrap value.
2.4 Expected Output
The need for grade-specific component matching renders the firm’s production stochastic, allowing generically for the possibility of even zero production output (with positive probability), irrespective of the chosen input quantities. While this may be true in principle, of course the likelihood of such poor output realizations becomes negligible in any realistic setting. For example, the likelihood of observing a type-i quantity of two standard deviations below the expected quantity in any given matching class is below 5%; for three standard deviations it is below .
Since the firm is risk-neutral, its focus lies on its expected total (weighted) output,(5) (5)
which is a deterministic function of the corresponding input , where the total output is obtained as the weighted sum of the expected grade-m outputs . For any given m, the expected grade-m output depends on the statistical properties of the difference, , referred to as the (grade-m) “parts defect,” which has the mean
and the standard deviation
The expected output in any given matching class can be expressed as a convex combination of the expected type-1 and type-2 quantities, minus a positive loss term that increases superlinearly in the aforementioned standard deviation of the parts defect.
Lemma 1 (Grade-m Output).
Given any , the expected grade-m output is given by (6) (6)
for all , where denotes the probability that the grade-m part defect reaches one standard deviation, and is the probability density at that point.
When the expected grade-m parts defect is positive (resp., negative), then the weight placed on the expected number of type-1 parts decreases (resp., increases) in , since with an increased coefficient of variation for parts defects the expected number of type-2 parts increases (resp., decreases) in relative importance. In other words, perhaps counterintuitively, the nominally expected matches (as measured by the convex combination of the expected type-specific yields and ) goes up in the dispersion of the grade-m parts defect. However, this effect is offset by an increased “matching loss” : independent of the sign of , the latter grows in superlinearly, for also grows in .
Lemma 2 (Monotonicity).
Given any , the expected grade-m output is increasing, so
for any two input vectors .
By increasing the amount of any input, the expected output increases over all matching classes. By increasing the type-i input from to the grade-m distribution first-order stochastically dominates the grade-m distribution . Since first-order stochastic dominance is preserved by taking the minimum across the component types, the grade-m output (with in ) first-order stochastically dominates the output (with in ), all else equal, which in turn implies the (strict) monotonicity of the firm’s total output in its inputs.
3 Minimum-Cost Selective Matching
3.1 Problem Statement
Each unit of a type-i input is procured at the cost , which implies that the firm is interested in minimizing the cost of generating a desired (expected) total output . Thus, given the cost vector , the firm’s (minimum-cost) selective matching problem is to determine the optimal input (also referred to as its “output-driven demand”) which achieves its minimum cost,(*) (*)
for any output . Clearly, the solution to the selective matching problem does not depend on the individual input costs, but only on their ratio .
3.2 Optimal Input
The firm’s output driven demand (or optimal input) minimizes the cost of its inputs to achieve a given output objective . It will come as no surprise that at the optimum the expected total output must exactly equal the output objective. This “output efficiency” is a consequence of the strict monotonicity of expected output in its inputs and the fact that inputs are costly at the margin. In addition, the technical rate of substitution between the two inputs must, at the optimum, be equal to the cost ratio . This provides optimality conditions for the otherwise nonconvex optimization problem ( ), summarized by the following result.
Theorem 1 .
For any expected total output , a solution to the minimum-cost selective matching problem ( ) satisfies output efficiency, (7) (7)
in addition, at the marginal rate of technical substitution between type-1 and type-2 components is equal to the cost ratio , so (8) (8)
where and , for all , are specified in La. 1.
Remark 3 (Nonconvexity).
Generally, the output objective is can be nonconcave in the vector of inputs . Indeed, in the simple case of two grades, where the firm only cares about grade-1 output, it is and . The corresponding Hessian of F is:
for all . The preceding determinant of the Hessian is equal to the product of its eigenvalues, which must have therefore different signs. As a consequence the output objective is nonconvex in this case. Thus, the solution to the optimality conditions (7)–(8) may not be unique. As shown in Sec. 3.4, a simplified approximate output objective may yield a reasonable approximate solution (in closed-form).
3.3 Envelope Approximation
Instead of the normal approximation of the expected output F we now consider the envelope output,(9) (9)
which is well-defined for all . The envelope output is an upper bound for F.
Lemma 3 .
, for all .
As the positive linear combination of minimum functions is concave, Jensen’s inequality implies that the expectation of the minimum is weakly smaller that the minimum of the expectations. An important question that arises of course is how far, for a given input , the envelope output may deviate from the reference output ? We thus consider the absolute deviation,referred to as the envelope approximation error. The next result shows that this error increases only sublinearly with the inputs.
Lemma 4 (Approximation Error).
For any it is
where .
In other words, the envelope approximation error is at most proportional to the weighted sum of the grade-specific standard deviations , using the given output weight , for each grade . Note also that the preceding error bound is tight when , , and . In a heterogeneous setting, the derived bound may be conservative as it presumes that holds for all ; cf. Weber (2022). While the absolute deviation between F and may grow with the size of the inputs, in terms of the relative error,
we obtain the following strong global approximation result.
Lemma 5 (Relative Approximation).
For any , there exists a finite minimal envelope output , which is such that: 2
For small , the minimal envelope output is .
Instead of a minimal envelope output it is possible to specify a minimal input , which is also ), since by the homogeneity of it is proportional to ; cf. footnote 2 . Then also implies . A key driver of the relative approximation result in La. 5 is that, as a consequence of La. 4, the given upper bound of the relative error decreases with the inverse square-root of the minimal input (i.e., with ). The given lower bounds for input or output are fairly conservative. In practice, it is preferable to perform an error correction based on the relative error at the solution to an envelope optimization problem.
3.4 Envelope Optimization
Based on the upper bound for the production function introduced in Sec. 3.3, we now consider the envelope optimization problem,(**) (**)
for any given target output . By construction, this optimization problem is convex, and in order to find a solution to ( ), which will be referred to as an optimal envelope input , it is useful to combine all grades which have the same ratio of likelihoods (across component types),(10) (10)
to obtain combined grades, where each such combined grade is characterized by a distinct likelihood ratio . We can then rewrite the envelope output in Eq. (9) as follows:(11) (11)
where
are the corresponding combined weights. The alternative representation of the envelope output in Eq. (11) allows for a closed-form representation of the solution to the envelope optimization problem ( ).
Theorem 2 .
For any , a solution to the envelope optimization problem ( ) is (12) (12)
where each , for , denotes a vertex of the iso-output contour for a unit output, with
The optimal combined grade, , determines the critical vertex , where the iso-output contour slopes are such that
for all .
The optimal envelope solution is such that the slope of the iso-cost curves lies in the subgradient, i.e., , and at the same time envelope output efficiency holds, in the sense that . The following example illustrates the result.
Example 1 .
Consider a selective-assembly system with matching classes featuring the matching-probability vectors for parts of type 1 and for parts of type 2. The unit-cost vector is , so . Since the set of likelihood ratios, , features only distinct elements, by Eq. (10) the relevant likelihood ratios (in increasing order) are , , and . Given a uniform output-weight vector , the envelope output can be written as in Eq. (11), with the combined weights (for ), (for ), and (for ):
The corresponding unit-output vertices are , , and , resulting in the iso-output contour slopes . Comparing the latter with the cost ratio , we find , so that becomes the critical vertex. By Thm. 2, the solution to the envelope optimization problem ( ) is therefore , for any . Fig. 3 depicts the corresponding situation comparing, for , the iso-F contour with the iso- contour. We also note that to there correspond two critical grades, . The solution to the envelope optimization problem is consistent with minimizing input cost only with respect to the critical grades.
Remark 4 (Alternative Representation).
From a computational viewpoint, instead of determining the set from the likelihood ratios , it is usually more convenient to compute for all M candidate solutions , with(13) (13)
instead of using Eq. (12), and to then select the optimal envelope input as the cheapest of these, so , with(14) (14)
The critical grade does thereby not depend on the magnitude of the output, since both the cost and the envelope output function are homogeneous (of degree 1).
Example 2 .
In the setting of Ex. 1, we find , , and . Since , this implies the critical grades , and thus the solution to the envelope optimization problem, , for all , as before.
3.5 Approximate Solution
By La. 3 the envelope output exceeds the expected output F. For any target output , output efficiency (i.e., the fact that ) means the solution to the envelope optimization problem ( ) is not a feasible point of the original selective matching problem ( ). However, since the envelope objective is homogeneous (of degree 1), it is enough to scale the optimal envelope input to achieve feasibility, leading to an approximate solution to the selective matching problem, 3(15) (15)
which in itself is sublinear in q (as F is convex in q due to a risk-pooling effect). Consequently, the approximate cost function,(16) (16)
is an upper bound for in ( ), where the critical grade is determined by Eq. (14). On the other hand, the optimal envelope cost, , must constitute a lower bound for the optimal cost, so(17) (17)
for all . At the approximate solution , the relative output error becomes(18) (18)
since, by construction, . We note that the term on the right-hand side of the preceding inequality (18) also bounds the relative cost overage,(19) (19)
where we have used Eqs. (15) and (17). Based on the preceding inequality, it is now possible to provide an a priori performance guarantee for the relative cost overage which depends only on the desired level of production output.
Lemma 6 (Relative Cost Overage).
For any target output , the relative cost overage is such that (20) (20)
where .
In particular, the relative cost overage is , in the sense that it is inversely proportional to the square root of the target output. For example, if , the upper bound for the relative cost overage (corresponding to the right-hand side in Eq. (20)) becomes ; this conservative bound drops below 10% for target outputs of 1,146 units and beyond. The latter value corresponds to the input bound implied by La. 5; cf. footnote 2 .
Example 3 .
In the setting of Exs. 1 and 2, the solution to the envelope optimization problem, , produces the expected output of about 94.6345 for (resp., an expected output of about 983.2032 for ). Accordingly, the approximate solution of the selective matching problem is with relative output error (resp., with ). By Eq. (19) the upper bound of 5.67% (resp., 1.71%) also majorizes the relative cost overage incurred by the approximate solution to the selective matching problem compared to the optimal solution , for (resp., for ).
3.6 Cost of Selective Matching
To gauge the cost of selective matching as a function of the grade cardinality , we consider a symmetric setting with maximum entropy, where all grades are attained with equal probability (i.e., and ), and where the output weights and the unit costs are all equal (i.e., ). Assuming a symmetric input for some , La. 1 yields the expected grade-m output,
Summing over all matching grades , with ) yields the (symmetric) relative output loss for selective-matching,
which is increasing (sublinearly) in and decreasing in . Since by symmetry Eq. (8) in Thm. 1 is automatically satisfied, the cost-minimizing input quantity for both type-i inputs, given a desired output , is entirely determined by the output-efficiency condition (7),
This output-driven demand leads to a relative input overage,
which is decreasing in the target output quantity q, due to risk pooling across matching classes. The value of is also equal to the relative cost overage (i.e., , for any with ) compared with perfect matching. 4 However, this relative overage increases almost linearly in the number of excess matching classes .
Example 4 .
When transitioning from to matching classes, the relative overage in input or cost increases by a factor of at least 1.9279 (for ), all else equal. For , the factor approximates . That is, for small target outputs q we observe a slight sublinear increase of the relative overage in the number of matching classes, which becomes somewhat superlinear when target outputs are relatively large.
3.7 Multi-Type Matching
When there are more than two part types (i.e., for with ), an explicit formula for the expected output under the normal approximation is no longer available. This in turn makes it more difficult to derive solid approximation results. The corresponding I-type envelope optimization problem can still be solved explicitly. Eqs. (13) and (14) remain valid, and they determine an optimal solution of the I-type envelope optimization problem (**). The re-scaling in Eq. (15) still applies and so does Eq. (16) for the approximate cost. To illustrate the behavior of the approximation error for compared to , we can “double” Ex. 1 by setting and for the additional component types and . This ensures a fair comparison when the number of component types increases by a factor of 2. Accordingly, the optimal solution to the envelope optimization problem for becomes (compared to for ; cf. Ex. 2). Fig. 4(a) shows the behavior of the relative approximation error, (defined after La. 4), as a function of . The (true) expected output is obtained (approximately) as the sample-average using the corresponding normal distribution (for joint samples) for each matching class, according to Eq. (9). Indeed, the law of large numbers guarantees a consistent approximation of the population mean by the sample mean.
The relative error, while increasing on average by a factor of about 2.94 on the given range (decreasing in q; see Fig. 4(b)) drops below 5% shortly after passing , so that for industrial-size problems the provided approximation continues to be reasonable. A more precise treatment of multi-type matching remains an interesting topic for further research.
4 Application
Consider the production of the oscillator for a mechanical watch movement where hairsprings (i.e., type-1 components) and balance wheels (i.e., type-2 components) are mated across matching classes, where M is an even number. Modern hairsprings are often made in silicon for its antimagnetic properties and are delivered in silicon wafers, with substantial variations of their stiffness across any sample. We assume that the spring coefficient [in Nm/rad] is random with realizations in the interval . Balance wheels are usually made from metal alloys (e.g., containing beryllium or gold among other constituents) and are characterized by the moment of inertia [in ]. The latter can be considered a random variable with realizations in the interval . The prespecified constants and are such that . As already pointed out in footnote 1, the key characteristic of the assembled oscillator is the oscillation period,
which, in itself a random variable, should have realizations close to the target period for the particular movement under consideration. For example, corresponds to today’s most common beat rates, in , which are measured in half-periods (“ticks”) per hour.
4.1 Model Identification
The four sets of primitives for the selective-assembly model, (i)–(iv), are now identified in turn, first the matching classes (i), then the parameters of the multinomial distribution for the sorting products (ii), the output values (iii), and finally the unit input costs (iv).
(i) Matching Classes.
The M type-i matching classes , for , and for feature breakpoints such that
for . According to Weber (2021), the optimal set of breakpoints, which minimize the maximum absolute deviation of T from , in this setup are such that the maximum matching error e is equal over all matching classes, so
where denotes the m-th joint matching class. This approach requires a consistency condition, namely that the ratio of the endpoints is constant across the component types , so
The latter can always be achieved by increasing (resp., decreasing) one of the four endpoints by extending the range of one characteristic (or by adding a zero-value matching class). The optimal set of breakpoints is then given by
for and , where we limit attention to the case where M is even.
(ii) Distributional Parameters.
Given a distribution of the random part characteristics, the multinomial probabilities , for , follow directly from Eq. (1) in Sec. 2.1. Alternatively, it is possible to avoid distributional assumptions about the underlying distribution of characteristics and proceed in a fully data-driven way, based on repeated observation of realizations for the random sorting product . This yields the maximum-likelihood estimates of the unknown parameters of the multinomial distribution,(21) (21)
where is the batch number, denotes the number of grade-m components in batch k, and K the number of observed batches. That is, the maximum-likelihood estimates for the component likelihoods in their respective type-grade combinations correspond simply to their respective frequencies over all observed samples.
Remark 5 (Off-Spec Components).
Because of the empirical regularity that a received batch of input parts may well contain items that do not fit into any of the matching classes, the following question arises: What happens when relaxing the normalization in Eq. (2) to account for “off-spec” components? Specifically, if denotes the size of batch k, then the expression for in Eq. (21) implicitly allows for off-spec components (which do not fit in any ), since generally , while at the same time satisfying Eq. (2). Yet, to explicitly account for the possibility of “off-spec” components let
denote the observed frequency of type-i reject components, as the maximum-likelihood estimate of the true underlying reject probability . Then by adding a matching class 0 with output weight , by Eq. (13) one can simply rescale the approximate solution to the selective matching problem in Eq. (15) to account for the reject probability:(22) (22)
where is any given output target and is given in Thm. 2 (for ). Hence, we can decouple the consideration of off-spec components from the main analysis (which is purely “on-spec”) and then rescale the approximate solution ex post.
(iii) Net Unit Values.
The weighted total output in Eq. (4) requires a vector of positive weights . Since the quality of the assembled mechanical oscillators does not depend on the index m of the matching grade, it is most natural for the firm to set all and consider standard output, i.e., . In standard selective-manufacturing applications, no grade is privileged over others (except possibly for a designated “reject” grade of negligible net unit value); in that case, the grade sorting satisfies the sole purpose of sharpening the precision of the assembly output given the quality imprecisions of the component inputs.
(iv) Unit Input Costs.
The cost function in the selective matching problem ( ) depends on the vector of the unit costs for the two component types. However, as evident from Thm. 1, for any given output quantity the optimal input vector depends only on the ratio of the unit costs, so that there is no need to know their absolute values to determine the composition of the firm’s optimal input portfolio. We assume that the unit cost of a balance wheel is about 10 to 20 times the unit cost of a silicon hairspring, so . In the numerical example, we set , so ; the results for are qualitatively very similar.
4.2 Data Generation
To generate synthetic samples representing the observed stochastic supply characteristics, assume that the true distribution of the normalized type-i component characteristic,
follows a standard normal distribution with cdf , where is the mean and the standard deviation of the (stationary) component population. Given the distributional assumption, the multinomial probabilities (conditional on parts being on-spec) can be obtained by adjusting Eq. (1) as follows:
Note that with positive probability some realizations of the random component characteristic may be “off-spec,” in which case we use input scaling as noted in Sec. 4.1 (ii). The off-spec probability in the component population is . As explained in Remark 5 on off-spec components, the renormalization of the multinomial probabilities has no effect on the likelihood ratios in Eq. (13), so that it is enough to rescale the approximate solution as in Eq. (22), an approach which is implemented here. Finally, we assume that both distributions are centered, so for , but that the coefficient of variation is twice as large for hairsprings than for balance wheels, with and .
4.3 Performance Analysis
Let [in Nm/rad] be the boundaries of the hairspring stiffness and [in ] the boundaries of the balance-wheel inertia (in either case corresponding to a spread of 2% around a central value), so , as required in Sec. 4.1 (i) where we also provide the optimal set of breakpoints for the movement frequency of , and thus the optimal matching classes . Clearly, the maximum matching error e decreases in the number of matching classes , which is a choice variable. For practical purposes it is useful to interpret in terms of “deviation per day” (dpd) [in seconds/day, i.e., s/d] by multiplying numerator and denominator by 345,600 ( ). Thus, since , we obtain that , so(23) (23)
where . That is, Eq. (23) provides the number of matching classes necessary to guarantee a given dpd, as shown in Fig. 5. For example, it is , , and . Finally, for a rather fine partition of part characteristics into matching classes it would be possible (at least theoretically) to achieve the COSC standard of 5s/d without any watchmaker intervention after assembly. Indeed, the Contrôle Officiel Suisse des Chronomètres (COSC) requires mechanical watch movements to run within –4s and +6s per day to issue a “chronometer”' certification, which is equivalent to a dpd of 5. For practical purposes, and to stay exactly in the framework of Weber 2021 used in Sec. 4.1 (i), we round up to the smallest even number of matching classes, thus considering instead of in Eq. (23), as indicated in Fig. 5.
To guarantee practical implementability and feasibility, the approximate solution in Eq. (15) is rounded to an input vector , with the next-highest integers as components (i.e., , for ). An integer discretization (determined by optimality) is also applied to the solution of the original selective matching problem ( ). This solution discretization accounts for some of the fluctuations in Fig. 6, which depicts the optimal costs (in terms of C and ; cf. Eq. (17)) and optimal inputs. To ensure comparability, the costs are normalized to the input cost without matching classes (where ), and the input solutions are normalized to the “overhead-free” input q for all types in that case. As shown in Fig. 6(a), the normalized cost decreases in the target quantity q, which reflects a risk-pooling effect implied by the sublinear increase of the standard deviation of the part characteristics in the amount ordered. It is also apparent that the deviation between the optimal cost C (for optimal integer inputs determined via grid search) and the cost approximation is quite negligible for realistic output targets. It is also apparent that as the dpd-precision requirement becomes more demanding (i.e., smaller), the normalized cost increases very fast—well exceeding 4 for COSC chronometer specifications. Fig. 6(b) illustrates (for ) that these cost increases are driven by a massive over-ordering of the 10-times cheaper hairsprings as “buffer components.” The over-ordering increases both in the production target and the precision requirement.
5 Conclusion
The envelope approximation of the firm’s output objective proved effective in determining a simple closed-form solution for the cost-optimal input portfolio in Eqs. (13) and (22), which includes the possibility of scrap parts. In that solution, somewhat intentionally, the notation of the index set for the different part types (e.g., hairsprings and balance wheels in the watchmaking application) has been used for displaying the results. Indeed, an extension to with part types (e.g., balls, inner/outer rings, and cages, in the case of a ball-bearing assembly) is quite straightforward, and requires no adjustment of these formulas. While this naturally applies to the envelope output in Eq. (9), the same cannot be said for the expected output in Eq. (6), for which a simple closed-form expression may not be within easy reach for more than two input part types.
From a practical viewpoint, a doubling of the number of matching classes tends to roughly double the precision of output parts. 5 As noted in Sec. 3.6, it also leads to almost a doubling of the relative input overage, which is the number of extra units required compared with perfect matching, where the latter corresponds to random matching with equal inputs without quality concerns—equivalent to using a single matching class. That is, all else equal, improving the output tolerance by a certain factor basically multiplies the overage cost by that factor. In addition, one should not overlook the fact that unused parts appear in matching classes for which the counterpart of the other type is stocked-out and on backorder. Yet, the residual inventory vector serves as a buffer for the next order and thus increases expected output relative to an empty stock. A formulation of the corresponding dynamic reordering policy, taking into account parts residuals, is an interesting topic for future research.
Appendix: Proofs
Proof of Lemma 1. Let be a given matching grade and let be an arbitrary input vector. By Eq. (3) the firm’s random grade-m output (conditional on ),
can be viewed as a sum of the grade-m type-1 output and the negative part of the grade-m parts defect, . As the difference of two correlated and normally distributed random variables, the latter follows a normal distribution with mean , and variance . Thus, the expected grade-m output is , where
On the other hand, with following a standard normal distribution,
Taking into account the symmetry of the last term can be computed explicitly,
Hence, the expected grade-m output becomes
By virtue of the fact that , for all , the preceding expression simplifies,
where we have set and . This establishes the claim.
Proof of Lemma 2. Let be a given matching grade and let be an arbitrary input vector. For any , the number of type-i components in grade-m follows a normal distribution with cdf . The cdf of the random grade-m output becomes
Consider now an increase of type-i input from to , and note that
This implies first-order stochastic dominance (FOSD) of the type-i quantity distribution after the quantity increase, . Thus, if we denote by the grade-m yields under , then , where marks the (strict) FOSD-order. But this implies that , so that we can now compare the expected values, , establishing the claimed monotonicity.
Proof of Theorem 1. Let . The Lagrangian associated with the cost-minimization problem ( ) is , where the adjoint variable measures the “shadow cost” of relaxing the output-attainment constraint. The first-order necessary optimality conditions are therefore(24) (24)
for all (Zorich, 2004, p. 527ff). Since by La. 2 it is , which in particular applies to input vectors such that , the output-attainment constraint must be binding, so that (by complementary slackness) necessarily and Eq. (7) must be satisfied if is to be a solution candidate for the selective matching problem ( ). Substitution of Eqs. (5) and (6) (in La. 1) into Eq. (24) for yields the marginal rate of technical substitution,
where
and
As a result,
which yields Eq. (8) for . This concludes our proof.
Proof of Lemma 3. Fix any input bundle . By Jensen’s inequality, it is
Hence, using Eq. (5) and the linearity of the expectation operator, one obtains
which concludes our proof.
Proof of Lemma 4. Let . Consider first the case where , for some . Then by La. 1 it is
On the other hand, since , for all , the expression on the right-hand side of the preceding equality is downward-sloping in , so(25) (25)
A similar reasoning applies for , in which case La. 1 yields
Thus, taking account of the fact that , for all , the expression on the right-hand side of the preceding inequality must be upward-sloping in , whence(26) (26)
Combining Eqs. (25) and (26), and repeating this exercise for all grades, yields that in fact(27) (27)
Using the definitions of the weighted output F and the (equally weighted) envelope output , one obtains by mere summation that , where . The (maximum) approximation error is therefore at most linear in the aggregate standard deviation , so , which concludes our proof.
Proof of Lemma 5. To exclude trivialities, we restrict attention to positive inputs, so . By Eq. (27) we have
so that
On the other hand, if we set , then
where the right-hand side of the preceding inequality does not depend on the grade m. Hence, it is easy to verify that
The latter implies:
Combining this with La. 4, the relative error decreases in the smallest input,
as long as the scale of the application is sufficiently large, so
Thus, for any :
as claimed. But this means that
which in turn implies the minimal output objective,
Finally, we note that is , so that for small the minimum envelope input is also , as claimed. This concludes the proof.
Proof of Theorem 2. As a positive linear combination of concave functions, the envelope output,
is also a concave function. Thus, for any the upper contour set, , is convex. In addition, it is also possible to achieve at least as much envelope output with more inputs, so
As a result, the upper contour set is a polygonal chain (i.e., a connected series of line segments),
in the sense that , where the vertices of the polygonal chain are implied by the conditions(28) (28)
for ; the edges of have the slopes of absolute values , and , for all . The vertex conditions in Eq. (28) imply
taking account of the fact that the likelihood ratios are size-ordered (i.e., ) by construction. Hence, the coordinates of the vertices of are fully described by
for all . By the Hahn-Banach theorem (Berge, 1963, p. 157) there exists a hyperplane separating the convex set , for some positive constant C, from the convex set as long as they are disjoint, i.e., as long as
In particular, if , then necessarily . The aforementioned separation for occurs at the vertex , with
since that avoids an intersection of the separating hyperplane with the budget set . Using the fact that the cost objective and the envelope output are each homogeneous of degree 1, we obtain that the input vector solves the envelope optimization problem ( ).
Proof of Lemma 6. Consider any fixed target output and let the minimal matching probability (which is positive). By La. 5, the right-hand side of Eq. (19) is such that
where we have used the fact that , as pointed out in footnote 3. Thus, by eliminating , the relative cost overage is such that
which corresponds to Eq. (20). This concludes our proof.
Acknowledgements
The author would like to thank several anonymous referees, as well as participants of the 2022 CORS/INFORMS International Conference in Vancouver, Canada, and the 2022 EURO Conference in Espoo, Finland, for helpful suggestions.
Notes on Contributor
Thomas A. Weber is full professor of Operations, Economics and Strategy at the Swiss Federal Institute of Technology in Lausanne (EPFL). Earlier he was a faculty member in the Economics and Finance Group of the Department of Management Science and Engineering at Stanford University. Prof. Weber is an Ingénieur des Arts et Manufactures (Ecole Centrale Paris) and a Diplom-Ingenieur in Electrical Engineering (Technical University Aachen). He holds master’s degrees in Technology and Policy and Electrical Engineering and Computer Science from MIT, and a PhD in Applied Economics and Managerial Science from the Wharton School of the University of Pennsylvania. He was a visiting faculty in Economics at Cambridge University and in Mathematics at Moscow State University. Between 1998 and 2002, he was a senior consultant with the Boston Consulting Group. His current research interests include the economics of information and uncertainty, robust optimization, and strategy. Prof. Weber’s more than 100 scholarly publications have appeared, for example, in American Economic Journal: Microeconomics, Decision Support Systems, Economics Letters, Economic Theory, Health Care Management Science, Information Systems Research, Journal of Economic Dynamics and Control, Journal of Environmental Economics and Management, Journal of Management Information Systems, Journal of Mathematical Economics, Journal of Optimization Theory and Applications, Journal of Regulatory Economics, Management Science, Mathematical Reviews, Medical Decision Making, Operations Research, Operations Research Letters, Optimal Control: Applications and Methods, Physical Review E, Sustainable Production and Consumption, as well as Theory and Decision. He has been associate editor for Management Science and is the author of Optimal Control Theory with Applications in Economics (MIT Press, 2011).
Notes
1 The hairspring features a stiffness (described by [in Nm/rad]) while the balance wheel is characterized by a moment of inertia (given by I [in kg ]). Their ratio determines the oscillation period, .
2 As established in the proof of La. 5, a (very conservative) minimal envelope output is obtained by setting , where is the corresponding minimal input, with being the smallest grade-achievement probability across all part types. For example, given and , the minimum input becomes units.
3 That is feasible in (*) can be seen as follows. By Eq. (5) the production function F is homogeneous of degree 1 (i.e., , for any ). As a consequence, Thus, not only is feasible in (*), but it also satisfies output efficiency (since is in fact binding).
4 Perfect matching happens when there is only a single matching class (i.e., for ); then all parts are matched. Naturally, this indiscriminate approach propagates lax input tolerances into the assembly which drives up other costs (e.g., in terms of rework or rejected output parts for a failure to satisfy tight output tolerances).
5 See, e.g., the deviation-per-day (dpd) precision of a mechanical watch movement in Fig. 5, which increases by a factor of 2 (from 35s to 17.5s) when doubling the matching classes (from to ).
Reproducibility Report
Download PDF (183.3 KB)References
- Asha, A., S. Kannan, and V. Jayabalan (2008). Optimization of clearance variation in selective assembly for components with multiple characteristics. International Journal of Advanced Manufacturing Technology 38(9-10), 1026–1044.
- Berge, C. (1963). Topological Spaces. Edinburgh, UK: Oliver and Boyd.
- Berry, A. (1941). The accuracy of the Gaussian approximation to the sum of independent variates. Transactions of the American Mathematical Society 49(1), 122–136.
- Buckingham, E. (1921). Principles of Interchangeable Manufacturing. New York, NY: Industrial Press.
- Caputo, A. and G. Di Salvo (2019). An economic decision model for selective assembly. International Journal of Production Economics 207, 56–69.
- Coullard, C., A. Gamble, and P. Jones (1998). Matching problems in selective assembly operations. Annals of Operations Research 76, 95–107.
- Duenyas, I., M. Keblis, and S. Pollock (1997). Dynamic type mating. Management Science 43(6), 751–763.
- Esseen, C.-G. (1942). On the Liapunoff limit of error in the theory of probability. Arkiv för Matematik, Astronomi och Fysik A28(9), 1–19.
- Esseen, C.-G. (1956). A moment inequality with an application to the central limit theorem. Skandinavisk Aktuarietidskrift 39, 160–170.
- Kannan, S. and G. Pandian (2021). A new selective assembly model for achieving specified clearance in radial assembly. Materials Today: Proceedings 46(17), 7411–7417.
- Kwon, H.-M., K.-J. Kim, and M. Chandra (1999). An economic selective assembly procedure for two mating components with equal variance. Naval Research Logistics 46(7), 809–821.
- Leontief, W. (1941). The Structure of American Economy 1919–1929: An Empirical Application of Equilibrium Analysis. Cambridge, MA: Harvard University Press.
- Mansoor, E. (1961). Selective assembly – Its analysis and applications. International Journal of Production Research 1(1), 13–24.
- Matsuura, S. (2011). Optimal partitioning of probability distributions under general convex loss functions in selective assembly. IIE Transactions 40(9), 1545–1560.
- Mease, D., V. Nair, and A. Sudjianto (2004). Selective assembly in manufacturing: Statistical issues and optimal binning strategies. Technometrics 46(2), 165–175.
- Pugh, G. (1986). Partitioning for selective assembly. Computers and Industrial Engineering 11(1-4), 175–179.
- Rezaei Aderiani, A., K. Wärmefjord, R. Söderberg, and L. Lindkvist (2019). Developing a selective assembly technique for sheet metal assemblies. International Journal of Production Research 57(22), 7174–7188.
- Shevtsova, I. (2011). On the absolute constants in the Berry-Esseen type inequalities for identically distributed summands. Working Paper, Lomonosov Moscow State University, Moscow, Russia. arXiv:1111.6554v1.
- Tan, M. and C. Wu (2012). Generalized selective assembly. IIE Transactions 44(1), 27–42.
- Weber, T. (2021). Minimum-error classes for matching parts. Operations Research Letters 49(1), 106–112.
- Weber, T. (2022). Optimal matching of random parts. Journal of Mathematical Economics 101, 102688:1–14.
- Zorich, V. (2004). Mathematical Analysis I. New York, NY: Springer.