June 19th, 2024

Algorithmic fitting of Japanese candy

The article discusses algorithmically fitting Japanese candies into a subscription box for Candy Japan. The programmer developed an algorithm considering candy sizes, permutations, and NP-hard complexity. Despite challenges, the project led to discontinuing Candy Japan due to postal rate impacts.

Read original articleLink Icon
Algorithmic fitting of Japanese candy

The article discusses the process of algorithmically fitting Japanese candies into a specific box size for a subscription box service called Candy Japan. The author, a programmer, developed an algorithm to determine the best candy combinations that would fit within the box dimensions. The algorithm considers factors such as total volume, individual candy sizes, and permutations of candy placements. By optimizing the testing process and reducing unnecessary permutations, the algorithm aims to efficiently find suitable candy arrangements. However, the complexity of the problem is highlighted as NP-hard, leading the author to consider existing solutions like the Python package pyShipping for faster results. Despite the challenges faced in optimizing the algorithm, the author acknowledges the learning experience gained from the project. The article concludes with the author's decision to discontinue Candy Japan due to increased postal rates impacting profit margins.

Link Icon 19 comments
By @crazygringo - 5 months
I loved reading this, because it feels like it's only the start. Now I'm really curious (and if anyone can point me to resources) --

1) Surely there are a bunch of more obvious optimizations, like don't place candies where they would fall because of gravity and nothing supporting them underneath, or always placing the candies in order of largest to smallest, and ignore anything symmetrical or rotationally equivalent to a previous solution... and how much would this reduce the search space?

2) What is the actual metric for fitting in a box "in the best way"? That lower layers are as full as possible before adding upper layers? That shifting of contents is minimized while upright? That shifting of contents is minimized in any orientation? To minimize small gaps and try to make non-used space as contiguous and compact as possible? Or is the box size arbitrary and the goal is to find the minimal box volume?

3) Is there separately a metric for "good enough"? Should the metric not to be to fit in a specified box, but rather find the smallest box of a set of standard box sizes that can fit the candies, with any arbitrary arrangement rather than a "best"? Is this a much easier problem, or is it just as hard or harder?

I feel like Amazon must have solved this for all practical purposes.

I suspect that it would work well and be fast if you simply place items in order from largest to smallest by volume, filling from bottom to top, quickly determine which box sizes are too small, and then just find the first arrangement that works. Curious if there are pathological cases where that wouldn't work?

By @resolutebat - 5 months
A friend of a friend worked at NASA on essentially this, except that instead of fitting Japanese candy in a box, they were fitting cargo into the Space Shuttle. As you can imagine this introduces a whole slew of new complications, notably mass distribution and that things must be unloaded in a certain order.
By @alexpotato - 5 months
While working at a Big Bank, a new policy was instituted where we had to fill out time sheets. We also had to allocate our "budgeted time" into these timesheets (Regardless of what we actually did).

It ended up being a bin packing problem that no one actually did till I wrote a script to do it for me. This in turn led to some interesting conversations with my manager and the head of Capital Markets and Banking's CFO.

You can read more details about it here: https://twitter.com/alexpotato/status/1296856648435326976

By @GuB-42 - 5 months
Not only that, but the optimal solution may involve packing at an angle, the most obvious being fitting a candy bar that is too long across the package, in diagonal. If you look at some of the best square packings, the results can be surprisingly messy[1], and few of them have been proven.

It is clear that optimal algorithms are out of the question, which is the conclusion of the article, even with the constraints of orthogonal placement of simple boxes. In fact, in real life, when taking into account packages that are not simple boxes, items that can be squished a little and those that shouldn't be, etc... I wouldn't be surprised if humans were actually better than computers.

[1] https://kingbird.myphotos.cc/packing/squares_in_squares.html

By @zimpenfish - 5 months
> "How hard could it be?"

Having worked on something that required "optimal" box packing for deliveries[1], the answer is "very, and then some (and that was with the help of that fancy USMIL pallet-packing algorithm, IIRC)".

[1] They wanted to use the smallest possible box plus some other constraints like "product X cannot have anything on top of it", etc.

By @vslira - 5 months
A couple of months ago I tried using z3 to create an itinerary for my honeymoon. It seemed like a very straightforward implementation of a TSP with a few extra restrictions. It's 2024 and I had an M2 - which is basically alien tech for the period when these problems started being solved - so I figured that in 5 minutes' time I'd have the perfect walking trip in Rome. To quote TFA, how hard could it be?

The whole story might become a blog post some day, but the punchline is that Moore's law ain't got nothing on NP-hard problems

By @reedf1 - 5 months
If this is presented at programmers, why not just say "Here is a cool practical case of a knapsack/bin packing problem."? The article seems to be intentionally avoiding this.
By @jiggawatts - 5 months
The author mentioned 10^20 combinations taking millions of years, but a modern server GPU can put out 10^15 computations per second (about a petaflop), assuming you use them in FP16 mode. Keep in mind that 65K divisions of a box one meter per side is 15 micrometers! This means that roughly speaking, it would be possible to brute-force through all possibilities in about 10^5 seconds, which is just one day. It helps that this type of algorithm is almost all computation with very little data transfer to and from main memory, and is "embarrassingly parallel".

Some light optimisation such as utilising symmetries to reduce work, combined with multiple GPUs in parallel could bring this down to hours or even minutes.

It would be a fun thing to cloud-host, spinning up spot-priced GPUs on demand.

Similar brute-force tricks can be applied to other NP-hard problems such as optimising electronic circuit board wiring, factory floor planning, etc...

The ridiculous amount of compute we have available to us in a GPU reminds me of this quote from Stargate Atlantis:

JEANNIE: The energy you'd need would be enormous to the point of absurd.

McKAY: Absurd we can do. We have something called a Zero Point Module which essentially does what we're attempting on a smaller scale -- extract energy from subspace time.

By @madaxe_again - 5 months
Used to provide ecommerce solutions, from purchase through to despatch and beyond.

A frequently asked for feature was box-packing - clients would complain that their packers would chuck something tiny in a huge box, use loads of packaging, and still manage to have the thing damaged on arrival due to shaking about in there.

So we implemented the feature. It worked great. Clients bought it.

Literally nobody used it, as humans inevitably decide they know better. Instead, we just ended up with clients reporting “the boxer told our packer to put a stick of gum in a 1m2 palletised delivery!”, as that was what the packer would report, when it had done no such thing, so all we had done was move the point of blame from minimum wage warehouse workers to ourselves.

We withdrew the feature.

By @bencyoung - 5 months
I once had a financial bin packing problem at a previous company. Rather than going down a dynamic programming route which was highly exponential, we decided to use a combination of monte-carlo simulation and a very basic genetic algorithm to try and find a solution. This made use of the fact that we could try solutions extremely quickly, and also timebound the overall time taken (something that was important as we had a limited time window). We'd still be able to run 1-2e6 different approaches in under a second.

Although we couldn't guarentee the "best" approach possible we generally came up with something good enough, often in many orders of magnitude less time.

A good example of the performance improvements you can make when you move from thinking of what the best solution is to what a "good enough" one is.

By @freeone3000 - 5 months
“A few minutes” seems… reasonable? Like, you can let it run for a few hours on a list, come back in the morning, see which combos work. You ship it out once a month, it’s gotta be quicker than testing them by hand.
By @Cthulhu_ - 5 months
(2016). Also posted 8 years ago, 50 comments: https://news.ycombinator.com/item?id=12973964
By @cypherpunks01 - 5 months
Wouldn't using a constraint programming solver like CP-SAT be overall a faster way to do this?

Then one would only have to specify the constraints and let the solver optimize a solution, rather than trying to write a customized packing problem algorithm from scratch. I'm sure it'd be fun, but when trying to best solve a real-world problem, I'd think using well-researched tools would be the quickest and safest way to go.

By @poopcat - 5 months
I loved how you used your skills to address a ~seemingly~ simple problem. The explanation was also cool
By @Minor49er - 5 months
I stumbled across a PHP library that aims to solve this kind of thing

https://github.com/dvdoug/BoxPacker?tab=readme-ov-file#boxpa...

By @lkdfjlkdfjlg - 5 months
> Hey I know, I'm a programmer, I'll just write an algorithm to do it for me. How hard could it be?

Hey I'm a programmer, I'll tell a computer to brute-force it.

To someone who only has a hammer everything looks like a nail.

By @nayuki - 5 months
I see all the <canvas> elements are blank because of JavaScript code errors.
By @deusum - 5 months
This would be a fascinating Code Golf challenge.