Performance

Introduction

A great survey of nonograms solvers was done by Jan Wolter several years ago, accessible here: webpbn.com/survey.

It provides a selection of puzzles of all difficulties (from very easy to almost impossible) that I used for my own testing. I added a few files to that list.

Test configuration

The methodology follows that of the webpbn survey, that is the Picross Solver is used in validation mode. The goal is to identify if a puzzle has a unique solution, therefore the number of solutions returned by the solver is limited to 2.

A typical command-line for a single file (but one can also pass multiple files at once to the CLI) would be:

picross_solver_cli.exe --validation webpbn-00065.non

Equivalently, the same setup can be obtained with:

picross_solver_cli.exe --max-nb-solutions 2 webpbn-00529.non

And will produce a more detailled output than the validation mode.

Version 0.3.0, 2023/03/01

New Test Files

The archive of test files used for the benchmark is shared as part of the release files for 0.3.0. All those files have been converted to NON format that has the advantage of properly documenting the puzzle source and copyright info. I would highly recommend keeping the same format when sharing those files.

A few files were added compared to the previous set:

Gargantua and Cauldron are example of very large puzzles (200x200 and 512x512). They are line solvable.
Owl is the most difficult puzzle in the collection from Jakub Wilk's nonogram project.
Raccoon is among the hardest black-white puzzles on nonograms.org (but still line solvable as are all the puzzles on that platform), according to a survey done by Izaron.
Kaa and Mozaic are among the most difficult puzzles from the webpbn collection that still have a unique solution. Kaa is at the moment the longest puzzle to solve for the Picross Solver, under one hour. But the result image is really quite nice and worth the wait!

Results

Test Hardware: My laptop

Processor   Intel(R) Core(TM) i7-8750H CPU @ 2.20GHz
RAM         16 GB
OS          Windows 10
Compiler    MSVC 19.34.31942.0

File	Grid	Size	Difficulty	Timing
webpbn-00001.non	Dance	5x10	LINE	0.064 ms
webpbn-00021.non	Skid	14x25	LINE	0.2 ms
webpbn-00023.non	Edge Pattern	10x11	BRANCH	0.2 ms
webpbn-00006.non	Cat	20x20	LINE	0.33 ms
webpbn-00027.non	Bucks	27x23	BRANCH	0.46 ms
webpbn-00016.non	Knot	34x34	LINE	0.51 ms
webpbn-02413.non	Smoke	20x20	BRANCH	1.0 ms
webpbn-00529.non	Swing	45x45	LINE	1.2 ms
nonograms-org-04010.non	Jellyfish	33x40	LINE	1.3 ms
nonograms-org-42735.non	Wolverine	40x40	LINE	2.6 ms
qnonograms-sun.non	Sun	50x60	LINE	3.23 ms
pattern-5-dom.non	5-Dom	11x11	BRANCH	3.8 ms
nonograms-org-03016.non	Barque	50x60	LINE	4.4 ms
qnonograms-tiger.non	Tiger	75x50	LINE	4.5 ms
webpbn-35105.non	Wild Tattoo	60x45	BRANCH	5.1 ms
webpbn-00436.non	Petroglyph	40x35	BRANCH	5.1 ms
webpbn-34824.non	Wishbone	39x47	BRANCH	5.54 ms
webpbn-01611.non	Merka	55x60	BRANCH	6.17 ms
webpbn-00065.non	Mum	34x40	BRANCH	6.5 ms
webpbn-02556.non	Flag	65x45	MULT	10 ms
webpbn-07604.non	No DiCaprio	55x55	LINE	13 ms
jwilk-091.non	Owl	25x25	BRANCH	40.6 ms
nonograms-org-51237.non	Gargantua	200x200	LINE	45.8 ms
nonograms-org-17403.non	Raccoon	80x65	LINE	51.1 ms
webpbn-10810.non	Centerpiece	60x60	MULT	53.2 ms
cauldron.non	Cauldron	512x512	LINE	53.4 ms
webpbn-01694.non	Tragic	45x50	BRANCH	57.4 ms
webpbn-06574.non	Forever	25x25	BRANCH	60.3 ms
pattern-7-dom.non	7-Dom	15x15	BRANCH	156.3 ms
webpbn-04645.non	Marilyn	50x70	BRANCH	160.5 ms
webpbn-03541.non	Sign	60x50	BRANCH	183.3 ms
webpbn-00803.non	Lighthouse	50x45	BRANCH	508 ms
webpbn-02712.non	Lion	47x47	MULT	2.06 s
webpbn-02040.non	Hot	55x60	BRANCH	4.41 s
pattern-9-dom.non	9-Dom	19x19	BRANCH	22 s
webpbn-29723.non	Mozaic	40x20	BRANCH	450 s
webpbn-06739.non	Karate	40x40	MULT	11 min
webpbn-03379.non	Kaa	40x40	BRANCH	24.5 min
webpbn-09892.non	Nature	50x40		+
webpbn-10088.non	Marley	52x63		+
webpbn-12548.non	Sierpinsky	47x40		+
webpbn-18297.non	Thing	36x42		+
webpbn-22336.non	Gettysburg	99x59		+

Difficulty:
- LINE if the grid is line solvable.
- BRANCH if the grid is not line-solvable but still has a unique solution
- MULT if there are multiple solutions
Timing: + indicates the solver was voluntarily interrupted because it had been running for more than 1 hour.

Conclusion

There is a very significant gain in performance, for the most part due to the improvement of the complexity of the "full" reduction algorithm, thanks to the use of a memoization technique to greatly speed up the previous implementation. A "full" line reduction identifies all tiles that can be logically solved on an individual line, given a constraint and the set of already known tiles. Its time complexity is now O(k.n^3), where k is the number of segments of the constraint on the line, and n is line length. This could probably be pulled down to O(k.n^2). That comes at the price of a higher memory consumption, with a memory complexity of O(k.n^2).
The "partial" reduction was replaced by the "linear" reduction, which is able to find more tiles (although not all logically deducible ones, contrary to the "full" reduction) using a simple heuristic, and is very fast with a time complexity of O(k.n). The idea of our solver is to make very fast progress on the grid using the "linear" line reduction, then to use the more costly but more accurate "full" line reduction on the toughest line solves. It is remarkable that on some puzzles (nine in the benchmark set), the solve can be done using only "linear" reductions.
The solver now always precisely identify line-solvable picross puzzles, due to the removal of the complexity limit on the full reduction. (There was before a notion of max number of alternatives above which the full reduction was not even attempted.) Thanks to that there is no more ambiguity on some puzzles like Tiger.
The "probing" technique is used in more cases than in version 0.2.0, although we now limit it to the initial search depth (depth = 0). Indeed, if the algortihm is already on a search path (i.e. an hypothesis has been made on at least one line), the cost of probing for all the hypothesis cases, most of which lead to no solution, did not seem worth it. Benchmarking both alternatives tends to confirm that impression, except in one case: Puzzle Karate. It is indeed the only one for which the performance has not improved since 0.2.0.
With this major gain in performance, and the ability of the solver to handle many of the most difficult puzzles, the Picross Solver is now on par with the best solvers in the webpbn survey.
Memory consumption remains low in this version. For instance, the O(k.n^2) memory cost of the full line reduction is mitigated by using a single memory buffer common to all lines. There is a localized line cache used by the search and probing algorithms, however its size is known, equal to the size of two full grids (O(n*n)) per search level.

Version 0.2.0, 2023/01/28

New Test Files

The set of test files is part of the release files for 0.2.0. A few files were added compared to the previous set:

5-Dom and 7-Dom test patterns. The previous set only had 9-Dom.
Wild Tattoo, created by yours truly with the help of the Picross Solver GUI.
Tiger is a difficult puzzle that is line-solvable, but requires very long reductions to do so. Depending on the settings of the solver, it may be reported as non-line-solvable (that is the case in the result table below). The CLI has a --line-solver option to fully reduce all lines no matter their difficulty.

Results

Test Hardware: My laptop

Processor   Intel(R) Core(TM) i7-8750H CPU @ 2.20GHz
RAM         16 GB
OS          Windows 10
Compiler    MSVC 19.16.27045.0

Puzzles sorted by solving time:

File	Grid	Size	Status	Difficulty	Timing
webpbn-00001.non	Dance	5x10	OK	LINE	0.05 ms
webpbn-00023.non	Edge	10x11	OK		0.20 ms
webpbn-00021.non	Skid	14x25	OK	LINE	0.28 ms
webpbn-00006.non	Cat	20x20	OK	LINE	0.58 ms
webpbn-00027.non	Bucks	27x23	OK		0.80 ms
webpbn-02413.non	Smoke	20x20	OK		1.59 ms
nonograms-org-4010.txt	Jelly-fish	33x40	OK	LINE	2.14 ms
webpbn-00529.non	Swing	45x45	OK	LINE	4.72 ms
pattern-5-dom.txt	5-Dom	11x11	OK		5.9 ms
webpbn-00016.non	Knot	34x34	OK	LINE	9.56 ms
nonograms-org-42735.txt	Wolverine	40x40	OK	LINE	51.1 ms
webpbn-35105.non	Wild Tattoo	60x45	OK		62.4 ms
webpbn-02556.non	Flag	65x45	MULT		62.5 ms
webpbn-00436.non	Petroglyph	40x35	OK		76.3 ms
qnonograms-sun.non	Sun	50x60	OK	LINE	135.4 ms
webpbn-00065.non	Mum	34x40	OK		167.6 ms
webpbn-34824.non	Wishbone	39x47	OK		223.8 ms
pattern-7-dom.txt	7-Dom	15x15	OK		275.8 ms
webpbn-01694.non	Tragic	45x50	OK		556.7 ms
webpbn-06574.non	Forever	25x25	OK		1.3 s
webpbn-07604.non	No DiCaprio	55x55	OK	LINE	4.3 s
nonograms-org-3016.txt	Barque	50x60	OK	LINE	4.6 s
webpbn-00803.non	Lighthouse	50x45	OK		9.75 s
webpbn-01611.non	Merka	55x60	OK		10.2 s
webpbn-04645.non	Marilyn	50x70	OK		22.3 s
qnonograms-tiger.non	Tiger	75x50	OK		36.6 s
webpbn-03541.non	Sign	60x50	OK		44.3 s
webpbn-08098.non	9-Dom	19x19	OK		57.6 s
webpbn-10810.non	Centerpiece	60x60	MULT		93.6 s
webpbn-02712.non	Lion	47x47	MULT		112 s
webpbn-06739.non	Karate	40x40	MULT		259 s
webpbn-02040.non	Hot	55x60	OK		51 m
webpbn-09892.non	Nature	50x40			+
webpbn-10088.non	Marley	52x63			+
webpbn-12548.non	Sierpinsky	47x40			+
webpbn-18297.non	Thing	36x42			+
webpbn-22336.non	Gettys	99x59			+

Status: OK if the solution is unique. MULT if there are multiple solutions. Nothing if the solving process was aborted.
Difficulty: LINE if the grid is line solvable. Else, it requires to make hypothesis on at least one line.
Timing: + indicates the solver was interrupted because it was too long (usually after 1 hour).

Conclusion

The performance has improved on most puzzles, and the solver is now able to tackle some of the hardest puzzles of the set (Lion, Karate and Hot, which version 0.1.0 could not solve in a reasonnable amount of time).

For the most part the improvements are due to the new "partial reduction" step in the solver which quickly discover some of the tiles of a line without doing a full reduction. This allows for very fast progress on the easy parts of a puzzle. The easiest puzzles of the test set are in fact solved solely thanks to that method.
The new "probing" method is a key asset to tackle the hardest puzzles. It consists in a search step limited to depth D+1, and done on the edges of the currently unsolved part of a grid. All tiles identified thanks to that method are set in the grid at depth D, and the line solver resumes from that point on. The most spectacular demonstration of that method is with the puzzle Forever, which used to take hours to solve and is now solved in a few seconds.
Unfortunately on some puzzles the "probing" method will consume a lot of CPU time for nothing, since it will not allow to make any progress. This is the case with the most geometric patterns, like the Domino test pattern (9-Dom for instance), and Centerpiece. Disabling the "probing" step in the solver, version 0.2.0 is able to solve Centerpiece (i.e. find at least 2 solutions) in less than a second.

Version 0.1.0, 2021/05/03

Results

Test Hardware: My laptop

Processor   Intel(R) Core(TM) i7-8750H CPU @ 2.20GHz
RAM         16 GB
OS          Windows 10
Compiler    MSVC 19.28.29914.0

Puzzles sorted by solving time:

File	Grid	Size	Status	Difficulty	Timing
webpbn-00001.non	Dance	5x10	OK	LINE	0.05 ms
webpbn-00021.non	Skid	14x25	OK	LINE	0.21 ms
webpbn-00023.non	Edge	10x11	OK		0.26 ms
webpbn-00006.non	Cat	20x20	OK	LINE	0.36 ms
webpbn-00027.non	Bucks	27x23	OK		0.59 ms
webpbn-02413.non	Smoke	20x20	OK		2.81 ms
webpbn-00016.non	Knot	34x34	OK	LINE	9.35 ms
webpbn-02556.non	Flag	65x45	MULT		9.38 ms
nonograms-org-4010.txt	Jelly-fish	33x40	OK	LINE	13.4 ms
webpbn-00529.non	Swing	45x45	OK	LINE	13.4 ms
webpbn-00436.non	Petroglyph	40x35	OK		107.4 ms
nonograms-org-42735.txt	Wolverine	40x40	OK	LINE	311.3 ms
webpbn-00065.non	Mum	34x40	OK		522.6 ms
webpbn-01694.non	Tragic	45x50	OK		1.172 s
webpbn-00803.non	Lighthouse	50x45	OK		3.778 s
webpbn-07604.non	No DiCaprio	55x55	OK	LINE	4.340 s
webpbn-10810.non	Centerpiece	60x60	MULT		18.236 s
nonograms-org-3016.txt	Barque	50x60	OK	LINE	23.195 s
webpbn-08098.non	9-Dom	19x19	OK		67.585 s
webpbn-04645.non	Marilyn	50x70	OK		137.72 s
webpbn-03541.non	Sign	60x50	OK		159.9 s
webpbn-01611.non	Merka	55x60	OK		167.5 s
webpbn-06574.non	Forever	25x25	OK		2.5 h
webpbn-02040.non	Hot	55x60			+
webpbn-02712.non	Lion	47x47			+
webpbn-06739.non	Karate	40x40			+
webpbn-09892.non	Nature	50x40			+
webpbn-10088.non	Marley	52x63			+
webpbn-12548.non	Sierpinsky	47x40			+
webpbn-18297.non	Thing	36x42			+
webpbn-22336.non	Gettys	99x59			+

Status: OK if the solution is unique. MULT if there are multiple solutions. Nothing if the solving process was aborted.
Difficulty: LINE if the grid is line solvable. Else, it requires to make hypothesis on at least one line.
Timing: + indicates the solver was manually interrupted because it was too long (usually after 1 hour).

Memory

Memory usage is very low, the CLI process stays below 1MB in validation mode for the puzzles listed above.

Using the library in a more general context, one has to be careful to limit the max number of solutions (in the case of a validation, it is limited to 2 solutions) because some ill-formed puzzles can have a high number of solutions and the solver will allocate a huge quantity of memory to store them all. One such example is Centerpiece if the solver is used with no limit on the number of solutions:

picross_solver_cli.exe webpbn-10810.non ⚠️

Conclusion

My "end game" at the moment is the very well named puzzle Forever (webpbn #6574) for which the Picross Solver correctly finds the unique solution, albeit after several hours of processing!

The robustness of the Picross Solver is satisfactory in the sense that it correctly finds the solution(s) of a wide variety of puzzles, even hard ones. The limiting factor on the most difficult puzzles is really the processing time.

By design the solver will not start a search (i.e. test several hypothesis with a backtracking technique) before it reaches a state where no further progress can be made with line solving alone. This has the advantage that the solver is good at detecting if a puzzle is line solvable (the label LINE in the table above). But it comes at a cost performance wise. In some examples like Barque, which is line solvable, it would probably be more advantageous to do a search on a line with a small number of alternatives rather than exploring the huge solution space of a few lines. The same should stay true for puzzles that are not line solvable but have a unique solution.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

PERF.md

PERF.md

Performance

Introduction

Test configuration

Version 0.3.0, 2023/03/01

New Test Files

Results

Conclusion

Version 0.2.0, 2023/01/28

New Test Files

Results

Conclusion

Version 0.1.0, 2021/05/03

Results

Memory

Conclusion

Files

PERF.md

Latest commit

History

PERF.md

File metadata and controls

Performance

Introduction

Test configuration

Version 0.3.0, 2023/03/01

New Test Files

Results

Conclusion

Version 0.2.0, 2023/01/28

New Test Files

Results

Conclusion

Version 0.1.0, 2021/05/03

Results

Memory

Conclusion