Clustering algorithm #1

dbalabka · 2021-09-21T20:52:24Z

What clustering algorithm this library implements?
What distance between colors/pixels is used to calculate clusters?

tyt2y3 · 2021-09-22T01:16:08Z

Thank you for sticking around. Sorry for missing the the other issue in vtracer.

This is ancient stuff haha.

Basically the idea is https://www.researchgate.net/publication/8199997_Statistical_Region_Merging (Although I did not start from this paper, the idea is the same).

The key feature of the technique is to able to handle natural images with 'infinite' number of colours and retain sane amount of information to process on.

There is a fundamental trade off between efficiency and statistical optimum, so my suggestion is to start with smaller regions first, then merge those regions hierarchically.

If you knew how many colours there are, or 100% sure it is a graphical image composed with solid colour patches, then perhaps K-means clustering will produce better results. (The question is K=?)

dbalabka · 2021-09-22T06:57:52Z

@tyt2y3, thanks for explaining. It really helps!

Currently, I have to post-process the SVG. I tried to cluster color with K-means, but the problem that I should know exact amount of colors. To fully automate this process, I used DBSCAN which produces the same clustering result but w/o need to know colors amount. The only thing that epsilon input parameter should be tuned properly.

IMO it would great to have different clustering algorithms.

Also, I think about to implement shapes antialiasing. Do you have any ideas? I can creat additional ticket to discuss this.

tyt2y3 · 2021-09-22T07:15:54Z

Definitely great to have different clustering algorithms.
Though, it seems some refactoring has to be done to make the Cluster interface algo-agnostic.

Also, I think about to implement shapes antialiasing.

Um... I don't get it. You want to remove the jagges from where?

dbalabka · 2021-09-22T09:39:02Z

@tyt2y3 here is two examples to illustrate my idea:
Original (VTracer):

Antialiased (VectorMagic):

tyt2y3 · 2021-09-23T08:26:41Z

The above image, are not generated from vtracer right?

I think it can be done by first reducing the number of points in the shape first, then subdivide-smoothing it afterwards.

There are such methods within the codebase.

dbalabka · 2021-09-27T19:46:30Z

@tyt2y3 the first variant has been generated with Vtracer and the second generated with VectorMagic.
Here is an example:

Here is image example:

tyt2y3 · 2021-09-28T02:51:43Z

Oh that's why, I mean the source image is already full of jaggies.

Then yes, it's possible to remove the jaggies by reducing the path with the 'radius' algorithm prior to curve fitting.

Just a side question, in what business scenario are you doing this? Can you share a bit?

That may motivate me a bit or quench my curiosity.

dbalabka · 2021-09-28T07:46:10Z

The goal is to restore the vector image. Provided image is good example of noisy image that previously had vector version. I'm sorry but I can not share much.

Previously, had similar idea to overcome this problem.

tyt2y3 · 2021-09-30T08:15:01Z

Well it looks cute. Not all graphics are made the same.

The more assumptions we can make on the original graphic, the better the recovered result will be.

Or, if we can model the degradation process, we can develop special processing to counteract it.

dbalabka · 2021-10-21T15:55:28Z

Here is a library which might help to implement different clustering algorithms: https://github.com/rust-ml/linfa

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Clustering algorithm #1

Clustering algorithm #1

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Clustering algorithm #1

Clustering algorithm #1

Comments

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!