Projects

Check out the analysis.

Dalt-NET

Sep 2023 - Dec 2023

A lightweight model based on series of ConvNets to recolor images for users with protanopia. Suitable for edge-end devices and small adjustments. Simple introduction for the field of Daltonization and visual recoloring to practical deep-learning frameworks.

Saliency Area Detection

V1 - Aug 2024 - Dec 2024

With new research in CVD recoloring, we have designed a new model to improve salient regions of images to reduce the visual fatigue that users with colorblindness may face. Suitable for cases where distinguishability is critical.

Mapping Distillation

V2 - Jan 2025

The new model brings in a wave of new data. Improving on Dalt-NET, a new transformer-dataset is introduced. In fact, the new architecture is that of a teacher-student model. A step in the future research, when transformational integrity is needed, but efficiency is even more critical.

InnoColor Edge

V3 - Feb 2025 - Mar 2025

The flagship model builds on the work of all previous, combining the distillation top-down structure, custom datasets, and saliency. It introduces the chromaticity mapping-method, which is more efficient and learnable. Suitable for all types of recoloring cases, even beyond Color Vision Deficiency.

Graphical Abstract

Where does the process start?

Color Vision Deficiency (color blindness) affects about 8% of men and 0.44% of women around the world. That's about 300 million people, all facing barriers in a world that is becoming increasingly digital, and thus visually-oriented. Unlike how people usually perceive it, CVD is not a single condition—it consists of a long, continuous spectrum of visual impairment.

When underlying genetics cause one type of photoreceptor from the eye to become inhibited or simply not be present, a range of EM wavelengths become lost. As color is essentially 3-dimensional (due to having 3 photoreceptors), this means that losing a photoreceptor will turn one's color perception into 2 dimensions, called dichromacy.

Translating into other color spaces, like LMS and YUV, this manifests in "confusion lines" which represent the lines of colors that dichromatic observers cannot distinguish. In the YUV color space, the intersection of the confusion lines is the "confusion point". This also means that the only color information that pure dichromats will get is the "angle" of the color with respect to the confusion point (when represented in YUV space). This helps represent and stabilize color spaces for the following process to enhance the images for the color blind—"Daltonization".

Another way of understanding dichromacy: the 2nd picture is the protanopic (red color-blind) simulation of the original color gamut (range) in the 1st. Note that the gamuts above (and any 2d gamuts) are a cross section of the entire color range of humans (as color is 3d!). However, in the 2d protanopic gamut, there is only 1 dimension represented as a point's distance from the confusion point becomes irrelevant. When the gamut is extended to 3d, there would only be 2 dimensions for protanopes.

What is Daltonization—currently?

Daltonization is the overarching method to recolor images to make them understandable for CVD individuals. In 2023, there were 4 criteria that I defined: a) Contrast maintained/improved in the Daltonized image, b) Context-based consistency of colors retained across different images, c) Context-based naturalness or reason of color of real objects demonstrated in Daltonized images, d) Speed of algorithm able to run in real-time.

A common method of Daltonization is to rotate the "hue" of color clusters around the white point (which has no hue). The white point is inside the color gamut, which means that most colors will rotate to another color that is also inside the gamut. For protanopia, red usually rotates towards blue. This means that the original red, which is essentially black for CVD observers, turns to magenta for normal observers but blue for CVD observers.

However, this type of static hue-rotation brings many disadvantages. For one, while hue rotation can bring distinguishability to reds in the original image, it will certainly turn previously distinguishable colors or objects no longer such. Additionally, maybe a blue apple wouldn't look too nice in the eyes of CVD observers. They would likely prefer a more muted color for objects that is consistent with what the it looks like in the real world around them. Thus, in a newer method, each color cluster is rotated differently, in order to displace them from being on the same confusion line. This is shown in the adjacent image.

Building on this, an even newer study used Generative Adversarial Networks (yes, deep learning!) in order to represent Daltonization transformations. The authors designed a mini-recoloring algorithm named an "Improved Octree Quantification Method" which similar to the method in the above paragraph, and combined it with another study's algorithm (a fixed hue-rotation) in order to create a dataset, which then trained on several image-to-image GAN networks. However, the issue here is that GANs are slowww... especially if you are applying it on a 30 FPS video.

How does InnoColor approach this?

In 2024, an additional objective, saliency, was captured from literature review. There are now five different objectives that pertain to practical usage, quality of life, and scalability purposes for InnoColor:

Saliency: Most images have a region-of-interest or two, whether a foreground object or a focal point in the distance. Maximizing its visibility through InnoColor drastically reduces visual fatigue faced by the colorblind. Users will be able to instantly understand the image through an apparent region-of-interest. It should be utmost priority that users can view media for extended periods of time and not become tired.
Contrast: This objective focuses on the overall clarity of the picture demonstrated by its colors. When the saliency objective is met, improving the separation of background regions can greatly enhance the perception of the image on the high-level rather than just the salient region.
Naturalness: The algorithm should not only retain the natural appearance of colors, but enhance it. The spectral emission of digital screens is different from that of real light, and this affects photometric perception. For example, not only should the algorithm not turn an apple blue, but it should also increase the saturation of the leaves in the background to match the real-life appearance.
Consistency: InnoColor needs to prioritize control over change. When enough information can be extracted from an image by the colorblind eye, any changes should be minimized. InnoColor is not a filter, but an adaptive corrector. This objective also looks for the retention of similar mapping schemes across consecutive video frames.
Efficiency: Needless to say, InnoColor needs to be efficient at image processing to enable the possibility of real-time inference. Excessively deep networks should be avoided.

InnoColor v3 consists of a chromaticity-mapping architecture that aims towards knowledge distillation. There are 3 versions of the deep-learning framework, each employing varied methods that enabled various objectives. Check them out at the top of the page!

In terms of objectives met—InnoColor surpasses all previous methods, save the distillation data source, in cross-the-board performance. Check some of InnoColor's result comparisons here

InnoColor labs

Technology

Our Technology