ColorPCA: Scalable Colored Dimensionality Reduction for Unlabeled High-Dimensional Data
KVL Staff on Project
Sohaib Ghani
sohaib.ghani@kaust.edu.sa
Building 1, Level 0, Office 0121
KAUST PI on Project
Ibrahim Hoteit
ibrahim.hoteit@kaust.edu.sa
Overview
KVL in collaboration with Prof. Ibrahim Hoteit's group, designed an interactive technique for dimensionality reduction of high-dimensional data.
Mapping labeled high-dimensional data to colors based on class labels in low-dimensional projections is effective for enhancing pattern recognition and reducing misinterpretation of clusters. However, automatic coloring of unlabeled high-dimensional data remains challenging for revealing unknown patterns or class structures in the data. To address this, we propose ColorPCA, a scalable method that improves existing dimensionality reduction-based automatic coloring by integrating Principal Component Analysis with alpha compositing. Rather than mapping reduced dimensions to color coordinates, ColorPCA directly encodes data into color space to enhance pattern discovery in unlabeled datasets. We implemented ColorPCA in a web-based visual analytics system for interactive exploration and evaluated it through three case studies using benchmark, simulated, and real-world climate datasets. Additionally, we conducted three user studies, two with generic users and one with climate domain experts. Comparisons with two state-of- the-art coloring methods based on PCA and t-SNE demonstrate that ColorPCA improves visual separability and facilitates deeper insight extraction in high-dimensional data visualization.
The paper entitled “ColorPCA: Scalable Colored Dimensionality Reduction for Unlabeled High-Dimensional Data” can be accessed here.