CLUSTERING INCOMPLETE SPECTRAL DATA WITH ROBUST METHODS
- Faculty of Information Technology, University of Jyväskylä, Jyväskylä, Finland
Keywords: Robust, Clustering, Spectral data, Interpolation, K-means, nan-K-spatmed
Abstract. Missing value imputation is a common approach for preprocessing incomplete data sets. In case of data clustering, imputation methods may cause unexpected bias because they may change the underlying structure of the data. In order to avoid prior imputation of missing values the computational operations must be projected on the available data values. In this paper, we apply a robust nan-K-spatmed algorithm to the clustering problem on hyperspectral image data. Robust statistics, such as multivariate medians, are more insensitive to outliers than classical statistics relying on the Gaussian assumptions. They are, however, computationally more intractable due to the lack of closed-form solutions. We will compare robust clustering methods on the bands incomplete data cubes to standard K-means with full data cubes.