Publications

For recent publications, refer to my Google Scholar profile.
† denotes equal contribution, * denotes corresponding author.

Extremal PCA-based synthetic event generation for the inference of extreme precipitation
Kyusoon Kim†, Seeun Park†, Hyeok-Yong Shin†, Jeongmin Choi, Jihee Ha, Junhyeon Kim, Yongho Kwon, Seoncheol Park, Seongjin Kang
Under review, 2026
Graph canonical coherence analysis
Kyusoon Kim, Hee-Seok Oh
Under review, 2026
Statistical graph empirical mode decomposition by graph denoising and boundary treatment
Haeran Cho†, Kyusoon Kim†, Hee-Seok Oh
Under review, 2026
Graph frequency-domain factor modeling
Kyusoon Kim, Hee-Seok Oh
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2026

We propose a novel factor model in the graph frequency domain for multivariate data lying on the vertices of a graph, called a multivariate graph signal. By utilizing graph filters, our model extends the frequency-domain approach of the dynamic factor model from time series to graphs, enabling a graph-aware and multiscale interpretation of factors across graph frequencies. This approach reduces the dimensionality of graph signals and improves the understanding of their structure. It also allows the use of the extracted factors for subsequent analyses, such as clustering. We describe the estimation of factors and their loadings and investigate the consistency of the factor estimator. In addition, we propose two consistent estimators for determining the number of factors. The finite sample performance of the proposed method is demonstrated through simulation studies under different graph structures, including a comparison with classical factor analysis and an exploration of how the graph structure affects the results. Furthermore, we demonstrate the effectiveness of the proposed method by applying it to G20 economic data, water quality data from the Geum River, and passenger data from the Seoul Metropolitan subway.

Graph frequency-domain regression
Kyusoon Kim*
Under review, 2026
Principal component analysis in the graph frequency domain
Kyusoon Kim, Hee-Seok Oh
Under review, 2026
Absolute average and median treatment effects as causal estimands on metric spaces
Hyeok-Yong Shin, Kyusoon Kim*, Kyeongyeol Lee, Hee-Seok Oh
Under review, 2026
Cross-spectral analysis of bivariate graph signals
Kyusoon Kim, Hee-Seok Oh
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2025

With the advancements in technology and monitoring tools, we often encounter multivariate graph signals, which can be seen as the realizations of multivariate graph processes, and revealing the relationship between their constituent quantities is one of the important problems. To address this issue, we propose a cross-spectral analysis tool for bivariate graph signals. The main goal of this study is to extend the scope of spectral analysis of graph signals to the multivariate case. In this study, we define joint weak stationarity and introduce cross-spectral density and coherence for multivariate graph processes. We propose estimators for the cross-spectral density and investigate the theoretical properties of the proposed estimators. Furthermore, we demonstrate the effectiveness of these tools through numerical experiments, including simulation studies and a real data application. Finally, as an interesting extension, we discuss a robust spectral analysis of graph signals.

Quantile-based fitting for graph signals
Kyusoon Kim, Hee-Seok Oh
Statistics and Computing, 2025

We propose a quantile based fitting method for analyzing graph signals. Unlike traditional approaches for data fitting such as smoothing splines and quantile smoothing splines working on Euclidean space, the proposed method is designed for graph domain, considering the inherent graph structure. In contrast to prevalent graph signal denoising methods that rely on optimization problem with L2-norm fidelity, our approach provides denoised signals that are robust to the existence of outliers, and identifies varying structural relationships within graph signals. We validate the efficacy of our method through comprehensive simulation studies and real data analysis.

Semiparametric approaches for the inference of univariate and multivariate extremes
Seongjin Kang†, Kyusoon Kim†, Yongho Kwon†, Seeun Park†, Seoncheol Park†, Hyeok-Yong Shin†, Junhyeon Kim, Hee-Seok Oh
Extremes, 2025

In this paper, we present several semiparametric approaches for the inference of univariate and multivariate extremes to resolve the tasks from the Data Challenge at the 13th Conference on Extreme Value Analysis. We implement generalized additive models to capture the flexible relationship for point and interval estimations of the conditional quantiles. We also adopt Lp-quantile to estimate the marginal quantiles of extreme levels. To predict probabilities of multivariate extreme events, we implement conditional methods by Heffernan and Tawn (2004) and Keef et al. (2013). When estimating the excess probability of 50-dimensional data, we cluster variables with high correlation after simple data exploration and combine the results obtained from each cluster. We further validate predicted models based on cross-validation and select the best estimates to achieve high accuracy.

Prediction of wafer performance: Use of functional outlier detection and regression
Kyusoon Kim†, Seongoh Oh†, Kyubaek Bae, Hee-Seok Oh
IEEE Access, 2025

Optical emission spectroscopy (OES) data is essential for virtual metrology, enabling accurate predictions of wafer performance in plasma etching processes. This approach not only leads to resource savings but also supports better decision-making. To exploit the consecutive nature of OES data, we propose a prediction method based on a functional approach using multivariate functional partial least squares regression, coupled with dimension reduction and a novel outlier detection technique via functional independent component analysis. The proposed approach improves prediction performance by capturing the continuous nature of OES data and effectively extracting the components that describe the data structure. Numerical experiments, including simulation studies and real-world applications of OES data, demonstrate the effectiveness of the proposed method, especially in the presence of outliers.

Network time series forecasting using spectral graph wavelet transform
Kyusoon Kim, Hee-Seok Oh
International Journal of Forecasting, 2024

We proposed a novel method for forecasting network time series that occur in graphs or networks. Our approach is based on a spectral graph wavelet transform (SGWT) that provides the localized behavior of graph signals around each node. The proposed method improves forecasting performance over other existing methods. In particular, the advantages of the proposed method stand out when signals observed on a graph are inhomogeneous or non-stationary.

Principal component analysis for river network data: Use of spatio-temporal correlation and heterogeneous covariance structure
Kyusoon Kim, Hee-Seok Oh, Minsu Park
Environmetrics, 2023

We developed a PCA method to reflect the unique characteristics of river networks. The strengths of our approach are that it can (i) reduce dimensionality for streamflow data while effectively removing correlation among them and (ii) identify the group structure of data.