Yazar "Sun, Xiaofei" seçeneğine göre listele
Listeleniyor 1 - 3 / 3
Sayfa Başına Sonuç
Sıralama seçenekleri
Öğe Convolution smoothing and non-convex regularization for support vector machine in high dimensions(Elsevier, 2024) Wang, Kangning; Yang, Junning; Polat, Kemal; Alhudhaif, Adi; Sun, XiaofeiThe support vector machine (SVM) is a well-known statistical learning tool for binary classification. One serious drawback of SVM is that it can be adversely affected by redundant variables, and research has shown that variable selection is crucial and necessary for achieving good classification accuracy. Hence some SVM variable selection studies have been devoted, and they have an unified empirical hinge loss plus sparse penaltyformulation. However, a noteworthy issue is the computational complexity of existing methods is high especially for large-scale problems, due to the non-smoothness of the hinge loss. To solve this issue, we first propose a convolution smoothing approach, which turns the non-smooth hinge loss into a smooth surrogate one, and they are asymptotically equivalent. Moreover, we construct computationally more efficient SVM variable selection procedure by implementing non-convex penalized convolution smooth hinge loss. In theory, we prove that the resulting variable selection possesses the oracle property when the number of predictors is diverging. Numerical experiments also confirm the good performance of the new method.Öğe Distributed non-convex regularization for generalized linear regression(Pergamon-Elsevier Science Ltd, 2024) Sun, Xiaofei; Zhang, Jingyu; Liu, Zhongmo; Polat, Kemal; Gai, Yujie; Gao, WenliangDistributed penalized generalized linear regression algorithms have been widely studied in recent years. However, they all assume that the data should be randomly distributed. In real applications, this assumption is not necessarily true, since the whole data are often stored in a non-random manner. To tackle this issue, a non- convex penalized distributed pilot sample surrogate negative log-likelihood learning procedure is developed, which can realize distributed high-dimensional variable selection for generalized linear models, and be adaptive to the non-random situations. The established theoretical results and numerical studies all validate the proposed method.Öğe Smooth quantile regression and distributed inference for non-randomly stored big data(Pergamon-Elsevier Science Ltd, 2023) Wang, Kangning; Jia, Jiaojiao; Polat, Kemal; Sun, Xiaofei; Alhudhaif, Adi; Alenezi, FayadhIn recent years, many distributed algorithms towards big data quantile regression have been proposed. However, they all rely on the data are stored in random manner. This is seldom in practice, and the violation of this assumption can seriously degrade their performance. Moreover, the non-smooth quantile loss brings inconvenience in both computation and theory. To solve these issues, we first propose a convex and smooth quantile loss, which converges to the quantile loss uniformly. Then a novel pilot sample surrogate smooth quantile loss is constructed, which can realize communication-efficient distributed quantile regression, and overcomes the non-randomly distributed nature of big data. In theory, the estimation consistency and asymptotic normality of the resulting distributed estimator are established. The theoretical results guarantee that the new method is adaptive to the situation where the data are stored in any arbitrary way, and can work well just as all the data were pooled on a single machine. Numerical experiments on both synthetic and real data verify the good performance of the new method.