Smooth quantile regression and distributed inference for non-randomly stored big data

Yükleniyor...
Küçük Resim

Tarih

2023

Dergi Başlığı

Dergi ISSN

Cilt Başlığı

Yayıncı

Pergamon-Elsevier Science Ltd

Erişim Hakkı

info:eu-repo/semantics/closedAccess

Özet

In recent years, many distributed algorithms towards big data quantile regression have been proposed. However, they all rely on the data are stored in random manner. This is seldom in practice, and the violation of this assumption can seriously degrade their performance. Moreover, the non-smooth quantile loss brings inconvenience in both computation and theory. To solve these issues, we first propose a convex and smooth quantile loss, which converges to the quantile loss uniformly. Then a novel pilot sample surrogate smooth quantile loss is constructed, which can realize communication-efficient distributed quantile regression, and overcomes the non-randomly distributed nature of big data. In theory, the estimation consistency and asymptotic normality of the resulting distributed estimator are established. The theoretical results guarantee that the new method is adaptive to the situation where the data are stored in any arbitrary way, and can work well just as all the data were pooled on a single machine. Numerical experiments on both synthetic and real data verify the good performance of the new method.

Açıklama

The research was supported by NNSF project of China (11901356) .

Anahtar Kelimeler

Quantile Regression, Big Data, Communication Efficiency, Median Regression, Selection

Kaynak

Expert Systems with Applications

WoS Q Değeri

Q1

Scopus Q Değeri

Q1

Cilt

215

Sayı

Künye

Wang, K., Jia, J., Polat, K., Sun, X., Alhudhaif, A., & Alenezi, F. (2023). Smooth quantile regression and distributed inference for non-randomly stored big data. Expert Systems with Applications, 215, 119418.