Suling Jia

AI
4papers
19citations
Novelty33%
AI Score19

4 Papers

MEJun 26, 2022
fETSmcs: Feature-based ETS model component selection

Lingzhi Qi, Xixi Li, Qiang Wang et al.

The well-developed ETS (ExponenTial Smoothing or Error, Trend, Seasonality) method incorporating a family of exponential smoothing models in state space representation has been widely used for automatic forecasting. The existing ETS method uses information criteria for model selection by choosing an optimal model with the smallest information criterion among all models fitted to a given time series. The ETS method under such a model selection scheme suffers from computational complexity when applied to large-scale time series data. To tackle this issue, we propose an efficient approach for ETS model selection by training classifiers on simulated data to predict appropriate model component forms for a given time series. We provide a simulation study to show the model selection ability of the proposed approach on simulated data. We evaluate our approach on the widely used forecasting competition data set M4, in terms of both point forecasts and prediction intervals. To demonstrate the practical value of our method, we showcase the performance improvements from our approach on a monthly hospital data set.

LGApr 6, 2021
A hybrid ensemble method with negative correlation learning for regression

Yun Bai, Ganglin Tian, Yanfei Kang et al.

Hybrid ensemble, an essential branch of ensembles, has flourished in the regression field, with studies confirming diversity's importance. However, previous ensembles consider diversity in the sub-model training stage, with limited improvement compared to single models. In contrast, this study automatically selects and weights sub-models from a heterogeneous model pool. It solves an optimization problem using an interior-point filtering linear-search algorithm. The objective function innovatively incorporates negative correlation learning as a penalty term, with which a diverse model subset can be selected. The best sub-models from each model class are selected to build the NCL ensemble, which performance is better than the simple average and other state-of-the-art weighting methods. It is also possible to improve the NCL ensemble with a regularization term in the objective function. In practice, it is difficult to conclude the optimal sub-model for a dataset prior due to the model uncertainty. Regardless, our method would achieve comparable accuracy as the potential optimal sub-models. In conclusion, the value of this study lies in its ease of use and effectiveness, allowing the hybrid ensemble to embrace diversity and accuracy.

AINov 1, 2019
Research and application of time series algorithms in centralized purchasing data

Yun Bai, Suling Jia, Xixi Li

Based on the online transaction data of COSCO group's centralized procurement platform, this paper studies the clustering method of time series type data. The different methods of similarity calculation, different clustering methods with different K values are analysed, and the best clustering method suitable for centralized purchasing data is determined. The company list under the corresponding cluster is obtained. The time series motif discovery algorithm is used to model the centroid of each cluster. Through ARIMA method, we also made 12 periods of prediction for the centroid of each category. This paper constructs a matrix of "Customer Lifecycle Theory - Five Elements of Marketing ", and puts forward corresponding marketing suggestions for customers at different life cycle stages.

IRApr 26, 2018
A Hybrid Recommendation Method Based on Feature for Offline Book Personalization

Xixi Li, Jiahao Xing, Haihui Wang et al.

Recommendation system has been widely used in different areas. Collaborative filtering focuses on rating, ignoring the features of items itself. In order to effectively evaluate customers preferences on books, taking into consideration of the characteristics of offline book retail, we use LDA model to calculate customers preference on book topics and use word2vec to calculate customers preference on book types. When forecasting rating on books, we take two factors into consideration: similarity of customers and correlation between customers and books. The experiment shows that our hybrid recommendation method based on features performances better than single recommendation method in offline book retail data.