A Comparative Analysis of Machine Learning and Grey Models

机器学习与灰色模型的比较分析

ABSTRACT Artificial Intelligence (AI) has recently shown its capabilities for almost every field of life. Machine Learning, which is a subset of AI, is a ‘hot’ topic for researchers. Machine Learning outperforms other classical forecasting techniques in almost all-natural applications and it is a crucial part of modern research. Many modern Machine Learning methods require a large amount of training data. Due to the small datasets, the researchers may not prefer to use Machine Learning algorithms that require large training data. To tackle this issue, this survey illustrates, and demonstrates related studies for significance of Grey Machine Learning (GML). Which is capable of handling large datasets as well as small datasets for time series forecasting likely outcomes. This survey presents a comprehensive overview of the existing grey models and machine learning forecasting techniques. To allow an in-depth understanding for the readers, a brief description of Machine Learning, as well as various forms of conventional grey forecasting models are discussed. Moreover, a brief description on the importance of GML framework is presented.

摘要人工智能 (AI) 最近展示了其在几乎所有生活领域的能力。机器学习作为 AI 的一个子集，是研究者的热门话题。在几乎所有自然应用中，机器学习都优于其他经典预测技术，并且是现代研究的关键部分。许多现代机器学习方法需要大量的训练数据。由于数据集较小，研究者可能不愿意使用需要大量训练数据的机器学习算法。为了解决这个问题，本调查展示并演示了灰色机器学习 (Grey Machine Learning, GML) 在时间序列预测中的重要性相关研究。GML 能够处理大数据集以及小数据集的时间序列预测。本调查全面概述了现有的灰色模型和机器学习预测技术。为了让读者深入理解，简要介绍了机器学习以及各种形式的传统灰色预测模型。此外，还简要介绍了 GML 框架的重要性。

INDEX TERMS Machine Learning, Grey Models, Grey Machine Learning, Forecasting, Small Sample Learning

索引术语机器学习 (Machine Learning)，灰色模型 (Grey Models)，灰色机器学习 (Grey Machine Learning)，预测 (Forecasting)，小样本学习 (Small Sample Learning)

INTRODUCTION

引言

ACHINE learning techniques plays an essential role especially for forecasting [1, 2]. Every country has a relevant organization that analyzes, and collects the economic facts, and figures to predict future tendency for several economic indicators and assist policy-makers in their decision-making [3, 4]. Data gathered from industries (e.g., demand and sale) remain insufficient. Recently, several types of forecasting methods were proposed and can be divided into two main categories: (i) Qualitative and (ii) Quantitative. Qualitative methods include expert system, trend prediction, Delphi, etc. Meanwhile, the quantitative methods include multi-linear regression analysis, exponential smoothing, time series analysis, and genetic algorithms [5, 6]. These forecasting methods are constrained by the lack of data, complicated input variables, and predicted environmental changes [7]. There are more than 300 studies related to forecasting. However, only a few of such studies are reliable. Despite the well-developed scientific technologies, There are several social and natural factors which are unexplained, uncertain, or incomplete. Besides the availability of an extensive range of technologies and frameworks, which can be used for big datasets [8, 9].

机器学习技术在预测中扮演着至关重要的角色 [1, 2]。每个国家都有一个相关组织，负责分析和收集经济事实和数据，以预测未来若干经济指标的趋势，并协助决策者进行决策 [3, 4]。从行业收集的数据（例如需求和销售）仍然不足。最近，提出了几种类型的预测方法，可以分为两大类：(i) 定性方法和 (ii) 定量方法。定性方法包括专家系统、趋势预测、德尔菲法等。同时，定量方法包括多元线性回归分析、指数平滑、时间序列分析和遗传算法 [5, 6]。这些预测方法受到数据缺乏、复杂的输入变量和预测环境变化的限制 [7]。有超过 300 项与预测相关的研究。然而，只有少数研究是可靠的。尽管科学技术已经相当发达，但仍有一些社会和自然因素无法解释、不确定或不完整。此外，尽管有广泛的技术和框架可用于处理大数据集 [8, 9]。

Recent forecasting review studies offer a systematic overview of current forecasting models and their classification. Hippert et al. [10] presented a review on short-term load forecasting. Mat Daut et al. presented a review on building electrical energy consumption forecasting analysis using conventional and AI methods [11]. Zhao et al. classified and reviewed the existing methods for building energy consumption prediction [12]. A review on energy demand models for forecasting proposed by Suganthi and Samuel [13]. Fumo et al. presented a detailed study on building energy estimation and classification [14]. Furthermore, Martinez-Alvarez et al. presented a survey on data mining techniques for time series forecasting of electricity [15]. Raza and Khosravi presented a review on short-term load forecasting techniques based on AI techniques [16]. Wang et al. proposed a review of AI based building energy prediction with a focus on ensemble prediction models [17].

最近的预测综述研究对当前的预测模型及其分类进行了系统概述。Hippert 等人 [10] 对短期负荷预测进行了综述。Mat Daut 等人对使用传统和 AI 方法进行建筑电力消耗预测分析进行了综述 [11]。Zhao 等人对现有的建筑能耗预测方法进行了分类和综述 [12]。Suganthi 和 Samuel 对用于预测的能源需求模型进行了综述 [13]。Fumo 等人对建筑能源估算和分类进行了详细研究 [14]。此外，Martinez-Alvarez 等人对用于电力时间序列预测的数据挖掘技术进行了调查 [15]。Raza 和 Khosravi 对基于 AI 技术的短期负荷预测技术进行了综述 [16]。Wang 等人对基于 AI 的建筑能源预测进行了综述，重点关注集成预测模型 [17]。

Recently, AI has shown its capabilities for almost every field of life. In 2015, The authors designed a machine to understand Mongolian and passed the Turing test [18]. Such research has shown that the computer can function as humans in handwriting tasks. In 2016, David Silver et al. published AlphaGo’s first paper, stating that the machine could beat the Go game practitioners for the very first time. This article was published after AlphaGo defeated the World Champion Lee Sedol. The success of AlphaGo had also demonstrated that the machine can not only be like humans, but can also be smarter than humans, and the output of this research gives AI enormous confidence [19]. Chirag et al. presented a review on time series forecasting techniques for building energy consumption [20]. The superior performance of the hybrid and ensemble models for time series forecasting was also proposed in recent review studies [21–25]. All the above surveys and studies provided vital information on forecasting models on different scales.

近年来，AI 几乎在生活的各个领域都展示了其能力。2015 年，作者设计了一台能够理解蒙古语的机器，并通过了图灵测试 [18]。此类研究表明，计算机在手写任务中可以像人类一样工作。2016 年，David Silver 等人发表了 AlphaGo 的第一篇论文，指出该机器首次能够击败围棋从业者。这篇文章是在 AlphaGo 击败世界冠军李世石之后发表的。AlphaGo 的成功也表明，机器不仅可以像人类一样，甚至可以比人类更聪明，这项研究的成果为 AI 带来了巨大的信心 [19]。Chirag 等人对建筑能耗的时间序列预测技术进行了综述 [20]。最近的综述研究也提出了混合模型和集成模型在时间序列预测中的优越性能 [21–25]。上述所有调查和研究为不同规模的预测模型提供了重要信息。

FIGURE 1. Illustration the criteria and selection process of this survey paper within the domain of building optimization.

图 1: 本综述论文在建筑优化领域中的标准和选择过程示意图。

Besides this, a question that needs to be addressed is how can researchers use Machine Learning techniques using extremely small datasets to acquire high accuracy and speed?

此外，一个需要解决的问题是，研究人员如何利用极小的数据集使用机器学习技术来获得高准确性和速度？

To answer this, a comprehensive overview of Machine Learning, grey models, and GML framework illustrated in this survey to highlight the present outlooks, and enhancement for the classical proposed methods.

为了回答这个问题，本调查对机器学习、灰色模型和GML框架进行了全面概述，以突出当前的前景，并对经典提出的方法进行了改进。

A. OBJECTIVES OF THE SURVEY

A. 调查目标

A forecasting model can be rely on static data that compares a dependent variable to a collection of independent variables, or it can be rely on composite or simultaneous time series data [20]. The importance of time series analysis has evolved as people more convinced of the significance of real-time data monitoring and storage [26]. The aim of this survey is to understand more about the existing time series forecasting framework known as GML. The key contributions of this article can be summarized as follows:

预测模型可以依赖于将因变量与一组自变量进行比较的静态数据，也可以依赖于复合或同步时间序列数据 [20]。随着人们越来越认识到实时数据监控和存储的重要性，时间序列分析的重要性也在不断提升 [26]。本次调查的目的是更深入地了解现有的时间序列预测框架 GML。本文的主要贡献可以总结如下：

This subsection describes the process of this paper and provides a summary of the articles discussed in Figure 1. The structure of this survey is given as follows: To illustrate the core concept of GML. Firstly, we provide a primer survey of Machine Learning algorithms and their applications in Section II. The general forms of the conventional grey models, including popular models, are illustrated in Section III. The general formulations including the computational details of GML are discussed in Section IV. A brief discussion and future perspectives of GML have been presented in Section V. Finally, the conclusion is presented in Section VI.

本小节描述了本文的过程，并对图1中讨论的文章进行了总结。本调查的结构如下：为了说明GML的核心概念，首先在第二部分提供了机器学习算法及其应用的初步调查。第三部分展示了传统灰色模型的一般形式，包括流行模型。第四部分讨论了GML的一般公式，包括计算细节。第五部分简要讨论了GML的未来展望。最后，第六部分给出了结论。

II. MACHINE LEARNING

II. 机器学习

Machine Learning is a subset of AI specifically developed to simulate human intelligence. It is essentially a data processing approach that automates the construction of an empirical model. In other words, it is focus on the premise that algorithms can learn from data, recognize patterns and make decisions with minimal human intervention. Recently, Machine Learning techniques have been used for Big Data [27] and has been widely implemented in some of the areas, varying from computer vision [28], finance [29], spacecraft engineering [30], entertainment [31], pattern recognition [32], and computational biology to biomedical applications [33]. In this overview, the types of Machine Learning algorithms as well as popular techniques of Machine Learning are presented.

机器学习是人工智能的一个子集，专门用于模拟人类智能。它本质上是一种自动化构建经验模型的数据处理方法。换句话说，它关注的前提是算法可以从数据中学习、识别模式并在最少人为干预的情况下做出决策。最近，机器学习技术已被用于大数据 [27]，并在一些领域得到了广泛应用，包括计算机视觉 [28]、金融 [29]、航天工程 [30]、娱乐 [31]、模式识别 [32] 以及计算生物学和生物医学应用 [33]。在本概述中，将介绍机器学习算法的类型以及流行的机器学习技术。

A. OVERVIEW OF MACHINE LEARNING

A. 机器学习概述

In this subsection, a brief analysis over time to explore the history of Machine Learning as well as the most significant

在本小节中，我们将简要分析机器学习的历史及其最重要的

milestones are presented. Furthermore, we have divided this overview into several categories to make it more understandable.

里程碑被呈现出来。此外，我们将此概述分为几个类别，以便更易于理解。

1) Classical work

1) 经典工作

Although Arthur Samuel et al. was an American pioneer of AI and inventor of the term “Machine Learning” in International Business Machines (IBM) a leading US computer manufacturer in 1959 [34], the year 1950 marks the first time Alan Turing et al. proposed the “Turing Machine” while posing such questions as, “Can machines think?” and “Can machines do what we can do? (as thinking entities)”. For instance, if a machine has true intellect, then the machine device must have the ability to trick a human being into thinking that it is also a human [35]. In Turings’ proposed study, there are many features that could be exhibited by machine intelligence and the different consequences for architecture are revealed, this is the first discovery in the field of Machine Learning. Earlier research works planned to connect the computer with human interaction. For this purpose, the first Machine Learning program was published in 1953, which is written by Arthur Samuel et al. [34]. The software was a game (Checkers). The IBM machine improved the design of the game and its progress, helped to refine the winning tactics, and integrated certain movements into the software. Based on the above studies, it can be analyzed that, in the early stages of Machine Learning developments, scientists and engineers introduced Machine Learning applications to improve computer intelligence.

虽然 Arthur Samuel 等人是 AI 的先驱，并于 1959 年在美国领先的计算机制造商 IBM (International Business Machines) 中提出了“机器学习 (Machine Learning)”这一术语 [34]，但 1950 年标志着 Alan Turing 等人首次提出了“图灵机 (Turing Machine)”，同时提出了诸如“机器能思考吗？”和“机器能做我们能做的事吗？（作为思考实体）”等问题。例如，如果一台机器具有真正的智能，那么该机器设备必须具备欺骗人类使其认为它也是人类的能力 [35]。在 Turing 提出的研究中，机器智能可以展示许多特征，并揭示了架构的不同后果，这是机器学习领域的首次发现。早期的研究工作计划将计算机与人类互动连接起来。为此，1953 年发布了第一个机器学习程序，由 Arthur Samuel 等人编写 [34]。该软件是一个游戏（跳棋）。IBM 机器改进了游戏的设计及其进展，帮助完善了获胜策略，并将某些动作集成到软件中。基于上述研究，可以分析出，在机器学习发展的早期阶段，科学家和工程师引入了机器学习应用来提高计算机的智能。

In 1957, Frank Rosenblatt et al. proposed the first neural network for computers namely called “the perceptron” [36], which simulates the thinking patterns in the human brain. Moreover, T.M. Cover and P.E. Hart proposed the “nearest neighbor” algorithm in 1967 [37]. This kind of algorithm is used for simple pattern recognition. Essentially, It was used to make a path for passengers beginning with a random location, but to make sure that they reach all cities on a short ride. Moreover, Stanford University students developed the “Stanford Cart” in 1979 [38], which can navigate obstacles automatically. The Cart was a moderately remotely controlled television-equipped mobile robot in the Stanford Artificial Intelligence Laboratory (SAIL). Other researchers proposed several explanation-based approaches, which can be linked back to the MACROPS learning strategies used in STRIPS [39]. The key sheet of Machine Learning algorithms have been discussed in Figure 2.

1957年，Frank Rosenblatt等人提出了第一个用于计算机的神经网络，称为“感知器 (perceptron)” [36]，它模拟了人脑的思维模式。此外，T.M. Cover和P.E. Hart在1967年提出了“最近邻 (nearest neighbor)”算法 [37]。这种算法用于简单的模式识别。本质上，它用于为乘客从随机位置开始规划路径，但确保他们能在短途旅行中到达所有城市。此外，斯坦福大学的学生在1979年开发了“斯坦福车 (Stanford Cart)” [38]，它可以自动避开障碍物。该车是斯坦福人工智能实验室 (SAIL) 中一种中等远程控制的配备电视的移动机器人。其他研究人员提出了几种基于解释的方法，这些方法可以追溯到STRIPS中使用的MACROPS学习策略 [39]。图2讨论了机器学习算法的关键内容。

According to the expansion and enhancement, it must be credited to Silver, Mitchell, and DeJong. At the same period, each of these researchers built very broad Explanation Based Learning (EBL) computer systems-LP (Silver), ESA (DeJong), and LEX2 (Mitchell). This kind of systems analyzes training data and generates a basic law that can be enforced by discarding the redundant data. It presents a historical account of the development of EBL and discusses some of the important outstanding research tasks [40]. Due to the increasing size of data, in 1998, the Scientists attracted a large number of data programs and applications (Data-Driven Approach using Machine Learning) [41, 42].

根据扩展和增强的内容，必须归功于 Silver、Mitchell 和 DeJong。在同一时期，这些研究人员各自构建了非常广泛的基于解释的学习 (Explanation Based Learning, EBL) 计算机系统——LP (Silver)、ESA (DeJong) 和 LEX2 (Mitchell)。这类系统通过分析训练数据并生成一个基本法则，该法则可以通过丢弃冗余数据来强制执行。它提供了 EBL 发展的历史记录，并讨论了一些重要的未解决的研究任务 [40]。由于数据量的不断增加，1998 年，科学家们吸引了大量的数据程序和应用程序（使用机器学习的基于数据驱动的方法）[41, 42]。

FIGURE 2. Popular Machine Learning Algorithms.

图 2: 流行的机器学习算法。

The above history shows that many researchers have been doing their research from time to time and played a key role in the field of Machine Learning. In the 21st century, the new millennium introduced an abundance of integrated technology. There is Machine Learning where adaptive programs are expected. These algorithms are capable of detecting patterns, extracting new knowledge from inputs, learning from practice, and optimizing the efficiency and accuracy of their analysis and output. Essentially, the researchers have taken the main ideas from the earlier proposed works and have enhanced their work in the form of new inventions to develop new algorithms, software, games, as well as probabilistic reasoning, particularly in the field of automated medical diagnosis [43].

上述历史表明，许多研究人员一直在不断进行研究，并在机器学习领域发挥了关键作用。21世纪，新千年带来了大量集成技术。在需要自适应程序的地方，机器学习应运而生。这些算法能够检测模式，从输入中提取新知识，从实践中学习，并优化其分析和输出的效率和准确性。本质上，研究人员从早期提出的工作中汲取了主要思想，并以新发明的形式增强了他们的工作，开发了新的算法、软件、游戏，以及概率推理，特别是在自动化医疗诊断领域 [43]。

In 2000, Thomas G. Dietterich et al. [44] proposed the ensemble methods in Machine Learning. As we know the original set approach is Bayesian averaging, but several techniques offer error-correcting output coding, boosting, and bagging. The authors analyzed these approaches and clarified why the ensembles can often do better than any single classifier [45]. In 2002, the authors designed a new technique for gene selection using Support Vector Machine (SVM) approaches focused on Recursive Feature Elimination (RFE) for cancer classification [46]. Experimentally, they proved that the genes identified by their strategies yield better detection efficiency and are biologically important to cancer.

2000年，Thomas G. Dietterich等人[44]提出了机器学习中的集成方法。众所周知，最初的集成方法是贝叶斯平均法，但还有几种技术提供了纠错输出编码、提升法和装袋法。作者分析了这些方法，并阐明了为什么集成方法通常比任何单一分类器表现更好[45]。2002年，作者设计了一种新的基因选择技术，使用支持向量机（SVM）方法，专注于递归特征消除（RFE）用于癌症分类[46]。实验证明，通过他们的策略识别出的基因具有更好的检测效率，并且对癌症具有重要的生物学意义。

This research shows that Machine Learning models have already been used to diagnose human diseases. In other research, the authors suggested that Machine Learning should also be a faster and better approach to corner-detection [47].

研究表明，机器学习模型已被用于诊断人类疾病。在其他研究中，作者提出机器学习也应成为角点检测更快、更好的方法 [47]。

2) State-of-the-art

2) 最新技术

Recently, several researchers focusing on Machine Learning technologies and algorithms merge diverse areas to increase production efficiencies, such as cloud computing, biomedical engineering, network security, image processing, forecasting, Internet of Things (IoT), and Big Data technology [48– 53]. In 2019, Wenrui Yang et al. introduced a system for sports image detection using Machine Learning. The purpose of this study was to develop the identification of athletes, the judgment on sport behavior, the perception of motion, and the development of a test framework to validate the effectiveness of the research process [54]. In another study, the authors suggested a combination of big data processing using cloud computing and Machine Learning [55]. In this study, they proposed that, Machine Learning seems to be an ideal solution for exploring the possibilities concealed for big data. Recently, the authors published a performance analysis of the new Machine Learning algorithm and the Logistic Prediction Method called MLIA [56]. In this review, the authors demonstrated that Machine Learning has a significant predictive effect of MLIA on the measurement of financial credit risk and can provide a theoretical basis for subsequent relevant studies.

最近，一些专注于机器学习技术和算法的研究人员将云计算、生物医学工程、网络安全、图像处理、预测、物联网 (IoT) 和大数据技术等不同领域结合起来，以提高生产效率 [48–53]。2019 年，Wenrui Yang 等人介绍了一种使用机器学习进行体育图像检测的系统。该研究的目的是开发运动员识别、运动行为判断、运动感知以及开发测试框架以验证研究过程的有效性 [54]。在另一项研究中，作者提出了结合云计算和机器学习的大数据处理方法 [55]。在这项研究中，他们认为机器学习似乎是探索大数据隐藏可能性的理想解决方案。最近，作者发表了一种新的机器学习算法和称为 MLIA 的逻辑预测方法的性能分析 [56]。在这篇综述中，作者证明了机器学习对金融信用风险测量具有显著的预测效果，并可以为后续相关研究提供理论基础。

FIGURE 3. Key sheet of Machine Learning Algorithms.

图 3: 机器学习算法关键表

In addition to the role of Machine Learning in wireless communication, Machine Learning techniques are anticipated to play a significant part in the implementation of the fifth-generation (5G). Researchers have recently presented an overview of wireless communication channel modeling based on Machine Learning [57]. In this overview, the writers addressed 5G with massive Multiple-Input / Multiple-Output (MIMO), quick handover, higher data rate, and channel simulation becoming more complicated than other conventional stochastic or deterministic models. To this purpose, scholars and academics are looking forward to more effective methods that are less complicated and more reliable. For example, Emerging Machine Learning Methods can offer a new direction for the analysis of big data and traffic data. To diagnosing human diseases by using Deep learning techniques [58] such as, the Convolutional Neural Networks (CNNs) have provided significant performance to boost the fields related to diagnosing human diseases. CNN techniques have been applied successfully for several tasks, like computer-aided diagnosis, image enhancement/generation, classification, and segmentation [59–64]. In comparison of Machine Learning with IoT and Big Data, Jangam J.S Mani and Sandhya Rani Kasireddy propounded a framework that classifies the population into four classes based on diet efficiency. After 30 days of dietary retrieval, they devour as normal, unbalanced, almost balanced, and almost unbalanced by using logistic regression, linear discriminant analysis (LDA), and random forest algorithm [65]. In Figure 3, the popular Machine Learning algorithms have been demonstrated.

除了机器学习在无线通信中的作用外，机器学习技术预计还将在第五代（5G）的实现中发挥重要作用。研究人员最近提出了基于机器学习的无线通信信道建模概述 [57]。在这篇概述中，作者讨论了5G的大规模多输入/多输出（MIMO）、快速切换、更高的数据速率以及信道模拟比其他传统的随机或确定性模型更加复杂的问题。为此，学者们期待更有效、更简单且更可靠的方法。例如，新兴的机器学习方法可以为大数据和流量数据的分析提供新的方向。通过使用深度学习技术 [58] 如卷积神经网络（CNNs），在诊断人类疾病的相关领域中提供了显著的性能提升。CNN技术已成功应用于多个任务，如计算机辅助诊断、图像增强/生成、分类和分割 [59–64]。在机器学习与物联网（IoT）和大数据的比较中，Jangam J.S Mani 和 Sandhya Rani Kasireddy 提出了一个框架，该框架根据饮食效率将人群分为四类。在30天的饮食数据检索后，他们使用逻辑回归、线性判别分析（LDA）和随机森林算法将人群分为正常、不平衡、几乎平衡和几乎不平衡 [65]。图3展示了流行的机器学习算法。

To discuss the data processing architecture, Machine Learning approaches are used to allow precise tuning to train a classifier for large-scale datasets. To serve IoT applications, these infrastructures use Machine Learning or AIbased techniques which evaluate entity or system data to produce valuable knowledge that can be used for service or decision-making. Machine Learning methods make it possible for machines to communicate with people, drive cars automatically, forecasting, writing and publishing sport match reports, and identifying criminal suspects as well. Furthermore, Machine Learning has a serious impact on most businesses and employees inside them, that is why a professional will at least have a context about what Machine Learning is, and how it is evolving [2, 66, 67].

为了讨论数据处理架构，机器学习方法被用于精确调整以训练大规模数据集的分类器。为了服务于物联网应用，这些基础设施使用机器学习或基于人工智能的技术来评估实体或系统数据，从而产生可用于服务或决策的有价值知识。机器学习方法使机器能够与人交流、自动驾驶汽车、进行预测、撰写和发布体育比赛报告，以及识别犯罪嫌疑人。此外，机器学习对大多数企业和内部员工产生了重大影响，这就是为什么专业人士至少需要了解机器学习是什么以及它是如何发展的 [2, 66, 67]。

B. POPULAR ALGORITHMS OF MACHINE LEARNING In this subsection, popular algorithms of Machine Learning are presented. Machine Learning algorithms are specifically programmed to create predictive models dependent on the underlying algorithm and dataset. Input data for Machine Learning algorithms usually consist of “label” and “features” over a range of samples. Labels are what the purpose of a Machine Learning algorithm is to determine, which is the output of the model, whereas the features are the quantities of all tests, either raw or mathematically transformed [68]. The most common Machine Learning algorithms can be divided into two key categories-Supervised learning and Unsupervised learning [69, 70]. Apart from these, some other methodologies of Machine Learning are also discussed in Figure 4.

B. 机器学习中的流行算法
在本小节中，介绍了机器学习中的流行算法。机器学习算法经过专门编程，以创建依赖于基础算法和数据集的预测模型。机器学习算法的输入数据通常由一系列样本中的“标签”和“特征”组成。标签是机器学习算法旨在确定的目标，即模型的输出，而特征是所有测试的量，无论是原始的还是经过数学变换的 [68]。最常见的机器学习算法可以分为两个主要类别——监督学习和无监督学习 [69, 70]。除此之外，图 4 中还讨论了其他一些机器学习方法。

1) Supervised Machine Learning Algorithms

1) 监督式机器学习算法

Supervised Machine Learning is the cognitive activity of discovering relationships between parameters in annotated data (training set). Using this knowledge, making a forecasting model capable of inferring annotations for new data in which annotations are unknown. This kind of algorithm uses the characteristics and annotations of the training set to induce the model to predict the annotations of instances in the test set [71, 72].

监督式机器学习是一种在标注数据（训练集）中发现参数之间关系的认知活动。利用这些知识，建立一个能够推断新数据（其中标注未知）的预测模型。这类算法利用训练集的特征和标注来引导模型预测测试集中实例的标注 [71, 72]。

FIGURE 4. Classification of Machine Learning Algorithms.

图 4: 机器学习算法的分类

2) Unsupervised Machine Learning Algorithms

2) 无监督机器学习算法

Compared with Supervised learning algorithms, Unsupervised learning algorithms work without the desired output label. For instance, an unsupervised learning algorithm analyzes the $a$ without needing the $b$ , whereas a Supervised Machine Learning algorithm usually learns from a method that maps an input $a$ into the $b$ output. Unsupervised learning strategies may be motivated by theoretical and Bayesian concepts of intelligence. Unsupervised learning algorithms usually using to expand the data and train a model for finding suitable internal representation, such as sorting data into clusters [73, 74].

与监督学习算法相比，无监督学习算法在没有期望输出标签的情况下工作。例如，无监督学习算法分析 $a$ 时不需要 $b$，而监督机器学习算法通常通过将输入 $a$ 映射到输出 $b$ 的方法进行学习。无监督学习策略可能受到理论和贝叶斯智能概念的启发。无监督学习算法通常用于扩展数据并训练模型以找到合适的内部表示，例如将数据分类为簇 [73, 74]。

3) Semi-supervised Machine Learning Algorithms

3) 半监督机器学习算法

Traditional class if i ers need to train labeled data (features/label pairs). However, labeled instances are sometimes challenging to procure, time taking and costly, since they involve the efforts of the skilled human annotator. In the meantime, unlabeled data can be relatively easy to get but difficult to use. Semi-supervised learning solves this issue by creating stronger class if i ers utilizing vast volumes of unlabeled data, coupled with the labeled data needing fewer human intervention and higher accuracy [75, 76]. Semisupervised learning algorithms are used in such cases where the labels are missing. For instance, only a limited amount of training data is labeled and the goal is to improve the output of the model that can be done either by avoiding the labels and performing unsupervised learning or by ignoring unlabeled data and performing supervised learning. It is the great interest in both practical and theoretical aspects [68, 77, 78].

传统的分类器需要训练带标签的数据（特征/标签对）。然而，带标签的实例有时难以获取，耗时且成本高，因为它们需要熟练的人工标注者的努力。与此同时，未标记的数据相对容易获取，但难以使用。半监督学习通过利用大量未标记数据和少量带标签数据来创建更强的分类器，从而解决了这个问题，减少了人工干预并提高了准确性 [75, 76]。半监督学习算法用于标签缺失的情况。例如，只有少量训练数据被标记，目标是通过避免标签并执行无监督学习，或忽略未标记数据并执行监督学习来改进模型的输出。这在实践和理论方面都引起了极大的兴趣 [68, 77, 78]。

4) Reinforcement Machine Learning Algorithms

4) 强化机器学习算法

Reinforcement learning is a branch of Machine Learning algorithms in which the learner or software entity tries to perform a sequence of acts that will optimize accumulated incentives, such as winning a checker or chess game. It is an area of research that has been able to overcome a broad variety of complicated decision-making problems which were historically been out of control for the machine. It also opens up a range of new opportunities in areas such as infrastructure, automation, smart grids, banking, and much more [79– 81].

强化学习是机器学习算法的一个分支，其中学习者或软件实体尝试执行一系列行为，以优化累积的激励，例如赢得跳棋或国际象棋比赛。这是一个能够克服历史上机器无法控制的多种复杂决策问题的研究领域。它还在基础设施、自动化、智能电网、银行等领域开辟了一系列新的机会 [79–81]。

These methods led to impressive advances in AI, going beyond human performance in domains ranging from Atari to Go to no-limit poker [82]. These signs of progress attracted the attention of cognitive scientists interested in understanding human learning. Over the last few years, because of its performance in solving the complexities of sequential decision-making, it has become increasingly popular. Some of the achievements were attributed to the combination of rein for cement learning and deep learning methodologies [83– 86].

这些方法在人工智能领域取得了令人瞩目的进展，在从Atari到围棋再到无限注德州扑克等多个领域超越了人类表现 [82]。这些进步迹象吸引了关注人类学习理解的认知科学家的注意。过去几年中，由于其在解决序列决策复杂性方面的表现，该方法变得越来越受欢迎。部分成就归功于强化学习与深度学习方法的结合 [83–86]。

5) 推荐系统

Recommend er systems have been built in coexistence with the internet. Initially, this kind of systems were focused on statistical, content-based and shared filtering. Such systems currently integrate social knowledge. Recommend er systems can also be described as learning techniques through which online customers can design their websites to match the customer’s tastes such as, an internet customer may get a product and/or associated products ranking while looking for things based on an established recommendation system. There are mainly two methods, namely content-based recommendation, and collective recommendation. This type of system allows users to get access to it and collect info, principles, intelligent and novel suggestions. Several e-commerce pages use this program [87–90].

推荐系统与互联网共存而生。最初，这类系统主要关注统计、基于内容和共享过滤的方法。如今，这些系统已整合了社交知识。推荐系统也可被视作一种学习技术，通过它，在线客户可以设计自己的网站以匹配客户的喜好。例如，互联网客户在搜索商品时，可能会根据已建立的推荐系统获得产品及/或相关产品的排名。主要有两种方法，即基于内容的推荐和集体推荐。这类系统允许用户访问并收集信息、原则、智能和新颖的建议。多个电子商务页面都采用了这一程序 [87–90]。

FIGURE 5. Popular Applications of Machine Learning.

图 5: 机器学习的流行应用

C. POPULAR APPLICATIONS OF MACHINE LEARNING Many companies and industries dealing with data packages have recognized the importance of Machine Learning technology. By using Machine Learning approaches, businesses can work more reliably and effectively as well as gain an advantage over competitors [91, 92].

C. 机器学习的流行应用

许多处理数据包的公司和行业已经认识到机器学习技术的重要性。通过使用机器学习方法，企业可以更可靠、更有效地工作，并在竞争中占据优势 [91, 92]。

Furthermore, with the aid of compelling articles for clarity to the reader, several applications of Machine Learning are also discussed and divided into different sections in Figure 5, which are given below:

此外，为了帮助读者更好地理解，本文还通过引人入胜的文章讨论了机器学习的几种应用，并在图 5 中将其分为不同的部分，具体如下：

1) Precision Agriculture

1) 精准农业

The most common concepts of Machine Learning in the field of agriculture were proposed by A. Kukuta et al. [93]. Precision agriculture, satellite farming, or site-specific crop management are agricultural management terms focused on observation, estimation, and the reaction of inter-and intrafield crop variability. The key goal of precision agriculture analysis is to set up a decision support system for agricultural management to optimize the return on inputs while maintaining energy [94–96].

农业领域中最常见的机器学习概念由 A. Kukuta 等人 [93] 提出。精准农业 (Precision Agriculture)、卫星农业 (Satellite Farming) 或特定地点作物管理 (Site-Specific Crop Management) 是农业管理术语，侧重于观察、估计和应对田间和田间作物变异。精准农业分析的关键目标是建立一个农业管理决策支持系统，以优化投入回报，同时保持能源 [94–96]。

2) Health care

2) 医疗保健

Machine Learning is an emerging field and fast-growing phenomenon in the field of health care [97, 98]. The advent of smart applications and devices that can use data to assess the health of patients in real-time. The medical professionals analyze data for the detection of patterns or warning flags that can contribute to better diagnosis and care [99–102].

机器学习是医疗保健领域中的一个新兴且快速发展的现象 [97, 98]。智能应用和设备的出现使得能够利用数据实时评估患者的健康状况。医疗专业人员通过分析数据来检测模式或警示标志，从而有助于更好的诊断和护理 [99–102]。

3) Retail

3) 零售

Websites recommend items that you would buy based on prior purchases using a Machine Learning method called a ‘recommendation system’ to evaluate your experience about

网站会根据你之前的购买记录，使用一种称为“推荐系统 (recommendation system)”的机器学习方法，评估你的体验，从而推荐你可能购买的商品。

purchasing the item. Retailers depend on Machine Learning technology to record, interpret, and customize their shopping experience [103, 104].

购买商品。零售商依赖机器学习技术来记录、解释并定制他们的购物体验 [103, 104]。

4) Government

4) 政府

State departments, such as infrastructure and public health have a strong need for artificial intelligence because they offer several data points that can be exploited for information. For instance, analyzing the data of the system to find opportunities to boost efficiency and save money. Machine Learning can also help spot fraud and prevent data theft [105, 106].

基础设施和公共卫生等政府部门对人工智能有着强烈的需求，因为它们提供了多个可被利用的数据点。例如，通过分析系统数据来寻找提高效率和节省成本的机会。机器学习还能帮助发现欺诈行为并防止数据盗窃 [105, 106]。

5) Computational Finance

5) 计算金融

Banks and certain organizations in the business industry are utilizing Machine Learning technologies for two key purposes: the first is to identify valuable insights from data and the second is to prevent fraud. Insights can recognize investment opportunities and allow shareholders to know when to sell. Data mining can also recognize high-risk clients or use cyber monitoring to detect warning signals of fraud [107, 108].

银行和商业领域的某些组织正在利用机器学习技术实现两个关键目标：一是从数据中识别有价值的洞察，二是预防欺诈。洞察能够识别投资机会，并让股东知晓何时出售。数据挖掘还能识别高风险客户，或通过网络监控发现欺诈的预警信号 [107, 108]。

6) Transportation

6) 交通

Analyzing data to detect patterns and developments is crucial for the transport sector, which focuses on keeping roads more effective and finding possible challenges to improve productivity. Information modeling and simulation elements of Machine Learning are useful tools for logistics firms, urban transportation’s, and other transit organizations [109, 110].

分析数据以检测模式和发展对于交通部门至关重要，该部门专注于提高道路效率并发现可能的挑战以提高生产力。机器学习的信息建模和仿真元素是物流公司、城市交通和其他运输组织的有用工具 [109, 110]。

7) Oil and gas

7) 石油和天然气

Use cases include finding new sources of energy, analyzing elements in rocks, predicting malfunction of refinery sensor, streamlining the production of oil to make it more reliable

用例包括寻找新能源、分析岩石中的元素、预测炼油厂传感器故障、简化石油生产以提高可靠性

and cost-effective. The amount of use cases of Machine Learning in this sector is overwhelming and continues to increase [111–114].

机器学习在该领域的应用案例数量庞大且持续增长 [111–114]。

8) Computational linguistics

8) 计算语言学

Computational linguistics has historically been conducted by computer scientists who have specialized in the use of computers for the analysis of natural languages. Nowadays, computer linguists often work as part of interdisciplinary teams, which can include computer scientists, target language specialists, and professional linguists [115, 116].

历史上，计算语言学一直由专门使用计算机分析自然语言的计算机科学家进行。如今，计算机语言学家通常作为跨学科团队的一部分工作，这些团队可能包括计算机科学家、目标语言专家和专业语言学家 [115, 116]。

III. CONVENTIONAL GREY MODELS

III. 传统灰色模型

Basic Grey Model Theory is an interdisciplinary research discipline that was proposed by [117] in 1980. He provided a classic continuous GM(1,1) model in which procedures begin with a differential equation namely ‘whitening equation’. As long as knowledge is concerned, systems that suffer from a lack of information, such as operating mechanism, structure message, and actions log, are referred to as Grey Systems. Throughout the background, the human body, livestock, climate, etc., are the Grey Systems, where “grey” implies incomplete, unknown, poor, etc. The goal of the Grey Model and its applications is to bridge the distance between natural science and social science [118]. This concept has been very popular in terms of its potential to work with systems that have partly unknown parameters. As an improvement over traditional predictive models, grey forecasting models only need small datasets to determine the actions of unknown processes [119]. Some of the existing grey models with a continuous whitening function follow the same linear formula as follows:

基础灰色模型理论是由 [117] 在 1980 年提出的一个跨学科研究领域。他提供了一个经典的连续 GM(1,1) 模型，该模型的流程从一个微分方程开始，即“白化方程”。就知识而言，缺乏信息的系统，如操作机制、结构信息和行为日志，被称为灰色系统。在背景中，人体、牲畜、气候等都是灰色系统，其中“灰色”意味着不完整、未知、贫乏等。灰色模型及其应用的目标是弥合自然科学与社会科学之间的距离 [118]。这一概念在应对部分参数未知的系统方面非常受欢迎。作为对传统预测模型的改进，灰色预测模型只需要少量数据集即可确定未知过程的行为 [119]。一些现有的具有连续白化函数的灰色模型遵循以下相同的线性公式：

\frac{d X^{(1)}(t)}{d t}+a x_{1}^{(1)}(t)=f(\pmb{\theta};t),```

while the series $X_{1}^{(1)}$ is usually referred to as an input series. Sometimes, the function $f(\pmb\theta;t)$ differs by time $t$ or dependency sequence (or input series) $X_{i}^{(1)},i=\dot{2},3,\dots,n$ , with unknown parameters $\theta$ . In case of discrete grey models, the general linear equation can also be written as,

而序列 $X_{1}^{(1)}$ 通常被称为输入序列。有时，函数 $f(\pmb\theta;t)$ 会随时间 $t$ 或依赖序列（或输入序列） $X_{i}^{(1)},i=\dot{2},3,\dots,n$ 的不同而变化，且参数 $\theta$ 未知。在离散灰色模型的情况下，一般线性方程也可以写成：

x_{1}^{(1)}(k+1)=\alpha x_{1}^{(1)}(k)+f(\pmb\theta;k).```


It is clear that the solutions of grey models in the abovementioned formulations also contain similar formulations and the same equations. Several grey models also use the initial condition   \$\dot{X}_{1}^{(1)}(1)\,=\,X_{1}^{(0)}\check{(1)}\$  . The general formulation of these models can thus easily be accessed.

显然，上述公式中的灰色模型解也包含类似的公式和相同的方程。一些灰色模型还使用初始条件 \$\dot{X}_{1}^{(1)}(1)\,=\,X_{1}^{(0)}\check{(1)}\$。因此，这些模型的一般公式可以很容易地获得。

For continuous models, the answer can always be described as the following convolution equation:

对于连续模型，答案总是可以描述为以下卷积方程：

\hat{X}{1}^{(1)}(t)=X{1}^{(0)}(1)\cdot e^{-a(t-1)}+\int_{1}^{t}e^{-a(t-\tau)}f(\pmb\theta;\tau)d\tau.```

Whereas, in the case of discrete models, the solution can always be described as the following discrete convolution equation:

在离散模型的情况下，解总是可以描述为以下离散卷积方程：

\hat{X}_{1}^{(1)}(k+1)=X_{1}^{(0)}(1)\cdot\alpha^{k}+\sum_{\tau=2}^{k+1}\alpha^{(k+1-\tau)}f(\pmb\theta;\tau).```

Grey forecasting models are essentially divided into two categories: Univariate and Multivariate. Single-variable models are called univariate while multivariable models are called multivariate [120]. For descriptive purposes, the whitening equation, time response function, and the restored values of grey models are provided in this section and also summarized in Figure 6.

灰色预测模型主要分为两类：单变量和多变量。单变量模型被称为单变量模型，而多变量模型被称为多变量模型 [120]。为了便于描述，本节提供了灰色模型的白化方程、时间响应函数和还原值，并在图 6 中进行了总结。

A. EVOLUTION OF GREY FORECASTING MODELS Since the study of Lin and Liu, the propagation of this Grey System theory took place as follows: Scholarly periodical for the presentation of research results followed by ‘Journal of Grey System’ that started to be published in England in 1989. More than 300 various scientific journals recognize and publish papers relevant to the grey system in the world. Moreover, In the early 1990s, several universities based throughout China, Taiwan, Australia, United States, and Japan began offering grey system theory courses. The Chinese Grey System Association (CGSA) was founded in 1996. Every year, CGSA conducts a conference on Grey System Theory and its application [121]. In the last four decades, the Grey System Theory has developed quickly and drawn the interest of several researchers. It has been widely and effectively extended to many applications such as commercial, manufacturing, transport, medical, military, mechanical, meteorological, civil, political, financial, science and technology, agricultural, hydrological, geological, etc. Furthermore, the conventional grey model called GM(1,1) has been widely adopted, and its forecasting efficiency could also be improved. To date, several researchers have proposed new approaches for improving the performance of the model as Deng et al. [122] have proposed the modifiable residual sequence method. Whereas, Mu et al. [123] obtained the formula of optimum grey derivative whitening values. He proposed an unbiased GM(1,1) model to develop a framework for estimating the parameters.

A. 灰色预测模型的演变
自 Lin 和 Liu 的研究以来，灰色系统理论的传播如下：研究成果的学术期刊发布，随后《灰色系统杂志》于 1989 年在英国开始出版。全球有超过 300 种科学期刊认可并发表与灰色系统相关的论文。此外，在 20 世纪 90 年代初，中国、台湾、澳大利亚、美国和日本的几所大学开始开设灰色系统理论课程。中国灰色系统协会 (CGSA) 于 1996 年成立。每年，CGSA 都会举办关于灰色系统理论及其应用的会议 [121]。在过去的四十年中，灰色系统理论发展迅速，并引起了众多研究人员的兴趣。它已被广泛且有效地扩展到许多应用领域，如商业、制造、交通、医疗、军事、机械、气象、土木、政治、金融、科技、农业、水文、地质等。此外，传统的灰色模型 GM(1,1) 已被广泛采用，其预测效率也得到了提升。迄今为止，许多研究人员提出了改进模型性能的新方法，如 Deng 等人 [122] 提出了可修改残差序列方法。而 Mu 等人 [123] 获得了最优灰色导数白化值的公式。他提出了一个无偏 GM(1,1) 模型，用于开发参数估计的框架。

Furthermore, Song and Wang developed the center approach of the alteration of grey model GM(1,1). They designed an adjusting grey model [124, 125]. Tan et al. supported the structure method of the background values in the GM(1,1) model and a basic approximation of the background value function was re-established, which had strong adaptability [126]. In another studies, the authors presented the optimal time-response sequence formula and used the least square approach to measure the constant number in the time-response series of the basic GM(1,1) model [126– 128]. Several researchers examined the appropriate scope and simulation accuracy of the GM(1,1) model [129, 130]. Moreover, some key approaches shall include center approach method [124], discrete models [131–133], correcting the residues [122], constructing background values [126], and optimization of the grey derivative [125]. A part of these, some other grey forecasting theory methods proposed by scholars [134–145].

此外，Song 和 Wang 开发了灰色模型 GM(1,1) 的改进中心方法。他们设计了一种调整灰色模型 [124, 125]。Tan 等人支持 GM(1,1) 模型中背景值的结构方法，并重新建立了背景值函数的基本近似，具有较强的适应性 [126]。在其他研究中，作者提出了最优时间响应序列公式，并使用最小二乘法来测量基本 GM(1,1) 模型时间响应序列中的常数 [126–128]。一些研究人员研究了 GM(1,1) 模型的适用范围和模拟精度 [129, 130]。此外，一些关键方法包括中心方法 [124]、离散模型 [131–133]、修正残差 [122]、构建背景值 [126] 和优化灰色导数 [125]。除此之外，学者们还提出了其他一些灰色预测理论方法 [134–145]。

![](https://u254848-88c6-e493554b.yza1.seetacloud.com:8443/miner/v2/analysis/pdf_img?as_attachment=False&user_id=931&pdf=24bdda8eb4e5e316620d03b71e3ff2c8f4cab2915d1068e46a22a1d35be8a5f51737271564_2104.00871v2.pdf&filename=91de4ed374f5540dd1c917e3d88eb37f6773bcf8423ba1e27420d5358a9d0b0e.jpg)
FIGURE 6. Popular Grey Forecasting Models.

图 6: 常见的灰色预测模型

# 1) Univariate Grey Model

# 1) 单变量灰色模型

The single parameter grey prediction model is GM(1,1) with one vector and one first-order equation, and a simulation unit is a single time sequence. It forecasts the future of the program by defining machine operating rules contained in a series focused on grey generation approaches. The singlevariable grey model might not take into account the effect of related factors on the system and thus, it has the benefit of the basic modeling process [146].

单参数灰色预测模型是GM(1,1)，它包含一个向量和一个一阶方程，模拟单元是一个单一的时间序列。它通过定义一系列专注于灰色生成方法中包含的机器操作规则来预测程序的未来。单变量灰色模型可能没有考虑相关因素对系统的影响，因此它具有基本建模过程的优势 [146]。

Classic GM(1,1) model is an effective method for precise predictions for small samples. Therefore, it is not surprising that GM(1,1) is commonly used as a predictive tool [120]. In order to define the GM(1,1) model, the first ‘1’ represents for the ‘first order’, while the second ‘1’ stands the ‘univariate’ [147]. The system parameters are calculated by de-escalating the whitening equation and using the least square method. For instance, The definition of the Grey Theory is the ‘Grey Box’ where information is established and knowledge is uncertain. Grey system theory is an important tool for determining unknown issues with limited samples and incomplete information [117]. The Whitening equations of the univariate grey forecasting models have been addressed in Table 1.

经典 GM(1,1) 模型是一种针对小样本进行精确预测的有效方法。因此，GM(1,1) 常被用作预测工具并不令人意外 [120]。为了定义 GM(1,1) 模型，第一个“1”代表“一阶”，而第二个“1”代表“单变量” [147]。系统参数通过降阶白化方程并使用最小二乘法计算得出。例如，灰色理论的定义是“灰箱”，其中信息是已知的，而知识是不确定的。灰色系统理论是在样本有限且信息不完整的情况下确定未知问题的重要工具 [117]。单变量灰色预测模型的白化方程已在表 1 中列出。

# 1. GM(1,1)

# 1. GM(1,1)

Among the families of grey models, the GM(1,1) model is the most widely used, due to its simplicity and high accuracy for limited datasets [148]. Based on this essential function, it has been successfully implemented in several fields. Widely used applications of GM (1,1) include energy production [149–152], the prediction of stock price [137], the oil production in China [153, 154], the consumption of energy [155, 156], detection [157], and the electricity consumption [149, 158]. Let  \$X^{(0)}\,=\,\left\{x^{(0)}(1),x^{(0)}(2),\dots,x^{(0)}(n)\bar{\}\right.\$  denote original data,   \$X^{(1)}\,=\,\left\{x^{(1)}(1),x^{(1)}(2),\dots,x_{\mathrm{~}}^{(1)}(n)\right\}\$  is the first order accumulation generator,  \$\begin{array}{r l r l}{Z^{(1)}}&{{}}&{=}\end{array}\$   \$\left\{z^{(1)}(1),z^{(1)}(2),\ldots,z^{(1)}(n)\right\}\$  is the background of  \$X^{(0)}\$  where  \$z^{(1)}(k)=0.5\left(x^{(1)}(k)\bar{+}\,x^{(1)}(k-1)\right)\!.\$  . Hence,

在灰色模型家族中，GM(1,1) 模型因其简单性和对有限数据集的高准确性而得到广泛应用 [148]。基于这一基本功能，它已成功应用于多个领域。GM(1,1) 的广泛应用包括能源生产 [149–152]、股票价格预测 [137]、中国石油生产 [153, 154]、能源消耗 [155, 156]、检测 [157] 以及电力消耗 [149, 158]。设 \$X^{(0)}\,=\,\left\{x^{(0)}(1),x^{(0)}(2),\dots,x^{(0)}(n)\bar{\}\right.\$ 表示原始数据，\$X^{(1)}\,=\,\left\{x^{(1)}(1),x^{(1)}(2),\dots,x_{\mathrm{~}}^{(1)}(n)\right\}\$ 为一阶累加生成序列，\$\begin{array}{r l r l}{Z^{(1)}}&{{}}&{=}\end{array}\$ \$\left\{z^{(1)}(1),z^{(1)}(2),\ldots,z^{(1)}(n)\right\}\$ 是 \$X^{(0)}\$ 的背景值，其中 \$z^{(1)}(k)=0.5\left(x^{(1)}(k)\bar{+}\,x^{(1)}(k-1)\right)\!.\$。因此，

x^{(0)}(k)+a z^{(1)}(k)=b```

is known as Grey model GM(1,1). The restrictions of $-a$ and $b$ in the grey basic form of GM(1,1) model are referred to as development coefficient and grey action quantity, respectively. Whereas the time response signal of GM(1,1) is,

被称为灰色模型 GM(1,1)。GM(1,1) 模型的灰色基本形式中 $-a$ 和 $b$ 的限制分别被称为发展系数和灰色作用量。而 GM(1,1) 的时间响应信号为，

\hat{x}^{(1)}(k+1)=\left(x^{(0)}(1)-\frac{b}{a}\right)\mathrm{e}^{-a k}+\frac{b}{a},\quad k=1,2,\dots,n```

while the predictive value of GM(1,1) is obtained as,

GM(1,1) 的预测值如下：

\begin{array}{c}{{\hat{x}^{(0)}(k+1)=\hat{x}^{(1)}(k+1)-\hat{x}^{(1)}(k)}}\ {{{}}}\ {{{}=(1-\mathrm{e}^{a})\left(x^{(0)}(1)-{\frac{b}{a}}\right)\mathrm{e}^{-a k},}}\ {{{}}}\ {{{\displaystyle k=1,2,\ldots,n}}}\end{array}```

Let $\begin{array}{c c l}{{\widehat{X}^{(0)}}}&{{=}}&{{\left{\hat{x}^{(0)}(1),\hat{x}^{(0)}(2),\ldots,\hat{x}^{(0)}(n)\right}}}\end{array}$ . Therefore, $X^{(0)}$ is the simulation sequence and $X^{(0)},\cdot,\hat{x}^{(1)}(k+1)$ is the simulation data of $x^{(\bar{1})}(k+1)$ . According to the (3), it is simple to show that the simulation data sequence is the geometric series [121], and thus, the growth rate of the simulation series is constant:

设 $\begin{array}{c c l}{{\widehat{X}^{(0)}}}&{{=}}&{{\left{\hat{x}^{(0)}(1),\hat{x}^{(0)}(2),\ldots,\hat{x}^{(0)}(n)\right}}}\end{array}$ 。因此，$X^{(0)}$ 是模拟序列，$X^{(0)},\cdot,\hat{x}^{(1)}(k+1)$ 是 $x^{(\bar{1})}(k+1)$ 的模拟数据。根据 (3)，可以简单证明模拟数据序列是几何级数 [121]，因此模拟序列的增长率为常数：

\hat{u}(k)=\frac{\hat{x}^{(0)}(k+1)-\hat{x}^{(0)}(k)}{\hat{x}^{(0)}(k)}\qquad\qquad\qquad\qquad\qquad\qquad\qquad\qquad\qquad\qquad}\\ {=\displaystyle\frac{\hat{x}^{(0)}(k+1)}{\hat{x}^{(0)}(k)}-1={\mathrm{e}}^{-a}-1.\qquad\qquad\qquad}```


# \$2.\mathsf {N G M}(1,1,k,c)\$

# \$2.\mathsf {N G M}(1,1,k,c)\$

Chen and Yu designed a parameter optimization approach to boost the   \$\mathbf{NGM}(1,\,1,\,k,\,c)\$   model combine with grey action quantity   \$b t+c\$   [134]. Based on the specified form of the latest NGM   \$(1,1,K,c)\$   model, the differential equation for the  \$\mathbf{NGM}(1,1,k,c)\$   is obtained as,

Chen 和 Yu 设计了一种参数优化方法，结合灰色作用量 \$b t+c\$ 来提升 \$\mathbf{NGM}(1,\,1,\,k,\,c)\$ 模型 [134]。基于最新的 NGM \$(1,1,K,c)\$ 模型的特定形式，\$\mathbf{NGM}(1,1,k,c)\$ 的微分方程如下：

{\frac{\mathrm{d}x^{(1)}(t)}{\mathrm{d}t}}+a x^{(1)}(t)=b t+c,```

while $a$ is the developing coefficient, and $b t+c$ is a grey action quantity. Furthermore, the time response function for the $\mathbf{NGM}(1,1,k,c)$ is,

其中 $a$ 是发展系数，$b t+c$ 是灰色作用量。此外，$\mathbf{NGM}(1,1,k,c)$ 的时间响应函数为，

\begin{array}{r}{\hat{x}^{(1)}(k)=\bigg(x^{(0)}(1)+\frac{b}{a^{2}}-\frac{b}{a}-\frac{c}{a}\bigg)\,e^{-a(k-1)}}\\ {+\frac{b}{a}k-\frac{b}{a^{2}}+\frac{c}{a},k=2,3,\dots,n,}\end{array}```


Whereas the restored function of  \$\mathbf{NGM}(1,1,k,c)\$   model can be written as,

而 \$\mathbf{NGM}(1,1,k,c)\$ 模型的恢复函数可以表示为,

\begin{array}{r}{\hat{x}^{(0)}(k)=\bigg(x^{(0)}(1)+\cfrac{b}{a^{2}}-\cfrac{b}{a}-\cfrac{c}{a}\bigg)}\ {(1-e^{a}),e^{-a(k-1)}+\cfrac{b}{a},}\ {k=2,3,,.,.,,n.}\end{array}```

3. $},}(1,1,t^)$

In 2012, Qian et al. developed a new forecasting grey model called $\mathrm{GM}(1,1,\mathit{t}^{\alpha})$ with grey action quantity of $b t^{\alpha}+c$ , and used it to forecast the settlement of the foundation [135]. The whitening differential equation of $\mathrm{GM}(1,,1,!t^{\alpha})$ model based on [159] is,

2012年，Qian等人开发了一种新的预测灰色模型，称为$\mathrm{GM}(1,1,\mathit{t}^{\alpha})$，其灰色作用量为$b t^{\alpha}+c$，并用于预测地基沉降[135]。基于[159]的$\mathrm{GM}(1,,1,!t^{\alpha})$模型的白化微分方程为，

\frac{d x^{(1)}(t)}{d t}+a x^{(1)}(t)=b t^{\alpha}+c,r>0,\alpha>0,```

Whereas the time response function of  \$\mathrm{GM}(1,1,\!t^{\alpha})\$   model is,

\$\mathrm{GM}(1,1,\!t^{\alpha})\$ 模型的时间响应函数为

\begin{array}{r}{x^{(r)}(k)=\left(x^{(0)}(1)-\displaystyle\frac{c}{a}\right)e^{-a(k-1)}+\displaystyle\frac{c}{a}+\frac{b}{2}e^{-a(k-1)}}\ {\displaystyle\sum_{-1}^{k-1}\left(\tau^{\alpha}e^{a(\tau-1)}+(\tau+1)^{\alpha}e^{a\tau}\right),}\ {k=2,3,\dots,n,}\end{array}```

and restored value of $\hat{x}^{(0)}(k)k=2,3,\ldots,n$ is given by,

$\hat{x}^{(0)}(k)k=2,3,\ldots,n$ 的还原值由下式给出，

x^{(0)}(k)=x^{(1)}(k)-x^{(1)}(k-1).```


# \$4.\; {\sf G M P}(1,1, {\cal N})\$

# \$4.\; {\sf G M P}(1,1, {\cal N})\$

In 2017, Luo and Wei [160] proposed a grey model with a polynomial term called   \$\mathrm{GMP}(1,1,\!N)\$   where the grey action term is a time polynomial function. The GM(1,1) model, the  \$\mathrm{NGM}(1,1,\boldsymbol{k})\$   model, and the  \$\mathrm{GM}(1,1,\ t^{\alpha})\$   model have been shown to be special cases of the   \$\mathrm{GMP}(1,1,\!N)\$   model. The differential form of the GMP   \$(1,1,\!N)\$   model obtained as,

2017年，Luo和Wei [160] 提出了一种带有多项式项的灰色模型，称为 \$\mathrm{GMP}(1,1,\!N)\$，其中灰色作用项是时间多项式函数。GM(1,1) 模型、\$\mathrm{NGM}(1,1,\boldsymbol{k})\$ 模型和 \$\mathrm{GM}(1,1,\ t^{\alpha})\$ 模型已被证明是 \$\mathrm{GMP}(1,1,\!N)\$ 模型的特殊情况。GMP \$(1,1,\!N)\$ 模型的微分形式如下：

\frac{d x^{(1)}(t)}{d t}+a x^{(1)}(t)=\beta_{0}+\beta_{1}t+\beta_{2}t^{2}+\cdot\cdot\cdot+\beta_{\mu}t^{n},```

Whereas the time response function for the GMP $(1,1,!N)$ is,

而 GMP $(1,1,!N)$ 的时间响应函数为，

x^{(1)}(k)=\left(x^{(0)}(1)-\sum_{r=0}^{k}r_{i}\right)e^{-a(k-1)}+\sum_{i=0}^{N}\left(r^{i}k^{i}\right),```

Finally, the restored value for the GMP   \$\left(1,1,\!N\right)\$   is,

最后，GMP \$\left(1,1,\!N\right)\$ 的恢复值为，

x^{(0)}(k)=x^{(1)}(k)-x^{(1)}(k-1).```

$5.;\mathsf(1,1)$

The Grey Verhulst model GVM(1,1) is suitable for forecasting the frequency of sequences that have a single apex or whose development delayed [164]. Suppose that, there is a positive sequence of data $\begin{array}{r l}{X^{(0)}}&{{}=}\end{array}$ $\left{x^{(0)}(1),x^{(0)}(2),\ldots,\stackrel{\cdot}{x}^{(0)}(n)\right}$ , and $X^{(1)}$ is accumulated generating operation (AGO) of $X^{(0)}$ , written as X(1) $X^{(1)}\ \ =\ \ \left{x^{(1)}(1),x^{(1)}(2),\ldots,x^{(1)}(n)\right}$ , where, $\begin{array}{r l r}{x^{(1)}(k)}&{{}=}&{\sum_{i=1}^{K}\overset{\setminus}{x^{(0)}}(\overset{.}{i})}\end{array}$ , k = 1, 2, . . . , n. $Z^{(1)}\quad=$ $\left{z^{(1)}(1),z^{(1)}({\overline{{2)}}},\ldots,z^{(1)}(n)\right}$ is the mean sequence of $\bar{x}^{(1)}(k)$ , while, $z^{(1)}(k);=;\textstyle{\frac{1}{2}}:\bigl(x^{(1)}(k)+x^{(1)}(k-1)\bigr)$ , $k,=$ $2,3,\ldots,n$ [165].

灰色 Verhulst 模型 GVM(1,1) 适用于预测具有单一顶点或发展延迟的序列频率 [164]。假设存在一个正数据序列 $\begin{array}{r l}{X^{(0)}}&{{}=}\end{array}$ $\left{x^{(0)}(1),x^{(0)}(2),\ldots,\stackrel{\cdot}{x}^{(0)}(n)\right}$，且 $X^{(1)}$ 是 $X^{(0)}$ 的累加生成操作 (AGO)，记为 $X^{(1)}\ \ =\ \ \left{x^{(1)}(1),x^{(1)}(2),\ldots,x^{(1)}(n)\right}$，其中 $\begin{array}{r l r}{x^{(1)}(k)}&{{}=}&{\sum_{i=1}^{K}\overset{\setminus}{x^{(0)}}(\overset{.}{i})}\end{array}$，k = 1, 2, . . . , n。$Z^{(1)}\quad=$ $\left{z^{(1)}(1),z^{(1)}({\overline{{2)}}},\ldots,z^{(1)}(n)\right}$ 是 $\bar{x}^{(1)}(k)$ 的均值序列，其中 $z^{(1)}(k);=;\textstyle{\frac{1}{2}}:\bigl(x^{(1)}(k)+x^{(1)}(k-1)\bigr)$，$k,=$ $2,3,\ldots,n$ [165]。

y^{(1)}(k+1)=\beta_{0}+\beta_{1}k+\beta_{2}y^{(1)}(k),```


Equation (18) is an optimized discrete Verhulst model. Whereas the differential equation of the conventional grey verhulst model [161] is given below:

方程 (18) 是一个优化的离散 Verhulst 模型。而传统的灰色 Verhulst 模型 [161] 的微分方程如下：

\frac{d\left(x^{(1)}\right)}{d t}+a x^{(1)}=b\left(x^{(1)}\right)^{2},```

Whereas the time response function for Grey Verhulst model is,

而灰色 Verhulst 模型的时间响应函数为，

x^{(1)}(k+1)=\frac{a x^{(1)}(0)}{b x^{(1)}(0)+\left(a-b x^{(1)}(0)\right)E X P(a k)},```

while the restored value of grey verhulst model is,

灰色Verhulst模型的恢复值为，

\hat{x}^{(0)}(k)=\hat{x}^{(1)}(k)-\hat{x}^{(1)}(k-1),k=2,3,\cdot\cdot\cdot\cdot,n.```

NGBM(1,1)
NGBM(1,1)

The nonlinear grey Bernoulli model NGBM(1,1) is a branch of the Grey Verhulst model and GM(1,1) [8, 166]. The key advantage of NGBM(1,1) is the power exponents of the model will effectively represent the non-linear features of the true structure and evaluate the form of the model in a versatile manner. Therefore, the framework can not only forecast sequences that increase or decrease monotonically but can also respond favorably to nonlinear processes with small sample sizes [162, 167]. The parameters of the NGBM(1,1) are also calculated by the least square estimation approach known as the Levenberge Marquardt (LM) optimization theory, and then use the modified model to allow analytical comparisons with the regression models, the traditional GM(1,1) and the Grey Verhulst models, which suggests that NGBM(1,1) shows the desirable prediction accuracy [168, 169]. Therefore, it is empirically proved that the performance of the NGBM(1,1) is higher than that of the GM(1,1) and Grey Verhulst model [170]. The differential equation of the NGBM(1,1) model is given by [162],

非线性灰色伯努利模型 NGBM(1,1) 是灰色 Verhulst 模型和 GM(1,1) 模型的一个分支 [8, 166]。NGBM(1,1) 的关键优势在于其幂指数能够有效表示真实结构的非线性特征，并以一种多功能的方式评估模型的形式。因此，该框架不仅能够预测单调递增或递减的序列，还能在小样本情况下对非线性过程做出良好的响应 [162, 167]。NGBM(1,1) 的参数也是通过最小二乘估计方法（称为 Levenberge Marquardt (LM) 优化理论）计算的，然后使用修改后的模型进行与回归模型、传统 GM(1,1) 模型和灰色 Verhulst 模型的对比分析，结果表明 NGBM(1,1) 具有理想的预测精度 [168, 169]。因此，经验证明 NGBM(1,1) 的性能高于 GM(1,1) 和灰色 Verhulst 模型 [170]。NGBM(1,1) 模型的微分方程由 [162] 给出，

TABLE 1. The definition of the comparative Univariate grey models used.

表 1. 使用的比较单变量灰色模型定义

No.	模型名称	微分方程	参考文献
1	GM(1,1)	dx(1)(t) +ax(1)(t) =b p	[121]
2	NGM(1,1,k,c)	x(0)(k) =ax(0)(k - 1) + b	[134]
3	GM(1,1,tα)		[135]
4	GMP(1,1,N)		[160]
5	GVM(1,1)	q(I+-u)+(-)(0)∞==(?)(0)=	[161]
6	NGBM(1,1)	d(l(+ax(1)(t)=b(x(1)(t)² 7P	[162]
7	GBM(1,1)	dx(1)(t) =a(M- x(1)(t) + bx(1)(t) dt	x(1)(t) [163] M

\frac{\mathrm{d}x^{(1)}(t)}{\mathrm{d}t}+a x^{(1)}(t)=b\left(x^{(1)}(t)\right)^{\gamma},```


Furthermore, the time response function of the NGBM(1,1) model is obtained as,

此外，NGBM(1,1) 模型的时间响应函数为，

x^{(r)}(t)=\left[\left(\left(x^{(r)}(1)\right)^{1-\gamma}-\frac{b}{a}\right)e^{-a(1-\gamma)(t-1)}+\frac{b}{a}\right]^{\frac{1}{1-\gamma}},```

while the prediction value of NGBM(1,1) is,

NGBM(1,1) 的预测值为,

\begin{array}{r}{x^{(r)}(k)=\left[\left(\left(x^{(r)}(1)\right)^{1-\gamma}-\frac{b}{a}\right)e^{-a(1-\gamma)(k-1)}+\frac{b}{a}\right]^{\frac{1}{1-\gamma}}}\\ {k=2,3,\ldots,n.}\end{array}```


# 7. GBM(1,1)

# 7. GBM(1,1)

In 1969, Bass et al. proposed once the purchasing model of durable goods by market research on the prevalence of 11 types of durable goods, abbreviated as a Grey Bass model GBM(1,1) [163]. Due to the simple form and the clear economic sense of the parameters, the Bass model is commonly used in new product forecasts [171, 172], technological diffusion [173], and business model diffusion [174–176]. Moreover, the mathematical form of the grey bass model is,

1969年，Bass等人通过对11种耐用品的市场调研，提出了耐用品的购买模型，简称为灰色Bass模型GBM(1,1) [163]。由于模型形式简单且参数具有明确的经济意义，Bass模型常被用于新产品预测 [171, 172]、技术扩散 [173] 以及商业模式扩散 [174–176]。此外，灰色Bass模型的数学形式为，

\frac{d x^{(1)}(t)}{d t}=a\left(M-x^{(1)}(t)\right)+b x^{(1)}(t)\left(1-\frac{x^{(1)}(t)}{M}\right),```

while the time response function of the grey bass model is,

灰色Bass模型的时间响应函数为，

\hat{x}^{(1)}(k)=M\left[\frac{1-e^{-(a+b)k}}{1+\frac{b}{a}e^{-(a+b)k}}\right],k=1,2,\cdot\cdot\cdot\cdot,n.```


At last, the restored value of the grey bass model is,

最后，灰色贝斯模型的恢复值为，

\hat{x}^{(0)}(k+1)=\hat{x}^{(1)}(k+1)-\hat{x}^{(1)}(k),k=1,2,\cdot\cdot\cdot\cdot,n.```

2) Multivariate Grey Models

2) 多元灰色模型

The multivariate grey forecasting model is represented by ${\mathrm{GM}}(1{,}n)$ . This model consists of system-specific sequences (or dependent variable sequences) and $(n{-}1)$ related sequences of variables (or independent variable sequences). The modeling method takes complete account of the effect of relevant variables on system transition and is a standard causal forecasting model. The ${\mathrm{GM}}(1{,}n)$ model has some similarities to the multi-regression model, yet it is fundamentally distinct. The former is focused on grey theory, while the latter is centered on probability statistics. The multivariate gray forecasting model is the limitation of the single-fold framework and the limited simulation potential of single-variable models. This model is mainly used as a tool for determining the similarities between system features sequences and associated factor sequences [140–144]. The detailed differential equation of multivariate grey forecasting models have been summarized in Table 2

多元灰色预测模型由 ${\mathrm{GM}}(1{,}n)$ 表示。该模型由系统特定序列（或因变量序列）和 $(n{-}1)$ 个相关变量序列（或自变量序列）组成。建模方法充分考虑了相关变量对系统转换的影响，是一种标准的因果预测模型。${\mathrm{GM}}(1{,}n)$ 模型与多元回归模型有一些相似之处，但本质上有所不同。前者基于灰色理论，而后者基于概率统计。多元灰色预测模型是单变量模型的局限性和有限模拟潜力的扩展。该模型主要用于确定系统特征序列与相关因子序列之间的相似性 [140–144]。多元灰色预测模型的详细微分方程已在表 2 中总结。

1. $\mathsf(1,!n)$

${\mathrm{GM}}(1{,}n)$ demonstrated that the approximate whitening time response function of ${\mathrm{GM}}(1{\mathrm{,}}n)$ could often contribute to an inappropriate experimental error [177, 178]. Furthermore, $\textstyle\sum_{i=2}^{N}b_{i}x_{i}^{(1)}$ ios fd ethfien edw hbiyt,ening equation $\begin{array}{r l}{\frac{d x_{1}^{(1)}}{d t};+;a x_{1}^{(1)}}&{=}\end{array}$

${\mathrm{GM}}(1{,}n)$ 表明，${\mathrm{GM}}(1{\mathrm{,}}n)$ 的近似白化时间响应函数常常会导致不适当的实验误差 [177, 178]。此外，$\textstyle\sum_{i=2}^{N}b_{i}x_{i}^{(1)}$ ios fd ethfien edw hbiyt,ening 方程 $\begin{array}{r l}{\frac{d x_{1}^{(1)}}{d t};+;a x_{1}^{(1)}}&{=}\end{array}$

\begin{array}{l}{{x_{1}^{(1)}(t)=e^{-a t}}}\\ {{\displaystyle\left[x_{1}^{(1)}(0)-t\sum_{i=2}^{N}b_{i}x_{i}^{(1)}(0)+\sum_{i=2}^{N}\int b_{i}x_{i}^{(1)}(t)e^{a t}d t\right],}}\end{array}```


When  \$\begin{array}{r l r}{X_{i}^{(1)}(i}&{{}=}&{1,2,\dots,N)}\end{array}\$  changes marginally,  \$\sum_{i=2}^{N}b_{i}x_{i}^{(1)}(k)\$   is shown as a grey constant. While the estimated time-response sequence of the  \${\mathrm{GM}}(1{,}n)\$   model is obtained as,

当 \$\begin{array}{r l r}{X_{i}^{(1)}(i}&{{}=}&{1,2,\dots,N)}\end{array}\$ 发生微小变化时，\$\sum_{i=2}^{N}b_{i}x_{i}^{(1)}(k)\$ 表现为一个灰色常数。而 \${\mathrm{GM}}(1{,}n)\$ 模型的估计时间响应序列为，

\begin{array}{r}{\hat{x}{1}^{(1)}(k+1)=\Bigg[x{1}^{(1)}(0)-\displaystyle\frac{1}{a}\sum_{i=2}^{N}b_{i}x_{i}^{(1)}(k+1)\Bigg],e^{-a k}}\ {+\displaystyle\frac{1}{a}\sum_{i=2}^{N}b_{i}x_{i}^{(1)}(k+1),}\end{array}```

where $x_{1}^{(1)}(0)$ is assumed to be $x_{1}^{(0)}(1)$ , which is the initial value of the ${\mathrm{GM}}(1{\mathrm{,}}n)$ model. Restoration of the inverse accumulation of ${\mathrm{GM}}(1{,}n)$ is obtained by,

其中 $x_{1}^{(1)}(0)$ 被假设为 $x_{1}^{(0)}(1)$ ，这是 ${\mathrm{GM}}(1{\mathrm{,}}n)$ 模型的初始值。通过以下方式获得 ${\mathrm{GM}}(1{,}n)$ 的逆累加恢复：

\hat{x}_{1}^{(0)}(k+1)=\alpha^{(1)}\hat{x}_{1}^{(1)}(k+1)=\hat{x}_{1}^{(1)}(k+1)-\hat{x}_{1}^{(1)}(k).```


# 2.  \$ {\mathsf{G M C}}(1,n)\$

# 2.  \$ {\mathsf{G M C}}(1,n)\$

To compare   \${\mathrm{GMC}}(1,\!n)\$   with other prediction models, the  \${\mathrm{GMC}}(1{\mathrm{,}}n)\$   model requires a greater proportion of the forecast core data and is far more significant than the other forecast source data. The accuracy of the indirect estimation and forecast of the   \${\mathrm{GMC}}(1,\!n)\$   model can then be presumed. This framework is used to test indirect measurem

[论文翻译]机器学习与灰色模型的比较分析

原文地址：https://arxiv.org/pdf/2104.00871

A Comparative Analysis of Machine Learning and Grey Models

机器学习与灰色模型的比较分析

INTRODUCTION

引言

A. OBJECTIVES OF THE SURVEY

A. 调查目标

II. MACHINE LEARNING

II. 机器学习

A. OVERVIEW OF MACHINE LEARNING

A. 机器学习概述

1) Classical work

1) 经典工作

2) State-of-the-art

2) 最新技术

1) Supervised Machine Learning Algorithms

1) 监督式机器学习算法

2) Unsupervised Machine Learning Algorithms

2) 无监督机器学习算法

3) Semi-supervised Machine Learning Algorithms

3) 半监督机器学习算法

4) Reinforcement Machine Learning Algorithms

4) 强化机器学习算法

5) Recommend er Systems

5) 推荐系统

1) Precision Agriculture

1) 精准农业

2) Health care

2) 医疗保健

3) Retail

3) 零售

4) Government

4) 政府

5) Computational Finance

5) 计算金融

6) Transportation

6) 交通

7) Oil and gas

7) 石油和天然气

8) Computational linguistics

8) 计算语言学

III. CONVENTIONAL GREY MODELS

III. 传统灰色模型

3. $},}(1,1,t^)$

3. $},}(1,1,t^)$

$5.;\mathsf(1,1)$

$5.;\mathsf(1,1)$

2) Multivariate Grey Models

2) 多元灰色模型

1. $\mathsf(1,!n)$

1. $\mathsf(1,!n)$