International Core Journal of Engineering 2020-26 | Page 105

intelligen*)) OR (AI artificial intellegen*) OR (Deep learning theory) OR (capsule networks) OR (deep reinforcement learning) OR (generative adversarial networks) OR (lean and augmented data learning) OR (unmanned AND (vehicles OR aerospace OR autonomous OR airborne OR (UA aircraft))) OR (UAV aerial vehicl*) OR (Drone) OR (RPA remotely piloted aircraft) OR (uninhabited aircraft) OR (upper atmosphere vehicle) OR (pilotless AND ((automob*) OR (PA airpla*))) (project selected with funding greater than or equal to 500000). field of artificial intelligence, and paid close attention to research hot-spots and development trends in the field, thus achieving scientific layout and rapid development [7]. It is right based on above reasons that this paper takes the data of fund sponsored projects in the field of artificial intelligence from 2013 to 2018 as the research subjects , and the PLDA(Parallel Latent Dirichlet Allocation)model was used to reveals the research front in order to provide reference for the development of artificial intelligence in our country. II. M ETHOD (2) (Probabilistic programming) OR (hybrid learning models) OR (automated machine learning) OR (digital twin) OR (explainable artificial intelligence). A. PLDA topic model The joint generation probability of topics and words in mth articles is : ⃑, ⃑| ⃑, ⃑ = Δ ⃑+ ⃑ Δ( Δ ⃑ After removing duplicate data, a total of 9467 items of fund projects were obtained. ⃑ + ⃑) Δ( ⃑) C. Research process First, the year of 2013 is selected as the starting point of the project, and the year is used as the unit of time for slicing, and a total of six sub-periods are obtained. Then the logarithmic likelihood method is used to calculate the number of topics in each sub-period. Secondly, the PLDA model within Knime platform is selected to identify the research topics in each sub-period. Thirdly, the thematic modeling was carried out for fund projects data set of the six sub-periods from 2013 to 2018. And the multi-dimensional mapping relations of theme-topic words and projects serial number were obtained, so as to calculate the funding time, funding amount and centrality of the topic-related fund projects. Fourthly, according to the Pareto Law, the threshold of the front identification indexes is defined, the research front topic was determined and interpreted. where:α and β are LDA’s prior parameters; ⃗ is the observation data obtained by statistical counting of each word of kth topic according to the word number 1 㸫 V. The probability of the count is the multinomial distribution of (N, V). N is the total number of words in kth topic. V is the total ⃗ is the observation data obtained number of words. from statistical counting of the topic word in the mth article according to the article number from 1 to k. The probability of the count is the multiple distribution of (N, K), N is the total number of words in the mth article, K is the total number of topics [8]. B. Research data Artificial intelligence mainly includes six study aspects: knowledge representation and processing, natural language processing, machine learning, machine perception and pattern recognition, intelligent system and application, cognition and neuroscience inspired artificial intelligence. Because of its wide range of contents, the retrieval scheme is based on the method of literature research [9-15] and expert consultation. Based on the data of artificial intelligence related fund projects funded by NSF in the United States, the time span is from January 1, 2013 to June 1, 2018. The retrieval is as follows: D. Front topic identification index Using the funding time index (FTI), the funding amount index (FAI) and the centrality index (ECI), which are put forward by Wang Yuyue [3] and so on, as the research front topic identification indexes. According to the Pareto Law, the first 20% value of the three indexes were taken as the threshold value of front topic identification indexes, as shown in Table I. The decision basis is that when the three indexes of research topic are all greater than (including equal to) the threshold value of front topic identification indexes, it is judged as the research front topic. (1) (robot*) OR (neural network) OR ((speech OR image) AND recognition) OR (machine learning) OR (computer vision) OR (natural language processing) OR (intelligen* system) OR (fuzzy logic) OR (brain AND (machine OR T ABLE Ⅰ. T HRESHOLD VALUE OF FRONT TOPIC IDENTIFICATION INDEXES ACCORDING TO P ARETO L AW . Year 2018 2017 2016 2015 2014 2013 Total number of fund projects 39 55 38 46 43 37 Total number of top 20% projects 8 11 8 9 9 7 Funding amount index(US$/Project) 353545 540969. 38 627436. 74 690815. 9 900555. 62 1915062. 86 Funding time index(year/project) 4. 7 4. 07 4. 25 4. 76 5. 51 6. 43 Centrality index 7 12 11 18 9 9 an annual increasing trend. Especially in 2015, the number of projects approved nearly doubled compared with the previous years, which indicates that since 2015, the United States Scientific Foundation has attached great importance to the research and development of artificial intelligence. The funding has increased substantially. Since the data acquisition deadline is June 1, 2018, the number of projects approved in 2018 is small. III. R ESULTS A. Annual distribution of the project number funded by NSF in artificial intelligence field From 2013 to 2018 years, the number of projects approved by NSF in the field of artificial intelligence in the United States was 642, 1318, 2110, 2343, 2469, 584, showing 83