Improving the Scopus and Aurora queries to identify research that supports the United Nations Sustainable Development Goals (SDGs) 2021
The United Nations Sustainable Development Goals (SDGs) challenge the global community to build a world where no one is left behind. Since 2018, Elsevier have generated SDG search queries to help researchers and institutions track and demonstrate progress towards the targets of the United Nations Sustainable Development Goals (SDGs). At the end of 2018, Elsevier worked on 2 versions of the SDG queries. One version was created by the Elsevier Analytical Services group and another by the Science-Metrix group, who had recently become part of Elsevier. At that time Science-Metrix was creating queries for 5 of the 16 SDGs, as part of pro-bono work for UNESCO. In 2020 inspired by the earlier queries, Elsevier, through its Science-Metrix group, used a new approach to mapping publications to the SDGs. Taking customer feedback into account, they significantly increased the number of search terms used to define each SDG. Those queries were then complemented by a machine learning model, which helped increase the recall by approximately 10%. As a result, this year’s “Elsevier 2021 SDG mapping” captures on average twice as many articles as the 2020 version, while keeping precision above 80%. The mapping also has a better overlap with SDG queries from other independent projects. Times Higher Education (THE) are using the “Elsevier 2021 SDG mapping” as part of their 2021 Impact Rankings. The documentation below describes the methods used and shares the queries. For each SDG, you can download the query as a text file, along with an html file that describes the methodology used to create the search query, plus additional information such as the most influential keyphrases and journals. It also breaks down the query into digestable chunks. A separate folder contains the methodology for the machine learning component, along with a sample of the top 100 keyphrases per SDG and a stratified sample of 8,000 EIDs that the model identified arcoss the SDGs.
Additional metadata for Elsevier datasets
|Date the data was collected||2020-12-15T11:00:00.000Z|