Skip to main content
Elsevier BV

Datasets within this collection

Filter Results
1970
2023
1970 2023
31 results
  • September 2022 data-update for "Updated science-wide author databases of standardized citation indicators"
    See file 28oct2022_v5_update_release_notes.txt below for detailed explanation of differences between versions 5 and 4. They both use the same data but version 5 has more appropriate subfield assignment. Citation metrics are widely used and misused. We have created a publicly available database of top-cited scientists that provides standardized information on citations, h-index, co-authorship adjusted hm-index, citations to papers in different authorship positions and a composite indicator (c-score). Separate data are shown for career-long and, separately, for single recent year impact. Metrics with and without self-citations and ratio of citations to citing papers are given. Scientists are classified into 22 scientific fields and 174 sub-fields. Field- and subfield-specific percentiles are also provided for all scientists with at least 5 papers. Career-long data are updated to end-of-2021 and single recent year data pertain to citations received during calendar year 2021. The selection is based on the top 100,000 scientists by c-score (with and without self-citations) or a percentile rank of 2% or above in the sub-field. This version (5) is based on the Sept 1, 2022 snapshot from Scopus, updated to end of citation year 2021. This work uses Scopus data provided by Elsevier through ICSR Lab (https://www.elsevier.com/icsr/icsrlab). Calculations were performed using all Scopus author profiles as of September 1, 2022. If an author is not on the list it is simply because the composite indicator value was not high enough to appear on the list. It does not mean that the author does not do good work. PLEASE ALSO NOTE THAT THE DATABASE HAS BEEN PUBLISHED IN AN ARCHIVAL FORM AND WILL NOT BE CHANGED. The published version reflects Scopus author profiles at the time of calculation. We thus advise authors to ensure that their Scopus profiles are accurate. REQUESTS FOR CORRECIONS OF THE SCOPUS DATA (INCLUDING CORRECTIONS IN AFFILIATIONS) SHOULD NOT BE SENT TO US. They should be sent directly to Scopus, preferably by use of the Scopus to ORCID feedback wizard (https://orcid.scopusfeedback.com/) so that the correct data can be used in any future annual updates of the citation indicator databases. The c-score focuses on impact (citations) rather than productivity (number of publications) and it also incorporates information on co-authorship and author positions (single, first, last author). If you have additional questions, please read the 3 associated PLoS Biology papers that explain the development, validation and use of these metrics and databases. (https://doi.org/10.1371/journal.pbio.1002501, https://doi.org/10.1371/journal.pbio.3000384 and https://doi.org/10.1371/journal.pbio.3000918). Finally, we alert users that all citation metrics have limitations and their use should be tempered and judicious. For more reading, we refer to the Leiden manifesto: https://www.nature.com/articles/520429a
    • Software/Code
    • Tabular Data
    • Dataset
    • Text
  • COVID-19: Public health, and societal and psychological impacts datasets
    We selected Public health, and societal and psychological impacts datasets indexed by the Mendeley Data Search engine on the 2019-present COVID-19 / Coronavirus pandemic. The aim was to make it easier to find potentially relevant datasets for this specific topic
    • Collection
  • COVID-19: Epidemiology & infectious modelling datasets
    We selected Epidemiology & infectious modelling datasets that are indexed by the Mendeley Data Search engine on the 2019-present COVID-19 / Coronavirus pandemic. The aim was to make it easier to find potentially relevant datasets for this specific topic.
    • Collection
  • COVID-19: Genetics, genomics & molecular structure datasets
    We selected Genetics, genomics & molecular structure datasets indexed by the Mendeley Data Search engine on the 2019-present COVID-19 / Coronavirus pandemic. The aim was to make it easier to find potentially relevant datasets for this specific topic
    • Collection
  • COVID-19: Vaccine, prevention, diagnosis & treatment datasets
    We selected Vaccine, prevention, diagnosis & treatment datasets indexed by the Mendeley Data Search engine on the 2019-present COVID-19 / Coronavirus pandemic. The aim was to make it easier to find potentially relevant datasets for this specific topic
    • Collection
  • Mendeley Data FAIRest Datasets
    A collection of datasets published on Mendeley Data that recognize researchers or research groups who make their research data available for additional research and do so in a way that exemplifies the FAIR data principles – Findable, Accessible, Interoperable, Reusable. Datasets in this collection have been selected by Elsevier's independent Research Data Management Advisory Board. Read Elsevier's community blog - Elsevier Connect - to discover interviews from researchers who published these datasets. * Prof. Zhiyong Shao, Fudan University China: https://www.elsevier.com/connect/spotlighting-fair-data-and-the-researchers-behind-it * Prof Ricardo Sánchez-Murillo, UNA Costa Rica: https://www.elsevier.com/connect/we-dont-want-data-sitting-in-our-desk-says-tropical-cyclone-researcher * Dr. Vanessa Susini, University of Pisa, Italy: https://www.elsevier.com/connect/for-mendeley-data-winner-sharing-fair-data-helps-researchers-learn-from-each-other
    • Collection
  • Elsevier OA CC-BY Corpus
    This is a corpus of 40k (40,001) open access (OA) CC-BY articles from across Elsevier’s journals represent the first cross-discipline research of data at this scale to support NLP and ML research. This dataset was released to support the development of ML and NLP models targeting science articles from across all research domains. While the release builds on other datasets designed for specific domains and tasks, it will allow for similar datasets to be derived or for the development of models which can be applied and tested across domains.
    • Tabular Data
    • Dataset
    • Text
    • File Set
  • ChEMU dataset for information extraction from chemical patents
    The discovery of new chemical compounds and their synthesis process is of great importance to the chemical industry. Patent documents contain critical and timely information about newly discovered chemical compounds, providing a rich resource for chemical research in both academia and industry. Chemical patents are often the initial venues where a new chemical compound is disclosed. Only a small proportion of chemical compounds are ever published in journals and these publications can be delayed by up to 3 years after the patent disclosure. In addition, chemical patent documents usually contain unique information, such as reaction steps and experimental conditions for compound synthesis and mode of action. These details are crucial for the understanding of compound prior art, and provide a means for novelty checking and validation. Due to the high volume of chemical patents, approaches that enable automatic information extraction from these patents are in demand. To develop natural language processing methods for large-scale mining of chemical information from patent texts, a corpus is created providing chemical patent snippets and annotated entities and reaction steps.
    • Dataset
    • Document
    • Text
    • File Set
  • The researcher journey through a gender lens
    Data underlying the analyses in chapters 1, 2, 3, and 5 of the report "The researcher journey through a gender lens" (www.elsevier.com/connect/gender-report), which provides an analysis of the researcher journey, analysed using a gender lens. Data on authors, grantees and patent applicants pertain to researchers active during two periods, 16 geographies, and 26 subject areas and 11 sub-fields of medicine. Theses data are provided at the aggregated level.
    • Tabular Data
    • Dataset
  • Data for: Build it and they will come: The convening power of the SOLEIL Synchrotron facility
    Data supporting the ICSR Perspectives paper: Build it and they will come: The convening power of the SOLEIL Synchrotron facility.
    • Tabular Data
    • Dataset
1