Skip to main content
Elsevier BV

Datasets within this collection

Filter Results
31 resultsSearch results powered by
  • August 2021 data-update for "Updated science-wide author databases of standardized citation indicators"
    Citation metrics are widely used and misused. We have created a publicly available database of over 100,000 top-scientists that provides standardized information on citations, h-index, co-authorship adjusted hm-index, citations to papers in different authorship positions and a composite indicator. Separate data are shown for career-long and single year impact. Metrics with and without self-citations and ratio of citations to citing papers are given. Scientists are classified into 22 scientific fields and 176 sub-fields. Field- and subfield-specific percentiles are also provided for all scientists who have published at least 5 papers. Career-long data are updated to end-of-2020. The selection is based on the top 100,000 by c-score (with and without self-citations) or a percentile rank of 2% or above. The dataset and code provides an update to previously released version 1 data under https://doi.org/10.17632/btchxktzyw.1; The version 2 dataset is based on the May 06, 2020 snapshot from Scopus and is updated to citation year 2019 available at https://doi.org/10.17632/btchxktzyw.2 This version (3) is based on the Aug 01, 2021 snapshot from Scopus and is updated to citation year 2020.
    • Software/Code
    • Tabular Data
    • Dataset
  • COVID-19: Public health, and societal and psychological impacts datasets
    We selected Public health, and societal and psychological impacts datasets indexed by the Mendeley Data Search engine on the 2019-present COVID-19 / Coronavirus pandemic. The aim was to make it easier to find potentially relevant datasets for this specific topic
    • Collection
  • COVID-19: Epidemiology & infectious modelling datasets
    We selected Epidemiology & infectious modelling datasets that are indexed by the Mendeley Data Search engine on the 2019-present COVID-19 / Coronavirus pandemic. The aim was to make it easier to find potentially relevant datasets for this specific topic.
    • Collection
  • COVID-19: Genetics, genomics & molecular structure datasets
    We selected Genetics, genomics & molecular structure datasets indexed by the Mendeley Data Search engine on the 2019-present COVID-19 / Coronavirus pandemic. The aim was to make it easier to find potentially relevant datasets for this specific topic
    • Collection
  • COVID-19: Vaccine, prevention, diagnosis & treatment datasets
    We selected Vaccine, prevention, diagnosis & treatment datasets indexed by the Mendeley Data Search engine on the 2019-present COVID-19 / Coronavirus pandemic. The aim was to make it easier to find potentially relevant datasets for this specific topic
    • Collection
  • Mendeley Data FAIRest Datasets
    A collection of datasets published on Mendeley Data that recognize researchers or research groups who make their research data available for additional research and do so in a way that exemplifies the FAIR data principles – Findable, Accessible, Interoperable, Reusable. Datasets in this collection have been selected by Elsevier's independent Research Data Management Advisory Board. Read Elsevier's community blog - Elsevier Connect - to discover interviews from researchers who published these datasets. * Prof. Zhiyong Shao, Fudan University China: https://www.elsevier.com/connect/spotlighting-fair-data-and-the-researchers-behind-it * Prof Ricardo Sánchez-Murillo, UNA Costa Rica: https://www.elsevier.com/connect/we-dont-want-data-sitting-in-our-desk-says-tropical-cyclone-researcher * Dr. Vanessa Susini, University of Pisa, Italy: https://www.elsevier.com/connect/for-mendeley-data-winner-sharing-fair-data-helps-researchers-learn-from-each-other
    • Collection
  • Elsevier OA CC-BY Corpus
    This is a corpus of 40k (40,001) open access (OA) CC-BY articles from across Elsevier’s journals represent the first cross-discipline research of data at this scale to support NLP and ML research. This dataset was released to support the development of ML and NLP models targeting science articles from across all research domains. While the release builds on other datasets designed for specific domains and tasks, it will allow for similar datasets to be derived or for the development of models which can be applied and tested across domains.
    • Tabular Data
    • Dataset
    • Text
    • File Set
  • ChEMU dataset for information extraction from chemical patents
    The discovery of new chemical compounds and their synthesis process is of great importance to the chemical industry. Patent documents contain critical and timely information about newly discovered chemical compounds, providing a rich resource for chemical research in both academia and industry. Chemical patents are often the initial venues where a new chemical compound is disclosed. Only a small proportion of chemical compounds are ever published in journals and these publications can be delayed by up to 3 years after the patent disclosure. In addition, chemical patent documents usually contain unique information, such as reaction steps and experimental conditions for compound synthesis and mode of action. These details are crucial for the understanding of compound prior art, and provide a means for novelty checking and validation. Due to the high volume of chemical patents, approaches that enable automatic information extraction from these patents are in demand. To develop natural language processing methods for large-scale mining of chemical information from patent texts, a corpus is created providing chemical patent snippets and annotated entities and reaction steps.
    • Dataset
    • Document
    • Text
    • File Set
  • The researcher journey through a gender lens
    Data underlying the analyses in chapters 1, 2, 3, and 5 of the report "The researcher journey through a gender lens" (www.elsevier.com/connect/gender-report), which provides an analysis of the researcher journey, analysed using a gender lens. Data on authors, grantees and patent applicants pertain to researchers active during two periods, 16 geographies, and 26 subject areas and 11 sub-fields of medicine. Theses data are provided at the aggregated level.
    • Tabular Data
    • Dataset
  • Data for: Build it and they will come: The convening power of the SOLEIL Synchrotron facility
    Data supporting the ICSR Perspectives paper: Build it and they will come: The convening power of the SOLEIL Synchrotron facility.
    • Tabular Data
    • Dataset
1