The NIH Figshare Archive

iCite Database Snapshots (NIH Open Citation Collection)

Version 55 2024-07-16, 13:12
Version 54 2024-06-11, 12:31
Version 53 2024-05-11, 17:25
Version 52 2024-04-12, 21:21
Version 51 2024-03-08, 14:27
Version 50 2024-02-08, 13:55
Version 49 2024-01-12, 01:02
Version 48 2023-12-08, 14:12
Version 47 2023-11-17, 14:09
Version 46 2023-10-11, 13:46
Version 45 2023-09-11, 16:59
Version 44 2023-08-18, 21:02
Version 43 2023-06-07, 15:33
Version 42 2023-05-05, 20:17
Version 41 2023-04-18, 14:54
Version 40 2023-03-13, 22:03
Version 39 2023-02-09, 21:03
Version 38 2023-01-17, 14:23
Version 37 2022-12-16, 15:01
Version 36 2022-11-08, 15:25
Posted on 2024-07-16 - 13:12 authored by iCite
This is a database snapshot of the iCite database (in CSV and JSON formats). Also provided is a zipped CSV file containing just the citation links in the NIH Open Citation Collection. iCite provides bibliometrics and metadata on publications indexed in PubMed, organized into three modules:

Influence: Delivers metrics of scientific influence, field-adjusted and benchmarked to NIH publications as the baseline

Translation: Measures how Human, Animal, or Molecular/Cellular Biology-oriented each paper is; tracks and predicts citation by clinical articles

Open Cites: Disseminates link-level, public-domain citation data from the NIH Open Citation Collection

Definitions for individual data fields:

pmid: PubMed Identifier, an article ID as assigned in PubMed by the National Library of Medicine

doi: Digital Object Identifier, if available

year: Year the article was published

title: Title of the article

authors: List of author names

journal: Journal name (ISO abbreviation)

is_research_article: Flag indicating whether the Publication Type tags for this article are consistent with that of a primary research article

relative_citation_ratio: Relative Citation Ratio (RCR)--OPA's metric of scientific influence. Field-adjusted, time-adjusted and benchmarked against NIH-funded papers. The median RCR for NIH funded papers in any field is 1.0. An RCR of 2.0 means a paper is receiving twice as many citations per year than the median NIH funded paper in its field and year, while an RCR of 0.5 means that it is receiving half as many citations per year. Calculation details are documented in Hutchins et al., PLoS Biol. 2016;14(9):e1002541.

provisional: RCRs for papers published in the previous two years are flagged as "provisional", to reflect that citation metrics for newer articles are not necessarily as stable as they are for older articles. Provisional RCRs are provided for papers published previous year, if they have received with 5 citations or more, despite being, in many cases, less than a year old. All papers published the year before the previous year receive provisional RCRs. The current year is considered to be the NIH Fiscal Year which starts in October. For example, in July 2019 (NIH Fiscal Year 2019), papers from 2018 receive provisional RCRs if they have 5 citations or more, and all papers from 2017 receive provisional RCRs. In October 2019, at the start of NIH Fiscal Year 2020, papers from 2019 receive provisional RCRs if they have 5 citations or more and all papers from 2018 receive provisional RCRs.

citation_count: Number of unique articles that have cited this one

citations_per_year: Citations per year that this article has received since its publication. If this appeared as a preprint and a published article, the year from the published version is used as the primary publication date. This is the numerator for the Relative Citation Ratio.

field_citation_rate: Measure of the intrinsic citation rate of this paper's field, estimated using its co-citation network.

expected_citations_per_year: Citations per year that NIH-funded articles, with the same Field Citation Rate and published in the same year as this paper, recieve. This is the denominator for the Relative Citation Ratio.

nih_percentile: Percentile rank of this paper's RCR compared to all NIH publications. For example, 95% indicates that this paper's RCR is higher than 95% of all NIH funded publications.

human: Fraction of MeSH terms that are in the Human category (out of this article's MeSH terms that fall into the Human, Animal, or Molecular/Cellular Biology categories)

animal: Fraction of MeSH terms that are in the Animal category (out of this article's MeSH terms that fall into the Human, Animal, or Molecular/Cellular Biology categories)

molecular_cellular: Fraction of MeSH terms that are in the Molecular/Cellular Biology category (out of this article's MeSH terms that fall into the Human, Animal, or Molecular/Cellular Biology categories)

x_coord: X coordinate of the article on the Triangle of Biomedicine

y_coord: Y Coordinate of the article on the Triangle of Biomedicine

is_clinical: Flag indicating that this paper meets the definition of a clinical article.

cited_by_clin: PMIDs of clinical articles that this article has been cited by.

apt: Approximate Potential to Translate is a machine learning-based estimate of the likelihood that this publication will be cited in later clinical trials or guidelines.

cited_by: PMIDs of articles that have cited this one.

references: PMIDs of articles in this article's reference list.

Large CSV files are zipped using zip version 4.5, which is more recent than the default unzip command line utility in some common Linux distributions. These files can be unzipped with tools that support version 4.5 or later such as 7zip.

Comments and questions can be addressed to


3 Biotech
3D Printing in Medicine
3D Research
3D-Printed Materials and Systems
AAPG Bulletin
AAPS PharmSciTech
Abhandlungen aus dem Mathematischen Seminar der Universität Hamburg
ABI Technik (German)
Academic Medicine
Academic Pediatrics
Academic Psychiatry
Academic Questions
Academy of Management Discoveries
Academy of Management Journal
Academy of Management Learning and Education
Academy of Management Perspectives
Academy of Management Proceedings
Academy of Management Review
Select your citation style and then place your mouse over the citation text to select it.


National Institutes of Health Official Duty

I confirm there is no human identifiable information in this dataset.

  • Yes


need help?