The Citation Cloud of a biomedical article: a free, public, web-based tool enabling citation analysis




citation analysis, bibliometrics, information retrieval, evidence based medicine, science of science


Background: An article’s citations are useful for finding related articles that may not be readily found by keyword searches or textual similarity. Citation analysis is also important for analyzing scientific innovation and the structure of the biomedical literature. We wanted to facilitate citation analysis for the broad community by providing a user-friendly interface for accessing and analyzing citation data for biomedical articles.

Case Presentation: We seeded the Citation Cloud dataset with over 465 million open access citations culled from six different sources: PubMed Central, Microsoft Academic Graph, ArnetMiner, Semantic Scholar, Open Citations, and the NIH iCite dataset. We implemented a free, public extension to PubMed that allows any user to visualize and analyze the entire citation cloud around any paper of interest A: the set of articles cited by A, those which cite A, those which are co-cited with A, and those which are bibliographically coupled to A.

Conclusions: Citation Cloud greatly enables the study of citations by the scientific community, including relatively advanced analyses (co-citations and bibliographic coupling) that cannot be undertaken using other available tools. The tool can be accessed by running any PubMed query on the Anne O’Tate value-added search interface and clicking on the Citations button next to any retrieved article.


Steen RG. Retractions in the medical literature: how many patients are put at risk by flawed research? J Med Ethics. 2011 Nov;37(11):688–92. DOI:

Van der Vet, PE, Nijveen H. Propagation of errors in citation networks: a study involving the entire citation network of a widely cited paper published in, and later retracted from, the journal Nature. Res Integr Peer Rev. 2016;1(3). DOI:

Hutchins BI, Davis MT, Meseroll RA, Santangelo GM. Predicting translational progress in biomedical research. PLoS Biol. 2019 Oct 10;17(10):e3000416. DOI:

Boyack KW, Klavans R. Co‐citation analysis, bibliographic coupling, and direct citation: which citation approach represents the research front most accurately? J Assoc Inf Sci Technol. 2010 Dec;61(12):2389–404.

Hutchins BI, Baker KL, Davis MT, Diwersy MA, Haque E, Harriman RM, Hoppe TA, Leicht SA, Meyer P, Santangelo GM. The NIH Open Citation Collection: a public access, broad coverage resource. PLoS Biol. 2019 Oct 10;17(10):e3000385. DOI:

Small H. Co‐citation in the scientific literature: a new measure of the relationship between two documents. J Am Soc Inf Sci. 1973 Jul;24(4):265–9.

Kessler MM. Bibliographic coupling between scientific papers. American Documentation. 1963 Jan;14(1):10–25.

Smalheiser NR, Zhou W, Torvik VI. Anne O'Tate: a tool to support user-driven summarization, drill-down and browsing of PubMed search results. collaboration Biomed Discov Collab. 2008 Dec 1;3(1):2. DOI:

Smalheiser NR, Fragnito DP, Tirk EE. Anne O'Tate: Value-added PubMed search engine for analysis and text mining. PLoS One. 2021 Mar 8;16(3):e0248335. DOI:

Smalheiser NR, Zhang H, Dwivedi Y. Enoxacin elevates microRNA levels in rat frontal cortex and prevents learned helplessness. Front Psychiatry. 2014 Feb 10;5:6. DOI:

Shotton D. Funders should mandate open citations. Nature. 2018 Jan 11;553(7687):129.

Lin J, Wilbur WJ. PubMed related articles: a probabilistic topic-based model for content similarity. BMC Bioinformatics. 2007 Dec 1;8(1):423.

Glänzel W. Bibliometrics-aided retrieval: where information retrieval meets scientometrics. Scientometrics. 2015 Mar 1;102(3):2215–22.

Belter CW. Citation analysis as a literature search method for systematic reviews. J Assoc Inf Sci Technol. 2016 Nov;67(11):2766–77.


Additional Files





Case Report