The top 100 most cited articles in medical artificial intelligence: a bibliometric analysis

Subhashaan Sreedharan; Mustafa Mian; Ross A. Robertson; Natalie Yang

doi:10.21037/jmai.2019.11.04

Original Article

The top 100 most cited articles in medical artificial intelligence: a bibliometric analysis

Subhashaan Sreedharan¹, Mustafa Mian¹, Ross A. Robertson¹, Natalie Yang^1,2

¹Melbourne Medical School, University of Melbourne, Melbourne, VIC 3010, Australia; ²Department of Radiology, Austin Health, Heidelberg, VIC 3084, Australia

Contributions: (I) Conception and design: All authors; (II) Administrative support: All authors; (III) Provision of study materials or patients: All authors; (IV) Collection and assembly of data: All authors; (V) Data analysis and interpretation: All authors; (VI) Manuscript writing: All authors; (VII) Final approval of manuscript: All authors.

Correspondence to: Subhashaan Sreedharan. Melbourne Medical School, University of Melbourne, Melbourne, VIC 3010, Australia. Email: subhashaan.sreedharan@gmail.com.

Background: There is growing interest in the use of artificial intelligence (AI) in medicine. Our objective was to identify and analyse the characteristics of the top 100 most cited articles relating to the use of AI in medicine in order to identify research trends and help direct future research.

Methods: A retrospective bibliometric analysis of the 100 most cited articles relating to the use of AI in medicine between the years 1950 and 2019 was performed. Data extracted for analysis included year of publication, authorship, journal title and impact factor, institution, country of origin, article type, keywords, and field of medicine.

Results: The number of citations for the top 100 articles ranged from 176 to 1,475, with a median [interquartile range (IQR)] of 238 [205–347]. The median [IQR] number of citations per year was 21 [16–41]. The majority of the top 100 articles [85] were published in the last two decades. Over half of the top 100 articles [55] originated from the United States. There were 60 original research articles featured, with 11 of these clinical studies. The most represented fields were medical informatics [25] and radiology [21]. Oncology is pioneering the clinical integration of AI applications, while cardiovascular medicine is lacking in AI research despite its high disease burden.

Conclusions: This study highlights that the current citation classics are largely in the non-clinical, experimental phase and have yet to progress to the clinical, integration phase of medical AI.

Keywords: Artificial intelligence (AI); deep learning; machine learning; medical AI

Received: 19 October 2019; Accepted: 11 November 2019; Published: 25 March 2020.

doi: 10.21037/jmai.2019.11.04

Introduction

Artificial intelligence (AI) is a general term used to describe machines and computers performing tasks usually requiring human intelligence (1). There has been interest in the application of AI in medicine since the 1950s (2). Since then, computer scientists and medical researchers have continued to investigate the use of AI in almost every field of medicine (2). Proponents of medical AI argue that it can help the clinician in all aspects of medicine, from formulating a diagnosis to making therapeutic decisions and predicting patient outcomes (3,4).

An extensive body of literature has become available regarding the use of AI in medicine, with considerable variation in the quality of articles, as well as in the study methodologies and AI techniques used. This can lead to difficulty in identifying articles of significance, particularly for clinicians with limited technical knowledge in AI. A bibliometric analysis of the most highly cited publications relating to medical AI may provide a better understanding of the progress made to date in this field, as well as identify areas for future research efforts (5-7). Citation frequency is a method of bibliometric analysis that involves examining the publications that have been most cited by other researchers. Citation count serves as an indicator of the influence and quality of a scientific publication (8).

A number of bibliometric analyses from various fields of medicine have been published (9-13). While the international interest and academic output relating to medical AI has risen significantly in recent years, a bibliometric analysis of the citation classics in this field, to the best of our knowledge, has not yet been performed. As such, this study aimed to identify and examine the characteristics of the top 100 most cited articles relating to the application of AI in medicine.

Methods

Literature search

A retrospective bibliometric analysis of the 100 most cited peer-reviewed journal articles related to the use of AI in medicine was performed in April 2019. Articles from the MEDLINE^® (U.S. National Library of Medicine, Bethesda, USA) database were identified using the Web of Science (Clarivate Analytics, Philadelphia, USA) citation indexing service (14,15). The MEDLINE^® database indexes more than 5,000 journals, comprising more than 25 million references in medicine and life sciences published since 1950. MEDLINE^® uses Medical Subject Headings (MeSH), which imposes uniformity and consistency to the indexing of biomedical literature.

The following search strategy was used: [“artificial intelligence” OR “machine learning” OR “deep learning” OR “natural language processing” OR “support vector machine” OR “naïve bayes” OR “bayesian learning” OR “artificial neural network” OR “random forest” OR “machine intelligence” OR “k-nearest neighbor” OR “decision tree learning” OR “data mining” OR “fuzzy” OR “computational intelligence” OR “computer reasoning”] AND [“medicine” OR “medical” OR “surgery” OR “surgical” OR “healthcare”]. Articles were identified if they included these search terms in either its title, abstract or MeSH terms. There were no restrictions on language or year of publication. A total of 16,025 articles were returned from this search. The top 100 articles ranked by citation count were identified and downloaded to a local database.

Analysis

Analysis of individual articles was performed by three reviewers (S Sreedharan, M Mian, RA Robertson) in order to extract relevant information relating to year of publication, authorship, journal title and impact factor (IF), institution, country of origin, article type, key words and field of medicine. Journal IFs were obtained from the SCImago Journal Rank (Elsevier, Amsterdam, Netherlands) issued in 2017, the most recent year of available data at the time of search. Institution and country of origin were based on the corresponding author’s affiliations. Article type was dichotomised into original research article or review article. The original articles were further dichotomised into clinical or non-clinical, where clinical papers were defined as those where the primary research question was clinical and the study involved human participants or data. Keywords were recorded for each article by screening their titles, abstracts and MeSH terms for the search terms used. Articles that could not be assigned to a specific field of medicine were grouped in a “general” category. The “medical image analysis” category included any papers relating to the analysis of non-radiological medical images, such as histopathological images or clinical photographs. The average citations per year for each article was also included as an adjunctive measure of overall article impact. Average citation per year was calculated by dividing each article’s total number of citations by the number of years since that article had been published.

Results

The top 100 articles relating to the use of AI in medicine were ranked according to citation count (Table 1). The median [IQR] number of citations was 238 [205–347]. The median [IQR] number of citations per year was 21 [16–41]. “The American College of Rheumatology preliminary diagnostic criteria for fibromyalgia and measurement of symptom severity”, published in Arthritis Care & Research in 2010, was ranked first with 1,475 citations. When ranked according to citations per year (Table 2), it became the fourth highest ranked article with an average of 164 citations per year.

Table 1

The 100 “citation classics” regarding the use of artificial intelligence in medicine

Rank	Article	Citations	Citations per year
1	Wolfe F, et al. The American College of Rheumatology preliminary diagnostic criteria for fibromyalgia and measurement of symptom severity. Arthritis Care & Research, 2010;62(5):600-10.	1,475	163.89
2	Radke RJ, et al. Image change detection algorithms: a systematic survey. IEEE Transactions on Image Processing: a publication of the IEEE Signal Processing Society, 2005;14(3):294-307.	930	66.43
3	Strobl C, et al. An introduction to recursive partitioning: rationale, application, and characteristics of classification and regression trees, bagging, and random forests. Psychological Methods, 2009;14(4):323-48.	730	73.00
4	Aronson AR. Effective mapping of biomedical text to the UMLS Metathesaurus: the MetaMap program. Proceedings. AMIA Symposium, 2001:17-21.	665	36.94
5	Heimann T, Meinzer HP. Statistical shape models for 3D medical image segmentation: a review. Medical Image Analysis, 2009;13(4):543-63.	608	60.80
6	Tu JV. Advantages and disadvantages of using artificial neural networks versus logistic regression for predicting medical outcomes. Journal of Clinical Epidemiology, 1996;49(11):1225-31.	587	25.52
7	Gillies RJ, et al. Radiomics: Images Are More than Pictures, They Are Data. Radiology, 2016;278(2):563-77.	582	194.00
8	Noble JA, Boukerroui D. Ultrasound image segmentation: a survey. IEEE Transactions on Medical Imaging, 2006;25(8):987-1010.	552	42.46
9	Rosse C, Mejino JL Jr. A reference ontology for biomedical informatics: the Foundational Model of Anatomy. Journal of Biomedical Informatics, 2003;36(6):478-500.	539	33.69
10	Yao X, Liu Y. A new evolutionary system for evolving artificial neural networks. IEEE Transactions on Neural Networks, 1997;8(3):694-713.	505	22.95
11	Savova GK, et al. Mayo clinical Text Analysis and Knowledge Extraction System (cTAKES): architecture, component evaluation and applications. Journal of the American Medical Informatics Association, 2010;17(5):507-13.	497	55.22
12	Gulshan V, et al. Development and Validation of a Deep Learning Algorithm for Detection of Diabetic Retinopathy in Retinal Fundus Photographs. JAMA, 2016;316(22):2402-2410.	485	161.67
13	Reyna VF, et al. How numeracy influences risk comprehension and medical decision making. Psychological Bulletin, 2009;135(6):943-73.	483	48.30
14	Benenson Y, et al. An autonomous molecular computer for logical control of gene expression. Nature, 2004;429(6990):423-9.	482	32.13
15	Murdoch TB, Detsky AS. The inevitable application of big data to health care. JAMA, 2013;309(13):1351-2.	429	71.50
16	Dreiseitl S, Ohno-Machado L. Logistic regression and artificial neural network classification models: a methodology review. Journal of Biomedical Informatics, 2002;35(5-6):352-9.	427	25.12
17	Mitra S, Hayashi Y. Neuro-fuzzy rule generation: survey in soft computing framework. IEEE Transactions on Neural Networks, 2000;11(3):748-68.	427	22.47
18	Litjens G, et al. A survey on deep learning in medical image analysis. Medical Image Analysis, 2017;42:60-88.	403	201.50
19	Baxt WG. Application of artificial neural networks to clinical medicine. The Lancet, 1995;346(8983):1135-8.	398	16.58
20	Lopes R, Betrouni N. Fractal and multifractal analysis: a review. Medical Image Analysis, 2009;13(4):634-49.	388	38.80
21	Saeed M, et al. Multiparameter Intelligent Monitoring in Intensive Care II: a public-access intensive care unit database. Critical Care Medicine, 2011;39(5):952-60.	374	46.75
22	Qian L, et al. Neural network computation with DNA strand displacement cascades. Nature, 2011;475(7356):368-72.	370	46.25
23	Li K, et al. Optimal surface segmentation in volumetric images--a graph-theoretic approach. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2006;28(1):119-34.	361	27.77
24	Iizuka N, et al. Oligonucleotide microarray for prediction of early intrahepatic recurrence of hepatocellular carcinoma after curative resection. The Lancet, 2003;361(9361):923-9.	360	22.50
25	Chapman WW, et al.A simple algorithm for identifying negated findings and diseases in discharge summaries. Journal of Biomedical Informatics, 2001;34(5):301-10.	349	19.39
26	Lu, G, Fei B. Medical hyperspectral imaging: a review. Journal of Biomedical Optics, 2014;19(1):10901.	346	69.20
27	Kononenko I. Machine learning for medical diagnosis: history, state of the art and perspective. Artificial Intelligence in Medicine, 2001;23(1):89-109.	332	18.44
28	Ott PA, et al. An immunogenic personal neoantigen vaccine for patients with melanoma. Nature, 2017;547(7662):217-221.	330	165.00
29	Xiong HY, et al. RNA splicing. The human splicing code reveals new insights into the genetic determinants of disease. Science, 2015;347(6218):1254806.	328	82.00
30	Friedman C, et al. A general natural-language text processor for clinical radiology. Journal of the American Medical Informatics Association, 1994;1(2):161-74.	325	13.00
31	Forbes SA, et al. The Catalogue of Somatic Mutations in Cancer (COSMIC). Current Protocols in Human Genetics, 2008;Chapter 10:Unit 10.11.	323	29.36
32	Szarfman A, et al. Use of screening algorithms and computer systems to efficiently signal higher-than-expected combinations of drugs and events in the US FDA's spontaneous reports database. Drug Safety, 2002;25(6):381-92.	314	18.47
33	Muller HM, et al. Textpresso: an ontology-based information retrieval and extraction system for biological literature. PLoS Biology, 2004;2(11):e309.	306	20.40
34	Bellazzi R, Zupan B. Predictive data mining in clinical medicine: current issues and guidelines. International Journal of Medical Informatics, 2008;77(2):81-97.	298	27.09
35	Tajbakhsh N, et al. Convolutional Neural Networks for Medical Image Analysis: Full Training or Fine Tuning? IEEE Transactions on Medical Imaging, 2016;35(5):1299-1312.	294	98.00
36	Kamnitsas K, et al. Efficient multi-scale 3D CNN with fully connected CRF for accurate brain lesion segmentation. Medical Image Analysis, 2017;36:61-78.	286	143.00
37	Markelj P, et al. A review of 3D/2D registration methods for image-guided interventions. Medical Image Analysis, 2012;16(3):642-61.	286	40.86
38	El-Naqa I, et al. A support vector machine approach for detection of microcalcifications. IEEE Transactions on Medical Imaging, 2002;21(12):1552-63.	284	16.71
39	Lanitis A, et al. Comparing different classifiers for automatic age estimation. IEEE transactions on systems, man, and cybernetics. Part B, Cybernetics: a publication of the IEEE Systems, Man, and Cybernetics Society, 2004;34(1):621-8.	280	18.67
40	Rice TW, et al. Cancer of the esophagus and esophagogastric junction: data-driven staging for the seventh edition of the American Joint Committee on Cancer/International Union Against Cancer Cancer Staging Manuals. Cancer, 2010;116(16):3763-73.	277	30.78
41	Sluimer I, et al. Computer analysis of computed tomography scans of the lung: a survey. IEEE Transactions on Medical Imaging, 2006;25(4):385-405.	274	21.08
42	Shahar Y, et al. The Asgaard project: a task-specific framework for the application and critiquing of time-oriented clinical guidelines. Artificial Intelligence in Medicine, 1998;14(1-2):29-51.	272	12.95
43	Denny JC, et al. Systematic comparison of phenome-wide association study of electronic medical record data and genome-wide association study data. Nature Biotechnology, 2013;31(12):1102-10.	265	44.17
44	Zhang DQ, Chen SC. A novel kernelized fuzzy C-means algorithm with application in medical image segmentation. Artificial Intelligence in Medicine, 2004;32(1):37-50.	258	17.20
45	Zhang H, et al. Discontinuation of statins in routine care settings: a cohort study. Annals of Internal Medicine, 2013;158(7):526-34.	256	42.67
46	Bates DW, et al. Detecting adverse events using information technology. Journal of the American Medical Informatics Association, 2003;10(2):115-28.	254	15.88
47	Dayhoff JE, DeLeo JM. Artificial neural networks: opening the black box. Cancer, 2001;91(8 Suppl):1615-35.	253	14.06
48	Uzuner O, et al. 2010 i2b2/VA challenge on concepts, assertions, and relations in clinical text. Journal of the American Medical Informatics Association, 2011;18(5):552-6.	243	30.38
49	Cios KJ, Moore GW. Uniqueness of medical data mining. Artificial Intelligence in Medicine, 2002;26(1-2):1-24.	241	14.18
50	Erho N, et al. Discovery and validation of a prostate cancer genomic classifier that predicts early metastasis following radical prostatectomy. PloS One, 2013;8(6):e66855.	239	39.83
51	Iorio F, et al. A Landscape of Pharmacogenomic Interactions in Cancer. Cell, 2016; 166(3):740-754.	238	79.33
52	Obermeyer Z, Emanuel EJ. Predicting the Future - Big Data, Machine Learning, and Clinical Medicine. New England Journal of Medicine, 2016;375(13):1216-9.	235	78.33
53	Lisboa, PJ. A review of evidence of health benefit from artificial neural networks in medical intervention. Neural Networks, 2016;375(13):1216-9.	233	13.71
54	Pauker SG, et al. Towards the simulation of clinical cognition. Taking a present illness by computer. The American Journal of Medicine, 1976;60(7):981-96.	233	5.42
55	Muller FJ, et al. Regulatory networks define phenotypic classes of human stem cell lines. Nature, 2008;455(7211):401-5.	232	21.09
56	Meystre SM, et al. Extracting information from textual documents in the electronic health record: a review of recent research. Yearbook of Medical Informatics, 2008:128-44.	229	20.82
57	Nauck D, Kruse R. Obtaining interpretable fuzzy classification rules from medical data. Artificial Intelligence in Medicine, 1999;16(2):149-69.	229	11.45
58	Crum WR, et al. Generalized overlap measures for evaluation and validation in medical image analysis. IEEE Transactions on Medical Imaging, 2006;25(11):1451-61.	228	17.54
59	Saad ZS, et al. A new method for improving functional-to-structural MRI alignment using local Pearson correlation. NeuroImage, 2009;44(3):839-48.	225	22.50
60	Mazurowski MA, et al. Training neural network classifiers for medical decision making: the effects of imbalanced datasets on classification performance. Neural Networks, 2008;21(2-3):427-36.	224	20.36
61	Bigio IJ, et al. Diagnosis of breast cancer using elastic-scattering spectroscopy: preliminary clinical results. Journal of Biomedical Optics, 2000;5(2):221-8.	224	11.79
62	Barnett GO, et al. DXplain. An evolving diagnostic decision-support system. JAMA, 1987; 258(1):67-74.	223	6.97
63	Woodhams R, et al. ADC mapping of benign and malignant breast tumors. Magnetic Resonance in Medical Sciences, 2005;4(1):35-42.	221	15.79
64	Libbrecht MW, Noble WS. Machine learning applications in genetics and genomics. Nature Reviews Genetics, 2015;16(6):321-32.	220	55.00
65	Pedersen T, et al. Measures of semantic similarity and relatedness in the biomedical domain. Journal of Biomedical Informatics, 2007;40(3):288-99.	220	18.33
66	Xu Q, et al. Low-dose X-ray CT reconstruction via dictionary learning. IEEE Transactions on Medical Imaging, 2012;31(9):1682-97.	218	31.14
67	Acampora G, et al. A Survey on Ambient Intelligence in Health Care. Proceedings of the IEEE, 2013; 101(12):2470-2494.	215	35.83
68	Rizk NP, et al. Optimum lymphadenectomy for esophageal cancer. Annals of Surgery, 2010; 251(1):46-50.	212	23.56
69	Lisboa PJ, Taktak AF. The use of artificial neural networks in decision support in cancer: a systematic review. Neural Networks, 2006;19(4):408-15.	212	16.31
70	Terwee CB, et al. Development of a methodological PubMed search filter for finding studies on measurement properties of measurement instruments. Quality of Life Research, 2009;18(8):1115-23.	210	21.00
71	Friedman C, et al. Automated encoding of clinical documents based on natural language processing. Journal of the American Medical Informatics Association, 2004; 11(5):392-402.	210	14.00
72	Nanni L, et al. Local binary patterns variants as texture descriptors for medical image analysis. Artificial Intelligence in Medicine, 2010;49(2):117-25.	208	23.11
73	Rebhan M, et al. GeneCards: a novel functional genomics compendium with automated data mining and query reformulation support. Bioinformatics, 1998;14(8):656-64.	208	9.90
74	Gurcan MN, et al. Histopathological image analysis: a review. IEEE Reviews in Biomedical Engineering, 2009;2:147-71.	207	20.70
75	Madhavan S, et al. Rembrandt: helping personalized medicine become a reality through integrative translational research. Molecular Cancer Research, 2009;7(2):157-67.	205	20.50
76	Spackman KA, et al. SNOMED RT: a reference terminology for health care. Proceedings: a conference of the American Medical Informatics Association. AMIA Fall Symposium, 1997:640-4.	205	9.32
77	Hripcsak G, et al. Unlocking clinical data from narrative reports: a study of natural language processing. Annals of Internal Medicine, 1995;122(9):681-8.	203	8.46
78	Wolf I, et al. The medical imaging interaction toolkit. Medical Image Analysis, 2005;9(6):594-604.	202	14.43
79	Cruz JA, Wishart DS. Applications of machine learning in cancer prediction and prognosis. Cancer Informatics, 2007;2:59-77.	197	16.42
80	Chuang LY, et al. Improved binary PSO for feature selection using gene expression data. Computational Biology and Chemistry, 2008;32(1):29-37.	194	17.64
81	Rindflesch TC, Fiszman M. The interaction of domain knowledge and linguistic structure in natural language processing: interpreting hypernymic propositions in biomedical text. Journal of Biomedical Informatics, 2003;36(6):462-77.	192	12.00
82	Nadkarni PM, et al. Chapman, Natural language processing: an introduction. Journal of the American Medical Informatics Association, 2011; 18(5):544-51.	191	23.88
83	Xu H, et al. MedEx: a medication information extraction system for clinical narratives. Journal of the American Medical Informatics Association, 2010;17(1):19-24.	190	21.11
84	Grabner G, et al.Symmetric atlasing and model based segmentation: an application to the hippocampus in older adults. International Conference on Medical Image Computing and Computer-Assisted Intervention, 2006;9(Pt 2):58-66.	190	14.62
85	Rossini PM, et al. Double nerve intraneural interface implant on a human amputee for robotic hand control. Clinical Neurophysiology, 2010;121(5):777-83.	188	20.89
86	Ritchie MD, et al. Robust replication of genotype-phenotype associations across multiple diseases in an electronic medical record. American Journal of Human Genetics, 2010; 86(4):560-72.	188	20.89
87	Shen D, et al. Deep Learning in Medical Image Analysis. Annual Review of Biomedical Engineering, 2017;19:221-248.	187	93.50
88	Wein W, et al. Automatic CT-ultrasound registration for diagnostic imaging and image-guided intervention. Medical Image Analysis, 2008;12(5):577-85.	187	17.00
89	Abbass HA. An evolutionary artificial neural networks approach for breast cancer diagnosis. Artificial Intelligence in Medicine, 2002;25(3):265-81.	186	10.94
90	Zang Y, et al. Flexible suspended gate organic thin-film transistors for ultra-sensitive pressure detection. Nature Communications, 2015;6:6269.	184	46.00
91	Li BN, et al. Integrating spatial fuzzy clustering with level set methods for automated medical image segmentation. Computers in Biology and Medicine, 2011; 41(1):1-10.	183	22.88
92	Heinrich MP, et al. MIND: modality independent neighbourhood descriptor for multi-modal deformable registration. Medical Image Analysis, 2012;16(7):1423-35.	182	26.00
93	Bodenreider O, Stevens R. Bio-ontologies: current trends and future directions. Briefings in Bioinformatics, 2006;7(3):256-74.	182	14.00
94	Friedman C, et al. GENIES: a natural-language processing system for the extraction of molecular pathways from journal articles. Bioinformatics, 2001;17 Suppl 1:S74-82.	182	10.11
95	Fox J, et al. Disseminating medical knowledge: the PROforma approach. Artificial Intelligence in Medicine, 1998;14(1-2):157-81.	181	8.62
96	Musen MA. Dimensions of knowledge sharing and reuse. Computers and Biomedical Research, 1992;25(5):435-67.	180	6.67
97	Duda RO, Shortliffe EH. Expert Systems Research. Science, 1983;220(4594):261-8.	180	5.00
98	Sharma N, Aggarwal LM. Automated medical image segmentation techniques. Journal of Medical Physics, 2010;35(1):3-14.	177	19.67
99	Sorzano CO, et al. Elastic registration of biological images using vector-spline regularization. IEEE Transactions on Biomedical Engineering, 2005; 52(4):652-63.	177	12.64
100	Hripcsak G, et al. Rationale for the Arden Syntax. Computers and Biomedical Research, 1994;27(4):291-324.	176	7.04

Table 2

Top 20 articles by “citations per year”

Rank	Article	Citations per year	Citations	Rank by citations
1	Litjens G, et al. A survey on deep learning in medical image analysis. Medical Image Analysis, 2017;42:60-88.	201.50	403	18
2	Gillies RJ, et al. Radiomics: Images Are More than Pictures, They Are Data. Radiology, 2016;278(2):563-77.	194.00	582	7
3	Ott PA, et al. An immunogenic personal neoantigen vaccine for patients with melanoma. Nature, 2017;547(7662):217-221.	165.00	330	28
4	Wolfe F, et al. The American College of Rheumatology preliminary diagnostic criteria for fibromyalgia and measurement of symptom severity. Arthritis Care & Research, 2010;62(5):600-10.	163.89	1475	1
5	Gulshan V, et al. Development and Validation of a Deep Learning Algorithm for Detection of Diabetic Retinopathy in Retinal Fundus Photographs. JAMA, 2016;316(22):2402-2410.	161.67	485	12
6	Kamnitsas K, et al. Efficient multi-scale 3D CNN with fully connected CRF for accurate brain lesion segmentation. Medical Image Analysis, 2017;36:61-78.	143.00	286	36
7	Tajbakhsh N, et al. Convolutional Neural Networks for Medical Image Analysis: Full Training or Fine Tuning? IEEE Transactions on Medical Imaging, 2016; 35(5):1299-1312.	98.00	294	25
8	Shen D, et al. Deep Learning in Medical Image Analysis. Annual Review of Biomedical Engineering, 2017;19:221-248.	93.50	187	87
9	Xiong HY, et al. RNA splicing. The human splicing code reveals new insights into the genetic determinants of disease. Science, 2015;347(6218):1254806.	82.00	328	29
10	Iorio F, et al. A Landscape of Pharmacogenomic Interactions in Cancer. Cell, 2016;166(3):740-754.	79.33	238	51
11	Obermeyer Z, Emanuel EJ. Predicting the Future - Big Data, Machine Learning, and Clinical Medicine. The New England Journal of Medicine, 2016; 375(13):1216-9.	78.33	235	52
12	Strobl C, et al. An introduction to recursive partitioning: rationale, application, and characteristics of classification and regression trees, bagging, and random forests. Psychological Methods, 2009;14(4):323-48.	73.00	730	3
13	Murdoch TB, Detsky AS. The inevitable application of big data to health care. JAMA, 2013;309(13):1351-2.	71.50	429	15
14	Lu G, Fei B. Medical hyperspectral imaging: a review. Journal of Biomedical Optics, 2014;19(1):10901.	69.20	346	26
15	Radke RJ, et al. Image change detection algorithms: a systematic survey. IEEE transactions on image processing: a publication of the IEEE Signal Processing Society, 2005; 14(3):294-307.	66.43	930	2
16	Heimann T, Meinzer HP. Statistical shape models for 3D medical image segmentation: a review. Medical Image Analysis, 2009;13(4):543-63.	60.80	608	5
17	Savova GK, et al. Mayo clinical Text Analysis and Knowledge Extraction System (cTAKES): architecture, component evaluation and applications. Journal of the American Medical Informatics Association, 2010;17(5):507-13.	55.22	497	11
18	Libbrecht MW, Noble WS. Machine learning applications in genetics and genomics. Nature reviews. Genetics, 2015;16(6):321-32.	55.00	220	64
19	Reyna VF, et al. How numeracy influences risk comprehension and medical decision making. Psychological Bulletin, 2009; 135(6):943-73.	48.30	483	13
20	Saeed M, et al. Multiparameter Intelligent Monitoring in Intensive Care II: a public-access intensive care unit database. Critical Care Medicine, 2011;39(5):952-60.	46.75	374	21

Only 15 articles were published prior to 2000. Thirty-five articles were published within the last decade, and the majority of articles were published between 2000 and 2010 (Table 3). The oldest article on the list was “Towards the stimulation of clinical cognition. Taking a present illness by computer” published in The American Journal of Medicine in 1976. This original research article presented a novel computer software that was developed to take a clinical history for a patient presenting with oedema in order to determine the most likely underlying illness.

Table 3

The number of articles per decade

Decade	No of articles
1970–1979	1
1980–1989	2
1990–1999	12
2000–2009	50
2010–2019	35

A total of 55 journals contributed articles to the top 100 list, with 15 contributing two or more articles. With eight articles each, Artificial Intelligence in Medicine (IF: 3.62) and Medical Image Analysis (IF: 6.50) contributed the most articles, followed by Journal of the American Medical Informatics Association (IF: 3.97) and Journal of Biomedical Informatics (IF: 7.22) with seven articles each. The journals with the highest IFs were New England Journal of Medicine (IF: 42.18) and Nature Reviews Genetics (IF: 38.94), each contributing one article.

There were 16 institutions contributing two or more articles each to the top 100 list. Harvard Medical School was the biggest contributor with five articles, followed by Vanderbilt University [3] and the National Library of Medicine [3]. The United States of America was the leading country of origin with 55 articles, followed by the United Kingdom [9] and Canada [5]. A further nine countries contributed two or more articles each (Figure 1).

Figure 1 The countries of origin of the top 100 articles.

The top 100 articles list comprised 60 original research articles and 40 review articles. The 60 original research articles included 11 clinical studies (Figure 2). Of note, 8 of the 11 clinical papers were published in the last decade, and 6 of the 11 clinical studies were related to oncology. The most frequently represented field in the top 100 list was medical informatics [25], followed by radiology [21], oncology [13] and non-radiological medical image analysis [7] (Figure 3). Papers relating to medical image analysis had the highest average citations per year at 69.03, followed by radiology (43.15) and genetics (42.16). The 11 clinical papers were from oncology [6], ophthalmology [1], internal medicine [2], critical care [1] and radiology [1].

Figure 2 Breakdown of the top 100 list by article type.

Figure 3 Breakdown of the top 100 list by field.

“Artificial intelligence” [46], “natural language processing” [20], “machine learning” [18], “data mining” [14] and “artificial neural network” [13] were the most frequently used keywords among the top 100 list (Table 4). Eighteen of the 20 articles that included the keyword “natural language processing” were from the field of medical informatics.

Table 4

The keywords used in the top 100 articles

Institution	No of articles
Artificial intelligence	46
Natural language processing	20
Machine learning	18
Data mining	14
Artificial neural network	13
Fuzzy	6
Random forest	5
Deep learning	4
Support vector machine	4
K-nearest neighbor	1

Discussion

There has been a recent surge of interest in the use of AI in medicine. However, many of the top 100 articles identified in this study were of low-level evidence and limited clinical significance, comprised of review articles and commentaries providing only a general overview of medical AI. Original research articles investigating the use of AI in clinical populations were lacking, with only 11 of the top 100 articles identified as clinical studies. The lack of highly cited clinical studies demonstrates the need for more studies investigating the integration of AI into clinical medicine. High-level evidence, including randomised controlled trials and meta-analyses, is required for clinicians to gain confidence in the capabilities of AI. Improved collaboration between clinicians and computer scientists or streamlined pathways for dual qualification in health and computer science may be feasible strategies to facilitate improved medical AI research.

Medical informatics contributed the most articles to the top 100 list. With the implementation of electronic medical records (EMR), medical informatics is becoming increasingly important. In order to reach their full potential, EMRs require automated applications to manage the huge amounts of clinical information available in them. Within the field of medical informatics, “natural language processing (NLP)” was the most frequently used keyword. NLP began in the 1950s and represents the intersection of AI and linguistics (16). Clinical information is often not coded or structured but recorded in natural language text that may not be readily accessible by informaticians. NLP may be able to overcome this problem by extracting text-based information and structuring it into useful data (17).

Radiology was the leading field of clinical medicine represented in the citation classics, contributing just over one fifth of the top 100 articles. While medical informatics had the highest number of articles, the average annual citations per article in radiology was more than double that of medical informatics, demonstrating the substantial interest in the use of AI in radiology. Radiology is a unique field of medicine in that it encompasses many of the common applications of AI. AI techniques have been experimented with in various aspects of radiology, from assisting clinicians to determine the most appropriate imaging procedure, to image interpretation and computer-assisted diagnosis, and lastly results reporting and the extraction of information from radiologist reports (18).

It is worth noting that there is an overlap in the approaches used in applying AI to radiological and non-radiological medical images. Non-radiological medical image analysis was the fourth leading field in our study. This category refers to histopathological slides and clinical photographs, such as endoscopic images or images of dermatological lesions (19). The separation of this category from radiology was made in order to recognise solely radiological studies from those involving wider medical image analysis. As with radiology, the application of AI in medical image analysis relies on the ability to collect and use large datasets in order to train AI systems in pattern recognition. It is likely that both fields will continue to be a major focus of medical AI in the future (19).

The majority of the clinical studies in the top 100 list were oncological papers. This is likely due to a number of factors. First, cancer is among the leading causes of mortality in developed countries (20), driving a scientific interest to innovate in this field. AI also has the potential to improve outcomes in oncology because of the variety of cancer types and presentations, and the risk of patients being asymptomatic until late and severe stages of disease. Lastly, oncology relies on a range of data rich modalities such as genomics and metabolomics, which enables the generation of large clinical datasets useful in the building and validation of AI models (21). In contrast, AI may be harder to implement in fields with fewer objective investigations and data available, such as psychiatry. Interestingly, cardiovascular disease did not feature in the top 100 list despite being the leading cause of mortality globally (20). This highlights a potential mismatch between disease burden and AI research efforts.

There are several limitations of our study that should be considered. A source of bias in the use of citation analysis is that older articles are more likely to be cited, independent of quality of the article. Further, total citation count does not provide information about the temporal profile of citations for each paper. We addressed this by including an average annual citation number as another indicator for article impact and contemporary influence (13). The use of journal IFs from 2017 only does not account for changes in journal IFs over time and does not account for the journal IF at the time of article publication. Another limitation is that the search terms used in our study may have excluded some relevant or highly cited articles (12). We attempted to mitigate this risk by including both general search terms, such as “artificial intelligence” or “machine learning”, as well as detailed search terms of specific AI techniques, such as “random forest” or “neural network”. It should be noted that applying these search terms to only titles, abstracts and MeSH keywords meant that articles may have not employed the mentioned AI technique, but only discussed it. However, this still highlights the AI techniques most frequently discussed across the highly cited literature in the field of medical AI.

Conclusions

This study provides a comprehensive overview of the top 100 most cited articles relating to the use of AI in medicine over the past 70 years. It highlights that the current citation classics are largely in the non-clinical, experimental phase and have yet to progress to the clinical, integration phase of medical AI. While medical informatics and radiology were heavily featured in the citation classics, oncology was the leading field with clinical integration of AI. There is an apparent mismatch between disease burden and AI research efforts, with a lack of representation of cardiovascular medicine in the top 100 list despite cardiovascular disease being the leading cause of mortality globally. These results offer important insights into current research trends in medical AI and could help direct future research in this highly active and exciting field.

Acknowledgments

Funding: None.

Footnote

Conflicts of Interest: All authors have completed the ICMJE uniform disclosure form (available at http://dx.doi.org/10.21037/jmai.2019.11.04). The authors have no conflicts of interest to declare.

Ethical Statement: The authors are accountable for all aspects of the work in ensuring that questions related to the accuracy or integrity of any part of the work are appropriately investigated and resolved.

Open Access Statement: This is an Open Access article distributed in accordance with the Creative Commons Attribution-NonCommercial-NoDerivs 4.0 International License (CC BY-NC-ND 4.0), which permits the non-commercial replication and distribution of the article with the strict proviso that no changes or edits are made and the original work is properly cited (including links to both the formal publication through the relevant DOI and the license). See: https://creativecommons.org/licenses/by-nc-nd/4.0/.

References

Hamet P, Tremblay J. Artificial intelligence in medicine. Metabolism 2017;69S:S36-S40. [Crossref] [PubMed]
Ramesh A, Kambhampati C, Monson JR, et al. Artificial intelligence in medicine. Ann R Coll Surg Engl 2004;86:334. [Crossref] [PubMed]
Schwartz WB, Patil RS, Szolovits P. Artificial intelligence in medicine. N Engl J Med 1987;316:685-8. [Crossref] [PubMed]
Szolovits P, Patil RS, Schwartz WB. Artificial intelligence in medical diagnosis. Ann Intern Med 1988;108:80-7. [Crossref] [PubMed]
Choudhri AF, Siddiqui A, Khan NR, et al. Understanding bibliometric parameters and analysis. Radiographics 2015;35:736-46. [Crossref] [PubMed]
Abramo G, D’Angelo CA. Evaluating research: from informed peer review to bibliometrics. Scientometrics 2011;87:499-514. [Crossref]
Goodall A. The place of citations in today's academy. International Higher Education 2015. doi: https://doi.org/10.6017/ihe.2006.43.7888
Godin B. On the origins of bibliometrics. Scientometrics 2006;68:109-33. [Crossref]
Kelly J, Glynn R, O’Briain D, et al. The 100 classic papers of orthopaedic surgery: a bibliometric analysis. J Bone Joint Surg Br 2010;92:1338-43. [Crossref] [PubMed]
Yoon DY, Yun EJ, Ku YJ, et al. Citation classics in radiology journals: the 100 top-cited articles, 1945–2012. AJR Am J Roentgenol 2013;201:471-81. [Crossref] [PubMed]
Brandt JS, Downing AC, Howard DL, et al. Citation classics in obstetrics and gynecology: the 100 most frequently cited journal articles in the last 50 years. Am J Obstet Gynecol 2010;203:355.e1-7. [Crossref] [PubMed]
Mohammed MF, Chahal T, Gong B, et al. Trends in CT colonography: bibliometric analysis of the 100 most-cited articles. Br J Radiol 2017;90:20160755. [Crossref] [PubMed]
Maingard J, Phan K, Ren Y, et al. The 100 most cited articles in the endovascular management of intracranial aneurysms. J Neurointerv Surg 2018;10:859-68. [Crossref] [PubMed]
Kulkarni AV, Aziz B, Shams I, et al. Comparisons of Citations in Web of Science, Scopus, and Google Scholar for Articles Published in General Medical Journals. JAMA 2009;302:1092-6. [Crossref] [PubMed]
Falagas ME, Pitsouni EI, Malietzis GA, et al. Comparison of PubMed, Scopus, Web of Science, and Google Scholar: strengths and weaknesses. FASEB J 2008;22:338-42. [Crossref] [PubMed]
Nadkarni PM, Ohno-Machado L, Chapman WW. Natural language processing: an introduction. J Am Med Inform Assoc 2011;18:544-51. [Crossref] [PubMed]
Friedman C, Hripcsak G. Natural language processing and its future in medicine. Acad Med 1999;74:890-5. [Crossref] [PubMed]
Kahn CE Jr. Artificial intelligence in radiology: decision support systems. Radiographics 1994;14:849-61. [Crossref] [PubMed]
Litjens G, Kooi T, Bejnordi BE, et al. A survey on deep learning in medical image analysis. Med Image Anal 2017;42:60-88. [Crossref] [PubMed]
World Health Organization. Global health estimates 2016: disease burden by cause, age, sex, by country and by region, 2000–2016. Geneva: World Health Organization, 2018.
Ramaswamy S. Translating cancer genomics into clinical oncology. N Engl J Med 2004;350:1814. [Crossref] [PubMed]

doi: 10.21037/jmai.2019.11.04
Cite this article as: Sreedharan S, Mian M, Robertson RA, Yang N. The top 100 most cited articles in medical artificial intelligence: a bibliometric analysis. J Med Artif Intell 2020;3:3.

The top 100 most cited articles in medical artificial intelligence: a bibliometric analysis

Introduction

Methods

Literature search

Analysis

Results

Table 1

Table 2

Table 3

Table 4

Discussion

Conclusions

Acknowledgments

Footnote

References

Article Options

Download Citation

Share