Natural Language Processing for Mimicking Clinical Trial Recruitment in Critical Care: A Semi-Automated Simulation Based on the LeoPARDS Trial
Tissot, Hegler C; Shah, Anoop D; Brealey, David; Harris, Steve; Agbakoba, Ruth; Folarin, Amos; Romao, Luis; Roguski, Lukasz; Dobson, Richard; Asselbergs, Folkert W
(2020) IEEE journal of biomedical and health informatics, volume 24, issue 10, pp. 2950 - 2959
(Article)
Abstract
Clinical trials often fail to recruit an adequate number of appropriate patients. Identifying eligible trial participants is resource-intensive when relying on manual review of clinical notes, particularly in critical care settings where the time window is short. Automated review of electronic health records (EHR) may help, but much of the
... read more
information is in free text rather than a computable form. We applied natural language processing (NLP) to free text EHR data using the CogStack platform to simulate recruitment into the LeoPARDS study, a clinical trial aiming to reduce organ dysfunction in septic shock. We applied an algorithm to identify eligible patients using a moving 1-hour time window, and compared patients identified by our approach with those actually screened and recruited for the trial, for the time period that data were available. We manually reviewed records of a random sample of patients identified by the algorithm but not screened in the original trial. Our method identified 376 patients, including 34 patients with EHR data available who were actually recruited to LeoPARDS in our centre. The sensitivity of CogStack for identifying patients screened was 90% (95% CI 85%, 93%). Of the 203 patients identified by both manual screening and CogStack, the index date matched in 95 (47%) and CogStack was earlier in 94 (47%). In conclusion, analysis of EHR data using NLP could effectively replicate recruitment in a critical care trial, and identify some eligible patients at an earlier stage, potentially improving trial recruitment if implemented in real time.
show less
Download/Full Text
The full text of this publication is not available.
Keywords: electronic medical records, Health information management, natural language processing, patient monitoring, text processing, Health Information Management, Electrical and Electronic Engineering, Biotechnology, Computer Science Applications, Journal Article
ISSN: 2168-2194
Publisher: Institute of Electrical and Electronics Engineers Inc.
Note: Funding Information: Manuscript received August 6, 2019; revised February 1, 2020; accepted February 24, 2020. Date of publication March 9, 2020; date of current version October 5, 2020. This work was supported in part by the National Institute for Health Research (NIHR) University College London Hospitals (UCLH) Biomedical Research Centre (BRC) Clinical and Research Informatics Unit (CRIU), NIHR Health Informatics Collaborative (HIC), in part by awards establishing the Institute of Health Informatics at University College London (UCL), and in part by Health Data Research UK under Grant LOND1, which is funded in part by the UK Medical Research Council, in part by Engineering and Physical Sciences Research Council, in part by Economic and Social Research Council, in part by Department of Health and Social Care (England), in part by Chief Scientist Office of the Scottish Government Health and Social Care Directorates, in part by Health and Social Care Research and Development Division (Welsh Government), in part by Public Health Agency (Northern Ireland), and in part by British Heart Foundation and Wellcome Trust. The work of A. D. Shah was supported by postdoctoral fellowship from THIS Institute. The work of D. Brealey was supported in part by the Division of Critical Care, University College Hospital and in part by NIHR University College London Hospitals Biomedical Research Centre. The work of R. Dobson was supported in part by the NIHR Biomedical Research Centre at South London and Maudsley NHS Foundation Trust and Kings College London, in part by Health Data Research UK which is funded by the UK Medical Research Council, Engineering and Physical Sciences Research Council, Economic and Social Research Council, Department of Health and Social Care (England), Chief Scientist Office of the Scottish Government Health and Social Care Directorates, Health and Social Care Research and Development Division (Welsh Government), Public Health Agency (Northern Ireland), British Heart Foundation and Wellcome Trust, in part by The Big-Data@Heart Consortium, funded by the Innovative Medicines Initiative-2 Joint Undertaking under Grant 116074. This Joint Undertaking receives support from the European Unions Horizon 2020 research and innovation programme and EFPIA, which is chaired by D. E. Grobbee and S. D. Anker, partnering with 20 academic and industry partners and ESC, and in part by The National Institute for Health Research University College London Hospitals Biomedical Research Centre. The work of F. W. Asselbergs was supported by the UCL Hospitals NIHR Biomedical Research Centre. (Corresponding author: Hegler Tissot.) Hegler C. Tissot, Anoop D. Shah, David Brealey, Steve Harris, Ruth Agbakoba, Luis Romao, and Lukasz Roguski are with the Institute of Health Informatics, University College London, London WC1E 6BT, U.K., and with the Health Data Research U.K. London, University College London, London WC1E 6BT, U.K., and also with the University College London Hospitals, London WC1N 3BG, U.K. (e-mail: h.tissot@ucl.ac.uk; a.shah@ucl.ac.uk; d.brealey@nhs.net; doc@steveharris.me; ruth.agbakoba@nhs.net; luis.romao@nhs.net; l.roguski@ucl.ac.uk). Publisher Copyright: © 2013 IEEE. Copyright: Copyright 2020 Elsevier B.V., All rights reserved.
(Peer reviewed)