Exploring Embedding Spaces for more Coherent Topic Modeling in Electronic Health Records
Rijcken, Emil; Zervanou, Kalliopi; Spruit, Marco; Mosteiro, Pablo; Scheepers, Floortje; Kaymak, Uzay
(2022)
2022 IEEE International Conference on Systems, Man, and Cybernetics, SMC 2022 - Proceedings, volume 2022-October, pp. 2669 - 2674
Conference Proceedings - IEEE International Conference on Systems, Man and Cybernetics, volume 2022-October, pp. 2669 - 2674
2022 IEEE International Conference on Systems, Man, and Cybernetics, SMC 2022, volume 2022-October, pp. 2669 - 2674
(Part of book)
Abstract
The written notes in the Electronic Health Records contain a vast amount of information about patients. Implementing automated approaches for text classification tasks requires the automated methods to be well-interpretable, and topic models can be used for this goal as they can indicate what topics in a text are relevant
... read more
to making a decision. We propose a new topic modeling algorithm, FLSA-E, and compare it with another state-of-the-art algorithm FLSA-W. In FLSA-E, topics are found by fuzzy clustering in a word embedding space. Since we use word embeddings as the basis for our clustering, we extend our evaluation with word-embeddings-based evaluation metrics. We find that different evaluation metrics favour different algorithms. Based on the results, there is evidence that FLSA-E has fewer outliers in its topics, a desirable property, given that within-topic words need to be semantically related.
show less
Download/Full Text
The full text of this publication is not available.
Keywords: Electronic Health Records, Fuzzy Clustering, Fuzzy Methods, Natural Language Processing, Neural Network methods, Psychiatry, Topic Modeling, Word Embeddings, Electrical and Electronic Engineering, Control and Systems Engineering, Human-Computer Interaction
ISSN: 1062-922X
ISBN: 9781665452588
Publisher: Institute of Electrical and Electronics Engineers Inc.
Note: Publisher Copyright: © 2022 IEEE.
(Peer reviewed)