RDBU| Repositório Digital da Biblioteca da Unisinos

Anamtech: speech-based automatic structuring of medical anamnesis

Mostrar registro simples

Autor Fraga, Ygor Allan de;
Orientador Costa, Cristiano André da;
Lattes do orientador http://lattes.cnpq.br/9637121030877187;
Instituição Universidade do Vale do Rio dos Sinos;
Título Anamtech: speech-based automatic structuring of medical anamnesis;
Resumo The medical history process is critical for a correct diagnosis of the patient. Filling out medical documents is costly for the doctor and can cause some conversation details to go unnoticed, resulting in a bad patient experience or a wrong diagnosis. Helping the experience of both physician and patient is the motivation behind this work. The main objective is to create an application that automatically integrates speech recognition to turn into text the interview, identify the relevant entities for the anamnesis document, and structure a digital document. The developed model (a.k.a. Anamtech) integrates different services to make it possible to recognize the anamnesis properly automatically. Voice recognition was used to capture the conversation between doctor and patient. Several open libraries have already transposed the audio into text. The recognized text was included in the process of identifying essential terms for anamnesis, which healthcare professionals reviewed, and an entity recognition algorithm was used to identify such information. This algorithm was previously trained according to available existing anamnesis that passed through the process of labeling. The last Anamtech component organizes all the recognized entities in a document following a defined medical standard. A complete automatic application was created and ready to use with minor interference by the physician who uses it. As the final document is divided by entities, organized with a prefix by the anamnesis phase, it would be easy to change any information contained in it. In general, the named entity recognition (a.k.a. NER) model, which is the heart of this project, had a precision of 85.1%, a recall of 87.6%, and an f1-score of 86.3%. In addition, metrics for each one of the entities were captured and described. The metrics related to the patient identification had the best results, whereas the ones associated with symptoms, diseases, and treatments could be identified, but some mismatches were identified due to the difficulty to classify some entities in the pre-processing.;
Palavras-chave Speech recognition; Named entity recognition; Anamnesis; Natural Language Processing; Medical Informatics;
Tipo TCC;
Data de defesa 2021-12-09;
URI http://repositorio.jesuita.org.br/handle/UNISINOS/13270;
Nivel Graduação;
Curso Sistemas de Informação;


Arquivos deste item

Este item aparece na(s) seguinte(s) coleção(s)

Mostrar registro simples

Buscar

Busca avançada

Navegar

Minha conta

Estatística