Abstract
The growing availability of biomedical research has increased demand for accessible scientific communication, especially for patients, caregivers, and non-expert readers. However, medical texts, such as
journal abstracts, full-length articles, and clinical content, often contain dense information and specialized terminology that can hinder comprehension or cause the ingestion of incomplete or misinterpreted
information. This thesis focuses on lay language summarization: the task of transforming complex
biomedical paragraphs into simplified, layperson-friendly versions without compromising factual correctness, technical fidelity or essential content.
We propose a modular pipeline that leverages domain-specific language models, structured biomedical knowledge, and decision-guided simplification strategies to perform high-quality text simplification
in the medical domain. The system is built around three core components:
1. A BioBERT-based term identification module, trained to detect biomedical jargon with high precision.
2. A UMLS-backed definition retriever, which integrates with standard medical ontologies to provide concise lay-level definitions of complex terms.
3. A powerful LLM, which acts as the brain of the operation and implements the reconstruction and
iterative rewriting of the content into lay-person friendly summaries.
These 3 components were combined into a modular pipeline, capable of simplifying, rephrasing and
reconstructing the content, while incorporating the retrieved definitions and maintaining factual consistency.
The system is evaluated on a corpus of paragraphs and abstracts drawn from PLOS and eLife journals. Evaluation is conducted using a comprehensive set of metrics across three dimensions:
1. Relevance and Content Preservation, using ROUGE, BLEU, METEOR and BERTScore
2. Readability via FKGL, DCRS, CLI, and LENS.
3. Factual Consistency, measured by AlignScore and SummaC.We also conducted an ablation study to understand the contribution of each component. Additionally, to cover dimensions not captured by the automated evaluation metrics, human feedback was also
collected to assess summary quality across readability, clarity, and factuality.
The results demonstrated that the proposed pipeline significantly improved readability and user comprehension over current baselines while maintaining a high degree of factual consistency. The modular
design allows for transparent interpretation of simplification steps and facilitates future integration with
electronic health records, clinical trial explanations, and public health platforms.
Overall, this work marks an important step towards accessible medical communication, leveraging
the power of domain-specific AI to bridge the knowledge gap between experts and the public.