[BibTeX] [RIS]
Diversity-Augmented Diffusion Network With LLM Assistance For Radiology Report Generation
Tipo de publicação: Artigo
Citação:
Journal: WWW '25: Companion Proceedings of the ACM on Web Conference 2025
Ano: 2025
Mês: May
Páginas: 2288-2296
ISSN: 979-8-4007-1331-6/2025/04
DOI: https://doi.org/10.1145/3701716.3717555
Resumo: Radiology report generation (RRG) is a demanding yet challenging task that involves producing multi-sentence diagnostic narratives, requiring long-form text with high diversity while addressing inherent data bias. Sentence-level diversity is therefore crucial for capturing varying diagnostic details across multiple regions of interest (ROIs) within a single report, yet it remains underexplored in the field. In this paper, we propose DADNET, a novel diffusion-based framework that leverages the inherent ability of diffusion models to generate diverse text. We make the first attempt to integrate large language models (LLMs) to bridge the inherent training-inference gap in diffusion models. Specifically, LLMs are used to draft a preliminary report, which is subsequently incorporated into the diffusion process to enhance report diversity. Additionally, we introduce a bias equalization technique using domain-specific priors to mitigate data distribution biases, improving the quality and reliability of generated reports under various scenarios. Experimental results demonstrate that DADNET outperforms existing approaches under the same non-autoregressive (NAR) mechanism and sets a new benchmark for diversity in RRG. This work underscores the importance of diversity in RRG and establishes DADNET as a pioneering framework for addressing this challenge with NAR methods.
Palavras-chave: DiffusionModel, large language model, Radiology Report Generation
Autores Long, Jieting
Li, Zhiyuan
Fan, Jianan
Liang, Zhuonan
Ma, Ao
Müller, Henning
Cai, Weidong
Adicionado por: []
Total mark: 0
Anexos
  • Diversity-Augmented Diffusion ...
Notas
    Tópicos