TandemNet: Distilling Knowledge from Medical Images Using Diagnostic Reports as Optional Semantic References

Zhang, Zizhao; Chen, Pingjun; Sapkota, Manish; Yang, Lin

Computer Science > Computer Vision and Pattern Recognition

arXiv:1708.03070 (cs)

[Submitted on 10 Aug 2017]

Title:TandemNet: Distilling Knowledge from Medical Images Using Diagnostic Reports as Optional Semantic References

Authors:Zizhao Zhang, Pingjun Chen, Manish Sapkota, Lin Yang

View PDF

Abstract:In this paper, we introduce the semantic knowledge of medical images from their diagnostic reports to provide an inspirational network training and an interpretable prediction mechanism with our proposed novel multimodal neural network, namely TandemNet. Inside TandemNet, a language model is used to represent report text, which cooperates with the image model in a tandem scheme. We propose a novel dual-attention model that facilitates high-level interactions between visual and semantic information and effectively distills useful features for prediction. In the testing stage, TandemNet can make accurate image prediction with an optional report text input. It also interprets its prediction by producing attention on the image and text informative feature pieces, and further generating diagnostic report paragraphs. Based on a pathological bladder cancer images and their diagnostic reports (BCIDR) dataset, sufficient experiments demonstrate that our method effectively learns and integrates knowledge from multimodalities and obtains significantly improved performance than comparing baselines.

Comments:	MICCAI2017 Oral
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1708.03070 [cs.CV]
	(or arXiv:1708.03070v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1708.03070

Submission history

From: Zizhao Zhang [view email]
[v1] Thu, 10 Aug 2017 04:12:00 UTC (1,439 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CV

< prev | next >

new | recent | 2017-08

Change to browse by:

References & Citations

1 blog link

(what is this?)

DBLP - CS Bibliography

listing | bibtex

Zizhao Zhang
Pingjun Chen
Manish Sapkota
Lin Yang

export BibTeX citation

Computer Science > Computer Vision and Pattern Recognition

Title:TandemNet: Distilling Knowledge from Medical Images Using Diagnostic Reports as Optional Semantic References

Submission history

Access Paper:

References & Citations

1 blog link

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:TandemNet: Distilling Knowledge from Medical Images Using Diagnostic Reports as Optional Semantic References

Submission history

Access Paper:

References & Citations

1 blog link

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators