Computer Science > Computation and Language

arXiv:2212.10192 (cs)

[Submitted on 20 Dec 2022 (v1), last revised 6 Jun 2024 (this version, v2)]

Title:Adam: Dense Retrieval Distillation with Adaptive Dark Examples

Authors:Chongyang Tao, Chang Liu, Tao Shen, Can Xu, Xiubo Geng, Binxing Jiao, Daxin Jiang

Abstract:To improve the performance of the dual-encoder retriever, one effective approach is knowledge distillation from the cross-encoder ranker. Existing works construct the candidate passages following the supervised learning setting where a query is paired with a positive passage and a batch of negatives. However, through empirical observation, we find that even the hard negatives from advanced methods are still too trivial for the teacher to distinguish, preventing the teacher from transferring abundant dark knowledge to the student through its soft label. To alleviate this issue, we propose ADAM, a knowledge distillation framework that can better transfer the dark knowledge held in the teacher with Adaptive Dark exAMples. Different from previous works that only rely on one positive and hard negatives as candidate passages, we create dark examples that all have moderate relevance to the query through mixing-up and masking in discrete space. Furthermore, as the quality of knowledge held in different training instances varies as measured by the teacher's confidence score, we propose a self-paced distillation strategy that adaptively concentrates on a subset of high-quality instances to conduct our dark-example-based knowledge distillation to help the student learn better. We conduct experiments on two widely-used benchmarks and verify the effectiveness of our method.

Comments:	13 pages, 3 figures
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2212.10192 [cs.CL]
	(or arXiv:2212.10192v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2212.10192

Submission history

From: Chongyang Tao [view email]
[v1] Tue, 20 Dec 2022 12:03:19 UTC (95 KB)
[v2] Thu, 6 Jun 2024 15:20:27 UTC (108 KB)

Computer Science > Computation and Language

Title:Adam: Dense Retrieval Distillation with Adaptive Dark Examples

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Adam: Dense Retrieval Distillation with Adaptive Dark Examples

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators