Heterogeneous Hypergraph Learning for Literature Retrieval Based on Citation Intents

Literature retrieval helps scientists find previous work that is relative to their own research or even get new research ideas. However, the discrepancy between retrieval results and the ultimate intention of citation is neglected by most literature retrieval models. Citation intent refers to the researcher’s motivation for citing a paper. A citation intent graph with homogeneous nodes and heterogeneous hyperedges can represent different types of citation intents. By leveraging the citation intent information included in a hypergraph, a retrieval model can guide researchers on where to cite its retrieval result by understanding the citation behaviour in the graph. We present a ranking model called CitenGL (Citation Intent Graph Learning) that aims to extract citation intent information and textual matching signals. The proposed model consists of a heterogeneous hypergraph encoder and a lightweight deep fusion unit for efficiency trade-offs. Compared to traditional literature retrieval, our model fills the gap between retrieval results and citation intention and yields an understandable graph-structured output. We evaluated our model on publicly available full-text paper datasets. Experimental results show that CitenGL outperforms most existing neural ranking models that only consider textual information, which illustrates the effectiveness of integrating citation intent information with textual information. Further ablation analyses show how citation intent information complements text-matching signals and citation networks.

Project Main Structure

-util
    dataTool  # graph data api
    graph_model
    textEncoderTool  # graph + text encoder (train text embedding)

run_*  # main file

Name		Name	Last commit message	Last commit date
< 8000 div class="LatestCommit-module__Box--En0AE"> Latest commit History 2 Commits
util		util
README.md		README.md
requirements.txt		requirements.txt
run_BM25.py		run_BM25.py
run_CitenGL.py		run_CitenGL.py
run_baseline.py		run_baseline.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Heterogeneous Hypergraph Learning for Literature Retrieval Based on Citation Intents

Project Main Structure

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Hipkevin/CitenGL

Folders and files

Latest commit

History

Repository files navigation

Heterogeneous Hypergraph Learning for Literature Retrieval Based on Citation Intents

Project Main Structure

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages