Knowledge Graph Reasoning with Self-supervised Reinforcement Learning | allainews.com

May 24, 2024, 4:45 a.m. | Ying Ma, Owen Burns, Mingqiu Wang, Gang Li, Nan Du, Laurent El Shafey, Liqiang Wang, Izhak Shafran, Hagen Soltau

cs.LG updates on arXiv.org arxiv.org

arXiv:2405.13640v1 Announce Type: cross
Abstract: Reinforcement learning (RL) is an effective method of finding reasoning pathways in incomplete knowledge graphs (KGs). To overcome the challenges of a large action space, a self-supervised pre-training method is proposed to warm up the policy network before the RL training stage. To alleviate the distributional mismatch issue in general self-supervised RL (SSRL), in our supervised learning (SL) stage, the agent selects actions based on the policy network and learns from generated labels; this self-generation …

arxiv cs.ai cs.cl cs.lg graph knowledge knowledge graph reasoning reinforcement reinforcement learning type

More from arxiv.org / cs.LG updates on arXiv.org

Consistent3D: Towards Consistent High-Fidelity Text-to-3D Generation with Deterministic Sampling Prior 2 days, 17 hours ago | arxiv.org

arxiv consistent cs.cv cs.lg +6

Machine-learned models for magnetic materials 2 days, 17 hours ago | arxiv.org

abstract arxiv autoencoder cond-mat.mtrl-sci +17

Revisiting RIP guarantees for sketching operators on mixture models 2 days, 17 hours ago | arxiv.org

abstract alternative analysis arxiv +9

Non-Intrusive Speech Intelligibility Prediction for Hearing Aids using Whisper and Metadata 2 days, 17 hours ago | arxiv.org

abstract accuracy arxiv assessment +16

Getting More for Less: Using Weak Labels and AV-Mixup for Robust Audio-Visual Speaker Verification 2 days, 17 hours ago | arxiv.org

abstract arxiv audio cs.cv +18

Neural-network quantum state study of the long-range antiferromagnetic Ising chain 2 days, 17 hours ago | arxiv.org

abstract arxiv boltzmann cond-mat.quant-gas +12

Prediction Risk and Estimation Risk of the Ridgeless Least Squares Estimator under General Assumptions on … 2 days, 17 hours ago | arxiv.org

abstract arxiv assumptions cs.lg +22

Vortex Feature Positioning: Bridging Tabular IIoT Data and Image-Based Deep Learning 2 days, 17 hours ago | arxiv.org

abstract arxiv cs.cv cs.lg +19

Provably Efficient Exploration in Quantum Reinforcement Learning with Logarithmic Worst-Case Regret 2 days, 17 hours ago | arxiv.org

abstract algorithms arxiv attention +20

Senior Data Engineer

@ Displate | Warsaw

View on ai-jobs.net

Werkstudent Product Data Management (m/w/d)

@ ABOUT YOU SE & Co. KG | Hamburg, Germany

View on ai-jobs.net

Data Scientist

@ Meta | Sunnyvale, CA

View on ai-jobs.net

Data Scientist, Analytics

@ Meta | Menlo Park, CA

View on ai-jobs.net

Principal AI Engineer

@ Blankfactor | Romania - Bucharest

View on ai-jobs.net

Data Engineer

@ DigiOutsource | Cape Town - Waterview Park

View on ai-jobs.net