Multi-Task Identification of Entities, Relations, and Coreference for Scientific Knowledge Graph Construction

Hannaneh Hajishirzi; Luheng He; Mari Ostendorf; Yi Luan

arxiv: 1808.09602 · v1 · pith:OYCFPFD4new · submitted 2018-08-29 · 💻 cs.CL

Multi-Task Identification of Entities, Relations, and Coreference for Scientific Knowledge Graph Construction

Yi Luan , Luheng He , Mari Ostendorf , Hannaneh Hajishirzi This is my paper

classification 💻 cs.CL

keywords scientificmulti-taskcoreferenceinformationrelationsconstructionentitiesframework

0 comments

read the original abstract

We introduce a multi-task setup of identifying and classifying entities, relations, and coreference clusters in scientific articles. We create SciERC, a dataset that includes annotations for all three tasks and develop a unified framework called Scientific Information Extractor (SciIE) for with shared span representations. The multi-task setup reduces cascading errors between tasks and leverages cross-sentence relations through coreference links. Experiments show that our multi-task model outperforms previous models in scientific information extraction without using any domain-specific features. We further show that the framework supports construction of a scientific knowledge graph, which we use to analyze information in scientific literature.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 3 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

CHIMERA: A Knowledge Base of Scientific Idea Recombinations for Research Analysis and Ideation
cs.CL 2025-05 unverdicted novelty 7.0

CHIMERA is the first large-scale mined KB of concept recombinations from scientific literature, created via a new IE task and LLM extraction, with demonstrated uses in pattern analysis and hypothesis generation.
MeasHalu: Mitigation of Scientific Measurement Hallucinations for Large Language Models with Enhanced Reasoning
cs.CL 2026-04 unverdicted novelty 6.0

MeasHalu reduces LLM hallucinations in scientific measurement extraction via a fine-grained taxonomy, reasoning-aware fine-tuning, and progressive rewards, improving accuracy on the MeasEval benchmark.
Capturing Monetarily Exploitable Vulnerability in Smart Contracts via Auditor Knowledge-Learning Fuzzing
cs.CR 2026-04 unverdicted novelty 5.0

FAUDITOR is a specialized fuzzer that detected 220 zero-day monetarily exploitable vulnerabilities in smart contracts by combining finance-interface targeting, NLP from auditor reports, and self-learning.