hub Mixed citations

ArXiv abs/2004.07180 (2020)

Arman Cohan, Sergey Feldman, Iz Beltagy, Doug Downey, Daniel Weld · 2020 · Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics · DOI 10.18653/v1/2020.acl-main.207 · arXiv 2004.07180

Mixed citation behavior. Most common role is background (33%).

26 Pith papers citing it

274 external citations · Crossref

Background 33% of classified citations

open at publisher browse 26 citing papers arXiv PDF

hub tools

JSON dossier citing papers JSON publisher DOI arXiv source

citation-role summary

background 3 dataset 2 other 1

citation-polarity summary

background 2 unclear 2 use dataset 2

representative citing papers

How Does Research Evolve? Tracing Cross-Domain Trajectories in NLP, ML, and CV with Claim-Grounded Typed Citations

cs.CL · 2026-06-21 · unverdicted · novelty 7.0

SciTraj is the first claim-grounded typed citation graph with 32,559 papers and 573,126 edges across six relation types, plus a temporally split link-prediction benchmark.

Forecasting Conceptual Diffusion in Science: The Case of Quantum Computing

cs.SI · 2026-06-02 · unverdicted · novelty 7.0

LightGBM models on citation and diversity features predict exogenous diffusion of quantum computing concepts with R² up to 0.78 while endogenous reinforcement remains largely unpredictable after growth controls, with replications in other fields.

The Harder Text Embedding Benchmark (HTEB): Beyond One-dimensional Static Robustness

cs.CL · 2026-05-27 · unverdicted · novelty 7.0

HTEB introduces dynamic, multi-axis evaluation of text embedding robustness using LLM transformations, finding decoupled profiles across models and that scaling does not close all robustness gaps.

Re$^2$Math: Benchmarking Theorem Retrieval in Research-Level Mathematics

cs.AI · 2026-05-09 · unverdicted · novelty 7.0

Re²Math is a new benchmark that evaluates AI models on retrieving and verifying the applicability of theorems from math literature to advance steps in partial proofs, accepting any sufficient theorem while controlling for leakage.

Beyond coauthorship: semantic structure and phantom collaborators in transportation research, 1967--2025

cs.DL · 2026-04-26 · unverdicted · novelty 7.0

Phantom collaborators—topically similar authors distant in the coauthor graph—become actual coauthors 16-33 times more often than baselines, with a 68-fold similarity gradient.

MasterSet: A Large-Scale Benchmark for Must-Cite Citation Recommendation in the AI/ML Literature

cs.IR · 2026-04-20 · unverdicted · novelty 7.0

MasterSet is a new large-scale benchmark for must-cite citation recommendation in AI/ML, using LLM-annotated tiers on 150k papers and Recall@K evaluation.

Human-LLM Compound System for Scientific Ideation through Facet Recombination and Novelty Evaluation

cs.HC · 2024-09-23 · unverdicted · novelty 7.0

Scideator enables facet-based scientific ideation through LLM-driven extraction, human-guided recombination, analogous retrieval, and facet-grounded novelty verification, showing significantly higher creativity support than a baseline LLM in a user study with CS researchers.

MCompassRAG: Topic Metadata as a Semantic Compass for Paragraph-Level Retrieval

cs.CL · 2026-06-16 · unverdicted · novelty 6.0

MCompassRAG adds topic metadata to chunk representations and uses LLM distillation to train a lightweight topic-aware retriever, reporting 8.24% average information efficiency gain and over 5x lower latency than strong baselines across six benchmarks.

SproutRAG: Attention-Guided Tree Search with Progressive Embeddings for Long-Document RAG

cs.CL · 2026-06-16 · unverdicted · novelty 6.0

SproutRAG introduces an attention-guided hierarchical framework that constructs a binary chunking tree for multi-granularity retrieval in RAG systems and reports a 6.1% average gain in information efficiency.

Explainable Forecasting of Scientific Breakthroughs from Concept Network Dynamics

cs.SI · 2026-06-02 · unverdicted · novelty 6.0

A two-stage LightGBM model on 59 features from concept networks forecasts link formation and intensity with ROC-AUC 0.95-0.967 across domains.

Learning Faster with Better Tokens: Parameter-Efficient Vocabulary Adaptation for Specialized Text Summarization

cs.CL · 2026-05-17 · unverdicted · novelty 6.0

Vocabulary adaptation via targeted token addition and replacement improves semantic similarity, domain word usage, and training efficiency for LLM summarization in legal and medical domains.

Unlocking LLM Creativity in Science through Analogical Reasoning

cs.AI · 2026-05-11 · conditional · novelty 6.0

Analogical reasoning increases LLM solution diversity by 90-173% and novelty rate to over 50%, delivering up to 13-fold gains on biomedical tasks including perturbation prediction and cell communication.

CAR: Query-Guided Confidence-Aware Reranking for Retrieval-Augmented Generation

cs.CL · 2026-05-06 · unverdicted · novelty 6.0

CAR reranks documents in RAG by promoting those that increase generator confidence (via answer consistency sampling) and demoting those that decrease it, yielding NDCG@5 gains on BEIR datasets that correlate with F1 improvements.

Aspect-Aware Content-Based Recommendations for Mathematical Research Papers

cs.IR · 2026-05-05 · unverdicted · novelty 6.0

The authors introduce aspect-aware datasets GoldRiM and SilverRiM for math papers and AchGNN, a heterogeneous GNN that outperforms prior methods by jointly modeling textual semantics, citations, and author lineage across aspects.

Learning to Route Queries to Heads for Attention-based Re-ranking with Large Language Models

cs.IR · 2026-04-27 · conditional · novelty 6.0

RouteHead trains a lightweight router to dynamically select optimal LLM attention heads per query for improved attention-based document re-ranking.

Data, Not Model: Explaining Bias toward LLM Texts in Neural Retrievers

cs.IR · 2026-04-07 · unverdicted · novelty 6.0

Bias toward LLM texts in neural retrievers arises from artifact imbalances between positive and negative documents in training data that are absorbed during contrastive learning.

Beyond Single-Score Ranking: Facet-Aware Reranking for Controllable Diversity in Paper Recommendation

cs.IR · 2026-03-11 · unverdicted · novelty 6.0

SciFACE improves facet-specific paper ranking NDCG scores by training separate cross-encoders for Background and Method similarity on 5,891 GPT-4o-mini labeled pairs, outperforming SPECTER by up to 31 points.

Traditional statistical representations outperform generative AI in identifying expert peer reviewers

cs.IR · 2026-05-18 · unverdicted · novelty 5.0

TF-IDF identifies labeled experts in the top 25 recommendations 79.5% of the time versus 51.5% for GPT-4o mini on an astronomy observatory dataset.

To MRL or not to MRL: Text Embeddings are Robust to Truncation Without Matryoshka Learning, Except In Heavy Truncation Scenarios

cs.LG · 2026-05-15 · unverdicted · novelty 5.0

Truncated embeddings from non-MRL models perform comparably to or better than MRL-trained models for most truncation levels, except heavy truncation of 80% or more.

Contradictions in Context: Challenges for Retrieval-Augmented Generation in Healthcare

cs.IR · 2025-11-10 · unverdicted · novelty 5.0

Contradictions between highly similar medical abstracts degrade the factual accuracy and consistency of LLM responses in retrieval-augmented generation.

Data-CUBE: Data Curriculum for Instruction-based Sentence Representation Learning

cs.CL · 2024-01-07 · unverdicted · novelty 5.0

Data-CUBE applies a two-level curriculum (TSP-based task ordering via simulated annealing plus difficulty-sorted mini-batches) to multi-task instruction tuning and reports gains on MTEB sentence representation tasks.

A Reproducible Benchmark and Evidence-Retrieval Software Framework for Silicon Detector R&D Literature

physics.ins-det · 2026-06-23 · accept · novelty 4.0

Hybrid sparse-dense retrieval achieves Hit@5 of 0.917 on a new curated benchmark of silicon detector papers with released code and annotations.

PeeriScope: A Multi-Faceted Framework for Evaluating Peer Review Quality

cs.CL · 2026-04-27 · unverdicted · novelty 4.0

PeeriScope is an open modular framework that integrates structured features, LLM rubric assessments, and supervised prediction to evaluate peer review quality for self-assessment, editorial triage, and large-scale auditing.

From Ambiguity to Accuracy: The Transformative Effect of Coreference Resolution on Retrieval-Augmented Generation systems

cs.CL · 2025-07-10 · unverdicted · novelty 4.0

Coreference resolution improves retrieval relevance and QA performance in RAG systems, with mean pooling performing best and smaller models benefiting more.

citing papers explorer

Showing 1 of 1 citing paper after filters.

Human-LLM Compound System for Scientific Ideation through Facet Recombination and Novelty Evaluation cs.HC · 2024-09-23 · unverdicted · none · ref 15
Scideator enables facet-based scientific ideation through LLM-driven extraction, human-guided recombination, analogous retrieval, and facet-grounded novelty verification, showing significantly higher creativity support than a baseline LLM in a user study with CS researchers.

ArXiv abs/2004.07180 (2020)

hub tools

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer