pith. sign in

hub Mixed citations

Nougat: Neural Optical Understanding for Academic Documents

Mixed citation behavior. Most common role is background (56%).

31 Pith papers citing it
Background 56% of classified citations
abstract

Scientific knowledge is predominantly stored in books and scientific journals, often in the form of PDFs. However, the PDF format leads to a loss of semantic information, particularly for mathematical expressions. We propose Nougat (Neural Optical Understanding for Academic Documents), a Visual Transformer model that performs an Optical Character Recognition (OCR) task for processing scientific documents into a markup language, and demonstrate the effectiveness of our model on a new dataset of scientific documents. The proposed approach offers a promising solution to enhance the accessibility of scientific knowledge in the digital age, by bridging the gap between human-readable documents and machine-readable text. We release the models and code to accelerate future work on scientific text recognition.

hub tools

citation-role summary

background 6 method 2 baseline 1

citation-polarity summary

representative citing papers

The Shrinking Lifespan of LLMs in Science

cs.DL · 2026-04-08 · unverdicted · novelty 7.0

LLM adoption in science follows a compressing inverted-U trajectory where release year predicts time-to-peak and lifespan better than model attributes.

Ryze: Evidence-Enriched Data Synthesis from Biomedical Papers

cs.AI · 2026-05-30 · unverdicted · novelty 6.0

Ryze automates evidence-enriched QA synthesis from biomedical papers to produce BioVLM-8B, which reaches 48.0% weighted accuracy on LAB-Bench (+12.6pp over base, +3.8pp over GPT-5.2) at under $200 cost.

DeepSeek-OCR: Contexts Optical Compression

cs.CV · 2025-10-21 · unverdicted · novelty 6.0

DeepSeek-OCR compresses text contexts up to 20x via 2D optical mapping while achieving 97% OCR accuracy below 10x and 60% at 20x, outperforming prior OCR tools with fewer vision tokens.

An AI-ready, Polarized Electron-Positron Collision Dataset

hep-ex · 2026-05-29 · unverdicted · novelty 5.0

Release of an AI-ready dataset containing approximately 660,000 reconstructed polarized e+e- collision events at 91.2 GeV from the SLD experiment, translated from legacy formats with accompanying digitized documentation.

CogVLM2: Visual Language Models for Image and Video Understanding

cs.CV · 2024-08-29 · conditional · novelty 5.0

CogVLM2 family achieves state-of-the-art results on image and video understanding benchmarks through improved visual expert architecture, higher resolution inputs, and automated temporal grounding for videos.

citing papers explorer

Showing 31 of 31 citing papers.