pith. sign in

Tomas Mikolov, Kai Chen, Greg Corrado, and Jeffrey Dean

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

citation-role summary

dataset 1

citation-polarity summary

fields

cs.LG 2 cs.CL 1

years

2026 1 2025 2

roles

dataset 1

polarities

use dataset 1

representative citing papers

Dolph2Vec: Self-Supervised Representations of Dolphin Vocalizations

cs.LG · 2026-06-10 · unverdicted · novelty 7.0

Dolph2Vec is the first species-specific self-supervised model for dolphin vocalizations, trained on longitudinal recordings from five dolphins, that outperforms general baselines on signature whistle classification and detection while producing embeddings aligned with known whistle categories.

Multilingual Vision-Language Models, A Survey

cs.CL · 2025-09-26 · accept · novelty 3.0

The survey identifies a key tension in multilingual vision-language models between language neutrality via contrastive learning and cultural awareness via diverse data, with most benchmarks relying on translation-based evaluation.

citing papers explorer

Showing 3 of 3 citing papers.

  • Dolph2Vec: Self-Supervised Representations of Dolphin Vocalizations cs.LG · 2026-06-10 · unverdicted · none · ref 19

    Dolph2Vec is the first species-specific self-supervised model for dolphin vocalizations, trained on longitudinal recordings from five dolphins, that outperforms general baselines on signature whistle classification and detection while producing embeddings aligned with known whistle categories.

  • Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach cs.LG · 2025-02-07 · unverdicted · none · ref 112

    A recurrent-depth architecture enables language models to improve reasoning performance by iterating computation in latent space, achieving gains equivalent to much larger models on benchmarks.

  • Multilingual Vision-Language Models, A Survey cs.CL · 2025-09-26 · accept · none · ref 100

    The survey identifies a key tension in multilingual vision-language models between language neutrality via contrastive learning and cultural awareness via diverse data, with most benchmarks relying on translation-based evaluation.