super hub

APACrefauthors \ 1987

Peter J. Rousseeuw · 1987 · Journal of Computational and Applied Mathematics · DOI 10.1016/0377-0427(87)90125-7

39 Pith papers cite this work, alongside 16,729 external citations. Polarity classification is still indexing.

39 Pith papers citing it

16.7k external citations · Crossref

open at publisher browse 39 citing papers more from Peter J. Rousseeuw

hub tools

JSON dossier citing papers JSON publisher DOI

citation-role summary

background 2 method 2

citation-polarity summary

background 2 use method 2

authors

Peter J. Rousseeuw

co-cited works

representative citing papers

An Answer is just the Start: Related Insight Generation for Open-Ended Document-Grounded QA

cs.CL · 2026-04-21 · unverdicted · novelty 8.0

InsightGen uses thematic clustering and graph neighborhood selection to generate diverse, relevant insights for open-ended document-grounded questions and releases the SCOpE-QA dataset of 3000 questions.

Fast Computation of Free-Support Wasserstein Medians

stat.CO · 2026-06-17 · unverdicted · novelty 7.0

Direct fixed-weight solver for free-support Wasserstein medians relocates atoms using OT barycentric projections and inverse-distance weights, achieving monotone descent on smoothed objectives with fewer subproblems than nested Weiszfeld baselines.

Multilingual Coreference Resolution via Cycle-Consistent Machine Translation

cs.CL · 2026-06-03 · unverdicted · novelty 7.0

A cycle-consistent MT pipeline generates and similarity-weights training data for coreference resolution, producing gains on four low-resource languages and enabling the task where no corpora existed.

Code Generation by Differential Test Time Scaling

cs.SE · 2026-05-19 · unverdicted · novelty 7.0

DiffCodeGen clusters code candidates by behavioral similarity from fuzzing-synthesized inputs and selects the largest cluster's medoid, matching or exceeding prior test-time scaling methods with far less token and time cost.

Soft-MSM: Differentiable Context-Aware Elastic Alignment for Time Series

cs.LG · 2026-04-30 · unverdicted · novelty 7.0

Soft-MSM is a smooth, gradient-enabled version of the context-aware MSM distance for time series alignment that outperforms Soft-DTW alternatives in clustering and nearest-centroid classification.

Batch-Invariant Spectral Intelligence for Robust and Explainable Insect Authentication

cs.LG · 2026-06-25 · unverdicted · novelty 6.0

BISN achieves 0.93 mean leave-one-batch-out accuracy on 2700 NIR spectra from three insect species across three batches, outperforming baselines by 4% while decisions align with lipid and protein absorption regions.

Soft Token Alignment for Cross-Lingual Reasoning

cs.CL · 2026-06-25 · unverdicted · novelty 6.0

SOLAR aligns soft-token probability mixtures across languages in embedding space during SFT and raises multilingual reasoning accuracy by up to 17.7 points over the base model.

FOSC-X: An Extended Framework for Optimal Local Cuts and Non-Horizontal Cluster Selection from Clustering Hierarchies

stat.ML · 2026-06-17 · unverdicted · novelty 6.0

FOSC-X uses bounded dynamic programming to compute top-M optimal non-horizontal cuts from clustering hierarchies in linear time, with or without cluster-count constraints.

Multilingual Fine-Tuning via Localized Gradient Conflict Resolution

cs.AI · 2026-06-04 · unverdicted · novelty 6.0

Bucket-Level MOO reformulates multilingual fine-tuning as localized multi-objective optimization and proves it enforces a tighter Pareto stationarity condition while improving cross-lingual performance on four LLMs.

Automatic Discovery of Disease Subgroups by Contrasting with Healthy Controls

cs.LG · 2026-05-20 · conditional · novelty 6.0

Deep UCSL uses a contrastive EM loss on patient-control labels to isolate disease-driven subgroups in medical imaging by suppressing shared healthy variability.

One Model to Translate Them All: Universal Any-to-Any Translation for Heterogeneous Collaborative Perception

cs.CV · 2026-05-18 · conditional · novelty 6.0

UniTrans pretrains a bank of translator experts and learns combination coefficients from modality mappings in a scene-invariant latent space to enable zero-shot any-to-any feature translation for heterogeneous collaborative perception.

Geometric Prototype Learning in Quantum Hilbert Space with Matrix Product States

quant-ph · 2026-05-18 · unverdicted · novelty 6.0

A quantum prototype learning scheme encodes class representatives as generative matrix product states and performs classification and clustering via geometric measures in Hilbert space, outperforming classical prototypes on Fashion-MNIST and ECG data.

The Infinite Mutation Engine? Measuring Polymorphism in LLM-Generated Offensive Code

cs.CR · 2026-05-05 · unverdicted · novelty 6.0 · 2 refs

A single commercial LLM can cheaply generate large populations of behaviorally equivalent yet structurally diverse malware payloads.

Generalized Category Discovery in Federated Graph Learning

cs.LG · 2026-05-05 · unverdicted · novelty 6.0

GCD-FGL mitigates neighborhood absorption and global semantic inconsistency in federated generalized category discovery, delivering +4.86 average HRScore gain over baselines on five graph datasets.

Class Angular Distortion Index for Dimensionality Reduction

cs.LG · 2026-05-01 · unverdicted · novelty 6.0

CADI quantifies the preservation of relative cluster angles in low-dimensional projections using internal angles from point triples.

Empirical Insights of Test Selection Metrics under Multiple Testing Objectives and Distribution Shifts

cs.SE · 2026-04-25 · unverdicted · novelty 6.0

A broad empirical benchmark shows how 15 existing test selection metrics perform for fault detection, performance estimation, and retraining under corrupted, adversarial, temporal, natural, and label shifts across image, text, and Android data.

A Machine Learning Approach to Meteor Classification

astro-ph.EP · 2026-04-24 · unverdicted · novelty 6.0

Machine learning clustering of meteor observations produces a new hardness classification H_class that refines traditional Kb models using more parameters and reveals compositional structure in meteoroid populations.

ClusterRAG: Cluster-Based Collaborative Filtering for Personalized Retrieval-Augmented Generation

cs.IR · 2026-04-14 · unverdicted · novelty 6.0

ClusterRAG applies density-based clustering to user profiles for collaborative retrieval in personalized RAG and reports best performance on LaMP tasks by combining target and similar-user profiles.

AFGNN: API Misuse Detection using Graph Neural Networks and Clustering

cs.SE · 2026-04-09 · unverdicted · novelty 6.0

AFGNN detects API misuses in Java code more effectively than prior methods by representing usage as graphs and clustering learned embeddings from self-supervised training.

Do Good, Stay Longer? Temporal Patterns and Predictors of Newcomer-to-Core Transitions in Conventional OSS and OSS4SG

cs.SE · 2026-01-30 · unverdicted · novelty 6.0

OSS4SG projects retain contributors at 2.2X higher rates with 19.6% higher core status probability than conventional OSS, and a late-spike temporal pattern enables faster core achievement (21 weeks) than early intensive contributions.

LandSegmenter: Towards a Flexible Foundation Model for Land Use and Land Cover Mapping

cs.CV · 2025-11-11 · unverdicted · novelty 6.0

LandSegmenter creates a task-specific foundation model for LULC mapping using weak labels from existing products, an RS adapter, text encoder, and confidence-guided fusion to achieve competitive zero-shot performance across modalities and taxonomies.

The bixplot: A variation on the boxplot suited for bimodal data

stat.ME · 2025-10-10 · unverdicted · novelty 6.0

Presents the bixplot as an extension of the boxplot incorporating contiguous clustering to visualize bimodality and multimodality while displaying individual data points, with Python and R implementations.

A Robust Nonparametric Framework for Detecting Repeated Spatial Patterns

stat.ME · 2025-06-17 · unverdicted · novelty 6.0

A nonparametric framework detects repeated spatial patterns via constrained clustering followed by MMD-based reassignment and block permutation under stationarity and mixing conditions.

Do Waders, Swimmers, and Divers Exist? A GPS-Based Pilot Study of Site-Dependent Visitor Movement in Theme Parks

physics.soc-ph · 2026-06-23 · unverdicted · novelty 5.0

GPS tracking across theme parks shows visitor movement forms a continuum rather than discrete types, diverges from self-reports, and reverses feature relationships from site to site, requiring local calibration.

citing papers explorer

Showing 39 of 39 citing papers.

An Answer is just the Start: Related Insight Generation for Open-Ended Document-Grounded QA cs.CL · 2026-04-21 · unverdicted · none · ref 47
InsightGen uses thematic clustering and graph neighborhood selection to generate diverse, relevant insights for open-ended document-grounded questions and releases the SCOpE-QA dataset of 3000 questions.
Fast Computation of Free-Support Wasserstein Medians stat.CO · 2026-06-17 · unverdicted · none · ref 146
Direct fixed-weight solver for free-support Wasserstein medians relocates atoms using OT barycentric projections and inverse-distance weights, achieving monotone descent on smoothed objectives with fewer subproblems than nested Weiszfeld baselines.
Multilingual Coreference Resolution via Cycle-Consistent Machine Translation cs.CL · 2026-06-03 · unverdicted · none · ref 26
A cycle-consistent MT pipeline generates and similarity-weights training data for coreference resolution, producing gains on four low-resource languages and enabling the task where no corpora existed.
Code Generation by Differential Test Time Scaling cs.SE · 2026-05-19 · unverdicted · none · ref 69
DiffCodeGen clusters code candidates by behavioral similarity from fuzzing-synthesized inputs and selects the largest cluster's medoid, matching or exceeding prior test-time scaling methods with far less token and time cost.
Soft-MSM: Differentiable Context-Aware Elastic Alignment for Time Series cs.LG · 2026-04-30 · unverdicted · none · ref 92
Soft-MSM is a smooth, gradient-enabled version of the context-aware MSM distance for time series alignment that outperforms Soft-DTW alternatives in clustering and nearest-centroid classification.
Batch-Invariant Spectral Intelligence for Robust and Explainable Insect Authentication cs.LG · 2026-06-25 · unverdicted · none · ref 36
BISN achieves 0.93 mean leave-one-batch-out accuracy on 2700 NIR spectra from three insect species across three batches, outperforming baselines by 4% while decisions align with lipid and protein absorption regions.
Soft Token Alignment for Cross-Lingual Reasoning cs.CL · 2026-06-25 · unverdicted · none · ref 48
SOLAR aligns soft-token probability mixtures across languages in embedding space during SFT and raises multilingual reasoning accuracy by up to 17.7 points over the base model.
FOSC-X: An Extended Framework for Optimal Local Cuts and Non-Horizontal Cluster Selection from Clustering Hierarchies stat.ML · 2026-06-17 · unverdicted · none · ref 30
FOSC-X uses bounded dynamic programming to compute top-M optimal non-horizontal cuts from clustering hierarchies in linear time, with or without cluster-count constraints.
Multilingual Fine-Tuning via Localized Gradient Conflict Resolution cs.AI · 2026-06-04 · unverdicted · none · ref 9
Bucket-Level MOO reformulates multilingual fine-tuning as localized multi-objective optimization and proves it enforces a tighter Pareto stationarity condition while improving cross-lingual performance on four LLMs.
Automatic Discovery of Disease Subgroups by Contrasting with Healthy Controls cs.LG · 2026-05-20 · conditional · none · ref 40
Deep UCSL uses a contrastive EM loss on patient-control labels to isolate disease-driven subgroups in medical imaging by suppressing shared healthy variability.
One Model to Translate Them All: Universal Any-to-Any Translation for Heterogeneous Collaborative Perception cs.CV · 2026-05-18 · conditional · none · ref 48
UniTrans pretrains a bank of translator experts and learns combination coefficients from modality mappings in a scene-invariant latent space to enable zero-shot any-to-any feature translation for heterogeneous collaborative perception.
Geometric Prototype Learning in Quantum Hilbert Space with Matrix Product States quant-ph · 2026-05-18 · unverdicted · none · ref 40
A quantum prototype learning scheme encodes class representatives as generative matrix product states and performs classification and clustering via geometric measures in Hilbert space, outperforming classical prototypes on Fashion-MNIST and ECG data.
The Infinite Mutation Engine? Measuring Polymorphism in LLM-Generated Offensive Code cs.CR · 2026-05-05 · unverdicted · none · ref 63 · 2 links
A single commercial LLM can cheaply generate large populations of behaviorally equivalent yet structurally diverse malware payloads.
Generalized Category Discovery in Federated Graph Learning cs.LG · 2026-05-05 · unverdicted · none · ref 29
GCD-FGL mitigates neighborhood absorption and global semantic inconsistency in federated generalized category discovery, delivering +4.86 average HRScore gain over baselines on five graph datasets.
Class Angular Distortion Index for Dimensionality Reduction cs.LG · 2026-05-01 · unverdicted · none · ref 42
CADI quantifies the preservation of relative cluster angles in low-dimensional projections using internal angles from point triples.
Empirical Insights of Test Selection Metrics under Multiple Testing Objectives and Distribution Shifts cs.SE · 2026-04-25 · unverdicted · none · ref 68
A broad empirical benchmark shows how 15 existing test selection metrics perform for fault detection, performance estimation, and retraining under corrupted, adversarial, temporal, natural, and label shifts across image, text, and Android data.
A Machine Learning Approach to Meteor Classification astro-ph.EP · 2026-04-24 · unverdicted · none · ref 36
Machine learning clustering of meteor observations produces a new hardness classification H_class that refines traditional Kb models using more parameters and reveals compositional structure in meteoroid populations.
ClusterRAG: Cluster-Based Collaborative Filtering for Personalized Retrieval-Augmented Generation cs.IR · 2026-04-14 · unverdicted · none · ref 86
ClusterRAG applies density-based clustering to user profiles for collaborative retrieval in personalized RAG and reports best performance on LaMP tasks by combining target and similar-user profiles.
AFGNN: API Misuse Detection using Graph Neural Networks and Clustering cs.SE · 2026-04-09 · unverdicted · none · ref 45
AFGNN detects API misuses in Java code more effectively than prior methods by representing usage as graphs and clustering learned embeddings from self-supervised training.
Do Good, Stay Longer? Temporal Patterns and Predictors of Newcomer-to-Core Transitions in Conventional OSS and OSS4SG cs.SE · 2026-01-30 · unverdicted · none · ref 46
OSS4SG projects retain contributors at 2.2X higher rates with 19.6% higher core status probability than conventional OSS, and a late-spike temporal pattern enables faster core achievement (21 weeks) than early intensive contributions.
LandSegmenter: Towards a Flexible Foundation Model for Land Use and Land Cover Mapping cs.CV · 2025-11-11 · unverdicted · none · ref 5
LandSegmenter creates a task-specific foundation model for LULC mapping using weak labels from existing products, an RS adapter, text encoder, and confidence-guided fusion to achieve competitive zero-shot performance across modalities and taxonomies.
The bixplot: A variation on the boxplot suited for bimodal data stat.ME · 2025-10-10 · unverdicted · none · ref 6
Presents the bixplot as an extension of the boxplot incorporating contiguous clustering to visualize bimodality and multimodality while displaying individual data points, with Python and R implementations.
A Robust Nonparametric Framework for Detecting Repeated Spatial Patterns stat.ME · 2025-06-17 · unverdicted · none · ref 12
A nonparametric framework detects repeated spatial patterns via constrained clustering followed by MMD-based reassignment and block permutation under stationarity and mixing conditions.
Do Waders, Swimmers, and Divers Exist? A GPS-Based Pilot Study of Site-Dependent Visitor Movement in Theme Parks physics.soc-ph · 2026-06-23 · unverdicted · none · ref 32
GPS tracking across theme parks shows visitor movement forms a continuum rather than discrete types, diverges from self-reports, and reverses feature relationships from site to site, requiring local calibration.
Pareto-Guided Teacher Alignment for Fair Personalized Text Generation cs.CL · 2026-06-08 · unverdicted · none · ref 263
Fairness mitigation in personalized text generation is objective-dependent with methods occupying different regions of the fairness-personalization Pareto frontier rather than any single strategy dominating all objectives.
On solving symmetric multi-type orthogonal non-negative matrix tri-factorization problem cs.LG · 2026-06-06 · unverdicted · none · ref 33
Two heuristic algorithms (fixed-point from penalized KKT and staged ADAM) are proposed for symmetric multi-type orthogonal NMF tri-factorization and evaluated on synthetic noisy data and citation networks for recovery and downstream tasks.
Functional Clustering of Survival Data via Smoothed Log-Hazard Trajectories: A Risk-Dynamics Perspective stat.ME · 2026-05-31 · unverdicted · none · ref 15
A new functional clustering framework for survival data that smooths log-hazard trajectories with B-splines, applies FPCA, and clusters on the scores to group by temporal risk dynamics.
Density Evolution: A Multiscale View of Density Estimation math.ST · 2026-05-29 · unverdicted · none · ref 102
A review reframing density estimation as 'density evolution' across scales, linking kernel smoothing to heat flow, mixtures to compression, and topology to level sets, while stating three structural results on modes, Gaussian semigroups, and log-concavity.
SmartIterator: Visual Analytics Workflows for Supervising Unsupervised Data Grouping cs.HC · 2026-05-27 · unverdicted · none · ref 47
SmartIterator supplies method-specific workflows and coordinated visualizations to systematically supervise and interpret parameter sweeps of unsupervised data grouping techniques.
COPRA: Conditional Parameter Adaptation with Reinforcement Learning for Video Anomaly Detection cs.CV · 2026-05-14 · unverdicted · none · ref 61
COPRA introduces conditional parameter adaptation via RL to dynamically tune frozen VLMs for video anomaly detection, outperforming static methods in in-domain and cross-domain settings while generalizing to other video tasks.
On Similarity of Computational Kernels in our Codes and Proxies cs.DC · 2026-05-07 · unverdicted · none · ref 32
New hardware-usage-based similarity metrics can identify matching computational kernels between proxy applications and performance suites on both CPU and GPU systems.
AI-Derived Reproductive Phenotypes and Explainable ML for Concurrent Early Multimorbidity in U.S. Women: NHANES 2017-March 2020 q-bio.OT · 2026-04-24 · unverdicted · none · ref 19
PCA and k-means on NHANES data identified four reproductive phenotypes in U.S. women aged 20-44, with one fragile subgroup showing 77.5% early multimorbidity prevalence; XGBoost improved discrimination over logistic regression but had worse calibration.
Leveraging Weighted Syntactic and Semantic Context Assessment Summary (wSSAS) Towards Text Categorization Using LLMs cs.CL · 2026-04-13 · unverdicted · none · ref 44
wSSAS is a two-phase deterministic framework that uses hierarchical text organization and SNR-based feature prioritization to improve clustering integrity, categorization accuracy, and reproducibility when applying LLMs to large review datasets.
SCULPT: An Interactive Machine Learning Platform for Analyzing Multi-Particle Coincidence Data from Cold Target Recoil Ion Momentum Spectroscopy physics.atm-clus · 2025-11-14 · unverdicted · none · ref 42
SCULPT is an interactive machine learning platform combining UMAP, clustering, and adaptive confidence scoring for analyzing COLTRIMS multi-particle coincidence data.
Investigating the Representation of Backchannels and Fillers in Fine-tuned Language Models cs.CL · 2025-09-24 · unverdicted · none · ref 49
Fine-tuning on annotated English and Japanese dialogues improves clustering of backchannels and fillers and makes generated utterances closer to human ones.
Robustness Analysis of USmorph: II. Optimizing Feature Extraction, Dimensionality Reduction, and Clustering for Unsupervised Galaxy Morphology Classification astro-ph.GA · 2026-05-20 · unverdicted · none · ref 131
Optimizes ImageNet-pretrained AlexNet, UMAP, and a bagging multi-cluster voting scheme with K-means, Birch and Agg for unsupervised galaxy morphology classification, reporting improved stability and consistency with galaxy evolution expectations.
An Explainable Unsupervised-to-Supervised Machine Learning Framework for Dietary Pattern Discovery Using UK National Dietary Survey Data q-bio.QM · 2026-05-07 · unverdicted · none · ref 18
An unsupervised-to-supervised ML pipeline on UK NDNS data discovers four dietary patterns, reproduces them with macro-F1 0.963 using a surrogate classifier, and interprets them via SHAP for potential clinical use.
Linking the "inner" and "outer" self to mental health and brain networks physics.soc-ph · 2026-06-26 · unverdicted · none · ref 35
HCP data analysis clusters individuals by social profiles into two groups where the more socially beneficial cluster scores higher on positive mental health measures and shows lower interconnectivity especially in the default mode network.
Robust discriminant analysis stat.ME · 2024-08-28 · unverdicted · none · ref 23
A review paper that identifies the outlier sensitivity of classical discriminant analysis and summarizes robust versions based on resistant location and scatter estimators plus diagnostic graphics.

APACrefauthors \ 1987

hub tools

citation-role summary

citation-polarity summary

authors

co-cited works

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer