InsightGen uses thematic clustering and graph neighborhood selection to generate diverse, relevant insights for open-ended document-grounded questions and releases the SCOpE-QA dataset of 3000 questions.
super hub
APACrefauthors \ 1987
26 Pith papers cite this work, alongside 16,729 external citations. Polarity classification is still indexing.
hub tools
citation-role summary
citation-polarity summary
authors
co-cited works
representative citing papers
DiffCodeGen clusters code candidates by behavioral similarity from fuzzing-synthesized inputs and selects the largest cluster's medoid, matching or exceeding prior test-time scaling methods with far less token and time cost.
Soft-MSM is a smooth, gradient-enabled version of the context-aware MSM distance for time series alignment that outperforms Soft-DTW alternatives in clustering and nearest-centroid classification.
Deep UCSL uses a contrastive EM loss on patient-control labels to isolate disease-driven subgroups in medical imaging by suppressing shared healthy variability.
UniTrans pretrains a bank of translator experts and learns combination coefficients from modality mappings in a scene-invariant latent space to enable zero-shot any-to-any feature translation for heterogeneous collaborative perception.
A quantum prototype learning scheme encodes class representatives as generative matrix product states and performs classification and clustering via geometric measures in Hilbert space, outperforming classical prototypes on Fashion-MNIST and ECG data.
A single commercial LLM can cheaply generate large populations of behaviorally equivalent yet structurally diverse malware payloads.
GCD-FGL mitigates neighborhood absorption and global semantic inconsistency in federated generalized category discovery, delivering +4.86 average HRScore gain over baselines on five graph datasets.
CADI quantifies the preservation of relative cluster angles in low-dimensional projections using internal angles from point triples.
A broad empirical benchmark shows how 15 existing test selection metrics perform for fault detection, performance estimation, and retraining under corrupted, adversarial, temporal, natural, and label shifts across image, text, and Android data.
Machine learning clustering of meteor observations produces a new hardness classification H_class that refines traditional Kb models using more parameters and reveals compositional structure in meteoroid populations.
ClusterRAG applies density-based clustering to user profiles for collaborative retrieval in personalized RAG and reports best performance on LaMP tasks by combining target and similar-user profiles.
AFGNN detects API misuses in Java code more effectively than prior methods by representing usage as graphs and clustering learned embeddings from self-supervised training.
OSS4SG projects retain contributors at 2.2X higher rates with 19.6% higher core status probability than conventional OSS, and a late-spike temporal pattern enables faster core achievement (21 weeks) than early intensive contributions.
LandSegmenter creates a task-specific foundation model for LULC mapping using weak labels from existing products, an RS adapter, text encoder, and confidence-guided fusion to achieve competitive zero-shot performance across modalities and taxonomies.
Presents the bixplot as an extension of the boxplot incorporating contiguous clustering to visualize bimodality and multimodality while displaying individual data points, with Python and R implementations.
A nonparametric framework detects repeated spatial patterns via constrained clustering followed by MMD-based reassignment and block permutation under stationarity and mixing conditions.
COPRA introduces conditional parameter adaptation via RL to dynamically tune frozen VLMs for video anomaly detection, outperforming static methods in in-domain and cross-domain settings while generalizing to other video tasks.
New hardware-usage-based similarity metrics can identify matching computational kernels between proxy applications and performance suites on both CPU and GPU systems.
PCA and k-means on NHANES data identified four reproductive phenotypes in U.S. women aged 20-44, with one fragile subgroup showing 77.5% early multimorbidity prevalence; XGBoost improved discrimination over logistic regression but had worse calibration.
wSSAS is a two-phase deterministic framework that uses hierarchical text organization and SNR-based feature prioritization to improve clustering integrity, categorization accuracy, and reproducibility when applying LLMs to large review datasets.
SCULPT is an interactive machine learning platform combining UMAP, clustering, and adaptive confidence scoring for analyzing COLTRIMS multi-particle coincidence data.
Fine-tuning on annotated English and Japanese dialogues improves clustering of backchannels and fillers and makes generated utterances closer to human ones.
Optimizes ImageNet-pretrained AlexNet, UMAP, and a bagging multi-cluster voting scheme with K-means, Birch and Agg for unsupervised galaxy morphology classification, reporting improved stability and consistency with galaxy evolution expectations.
citing papers explorer
-
The bixplot: A variation on the boxplot suited for bimodal data
Presents the bixplot as an extension of the boxplot incorporating contiguous clustering to visualize bimodality and multimodality while displaying individual data points, with Python and R implementations.
-
A Robust Nonparametric Framework for Detecting Repeated Spatial Patterns
A nonparametric framework detects repeated spatial patterns via constrained clustering followed by MMD-based reassignment and block permutation under stationarity and mixing conditions.
-
Robust discriminant analysis
A review paper that identifies the outlier sensitivity of classical discriminant analysis and summarizes robust versions based on resistant location and scatter estimators plus diagnostic graphics.