pith. sign in

hub

International conference on machine learning , pages=

87 Pith papers cite this work. Polarity classification is still indexing.

87 Pith papers citing it

hub tools

citation-role summary

background 2 method 1

citation-polarity summary

claims ledger

  • method For augmented samples with positive advantage ( ˆAi,t >0 ), If ϵexp → ∞ is permitted, monotonic policy improvement cannot be guaranteed. Proof.When the advantage is positive, the objective seeks to increase ρi,t(θ)>1 . For Standard Samples (mi = 0), the upper bound is1 +ϵ std. For Augmented Samples (mi = 1), the upper bound is relaxed to1 +ϵ exp (whereϵ exp > ϵ std): LCLIP aug (θ) = min(ρi,t(θ) ˆAi,t,(1 +ϵ exp) ˆAi,t).(21) By relaxing the upper bound, we allow the policy to take larger gradient
  • background ¯Qmin(s,a) = min j∈{1,...,K} ¯Q¯θj(s,a),(10) which yields the backup operatorB π ¯Qmin. h-step (chunk) Bellman regression.Each critic is trained by regressing to the shared target: LTD(θi) =E (st,at,r(h) t ,st+h)∼D h Qθi(st,a t)− B π ¯Qmin(st,a t) 2i .(11) The correspondingh-step Bellman backup is Bπ ¯Qmin(st,a t) =r (h) t +γ h Ea′∼π(·|st+h)  ¯Qmin(st+h,a ′)  ,(12) where theh-step return is r(h) t = h−1X i=0 γi rt+i.(13) Cal-QL calibration regularizer.To enable a smooth transition from offlin

co-cited works

representative citing papers

A Unified Geometric Framework for Weighted Contrastive Learning

cs.LG · 2026-05-13 · unverdicted · novelty 8.0

Weighted InfoNCE objectives realize specific target geometries in embedding space, with SupCon producing size-dependent inter-class similarities under imbalance while Soft SupCon and certain continuous variants preserve regular simplex or unique optima.

Functionalization via Structure Completion and Motion Rectification

cs.CV · 2026-05-18 · unverdicted · novelty 7.0

Object functionalization is cast as neural graph completion over a functional graph of parts, contacts, and motions, followed by geometry realization that also rectifies erroneous motions, demonstrated on furniture with a new paired dataset.

R-DMesh: Video-Guided 3D Animation via Rectified Dynamic Mesh Flow

cs.CV · 2026-05-13 · unverdicted · novelty 7.0 · 2 refs

R-DMesh generates high-fidelity 4D meshes aligned to video by disentangling base mesh, motion, and a learned rectification jump offset inside a VAE, then using Triflow Attention and rectified-flow diffusion.

ProjLens: Unveiling the Role of Projectors in Multimodal Model Safety

cs.CR · 2026-04-21 · unverdicted · novelty 7.0

ProjLens shows that backdoor parameters in MLLMs are encoded in low-rank subspaces of the projector and that embeddings shift toward the target direction with magnitude linear in input norm, activating only on poisoned samples.

Long-Text-to-Image Generation via Compositional Prompt Decomposition

cs.CV · 2026-04-20 · unverdicted · novelty 7.0

PRISM lets pre-trained text-to-image models handle long prompts by breaking them into compositional parts, predicting noise separately, and merging outputs via energy-based conjunction, matching fine-tuned models while generalizing better to prompts over 500 tokens.

Label-Efficient Dataset Pruning via Semi-Supervised Pseudo-Labeling

cs.LG · 2026-05-22 · unverdicted · novelty 6.0

SemiPrune uses a small labeled subset and semi-supervised pseudo-labeling to enable supervised dataset pruning methods, achieving state-of-the-art results on domain-specific, image-corrupted, and long-tailed datasets.

Uncovering the Latent Potential of Deep Intermediate Representations

cs.LG · 2026-05-21 · unverdicted · novelty 6.0

Introduces LOES, a constructive spectral method to select task-discriminative subspaces from intermediate layer embeddings, and GeoReg for enforcing simplicial class geometry during fine-tuning, with reported gains increasing with model depth across modalities.

citing papers explorer

Showing 50 of 87 citing papers.