Across 252,000 paired trials on six LLMs, topical relevance and list position emerged as the strongest drivers of first citation in competitive RAG, with price information and recency providing consistent secondary gains.
super hub
Fitting linear mixed-effects models using lme4
25 Pith papers cite this work, alongside 72,323 external citations. Polarity classification is still indexing.
hub tools
citation-role summary
citation-polarity summary
authors
co-cited works
representative citing papers
Human face perception aligns with neural networks trained on inverse-generative and naturalistic discriminative tasks, as these best predict human dissimilarity judgments on controversial and random face pairs.
Crossed random-effects models on LLM word ratings show 16.9% variance from genuine stimulus-specific individuality, exceeding null models and forming coherent per-model fingerprints.
SCOOTER supplies best-practice guidelines, open tools, and a 3K-image benchmark with 34K+ human ratings showing that six tested unrestricted attacks produce images humans can detect as fake.
Frontier LLMs exhibit moral deliberative sycophancy by shifting their moral reasoning and justifications up to 6.5% on average toward a user's stated preferred view in simulated deliberations.
Robots detect underspecified reward features via demonstration variation and query targeted natural language explanations to improve reward recovery from imperfect demos.
A framework using generative AI to produce synthetic multilevel data for Monte Carlo simulations that evaluate the performance and parameter recovery of quantitative methods.
Vision language models applied to daily-life photos quantify visual environmental features that correlate with momentary affect and chronic stress, establishing a paradigm for visual exposomics.
Substantive LLM reframing boosts cross-partisan receptivity to news headlines without backfire, but models overestimate effect sizes and lack fidelity in modeling human psychological responses.
Fluent AI users adopt an active, iterative collaboration mode that produces more visible failures but better recovery and success on hard tasks, whereas novices experience more invisible failures from passive use.
LLM originality raters exhibit self-preference bias toward artificial responses that disappears after controlling for idea elaboration in the Alternate Uses Task.
Later LLM layers align better with human cognitive effort in syntactic ambiguity than early layers do, indicating dual processing modes and complementary benefits from multi-layer probability updates.
Reasoning models expend more tokens on association-incompatible tasks than compatible ones, indicating greater effort on counter-stereotypical information, except for Claude 3.7 Sonnet which shows the reverse pattern linked to its bias-focused reasoning.
A framework using language models to simulate non-existent experiments and derive novel testable hypotheses on dative verb acquisition and cross-structural generalization in children.
Introduces MASI to standardize net migration rates for age structure and applies a Bayesian hierarchical model to forecast adjusted total and age-sex specific migration rates through 2100, yielding narrower intervals and moderated decline projections.
A sensitivity analysis for MNAR data in multilevel models derives bias adjustments conditional on user-specified sensitivity parameters to produce bounds on parameters of interest under weaker assumptions than missing at random.
Proposes PcovRnnp method enabling simultaneous dimension reduction and regularized coefficient estimation via nuclear norm penalty in high-dimensional settings.
SPICE is a scalable Bayesian MCMC engine for explanatory IRT calibration on sparsely linked persons and items in large assessment banks.
ProfileGLMM is an R package extending Bayesian profile regression with GLMMs to support hierarchical data, random effects, and cluster-covariate interactions for continuous or binary outcomes.
Accented synthetic speech leads users to align their lexical choices with the perceived accent of the machine partner, mirroring human-human dialogue patterns.
LLMs function as accurate semantic processors for conditionals but do not replicate the pragmatic inferences that define human reasoning.
Low vision individuals with central visual field loss can use head-pointing to select 2° targets in VR, reaching near-control performance with sufficiently large pointer activation zones.
Open shelving in a virtual kitchen reduced task time and physical activity for older adults with and without MCI while increasing gaze entropy, with no change in subjective cognitive load or motivation.
Strategic selection of LLMs and reasoning effort optimizes automated scoring accuracy and cost more effectively than self-consistency ensembling.
citing papers explorer
-
Bringing Age Back In: Accounting for Population Age Distribution in Forecasting Migration
Introduces MASI to standardize net migration rates for age structure and applies a Bayesian hierarchical model to forecast adjusted total and age-sex specific migration rates through 2100, yielding narrower intervals and moderated decline projections.