Socratic Models compose zero-shot multimodal reasoning by prompting pretrained language and vision models to exchange information and enable new capabilities without finetuning.
Title resolution pending
3 Pith papers cite this work. Polarity classification is still indexing.
citation-role summary
citation-polarity summary
representative citing papers
A comprehensive benchmark study of offline imitation learning methods on multi-stage robot manipulation tasks identifies key sensitivities to algorithm design, data quality, and stopping criteria while releasing all datasets and code.
ThirdEye applies triplet convolutional neural networks directly to segmented iris images without normalization, reporting EERs of 1.32% on ND-0405, 9.20% on UbirisV2, and 0.59% on IITD, improving prior results on the constrained IITD set.
citing papers explorer
-
Socratic Models: Composing Zero-Shot Multimodal Reasoning with Language
Socratic Models compose zero-shot multimodal reasoning by prompting pretrained language and vision models to exchange information and enable new capabilities without finetuning.
-
What Matters in Learning from Offline Human Demonstrations for Robot Manipulation
A comprehensive benchmark study of offline imitation learning methods on multi-stage robot manipulation tasks identifies key sensitivities to algorithm design, data quality, and stopping criteria while releasing all datasets and code.
-
ThirdEye: Triplet Based Iris Recognition without Normalization
ThirdEye applies triplet convolutional neural networks directly to segmented iris images without normalization, reporting EERs of 1.32% on ND-0405, 9.20% on UbirisV2, and 0.59% on IITD, improving prior results on the constrained IITD set.