pith. sign in

Mubarak Shah

Identifiers

  • name variant Mubarak Shah 0.60 · backfill

Papers (61)

  1. The Illusion of High Utility in Safety Alignment of Text-to-Image Diffusion Models cs.CV · 2026 · author #5
  2. ReasonCLIP-58M: Visually Grounded Commonsense Reasoning Supervision for CLIP cs.CV · 2026 · author #8
  3. CaMBRAIN: Real-time, Continuous EEG Inference with Causal State Space Models cs.AI · 2026 · author #6
  4. OmniToM: Benchmarking Theory of Mind in LLMs via Explicit Belief Modeling cs.AI · 2026 · author #5
  5. Aero-World: Action-Conditioned Aerial Video Generation from Inertial Controls cs.CV · 2026 · author #4
  6. Attend Locally, Remember Linearly: Linear Attention as Cross-Frame Memory for Autoregressive Video Diffusion cs.CV · 2026 · author #2
  7. Weakly-Supervised Spatiotemporal Anomaly Detection cs.CV · 2026 · author #3
  8. Dystruct: Dynamically Structured Diffusion Language Model Decoding via Bayesian Inference cs.LG · 2026 · author #3
  9. VidTAG: Temporally Aligned Video to GPS Geolocalization with Denoising Sequence Prediction at a Global Scale cs.CV · 2026 · author #4
  10. ViLL-E: Video LLM Embeddings for Retrieval cs.CV · 2026 · author #6
  11. Dictionary-Aligned Concept Control for Safeguarding Multimodal LLMs cs.LG · 2026 · author #7
  12. MedRoute: RL-Based Dynamic Specialist Routing in Multi-Agent Medical Diagnosis eess.IV · 2026 · author #5
  13. Privacy Beyond Pixels: Latent Anonymization for Privacy-Preserving Video Understanding cs.CV · 2025 · author #3
  14. Agent-X: Evaluating Deep Multimodal Reasoning in Vision-Centric Agentic Tasks cs.CV · 2025 · author #10
  15. HumaniBench: A Human-Centric Framework for Large Multimodal Models Evaluation cs.CV · 2025 · author #8
  16. Deepfake Media Generation and Detection in the Generative AI Era: A Survey and Outlook cs.CV · 2024 · author #10
  17. Direct Preference Optimization for Primitive-Enabled Hierarchical RL: A Bilevel Approach cs.LG · 2024 · author #7
  18. An Efficient 3D CNN for Action/Object Segmentation in Video cs.CV · 2019 · author #4
  19. Deep Constrained Dominant Sets for Person Re-identification cs.CV · 2019 · author #3
  20. Crowd Transformer Network cs.CV · 2019 · author #2
  21. Multi-modal Capsule Routing for Actor and Action Video Segmentation Conditioned on Natural Language Queries cs.CV · 2018 · author #4
  22. Iterative Projection and Matching: Finding Structure-preserving Representatives and Its Application to Computer Vision cs.CV · 2018 · author #4
  23. Time-Aware and View-Aware Video Rendering for Unsupervised Representation Learning cs.CV · 2018 · author #3
  24. Deep Affinity Network for Multiple Object Tracking cs.CV · 2018 · author #5
  25. Pay attention! - Robustifying a Deep Visuomotor Policy through Task-Focused Attention cs.RO · 2018 · author #3
  26. Enhancing camera surveillance using computer vision: a research note cs.CY · 2018 · author #2
  27. Composition Loss for Counting, Density Map Estimation and Localization in Dense Crowds cs.CV · 2018 · author #7
  28. Training Faster by Separating Modes of Variation in Batch-normalized Models cs.LG · 2018 · author #2
  29. VideoCapsuleNet: A Simplified Network for Action Detection cs.CV · 2018 · author #3
  30. Task-Agnostic Meta-Learning for Few-shot Learning cs.LG · 2018 · author #3
  31. Human Semantic Parsing for Person Re-identification cs.CV · 2018 · author #5
  32. Real-world Anomaly Detection in Surveillance Videos cs.CV · 2018 · author #3
  33. Visual Text Correction cs.CV · 2018 · author #2
  34. An End-to-end 3D Convolutional Neural Network for Action Detection and Segmentation in Videos cs.CV · 2017 · author #3
  35. Weighted Singular Value Thresholding and its Application to Background Estimation math.OC · 2017 · author #4
  36. Multi-Target Tracking in Multiple Non-Overlapping Cameras using Constrained Dominant Sets cs.CV · 2017 · author #5
  37. Improving Facial Attribute Prediction using Semantic Segmentation cs.CV · 2017 · author #3
  38. Video Fill In the Blank using LR/RL LSTMs with Spatial-Temporal Attentions cs.CV · 2017 · author #3
  39. ClusterNet: Detecting Small Objects in Large Scenes by Exploiting Spatio-Temporal Information cs.CV · 2017 · author #3
  40. Unsupervised Action Proposal Ranking through Proposal Recombination cs.CV · 2017 · author #3
  41. Tube Convolutional Neural Network (T-CNN) for Action Detection in Videos cs.CV · 2017 · author #3
  42. Semi and Weakly Supervised Semantic Segmentation Using Generative Adversarial Network cs.CV · 2017 · author #3
  43. Cross-View Image Matching for Geo-localization in Urban Environments cs.CV · 2017 · author #3
  44. Large-scale Image Geo-Localization Using Dominant Sets cs.CV · 2017 · author #6
  45. Re-identification of Humans in Crowds using Personal, Social and Environmental Constraints cs.CV · 2016 · author #3
  46. Online Localization and Prediction of Actions and Interactions cs.CV · 2016 · author #3
  47. On Duality Of Multiple Target Tracking and Segmentation cs.CV · 2016 · author #2
  48. Video Fill in the Blank with Merging LSTMs cs.CV · 2016 · author #3
  49. Scene Labeling Through Knowledge-Based Rules Employing Constrained Integer Linear Programing cs.CV · 2016 · author #2
  50. Query-Focused Extractive Video Summarization cs.CV · 2016 · author #3
  51. Covariance of Motion and Appearance Featuresfor Spatio Temporal Recognition Tasks cs.CV · 2016 · author #3
  52. Fast Zero-Shot Image Tagging cs.CV · 2016 · author #3
  53. Automatic Action Annotation in Weakly Labeled Videos cs.CV · 2016 · author #2
  54. A Framework for Human Pose Estimation in Videos cs.CV · 2016 · author #2
  55. The THUMOS Challenge on Action Recognition for Videos "in the Wild" cs.CV · 2016 · author #7
  56. Binary Quadratic Programing for Online Tracking of Hundreds of People in Extremely Crowded Scenes cs.CV · 2016 · author #2
  57. Autonomous navigation for low-altitude UAVs in urban areas cs.RO · 2016 · author #5
  58. Learning a Deep Model for Human Action Recognition from Novel Viewpoints cs.CV · 2016 · author #3
  59. Understanding Trajectory Behavior: A Motion Pattern Approach cs.CV · 2015 · author #5
  60. Face Verification Using Boosted Cross-Image Features cs.CV · 2013 · author #3
  61. UCF101: A Dataset of 101 Human Actions Classes From Videos in The Wild cs.CV · 2012 · author #3

Mentions

  • 2607.00402 #5 · arxiv_oai · confidence 0.70 Mubarak Shah
  • 2411.19537 #10 · arxiv_oai · confidence 0.70 Mubarak Shah
  • 2606.26794 #8 · arxiv_oai · confidence 0.70 Mubarak Shah
  • 2505.11454 #8 · arxiv_oai · confidence 0.70 Mubarak Shah
  • 1501.00614 #5 · backfill · confidence 0.70 Mubarak Shah
  • 1309.7434 #3 · backfill · confidence 0.70 Mubarak Shah
  • 2605.28792 #6 · arxiv_oai · confidence 0.70 Mubarak Shah
  • 2605.26322 #5 · arxiv_oai · confidence 0.70 Mubarak Shah
  • 2505.24876 #10 · arxiv_oai · confidence 0.70 Mubarak Shah
  • 1212.0402 #3 · backfill · confidence 0.70 Mubarak Shah
  • 2605.19728 #4 · arxiv_oai · confidence 0.70 Mubarak Shah
  • 2605.16579 #2 · arxiv_oai · confidence 0.70 Mubarak Shah

Frequent Coauthors