MinNav: Minimalist Navigation Using Optical Flow For Active Tiny Aerial Robots

Aniket Patil; Mandeep Singh; Nitin J. Sanket; Uday Girish Maradana

arxiv: 2606.07813 · v1 · pith:QH2HPMNTnew · submitted 2026-06-05 · 💻 cs.RO · cs.CV

MinNav: Minimalist Navigation Using Optical Flow For Active Tiny Aerial Robots

Aniket Patil , Mandeep Singh , Uday Girish Maradana , Nitin J. Sanket This is my paper

Pith reviewed 2026-06-27 21:29 UTC · model grok-4.3

classification 💻 cs.RO cs.CV

keywords optical flowmonocular cameraaerial robot navigationobstacle avoidancegap navigationactive explorationtiny drones

0 comments

The pith

Optical flow from a monocular camera enables navigation through unknown obstacles and gaps for tiny aerial robots.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

MinNav uses optical flow and uncertainty estimates from a single camera, combined with the robot's active exploratory movements, to detect and avoid static and dynamic obstacles while passing through gaps of unknown shapes. This approach requires no prior knowledge of the environment or additional sensors like depth cameras. The system achieves a 70% success rate in real-world experiments across various settings and runs efficiently on tiny robots with much lower computation than depth-based alternatives.

Core claim

The central discovery is that optical flow computed from monocular images, along with its uncertainty, when used with active exploratory motion, supplies enough information to reliably navigate scenes containing static and dynamic obstacles and unknown-shaped gaps without any prior scene knowledge.

What carries the argument

Optical flow and its uncertainty estimates from monocular images, augmented by active exploratory motion of the robot to gather environmental information.

If this is right

Navigation becomes possible on tiny aerial robots using only a monocular camera without depth sensors.
The method handles both static and dynamic obstacles as well as gaps of arbitrary unknown shapes.
Overall success rate reaches 70% in diverse real-world environments.
Computation is reduced by factors of magnitude compared to depth-based methods while maintaining on-par performance.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

Such minimalist approaches could lower the cost and size barriers for autonomous aerial navigation in constrained platforms.
The reliance on active motion suggests that passive vision alone may be insufficient for these tasks.

Load-bearing premise

That optical flow from monocular images and its uncertainty, when paired with active exploratory motion, provides reliable information to detect obstacles and navigate unknown gaps without scene priors or extra sensors.

What would settle it

Consistent failure of the robot to navigate successfully through a series of tests involving fast dynamic obstacles or highly irregular gap shapes using only the described optical flow method.

Figures

Figures reproduced from arXiv: 2606.07813 by Aniket Patil, Mandeep Singh, Nitin J. Sanket, Uday Girish Maradana.

**Figure 1.** Figure 1: MinNav handles navigation in unstructured and wild scenes including static obstacles, dynamic obstacles and unknown shaped gaps without any prior knowledge of location or scene ordering. All this is performed using a monocular camera and an active strategy using only onboard computation and sensing. All the images in this paper are best viewed in color on a computer screen at 200% zoom. Abstract— Navigatio… view at source ↗

**Figure 2.** Figure 2: Lightweight multi-scale pyramidal neural network architecture used to predict optical flow [PITH_FULL_IMAGE:figures/full_fig_p004_2.png] view at source ↗

**Figure 3.** Figure 3: A toy example showing how our control strategy will “wander” [PITH_FULL_IMAGE:figures/full_fig_p004_3.png] view at source ↗

**Figure 4.** Figure 4: Sequence of images of quadrotor navigating through different scenarios. [PITH_FULL_IMAGE:figures/full_fig_p006_4.png] view at source ↗

**Figure 5.** Figure 5: Comparison of various navigation methods: [PITH_FULL_IMAGE:figures/full_fig_p006_5.png] view at source ↗

read the original abstract

Navigation using a monocular camera is pivotal for autonomous operation on tiny aerial robots due to their perfect balance of versatility, cost and accuracy. In this paper, we introduce MinNav, a navigation stack based on optical flow and its uncertainty to fly through a scene with static and dynamic obstacles and unknown-shaped gaps without any prior knowledge of the scene components and/or their locations/ordering. We further improve success rate by using the activeness of the robot to move around in an exploratory way to find obstacles and navigate. We successfully evaluate and demonstrate the proposed approach in many real-world experiments in various environments with static and dynamic obstacles and unknown-shaped gaps with an overall success rate of 70%. To the best of our knowledge, this is the first solution to tackle all the aforementioned navigation cases without prior knowledge using a monocular camera. Our approach is on par in performance with depth based methods with factors of magnitude less computation required and can readily run onboard tiny aerial robots. The accompanying video, supplementary material, code and dataset can be found at https://pear.wpi.edu/research/minnav.html

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

MinNav shows a working monocular optical-flow system for tiny drones that handles obstacles and gaps via active motion, but the 70% success claim rests on thin evidence with no baselines or failure analysis.

read the letter

The paper delivers a complete navigation stack for size- and power-limited aerial robots that uses only monocular optical flow, its uncertainty, and deliberate exploratory motion to manage static obstacles, moving obstacles, and unknown-shaped gaps with no scene priors. They report real hardware runs on tiny platforms and claim this is the first method to cover the full set of cases.

The integration of uncertainty estimates with active viewpoint changes is the concrete step forward. Releasing code, dataset, and video lets others reproduce the onboard implementation, which runs at low compute compared with depth pipelines. The real-world trials across varied environments give some practical grounding that the approach can fly without extra sensors.

The evaluation stays light. The abstract states a 70% overall success rate but supplies no per-scenario breakdown, no statistical measures, and no head-to-head numbers against other monocular or depth methods. The stress-test point about low-texture regions holds weight here: standard flow estimators lose signal where gradients are weak, and active motion alone does not add texture. If the test scenes happened to be well-textured, the reported rate would not generalize to the broader problem the authors state.

This work is aimed at researchers building minimalist vision stacks for small robots. A reader who needs an onboard monocular baseline or wants to inspect the released implementation will find usable material. The hardware results are concrete enough that a serious editor should send it for peer review, though the authors will need to add quantitative comparisons and texture-related failure cases to strengthen the central claim.

Referee Report

2 major / 2 minor

Summary. The paper introduces MinNav, a navigation stack for tiny aerial robots that computes optical flow and its uncertainty from monocular images, augments this with active exploratory robot motion, and uses the combination to navigate scenes containing static and dynamic obstacles as well as gaps of unknown shape, all without any prior scene knowledge. The authors report an overall 70% success rate across real-world trials in varied environments and position the work as the first monocular-camera solution addressing all listed cases; they further claim performance on par with depth-based methods at orders-of-magnitude lower compute, enabling onboard execution. Code, dataset, video, and supplementary material are provided.

Significance. If the central claims hold after addressing validation gaps, the work would be significant for resource-constrained aerial robotics by showing that optical-flow-based sensing plus active motion can substitute for heavier depth sensors in complex navigation tasks. The explicit release of code, dataset, and experimental material is a clear strength that supports reproducibility and community verification.

major comments (2)

[Abstract / Results] Abstract and experimental evaluation: the reported 70% success rate is stated without the number of trials, per-scenario breakdown (static obstacles, dynamic obstacles, gap navigation), statistical measures, or quantitative comparisons to any baseline (depth-based or other monocular methods). This information is load-bearing for assessing whether the results support the reliability claims.
[Methods] Methods / approach description: the central assumption that optical flow uncertainty plus active exploratory motion suffices to detect obstacles and unknown-shaped gaps even when local image texture is low is not accompanied by an explicit mechanism for handling high-uncertainty regions or by experiments in deliberately low-texture environments. Standard estimators (e.g., Lucas-Kanade) produce no usable signal in such regions, directly affecting the “no prior knowledge” claim.

minor comments (2)

[Methods] Notation for optical-flow uncertainty is introduced without a clear equation reference or pseudocode showing how it is thresholded or fused with exploratory motion commands.
[Figures] Figure captions and axis labels in the experimental results could be expanded to indicate trial counts and success/failure criteria.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive feedback. We address each major comment below and indicate where revisions will be made to strengthen the manuscript.

read point-by-point responses

Referee: [Abstract / Results] Abstract and experimental evaluation: the reported 70% success rate is stated without the number of trials, per-scenario breakdown (static obstacles, dynamic obstacles, gap navigation), statistical measures, or quantitative comparisons to any baseline (depth-based or other monocular methods). This information is load-bearing for assessing whether the results support the reliability claims.

Authors: We agree that additional detail on the experimental validation would improve clarity and support for the claims. The current manuscript states the overall 70% success rate from real-world trials in varied environments but does not include the requested breakdown, trial counts, statistics, or baseline comparisons in the abstract or results summary. We will revise the abstract and results section to incorporate the total number of trials, per-scenario success rates, statistical measures, and quantitative comparisons to depth-based methods. revision: yes
Referee: [Methods] Methods / approach description: the central assumption that optical flow uncertainty plus active exploratory motion suffices to detect obstacles and unknown-shaped gaps even when local image texture is low is not accompanied by an explicit mechanism for handling high-uncertainty regions or by experiments in deliberately low-texture environments. Standard estimators (e.g., Lucas-Kanade) produce no usable signal in such regions, directly affecting the “no prior knowledge” claim.

Authors: The manuscript computes and uses optical flow uncertainty to identify unreliable regions, which is the explicit mechanism for handling high-uncertainty areas by de-emphasizing them and relying on active exploratory motion to obtain usable signals from other viewpoints. This supports the no-prior-knowledge claim by avoiding dependence on any single low-texture patch. However, dedicated experiments in deliberately low-texture environments are not included, which is a limitation. We will revise the methods section to describe the uncertainty handling more explicitly and add a limitations paragraph on low-texture cases. revision: partial

Circularity Check

0 steps flagged

No circularity: experimental system paper with no derivation chain

full rationale

The manuscript presents MinNav as an experimental navigation stack for tiny aerial robots that combines monocular optical flow, uncertainty estimates, and active exploratory motion. No equations, fitted parameters, predictions, or uniqueness theorems appear in the abstract or described claims. The central result is a reported 70% success rate across real-world trials; this is an empirical outcome, not a quantity derived from or forced by any self-referential input. No self-citation load-bearing steps, ansatz smuggling, or renaming of known results are identifiable. The work is therefore self-contained as a system demonstration.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

Abstract-only review; the central assumption is the sufficiency of monocular optical flow for the stated navigation tasks. No free parameters or invented entities are mentioned.

axioms (1)

domain assumption Optical flow and its uncertainty from a monocular camera, augmented by active robot motion, provide sufficient information to detect and navigate static/dynamic obstacles and unknown gaps without prior scene knowledge.
This premise is required for the MinNav stack to function as described and is the load-bearing assumption extracted from the abstract.

pith-pipeline@v0.9.1-grok · 5731 in / 1326 out tokens · 31545 ms · 2026-06-27T21:29:09.825244+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

54 extracted references · 5 canonical work pages · 1 internal anchor

[1]

Design and analysis of an aerial pollination system for walnut trees.Biosystems Engineering, 225:83–98, 2023

Mozhdeh Mazinani, Payam Zarafshan, Mohammad Dehghani, Kourosh Vahdati, and Hamed Etezadi. Design and analysis of an aerial pollination system for walnut trees.Biosystems Engineering, 225:83–98, 2023

2023
[2]

PhD thesis, University of Maryland, College Park, 2021

Nitin Jagannatha Sanket.Active Vision Based Embodied-AI Design for Nano-UA V Autonomy. PhD thesis, University of Maryland, College Park, 2021

2021
[3]

Michael et al

N. Michael et al. Collaborative mapping of an earthquake-damaged building via ground and aerial robots.Journal of Field Robotics, 29(5):832–841, 2012

2012
[4]

Quadcloud: a rapid response force with quadrotor teams

Kartik Mohta, Matthew Turpin, Alex Kushleyev, Daniel Mellinger, Nathan Michael, and Vijay Kumar. Quadcloud: a rapid response force with quadrotor teams. InExperimental Robotics: The 14th International Symposium on Experimental Robotics, pages 577–590. Springer, 2016

2016
[5]

The current state and future outlook of rescue robotics.Journal of Field Robotics, 36(7):1171–1191, 2019

Jeffrey Delmerico, Stefano Mintchev, Alessandro Giusti, Boris Gromov, Kamilo Melo, Tomislav Horvat, Cesar Cadena, Marco Hutter, Auke Ijspeert, Dario Floreano, et al. The current state and future outlook of rescue robotics.Journal of Field Robotics, 36(7):1171–1191, 2019

2019
[6]

Özaslan et al

T. Özaslan et al. Inspection of penstocks and featureless tunnel-like environments using micro UA Vs. InField and Service Robotics, pages 123–136. Springer, 2015

2015
[7]

Edgeflownet: 100fps@ 1w dense optical flow for tiny mobile robots.IEEE Robotics and Automation Letters, 2024

Sai Ramana Kiran Pinnama Raju, Rishabh Singh, Manoj Velmurugan, and Nitin J Sanket. Edgeflownet: 100fps@ 1w dense optical flow for tiny mobile robots.IEEE Robotics and Automation Letters, 2024

2024
[8]

Prgflow: Unified swap-aware deep global optical flow for aerial robot navigation.Electronics Letters, 57(16):614–617, 2021

Nitin J Sanket, Chahat Deep Singh, Cornelia Fermüller, and Yiannis Aloimonos. Prgflow: Unified swap-aware deep global optical flow for aerial robot navigation.Electronics Letters, 57(16):614–617, 2021

2021
[9]

Ecological robotics: Controlling behavior with optical flow

Andrew P Duchon, William H Warren, and L Pack Kaelbling. Ecological robotics: Controlling behavior with optical flow. In Proceedings of the seventeenth annual conference of the Cognitive Science Society, volume 17, page 164. Psychology Press, 1995

1995
[10]

Optical flow based robot obstacle avoidance.International Journal of Advanced Robotic Systems, 4(1):2, 2007

Kahlouche Souhila and Achour Karim. Optical flow based robot obstacle avoidance.International Journal of Advanced Robotic Systems, 4(1):2, 2007

2007
[11]

Determining optical flow

Berthold KP Horn and Brian G Schunck. Determining optical flow. 1980

1980
[12]

Aloimonos et al

J. Aloimonos et al. Active vision.International journal of computer vision, 1(4):333–356, 1988

1988
[13]

Gapflyt: Active vision based minimalist structure-less gap detection for quadrotor flight.IEEE Robotics and Automation Letters, 3(4):2799–2806, 2018

Nitin J Sanket, Chahat Deep Singh, Kanishka Ganguly, Cornelia Fermüller, and Yiannis Aloimonos. Gapflyt: Active vision based minimalist structure-less gap detection for quadrotor flight.IEEE Robotics and Automation Letters, 3(4):2799–2806, 2018

2018
[14]

Ajna: Generalized deep uncertainty for minimal perception on parsimonious robots.Science Robotics, 8(81):eadd5139, 2023

Nitin J Sanket, Chahat Deep Singh, Cornelia Fermüller, and Yiannis Aloimonos. Ajna: Generalized deep uncertainty for minimal perception on parsimonious robots.Science Robotics, 8(81):eadd5139, 2023

2023
[15]

Nanoflownet: Real-time dense optical flow on a nano quadcopter

Rik J Bouwmeester, Federico Paredes-Vallés, and Guido CHE De Croon. Nanoflownet: Real-time dense optical flow on a nano quadcopter. In2023 IEEE International Conference on Robotics and Automation (ICRA), pages 1996–2003. IEEE, 2023

1996
[16]

Wind and obstacle motion affect honeybee flight strategies in cluttered environments.Journal of Experimental Biology, 223(14):jeb222471, 2020

Nicholas P Burnett, Marc A Badger, and Stacey A Combes. Wind and obstacle motion affect honeybee flight strategies in cluttered environments.Journal of Experimental Biology, 223(14):jeb222471, 2020

2020
[17]

Evdodgenet: Deep dynamic obstacle dodging with event cameras

Nitin J Sanket, Chethan M Parameshwara, Chahat Deep Singh, Ashwin V Kuruttukulam, Cornelia Fermüller, Davide Scaramuzza, and Yiannis Aloimonos. Evdodgenet: Deep dynamic obstacle dodging with event cameras. In2020 IEEE International Conference on Robotics and Automation (ICRA), pages 10651–10657. IEEE, 2020

2020
[18]

Neuromorphic control for optic-flow-based landing of mavs using the loihi processor

Julien Dupeyroux, Jesse J Hagenaars, Federico Paredes-Vallés, and Guido CHE de Croon. Neuromorphic control for optic-flow-based landing of mavs using the loihi processor. In2021 IEEE International Conference on Robotics and Automation (ICRA), pages 96–102. IEEE, 2021

2021
[19]

Vision based forward sensitive reactive control for a quadrotor vtol

Jean-Luc Stevens and Robert Mahony. Vision based forward sensitive reactive control for a quadrotor vtol. In2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pages 5232–5238. IEEE, 2018

2018
[20]

Traditional and modern strategies for optical flow: an investigation.SN Applied Sciences, 3:1–14, 2021

Syed Tafseer Haider Shah and Xiang Xuezhi. Traditional and modern strategies for optical flow: an investigation.SN Applied Sciences, 3:1–14, 2021

2021
[21]

Online lidar and vision based ego-motion estimation and mapping.PhD thesis, 2017

Ji Zhang. Online lidar and vision based ego-motion estimation and mapping.PhD thesis, 2017

2017
[22]

Safety-assured high-speed navigation for mavs.Science Robotics, 10(98):eado6187, 2025

Yunfan Ren, Fangcheng Zhu, Guozheng Lu, Yixi Cai, Longji Yin, Fanze Kong, Jiarong Lin, Nan Chen, and Fu Zhang. Safety-assured high-speed navigation for mavs.Science Robotics, 10(98):eado6187, 2025

2025
[23]

Swarm of micro flying robots in the wild.Science Robotics, 7(66):eabm5954, 2022

Xin Zhou, Xiangyong Wen, Zhepei Wang, Yuman Gao, Haojia Li, Qianhao Wang, Tiankai Yang, Haojian Lu, Yanjun Cao, Chao Xu, et al. Swarm of micro flying robots in the wild.Science Robotics, 7(66):eabm5954, 2022

2022
[24]

Semantically-enhanced deep collision prediction for autonomous navigation using aerial robots

Mihir Kulkarni, Huan Nguyen, and Kostas Alexis. Semantically-enhanced deep collision prediction for autonomous navigation using aerial robots. In2023 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pages 3056–3063. IEEE, 2023

2023
[25]

Flying on point clouds: Online trajectory generation and autonomous navigation for quadrotors in cluttered environments.Journal of Field Robotics, 36(4):710–733, 2019

Fei Gao, William Wu, Wenliang Gao, and Shaojie Shen. Flying on point clouds: Online trajectory generation and autonomous navigation for quadrotors in cluttered environments.Journal of Field Robotics, 36(4):710–733, 2019

2019
[26]

High speed navigation for quadrotors with limited onboard sensing

Sikang Liu, Michael Watterson, Sarah Tang, and Vijay Kumar. High speed navigation for quadrotors with limited onboard sensing. In2016 IEEE international conference on robotics and automation (ICRA), pages 1484–1491. IEEE, 2016

2016
[27]

Autonomous drone racing: A survey.arXiv e-prints, pp

Drew Hanover, Antonio Loquercio, Leonard Bauersfeld, Angel Romero, Robert Penicka, Yunlong Song, Giovanni Cioffi, Elia Kaufmann, and Davide Scaramuzza. Autonomous drone racing: A survey.arXiv e-prints, pp. arXiv–2301, 2023

2023
[28]

Reaching the limit in autonomous racing: Optimal control versus reinforcement learning.Science Robotics, 8(82):eadg1462, 2023

Yunlong Song, Angel Romero, Matthias Müller, Vladlen Koltun, and Davide Scaramuzza. Reaching the limit in autonomous racing: Optimal control versus reinforcement learning.Science Robotics, 8(82):eadg1462, 2023

2023
[29]

Champion-level drone racing using deep reinforcement learning.Nature, 620(7976):982–987, 2023

Elia Kaufmann, Leonard Bauersfeld, Antonio Loquercio, Matthias Müller, Vladlen Koltun, and Davide Scaramuzza. Champion-level drone racing using deep reinforcement learning.Nature, 620(7976):982–987, 2023

2023
[30]

Learning to fly in seconds.arXiv preprint arXiv:2311.13081, 2023

Jonas Eschmann, Dario Albani, and Giuseppe Loianno. Learning to fly in seconds.arXiv preprint arXiv:2311.13081, 2023

work page arXiv 2023
[31]

End-to-end neural network based optimal quadcopter control.Robotics and Autonomous Systems, 172:104588, 2024

Robin Ferede, Guido de Croon, Christophe De Wagter, and Dario Izzo. End-to-end neural network based optimal quadcopter control.Robotics and Autonomous Systems, 172:104588, 2024

2024
[32]

Landing with time-to-contact and ventral optic flow estimates.Journal of Guidance, Control, and Dynamics, 35(4):1362–1367, 2012

Dario Izzo and Guido De Croon. Landing with time-to-contact and ventral optic flow estimates.Journal of Guidance, Control, and Dynamics, 35(4):1362–1367, 2012

2012
[33]

Sanket et al

N. Sanket et al. GapFlyt: Active vision based minimalist structure-less gap detection for quadrotor flight.IEEE Robotics and Automation Letters, 3(4):2799–2806, Oct 2018

2018
[34]

Unsupervised deep epipolar flow for stationary or dynamic scenes

Yiran Zhong, Pan Ji, Jianyuan Wang, Yuchao Dai, and Hongdong Li. Unsupervised deep epipolar flow for stationary or dynamic scenes. InProceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 12095–12104, 2019

2019
[35]

Competitive collaboration: Joint unsupervised learning of depth, camera motion, optical flow and motion segmentation

Anurag Ranjan, Varun Jampani, Lukas Balles, Kihwan Kim, Deqing Sun, Jonas Wulff, and Michael J Black. Competitive collaboration: Joint unsupervised learning of depth, camera motion, optical flow and motion segmentation. InProceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 12240–12249, 2019

2019
[36]

Revisiting active perception.Autonomous Robots, pages 1–20, 2017

Ruzena Bajcsy et al. Revisiting active perception.Autonomous Robots, pages 1–20, 2017

2017
[37]

Princeton University Press, 2023

Lars Chittka.The mind of a bee. Princeton University Press, 2023

2023
[38]

Gap perception in bumblebees.Journal of Experimental Biology, 222(2):jeb184135, 2019

Sridhar Ravi, Olivier Bertrand, Tim Siesenop, Lea-Sophie Manz, Charlotte Doussot, Alex Fisher, and Martin Egelhaaf. Gap perception in bumblebees.Journal of Experimental Biology, 222(2):jeb184135, 2019

2019
[39]

Evpropnet: Detecting drones by finding propellers for mid-air landing and following.arXiv preprint arXiv:2106.15045, 2021

Nitin J Sanket, Chahat Deep Singh, Chethan M Parameshwara, Cornelia Fermüller, Guido CHE de Croon, and Yiannis Aloimonos. Evpropnet: Detecting drones by finding propellers for mid-air landing and following.arXiv preprint arXiv:2106.15045, 2021

work page arXiv 2021
[40]

Nudgeseg: Zero-shot object segmentation by repeated physical interaction

Chahat Deep Singh, Nitin J Sanket, Chethan M Parameshwara, Cornelia Fermüller, and Yiannis Aloimonos. Nudgeseg: Zero-shot object segmentation by repeated physical interaction. In2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pages 2714–2712. IEEE, 2021

2021
[41]

Optical flow estimation using a spatial pyramid network, 2016

2016
[42]

Dosovitskiy, P

A. Dosovitskiy, P. Fischer, E. Ilg, P. Häusser, C. Hazırba¸ s, V . Golkov, P. v.d. Smagt, D. Cremers, and T. Brox. Flownet: Learning optical flow with convolutional networks. InIEEE International Conference on Computer Vision (ICCV), 2015

2015
[43]

E. Ilg, T. Saikia, M. Keuper, and T. Brox. Occlusions, motion and depth boundaries with a generic network for disparity, optical flow or scene flow estimation. InEuropean Conference on Computer Vision (ECCV), 2018

2018
[44]

A Large Dataset to Train Convolutional Networks for Disparity, Optical Flow, and Scene Flow Estimation

N. Mayer, E. Ilg, P. Häusser, P. Fischer, D. Cremers, A. Dosovitskiy, and T. Brox. A large dataset to train convolutional networks for disparity, optical flow, and scene flow estimation. InIEEE International Conference on Computer Vision and Pattern Recognition (CVPR), 2016. arXiv:1512.02134

work page internal anchor Pith review Pith/arXiv arXiv 2016
[45]

Bumblebees display characteristics of active vision during robust obstacle avoidance flight.Journal of Experimental Biology, 225(4):jeb243021, 2022

Sridhar Ravi, Tim Siesenop, Olivier J Bertrand, Liang Li, Charlotte Doussot, Alex Fisher, William H Warren, and Martin Egelhaaf. Bumblebees display characteristics of active vision during robust obstacle avoidance flight.Journal of Experimental Biology, 225(4):jeb243021, 2022

2022
[46]

Avoidbench: A high-fidelity vision-based obstacle avoidance benchmarking suite for multi-rotors.arXiv preprint arXiv:2301.07430, 2023

Hang Yu, Guido CH de Croon, and Christophe De Wagter. Avoidbench: A high-fidelity vision-based obstacle avoidance benchmarking suite for multi-rotors.arXiv preprint arXiv:2301.07430, 2023

work page arXiv 2023
[47]

Worldgen: A large scale generative simulator

Chahat Deep Singh, Riya Kumari, Cornelia Fermüller, Nitin J Sanket, and Yiannis Aloimonos. Worldgen: A large scale generative simulator. In2023 IEEE International Conference on Robotics and Automation (ICRA), pages 9147–9154. IEEE, 2023

2023
[48]

Autoflow: Learning a better training set for optical flow

Deqing Sun, Daniel Vlasic, Charles Herrmann, Varun Jampani, Michael Krainin, Huiwen Chang, Ramin Zabih, William T Freeman, and Ce Liu. Autoflow: Learning a better training set for optical flow. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 10093–10102, 2021

2021
[49]

0-mms: Zero-shot multi-motion segmentation with a monocular event camera

Chethan M Parameshwara, Nitin J Sanket, Chahat Deep Singh, Cornelia Fermüller, and Yiannis Aloimonos. 0-mms: Zero-shot multi-motion segmentation with a monocular event camera. In2021 IEEE International Conference on Robotics and Automation (ICRA), pages 9594–9600. IEEE, 2021

2021
[50]

Morpheyes: Variable baseline stereo for quadrotor navigation

Nitin J Sanket, Chahat Deep Singh, Varun Asthana, Cornelia Fermüller, and Yiannis Aloimonos. Morpheyes: Variable baseline stereo for quadrotor navigation. In2021 IEEE International Conference on Robotics and Automation (ICRA), pages 413–419. IEEE, 2021

2021
[51]

Midas v3.1 – a model zoo for robust monocular relative depth estimation.arXiv preprint arXiv:2307.14460, 2023

Reiner Birkl, Diana Wofk, and Matthias Müller. Midas v3.1 – a model zoo for robust monocular relative depth estimation.arXiv preprint arXiv:2307.14460, 2023

work page arXiv 2023
[52]

Vision transformers for dense prediction.ICCV, 2021

René Ranftl, Alexey Bochkovskiy, and Vladlen Koltun. Vision transformers for dense prediction.ICCV, 2021

2021
[53]

Towards robust monocular depth estimation: Mixing datasets for zero-shot cross-dataset transfer.IEEE Transactions on Pattern Analysis and Machine Intelligence, 44(3), 2022

René Ranftl, Katrin Lasinger, David Hafner, Konrad Schindler, and Vladlen Koltun. Towards robust monocular depth estimation: Mixing datasets for zero-shot cross-dataset transfer.IEEE Transactions on Pattern Analysis and Machine Intelligence, 44(3), 2022

2022
[54]

Raft: Recurrent all-pairs field transforms for optical flow

Zachary Teed and Jia Deng. Raft: Recurrent all-pairs field transforms for optical flow. InComputer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part II 16, pages 402–419. Springer, 2020

2020

[1] [1]

Design and analysis of an aerial pollination system for walnut trees.Biosystems Engineering, 225:83–98, 2023

Mozhdeh Mazinani, Payam Zarafshan, Mohammad Dehghani, Kourosh Vahdati, and Hamed Etezadi. Design and analysis of an aerial pollination system for walnut trees.Biosystems Engineering, 225:83–98, 2023

2023

[2] [2]

PhD thesis, University of Maryland, College Park, 2021

Nitin Jagannatha Sanket.Active Vision Based Embodied-AI Design for Nano-UA V Autonomy. PhD thesis, University of Maryland, College Park, 2021

2021

[3] [3]

Michael et al

N. Michael et al. Collaborative mapping of an earthquake-damaged building via ground and aerial robots.Journal of Field Robotics, 29(5):832–841, 2012

2012

[4] [4]

Quadcloud: a rapid response force with quadrotor teams

Kartik Mohta, Matthew Turpin, Alex Kushleyev, Daniel Mellinger, Nathan Michael, and Vijay Kumar. Quadcloud: a rapid response force with quadrotor teams. InExperimental Robotics: The 14th International Symposium on Experimental Robotics, pages 577–590. Springer, 2016

2016

[5] [5]

The current state and future outlook of rescue robotics.Journal of Field Robotics, 36(7):1171–1191, 2019

Jeffrey Delmerico, Stefano Mintchev, Alessandro Giusti, Boris Gromov, Kamilo Melo, Tomislav Horvat, Cesar Cadena, Marco Hutter, Auke Ijspeert, Dario Floreano, et al. The current state and future outlook of rescue robotics.Journal of Field Robotics, 36(7):1171–1191, 2019

2019

[6] [6]

Özaslan et al

T. Özaslan et al. Inspection of penstocks and featureless tunnel-like environments using micro UA Vs. InField and Service Robotics, pages 123–136. Springer, 2015

2015

[7] [7]

Edgeflownet: 100fps@ 1w dense optical flow for tiny mobile robots.IEEE Robotics and Automation Letters, 2024

Sai Ramana Kiran Pinnama Raju, Rishabh Singh, Manoj Velmurugan, and Nitin J Sanket. Edgeflownet: 100fps@ 1w dense optical flow for tiny mobile robots.IEEE Robotics and Automation Letters, 2024

2024

[8] [8]

Prgflow: Unified swap-aware deep global optical flow for aerial robot navigation.Electronics Letters, 57(16):614–617, 2021

Nitin J Sanket, Chahat Deep Singh, Cornelia Fermüller, and Yiannis Aloimonos. Prgflow: Unified swap-aware deep global optical flow for aerial robot navigation.Electronics Letters, 57(16):614–617, 2021

2021

[9] [9]

Ecological robotics: Controlling behavior with optical flow

Andrew P Duchon, William H Warren, and L Pack Kaelbling. Ecological robotics: Controlling behavior with optical flow. In Proceedings of the seventeenth annual conference of the Cognitive Science Society, volume 17, page 164. Psychology Press, 1995

1995

[10] [10]

Optical flow based robot obstacle avoidance.International Journal of Advanced Robotic Systems, 4(1):2, 2007

Kahlouche Souhila and Achour Karim. Optical flow based robot obstacle avoidance.International Journal of Advanced Robotic Systems, 4(1):2, 2007

2007

[11] [11]

Determining optical flow

Berthold KP Horn and Brian G Schunck. Determining optical flow. 1980

1980

[12] [12]

Aloimonos et al

J. Aloimonos et al. Active vision.International journal of computer vision, 1(4):333–356, 1988

1988

[13] [13]

Gapflyt: Active vision based minimalist structure-less gap detection for quadrotor flight.IEEE Robotics and Automation Letters, 3(4):2799–2806, 2018

Nitin J Sanket, Chahat Deep Singh, Kanishka Ganguly, Cornelia Fermüller, and Yiannis Aloimonos. Gapflyt: Active vision based minimalist structure-less gap detection for quadrotor flight.IEEE Robotics and Automation Letters, 3(4):2799–2806, 2018

2018

[14] [14]

Ajna: Generalized deep uncertainty for minimal perception on parsimonious robots.Science Robotics, 8(81):eadd5139, 2023

Nitin J Sanket, Chahat Deep Singh, Cornelia Fermüller, and Yiannis Aloimonos. Ajna: Generalized deep uncertainty for minimal perception on parsimonious robots.Science Robotics, 8(81):eadd5139, 2023

2023

[15] [15]

Nanoflownet: Real-time dense optical flow on a nano quadcopter

Rik J Bouwmeester, Federico Paredes-Vallés, and Guido CHE De Croon. Nanoflownet: Real-time dense optical flow on a nano quadcopter. In2023 IEEE International Conference on Robotics and Automation (ICRA), pages 1996–2003. IEEE, 2023

1996

[16] [16]

Wind and obstacle motion affect honeybee flight strategies in cluttered environments.Journal of Experimental Biology, 223(14):jeb222471, 2020

Nicholas P Burnett, Marc A Badger, and Stacey A Combes. Wind and obstacle motion affect honeybee flight strategies in cluttered environments.Journal of Experimental Biology, 223(14):jeb222471, 2020

2020

[17] [17]

Evdodgenet: Deep dynamic obstacle dodging with event cameras

Nitin J Sanket, Chethan M Parameshwara, Chahat Deep Singh, Ashwin V Kuruttukulam, Cornelia Fermüller, Davide Scaramuzza, and Yiannis Aloimonos. Evdodgenet: Deep dynamic obstacle dodging with event cameras. In2020 IEEE International Conference on Robotics and Automation (ICRA), pages 10651–10657. IEEE, 2020

2020

[18] [18]

Neuromorphic control for optic-flow-based landing of mavs using the loihi processor

Julien Dupeyroux, Jesse J Hagenaars, Federico Paredes-Vallés, and Guido CHE de Croon. Neuromorphic control for optic-flow-based landing of mavs using the loihi processor. In2021 IEEE International Conference on Robotics and Automation (ICRA), pages 96–102. IEEE, 2021

2021

[19] [19]

Vision based forward sensitive reactive control for a quadrotor vtol

Jean-Luc Stevens and Robert Mahony. Vision based forward sensitive reactive control for a quadrotor vtol. In2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pages 5232–5238. IEEE, 2018

2018

[20] [20]

Traditional and modern strategies for optical flow: an investigation.SN Applied Sciences, 3:1–14, 2021

Syed Tafseer Haider Shah and Xiang Xuezhi. Traditional and modern strategies for optical flow: an investigation.SN Applied Sciences, 3:1–14, 2021

2021

[21] [21]

Online lidar and vision based ego-motion estimation and mapping.PhD thesis, 2017

Ji Zhang. Online lidar and vision based ego-motion estimation and mapping.PhD thesis, 2017

2017

[22] [22]

Safety-assured high-speed navigation for mavs.Science Robotics, 10(98):eado6187, 2025

Yunfan Ren, Fangcheng Zhu, Guozheng Lu, Yixi Cai, Longji Yin, Fanze Kong, Jiarong Lin, Nan Chen, and Fu Zhang. Safety-assured high-speed navigation for mavs.Science Robotics, 10(98):eado6187, 2025

2025

[23] [23]

Swarm of micro flying robots in the wild.Science Robotics, 7(66):eabm5954, 2022

Xin Zhou, Xiangyong Wen, Zhepei Wang, Yuman Gao, Haojia Li, Qianhao Wang, Tiankai Yang, Haojian Lu, Yanjun Cao, Chao Xu, et al. Swarm of micro flying robots in the wild.Science Robotics, 7(66):eabm5954, 2022

2022

[24] [24]

Semantically-enhanced deep collision prediction for autonomous navigation using aerial robots

Mihir Kulkarni, Huan Nguyen, and Kostas Alexis. Semantically-enhanced deep collision prediction for autonomous navigation using aerial robots. In2023 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pages 3056–3063. IEEE, 2023

2023

[25] [25]

Flying on point clouds: Online trajectory generation and autonomous navigation for quadrotors in cluttered environments.Journal of Field Robotics, 36(4):710–733, 2019

Fei Gao, William Wu, Wenliang Gao, and Shaojie Shen. Flying on point clouds: Online trajectory generation and autonomous navigation for quadrotors in cluttered environments.Journal of Field Robotics, 36(4):710–733, 2019

2019

[26] [26]

High speed navigation for quadrotors with limited onboard sensing

Sikang Liu, Michael Watterson, Sarah Tang, and Vijay Kumar. High speed navigation for quadrotors with limited onboard sensing. In2016 IEEE international conference on robotics and automation (ICRA), pages 1484–1491. IEEE, 2016

2016

[27] [27]

Autonomous drone racing: A survey.arXiv e-prints, pp

Drew Hanover, Antonio Loquercio, Leonard Bauersfeld, Angel Romero, Robert Penicka, Yunlong Song, Giovanni Cioffi, Elia Kaufmann, and Davide Scaramuzza. Autonomous drone racing: A survey.arXiv e-prints, pp. arXiv–2301, 2023

2023

[28] [28]

Reaching the limit in autonomous racing: Optimal control versus reinforcement learning.Science Robotics, 8(82):eadg1462, 2023

Yunlong Song, Angel Romero, Matthias Müller, Vladlen Koltun, and Davide Scaramuzza. Reaching the limit in autonomous racing: Optimal control versus reinforcement learning.Science Robotics, 8(82):eadg1462, 2023

2023

[29] [29]

Champion-level drone racing using deep reinforcement learning.Nature, 620(7976):982–987, 2023

Elia Kaufmann, Leonard Bauersfeld, Antonio Loquercio, Matthias Müller, Vladlen Koltun, and Davide Scaramuzza. Champion-level drone racing using deep reinforcement learning.Nature, 620(7976):982–987, 2023

2023

[30] [30]

Learning to fly in seconds.arXiv preprint arXiv:2311.13081, 2023

Jonas Eschmann, Dario Albani, and Giuseppe Loianno. Learning to fly in seconds.arXiv preprint arXiv:2311.13081, 2023

work page arXiv 2023

[31] [31]

End-to-end neural network based optimal quadcopter control.Robotics and Autonomous Systems, 172:104588, 2024

Robin Ferede, Guido de Croon, Christophe De Wagter, and Dario Izzo. End-to-end neural network based optimal quadcopter control.Robotics and Autonomous Systems, 172:104588, 2024

2024

[32] [32]

Landing with time-to-contact and ventral optic flow estimates.Journal of Guidance, Control, and Dynamics, 35(4):1362–1367, 2012

Dario Izzo and Guido De Croon. Landing with time-to-contact and ventral optic flow estimates.Journal of Guidance, Control, and Dynamics, 35(4):1362–1367, 2012

2012

[33] [33]

Sanket et al

N. Sanket et al. GapFlyt: Active vision based minimalist structure-less gap detection for quadrotor flight.IEEE Robotics and Automation Letters, 3(4):2799–2806, Oct 2018

2018

[34] [34]

Unsupervised deep epipolar flow for stationary or dynamic scenes

Yiran Zhong, Pan Ji, Jianyuan Wang, Yuchao Dai, and Hongdong Li. Unsupervised deep epipolar flow for stationary or dynamic scenes. InProceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 12095–12104, 2019

2019

[35] [35]

Competitive collaboration: Joint unsupervised learning of depth, camera motion, optical flow and motion segmentation

Anurag Ranjan, Varun Jampani, Lukas Balles, Kihwan Kim, Deqing Sun, Jonas Wulff, and Michael J Black. Competitive collaboration: Joint unsupervised learning of depth, camera motion, optical flow and motion segmentation. InProceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 12240–12249, 2019

2019

[36] [36]

Revisiting active perception.Autonomous Robots, pages 1–20, 2017

Ruzena Bajcsy et al. Revisiting active perception.Autonomous Robots, pages 1–20, 2017

2017

[37] [37]

Princeton University Press, 2023

Lars Chittka.The mind of a bee. Princeton University Press, 2023

2023

[38] [38]

Gap perception in bumblebees.Journal of Experimental Biology, 222(2):jeb184135, 2019

Sridhar Ravi, Olivier Bertrand, Tim Siesenop, Lea-Sophie Manz, Charlotte Doussot, Alex Fisher, and Martin Egelhaaf. Gap perception in bumblebees.Journal of Experimental Biology, 222(2):jeb184135, 2019

2019

[39] [39]

Evpropnet: Detecting drones by finding propellers for mid-air landing and following.arXiv preprint arXiv:2106.15045, 2021

Nitin J Sanket, Chahat Deep Singh, Chethan M Parameshwara, Cornelia Fermüller, Guido CHE de Croon, and Yiannis Aloimonos. Evpropnet: Detecting drones by finding propellers for mid-air landing and following.arXiv preprint arXiv:2106.15045, 2021

work page arXiv 2021

[40] [40]

Nudgeseg: Zero-shot object segmentation by repeated physical interaction

Chahat Deep Singh, Nitin J Sanket, Chethan M Parameshwara, Cornelia Fermüller, and Yiannis Aloimonos. Nudgeseg: Zero-shot object segmentation by repeated physical interaction. In2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pages 2714–2712. IEEE, 2021

2021

[41] [41]

Optical flow estimation using a spatial pyramid network, 2016

2016

[42] [42]

Dosovitskiy, P

A. Dosovitskiy, P. Fischer, E. Ilg, P. Häusser, C. Hazırba¸ s, V . Golkov, P. v.d. Smagt, D. Cremers, and T. Brox. Flownet: Learning optical flow with convolutional networks. InIEEE International Conference on Computer Vision (ICCV), 2015

2015

[43] [43]

E. Ilg, T. Saikia, M. Keuper, and T. Brox. Occlusions, motion and depth boundaries with a generic network for disparity, optical flow or scene flow estimation. InEuropean Conference on Computer Vision (ECCV), 2018

2018

[44] [44]

A Large Dataset to Train Convolutional Networks for Disparity, Optical Flow, and Scene Flow Estimation

N. Mayer, E. Ilg, P. Häusser, P. Fischer, D. Cremers, A. Dosovitskiy, and T. Brox. A large dataset to train convolutional networks for disparity, optical flow, and scene flow estimation. InIEEE International Conference on Computer Vision and Pattern Recognition (CVPR), 2016. arXiv:1512.02134

work page internal anchor Pith review Pith/arXiv arXiv 2016

[45] [45]

Bumblebees display characteristics of active vision during robust obstacle avoidance flight.Journal of Experimental Biology, 225(4):jeb243021, 2022

Sridhar Ravi, Tim Siesenop, Olivier J Bertrand, Liang Li, Charlotte Doussot, Alex Fisher, William H Warren, and Martin Egelhaaf. Bumblebees display characteristics of active vision during robust obstacle avoidance flight.Journal of Experimental Biology, 225(4):jeb243021, 2022

2022

[46] [46]

Avoidbench: A high-fidelity vision-based obstacle avoidance benchmarking suite for multi-rotors.arXiv preprint arXiv:2301.07430, 2023

Hang Yu, Guido CH de Croon, and Christophe De Wagter. Avoidbench: A high-fidelity vision-based obstacle avoidance benchmarking suite for multi-rotors.arXiv preprint arXiv:2301.07430, 2023

work page arXiv 2023

[47] [47]

Worldgen: A large scale generative simulator

Chahat Deep Singh, Riya Kumari, Cornelia Fermüller, Nitin J Sanket, and Yiannis Aloimonos. Worldgen: A large scale generative simulator. In2023 IEEE International Conference on Robotics and Automation (ICRA), pages 9147–9154. IEEE, 2023

2023

[48] [48]

Autoflow: Learning a better training set for optical flow

Deqing Sun, Daniel Vlasic, Charles Herrmann, Varun Jampani, Michael Krainin, Huiwen Chang, Ramin Zabih, William T Freeman, and Ce Liu. Autoflow: Learning a better training set for optical flow. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 10093–10102, 2021

2021

[49] [49]

0-mms: Zero-shot multi-motion segmentation with a monocular event camera

Chethan M Parameshwara, Nitin J Sanket, Chahat Deep Singh, Cornelia Fermüller, and Yiannis Aloimonos. 0-mms: Zero-shot multi-motion segmentation with a monocular event camera. In2021 IEEE International Conference on Robotics and Automation (ICRA), pages 9594–9600. IEEE, 2021

2021

[50] [50]

Morpheyes: Variable baseline stereo for quadrotor navigation

Nitin J Sanket, Chahat Deep Singh, Varun Asthana, Cornelia Fermüller, and Yiannis Aloimonos. Morpheyes: Variable baseline stereo for quadrotor navigation. In2021 IEEE International Conference on Robotics and Automation (ICRA), pages 413–419. IEEE, 2021

2021

[51] [51]

Midas v3.1 – a model zoo for robust monocular relative depth estimation.arXiv preprint arXiv:2307.14460, 2023

Reiner Birkl, Diana Wofk, and Matthias Müller. Midas v3.1 – a model zoo for robust monocular relative depth estimation.arXiv preprint arXiv:2307.14460, 2023

work page arXiv 2023

[52] [52]

Vision transformers for dense prediction.ICCV, 2021

René Ranftl, Alexey Bochkovskiy, and Vladlen Koltun. Vision transformers for dense prediction.ICCV, 2021

2021

[53] [53]

Towards robust monocular depth estimation: Mixing datasets for zero-shot cross-dataset transfer.IEEE Transactions on Pattern Analysis and Machine Intelligence, 44(3), 2022

René Ranftl, Katrin Lasinger, David Hafner, Konrad Schindler, and Vladlen Koltun. Towards robust monocular depth estimation: Mixing datasets for zero-shot cross-dataset transfer.IEEE Transactions on Pattern Analysis and Machine Intelligence, 44(3), 2022

2022

[54] [54]

Raft: Recurrent all-pairs field transforms for optical flow

Zachary Teed and Jia Deng. Raft: Recurrent all-pairs field transforms for optical flow. InComputer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part II 16, pages 402–419. Springer, 2020

2020