Arsalan Mousavian

Address:

NVIDIA
4545 Roosevelt Way, 6th Floor
Seattle, WA, 98105

Email:

amousavian AT nvidia DOT com


2004 - 2010	2010 - 2013	June-Aug 2015	June-Aug 2016	June-Aug 2017	Aug 2017 - May 2018	2013 - 2018	2018 - now

I am a senior research scientist at NVIDIA Seattle Robotics Lab. I am interested in using computer vision and 3D vision for robotics tasks such as object manipulation. At the moment, I am mainly working on model-free object manipulation in unconstrained environments. In addition, I am interested in 6D pose estimation of objects, instance segmentation, and recognition from RGB-D images. Prior to NVIDIA, I finished my PhD in Computer Science department at George Mason University where I was working under the supervision of Prof. Jana Kosecka. During my PhD, I worked on a variety of computer vision problems for robot perception such as: vision based navigation, object pose estimation, object detection, semantic segmentation and image based localization. I was fortunate to work with amazing collaborators in different research groups in industry. I did internships at Google Brain Robotics, Zoox, and Google StreetView during my PhD. Prior to my PhD, I got my masters in AI and Robotics from University of Tehran.

News:

March 2022: 1 paper on zero-shot object rearrangement accepted to CVPR 2022.
September 2021: 3 papers accepted to CoRL.
July 2021: Got promoted to Senior Robotics Research Scientist.
June 2021: Our paper on Reactive Human-to-Robot Handovers of Arbitrary Objects won the best paper award on Human Robot Interaction at ICRA 2021.
June 2021: 1 paper accepted to IROS 2021.
May 2021: 1 paper accepted to RSS 2021.
March 2021: 1 paper accepted to CVPR 2021.
March 2021: 5 papers accepted to ICRA 2021.
September 2020: Our paper on instance sementation of unknown objects is accepted to CoRL 2020.
May 2020: Our paper on interpreting signals from touch sensors is accepted to RSS 2020.
May 2020: Our paper on grasping unkonwn objects in clutter is nominated for Best Robot Manipulation and Student Paper award.
February 2020: Our paper on zero-shot pose estimation is accepted to CVPR 2020.
January 2020: 2 papers accepted to ICRA 2020.
October 2019: got promoted to Senior Robotics Research Scientist.
September 2019: Our paper on evaluating different grasp sampling schemes is accepted to ISRR 2019.
September 2019: Our paper on instance segmentation for unknown object got accepted to CoRL 2019.
July 2019: Our paper on 6-DOF grasping of unknown objects got accepted to ICCV 2019.
May 2019: Our paper got accepted to RSS 2019.
January 2019: Our paper got accepted to ICRA 2019.
August 2018: Joined NVIDIA Robotics lab as Research Scientist.
June 2018: I defended my dissertation and graduated from PhD program.
May 2018: Received outstanding graduate student award in CS department.
I will be joining Google Brain team as visiting researcher for academic year of 2017.
April 2017: Our paper got accepted to RSS 2017 conference.
February 2017: Our paper is covered by IEEE Spectrum and Forbes.
February 2017: Our paper got accepted to CVPR 2017 conference.
September 2016: Two papers got accepted in 3DV 2016 conference.
March 2016: Second place in Computer Science PhD Symposium of George Mason University.

Publications:

		ProgPrompt: Generating Situated Robot Task Plans using Large Language Models Ishika Singh, Valts Blukis, Arsalan Mousavian, Ankit Goyal, Danfei Xu, Jonathan Tremblay, Dieter Fox, Jesse Thomason, Animesh Garg arXiv [Paper] [Project Page] [bibtex]

		MegaPose: 6D Pose Estimation of Novel Objects via Render & Compare Yann Labbe, Lucas Manuelli, Arsalan Mousavian, Stephen Tyree, Stan Birchfield, Jonathan Tremblay, Justin Carpentier, Mathieu Aubry, Dieter Fox, Josef Sivic Conference on Robot Learning (CoRL) 2022 [OpenReview] [bibtex]

		Learning Robust Real-World Dexterous Grasping Policies via Implicit Shape Augmentation Zoey Chen, Karl Van Wyk, Yu-Wei Chao, Wei Yang, Arsalan Mousavian, Abhishek Gupta, Dieter Fox Conference on Robot Learning (CoRL) 2022 [Paper] [Project Page] [bibtex]

		Deep Learning Approaches to Grasp Synthesis: A Review Rhys Newbury, Morris Gu, Lachlan Chumbley, Arsalan Mousavian, Clemens Eppner, Jurgen Leitner, Jeannette Bohg, Antonio Morales, Tamim Asfour, Danica Kragic, Dieter Fox, Akansel Cosgun Fox arXiv [Paper] [bibtex]

		IFOR: Iterative Flow Minimization for Robotic Object Rearrangement Ankit Goyal, Arsalan Mousavian, Chris Paxton, Yu-Wei Chao, Brian Okorn, Jia Deng, Dieter Fox Computer Vision and Pattern Recognition (CVPR) 2022 [Paper] [Project Page] [bibtex]

		RICE: Refining Instance Masks in Cluttered Environments with Graph Neural Networks Christopher Xie, Arsalan Mousavian, Yu Xiang, Dieter Fox Conference on Robot Learning (CoRL) 2021 [Paper] [Video][Project Page] [bibtex]

		STORM: An Integrated Framework for Fast Joint-Space Model-Predictive Control for Reactive Manipulation Mohak Bhardwaj, Balakumar Sundaralingam, Arsalan Mousavian, Nathan Ratliff, Dieter Fox, Fabio Ramos, Byron Boots Conference on Robot Learning (CoRL) 2021 [Paper] [Project Page] [bibtex]

		Goal-Auxiliary Actor-Critic for 6D Robotic Grasping with Point Clouds Lirui Wang, Yu Xiang, Wei Yang, Arsalan Mousavian, Dieter Fox Conference on Robot Learning (CoRL) 2021 [Paper] [Project Page] [bibtex]

		NeRP: Neural Rearrangement Planning for Unknown Objects Ahmed Qureshi, Arsalan Mousavian, Chris Paxton, Michael Yip, Dieter Fox Robotics: Science and Systems, (RSS), Virtual, 2021 [Paper] [Video] [bibtex]

		Reactive Long Horizon Task Execution via Visual Skill and Precondition Models Shohin Mukherjee, Chris Paxton, Arsalan Mousavian, Adam Fishman, Maxim Likhachev, Dieter Fox International Conference on Intelligent Robots and Systems (IROS) 2020 [Paper] [Video] [bibtex]

		RGB-D Local Implicit Function for Depth Completion of Transparent Objects Luyang Zhu, Arsalan Mousavian, Yu Xiang, Hammad Mazhar, Jozef van Eenbergen, Shoubhik Debnath, Dieter Fox Conference on Computer Vision and Pattern Recognition (CVPR), Virtual, 2021 [Paper] [Project Page] [bibtex]

		Contact-GraspNet: Efficient 6-DoF Grasp Generation in Cluttered Scenes Martin Sundermeyer, Arsalan Mousavian, Rudolph Triebel, Dieter Fox International Conference on Robotics and Automation (ICRA), Xi'an, China, 2021 [Paper] [Video] [Project Page] [bibtex]

		Object Rearrangement Using Learned Implicit Collision Functions Michael Danielczuk, Arsalan Mousavian, Clemens Eppner, Dieter Fox International Conference on Robotics and Automation (ICRA), Xi'an, China, 2021 [Paper] [Video] [Project Page] [bibtex]

Reactive Human-to-Robot Handovers of Arbitrary Objects
Wei Yang, Chris Paxton, Arsalan Mousavian, Yu-Wei Chao, Maya Cakmak, Dieter Fox
International Conference on Robotics and Automation (ICRA), Xi'an, China, 2021
[Paper] [Video] [Project Page] [bibtex]

Best Human Robot Interaction Paper

		ACRONYM: A Large-Scale Grasp Dataset Based on Simulation Clemens Eppner, Arsalan Mousavian, Dieter Fox International Conference on Robotics and Automation (ICRA), Xi'an, China, 2021 [Paper] [Dataset] [bibtex]

		Sim-to-Real for Robotic Tactile Sensing via Physics-Based Simulation and Learned Latent Projections Yashraj Narang, Balakumar Sunarlingam, Miles Macklin, Arsalan Mousavian, Dieter Fox International Conference on Robotics and Automation (ICRA), Xi'an, China, 2021 [Paper] [bibtex]

		Learning RGB-D Feature Embeddings for Unseen Object Instance Segmentation Yu Xiang, Christopher Xie, Arsalan Mousavian, Dieter Fox Conference on Robot Learning (CoRL) Virtual, 2020 [Paper] [Video] [Code] [bibtex]

		Interpreting and Predicting Tactile Signals via a Physics-Based and Data-Driven Framework Yashraj Narang, Karl Van Wyk, Arsalan Mousavian, Dieter Fox Robotics: Science and Systems Conference (RSS), Corvalis OR, 2020 [Paper] [Video] [Project Page] [bibtex]

		LatentFusion: End-to-End Differentiable Reconstruction and Rendering for Unseen Object Pose Estimation Keunhong Park, Arsalan Mousavian, Yu Xiang, Dieter Fox IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Seattle WA, 2020 [Paper] [Video] [bibtex]

6-DOF Grasping for Target-driven Object Manipulation in Clutter
Adithya Murali, Arsalan Mousavian, Clemens Eppner, Dieter Fox
International Conference on Robotics and Automation (ICRA), Paris, France, 2020

Best Robot Manipulation Paper Finalist
Best Student Paper Finalist

[Paper] [Video] [bibtex] [blog post]

		Self-supervised 6D Object Pose Estimation for Robot Manipulation Xinke Deng, Yu Xiang, Arsalan Mousavian, Clemens Eppner, Timothy Bretl, Dieter Fox International Conference on Robotics and Automation (ICRA), Paris, France, 2020 [Paper] [Video] [bibtex]

		A Billion Ways to Grasps - An Evaluation of Grasp Sampling Schemes on a Dense, Physics-based Grasp Data Set Clemens Eppner, Arsalan Mousavian, Dieter Fox International Symposium on Robotics Research (ISRR), Hanoi, Vietnam, 2019 [Paper] [Dataset] [bibtex]

		The Best of Both Modes: Separately Leveraging RGB and Depth for Unseen Object Instance Segmentation Christopher Xie, Yu Xiang, Arsalan Mousavian, Dieter Fox Conference on Robot Learning (CoRL), Osaka, Japan, 2019 [Paper] [Video] [Projet Page] [bibtex]

		6-DOF GraspNet: Variational Grasp Generation for Object Manipulation Arsalan Mousavian, Clemens Eppner, Dieter Fox International Conference on Computer Vision (ICCV), Seoul, South Korea, 2019 [Paper] [Video] [bibtex] [Code + Data] [Blog Post]

		PoseRBPF: A Rao-Blackwellized Particle Filter for 6D Object Pose Tracking Xinke Deng, Arsalan Mousavian, Yu Xiang, Fei Xia, Tim Bretl, Dieter Fox Robotics: Science and Systems Conference (RSS) Freiburg, Germany, 2019 [Paper] [Video] [TechXplore Article] [bibtex]

		Visual Represenatations for Semantic Target Driven Navigation Arsalan Mousavian, Alexander Toshev, Marek Fiser, Jana Kosecka, James Davidson International Conference on Robotics and Automation (ICRA) Montreal, Canada, 2019 [Paper] [bibtex]

		Synthesizing Training Data for Object Detection in Indoor Scenes Georgios Georgakis, Arsalan Mousavian, Alexander C. Berg, Jana Kosecka Robotics: Science and Systems Conference (RSS), Cambridge MA, 2017 [Paper] [bibtex]

		3D Bounding Box Estimation Using Deep Learning and Geometry Arsalan Mousavian, Dragomir Anguelov, John Flynn, Jana Kosecka IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu HI, 2017 [Paper] [Supplementary Material] [bibtex] [IEEE Spectrum][Forbes] Estimated 3D Boxes on the split of 3DVP: [Download Link]

		Joint Semantic Segmentation and Depth Estimation with Deep Convolutional Networks Arsalan Mousavian, Hamed Pirsiavash, Jana Kosecka International Conference on 3DVision(3DV), Stanford CA, 2016 [Paper] [bibtex]

		Multiview RGB-D Dataset for Object Instance Detection Georgios Georgakis, Md Alimoor Reza, Arsalan Mousavian, Phi-Hung Le, Jana Kosecka International Conference on 3DVision(3DV), Stanford CA, 2016 [Paper] [Dataset Link] [bibtex]

		Semantic Image Based Geolocation Given a Map Arsalan Mousavian, Jana Kosecka arXiv 2016 [Paper]

		Deep Convolutional Features for Image Based Retrieval and Scene Categorization Arsalan Mousavian, Jana Kosecka arXiv 2015 [Paper] [bibtex]

		Semantically Guided Location Recognition for Outdoors Scenes Arsalan Mousavian, Jana Kosecka, Jyh-Ming Lien. International Conference on Automation and Robotics (ICRA), Seattle WA, 2015 [Paper] [bibtex]

		Semantically Aware Bag-of-words for Localization Arsalan Mousavian, Jana Kosecka. Computer Vision and Pattern Recognition 2015 workshop on Semantic for Visual Reconstruction, Localization, and Mapping (CVPRW), Boston, MA, 2015 [Paper] [bibtex]