Arsalan Mousavian

Address: NVIDIA
4545 Roosevelt Way, 6th Floor
Seattle, WA, 98105
Email: amousavian AT nvidia DOT com

2004 - 2010 2010 - 2013 June-Aug 2015 June-Aug 2016 June-Aug 2017 Aug 2017 - May 2018 2013 - 2018 2018 - now


I am a senior research scientist at NVIDIA Seattle Robotics Lab. I am interested in using computer vision and 3D vision for robotics tasks such as object manipulation. At the moment, I am mainly working on model-free object manipulation in unconstrained environments. In addition, I am interested in 6D pose estimation of objects, instance segmentation, and recognition from RGB-D images. Prior to NVIDIA, I finished my PhD in Computer Science department at George Mason University where I was working under the supervision of Prof. Jana Kosecka. During my PhD, I worked on a variety of computer vision problems for robot perception such as: vision based navigation, object pose estimation, object detection, semantic segmentation and image based localization. I was fortunate to work with amazing collaborators in different research groups in industry. I did internships at Google Brain Robotics, Zoox, and Google StreetView during my PhD. Prior to my PhD, I got my masters in AI and Robotics from University of Tehran.

News:


Publications:


ProgPrompt: Generating Situated Robot Task Plans using Large Language Models
Ishika Singh, Valts Blukis, Arsalan Mousavian, Ankit Goyal, Danfei Xu, Jonathan Tremblay, Dieter Fox, Jesse Thomason, Animesh Garg
arXiv
[Paper] [Project Page] [bibtex]

 
     

MegaPose: 6D Pose Estimation of Novel Objects via Render & Compare
Yann Labbe, Lucas Manuelli, Arsalan Mousavian, Stephen Tyree, Stan Birchfield, Jonathan Tremblay, Justin Carpentier, Mathieu Aubry, Dieter Fox, Josef Sivic
Conference on Robot Learning (CoRL) 2022
[OpenReview] [bibtex]

 
     

Learning Robust Real-World Dexterous Grasping Policies via Implicit Shape Augmentation
Zoey Chen, Karl Van Wyk, Yu-Wei Chao, Wei Yang, Arsalan Mousavian, Abhishek Gupta, Dieter Fox
Conference on Robot Learning (CoRL) 2022
[Paper] [Project Page] [bibtex]

 
     

Deep Learning Approaches to Grasp Synthesis: A Review
Rhys Newbury, Morris Gu, Lachlan Chumbley, Arsalan Mousavian, Clemens Eppner, Jurgen Leitner, Jeannette Bohg, Antonio Morales, Tamim Asfour, Danica Kragic, Dieter Fox, Akansel Cosgun Fox
arXiv
[Paper] [bibtex]

 
     

IFOR: Iterative Flow Minimization for Robotic Object Rearrangement
Ankit Goyal, Arsalan Mousavian, Chris Paxton, Yu-Wei Chao, Brian Okorn, Jia Deng, Dieter Fox
Computer Vision and Pattern Recognition (CVPR) 2022
[Paper] [Project Page] [bibtex]

 
     

RICE: Refining Instance Masks in Cluttered Environments with Graph Neural Networks
Christopher Xie, Arsalan Mousavian, Yu Xiang, Dieter Fox
Conference on Robot Learning (CoRL) 2021
[Paper] [Video][Project Page] [bibtex]

 
     

STORM: An Integrated Framework for Fast Joint-Space Model-Predictive Control for Reactive Manipulation
Mohak Bhardwaj, Balakumar Sundaralingam, Arsalan Mousavian, Nathan Ratliff, Dieter Fox, Fabio Ramos, Byron Boots
Conference on Robot Learning (CoRL) 2021
[Paper] [Project Page] [bibtex]

 
     

Goal-Auxiliary Actor-Critic for 6D Robotic Grasping with Point Clouds
Lirui Wang, Yu Xiang, Wei Yang, Arsalan Mousavian, Dieter Fox
Conference on Robot Learning (CoRL) 2021
[Paper] [Project Page] [bibtex]

 
     

NeRP: Neural Rearrangement Planning for Unknown Objects
Ahmed Qureshi, Arsalan Mousavian, Chris Paxton, Michael Yip, Dieter Fox
Robotics: Science and Systems, (RSS), Virtual, 2021
[Paper] [Video] [bibtex]

 
     

Reactive Long Horizon Task Execution via Visual Skill and Precondition Models
Shohin Mukherjee, Chris Paxton, Arsalan Mousavian, Adam Fishman, Maxim Likhachev, Dieter Fox
International Conference on Intelligent Robots and Systems (IROS) 2020
[Paper] [Video] [bibtex]

 
     

RGB-D Local Implicit Function for Depth Completion of Transparent Objects
Luyang Zhu, Arsalan Mousavian, Yu Xiang, Hammad Mazhar, Jozef van Eenbergen, Shoubhik Debnath, Dieter Fox
Conference on Computer Vision and Pattern Recognition (CVPR), Virtual, 2021
[Paper] [Project Page] [bibtex]

 
     

Contact-GraspNet: Efficient 6-DoF Grasp Generation in Cluttered Scenes
Martin Sundermeyer, Arsalan Mousavian, Rudolph Triebel, Dieter Fox
International Conference on Robotics and Automation (ICRA), Xi'an, China, 2021
[Paper] [Video] [Project Page] [bibtex]

 
     

Object Rearrangement Using Learned Implicit Collision Functions
Michael Danielczuk*, Arsalan Mousavian*, Clemens Eppner, Dieter Fox
International Conference on Robotics and Automation (ICRA), Xi'an, China, 2021
[Paper] [Video] [Project Page] [bibtex]

 
     

Reactive Human-to-Robot Handovers of Arbitrary Objects
Wei Yang, Chris Paxton, Arsalan Mousavian, Yu-Wei Chao, Maya Cakmak, Dieter Fox
International Conference on Robotics and Automation (ICRA), Xi'an, China, 2021
[Paper] [Video] [Project Page] [bibtex]

Best Human Robot Interaction Paper

 
     

ACRONYM: A Large-Scale Grasp Dataset Based on Simulation
Clemens Eppner, Arsalan Mousavian, Dieter Fox
International Conference on Robotics and Automation (ICRA), Xi'an, China, 2021
[Paper] [Dataset] [bibtex]

 
     

Sim-to-Real for Robotic Tactile Sensing via Physics-Based Simulation and Learned Latent Projections
Yashraj Narang*, Balakumar Sunarlingam*, Miles Macklin, Arsalan Mousavian, Dieter Fox
International Conference on Robotics and Automation (ICRA), Xi'an, China, 2021
[Paper] [bibtex]

 
     

Learning RGB-D Feature Embeddings for Unseen Object Instance Segmentation
Yu Xiang, Christopher Xie, Arsalan Mousavian, Dieter Fox
Conference on Robot Learning (CoRL) Virtual, 2020
[Paper] [Video] [Code] [bibtex]

 
     

Interpreting and Predicting Tactile Signals via a Physics-Based and Data-Driven Framework
Yashraj Narang, Karl Van Wyk, Arsalan Mousavian, Dieter Fox
Robotics: Science and Systems Conference (RSS), Corvalis OR, 2020
[Paper] [Video] [Project Page] [bibtex]

 
     

LatentFusion: End-to-End Differentiable Reconstruction and Rendering for Unseen Object Pose Estimation
Keunhong Park, Arsalan Mousavian, Yu Xiang, Dieter Fox
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Seattle WA, 2020
[Paper] [Video] [bibtex]

 
     

6-DOF Grasping for Target-driven Object Manipulation in Clutter
Adithya Murali, Arsalan Mousavian, Clemens Eppner, Dieter Fox
International Conference on Robotics and Automation (ICRA), Paris, France, 2020

Best Robot Manipulation Paper Finalist
Best Student Paper Finalist

[Paper] [Video] [bibtex] [blog post]

 
     

Self-supervised 6D Object Pose Estimation for Robot Manipulation
Xinke Deng, Yu Xiang, Arsalan Mousavian, Clemens Eppner, Timothy Bretl, Dieter Fox
International Conference on Robotics and Automation (ICRA), Paris, France, 2020
[Paper] [Video] [bibtex]

 
     

A Billion Ways to Grasps - An Evaluation of Grasp Sampling Schemes on a Dense, Physics-based Grasp Data Set
Clemens Eppner, Arsalan Mousavian, Dieter Fox
International Symposium on Robotics Research (ISRR), Hanoi, Vietnam, 2019
[Paper] [Dataset] [bibtex]

 
     

The Best of Both Modes: Separately Leveraging RGB and Depth for Unseen Object Instance Segmentation
Christopher Xie, Yu Xiang, Arsalan Mousavian, Dieter Fox
Conference on Robot Learning (CoRL), Osaka, Japan, 2019
[Paper] [Video] [Projet Page] [bibtex]

 
     

6-DOF GraspNet: Variational Grasp Generation for Object Manipulation
Arsalan Mousavian, Clemens Eppner, Dieter Fox
International Conference on Computer Vision (ICCV), Seoul, South Korea, 2019
[Paper] [Video] [bibtex] [Code + Data] [Blog Post]

 
     

PoseRBPF: A Rao-Blackwellized Particle Filter for 6D Object Pose Tracking
Xinke Deng, Arsalan Mousavian, Yu Xiang, Fei Xia, Tim Bretl, Dieter Fox
Robotics: Science and Systems Conference (RSS) Freiburg, Germany, 2019
[Paper] [Video] [TechXplore Article] [bibtex]

 
     

Visual Represenatations for Semantic Target Driven Navigation
Arsalan Mousavian, Alexander Toshev, Marek Fiser, Jana Kosecka, James Davidson
International Conference on Robotics and Automation (ICRA) Montreal, Canada, 2019
[Paper] [bibtex]

 
     

Synthesizing Training Data for Object Detection in Indoor Scenes
Georgios Georgakis, Arsalan Mousavian, Alexander C. Berg, Jana Kosecka
Robotics: Science and Systems Conference (RSS), Cambridge MA, 2017
[Paper] [bibtex]

 
     

3D Bounding Box Estimation Using Deep Learning and Geometry
Arsalan Mousavian, Dragomir Anguelov, John Flynn, Jana Kosecka
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu HI, 2017
[Paper] [Supplementary Material] [bibtex] [IEEE Spectrum][Forbes]
Estimated 3D Boxes on the split of 3DVP: [Download Link]

 
     

Joint Semantic Segmentation and Depth Estimation with Deep Convolutional Networks
Arsalan Mousavian, Hamed Pirsiavash, Jana Kosecka
International Conference on 3DVision(3DV), Stanford CA, 2016
[Paper] [bibtex]

 
     

Multiview RGB-D Dataset for Object Instance Detection
Georgios Georgakis, Md Alimoor Reza, Arsalan Mousavian, Phi-Hung Le, Jana Kosecka
International Conference on 3DVision(3DV), Stanford CA, 2016
[Paper] [Dataset Link] [bibtex]

 
     

Semantic Image Based Geolocation Given a Map
Arsalan Mousavian, Jana Kosecka
arXiv 2016
[Paper]

 
     

Deep Convolutional Features for Image Based Retrieval and Scene Categorization
Arsalan Mousavian, Jana Kosecka
arXiv 2015
[Paper] [bibtex]

 
     

Semantically Guided Location Recognition for Outdoors Scenes
Arsalan Mousavian, Jana Kosecka, Jyh-Ming Lien.
International Conference on Automation and Robotics (ICRA), Seattle WA, 2015
[Paper] [bibtex]

 
     

Semantically Aware Bag-of-words for Localization
Arsalan Mousavian, Jana Kosecka.
Computer Vision and Pattern Recognition 2015 workshop on Semantic for Visual Reconstruction, Localization, and Mapping (CVPRW), Boston, MA, 2015
[Paper] [bibtex]