Sai Abhishek Siddhartha Namburu

Education

Master of Science in Mechanical Engineering, Carnegie Mellon University, Concentration: Robotics and Control, 2021-2023 [current GPA: 4.0/4.0]
B.E. (Hons.) Mechanical Engineering, Birla Institute of Technology and Science (BITS) Pilani - India, 2015-2020 [GPA: 8.74/10]

Work experience

May 2022 - August 2022: Computer Vision Intern
- Location: SRI International, Menlo Park, CA
- Description:
  - Developed an Attention-based Visual Dossier matching module to recognize whole-body ID with an accuracy of 91% based on 3D mesh and view angles generated using VIBE from pixel enhanced videos.
  - Generated pixel enhanced videos (2x better SNR) from U-Net based network by removing pixel shift and blur in videos.
  - Trained VIBE with an additional silhouette-based loss to generate shape aware 3D meshes of humans from videos.
August 2020 - August 2021: Machine Learning Engineer
- Location: DeepEdge, Hyderabad - India
- Description
  - Collaborated to develop a face recognition solution(RetinaFace + ArcFace + SORT) for body cameras worn by security officials. Integrated the pipeline by developing an application in K7 Camera(Ambarella CV25 chip) to inference on real time data. [code] [code]
  - Modified and trained a Mask-RCNN to detect custom objects, draw masks and identify keypoints of interest in pytorch. Improved inference speed by 3x by converting the model into TensorRT using TensorRT wrappers.
  - Accelerated Iterative Point Cloud algorithm (based on open3d) by 2.5x using cuda kernels and thrust library.
  - Analysed and calculated GAIT Characteristic parameters like stepcount, stance, swing from a pre-trained openpose pose detection model for an autonomous walker.
July 2018 - December 2018: Fintech Intern
- Location: Nomura Services India Private Limited, Mumbai - India
- Description
  - Assisted Nomura’s FinTech division: Responsible for conceptualization and execution of Nomura technology development program - Implemented AI/ML, NLP technologies to automate use-cases across various divisions.

Research/Academic experience

September 2021 - Present: Graduate Research Assistant
- Location: Human Sensing Lab, Carnegie Mellon University
- Description:
  - Modified current deep mind based I3D action recognition model(with non-local blocks) to localize activity and achieved an accuracy of 72% (in real domain) training on combination of synthetic and real robot cognition data.
  - Implemented Gradient Reversal Layer for I3D improving accuracy in real domain by 2% without using labels in real domain while training.
- Code
- Supervisor: Professor Fernando De La Torre and Prof. Jessica K. Hodgins
August 2020 - August 2021: Project Assistant
- Location: Indian Institute of Science, Bangalore - India
- Description
  - Devised optmisation algorithms in python for efficient heat transfer.
- Supervisor: Professor Anathasuresh, Prof. Pramod Kumar

Projects

Inter-Vehicular Depth and Velocity Estimation using a Monocular Camera (CMU): February 2022 – May 2022
- Description:
  - Modified Monoflex to regress for 3D key points from single image to predict depth of vehicles. Reduced dependence on data by implicitly learning for uncoupled representations (3D orientations) of a car maintaining similar AP.
  - Integrated kalman filter based AB3MOT with the Monoflex (using its depth predictions) achieving 3% less error in velocity prediction compared to using PointRCNN backbone.
  - Attained comparable accuracy by regressing for velocity and depth directly from a Multi-scale Vision Transformer.
- Tech Stack: Python, PyTorch
- Code and Report
MyTorch Framework and Deep Learning (CMU): February 2022 – May 2022
- Description:
  - Developed a deep learning framework with functional modules like Conv1D, Conv2D, LSTM, GRU, Attention and iterative optimization algorithms including Adam, SGD and RMSprop.
- Tech Stack: Python
- Code and Report
Visual Learning and Computer Vision Assignments (CMU): August 2021 – May 2022
- Description:
  - Implemented a weakly supervised object localization network and a deep detection network with AlexNet backbone on Pascal dataset and obtained comparable accuracy with supervised object detection methods.
  - Trained a GAN on CUB 2011 dataset to generate realistic looking birds with standard GAN, LSGAN and WGAN-GP losses and compared the FID scores. Tested disentanglement of latent space by interpolating it.
  - Implemented visual object tracking with Lucas-Kanade forward additive image alignment algorithm and Matthew-Bakers Inverse Compositional alignment algorithm.
  - Performed 3-D Reconstruction of a temple estimating Fundamental matrix using with 8/7-point algorithm and RANSAC.
  - Worked on stitching panoramas utilizing homography and RANSAC.
  - Implemented MNIST character detection recognition using neural nets and acheived an accuracy greater than 90%.
  - Used Spatial Pyramid Matching of image word maps to recognise the location of an image.
- Tech Stack: Python, PyTorch
- Code and Report
Emotion Recognition in Multi-Party Conversations using Multi-Modal Approach (CMU): October 2021 – November 2021
- Description:
  - Achieved an accuracy of 60% in predicting sentiment and emotion by combining text and audio modules of multi-party conversations in friends using shallow machine learning models.
  - Deployed various feature engineering methods to improve the accuracy by 5%. Compared the results with t-CNN, d-RNN and bc-LSTM and reasoned the accuracies.
- Tech Stack: Python, PyTorch, Tensorflow
- Code and Report
Robot Planner Visualization (CMU): October 2021 – November 2021
- Description:
  - Implemented RRT, RRT, RRT Connect, A, Weighted A* and Dijikstra in 2D using object oriented paradigm.
  - Visualized the search of these planner using openGL in 2D. Developed a GUI to make the product market ready.
- Tech Stack: C++, openGL
- Code and Report
Home Service Robot: June 2021 – July 2021
- Description:
  - Deployed a turtlebot mobile robot in a designed custom environment in gazebo to pick-up and drop-off objects in the environment.
  - Mapped the environment using gmapping package, localized it and used ROS navigation stack (Dijkstra’s algorithm) to plan a safe path to pick-up and drop-off objects (designed with rviz marker).
- Tech Stack: C++, ROS
Fashion Retail forecasting using ENN: February 2019 – April 2019
- Description:
  - Modelled custom evolutionary neural network using genetic algorithm to forecast demand for two different apparels.
  - Employed a pre-search algorithm along with BIC metric to effectively increase computation speed.
  - Compared with a Non-linear Auto Regressive exogenous input network to pick a better forecasting method.
- Tech Stack: Python, Keras, Scikit-learn, Matlab DL tool
- Code
Classification of YouTube comments to detect advertisements: November 2017 - December 2017
- Description:
  - Implemented an end-end approach with Support Vector Machine and a Naive Bayes classifier to classify YouTube comments and detect advertisements based on words used.
  - Used RoC-AuC metric to tune hyperparameters in the model.
- Tech Stack: Python
- Code
Algorithm to reduce epicyclic geartrains by Isomorphism test: February 2018 – April 2018
- Description:
  - Devised an algorithm to reduce the number of epicyclic gear trains after checking structural and rotational isomorphism.
  - Algorithm first generates all possible Epi cyclic gear trains(EGT) and associates each them to a matrix using hammingmethod, then checks for translational and rotational isomorphism retaining non-isomorphic gear trains.
- Tech Stack: Matlab

Skills

Programming Languages: Python, C/C++ (Cuda kernels and thrust library), R
Deep-Learning Frameworks: PyTorch, TensorRT, Keras, Tensorflow(1 and 2)
Tools: Docker, Scikit learn, OpenCV, SQL, MS Office, LaTeX, GitHub
Application Software: ROS, MATLAB, Gazebo

Publications

Submitted paper to CVPR 2023 for review - Multi-view and Multi-domain Human Activity Recognition
Constitutive modelling and processing map analysis of tungsten heavy alloy (92.5W-5.25Ni-2.25Fe) at elevated temperatures, Second author, International Journal of Refractory Metals and Hard Materials
Algorithm to reduce epicyclic gear trains by testing for Isomorphism

Research & Teaching

Undergraduate Research Intern

Graduate Research Assistant (IISc)

Graduate Research Assistant (CMU)

Teaching Assistant (Bio-Inspired Robotics)

Teaching Assistant (Engineering Computation)

Certificates & Achievements

Robotics Software Engineer : Udacity [In Progress]
Deep Learning Specialization: Deeplearning.ai, Coursera [December 2018]
Machine Learning: Coursera [December 2018]
C, C++ programming: NIIT [May 2019]
Excel to My SQL Analytics techniques: Coursera [Jun 2018]
Bloomberg Market Concepts: Bloomberg [Aug 2017]
R programming training: Innodatatics USA [Aug 2017]

Activities and Interests

Secretary at Brindavanam Telugu Samithi at BITS-Hyderabad
Member of VFX Club at BITS-Hyderabad
Member of Department of Visual Effects at BITS-Hyderabad
Teaching Volunteer for nearby schools in under-developed areas as a part of Nirmann, BITS Hyderabad