Hi, I'm Vatsal!

Contemplative coder, inspired by simplifying lives.

A little bit about myself : I was born and raised in New Delhi - the perfect place for a foodie and an explorer such as me! For as long as I can remember, I have been driven by a love for learning and I am constantly on the lookout for new ideas and concepts. As a tech-savvy and Machine Learning enthusiast, I absolutely love undertaking and solving challenging real-world problems. And when I am not coding, I am either hitting the gym for some intense weight-lifting, taking my beagle for a stroll in the park, or simply exploring my artistic side!

What do I do?

Machine Learning

Deep Learning

Software Engineering

Full Stack Web Development

Algorithm Design

Competitve Programming

Algorithmic Challenge Curation

Education

University of Pennsylvania - School of Engineering and Applied Science (2018 - 2020)
  • GPA : 4.0/4.0
  • Relevant Courses:
    • Deep Learning for Data Science
    • Vision and Learning
    • Machine Learning
    • Reinforcement Learning
    • Machine Perception
    • Big Data Analytics
    • Internet and Web Systems
    • Analysis of Algorithms
    Head Teaching Assistant CIS 545 - Big Data Analytics
    Teaching Assistant CIS 522 - Deep Learning for Data Science
    Graduate Research Assistant: GRASP Laboratory under the guidance of Bernadette Bucher and Prof. Kostas Daniilidis in the machine perception group.
    Penn Data Science group: Collaborating with Dymaxion Labs on the AP-LATAM Project - mapping roofs in slums with Mask R-CNN trained on 50cm satellite data from Rio.
    University of Delhi - Netaji Subhas Institue of Technology (2012 - 2016)
  • Graduated first class with distinction
  • Percentage : 80.57%
  • Relevant Courses:
    • Pattern Recognition
    • Operating Systems
    • Distributed Systems and Computing
    • Design and Analysis of Algorithms
    • Theory of Computation
    • Computer Networks
    • Relational Database Management Systems
    • Engineering Mathematics I,II and III
    • Software Engineering
    • Object Oriented Technology
    • Digital Circuits and Systems
    • Analog Electronics
    Bachelors Project: Intelligent online clothing store using Deep Learning under the guidance of Dr. Apoorvi Sood.
    Competitive Programming:
    ACM ICPC & APC Contests:
    • Achieved 24th rank in the ACM- Asian Programming Contest.
    • Qualified thrice for the ACM-ICPC regional contests. Achieved ranks of 50, 105, 112 & 130 in the online and regional rounds.
    Topcoder, Codeforces and Codechef:
    Consistently achieved good ranks in contests held regularly on these platforms.
    Codechef best ranking : 252 Global Rank / 128 Country Rank

    Work Experience

    Software Engineering Intern - Prediction at Uber ATG May 2019- August 2019

    Contributed to the continuous trajectories subteam which develops algorithms for trajectory waypoint predictions for dynamic actors on the road.
    • Developed a policy guided machine learnt model for replacing a physics guided approach for trajectory prediction in a particular scenario.
    • Responsibilities included data extraction, cleaning and preprocessing from AV logs, hypothesizing and validating policies for prediction, developing models for predicting policy parameters, evaluating and comparing to the existing approach and finally deploying the learnt model onboard the autonomous vehicle.

    Software Engineer at July 2016- July 2018

    Full stack developer for the front office discretionary group

    • Redesigned and rewrote a legacy application for recruiting functionalities and analytics with RESTful services developed using Java Spring and UI built using ReactJS and NodeJS.
    • Developed ETLs for migration and cleaning of the data from the older system to the newer system using Spring-Batch.
    • Designed and developed generic React components for the firm's react component library - DJS.
    • Responsible for enhancements and maintenance of financial dashboards.
    • Consistently received the highest performance rating

    Software Engineering Intern at HackerRank May 2015- July 2015

    • Developed the articles module for adding and updating articles to the HackerRank community website.
    • Developed a forum classifier that classifies comments posted on the discussion forum as appreciative, help required, problem, negative and miscellaneous.
    • Developed and tested algorithmic problems of varying difficulties for the community website

    Problem Curator Intern at HackerRank Dec 2015 - Jan 2016

    • Developed 50+ problems for the community website and HackerRank for work domain of varying difficulties for the data structure, algorithms, artificial intelligence and programming language specific domains.

    Software Engineering Intern at IBM India Software Labs May 2014- June 2014

    Worked on transit management services for Singapore Metro Rail Transit

    • Developed utility functions for cleaning, wrangling and integrating the data from multiple data files received from SMRT.
    • Developed services for estimating arrival times, finding the stations impacted by cascading delays and computing polygons corresponding to routes.

    External Contributor at CodeRoulette August 2016

    • Developed algorithmic challenges for a pair programming game for developers where they can code in realtime with peers from all around the world

    Projects

    Visual Question Answering

    A VQA system takes as input an image and a free-form, open-ended, natural language question about the image and produces a natural language answer as the output

    • Dataset: Toronto COCO-QA, VQA dataset. Both of these datasets contain images from MS COCO and Abstract Scenes with ~5 questions per image and 3 plausible answers per question
    • Model architecture comprises of 4 parts - Image Encoder, Question Encoder, Fusion technique and feed forward classifier
    • VGG16 encoder combined with concatenated word embeddings from BERT combined using a stacked attention network based fusion technique was our best performing model with a top 5 accuracy of 70.32%

    Deep Learning for Chest X Ray Diagnosis

    Automated chest radiograph interpretation at the level of practicing radiologists using CheXpert - a large dataset of chest X-rays

    • Trained two separate DenseNets - one trained only on lateral images and one trained only on frontal images
    • Combined the results from these two models at a study level by taking the maximum predictions for two views
    • Acheived AUC Score of 0.861 on the test set and the model performed better than predictions by a single radiologist.
    • Interpreted the model using Class Activation Mapping - highlighting features/sections in the xray most important for the classification.
    • Also tried guided backpropagation to identify the locations in the image that activate the final layer neurons.

    SearchSearchGo

    SearchSearchGo is a academic version of a search engine consisting of a distributed crawler, hadoop map-reduce based indexer and pagerank, a query engine with Berkley DB as the store and UI built on ReactJS.

    • Crawled ~1 million pages using a Apache storm based crawler distributed across 10 EC2 machines.
    • Created the inverted indexes, tf-idf values for the crawled corpus using a hadoop map-reduce based Indexer
    • Generated pageranks for the webpages using a map-reduce pagerank algorithm executed on Amazon EMR
    • Query engine combined the results from cosine similarity between the query and the documents and the pagerank values.
    • Integrated shopping and weather results from ebay and weather.com apis/li>
    • The UI was built using ReactJS

    Roof Identification

    Used Mask RCNNs fine tuned on SpaceNet dataset to identify roofs in satellite imagery to identify potential slums and informal settlements in Latin American Countries.

    • Exploratory data analysis of SpaceNet Rio data and the data from dymaxion labs for latin american countries.
    • Converting data from different formats from SpaceNet Rio and Dynamion data to a common format consistent with MS COCO input format
    • Fine tuning the non backbone layers of the Mask RCNN on SpaceNet Rio Data using multiGPU training on Google Cloud
    • Used several data augmentation techniques for regularization and performance improvements on the slums data

    Face Antispoofing

    Developed a technique that can distinguish the spoof face accesses from the genuine ones. This will reduce the possibility of a facial biometric system from being deceived or spoofed by non-real faces.

    • Collected face data using digital and 3d printed masks based spoofing.
    • Used a combination of hand engineered textural features using Local Binary Patterns and Haralick texture features and deep features from a pretrained Resnet50 CNN model on VGG-Face dataset.

    Intelligent Online Clothing Store

    Developed a system to recognise clothing style and colour in apparel images using deep features and recommend visually similar products.
    The system has the functionality to search for products available in the store by clicking photographs of the desired products.

    • Automated product tagging using a color X category classifier trained on images scraped from google
    • Searching products with images
    • Review based product recommendation for users

    Healthcare Outcomes

    Developed a system to Predict health behaviors and outcomes for US counties as a function of

    • Demographics
    • Socio-Economic Status (SES)
    • Urban/rural environment
    • Words in tweets from that county

    Game of Hex

    Hex is a strategy board game usually played on a 11x11 hexagonal grid. Wrote the code for both a Player vs Player and Player vs CPU hex game. The AI Bot used Monte Carlo Simulation to determine the next best move.

    Movie Rating Predictor

    Predict success for films measured by their gross earnings and average user rating using support vector machine trained and tuned on IMDb movie data for 80,000 movies (Pruned to include only movies for which the desired information is present)

    Languages and Frameworks

    Online Courses

    • Deep Learning: 5-course specialization | deeplearning.ai
    • Machine Learning | Stanford University 
    • Machine Learning Foundations: A Case Study Approach | Univerity of Washington 
    • Machine Learning Regression | Univerity of Washington 
    • Machine Learning Classification | Univerity of Washington 
    • Web Application Architectures | University of New Mexico 
    • HTML, CSS and JavaScript | HKUST
    • Front-End Web UI Frameworks and Tools | HKUST
    • Algorithms Design and Analysis (Part 1 and Part 2) | Stanford University
    • C++ For C Programmers | University of California, Santa Cruz  
    • CS50x: Introduction to computer science | Harvard University 
    • LFS101x.2: Introduction to Linux | The Linux Foundation 

    The Fun Part

    Contact

    328 N Wiota Street, Philadelphia, PA 19104