Featured Projects

Dynamic Hand Gesture Recognition

Python
Keras

Recognising Dynamic Hand Gesture movements and mapping them to Human Computer Interaction based acitivites in real time.

Check it out

Generative Modelling of Images from Speech

Python
PyTorch

On-going project to reconstruct a person's face from a sample of his/her audio using.

Check it out

AI Assistant

Python
Keras

An AI assistant for differently-abled people to recognize static hand gestures and converting them into sign language. Can also convert Text/Voice into a live motion sequence of gestures and read out newspapers/articles.

Check it out

Object Detection

Python
Keras

Object detection for Flipkart Grid Challenge in which I secured an All India rank of 29.

Check it out

GLMC

C
Python

A specialized linear algebra library for OpenGL made in C with a python wrapper.

Check it out

Space Invaders

Python
Pygame

A modern implementation of classic arcade game Space Invaders made using Pygame.

Check it out

Work Experience

Prasaurus Sports Analytics Pvt. Ltd. :

Worked as a Deep Learning / Computer Vision intern.

My task was to process the video datasets and create deep learning models to perform various tasks such as detect players, classify actions, infer other statistics such as dominance etc.

MyWays Life Layouts Pvt. Ltd. :

Worked as a Machine Learning Intern.

My work was based on creating a part of the pipeline for a hybrid recommendation systems for career planning and recruitment automation.

Cateina Technologies Pvt. Ltd.:

Worked as a software development intern.

My project was to create an customized recommendation engine using machine learning techniques for the company's use case

Z-connect:

Worked as an Android developer / intern.

My project here was to create a restaurant automation application that performs various tasks to enrich user experience and revolutionarise the dine-in process.

Blog

Vesit AI All India Hackathon:

Created an AI assistant as a part of the hackathon for differently-abled people and was the second runner-up in the competition.

I started off with gesture classification on ASL dataset as gesture recogntion was the base of the project.

On testing model with live images, i found out that accuracy was not very great on real-life images even after using optimization techniques.

I realised that real-life images have a lot of noise in the background and thus accuracy is low.

After some research, i came up with a solution to use background subtraction to eliminate the noise in the background and my accuracy in real life images significantly improved to accuractely classify 28/29 gestures with over a 98% accuracy which is better than most models available.

I then processed and added more functionalities like smart word and sentence recommendations and converting text/audio to a live motion sequence of gestures for differently abled people to understand in their sign language and also added functionality to read out text from articles and newspapers from their images.

In the end, I was able to secure 3rd position in the competition.

Project link

Generative Modelling (Speech2Face) :

Creating a model for reconstructing a person's face from his/her voice sample.

The project started off with understanding the workflow and creating a timeline of the tasks to be done.

Firstly, I extracted the voice segments and face of the people from youtube videos using the AVSpeech dataset.

Next, I augmented the audio segments with themselves until they reached a fixed clip size to make the inputs uniform for the encoder network.

After that, I extracted the facial features from the extracted faces using VGG vace which would be the output ground truth values for my encoder network and proceeded with building and training the encoder network.

The next task was to reconstruct the image of a person from the output of the encoder using a Face decoder network.

I built the decoder network to do the same usign transpose convolution layers and sm currently in the process of optimization of hyper-parameters.

In the end, I was able to create a model that can give an approximate of a person's facial looks from his / her audio samples.

Project link

Flipkart Grid Challenge:

The problem statement was to create a model to detect objects in images.

I started this project by doing a detailed study of the concepts of object detection and computer vision which helped me prepare a plan of action.

Since the competition was a 2 month long competion involving three stages, my primary focus was to create a base model for the second round and then optimize it.

I cleared round two and proceeded to round three in which i started using advanced optimization techniques to improve my model performance and finally acheived a nationwide rank of 29.

Project link

GLMC:

GLMC is a specialized linear algebra library prepared with C along with a python wrapper.

I started this project by doing a detailed study of the concepts of linear algebra that i had previously learnt and leaning some new ones which helped me set up a strong foundation to code optimally.

The next step was to decide a timeline and create a rough model for referrance.

And finally came the coding part. The coding part of the project was made relatively simple due to the strong base set up before it by the rough model and revision of concepts of linear algebra. After a few weeks of coding, the C library was completed and it could perform functions for 2x2,3x3,4x4 matrices and 2D,3D,4D vectors. It could also perform functions between matrices and vectors. The matrices were column major and vectors were column vectors. Note that the exmphasis was on speed and most of the functions were hence hard coded.

Project link

Ashwin Vaswani Software Developer

My Expertise

Machine Learning / Deep Learning

Competitive Coding

App Development

Featured Projects

Dynamic Hand Gesture Recognition

Generative Modelling of Images from Speech

AI Assistant

Object Detection

GLMC

Space Invaders

Work Experience

Prasaurus Sports Analytics Pvt. Ltd. :

MyWays Life Layouts Pvt. Ltd. :

Cateina Technologies Pvt. Ltd.:

Z-connect:

Blog

Vesit AI All India Hackathon:

Generative Modelling (Speech2Face) :

Flipkart Grid Challenge:

GLMC: