Past Affiliations

Projects

  • image

    Descriptive Image Captioning

    Prof. V.Namboodiri, CSE, IIT Kanpur

    Replicated the torch implementation results of Dense Captioning model to obtain region-specific captions.Refined the captions by putting filter on the overlap area and caption similarity to reduce redundancy. Categorized captions through topic based clustering and fed them into anencoder-decoder model trained to generate sentences from phrases obtaining successful results in producing paragraphs for images in Visual Genome Dataset.[Report] [Slides]

  • image

    Video Captioning

    Prof.Gaurav Sharma, CSE, IIT Kanpur

    Implemented the State-of-the-art model of Sequence to Sequence-Video to Text which exploits the temporal information of videos. Incorporated audio features to improve the confidence in activity prediction which refined captions. Also Combined the Deep Compositional Captioning model to use a language model trained on DBpedia reducing dependency on annotated dataset.[Report] [Slides]

  • image

    Pedestrian and Vehicle Classification

    Prof.Harish Karnick, CSE, IIT Kanpur

    Refined the data obtained from surveillance camera videos of the campus and carried out background subtraction for object detection which were then tracked using box overlap in consecutive frames.Employed grey-scale, HOG, hierarchial HOG and SIFT features for clustering as well as classification.Experimented with SVM, Random Forest, Adaboost, Convolutional neural nets for classification of vehiclesand pedestrians obtaining the highest accuracy of 95.37%.[Report] [Slides]

  • image

    Gaussian Processes for Regression

    Prof.Piyush Rai, CSE, IIT Kanpur

    Studied Gaussian Processes which involves inferring a distribution rather than giving a point estimate. Assumed a Gaussian prior with a zero mean and squared exponential covariance over the predictingfunction and used Bayes Theorem to obtain a posterior which is also a Gaussian Distribution for forecasting. Tried various covariance functions to predict forest fires using GPML libraries in MATLAB.[Slides]