
Aayan Yadav
BTech Student
IIT Roorkee
About Me
I am a senior at the Mehta Family School of Data Science and Artificial Intelligence at IIT Roorkee. My field of interest is Computer Vision. I have worked with segmentation, datasets and AI-generated image detection. I am currently working on diffusion and 3D scene generation. I am also intrigued by 3D Reconstruction and SLAM and thus exploring these topics in my free time. I am currently looking for exciting projects in 3D—if you have any ideas or openings, please reach out!
Publications



Provenance Detection for AI-Generated Images: Combining Perceptual Hashing, Homomorphic Encryption, and AI Detection Models
Shree Singhi, Aayan Yadav, Aayush Gupta, Shariar Ebrahimi, Parisa Hassanizadeh
Projects
Sirius
An agentic RAG system using SoTA techniques like AdaRAG, PlanRAG, HyDE, SPLADE, MetRAG, RRF etc.
GitHubMedMatcher
Similar Document Template Matching for Medical Dataset. Fine-tuned LayoutLMv3 model on custom medical document dataset using weighted cross entropy loss and minibatch gradient descent.
GitHubImage Captioning Model
Build an image captioning model using transfer learning techniques on the Flickr8k dataset. We fine-tuned a combination of pretrained Inceptionv3 and LSTM with regularization.
GitHubBlogs
Dismantling Disentanglement in VAEs
In this blog post I give a brief introduction of variational autoencoders and then explain how we can achieve disentanglement in latent space. It is an explanation of this paper.
Read moreActivation Functions
This is a beginner's introduction to activation functions. This was my first ever blog which I wrote for Blogathon organised by DSG IITR!
Read more