Skip to content
View Rasesh2005's full-sized avatar

Highlights

  • Pro

Block or report Rasesh2005

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please donโ€™t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
Rasesh2005/README.md

๐Ÿ‘‹ Hi, I'm Rasesh Udayakumar Shetty!

Welcome to my GitHub profile! I'm a passionate student pursuing an Integrated Dual Degree in Mathematics and Computing at IIT (BHU), Varanasi, with a strong interest in AI/ML research, computer systems, and algorithm design.

๐Ÿ› ๏ธ Projects

๐Ÿ” AI-Generated Image Detection with Explanation (October - December 2024)

  • Built a dataset of 3.4M+ images, with reasoning generated for 1M+ fake images using Gemini-1.5-Pro.
  • Designed a novel architecture leveraging Variational AutoEncoders with 99% accuracy on similar latent-space images and 75%+ accuracy on unseen datasets (Adobe Firefly).
  • Fine-tuned Qwen2-VL with Class Activation Mapping for image reasoning.
  • Tools: PyTorch, PyTorch Lightning, Gemini API.

๐ŸŽฅ GIF Question Answering MultiModal ML Training (September - October 2024)

  • Developed an architecture for GIF Visual Question Answering using qformer from BLIP-2 and Llama3.2-1b.
  • Explored Vision-Language Models (VLMs) and their fine-tuning potential on GIF data.
  • Tools: PyTorch.

๐Ÿ”ข Differential Equation Solver (August - November 2024)

  • Trained a neural network to solve Ordinary Differential Equations under the guidance of Dr. Santwana Mukhopadhyaya.
  • Explored Graph-based ML and how neural networks learn functions.
  • Tools: PyTorch.

๐Ÿ’ป Technologies

  • Languages: C++, C, Python, JavaScript
  • Frameworks & Tools: PyTorch, TensorFlow, LangChain, Cirq, Qiskit, MERN Web Development

๐Ÿ“ฐ Publications

  • Understanding the Worldโ€™s Museums through Vision-Language Reasoning
    Curated a large-scale dataset of 65M images and 200M question-answer pairs for benchmarking vision-language models across visual question answering tasks.

๐Ÿ”— Connect with Me

Feel free to explore my repositories and connect!


My github stats

Github Stats Here

Pinned Loading

  1. file-explorer-shell file-explorer-shell Public

    C++

  2. quickbloom quickbloom Public

    Rust

  3. Traffic-Control-Quantum-Annealing Traffic-Control-Quantum-Annealing Public

    Jupyter Notebook

  4. AI-Tic-Tac-Toe-Website AI-Tic-Tac-Toe-Website Public

    Artificial Intelligence based Tic Tac Toe made as a website in html, css and vanilla javascript

    JavaScript 11 16

  5. GDB-UI GDB-UI Public

    Forked from c2siorg/GDB-UI

    GDB stands for GNU Debugger. It's a powerful and popular debugger for various programming languages, including C, C++, Ada, and others. It allows developers to observe what a program is doing whileโ€ฆ

    JavaScript