About Me
Supratim Manna is deeply fascinated by generative models such as Large Language Models (LLMs), Generative Adversarial Networks (GANs), and diffusion models. These advanced techniques offer powerful applications from generating realistic images to enhancing data augmentation in machine learning. He also has a strong interest in quantum computing, recognizing its potential to revolutionize fields like Cryptography and AI. Through a combination of coursework, self-study, and hands-on projects, Supratim actively explores these cutting-edge domains to stay at the forefront of technological advancement.
Education
Defence Institute of Advanced Technology (DIAT-DRDO)
M.Sc. Data Science (Applied Mathematics)
2023 - 2025
Midnapore College (Autonomous)
B.Sc. Mathematics (Hons.)
2020 - 2023
Research Interests
Supratim Manna’s research interests lie in generative models, including LLMs, GANs, and diffusion models, with applications in data synthesis and augmentation. He is also intrigued by the transformative potential of quantum computing in areas like AI and cryptography. Through continuous learning and hands-on experimentation, he explores these frontier technologies to drive innovation.
Publications
1. Isolation Forest-based Anomaly Detection of Vessels using AIS Data
S. Manna, Dr. B. Ramkrishna
ICDMAI 2025 (To be published soon)
2. An ML Approach for Predicting Maritime Traffic
S. Manna, Dr. B. Ramkrishna
2025(In process)
Projects
This project investigates the detection of exoplanets using machine learning, specifically employing the K-Nearest Neighbors (k-NN) algorithm to analyze light intensity data. Utilizing a dataset downloaded from Kaggle, which includes light curves of various celestial bodies, we focus on identifying patterns indicative of exoplanets. The k-NN model is trained on labeled data to distinguish between exoplanets and non-exoplanetary candidates based on fluctuations in light intensity. Our findings demonstrate that k-NN can effectively enhance classification accuracy, providing a promising avenue for future research in exoplanet detection and contributing to our understanding of planetary systems beyond our solar system.
This project encompasses innovative components developed for the CyberAICup 2024 competition, focusing on advancements in network security. The Network Intrusion Detection project employs a machine learning approach to enhance cybersecurity. By using interpolation techniques on payload data, the model is trained with LightGBM to effectively identify potential network intrusions. This innovative solution aims to improve detection accuracy (98.71%) and response times in real-time network monitoring.
This project focuses on 3D object detection for autonomous vehicles, leveraging a pre-trained model sourced from GitHub.The objective is to enhance the safety and reliability of autonomous driving systems by accurately detecting and classifying objects in real-time. We utilized camera-recorded video for testing, allowing the model to analyze its surroundings and produce rectangular 2D bounding boxes around detected objects. Through comprehensive evaluation, we assess the model’s performance in terms of detection accuracy and processing speed. The results demonstrate the model’s effectiveness in real-world scenarios, contributing to advancements in object recognition systems and improving the overall functionality of autonomous vehicles.
This project presents a Movie Recommender System utilizing data from The Movie Database (TMDB) to provide personalized movie suggestions. By integrating datasets containing movie details and credits, we preprocess and extract relevant features such as genres, keywords, cast, and crew. Using a bag-of-words approach with Count Vectorization, we create a similarity matrix based on text data from these features. The system employs cosine similarity to recommend movies similar to a given title, enhancing user experience through tailored suggestions. The results demonstrate the effectiveness of this method in generating relevant recommendations, contributing to advancements in recommendation algorithms in the entertainment industry.
This project focuses on face mask detection using Convolutional Neural Networks (CNNs), utilizing a dataset sourced from Kaggle’s open-source repositories. The objective is to develop an automated system capable of accurately identifying individuals wearing face masks in real-time, a critical task in promoting public health and safety. The dataset comprises images categorized into masked and unmasked individuals, which are preprocessed and augmented to improve model robustness. The CNN model is trained to distinguish between the two classes, achieving high accuracy in detection. This project underscores the potential of deep learning techniques in addressing real-world challenges, particularly in monitoring compliance with health guidelines in public spaces.
This project predicts stock price movement based on news headlines using Natural Language Processing (NLP), utilizing a dataset sourced from Kaggle. By analyzing the sentiment of financial news, we assess its impact on stock market behavior. The data is preprocessed and sentiment scores are calculated, which are then correlated with subsequent stock price movements. My findings demonstrate the effectiveness of NLP techniques in financial forecasting, providing valuable insights for investors.
This project focuses on generating CIFAR-10 small color photographs using Generative Adversarial Networks (GANs). The CIFAR-10 dataset, comprising 60,000 32x32 color images across 10 classes, serves as the training ground for our GAN model. By leveraging the adversarial training process, my aim is to create realistic images that closely resemble the original dataset. The results showcase the GAN’s ability to generate diverse and high-quality color photographs, contributing to advancements in image synthesis and demonstrating the potential of GANs in various applications, including data augmentation and creative content generation.
This project focuses on image generation using Denoising Diffusion Probabilistic Models (DDPMs), with the code and dataset sourced from GitHub. By utilizing diffusion processes to iteratively refine noise into coherent images, we aim to capture intricate details and textures across various datasets. The model is trained on a collection of images, allowing it to learn complex distributions and produce visually appealing outputs. My findings demonstrate the effectiveness of DDPM in generating realistic images, showcasing their potential applications in areas such as art, design, and data augmentation.
We have implemented a military aircraft classification system using pretrained deep learning models—ResNet50, VGG19, and VGG16—to compare their performance. Conducted extensive evaluations, where ResNet50 outperformed the others, delivering the best classification accuracy. This study highlights the effectiveness of transfer learning in defense applications, ensuring robust and efficient aircraft recognition.
I have developed an AI-Agent using DeepSeek prompt engineering to answer user queries based on a predefined Knowledge Base (KB), ensuring data privacy by preventing external data sharing. If the answer is unavailable in the KB, the agent dynamically queries an LLM prompt to generate a response. This approach enables secure model training without exposing sensitive information while optimizing knowledge retrieval efficiency.
Virtual Internship
1. Innovate Intern - AI/ML
AICTE Virtual Internship
July 2024 - November 2024
Skills
Technical Skills
- Programming Languages: Python, R, SQL.
- Libraries: Scikit-learn, TensorFlow, Pytorch.
- Technical Skills: ANN, CNN, RNN, LSTM, GRU, Transformer, Attention Mechanism, Encoder-Decoder, GAN, DDPM.
- Mathematical Expertise: Operation Research and Optimization Techniques, Linear Algebra, Calculus, Discrete Mathematics, Graph Theory, Number Theory, Numerical Analysis, Probability & Statistics, Mathematical Cryptography.
- AI Skills: Machine Learning, Deep Learning, LLM, Data structures & Algorithms, DBMS, Time Series Analysis.
- Data Science Tools: Pandas, NumPy, Seaborn, Matplotlib, VS-Code, R-Studio, Google-Colab, ChatGPT, Kaggle, Hugging-Face, Git, Github, Cursor, WixStudio, Pycharm, Ollama, LM-Studio.
Soft Skills:
Problem-solving, Good Communicator, Analytical Thinking, Creativity, Team Collaboration.
Certifications
Certifications
- 1. Artificial Intelligence Fundamentals - IBM
- 2. Python Essentials - Cisco
- 3. Power of Python in Data Analysis - Ramkrishna Mission Vidyamandira
- 4. ACMAGRADE Certification - ACMAGRADE
- 5. Next2Tech Certification - Next2Tech
Achievements
1. GATE Qualified in Data Science & AI, 2024 & 2025
2. Selected for Masters from IISER Kolkata, IISER TVM & South Asian University(International University).
Volunteering & Leadership Roles
1. Volunteer, PUNECON - IEEE Pune Section International Conference 2024
Languages & Hobbies
Languages: English, Hindi, German
Hobbies: Badminton