I Ask AI Models 'But Why?' So You Don't Have To
I'm Vishal Pramanik, a PhD student in Computer and Information Sciences at the University of Florida, where I conduct research at the intersection of AI safety, explainable AI, and machine learning security under the guidance of Prof. Dr. Sumit Kumar Jha. My work focuses on understanding the inner mechanisms of deep learning models—from vision transformers to large language models—using mechanistic interpretability, attribution methods, and circuit-based approaches.
My research spans multiple critical areas in AI safety: developing novel attribution techniques for model interpretability, exploring adversarial robustness and jailbreak vulnerabilities, advancing machine unlearning frameworks, and investigating hyperdimensional computing for efficient representation learning. I'm particularly interested in moving beyond surface-level model analysis to understand the fundamental mechanisms that drive neural network behavior and decision-making.
Before joining UF, I earned my Master's degree in Computer Science Engineering from the Indian Institute of Technology Bombay, where I received the Certificate of Excellence in Research for my thesis work on natural language processing using large language models. I also spent two years at Intel Bangalore as a Silicon Firmware Development Engineer, contributing to memory initialization and optimization for DDR5-based server systems.
University of Florida
2025 - Present
Indian Institute of Technology Bombay
2020 - 2022
GPA: 9.01/10.0
West Bengal University of Technology
2015 - 2019
GPA: 8.9/10.0
University of Florida
2025 - Present
Indian Institute of Technology Bombay
2020 - 2022
Intel Bangalore, India
July 2022 - July 2024
NeurIPS 2025 Workshop: First Workshop on Foundations of Reasoning in Language Models
AAAI-26 Workshop on Artificial Intelligence for Cyber Security (AICS), 2026
AAAI-26 Workshop on Artificial Intelligence for Cyber Security (AICS), 2026
AAAI-26 Workshop on Artificial Intelligence for Cyber Security (AICS), 2026
ACL Anthology | PDF
AAAI 2026 Workshop on Trust and Control in Agentic AI
IEEE National Aerospace and Electronics Conference (NAECON), 2024 | PDF
Main technologies I work with:
Explore interactive webpages for my recent research work: