cv
General Information
Full Name | Himanshu Thakur |
Location | Cupertino, CA |
scihimanshu10 [AT] gmail [DOT] com | |
Languages | English, Hindi |
Work Experience
- Feb 2024 - Present
Senior Machine Learning Engineer
J.P Morgan Chase & Co.
- Devised a deep-cross multimodal ranking model for offer category recommendation serving 90M Chase customers, increasing clicks by 12% in A/B tests and driving $100M in revenue; outperformed BAU by 4x and random by 2x in F1 score.
- Conceived an agentic framework to build digital twins of Chase customers for offline A/B testing our models — fine-tuned LLaMA-3 70B (using RAFT) on GPT-4 distilled tabular interaction data to hyper-personalize agents—first-authored a NeurIPS 2025 paper.
- Pioneered a distributed LLM inference and fine-tuning framework using vLLM and torchtune on team’s infrastructure, enabling 2 brand new projects, cutting setup time by 8 weeks, and establishing the team’s first GenAI environment.
- Eliminated feature errors and boilerplate by building a PySpark feature transformation library, enabling 20+ developers to define new jobs in 1 day instead of 7 using simple configs and functional primitives, reducing production model failures by 20%.
- Aug 2020 - July 2022
Senior Machine Learning Engineer
Morgan Stanley
- Spearheaded an 8-member team to build a distributed batch-job failure prediction model using Spark MLlib & Scala, achieved 94% accuracy, and reduced downtime by 100+ hours annually—recognized as the top contributor to global firm resiliency.
- Revolutionized graph search by creating a natural language-to-Cypher query translation framework, empowering 50+ teams' ML models.
- Researched and developed a domain-agnostic recommendation engine (using large-scale and distributed training of deep learning models) powered by graph databases, recognized as most innovative firmwide project.
- Dec 2019 - July 2020
Data Scientist Intern
Locale.AI
- Founding Member of the Data Science team, developed low-latency, real-time geospatial ML frameworks to power intelligent location-based decision-making.
- Invented a novel clustering algorithm for human activity discovery from raw geospatial ping data using semi-supervised representation learning, enabling the automated identification of 18% more business areas from previously untapped data.
- Productionized and scaled a high-throughput timeseries anomaly detection framework using temporal convolutional networks (TCN), increasing request processing speed 6x and empowering 10 companies to predict vehicle vandalism and trip delays with greater reliability.
- Jan 2020 - June 2020
Machine Learning and Web Development Intern (Part-Time)
S4S Technologies
- Designed and built the entire internal operations dashboard, streamlining team workflows and decision-making.
- Developed a computer vision solution for automated food quality assurance, enhancing accuracy and efficiency in defect detection.
- May 2019 - July 2019
Machine Learning Intern
Sigmaways Inc
- Developed an AI-driven portfolio recommendation engine using an LSTM-based predictor and a Meta-Heuristic Optimizer, optimizing personalized investment strategies.
- Built and applied NLP techniques in a seamless chatbot interface, improving user interaction and accessibility to financial analysts.
- May 2018 - July 2018
Software Engineering Intern
Nova (P & D)
- As a Summer Intern, I enhanced the bookstore's software capabilities by developing a new Edge-Billing system, enabling multiple staff members to finalize bills directly on their smartphones, which significantly increased transaction speed through simultaneous billing.
- Leveraged data analytics to introduce new software features that provided better insights into sales and purchases, further optimizing operations.
- Streamlined the transaction process by implementing a barcode scanner app linked to Google Docs, enabling staff to independently process orders, which saved months of hiring efforts and helped the bookstore meet its sales target.
Research Experience
- May 2023 - Aug 2023
Research Scientist Intern
Abacus.AI
- Led research on LLM adapter linearity (soft prompts, LoRAs), developing a gradient-based algorithm that approximates full fine-tuning performance with only (depth × num of LoRA) parameters, achieving <10% F1 error on Bloom for NewQA and SQuAD.
- Devised coding and reasoning LLM evaluation benchmark to debunk staleness in popular NLP benchmarks due to memorization in LLMs (GPT4, GPT3.5-turbo), open-sourced library, contributed to writing research paper accepted at ICLR 2024 (contributed talk). Code - https://github.com/abacusai/to-the-cutoff
- Jan 2023 - July 2023
Graduate Research Assistant
Robotics Institute, Carnegie Mellon University
- Led a team of 8 graduate researchers in ML research at the Biorobotics Lab to enhance the generalization and robustness of models for e-waste recycling. Focus areas included robust segmentation, out-of-distribution detection, and reinforcement learning.
- Invented a novel area-relaxation-based loss function that improved state-of-the-art segmentation on X-ray images of e-waste materials, increasing IoU by 15% and boosting small object detection accuracy by 80%.
- Developed and deployed a real-time distribution shift detection algorithm, achieving 99% accuracy in out-of-distribution detection for e-waste material classification.
- Feb 2021 - Feb 2022
Research Intern
SketchX Lab, University of Surrey
- Invented a novel active learning algorithm for fine-grained cross-modal instance-level retrieval tasks,
- Increased mean top1 accuracy by 5% over state-of-the-art technique, published a first-authored paper to a top-tier conference (BMVC).
Education
- 2023
Master of Computational Data Science
Carnegie Mellon University
- Select Courses
- Large Language Models
- Multimodal Machine Learning
- Deep Reinforcement Learning
- Introdution to Deep Learning
- Introduction to Machine Learning (PhD)
- Cloud Computing
- Foundations of Computational Data Science
- Academic Services
- Reviewer for EMNLP 2023
- Core Committee Member, LTI Computing Insfrastructure
- Select Courses
- 2020
Bachelor of Computer Science and Engineering
Vellore Institute of Technology, Vellore
- Select Courses
- Machine Learning
- Artificial Intelligence
- Image Processing
- Neural Networks and Fuzzy Logic
- Parallel and Distributed Computing
- Virualization
- Data Structures and Algorithms
- Operating Systems
- Databases
- Networks and Communications
- Calculus
- Linear Algebra
- Statics and Probability
- Discrete Mathematics and Graph Theory
- Theory of Computation and Compiler Design
- Extracurricular Experiences
- Researcher at ACM-VIT
- Core Committee Member, SEDS-VIT
- Founding Member at Team Qubits
- Achievements
- G. D. Naidu Young Scientist Award, 2020
- Sir. M. Visvesaraya Award, 2019
- Secured 1st position in 7 national hackathons
- Ranked in Top 3 teams at 3 national hackathons
- Secured 2 patents
- Select Courses
Open Source Projects
- 2020
AI on the Beach
- Developed a high-performance library for animating millions of geospatial pings using Dask and DataShader, enabling rapid visualization.
- Analyzed shark movement patterns with geospatial intelligence and machine learning, identifying potential attack sites in advance.
Honors and Awards
- 2020
- G. D. Naidu Young Scientist Award
- 2019
- Sir. M. Visvesaraya Award
Other Interests
- Hobbies: Reading, Music Composition, Hiking, Travelling