cv | Himanshu Thakur

General Information

Full Name	Himanshu Thakur
Location	Cupertino, CA
Email	scihimanshu10 [AT] gmail [DOT] com
Languages	English, Hindi

Work Experience

Feb 2024 - Present
Senior Machine Learning Engineer

J.P Morgan Chase & Co.
- Devised a deep-cross multimodal ranking model for offer category recommendation serving 90M Chase customers, increasing clicks by 12% in A/B tests and driving $100M in revenue; outperformed BAU by 4x and random by 2x in F1 score.
- Conceived an agentic framework to build digital twins of Chase customers for offline A/B testing our models — fine-tuned LLaMA-3 70B (using RAFT) on GPT-4 distilled tabular interaction data to hyper-personalize agents—first-authored a NeurIPS 2025 paper.
- Pioneered a distributed LLM inference and fine-tuning framework using vLLM and torchtune on team’s infrastructure, enabling 2 brand new projects, cutting setup time by 8 weeks, and establishing the team’s first GenAI environment.
- Eliminated feature errors and boilerplate by building a PySpark feature transformation library, enabling 20+ developers to define new jobs in 1 day instead of 7 using simple configs and functional primitives, reducing production model failures by 20%.
Aug 2020 - July 2022
Senior Machine Learning Engineer

Morgan Stanley
- Spearheaded an 8-member team to build a distributed batch-job failure prediction model using Spark MLlib & Scala, achieved 94% accuracy, and reduced downtime by 100+ hours annually—recognized as the top contributor to global firm resiliency.
- Revolutionized graph search by creating a natural language-to-Cypher query translation framework, empowering 50+ teams' ML models.
- Researched and developed a domain-agnostic recommendation engine (using large-scale and distributed training of deep learning models) powered by graph databases, recognized as most innovative firmwide project.
Dec 2019 - July 2020
Data Scientist Intern

Locale.AI
- Founding Member of the Data Science team, developed low-latency, real-time geospatial ML frameworks to power intelligent location-based decision-making.
- Invented a novel clustering algorithm for human activity discovery from raw geospatial ping data using semi-supervised representation learning, enabling the automated identification of 18% more business areas from previously untapped data.
- Productionized and scaled a high-throughput timeseries anomaly detection framework using temporal convolutional networks (TCN), increasing request processing speed 6x and empowering 10 companies to predict vehicle vandalism and trip delays with greater reliability.
Jan 2020 - June 2020
Machine Learning and Web Development Intern (Part-Time)

S4S Technologies
- Designed and built the entire internal operations dashboard, streamlining team workflows and decision-making.
- Developed a computer vision solution for automated food quality assurance, enhancing accuracy and efficiency in defect detection.
May 2019 - July 2019
Machine Learning Intern

Sigmaways Inc
- Developed an AI-driven portfolio recommendation engine using an LSTM-based predictor and a Meta-Heuristic Optimizer, optimizing personalized investment strategies.
- Built and applied NLP techniques in a seamless chatbot interface, improving user interaction and accessibility to financial analysts.
May 2018 - July 2018
Software Engineering Intern

Nova (P & D)
- As a Summer Intern, I enhanced the bookstore's software capabilities by developing a new Edge-Billing system, enabling multiple staff members to finalize bills directly on their smartphones, which significantly increased transaction speed through simultaneous billing.
- Leveraged data analytics to introduce new software features that provided better insights into sales and purchases, further optimizing operations.
- Streamlined the transaction process by implementing a barcode scanner app linked to Google Docs, enabling staff to independently process orders, which saved months of hiring efforts and helped the bookstore meet its sales target.

Research Experience

May 2023 - Aug 2023
Research Scientist Intern

Abacus.AI
- Led research on LLM adapter linearity (soft prompts, LoRAs), developing a gradient-based algorithm that approximates full fine-tuning performance with only (depth × num of LoRA) parameters, achieving <10% F1 error on Bloom for NewQA and SQuAD.
- Devised coding and reasoning LLM evaluation benchmark to debunk staleness in popular NLP benchmarks due to memorization in LLMs (GPT4, GPT3.5-turbo), open-sourced library, contributed to writing research paper accepted at ICLR 2024 (contributed talk). Code - https://github.com/abacusai/to-the-cutoff
Jan 2023 - July 2023
Graduate Research Assistant

Robotics Institute, Carnegie Mellon University
- Led a team of 8 graduate researchers in ML research at the Biorobotics Lab to enhance the generalization and robustness of models for e-waste recycling. Focus areas included robust segmentation, out-of-distribution detection, and reinforcement learning.
- Invented a novel area-relaxation-based loss function that improved state-of-the-art segmentation on X-ray images of e-waste materials, increasing IoU by 15% and boosting small object detection accuracy by 80%.
- Developed and deployed a real-time distribution shift detection algorithm, achieving 99% accuracy in out-of-distribution detection for e-waste material classification.
Feb 2021 - Feb 2022
Research Intern

SketchX Lab, University of Surrey
- Invented a novel active learning algorithm for fine-grained cross-modal instance-level retrieval tasks,
- Increased mean top1 accuracy by 5% over state-of-the-art technique, published a first-authored paper to a top-tier conference (BMVC).

Education

2023
Master of Computational Data Science

Carnegie Mellon University
- Select Courses
  - Large Language Models
  - Multimodal Machine Learning
  - Deep Reinforcement Learning
  - Introdution to Deep Learning
  - Introduction to Machine Learning (PhD)
  - Cloud Computing
  - Foundations of Computational Data Science
- Academic Services
  - Reviewer for EMNLP 2023
  - Core Committee Member, LTI Computing Insfrastructure
2020
Bachelor of Computer Science and Engineering

Vellore Institute of Technology, Vellore
- Select Courses
  - Machine Learning
  - Artificial Intelligence
  - Image Processing
  - Neural Networks and Fuzzy Logic
  - Parallel and Distributed Computing
  - Virualization
  - Data Structures and Algorithms
  - Operating Systems
  - Databases
  - Networks and Communications
  - Calculus
  - Linear Algebra
  - Statics and Probability
  - Discrete Mathematics and Graph Theory
  - Theory of Computation and Compiler Design
- Extracurricular Experiences
  - Researcher at ACM-VIT
  - Core Committee Member, SEDS-VIT
  - Founding Member at Team Qubits
- Achievements
  - G. D. Naidu Young Scientist Award, 2020
  - Sir. M. Visvesaraya Award, 2019
  - Secured 1st position in 7 national hackathons
  - Ranked in Top 3 teams at 3 national hackathons
  - Secured 2 patents

Open Source Projects

2020
AI on the Beach
- Developed a high-performance library for animating millions of geospatial pings using Dask and DataShader, enabling rapid visualization.
- Analyzed shark movement patterns with geospatial intelligence and machine learning, identifying potential attack sites in advance.

Honors and Awards

2020
- G. D. Naidu Young Scientist Award
2019
- Sir. M. Visvesaraya Award

Other Interests

Hobbies: Reading, Music Composition, Hiking, Travelling

General Information

Work Experience

Senior Machine Learning Engineer

J.P Morgan Chase & Co.

Senior Machine Learning Engineer

Morgan Stanley

Data Scientist Intern

Locale.AI

Machine Learning and Web Development Intern (Part-Time)

S4S Technologies

Machine Learning Intern

Sigmaways Inc

Software Engineering Intern

Nova (P & D)

Research Experience

Research Scientist Intern

Abacus.AI

Graduate Research Assistant

Robotics Institute, Carnegie Mellon University

Research Intern

SketchX Lab, University of Surrey

Education

Master of Computational Data Science

Carnegie Mellon University

Bachelor of Computer Science and Engineering

Vellore Institute of Technology, Vellore

Open Source Projects

AI on the Beach

Honors and Awards

Other Interests