>> himanshu bhenwal

currently, i'm a machine learning engineer at PrudentBit, where i build secure document intelligence systems — PII detection & redaction, multilingual NER pipelines, and the hybrid ML + rule-based stack behind them.

previously, i was a researcher at Lossfunk working on reducing compounding rollout error and its induced distributional shift, and studying RL generalizability on benchmarks such as Atari-100K.

i love deep learning, LLMs and world models, and am always looking to build cool projects on these — so ping me on any of my socials if you'd like to collaborate on something!

now

things i'm spending my time on:

GPU programming
reinforcement learning & world models
some interpretability and ai safety stuff
teaching ML and math
running small experiments on my own time — they end up on my github

experience

machine learning engineer @ PrudentBit jan 2026 — present
architecting a secure document intelligence system for PII detection & redaction at enterprise scale — multilingual NER across structured and unstructured data (10k+ docs/day), pairing transformer models with rule-based validation for a ~10% F1 bump.
researcher intern @ Lossfunk jul 2025 — jan 2026
research on model-based RL for better generalization and lower sample complexity — prototyping world models for high-dimensional environments and chasing more stable long-horizon predictions.
machine learning engineer intern @ PrudentBit jun 2024 — jul 2025
built core modules of Immunefiles-PII over the last 2 years of working at PrudentBit, Immeunfiles-PII is a production system for sensitive data detection, and helped ship scalable pipelines for real-world document workloads while tightening the precision-recall tradeoff on noisy data.

projects

playing MsPacMan with RL + VQ-VAE world models

end-to-end pipeline for learning compact latent policies on Ms Pac-Man: a VQ-VAE compresses raw frames into a discrete latent space, DQN agents (with/without PER) learn directly on those latents, plus value / world-model / action-mapping nets for model-based planning.

PythonPyTorchWandB

sushi-GGUF

a minimalist framework for GGUF quantization of SDXL models — quantize Stable Diffusion to precision levels like Q4KS, Q5KS and Q8_0 using llama.cpp binaries.

Pythonllama.cppSDXL

paper implementations

a repo collecting several research paper implementations i've worked through — mostly reproductions to actually understand the ideas end to end.

PythonPyTorchWandB

talks & community

Transformers to Mamba — AI4Bharat, IIT Madras (jul 2024) watch here
An Introduction to Reasoning Capabilities in Language Models — IEEE BPIT (feb 2025)
chairperson, IEEE BPIT Student Branch — mentored a community of 80+ students (2021–25)
co-creator & lead instructor of IEEE BPIT's SIG on ML — semester-long classes + weekly paper reading sessions
jury member at Datadive: The Ultimate Datathon (IEEE WIE BPIT)

writing

i don't write much — most of what i become fascinated by, i share on my twitter. the longer pieces live on the blogs page:

toolbox

Python
C++
C
PyTorch
JAX
TensorFlow
Keras
scikit-learn
Transformers
LangChain
LlamaIndex
NumPy
Pandas
FastAPI
Django
Celery
PostgreSQL
Redis
Docker
GitHub Actions
AWS

elsewhere

// twitter@retr0jirachi
// githubnerdlab53
// discord@himanshuhasnoenemies.
// mailretr0sushi.04@gmail.com