Machine Learning Research Engineer · Independent Researcher · Founder, DeepBrain Labs
Building systems at the intersection of neuromorphic computation, efficient sequence modeling, and adaptive decision-making.
I work on methods for efficient, adaptive, and interpretable machine learning, with a particular focus on where biological computation meets modern deep learning. My recent work spans neuromorphic sequence models, stable long-context attention, interpretable clinical ML, and multimodal sensing for real-world systems. I publish as an independent researcher and run DeepBrain Labs, a small effort building adaptive cognitive systems.
- Neuromorphic and spiking architectures: energy-efficient inference via spike-based computation and biologically motivated learning rules
- Attention mechanisms and long-context modeling: stable associative memory, linear-time recurrent formulations, principled memory management
- Reinforcement learning and sequential decision-making: return-conditioned sequence models, adaptive control, and agentic pipelines
- Multimodal and applied ML: fusing visual and structured signals; interpretable clinical prediction with SHAP-grounded analysis
- Systems-level efficiency: JAX and PyTorch implementations tuned for research-to-deployment fidelity
Preprints on arXiv. Click a badge to open the paper.
Abstract highlights
Variational Linear Attention (2605.11196): Linear attention reduces softmax attention's quadratic cost to O(T), but its memory state grows as O(T) in Frobenius norm, causing progressive interference between stored associations. VLA reframes the memory update as an online regularised least-squares problem with an adaptive penalty matrix maintained via the Sherman-Morrison rank-1 formula. Provably stable recurrence: Jacobian spectral norm equals exactly 1 for all sequence lengths and head dimensions.
Diabetes Detection Framework (2605.13464): A reproducible three-stage ML pipeline covering binary detection (SVM-RBF / LR achieving ROC-AUC 0.825), silhouette-validated subtype clustering without ground-truth labels, and statistical analysis of the Ohio Longitudinal Cognitive Dataset (n=373) revealing a significant glycaemic-cognitive association (rho=0.208, p<0.0001) surviving Holm correction. SHAP identifies Glucose, BMI, and Age as dominant biomarkers.
Spiking Decision Transformers (2508.21505): Return-conditioned sequence modeling with Leaky Integrate-and-Fire neurons in each self-attention block, trained end-to-end via surrogate gradients. Matches or exceeds standard DT performance on CartPole-v1, MountainCar-v0, Acrobot-v1, and Pendulum-v1 while emitting fewer than ten spikes per decision — suggesting over four orders-of-magnitude reduction in per-inference energy.
AgroSense (2509.01344): Multimodal framework integrating visual and tabular signals for crop and soil analysis. Designed for precision agriculture workflows where heterogeneous sensor streams must be fused reliably across conditions.
| Repository | Description |
|---|---|
| neuromorphic_decision_transformer | Code for Spiking Decision Transformers (arXiv:2508.21505). Spike-based self-attention with surrogate gradient training, LIF neurons, and three-factor plasticity. |
| AgroSense | Multimodal vision + tabular fusion for precision agriculture (arXiv:2509.01344). |
| variational-linear-attention | Code for Variational Linear Attention (arXiv:2605.11196). Sherman-Morrison rank-1 memory update with provably unit Jacobian spectral norm. |
| diabetes-type-prediction | Code for the three-stage diabetes ML framework (arXiv:2605.13464). Detection, subtype clustering, and cognitive-metabolic analysis with SHAP explainability. |
| Repository | Description |
|---|---|
| SmartTraffic-RL | Gym-style RL environment for adaptive urban traffic signal control with macroscopic flow models and optional SUMO integration. |
| AgenticCyberOps | Autonomous pipeline for security analysis, penetration testing, and threat triage using LLM-backed agentic orchestration. |
56+ repositories total, including paper re-implementations, RL experiments, generative model prototypes, and systems utilities.
Languages
Frameworks
Infrastructure
Research domains: Transformers · Spiking Neural Networks · Reinforcement Learning · Multimodal Fusion · RAG Systems · Linear Attention · Interpretable ML · Neuromorphic Computing
DeepBrain Labs is a small independent research effort focused on building adaptive cognitive systems and computational models of intelligence. Current directions include associative memory architectures, neuromorphic control, and learned world models for planning under uncertainty.


