Skip to content
View thunderock's full-sized avatar
😎
Active!!!!
😎
Active!!!!

Block or report thunderock

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
thunderock/README.md

Hey there! πŸ‘‹ I'm Ashutosh Tiwari

LinkedIn Facebook Twitter Kaggle Resume

πŸš€ About Me

Senior ML Engineer specializing in large-scale inference platforms, foundation-model training frameworks, and MLOps infrastructure for generative AI. Currently building ML platform at Adobe Firefly for generative AI workloads over billions of images and videos. Published researcher in Graph Neural Networks and bias mitigation.

Education πŸŽ“

  • MS in Computational Data Science β€” Indiana University, Bloomington (2023) β€’ GPA: 3.87/4.0
  • BS in Computer Science β€” National Institute of Technology, Patna (2015) β€’ GPA: 8.32/10.0

Current Work πŸ’Ό

  • Senior Machine Learning Engineer (ML Platform & Frameworks) @ Adobe Firefly (Jul 2024 - Present)
    • Designed and led the org's online + offline inference systems β€” vLLM + Ray for LLMs (Falcon-40B, Llama 2 70B); Ray Data + PyTorch Lightning offline inference hitting 3–8K images/sec on 32 GPUs (10–50Γ— faster image loading).
    • Leading development of a PyTorch foundation-model training framework adopted across the org β€” plugin-based FSDP/FSDP2 strategies, Flash Attention 2/3, and context/distributed attention for DiT text-to-image and text-to-video training.
    • Co-built the underlying Kubernetes platform: plugin pipelines (KEDA-autoscaled on SQS), BuildKit/ECR image builds, a Rust control plane, and an MCP interface so AI agents can drive build/deploy/inference end-to-end.
    • Built the org's first data quality framework on Cerberus, used across enrichment pipelines to validate the correctness of feature-generation outputs.

Previous Experience

  • Senior Software Engineer (Data Platform) @ EvolutionIQ (2023-2024) β€” Data and model-training pipelines at the intersection of generative AI, fintech & healthcare; fairness/bias evaluation framework.
  • Software Dev Engineer II (ML Platform) @ Swiggy (2019-2021) β€” Online inference platform serving 18M+ MTU; feature store (4Bn rows, 10K QPS); founding member of forecasting & correlation platform; DAQ collecting 15M rows/day.
  • Software Dev Engineer (Search Relevance) @ Flipkart (2017-2019) β€” Search intent models (CRF/NN); Flipkart's first automated ML training & deployment workflow (Luigi β†’ Airflow); Hackday 9 Winner.
  • Software Engineer @ Groupon, NetSpeed Systems (2015-2017) β€” Backend engineering & Network-on-Chip modules (Polarity-based Arbitration, Multi-Cast Filtering).

Research πŸ“„

  • First Author β€” "Biased Contrastive Learning debiases Graph Neural Networks" (NetSci 2023, IC2S2 2023)

πŸ› οΈ Tech Stack

Inference & Serving: vLLM β€’ Ray β€’ Ray Data β€’ Ray Serve β€’ PyTorch Lightning β€’ TensorRT-LLM β€’ Triton Inference Server

Training Frameworks: PyTorch β€’ FSDP / FSDP2 β€’ Flash Attention 2/3 β€’ Diffusion Transformers (DiT) β€’ Context / Distributed Attention

ML/AI: PyTorch Geometric β€’ TensorFlow β€’ Scikit-learn β€’ LLMs β€’ GNNs β€’ NLP β€’ Computer Vision

Data Engineering: Apache Spark β€’ Apache Kafka β€’ Apache Flink β€’ Hadoop HDFS β€’ Airflow β€’ Feature Stores β€’ Cerberus

Platform & Infra: Kubernetes β€’ KEDA β€’ BuildKit β€’ Docker β€’ AWS (S3, SQS, ECR, SageMaker, DynamoDB) β€’ Model Context Protocol (MCP)

Languages: Python β€’ Rust β€’ Scala β€’ C++ β€’ SQL β€’ Java

🌐 Portfolio: thunderock.github.io

πŸ“Š GitHub Analytics

Pinned Loading

  1. graph_ml graph_ml Public

    sklearn for machine learning on graphs

    Python 2

  2. BiasNet BiasNet Public

    Reinforcement Learning for Street Fighter using GNNs

    HTML

  3. DeepFoodie DeepFoodie Public archive

    Deep Learning Systems Project Files

    Jupyter Notebook

  4. residual2vec_ residual2vec_ Public

    Jupyter Notebook

  5. rollout rollout Public

    A high-performance, multi-node reinforcement-learning framework for large language models. Written in Rust. Pluggable in Python.

    Rust

  6. vector vector Public

    fast terminal emulator

    Rust