Latest

News

Paul E. Torgersen Graduate Research Excellence Award finalist

Selected as a PhD finalist for Virginia Tech’s Paul E. Torgersen Graduate Student Research Excellence Award. See the awardees list for more information.

Metadata-aware RAG work accepted at ECIR 2026

Utilizing Metadata for Better Retrieval-Augmented Generation was accepted to the 48th European Conference on Information Retrieval. Read the paper here.

Product provenance verification paper accepted at AAAI 2026

Research on data valuation for product provenance verification was accepted to the AAAI Conference on Artificial Intelligence. Read the paper here.

Best Paper at IEEE BigData 2024

LLM Augmentations to Support Analytical Reasoning over Multiple Documents received the Best Paper Award.

Impact Highlights

Research & Technical Expertise

LLMs & Generative AI: Large Language Models, Prompt Engineering, Retrieval-Augmented Generation, Model Context Protocol (MCP), Long-context Reasoning, LLM Evaluation & Quality Assessment, Reward Modeling, Synthetic Data Generation, Quantization, Model Serving & Deployment.
Model Adaptation & Learning: Fine-tuning, SFT, RLHF, PEFT, LoRA/QLoRA, Representation Learning, Recommender Systems, Supervised Learning, Feature Engineering, Explainability, Graph-based Modeling, Transfer Learning, Data Valuation, Uncertainty-aware Learning.
Systems & Applications: Search and Retrieval, Embeddings & Vector Databases, Information Extraction, Structured Data Reasoning, Forecasting, Spatiotemporal Modeling, Data Engineering, Evaluation Pipelines, Human-in-the-loop Systems, Decision-support Workflows.
Frameworks & Tools: PyTorch, Transformers, PEFT, TRL, LangChain, OpenAI, Claude, Gemini, Elasticsearch, FastAPI, AWS SageMaker, AWS EC2, GCP Compute Engine, Azure, scikit-learn, spaCy, NetworkX, Pandas, NumPy, Streamlit, Docker, Git.
Programming: Python, Java, C++, MATLAB, R, SQL/NoSQL.

Selected Projects

Selected work highlighting system design, evaluation, deployment context, and measurable outcomes.

RAGMate — Metadata-Aware Retrieval for RAG

  • Designed metadata-aware dual-encoder retrieval methods that incorporate structured disambiguation signals into embedding and ranking objectives.
  • Improved retrieval performance by 70% on average over text-only baselines in retrieval-augmented generation settings.
  • Developed in collaboration with Vectorize.io, with attention to practical retrieval concerns including schema ambiguity, metadata use, source grounding, ranking quality, and evaluation.

Speculatores: Memory-Augmented RAG for Multi-Document Reasoning

  • Built memory-augmented architectures for LLM-based reasoning across multiple documents, using persistent context representations and cross-document evidence linking.
  • Combined retrieval, structured memory, and generation to support analytical reasoning over long, distributed information spaces.
  • Achieved 25% relative improvement in classification performance and 50% improvement in generation quality for multi-document analytical reasoning tasks.

MemoryDrift: Benchmarking Long-Context Reliability in LLMs

  • Developed a graph-based benchmark to evaluate whether LLMs can maintain stable structured memory as context length increases.
  • Studied how models induce, update, and preserve graph-like representations over long contexts, exposing reliability failures that are difficult to detect with standard long-context tests.
  • Revealed memory-drift onset at only 1–6% of advertised context capacity, showing that nominal context length can substantially overstate reliable reasoning capacity.

DataWeave — Human–LLM System for Structured Data Analysis

  • Built an interactive human–LLM system for exploratory structured data analysis and analytical reasoning over complex datasets.
  • Designed workflows for schema understanding, user-guided analysis, grounded generation, and iterative refinement.
  • Enabled a human-in-the-loop agentic structured data analysis system with 86% schema-alignment reliability across hundreds of newsroom sessions.
  • Developed in collaboration with The Chronicle of Higher Education as a deployed system and research platform for studying AI-assisted data analysis.

Newsroom LLM Systems — The Washington Post

  • Fine-tuned and evaluated LLMs on AWS EC2 and SageMaker for newsroom applications including subheadline generation, summarization, and question answering.
  • Contributed to the early development of the Ask the Post chatbot, advising on RAG design, grounding, and evaluation in collaboration with newsroom stakeholders.
  • Presented LLM evaluation, fine-tuning, and model development work to engineering leadership and newsroom audiences.
  • Focused on editorial quality, reliability, answer grounding, and model behavior in newsroom use cases.

Product Provenance Verification — ML for Supply-Chain Traceability

  • Led development of ML systems for product provenance verification, combining probabilistic modeling, data valuation, and spatiotemporal reasoning for compliance workflows.
  • Supported assessment of 59 wood products, identification of 260+ tons of allegedly illegal timber, and 9+ enforcement investigations.
  • Developed in collaboration with World Forest ID on real-world traceability problems involving noisy data, uncertain labels, and regulatory decision-support needs.
  • Contributed to research on optimizing product provenance verification using data valuation methods.

Migration Forecasting — Policy-Relevant Predictive ML

  • Built large-scale forecasting pipelines for migration patterns and land border encounters using applied ML, statistical modeling, and spatiotemporal data.
  • Developed real-time and policy-facing predictive workflows designed for decision support under uncertainty.
  • Worked on projects funded by government and research partners, emphasizing scalable pipelines, evaluation, and decision-support relevance.

Scientific Information Extraction — Domain-Adapted Transformers

  • Designed a full-text scientific information extraction system using domain-adapted transformer models and task-specific representation learning objectives.
  • Achieved 11% improvement over prior baselines and 26% higher accuracy in salient task and method extraction from scholarly documents.
  • Worked with collaborators at CSET, Georgetown University, to extract structured signals from scholarly documents for science-of-science analysis.

Data-Centric LLM Fine-Tuning & Synthetic Table Generation

  • Introduced permutation-aware tabular data generation methods to reduce invalid synthetic data generation by LLMs.
  • Reduced synthetic table rule violations by 70% through structured generation constraints and evaluation.
  • Contributed to Fisher information-guided regularization for language model fine-tuning, improving generalization in low-data regimes across 9/10 GLUE tasks with no added computational overhead.

Software, Systems & Open Source

Publications

Peer-reviewed papers, manuscripts under review, and extended abstracts. See Google Scholar for citation details and updates.

LLMs, RAG, Structured Reasoning & Information Extraction

Applied ML, Forecasting & Decision Support

Experience

Virginia Tech Transportation Institute — AI Researcher 2026–Present

  • Build data-grounding layers for heterogeneous traffic and mobility data by standardizing schemas, metadata, and data-access patterns.
  • Develop LLM-based traffic analysis tools that answer natural-language questions through grounded querying, analysis, and summaries.
  • Create benchmarks to evaluate and improve the reliability of AI tools over complex traffic datasets and external context.

The Washington Post — Machine Learning Intern 2023

  • Fine-tuned and evaluated LLMs on AWS EC2 and SageMaker to explore newsroom applications including subheadline generation, summarization, and question answering.
  • Contributed to early development of the Ask the Post chatbot, advising on RAG design and evaluation in collaboration with newsroom stakeholders.
  • Presented LLM evaluation, fine-tuning, and model development for newsroom applications to engineering leadership and newsroom audiences.

Virginia Tech, Sanghani Center for AI — Graduate Research Assistant 2019–2025

Built LLM, applied ML, NLP, retrieval, and forecasting systems for data-aware reasoning and decision support across funded research projects. Selected work:

  • Developed memory-augmented architectures for LLM-based multi-document reasoning with persistent context representation and cross-document evidence linking.
  • Proposed metadata-aware retrieval methods for RAG, incorporating structured disambiguation signals into embedding and ranking objectives in collaboration with Vectorize.io.
  • Built human-in-the-loop agentic data analysis workflows for real-world newsroom use cases in collaboration with The Chronicle of Higher Education.
  • Introduced permutation-aware tabular generation and Fisher information-guided regularization for LLM fine-tuning to improve data efficiency and generalization.
  • Designed full-text scientific information extraction systems using domain-adapted transformer models and task-specific representation learning.
  • Led development of ML systems for product provenance verification and large-scale migration forecasting, deploying regulatory and real-time pipelines used in policy and compliance settings, including provenance work in collaboration with World Forest ID.
  • Contributed to funded research proposals for projects supported by DARPA, NSF, and external partners, including technical section writing.

Eastern University Bangladesh — Lecturer 2018

  • Led advanced programming and digital logic design courses, managed labs, exams, student supervision, and administrative responsibilities.
  • Served on academic and administrative committees and contributed to departmental teaching and curriculum activities.

Talks & Presentations

Teaching & Communication

Awards & Service

Education

Bio

I am an AI/ML researcher and applied ML engineer working on language models, retrieval, representation learning, information extraction, forecasting, recommender systems, and data-centric learning. My work centers on building reliable systems that reason over structured and unstructured data, with attention to grounding, evaluation, adaptation, and practical deployment.

I recently completed my PhD in Computer Science at Virginia Tech, where I developed data-aware AI systems for reasoning and decision-making over complex real-world data. I build systems that operate over multi-source, structured, unstructured, and noisy information, with an emphasis on grounding, uncertainty, evaluation, and practical deployment.

I have worked on AI and ML systems for generative AI, search, newsroom applications, supply-chain traceability, agricultural product verification, migration forecasting, scientific information extraction, and structured-data analysis. Across these projects, I focus on system design, robust evaluation, scalable pipelines, and translating research ideas into useful tools for analysts, researchers, policy teams, and domain experts.