AP

Hey, I'm

Anket Patil

Full-Stack Engineer / AI Data Platform

My Story

I am a Computer Science graduate student at George Washington University, focused on full-stack systems for AI data platforms, LLM pipelines, and applied machine learning infrastructure.

At LAiSER, I work on AI systems that extract and align skill intelligence from unstructured data. I shipped an open-source Extract Module used by production organizations, and owned a Dockerized FastAPI service on AWS handling 50K+ requests/day with sub-200ms p99 latency.

Previously at EDUJO Labs, I engineered a full-stack AI-powered LMS and a course-grounded RAG chatbot. That work supported 200+ generated courses, 25K+ learner interactions, and a 4.2/5 average chatbot rating.

I like building at the boundary between product engineering and AI infrastructure: LangTrailExtract ModuleNLX PipelineHackathons. My toolkit spans Python, PyTorch, LangChain, React, TypeScript, FastAPI, Rust, Docker, Kubernetes, PostgreSQL, Redis, Kafka, FAISS, AWS, and graph-backed data systems.

Experience

Research, engineering, and education.

Jun 2025 - Present now

Research Assistant

LAiSER
  • Work at a Gates and Walmart backed research lab using AI to extract and align skill intelligence from unstructured data.
  • Shipped the open-source Extract Module, an LLM-powered pipeline that parses resumes, syllabi, and job postings into structured Knowledge, Skills, and Tasks for 5+ production organizations.
  • Built with PyTorch, Hugging Face Transformers, Sentence-Transformers, FAISS, Neo4j, vLLM, and GitLab CI/CD.
  • Architected and owned a Dockerized FastAPI service on AWS ECR/EC2 that extracts skills from credential content and handles 50K+ requests/day with sub-200ms p99 latency.
Jan 2023 - May 2024

Research Engineer

EDUJO Labs
  • Engineered a full-stack AI-powered LMS that enabled instructors to generate courses on the fly.
  • Built a RAG chatbot that answered learner questions grounded in course materials.
  • Used Next.js, Node.js, PostgreSQL, GraphQL, OpenAI GPT-3.5/4, embeddings, Pinecone, and Azure TTS.
  • Supported 200+ generated courses, 25K+ learner interactions, and a 4.2/5 average chatbot rating across 1K+ learner interactions.
Aug 2024 - May 2026

M.S. Computer Science

George Washington University
  • GPA: 3.7/4.0, Magna Cum Laude.
  • Focused on systems, AI infrastructure, and data-intensive software engineering.

Projects

AI systems, data infrastructure, and hackathon builds.

LangTrail

Reverse proxy and review layer for monitoring LLM agent systems without SDKs or code instrumentation.

  • Architected a Rust-based reverse proxy that captures AI agent traffic transparently without modifying existing codebases.
  • Designed an LLM-assisted review layer that generates critiques, suggested labels, failure signals, and confidence scores while keeping human approval as the final decision.
  • Added a human-in-the-loop pipeline that converts agent traces into training datasets for RLHF and AI agent evaluation.
  • Built with Rust, TypeScript, React, PostgreSQL, Vite, Docker, Kubernetes, and Prometheus.

NLX Job Knowledge Graph Pipeline

High-throughput job intelligence pipeline that extracts skills, knowledge, and tasks into a graph.

  • Designed and shipped a data ingestion pipeline that processes 2M job postings/day to extract and structure skills, knowledge, and tasks.
  • Scaled LLM inference to 800K unique jobs/day using vLLM with async batch workers on serverless GPU cloud.
  • Built with Python, vLLM, Llama 3.1, FAISS, Neo4j, Kafka, and Modal.

LAiSER Extract Module

Open-source LLM pipeline for turning resumes, syllabi, and job postings into structured KST data.

  • Parses unstructured text into Knowledge, Skills, and Tasks for downstream skill intelligence workflows.
  • Serves 5+ organizations in production as part of LAiSER research and deployment work.
  • Uses PyTorch, Hugging Face Transformers, Sentence-Transformers, FAISS, Neo4j, vLLM, and GitLab CI/CD.

UN Ahead of the Storm Hackathon

1st place geospatial child vulnerability system for Cyclone Amphan risk response.

  • Built a dashboard quantifying 494K+ children at risk and 666 critical facilities across Bangladesh.
  • Used Python, GeoPandas, Rasterio, and Plotly to deliver an interactive tool for UN officials.

GW George Hacks

1st place LLM-powered metadata automation system for open-source GitHub projects.

  • Built an automation workflow using Python, LangChain, and the GitHub API.
  • Generated useful repository metadata to reduce manual project maintenance work.