+
Skip to content
View aryanj10's full-sized avatar

Highlights

  • Pro

Block or report aryanj10

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
aryanj10/README.md
Typing SVG

Aryan's Banner Azure DS Azure AI Profile Views

Building systems that remove manual work • LLMs, RAG, Agents • Open to collaboration

💫 About Me

I build AI systems that turn messy docs and repetitive workflows into clean data and automated actions. From payroll parsing at scale to RAG pipelines and inference servers, I ship end-to-end solutions that save time, cut costs, and actually get used.

    🎯 What I Do

    • 🧠 Document AI for PDFs/RTFs/Excels (payroll, statements, reports)
    • 🤖 Agentic automation with Browser/Playwright + LLMs
    • 🔎 RAG systems, vector search, re-ranking, and evaluation
    • 🚀 ML deployment with Triton, FastAPI, Docker, GitHub Actions
Coding

🏆 Impact Highlights

Savings

$20K+

Labor Saved

End-to-end payroll document automation used by real clients

Docs

1,000+

Docs Parsed

Robust parsing of PDFs/RTFs → schema-validated JSON

Stores

25M+

Rows Analyzed

P&L analytics, EBITDA drivers, and forecasting dashboards

Deploy

Prod

Deployments

Triton + FastAPI inference services with CI/CD


🌐 Connect With Me

🧠 My Tech Stack

💻 Languages & Frameworks

Python R SQL JavaScript TypeScript C++ FastAPI Dash Streamlit

🤖 AI & ML

PyTorch TensorFlow scikit-learn HuggingFace LangChain FAISS RAG OpenAI vLLM XGBoost

🛠 Infra, DevOps & Cloud

Triton Docker Linux GitHub Actions AWS Azure PostgreSQL

🧠 Learning & Exploring

Agentic AI Retrieval Evaluation Geospatial AI Self-hosted LLMs

🎯 Featured Projects

Payroll LLM Extractor

Payroll


  • Parsed 1,000+ payroll PDFs/RTFs → structured JSON
  • Auto-uploads to Accountant's World; saved $20K+ labor
  • Modular earnings/tax mapping + review UI
CW-RIS

Wind


  • Overlays wind hazards with child population exposure
  • Top-5 vulnerability zones + trend time-series
  • Built for UN Hackathon challenge
Triton Deployment

Gears


  • vLLM + Python backend preprocessors
  • Batch inference (100k images) pipeline
  • Metrics + GitHub Actions CI
PriceCompare Agent

Search


  • Structured JSON outputs with schema validation
  • Amazon vs Walmart scraping + normalization
Philly Crime

Crime


  • Public dataset (2006–2025) with dashboards
  • Homicide nowcast + cluster analysis
Resume Editor

Memo


  • Upload JSON + LaTeX → export polished PDF
  • Supports per-user storage + versions

🔬 Current Research & Explorations

Reasoning

RAG

Agents

Geo

⚡ Development Metrics


📝 Recent Activity Highlights

🏆 GitHub Achievements

GitHub Trophies

💡 Open to collaborating on impactful AI projects that remove repetitive work.


Last updated: September 2025

Pinned Loading

  1. Payroll-LLM-Extractor Payroll-LLM-Extractor Public

    This repository contains a production-grade payroll automation system that extracts, parses, and uploads payroll data using LLMs (Google Gemini) and browser automation (Playwright)

    Python 2

  2. UN-Hackathon-Child-Wind-Risk-Intelligence-System UN-Hackathon-Child-Wind-Risk-Intelligence-System Public

    CW-RIS is a child-centered wind risk analysis app that overlays real-time ECMWF ERA5 wind forecast data with child population exposure (e.g., from WorldPop), school density, and hospital accessibil…

    Python

  3. resume-editor-generate resume-editor-generate Public

    Customizable resume builder with Streamlit, LaTeX, and JSON. Edit content, preview live PDFs, generate professional resumes, and track changes with timestamped logs and diffs.

    TeX

  4. Triton-Implementation Triton-Implementation Public

    This repo walks through deploying an image classification pipeline using an ensemble model in NVIDIA Triton Inference Server. The architecture offloads preprocessing to a Python backend and classif…

    Jupyter Notebook

  5. XAI-Gradcam-Twin-System XAI-Gradcam-Twin-System Public

    Explainable AI project using ResNet-18 to classify real vs. diffusion-generated cat images with Grad-CAM heatmaps and Twin System nearest-neighbor explanations for interpretability.

    Jupyter Notebook

  6. Browser-Use-Price-Comp-Bot Browser-Use-Price-Comp-Bot Public

    AI-powered browser automation tool to compare Amazon and Walmart prices. Extracts product details, avoids sponsored listings, and returns structured JSON highlighting the cheaper source with differ…

    Python 1

点击 这是indexloc提供的php浏览器服务,不要输入任何密码和下载