Skip to content
View aksh-ay06's full-sized avatar

Highlights

  • Pro

Block or report aksh-ay06

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
aksh-ay06/README.md

Akshay Patel

Data Scientist & ML Engineer

I build production-focused machine learning systems with strengths in modeling, MLOps, LLM/RAG pipelines, and data engineering. My work emphasizes clean architecture, reliable data pipelines, and measurable impact.


Summary

  • Graduate Research Assistant at WVU leading energy analytics, automation, and ML-driven decision support tools.
  • Strong background in applied machine learning, LLM-based retrieval systems, and Python software engineering.
  • Skilled at turning ambiguous business problems into scalable, maintainable solutions.

Technical Skills

Languages: Python, SQL, TypeScript, JavaScript
ML/AI: XGBoost, CatBoost, Random Forest, Logistic Regression, SVM, Optuna, SMOTE, SHAP
LLM/RAG: FAISS, pgvector, Docling, embedding pipelines, retrieval optimization
MLOps: Docker, FastAPI, MLflow, GitHub Actions, Streamlit, Poetry, uv
Data Engineering: ETL processes, PostgreSQL, Elastic Stack, data modeling
Frontend (for tools): React, shadcn/ui, TailwindCSS, DataTables.net


Selected Projects

P2RAG — Technical Document Retrieval System

Local, privacy-preserving RAG pipeline for energy-audit PDFs using Docling, FAISS, pgvector, FastAPI, and Streamlit.

RAGvix — Research Literature Assistant

Modular ingestion → embedding → indexing → retrieval workflow using FAISS and local LLMs.

NIST Log-to-Control ML Scanner (Signed a NDA for it)

Python + Elastic pipeline classifying log events into NIST 800-53 controls, with FastAPI backend and a lightweight React dashboard.

BRFSS 2023 Cancer Prediction

End-to-end ML workflow: preprocessing, feature engineering, SMOTE, Optuna tuning, SHAP explainability, and reporting.


Interests

  • Machine Learning & MLOps
  • LLM/RAG systems and AI agents
  • Data engineering for analytics and automation
  • Applied ML in energy, manufacturing, and enterprise AI

Contact


“You don't need to be a genius to change the world. Just be consistently curious.”

Pinned Loading

  1. Automated-Defect-Detection-System-for-Quality-Assurance-in-Manufacturing Automated-Defect-Detection-System-for-Quality-Assurance-in-Manufacturing Public

    Jupyter Notebook

  2. Bank-Credit-Card-Default-Predictions Bank-Credit-Card-Default-Predictions Public

    This repository covers an end-to-end machine learning pipeline, from data ingestion to model deployment, for credit card default prediction. Steps include data cleaning, EDA, feature engineering (d…

    Jupyter Notebook

  3. Operational-Efficiency-Analysis-Machine-Learning-for-Heating-Tunnel-Optimization Operational-Efficiency-Analysis-Machine-Learning-for-Heating-Tunnel-Optimization Public

    Jupyter Notebook

  4. Autonomous-Mobile-Robot-Path-Planner Autonomous-Mobile-Robot-Path-Planner Public

    A comprehensive Python package for autonomous mobile robot (AMR) path planning with dynamic obstacle avoidance and real-time simulation capabilities.

    Python 1