Skip to content
View 01PrathamS's full-sized avatar

Block or report 01PrathamS

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
01PrathamS/README.md

Pratham Savaliya

Machine Learning Practitioner

Building: Helpish — meeting intelligence, solo-built, production-deployed.


Hacks

Helpish · AI Assistant for tech product demos

Layered async pipeline — audio capture → transcription → analysis → persistence, each stage swappable.
Browser recorder with mic + tab audio mixing, live transcription via Deepgram, speaker diarisation, voice profile embeddings.
Live sales coaching, cross-session memory with contradiction detection, post-call summaries that adapt per user.


Open Source

Chronicle · Self-hostable wearable AI backend · 60+ ⭐ · Active contributor

  • Added Google Drive, Dropbox, and WebSocket as first-class audio sources; co-designed plugin architecture for third-party apps
  • Built speaker diarisation + identification benchmark suite — measures accuracy, latency, speaker confusion across diverse recordings
  • Implementing MCP connector support enabling plugins to interact with external services via natural language tool calls

Deep Learning from Scratch

I implement papers to actually understand them. Each repo has unit tests validating against reference implementations.

Repo What it covers
Stable_Diffusion_from_scratch Full diffusion architecture in PyTorch
Transformers_from_scratch Attention, positional encoding, encoder-decoder
LoRA-implementation Low-rank adaptation from paper to training loop
BERT_from_scratch Masked LM + NSP pretraining
Swin_Transformer_from_scratch Shifted window attention
GPU_programming CUDA kernels, memory hierarchy, parallel patterns
SonicSpeech Deep learning architectures for speech and audio
Dillusion SOTA diffusion paper implementations
Backprop Scalar autograd from scratch → micrograd

Cross-lingual embedding alignment (English ↔ Hindi/Gujarati): Procrustes alignment on FastText embeddings — word translation and multilingual similarity search in a shared vector space.


Stack

ML & Training: PyTorch, LoRA, SFT, fine-tuning, RAG, GraphRAG, agentic systems, embeddings, eval pipelines
Speech & Audio: Deepgram, Whisper, VAD, speaker diarisation, voice-profile embeddings
Infra: FastAPI, Redis, Docker, AWS, queue-based async architectures
Low-level: CUDA, C/C++, parallel programming


More

  • Hackathon winner ×10
  • IEEE: APBTMS — computer vision for real-time bus passenger tracking and overcrowding detection
  • Cohere AI Gujarati LLM — top contributor to dataset creation and curation for a low-resource Indian language model
  • Currently exploring: concurrency-chaos — threads, async, queues, Redis, event-driven systems
  • Reading: Goodreads
  • Hack demos: YouTube

pratham.savaliya01@gmail.com · LinkedIn · helpish.xyz

Pinned Loading

  1. chronicle_ chronicle_ Public

    Forked from SimpleOpenSoftware/chronicle

    A project to provide the simplest, self hosted route to get "I want AI to help me by passively observing me"

    Python

  2. Backprop Backprop Public

    Jupyter Notebook 4

  3. Stable_Diffusion_from_scratch Stable_Diffusion_from_scratch Public

    implementation of Stable Diffusion Architecture from scratch

    Jupyter Notebook 3

  4. concurrency-chaos concurrency-chaos Public

    Experiments in concurrency — threads, async, queues, Redis, and event-driven systems, built from scratch

    Python

  5. GPU_programming GPU_programming Public

    learning and building with cuda

    Jupyter Notebook 3

  6. SonicSpeech SonicSpeech Public

    Implementaiton of deep learning architectures for speech and audio processing.

    Jupyter Notebook 1