Skip to content
View MirMustafaAli's full-sized avatar

Highlights

  • Pro

Block or report MirMustafaAli

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
MirMustafaAli/README.md

Mir Mustafa Ali

Data Scientist & Machine Learning Engineer

I'm a Data Scientist with 4+ years of experience building scalable, production-grade AI and deep learning systems. My expertise spans across insurance, retail, and media industries, with a focus on optimizing model performance and integrating end-to-end machine learning pipelines.

πŸŽ“ Education

  • M.S. in Computer Science | University of Texas at Arlington (2023 - 2024)
  • B.E. in Information Science and Engineering | Visvesvaraya Technological University (2013 - 2018)

🌟 Core Competencies

  • Machine Learning & Deep Learning: Production-grade ML systems, BERT, GRU, LSTM
  • Distributed Computing: Parallel Processing, Multi-node Systems, GPU Programming
  • Big Data Processing: Apache Spark, Kafka, Redis, MongoDB
  • MLOps: Docker, Kubernetes, AWS, GCP, Azure
  • Languages & Frameworks: Python, C/C++, PyTorch, TensorFlow, CUDA

πŸš€ Featured Projects

COAT Optimizer for TorchAO

Python, PyTorch, Float8 Optimization(In Progress)

  • Contributed to open-source development of COAT (Compressing Optimizer States and Activations)
  • Implemented memory-efficient FP8 training utilizing dynamic range expansion
  • Enhanced float8 data type utilization during training processes
  • View Project

3D Heat Equation Solver

C, MPI, OpenMP

  • Developed a high-performance parallel solver for 3D heat equations
  • Implemented multi-node computation across X, Y, and Z dimensions
  • Conducted comprehensive scaling analysis on UTA cluster
  • Achieved significant performance improvements through parallel processing

Matrix Multiplication Optimization

C, Cache Optimization

  • Optimized single-core matrix multiplication using advanced algorithms
  • Achieved 50%+ of theoretical GFLOPS through cache optimization
  • Implemented Goto and Van de Geijn algorithm with six loop orderings

Distributed Systems Implementation

Python, Node.js, gRPC

  • Implemented various distributed consensus algorithms:
    • Paxos Algorithm Simulation
    • Raft Consensus Protocol
    • Decentralized & Distributed Locking
    • Vector Clocks & Lamport's Logical Clock
  • Created cross-language compatible distributed store
  • View Project

πŸ’Ό Professional Experience

Research Assistant | UTA Research Institute

(Sep 2023 - Jan 2024)

  • Engineered LSTM models for smartwatch analytics
  • Implemented robust fall detection systems using PyTorch Lightning and MLFlow

Senior Data Scientist | Aptus Data Labs

(Mar 2022 - Jan 2023)

  • Led a team on automating pharmaceutical report analysis via combination of heuristics with the collaboration of subject matter experts and BERT Zero-Shot Classification.
  • Reduced the report evaluation process from 1 month to 1 week.
  • Implemented dynamic PROMPT/LABEL techniques

Associate Consultant | CoffeeBeans Consulting

(Jan 2019 - Jan 2022)

  • Deployed prediction model on News Articles predicting the top 10 articles published on platform for the day with 80% accuracy.
  • Reducing load on Redis, Kafka, and MongoDB by developing FAISS vector-similarity API with a scalable design providing sub 8ms response time.
  • Leveraged SparkML clustering algorithms for segmented recommendations on News article platform which boosted the average CTR from 1.5 to 1.8.
  • Implemented real time preprocessing services using Faust( a python streaming library) which reduced preprocessing cost by 50%.
  • Consulted a title insurance organization as deep learning engineer to Train and Deployed a GRU-based NER model on Property legal docs by collaborating with domain experts, achieving 82% accuracy and reduced training time by 30% with PyTorch DDP on multi GPU system.
  • Built scalable model-serving pipelines using TorchServe

πŸ“« Connect With Me

Currently seeking opportunities in Machine Learning Engineering and Data Science roles

Popular repositories Loading

  1. Halcyon-17 Halcyon-17 Public

    A Techno-Cultural website

    HTML

  2. learnopencv learnopencv Public

    Forked from spmallick/learnopencv

    Learn OpenCV : C++ and Python Examples

    Jupyter Notebook

  3. char-rnn char-rnn Public

    Forked from karpathy/char-rnn

    Multi-layer Recurrent Neural Networks (LSTM, GRU, RNN) for character-level language models in Torch

    Lua

  4. TDD_Academy_Classes TDD_Academy_Classes Public

    Java

  5. cs224n-assignment cs224n-assignment Public

    Jupyter Notebook

  6. google-interview-university google-interview-university Public

    Forked from prakhar21/google-interview-university

    A complete computer science study plan to become a software engineer.