Muhammad Mahwiz Khalil

Building AI for languages the world forgot to teach its machines

Read Research View Work Contact

About

I'm an AI engineer and researcher based in Karachi, Pakistan, working on low-resource language models — primarily Urdu, Sindhi, and Pakistani NLP. My focus is speech synthesis, automatic speech recognition, and large language models for languages that have been systematically underserved by the field.

I work at the intersection of audio ML, LLMs, and RAG systems. Currently building at Proxima AI and 2DamnWav. 33 models and 25 datasets published on HuggingFace.

Research & Writing

Selected Work

Models

Orpheus Urdu TTS

TTS · 3B 179K ↓

Kani TTS 400M Urdu

TTS · 400M 12 ♥

TTS · 1.6B 66 ↓

Qalb 1.0 — 8B Instruct

LLM · 8B Q4 Quant

Text Gen · 350M

Urdu Wav2Vec2 94M

Checkpoint Latest

View all 33 models →

Datasets

Urdu 208h Audio

ASR · Audio 319K ↓

Pak Multilang 2025

Corpus 480K ↓

OCR · Vision 40.8K ↓

Sindhi TTS Haveli

TTS · Sindhi 3.7K ↓

Urdu Medical SFT

Synthetic Urdu V2

Synthetic 85K ↓

Urdu Nano Codec

Audio Codec 31K ↓

View all 25 datasets →

Connect

HuggingFace GitHub Twitter LinkedIn Book a Call