Blog
I love to write about programming, machine learning, NLP, LLMs and things that motivates me.
Read more on Medium →Explore the architecture and training techniques for building robust question answering systems, from extractive to generative approaches.
Read MoreLearn how to efficiently scale your ML models across multiple GPUs and machines using data parallelism, model parallelism, and distributed training frameworks.
Read MoreA comprehensive guide to understanding the transformer architecture, self-attention mechanisms, and their evolution into modern large language models.
Read More