Hi, I’m Aflah, a research software engineer at the Max Planck Institute for Software Systems. My work centers on deepening our understanding of large language models (LLMs) and rigorously evaluating their capabilities. I’m also passionate about the systems side of LLMs, with hands-on experience in large-scale pretraining and inference. In the past, I’ve contributed to projects targeting hate speech reduction and other NLP applications for social good.
Open to roles in research, research engineering, or backend engineeringJun 2026 📝 Position: Don't Just "Fix it in Post": A Science of AI Must Study Learning Dynamics accepted as an oral at ICML 2026 and accepted at the Mechanistic Interpretability Workshop @ ICML 2026.
Jun 2026 💼 Started as a part-time Research Scientist at EleutherAI.
Apr 2026 📝 Fine-tuning vs. In-context Learning in Large Language Models: A Formal Language Learning Perspective accepted as an oral at ACL 2026.
Apr 2026 ✈️ Catch me in Rio de Janeiro, Brazil for ICLR 2026.
Mar 2026 📄 Fractional Rotation, Full Potential? Investigating Performance and Convergence of Partial RoPE released as a preprint.
Working under Dr Krishna Gummadi to explore different aspects of LLMs. Some areas we've explored/are exploring are
Currently working on the Multilingual Natural Instructions project to build a massive instruction tuning corpus for Hindi. Previously worked on -
I've worked on a variety of projects, from hate speech normalization to designing recommendations for fine-tuning improved hate speech detectors. I also led the QUENCH project, a benchmark aimed at evaluating advanced reasoning abilities in large language models, with a particular emphasis on Indic contexts.
Worked in the Finance, Planning & Analysis Engineering division towards revamping the central hub of the department. Also built POCs based on user feedback to improve the search and access experience on the webapp. Also recieved a return offer to join full time as an Analyst.
Worked with Matthew Watson & Chen Qian towards adding support for data augmentation layers to KerasNLP a library under the Keras/TensorFlow Ecosystem which aims to build industry oriented NLP Solutions. I also contributed to several bug fixes and other utilities such as tokenizers and transformer encoder & decoder.
ICLR 2026 - The Fourteenth International Conference on Learning Representations
IASEAI 2026 - The International Association for Safe & Ethical AI (Non-Archival)
(An earlier version of this work was presented at R2-FM, ICML 2025)
[Oral] ICLR 2026 - The Fourteenth International Conference on Learning Representations
Data-FM @ ICLR 2026 - Workshop on Navigating and Addressing Data Problems for Foundation Models (Non-Archival)
ICLR 2026 - The Fourteenth International Conference on Learning Representations
IASEAI 2026 - The International Association for Safe & Ethical AI (Non-Archival)
(An earlier version of this work was presented at MemFM, ICML 2025)
IASEAI 2026 - The International Association for Safe & Ethical AI (Non-Archival)
[Oral] ICML 2026 (Position Paper Track) - Forty-Third International Conference on Machine Learning
Mechanistic Interpretability Workshop @ ICML 2026 (Non-Archival)
[Oral] ACL 2026 - The 64th Annual Meeting of the Association for Computational Linguistics
Under Review
EMNLP 2025 System Demonstrations - The 2025 Conference on Empirical Methods in Natural Language Processing
ICLR 2025 - The Thirteenth International Conference on Learning Representations
WSDM 2025 - Proceedings of the 18th ACM International Conference on Web Search and Data Mining
COLING 2025 - Proceedings of the 31st International Conference on Computational Linguistics
R2-FM @ ICML 2025 - Workshop on Reliable and Responsible Foundation Models (Non-Archival)
MemFM @ ICML 2025 - The Impact of Memorization on Trustworthy Foundation Models (Non-Archival)
MemFM @ ICML 2025 - The Impact of Memorization on Trustworthy Foundation Models (Non-Archival)
The Second Tiny Papers Track at ICLR 2024
EACL 2024 - Findings of the Association for Computational Linguistics
FIRE 2023 - Proceedings of the 15th Annual Meeting of the Forum for Information Retrieval Evaluation
In Working Notes of FIRE 2023 - Forum for Information Retrieval Evaluation
[Oral] ICML 2023 - The Fortieth International Conference on Machine Learning
The First Tiny Papers Track at ICLR 2023
The First Tiny Papers Track at ICLR 2023
KDD 2022 - Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (Applied Data Science Track)