Michal Lukasik | Papers & Talks

REAL: Regression-Aware Reinforcement Learning for LLM-as-a-Judge
Yasi Zhang, Tianyu Chen, Mingyuan Zhou, Oscar Leong, Ying Nian Wu, Michal Lukasik. In ICML, 2026.

TRACT: Regression-Aware Fine-tuning Meets Chain-of-Thought Reasoning for LLM-as-a-Judge
Cheng-Han Chiang, Hung-yi Lee, Michal Lukasik. In ACL main, 2025.

Bipartite Ranking From Multiple Labels On Loss Versus Label Aggregation
Michal Lukasik, Lin Chen, Harikrishna Narasimhan, Aditya Krishna Menon, Wittawat Jitkrittum, Felix X. Yu, Sashank J. Reddi, Gang Fu, Mohammadhossein Bateni, Sanjiv Kumar. In ICML, 2025.

Better Autoregressive Regression via Regression-aware Fine-tuning
Michal Lukasik, Zhao Meng, Harikrishna Narasimhan, Yin-Wen Chang, Aditya Krishna Menon, Felix Yu, Sanjiv Kumar. In ICLR (spotlight), 2025.

Regression Aware Inference with LLMs
Michal Lukasik, Harikrishna Narasimhan, Aditya Krishna Menon, Felix Yu, Sanjiv Kumar. In EMNLP (findings), 2024.

What do larger image classifiers memorise?
Michal Lukasik, Vaishnavh Nagarajan, Ankit Singh Rawat, Aditya Krishna Menon, Sanjiv Kumar. In TMLR, 2024.

It's an Alignment, Not a Trade-off: Revisiting Bias and Variance in Deep Models
Lin Chen, Michal Lukasik, Wittawat Jitkrittum, Chong You, Sanjiv Kumar. In ICLR (spotlight), 2024.

Two-stage LLM Fine-tuning with Less Specialization and More Generalization
Yihan Wang, Si Si, Daliang Li, Michal Lukasik, Felix Yu, Cho-Jui Hsieh, Inderjit S Dhillon, Sanjiv Kumar. In ICLR, 2024.

ResMem: Learn what you can and memorize the rest
Zitong Yang, Michal Lukasik, Vaishnavh Nagarajan, Zonglin Li, Ankit Rawat, Manzil Zaheer, Aditya Menon, Sanjiv Kumar. In NEURIPS, 2023.

Large language models with controllable working memory
Daliang Li, Ankit Singh Rawat, Manzil Zaheer, Xin Wang, Michal Lukasik, Andreas Veit, Felix Yu, Sanjiv Kumar. In ACL (findings), 2023.

Robust distillation for worst-class performance on the interplay between teacher and student objectives
Serena Wang, Harikrishna Narasimhan, Yichen Zhou, Sara Hooker, Michal Lukasik, Aditya Krishna Menon. In UAI, 2023.

Teacher's pet: understanding and mitigating biases in distillation
Michal Lukasik, Srinadh Bhojanapalli, Aditya Krishna Menon, Sanjiv Kumar. In TMLR, 2022.

Semantic Label Smoothing for Sequence to Sequence Problems
Michal Lukasik, Himanshu Jain, Aditya Krishna Menon, Seungyeon Kim, Srinadh Bhojanapalli, Felix Yu, Sanjiv Kumar. In EMNLP, 2020.

Text Segmentation by Cross Segment Attention
Michal Lukasik, Boris Dadachev, Gonçalo Simões, Kishore Papineni. In EMNLP, 2020.

Does label smoothing mitigate label noise?
Michal Lukasik, Srinadh Bhojanapalli, Aditya Krishna Menon, Sanjiv Kumar. In ICML, 2020.

Scaling Graph Neural Networks with Approximate PageRank
Aleksandar Bojchevski, Johannes Klicpera, Bryan Perozzi, Amol Kapoor, Martin Blais, Benedek Rózemberczki, Michal Lukasik, Stephan Günnemann. In KDD, 2020.

Discourse-aware rumour stance classification in social media using sequential classifiers
Arkaitz Zubiaga, Elena Kochkina, Maria Liakata, Rob Procter, Michal Lukasik, Kalina Bontcheva, Trevor Cohn, Isabelle Augenstein. In Information Processing \& Management, 2018.

Content explorer Recommending novel entities for a document writer
Michal Lukasik, Richard Zens. In EMNLP, 2018.

Longitudinal Modeling of Social Media with Hawkes Process based on Users and Networks
P.K. Srijith, Michal Lukasik, Kalina Bontcheva and Trevor Cohn. In The IEEE/ACM International Conference on Social Networks Analysis and Mining, ASONAM, 2017.
Abstract

Computational approach to dendritic spine taxonomy and shape transition analysis
Grzegorz Bokota, Marta Magnowska, Tomasz Kusmierczyk, Michal Lukasik, Matylda Roszkowska, Dariusz Plewczynski. In Frontiers in Computational Neuroscience, 2017.
Abstract Code

Stance classification in Rumours as a Sequential Task Exploiting the Tree Structure of Social Media Conversations
Arkaitz Zubiaga, Elena Kochkina, Maria Liakata, Rob Procter, Michal Lukasik. In 26th International Conference on Computational Linguistics, COLING, 2016.
Abstract

Hawkes Processes for Continuous Time Sequence Classification an Application to Rumour Stance Classification in Twitter
Michal Lukasik, P. K. Srijith, Duy Vu, Kalina Bontcheva, Arkaitz Zubiaga, Trevor Cohn. In Proceedings of the 54th annual meeting of the Association for Computational Linguistics, ACL, 2016.
Abstract Code

Metrics for Evaluation of Word-level Machine Translation Quality Estimation
Varvara Logacheva, Michal Lukasik and Lucia Specia. In Proceedings of the 54th annual meeting of the Association for Computational Linguistics, ACL, 2016.
Abstract

Convolution Kernels for Discriminative Learning from Streaming Text
Michal Lukasik, Trevor Cohn. In Proceedings of the Thirtieth AAAI Conference. AAAI, 2016.
Abstract

Classifying Tweet Level Judgements of Rumours in Social Media
Michal Lukasik, Trevor Cohn and Kalina Bontcheva. In Proceedings of Empirical Methods of Natural Language Processing, EMNLP, 2015.
Abstract Code

Modeling Tweet Arrival Times using Log-Gaussian Cox Processes
Michal Lukasik, Srijith Prabhakaran Nair Kusumam, Trevor Cohn and Kalina Bontcheva. In Proceedings of Empirical Methods of Natural Language Processing, EMNLP, 2015.
Abstract

Point process modelling of rumour dynamics in social media
Michal Lukasik, Trevor Cohn and Kalina Bontcheva. In Proceedings of the 53rd annual meeting of the Association for Computational Linguistics, ACL, 2015.
Abstract

Hierarchical, Multi-label Classification of Scholarly Publications Modifications of ML-KNN Algorithm
Michal Lukasik, Tomasz Kusmierczyk, Lukasz Bolikowski, Hung Son Nguyen. In Intelligent Tools for Building a Scientific Information Platform, 2013.
Abstract Code

Evaluation of Features for Author Name Disambiguation Using Linear Support Vector Machines
Piotr Jan Dendek, Lukasz Bolikowski, Michal Lukasik. In Document Analysis Systems, DAS, 2012.
Abstract