Rahmani, H. A., Siro, C., Aliannejadi, M., Craswell, N., Clarke, C. L. A., Faggioli, G., Mitra, B., Thomas, P., & Yilmaz, E. (2024). LLM4Eval: Large Language Model for Evaluation in IR. In SIGIR '24: Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval : July 14-18, 2024, Washington, DC, USA (pp. 3040-3043). Association for Computing Machinery. https://doi.org/10.1145/3626772.3657992[details]
Siro, C., Aliannejadi, M., & de Rijke, M. (2024). Context Does Matter: Implications for Crowdsourced Evaluation Labels in Task-Oriented Dialogue Systems. In K. Duh, H. Gomez, & S. Bethard (Eds.), Findings of the Association for Computational Linguistics: NAACL 2024: Findings: Findings 2024 : June 16-21, 2024 (pp. 1258–1273). Association for Computational Linguistics. https://doi.org/10.18653/v1/2024.findings-naacl.80[details]
Siro, C., Aliannejadi, M., & de Rijke, M. (2024). Rethinking the Evaluation of Dialogue Systems: Effects of User Feedback on Crowdworkers and LLMs. In SIGIR '24: Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval : July 14-18, 2024, Washington, DC, USA (pp. 1952-1962). Association for Computing Machinery. https://doi.org/10.1145/3626772.3657712[details]
Siro, C., Aliannejadi, M., & de Rijke, M. (2024). Understanding and Predicting User Satisfaction with Conversational Recommender Systems. ACM Transactions on Information Systems, 42(2), Article 55. https://doi.org/10.1145/3624989[details]
Yuan, Y., Siro, C., Aliannejadi, M., de Rijke, M., & Lam, W. (2024). Asking Multimodal Clarifying Questions in Mixed-Initiative Conversational Search. In WWW '24: Proceedings of the ACM Web Conference 2024 : May 13-17, 2024, Singapore, Singapore (pp. 1474-1485). The Association for Computing Machinery. https://doi.org/10.48550/arXiv.2402.07742, https://doi.org/10.1145/3589334.3645483[details]
Siro, C., Aliannejadi, M., & de Rijke, M. (2022). Understanding User Satisfaction with Task-oriented Dialogue Systems. In SIGIR '22: proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval : July 11-15, 2022, Madrid, Spain (pp. 2018-2023). The Association for Computing Machinery. https://doi.org/10.1145/3477495.3531798[details]
2022
Srivastava, A., Siro, C., Shutova, E., Jumelet, J., ter Hoeve, M., Giulianelli, M., Lewis, M., Schubert, M., Tong, X., & BIG-bench authors (2022). Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models. (v2 ed.) ArXiv. https://doi.org/10.48550/arXiv.2206.04615[details]
Siro, C. N. (2025). Rethinking the human-centered evaluation of conversational systems. [Thesis, fully internal, Universiteit van Amsterdam]. [details]
De UvA gebruikt cookies voor het meten, optimaliseren en goed laten functioneren van de website. Ook worden er cookies geplaatst om inhoud van derden te kunnen tonen en voor marketingdoeleinden. Klik op ‘Accepteren’ om akkoord te gaan met het plaatsen van alle cookies. Of kies voor ‘Weigeren’ om alleen functionele en analytische cookies te accepteren. Je kunt je voorkeur op ieder moment wijzigen door op de link ‘Cookie instellingen’ te klikken die je onderaan iedere pagina vindt. Lees ook het UvA Privacy statement.