@techreport{TR-IC-PFG-24-38, number = {IC-PFG-24-38}, author = {Gabriela Bittencourt and Islene Calciolari {Garcia}}, title = {{Enhancing Linux Kernel Test Result Analysis: Automated Log Clustering in the KernelCI Database}}, month = {December}, year = {2024}, institution = {Institute of Computing, University of Campinas}, note = {In English, 20 pages. \par\selectlanguage{english}\textbf{Abstract} The Linux kernel is one of the largest collaborative efforts of software development in the world, powering a large majority of the infrastructure that runs computing workloads of all scales -- from embedded systems to HPC clusters. As such, improving the testing ecosystem for the Linux kernel is critical to ensure the longevity of the project. The KernelCI project is a recent initiative that looks to provide a unified testing infrastructure for all the kernel subsystems; this work aims to improve the automatic evaluation and labeling of test results for the kernel in the context of the KernelCI project, through the use of modern clustering and data aggregation techniques; in particular, we propose a frequency-based algorithm for filtering and labeling logs from kernel tests as a way to facilitate their analysis by kernel maintainers, greatly improving the efficiency of the review process. } }