Initiative Aims to Enable Ethical Coding LLMs
IEEE Spectrum Nonprofit Software Heritage has launched the CodeCommons project with the goal of creating the biggest repository of ethically sourced code for training AI models. CodeCommons will be focused on developing a unified data platform that gives researchers access to pre-cleaned code collections featuring license information, links to related research papers, and other metadata. […]