Alert button
Picture for Foutse Khomh

Foutse Khomh

Alert button

Jack

Introducing v0.5 of the AI Safety Benchmark from MLCommons

Add code
Bookmark button
Alert button
Apr 18, 2024
Bertie Vidgen, Adarsh Agrawal, Ahmed M. Ahmed, Victor Akinwande, Namir Al-Nuaimi, Najla Alfaraj, Elie Alhajjar, Lora Aroyo, Trupti Bavalatti, Borhane Blili-Hamelin, Kurt Bollacker, Rishi Bomassani, Marisa Ferrara Boston, Siméon Campos, Kal Chakra, Canyu Chen, Cody Coleman, Zacharie Delpierre Coudert, Leon Derczynski, Debojyoti Dutta, Ian Eisenberg, James Ezick, Heather Frase, Brian Fuller, Ram Gandikota, Agasthya Gangavarapu, Ananya Gangavarapu, James Gealy, Rajat Ghosh, James Goel, Usman Gohar, Sujata Goswami, Scott A. Hale, Wiebke Hutiri, Joseph Marvin Imperial, Surgan Jandial, Nick Judd, Felix Juefei-Xu, Foutse Khomh, Bhavya Kailkhura, Hannah Rose Kirk, Kevin Klyman, Chris Knotz, Michael Kuchnik, Shachi H. Kumar, Chris Lengerich, Bo Li, Zeyi Liao, Eileen Peters Long, Victor Lu, Yifan Mai, Priyanka Mary Mammen, Kelvin Manyeki, Sean McGregor, Virendra Mehta, Shafee Mohammed, Emanuel Moss, Lama Nachman, Dinesh Jinenhally Naganna, Amin Nikanjam, Besmira Nushi, Luis Oala, Iftach Orr, Alicia Parrish, Cigdem Patlak, William Pietri, Forough Poursabzi-Sangdeh, Eleonora Presani, Fabrizio Puletti, Paul Röttger, Saurav Sahay, Tim Santos, Nino Scherrer, Alice Schoenauer Sebag, Patrick Schramowski, Abolfazl Shahbazi, Vin Sharma, Xudong Shen, Vamsi Sistla, Leonard Tang, Davide Testuggine, Vithursan Thangarasa, Elizabeth Anne Watkins, Rebecca Weiss, Chris Welty, Tyler Wilbers, Adina Williams, Carole-Jean Wu, Poonam Yadav, Xianjun Yang, Yi Zeng, Wenhui Zhang, Fedor Zhdanov, Jiacheng Zhu, Percy Liang, Peter Mattson, Joaquin Vanschoren

Viaarxiv icon

Machine Learning Robustness: A Primer

Add code
Bookmark button
Alert button
Apr 01, 2024
Houssem Ben Braiek, Foutse Khomh

Viaarxiv icon

Bugs in Large Language Models Generated Code: An Empirical Study

Add code
Bookmark button
Alert button
Mar 18, 2024
Florian Tambon, Arghavan Moradi Dakhel, Amin Nikanjam, Foutse Khomh, Michel C. Desmarais, Giuliano Antoniol

Figure 1 for Bugs in Large Language Models Generated Code: An Empirical Study
Figure 2 for Bugs in Large Language Models Generated Code: An Empirical Study
Figure 3 for Bugs in Large Language Models Generated Code: An Empirical Study
Figure 4 for Bugs in Large Language Models Generated Code: An Empirical Study
Viaarxiv icon

Trained Without My Consent: Detecting Code Inclusion In Language Models Trained on Code

Add code
Bookmark button
Alert button
Feb 14, 2024
Vahid Majdinasab, Amin Nikanjam, Foutse Khomh

Viaarxiv icon

ChatGPT vs LLaMA: Impact, Reliability, and Challenges in Stack Overflow Discussions

Add code
Bookmark button
Alert button
Feb 13, 2024
Leuson Da Silva, Jordan Samhi, Foutse Khomh

Viaarxiv icon

Deep Learning Model Reuse in the HuggingFace Community: Challenges, Benefit and Trends

Add code
Bookmark button
Alert button
Jan 24, 2024
Mina Taraghi, Gianolli Dorcelus, Armstrong Foundjem, Florian Tambon, Foutse Khomh

Viaarxiv icon

Towards Enhancing the Reproducibility of Deep Learning Bugs: An Empirical Study

Add code
Bookmark button
Alert button
Jan 05, 2024
Mehil B. Shah, Mohammad Masudur Rahman, Foutse Khomh

Viaarxiv icon

Refining GPT-3 Embeddings with a Siamese Structure for Technical Post Duplicate Detection

Add code
Bookmark button
Alert button
Dec 22, 2023
Xingfang Wu, Heng Li, Nobukazu Yoshioka, Hironori Washizaki, Foutse Khomh

Viaarxiv icon

Characterizing and Classifying Developer Forum Posts with their Intentions

Add code
Bookmark button
Alert button
Dec 21, 2023
Xingfang Wu, Eric Laufer, Heng Li, Foutse Khomh, Santhosh Srinivasan, Jayden Luo

Viaarxiv icon

Studying the Practices of Testing Machine Learning Software in the Wild

Add code
Bookmark button
Alert button
Dec 19, 2023
Moses Openja, Foutse Khomh, Armstrong Foundjem, Zhen Ming, Jiang, Mouna Abidi, Ahmed E. Hassan

Viaarxiv icon