We introduced a new model designed for the Code generation task. Its test accuracy on the HumanEval base dataset surpasses that of GPT-4 Turbo (April 2024) and GPT-4o.
-
Updated
Jul 6, 2024 - Python
We introduced a new model designed for the Code generation task. Its test accuracy on the HumanEval base dataset surpasses that of GPT-4 Turbo (April 2024) and GPT-4o.
Code evaluation with *30-seconds-of-code* examples. Inspired by "Evaluating Large Language Models Trained on Code"
Add a description, image, and links to the humaneval topic page so that developers can more easily learn about it.
To associate your repository with the humaneval topic, visit your repo's landing page and select "manage topics."