Spaces:

hage2000
/

code_eval_stdio

Sleeping

Kewen Zhao commited on Nov 22, 2024

Commit

f5dea60

1 Parent(s): 0c0f1e7

update readme and description

Files changed (2) hide show

README.md CHANGED Viewed

@@ -16,7 +16,7 @@ description: >-
   (https://arxiv.org/abs/2107.03374).
 ---
-# Metric Card for Code Eval
 ## Metric description

   (https://arxiv.org/abs/2107.03374).
 ---
+# Metric Card for Code Eval StdIO
 ## Metric description

code_eval_stdio.py CHANGED Viewed

@@ -76,10 +76,11 @@ Returns:
     pass_at_k: dict with pass rates for each k
     results: dict with granular results of each unittest
 Examples:
-    >>> code_eval = evaluate.load("code_eval")
-    >>> test_cases = ["assert add(2,3)==5"]
-    >>> candidates = [["def add(a,b): return a*b", "def add(a, b): return a+b"]]
-    >>> pass_at_k, results = code_eval.compute(references=test_cases, predictions=candidates, k=[1, 2])
     >>> print(pass_at_k)
     {'pass@1': 0.5, 'pass@2': 1.0}
 """

     pass_at_k: dict with pass rates for each k
     results: dict with granular results of each unittest
 Examples:
+    >>> code_eval_stdio = evaluate.load("hage2000/code_eval_stdio")
+    >>> inputs = ["2 3"]
+    >>> references = ["5"]
+    >>> candidates = [[ "nums = list(map(int, input().split()))\nprint(sum(nums))"]]
+    >>> pass_at_k, results = code_eval_stdio.compute(references=references, predictions=candidates, inputs = inputs, k=[1, 2])
     >>> print(pass_at_k)
     {'pass@1': 0.5, 'pass@2': 1.0}
 """