Mbpp/305

❌ 均未通过 Base: FAIL Plus: FAIL
测试评估
指标结果
Base 测试FAIL
Plus 测试FAIL
代码长度400 字符
模型生成的代码 (400 字符)
def start_withp(phrases):
    """
    Return the first two words starting with the letter 'p' (case-insensitive)
    from a list of phrases. Each phrase may contain multiple words.
    
    Args:
        phrases (list of str): List of strings, each containing one or more words.
        
    Returns:
        tuple: A tuple containing the first two words that start with 'p'.
    """
    p_words = []
Base 失败测试用例
  • [['Python PHP', 'Java JavaScript', 'c c++']]
Plus 失败测试用例
  • [['Python Programming', 'Java Programming', 'Perl Programming', 'Ruby Programming', 'PHP Programming', 'C Programming']]