huntr: microsoft/promptbench

microsoft / promptbench

A unified evaluation framework for large language models

Submit a report

FIRST INTERACTION

WITHIN61 DAYS

REVIEW

WITHIN61 DAYS

FIX

WITHINN/A DAYS

microsoft

Arbitrary Code Execution via Unsafe eval() in promptbench.MMMU Dataset Loader

May 6th 2025

duplicate

High

microsoft

Arbitrary Code Execution via Unchecked eval() in PromptBench (MMMU Dataset)

May 6th 2025

informative

High

CRITICAL

$1500

HIGH

$750

MEDIUM

$125

LOW

$20