promptfoo/promptfoo

Test your prompts, agents, and RAGs. Red teaming/pentesting/vulnerability scanning for AI. Compare performance of GPT, Claude, Gemini, DeepSeek, and more. Simple declarative configs with command line and CI/CD integration. Used by OpenAI and Anthropic.

51.5

Score

22,717

Stars

2,020

Forks

0.0

Trend

Details

Language: TypeScript
License: MIT
Category: AI/ML
Open Issues: 368
Contributors: 0
Archived: No

Security

OpenSSF Score: N/A
Dependency Risk: Unknown
Activity Health: Unknown

Topics

cici-cdcicdevaluationevaluation-frameworkllmllm-evalllm-evaluationllm-evaluation-frameworkllmopspentestingprompt-engineeringprompt-testingpromptsragred-teamingtestingvulnerability-scanners

View on GitHub ↗