Test your prompts, agents, and RAGs. Red teaming/pentesting/vulnerability scanning for AI. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with command line and CI/CD integration.
promptfoo æ¯ä¸ä¸ªLLM æµè¯åå®å ¨æ«æå·¥å ·ï¼ç¨äºæµè¯æç¤ºè¯ãAI 代çå RAG ç³»ç»ãå®æä¾ AI çº¢éæµè¯ãæ¼æ´æ«æåè½ï¼æ¯ææ¯è¾ GPTãClaudeãGeminiãLlama çå¤ä¸ªæ¨¡åçæ§è½ã
æµè¯åæ¯è¾ä¸åæç¤ºè¯çææï¼æ¾å°æä¼æ¹æ¡ã
èªå¨æ£æµ LLM æ¼æ´åå®å ¨éæ£ã
å¹¶è¡æµè¯å¤ä¸ªæ¨¡åï¼æ¯è¾è¾åºè´¨éåææ¬ã
å½ä»¤è¡å·¥å ·å CI/CD éæï¼èªå¨åæµè¯æµç¨ã
æµè¯æ£ç´¢å¢å¼ºçæç³»ç»çåç¡®æ§åç¸å ³æ§ã
æµè¯ AI 代ççè¡ä¸ºåå³çé»è¾ã