Evaluations, Anthropic support, Agents
What's Changed
- Evaluations - Basic metrics such as match, fuzzy, includes as well as a C Compiler to test if LLM generated code compiles
- Anthropic Claude LLM support
- Agents - CAMEL agents, Scrape summarizer agent