Skip to content
View Yifan-Song793's full-sized avatar
🉐
🉐
  • Peking University

Highlights

  • Pro

Organizations

@PKU-TANGENT

Block or report Yifan-Song793

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. GoodBadGreedy GoodBadGreedy Public

    The Good, The Bad, and The Greedy: Evaluation of LLMs Should Not Ignore Non-Determinism

    Python 25 1

  2. VisualWebBench/VisualWebBench VisualWebBench/VisualWebBench Public

    Evaluation framework for paper "VisualWebBench: How Far Have Multimodal LLMs Evolved in Web Page Understanding and Grounding?"

    Python 47 1

  3. ETO ETO Public

    Trial and Error: Exploration-Based Trajectory Optimization of LLM Agents (ACL 2024 Main Conference)

    Python 99 11

  4. RestGPT RestGPT Public

    An LLM-based autonomous agent controlling real-world applications via RESTful APIs

    Python 1.3k 99