Skip to content
View liushz's full-sized avatar
  • Shanghai

Block or report liushz

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. open-compass/opencompass open-compass/opencompass Public

    OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

    Python 6k 650

  2. open-compass/MathBench open-compass/MathBench Public

    [ACL 2024 Findings] MathBench: A Comprehensive Multi-Level Difficulty Mathematics Evaluation Dataset

    105 1

  3. open-compass/GPassK open-compass/GPassK Public

    [ACL 2025] Are Your LLMs Capable of Stable Reasoning?

    Python 30 2

  4. open-compass/CompassVerifier open-compass/CompassVerifier Public

    [EMNLP 2025] CompassVerifier: A Unified and Robust Verifier for LLMs Evaluation and Outcome Reward

    Jupyter Notebook 43 1