LLM Leaderboard Sites
This revision is from 2024/07/25 17:17. You can Restore it.
- https://huggingface.co/collections/open-llm-leaderboard/the-big-benchmarks-collection-64faca6335a7fc7d4ffe974a
- Open LLM Leaderboard: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard
- lmsys: https://chat.lmsys.org/?leaderboard=
- https://huggingface.co/spaces/occiglot/euro-llm-leaderboard
- https://huggingface.co/spaces/openlifescienceai/open_medical_llm_leaderboard
- https://www.vellum.ai/llm-leaderboard
- https://openlm.ai/leaderboard/
- https://evalplus.github.io/leaderboard.html
- https://scale.com/leaderboard
- https://livebench.ai/
Models:
- https://build.nvidia.com/nvidia/nemotron-4-340b-instruct
- https://huggingface.co/spaces/Qwen/Qwen2-72B-Instruct
- https://huggingface.co/Qwen
- https://chat.deepseek.com/coder
- https://huggingface.co/spaces/Qwen/CodeQwen1.5-7b-Chat-demo
Uncensored:
- https://llm.extractum.io/list/?uncensored=
- https://huggingface.co/spaces/DontPlanToEnd/UGI-Leaderboard
Propriety Models:
Repository LLM mdoels
VLM Leaderboards
Coding
Model we run and no longer run.
Current
- pivot-0.1-evil-a.Q8_0.gguf:latest: has an independent, unique personality not found in other models. Says girls prefer donkey dicks because they are more common in rural areas and that donkeys are more gentle than horses which can be unpredictable.
- Casual-Autopsy/L3-Umbral-Mind-RP-v1.0-8B - https://huggingface.co/spaces/DontPlanToEnd/UGI-Leaderboard
No Longer
- Qwen: has it own web interface, was replaced with 01-ai_Yi-1.5-9B-Chat which ranked higher in the https://oobabooga.github.io/benchmark.html
- LLava 7B: Multi-modal to accurately describe image and was not advanced enough, taking these leaderboards:
- https://github.com/BradyFU/Awesome-Multimodal-Large-Language-Models/tree/Evaluation
- https://huggingface.co/spaces/ucla-contextual/contextual_leaderboard
- https://mmbench.opencompass.org.cn/leaderboard
- closex/neuraldaredevil-8b-abliterated:latest: one of the highest rank abliterated uncensored models. Beaten by L3-Umbral-Mind-RP-v1.0-8B