VLMVibeEval

A lightweight leaderboard for evaluating Vision Language Models (VLMs) — based on vibes. 🌞

Traditional benchmarks don't give concrete signal for your use case and models are often saturated over them. Instead, we let you vibe test models across curated, in-the-wild examples:

Predefined categories with images and prompts.
Check any model on these examples.
Explore the generations and judge for yourself, as different models have different styles and strengths. 🗣️

This is not about scores — it's about how it feels. You can submit new models in the community tab and we'll shortly update the app! 🤗