VLMVibeEval

A lightweight leaderboard for evaluating Vision Language Models (VLMs) โ€” based on vibes. ๐ŸŒž

Traditional benchmarks don't give concrete signal for your use case and models are often saturated over them. Instead, we let you vibe test models across curated, in-the-wild examples:

  1. Predefined categories with images and prompts.
  2. Check any model on these examples.
  3. Explore the generations and judge for yourself, as different models have different styles and strengths. ๐Ÿ—ฃ๏ธ

This is not about scores โ€” it's about how it feels. You can submit new models in the community tab and we'll shortly update the app! ๐Ÿค—

Mode
2 32
Choose model
Category
Category
Example
× Enlarged Image