16 models indexed
LLM Reference Library

Every model that matters,
one place.

A curated index of language models — open source and proprietary. Compare context windows, use cases, and benchmarks. Updated as the field moves.

16
Models
9
Open Source
7
Proprietary
9
Run Locally
Type
Feature

Benchmarks

ModelProviderTypeMMLUHumanEvalContext

MMLU = Massive Multitask Language Understanding (% accuracy). HumanEval = coding pass@1 (%). Scores sourced from published model cards and third-party evals — may vary by methodology.

Run Locally with Ollama

Quick Start

Ollama lets you run open-source models locally with a single command. Install from ollama.com, then pull any compatible model.

ollama pull llama3.3 ollama pull deepseek-r1 ollama pull mistral ollama pull gemma3 ollama pull qwen3 ollama run llama3.3

Hardware Guide

Required VRAM depends on model size. Most consumer GPUs handle 7–14B models comfortably. CPU inference is slower but works.

7B model → ~5 GB VRAM 13B model → ~9 GB VRAM 30B model → ~20 GB VRAM 70B model → ~40 GB VRAM or split across GPUs

Contact

Got a model to suggest?
Found an error?

This library is maintained manually and updated as the field moves. If you know a model worth indexing, spotted outdated data, or just want to say hello — reach out.

// Send a message

Protected by Cloudflare Turnstile. No tracking, no cookies.