LLM Reference Library

Every model that matters,
one place.

A curated index of language models — open source and proprietary. Compare context windows, use cases, and benchmarks. Updated as the field moves.

Models

Open Source

Proprietary

Run Locally

Model	Provider	Type	MMLU	HumanEval	Context

MMLU = Massive Multitask Language Understanding (% accuracy). HumanEval = coding pass@1 (%). Scores sourced from published model cards and third-party evals — may vary by methodology.

Quick Start

Ollama lets you run open-source models locally with a single command. Install from ollama.com, then pull any compatible model.

ollama pull llama3.3
ollama pull deepseek-r1
ollama pull mistral
ollama pull gemma3
ollama pull qwen3
ollama run llama3.3

Hardware Guide

Required VRAM depends on model size. Most consumer GPUs handle 7–14B models comfortably. CPU inference is slower but works.

7B  model  → ~5 GB VRAM
13B model  → ~9 GB VRAM
30B model  → ~20 GB VRAM
70B model  → ~40 GB VRAM
            or split across GPUs

Got a model to suggest?
Found an error?

This library is maintained manually and updated as the field moves. If you know a model worth indexing, spotted outdated data, or just want to say hello — reach out.

Every model that matters,
one place.

Benchmarks

Run Locally with Ollama

Quick Start

Hardware Guide

Contact

Got a model to suggest?
Found an error?

// Send a message

Every model that matters,one place.

Benchmarks

Run Locally with Ollama

Quick Start

Hardware Guide

Contact

Got a model to suggest?Found an error?

// Send a message

Every model that matters,
one place.

Got a model to suggest?
Found an error?