AI-Portal

memory

Model Leaderboard

Top AI models ranked by benchmark · manually updated

14 models

Model	Tier	Contextunfold_more	SWE-benchunfold_more	MMLUarrow_downward	Input $/1Munfold_more	Output $/1M
Llama 4 MaverickMeta	balanced	1000K	-	85.5%	OSS	OSS
Claude Fable 5Anthropic	flagship	1000K	80.3%	-	$10.00	$50.00
Claude Opus 4.8Anthropic	flagship	1000K	88.6%	-	$5.00	$25.00
GPT-5.5OpenAI	flagship	1049K	-	-	$5.00	$30.00
Gemini 3.1 ProGoogle	flagship	1000K	80.6%	-	$2.00	$12.00
Grok 4.3xAI	flagship	1000K	74.9%	-	$1.25	$2.50
DeepSeek V4 ProDeepSeek	flagship	1000K	80.6%	-	$1.74	$3.48
Gemini 2.5 Pro Deep ThinkGoogle	reasoning	1000K	-	-	$3.50	$10.50
Claude Sonnet 4.6Anthropic	balanced	200K	-	-	$3.00	$15.00
Qwen 3.5 235BAlibaba	balanced	262K	-	-	OSS	OSS
Claude Haiku 4.5Anthropic	fast	200K	-	-	$0.80	$4.00
Gemini 2.5 FlashGoogle	fast	1000K	-	-	$0.15	$0.60
DeepSeek V4 FlashDeepSeek	fast	1000K	-	-	$0.28	$0.42
Llama 4 ScoutMeta	fast	10000K	-	-	OSS	OSS

SWE-bench = real GitHub coding tasks (higher = better). MMLU = general knowledge. Prices from provider APIs. OSS models: self-host or use via API providers. Last updated Jun 2026.

build

Tools Directory

Frameworks, platforms, and infra for GenAI development

LangChainframework

Most popular framework for building LLM-powered applications, chains, and agents.

terminalOSSstar95kopen_in_new

LlamaIndexframework

Data framework for LLM applications — RAG, structured extraction, and query pipelines.

terminalOSSstar37kopen_in_new

Vercel AI SDKframework

TypeScript toolkit for building AI-powered UIs with streaming, tool use, and multi-modal support.

terminalOSSstar12kopen_in_new

Haystackframework

Production-ready NLP framework for building custom pipelines with RAG, search, and agents.

terminalOSSstar17kopen_in_new

Semantic Kernelframework

Microsoft's SDK for integrating LLMs into C#, Python, and Java enterprise applications.

terminalOSSstar22kopen_in_new

CrewAImulti-agent

Framework for orchestrating role-playing, autonomous AI agents working as a team.

terminalOSSstar23kopen_in_new

AutoGenmulti-agent

Microsoft's framework for building multi-agent applications where agents converse to solve tasks.

terminalOSSstar34kopen_in_new

LangGraphmulti-agent

Build stateful, multi-actor agent applications as controllable graphs with cycles and branches.

terminalOSSstar8kopen_in_new

Difyplatform

Visual platform for building and operating AI applications with drag-and-drop workflows.

terminalOSSstar45kopen_in_new

Flowiseplatform

Drag-and-drop UI for building LLM flows and AI agents with LangChain components.

terminalOSSstar31kopen_in_new

n8n AIplatform

Workflow automation platform with native AI nodes for building agentic workflows without code.

terminalOSSstar47kopen_in_new

Ollamalocal

Run LLMs locally with a simple CLI. Supports Llama, Mistral, Gemma, Phi and hundreds more.

terminalOSSstar90kopen_in_new

LM Studiolocal

Desktop app to discover, download, and run local LLMs with a ChatGPT-like UI.

open_in_new

vLLMlocal

High-throughput, memory-efficient inference engine for LLMs. The standard for production self-hosting.

terminalOSSstar31kopen_in_new

Model Context Protocolmcp

Anthropic's open standard for connecting AI assistants to data sources and tools.

terminalOSSstar12kopen_in_new

OpenAI Assistantsmcp

Hosted platform for building AI assistants with persistent threads, code interpreter, and file search.

open_in_new

Weights & Biasesmlops

ML experiment tracking, model versioning, and dataset management for the full ML lifecycle.

star9kopen_in_new

LangSmithmlops

Observability and testing platform for LLM applications built with LangChain.

open_in_new

Langfusemlops

Open-source LLM engineering platform for tracing, evaluation, and prompt management.

terminalOSSstar7kopen_in_new

Pineconevector

Managed vector database built for production AI applications with low latency at scale.

open_in_new

Chromavector

Open-source embedding database for AI applications. Simple API, runs embedded or as a server.

terminalOSSstar15kopen_in_new

Qdrantvector

High-performance vector search engine with rich filtering, payload indexing, and Rust core.

terminalOSSstar20kopen_in_new

Weaviatevector

Open-source vector database with built-in ML model integrations and GraphQL API.

terminalOSSstar12kopen_in_new

map

AI Vendor Map

34 vendors · 6 categories · Updated June 2026

psychology

Foundation Models

(8)

Anthropic

leader

Claude Fable 5, Opus 4.8, Sonnet 4.6. Leader in safety-focused frontier AI. API + AWS/GCP.

Visitopen_in_new

OpenAI

leader

GPT-5.5, o-series reasoning. Most widely adopted API. Microsoft integration.

Visitopen_in_new

Google Gemini

leader

Gemini 3.1 Pro, 2.5 Flash. 1M context, native multimodal, Vertex AI. Strongest on benchmarks Jun 2026.

Open weights. Scout (10M context), Maverick (MoE). Self-host or via API providers.

Visitopen_in_new

DeepSeek V4

leaderterminalOSS

MIT-licensed MoE. $1.74/$3.48 per 1M — near-frontier quality at fraction of cost.

Visitopen_in_new

Mistral AI

challenger

European frontier AI. Mistral Large 2, Codestral. Strong multilingual + code. GDPR-friendly.

Grok 4.3: $1.25/$2.50 per 1M, 1M context, native video. Great price/performance ratio.

Visitopen_in_new

Cohere

niche

Command R+ for enterprise RAG. Embed v3 for best-in-class embeddings. Strong enterprise focus.

Visitopen_in_new

cloud

Cloud Platforms

(6)

AWS Bedrock

leader

Managed access to Anthropic, Meta, Mistral, Cohere. Guardrails, Agents, Knowledge Bases built-in.

OpenAI + 1,700+ models from HuggingFace. Deep enterprise integration, RBAC, compliance.

Gemini + OSS models. Agent Builder, Grounding, RAG Engine. Best for Gemini-heavy workloads.

Visitopen_in_new

Together.ai

challenger

Fastest OSS inference (Llama, Qwen, Mixtral). 200+ models, competitive pricing.

Visitopen_in_new

Replicate

challenger

Run any OSS model via API. Pay-per-prediction. Great for image, audio, and niche models.

Visitopen_in_new

CWA

Cloudflare Workers AI

niche

Edge inference across 300 PoPs. Low latency, zero cold starts. OSS models at edge.

Visitopen_in_new

code

Frameworks

(5)

LangChain

leaderterminalOSS

Most popular LLM framework. Chains, agents, RAG. Python + TypeScript. 95k GitHub stars.

Visitopen_in_new

LlamaIndex

leaderterminalOSS

Data framework for RAG. Best for complex document ingestion and retrieval pipelines.

Visitopen_in_new

VAS

Vercel AI SDK

challengerterminalOSS

TypeScript-first AI for web. Streaming, tool use, multi-provider. Best for Next.js.

Visitopen_in_new

Haystack

challengerterminalOSS

Production-ready pipelines. Deepset-backed. Strong for search and enterprise RAG.

Visitopen_in_new

DSPy

nicheterminalOSS

Stanford project — compiles prompts into optimized programs. Best for systematic prompt optimization.

Visitopen_in_new

account_tree

Agentic / Auto

(6)

CrewAI

leaderterminalOSS

Multi-agent orchestration with role-based agents. 30k stars. Best for collaborative agent teams.

Visitopen_in_new

LangGraph

leaderterminalOSS

Graph-based agentic workflows from LangChain. Supports cycles, conditional branches, state.

Visitopen_in_new

AutoGen

challengerterminalOSS

Microsoft multi-agent framework. AutoGen Studio for no-code agent building.

Visitopen_in_new

Manus AI

challenger

Autonomous AI agent that completes complex tasks end-to-end. Browser, code, files.

Visitopen_in_new

n8n

leaderterminalOSS

Workflow automation with 400+ integrations + AI nodes. Self-hostable. Strong EU adoption.

Visitopen_in_new

Dify

challengerterminalOSS

LLM app development platform. Drag-and-drop agent builder, RAG, workflow. 60k stars.

Visitopen_in_new

monitor_heart

MLOps / Observ.

(4)

W&B

Weights & Biases

leader

Experiment tracking, model registry, LLM monitoring. De facto standard for ML teams.

Visitopen_in_new

LangSmith

leader

LLM observability from LangChain. Trace, evaluate, and optimize LLM applications.

Visitopen_in_new

Langfuse

challengerterminalOSS

Open-source LLM observability. Traces, evals, prompt management. Self-hostable.

Visitopen_in_new

Arize AI

challenger

ML + LLM monitoring. Drift detection, explainability, A/B testing for AI models.

Visitopen_in_new

database

Vector DBs

(5)

Pinecone

leader

Managed vector DB. Fastest time-to-production. Serverless tier for small workloads.

Visitopen_in_new

Weaviate

leaderterminalOSS

Open-source vector DB with hybrid search. GraphQL API, multi-tenancy, on-prem option.

Visitopen_in_new

Chroma

challengerterminalOSS

Simplest OSS vector DB. Zero config for local dev. Perfect for prototyping.

Visitopen_in_new

Qdrant

challengerterminalOSS

Rust-based, high-performance. Payload filtering, named vectors, cloud + self-host.

Visitopen_in_new

pgvector

nicheterminalOSS

PostgreSQL extension for vector similarity search. Zero new infra for Postgres shops.

Visitopen_in_new

schema

AI Architecture Patterns

7 patterns · When to use each · Common pitfalls · June 2026

chat

Basic Prompt Engineering

Low

Simple Q&A, text transformation, classification, summarization. No external data needed. Fastest to ship.

expand_more

database

RAG (Retrieval-Augmented Generation)

Medium

Q&A over company documents, knowledge bases, or any proprietary content. Prevents hallucination on factual queries.

expand_more

build

Tool Use / Function Calling

Medium

LLM needs to interact with external systems: search the web, query a database, call an API, execute code.

expand_more

account_tree

Agentic Loop (ReAct)

High

Multi-step tasks where the next action depends on previous results. The LLM acts as a planner and executor.

expand_more

groups

Multi-Agent System

High

Parallel subtasks, specialization, or peer-review between agents. Scale beyond single-agent context limits.

expand_more

model_training

Fine-tuned Model

High

Consistent domain-specific behavior at scale. Style, format, or domain knowledge the base model lacks. High-volume inference where prompt size matters for cost.

expand_more

security

Guardrails & Safety Layer

Medium

Any production deployment. Add input/output validation to prevent prompt injection, PII leakage, and off-topic responses.

expand_more