Models
12 AI tools in this category, sorted by community votes.
Specialized OCR model for complex tables, forms, handwriting, and multi-page PDFs with structured output.
Anthropic's most powerful model with Adaptive Thinking, Agent Teams, and 1M context window. 80.8% SWE-bench.
Anthropic's flagship model with extended thinking, optimized for agents, software engineering, and computer use.
Google's frontier multimodal reasoning model with 1M context window, advanced coding, and natively multimodal capabilities.
Open-source multimodal vision-language model with native function calling and image-text generation.
Lightweight 9B-parameter open-source multimodal model for local deployment. Free to use.
Enhanced GLM-4.6V-Flash with higher capacity and stability for production multimodal workloads.
Zhipu AI's flagship MoE model for agentic engineering, complex reasoning, and advanced coding.
GLM-5's coding specialist — optimized for advanced programming and agentic dev workflows.
OpenAI's latest model with 400K context, 100% AIME math score, and 6.2% hallucination rate.
Mistral's flagship open-weight multimodal model — 256K context, multilingual, strong reasoning and coding.
Enterprise-optimized multimodal model with high-speed reasoning and long-context document understanding.