Model Information: Explore the capabilities, specifications, and pricing of all available models in Bot Scanner.

Alibaba

Qwen3 14B
Fast and cost-effective version of Qwen3, the latest generation of large language models in Qwen series, offering a comprehensive suite of dense and mixture-of-experts (MoE) models. Built upon extensive training, Qwen3 delivers groundbreaking advancements in reasoning, instruction-following, agent capabilities, and multilingual support
Parameters 14.8B
Context 32,768 tokens
Released 2025-04-29
API Type nebius
This model is available for answering and ranking.
qwen alibaba cost-effective
Learn more Model Terms of Service
Qwen3 30B A3B
Thinking
Tier Free
Mid-sized version of Qwen3, the latest generation of large language models in Qwen series, offering a comprehensive suite of dense and mixture-of-experts (MoE) models. Built upon extensive training, Qwen3 delivers groundbreaking advancements in reasoning, instruction-following, agent capabilities, and multilingual support
Parameters 29.9B
Context 32,768 tokens
Released 2025-04-29
API Type nebius
This model is available for answering and ranking.
qwen alibaba cost-effective
Learn more Model Terms of Service
Qwen3 235B A22B
Thinking
Tier 1
The most powerful version of Qwen3, the latest generation of large language models in Qwen series, offering a comprehensive suite of dense and mixture-of-experts (MoE) models. Built upon extensive training, Qwen3 delivers groundbreaking advancements in reasoning, instruction-following, agent capabilities, and multilingual support
Parameters 234B
Context 32,768 tokens
Released 2025-04-29
API Type nebius
This model is available for answering and ranking.
qwen alibaba high-performance
Learn more Model Terms of Service

Amazon

Nova Micro v1.0
Amazon Bedrock Nova Micro model. Ultra-fast, low-cost LLM for lightweight tasks.
Parameters Unknown
Context 8,192 tokens
Released 2024-05-01
API Type bedrock
This model is available for answering and ranking.
nova amazon bedrock ultra-fast
Learn more Model Terms of Service
Nova Lite v1.0
Amazon Bedrock Nova Lite model. Lightweight, cost-effective LLM for general tasks.
Parameters Unknown
Context 8,192 tokens
Released 2024-05-01
API Type bedrock
This model is available for answering and ranking.
nova amazon bedrock cost-effective
Learn more Model Terms of Service
Nova Pro v1.0
Amazon Bedrock Nova Pro model. Advanced LLM for demanding tasks.
Parameters Unknown
Context 32,768 tokens
Released 2024-05-01
API Type bedrock
This model is available for answering and ranking.
nova amazon bedrock advanced
Learn more Model Terms of Service

Anthropic

Claude 3.5 Haiku
Faster and cheaper version of Claude 3.5 Sonnet. Enhanced capabilities in speed, coding accuracy, and tool use. Engineered to excel in real-time applications, it delivers quick response times that are essential for dynamic tasks such as chat interactions and immediate coding suggestions.
Parameters Unknown
Context 200,000 tokens
Released 2024-10-22
API Type anthropic
This model is available for answering and ranking.
claude anthropic cost-effective
Learn more Model Terms of Service
Claude 3.7 Sonnet
Top of the line model by Anthropic for complex tasks. Advanced large language model with improved coding and problem-solving capabilities, particularly in front-end development and full-stack updates.
Parameters Unknown
Context 200,000 tokens
Released 2025-02-15
API Type anthropic
This model is available for answering and ranking.
claude anthropic High Performance
Learn more Model Terms of Service
Claude 4 Sonnet
Claude 4 Sonnet is the most cost-effective high-performance model in the Claude family with better control at coding.
Parameters Unknown
Context 200,000 tokens
Released 2025-05-23
API Type anthropic
This model is available for answering and ranking.
claude anthropic High Performance
Learn more Model Terms of Service
Claude 4 Opus
Claude 4 Opus is the most powerful high-performance model in the Claude family. Highest level of intelligence and capability.
Parameters Unknown
Context 200,000 tokens
Released 2025-05-23
API Type anthropic
This model is available for answering.
claude anthropic High Performance
Learn more Model Terms of Service

DeepSeek

DeepSeek V3
DeepSeek's advanced language model with strong performance. A strong Mixture-of-Experts (MoE) language model with 671B total parameters with 37B activated for each token.
Parameters 671B
Context 128,000 tokens
Released 2024-06-01
API Type nebius
This model is available for answering and ranking.
deepseek high-performance
Learn more Model Terms of Service
DeepSeek V3 0324
The most recent version of DeepSeek's advanced language model V3. It demonstrates notable improvements over its predecessor, DeepSeek-V3, in several key aspects.
Parameters 671B
Context 128,000 tokens
Released 2024-06-01
API Type nebius
This model is available for answering and ranking.
deepseek high-performance
Learn more Model Terms of Service
DeepSeek R1
Thinking
Tier 1
R1 is a state of the art reasoning model trained with reinforcement learning. It delivers strong performance on math, code, and logic tasks. It is especially good at tasks like code review, document analysis, planning, information extraction, and coding.
Parameters 671B
Context 128,000 tokens
Released 2025-02-15
API Type nebius
This model is available for answering.
deepseek reasoning thinking
Learn more Model Terms of Service
DeepSeek R1 0528
Thinking
Tier 1
R1 0528 is the latest version of the celebrated DeepSeek R1 model. It delivers top performance on math, code, and logic tasks. It is especially good at tasks like code review, document analysis, planning, information extraction, and coding.
Parameters 671B
Context 163,840 tokens
Released 2025-05-28
API Type nebius
This model is available for answering.
deepseek reasoning thinking
Learn more Model Terms of Service

Google

Gemma 2 9B
Google's 9B parameter open source model, fast and performing.
Parameters 9B
Context 8,192 tokens
Released 2024-05-15
API Type nebius
This model is available for answering and ranking.
gemma open source google fast
Learn more Model Terms of Service
Gemma 3N E4B
Gemma 3n models use selective parameter activation technology to reduce resource requirements. This technique allows the models to operate at an effective size of 2B and 4B parameters, which is lower than the total number of parameters they contain.
Parameters 8B
Context 32,000 tokens
Released 2025-07-03
API Type together
This model is available for answering and ranking.
gemma open source google fast
Learn more Model Terms of Service
Gemini 2.0 Flash
Gemini 2.0 Flash delivers next-gen features and improved capabilities, including superior speed, native tool use, multimodal generation, and a 1M token context window.
Parameters Unknown
Context 1,048,576 tokens
Released 2025-01-20
API Type gemini
This model is available for answering and ranking.
gemini google cost-effective
Learn more Model Terms of Service
Gemini 2.0 Flash Lite
Faster version of Gemini 2.0 Flash
Parameters Unknown
Context 1,048,576 tokens
Released 2025-01-20
API Type gemini
This model is available for answering and ranking.
gemini google fast cheap
Learn more Model Terms of Service
Gemini 2.5 Pro Preview
Thinking
Tier 1
Current top of the line in the Gemini family. Improved quality, especially for world knowledge, code, and long context
Parameters Unknown
Context 2,097,152 tokens
Released 2025-04-02
API Type gemini
This model is available for answering.
gemini google high-performance
Learn more Model Terms of Service

Meta

Llama 3.1 8B
Fast and cheap version of Llama 3.1 70B. Llama 3.1 is an auto-regressive language model that uses an optimized transformer architecture.
Parameters 8B
Context 128,000 tokens
Released 2024-07-24
API Type nebius
This model is available for answering and ranking.
llama meta cheap
Learn more Model Terms of Service
Llama 3.1 405B
Llama 3.1 is an auto-regressive language model that uses an optimized transformer architecture. The tuned versions use supervised fine-tuning (SFT) and reinforcement learning with human feedback (RLHF) to align with human preferences for helpfulness and safety.
Parameters 405B
Context 128,000 tokens
Released 2024-07-23
API Type nebius
This model is available for answering and ranking.
llama meta high-performance
Learn more Model Terms of Service
Llama 3.3 70B
Llama 3.3 multilingual LLM is a pretrained and instruction tuned generative model in 70B. The Llama 3.3 instruction tuned text only model is optimized for multilingual dialogue use cases and outperform many of the available open source and closed chat models on common industry benchmarks.
Parameters 70B
Context 128,000 tokens
Released 2024-04-18
API Type nebius
This model is available for answering and ranking.
llama meta high-performance
Learn more Model Terms of Service
Llama 4 Scout 17B (17Bx16E)
Fastest version of the Llama 4 collection of models. These models leverage a mixture-of-experts architecture to offer industry-leading performance in text and image understanding.
Parameters 109B
Context 10,000,000 tokens
Released 2024-4-5
API Type together
This model is available for answering and ranking.
llama meta high-performance
Learn more Model Terms of Service
Llama 4 Maverick 17B (17Bx128E)
The most powerful version of the Llama 4 collection of models. These models leverage a mixture-of-experts architecture to offer industry-leading performance in text and image understanding.
Parameters 400B
Context 1,000,000 tokens
Released 2024-4-5
API Type together
This model is available for answering and ranking.
llama meta high-performance
Learn more Model Terms of Service

Mistral AI

Mixtral 8x7B Instruct v0.1
Mixtral 8x7B Instruct v0.1 is a Mixture of Experts model with just 8x7B parameters. It is a good model for quick tasks and low cost.
Parameters 56B
Context 128,000 tokens
Released 2024-5-22
API Type together
This model is available for answering and ranking.
mistral very small cheap mixture-of-experts
Learn more Model Terms of Service
Mistral Small 24B
Mistral Small ( 2501 ) claims to be a high performance model in the small Large Models category below 70B, boasting 24B parameters and achieving state-of-the-art capabilities comparable to larger models.
Parameters 24B
Context 65,536 tokens
Released 2025-01-11
API Type together
This model is available for answering and ranking.
mistral small high-performance
Learn more Model Terms of Service

MoonShot AI

Kimi K2 Instruct
Kimi K2 is a state-of-the-art mixture-of-experts (MoE) language model with 32 billion activated parameters and 1 trillion total parameters. Trained with the Muon optimizer, Kimi K2 achieves exceptional performance across frontier knowledge, reasoning, and coding tasks while being meticulously optimized for agentic capabilities
Parameters 1T
Context 128,000 tokens
Released 2025-07-10
API Type together
This model is available for answering and ranking.
Moonshot AI open source high-performance
Learn more Model Terms of Service

NVIDIA

Llama 3.1 Nemotron Super 49B
Thinking
Tier Free
Llama 3.1 Nemotron Super 49B v1 is a large language model customized by NVIDIA to improve the helpfulness of LLM generated responses in the coding, science, and technology domains.
Parameters 49B
Context 128,000 tokens
Released 2025-03-30
API Type nebius
This model is available for answering and ranking.
nemotron nvidia high-performance
Learn more Model Terms of Service
Llama 3.1 Nemotron Ultra 253B
Thinking
Tier 1
Nemotron Ultra 253B is an LLM which is a derivative of Llama 3.1 70B. It is a reasoning model that is post trained for reasoning, human chat preferences, and tasks, such as RAG and tool calling.
Parameters 253B
Context 131,072 tokens
Released 2025-4-7
API Type nebius
This model is available for answering and ranking.
nemotron nvidia high-performance thinking
Learn more Model Terms of Service

OpenAI

GPT-4.1
The new top of the line OpenAi non-reasoning model, excelling at complex instructions and software engineering. With a 1 million token context window, it is optimized for precise code diffs, agent reliability, and high recall in large documents, suiting it for agents, IDE tools, and enterprise knowledge retrieval.
Parameters Unknown
Context 1,047,576 tokens
Released 2025-04-14
API Type openai
This model is available for answering and ranking.
gpt openai high-performance
Learn more Model Terms of Service
GPT-4.1 Mini
Fast and affordable version of GPT-4.1, the new top OpenAi non-reasoning model. With a 1,047,576 tokens context window, it is optimized for complex instructions and software engineering.
Parameters Unknown
Context 1,047,576 tokens
Released 2025-04-14
API Type openai
This model is available for answering and ranking.
gpt openai high-performance
Learn more Model Terms of Service
GPT-4.1 Nano
Fastest and cheapest version in the GPT-4.1 family of models. Ideal for cost-effective complex instructions and software engineering. With a 1 million token context window, it is optimized for precise code diffs, agent reliability, and high recall in large documents, suiting it for agents, IDE tools, and enterprise knowledge retrieval.
Parameters Unknown
Context 1,047,576 tokens
Released 2025-04-14
API Type openai
This model is available for answering and ranking.
gpt openai cost-effective
Learn more Model Terms of Service
o1
Thinking
Tier 1
The full reasoning model from OpenAi's o1 family of reasoning models, the o1-series. Still powerful (and expensive), it has since been succeeded by the more advanced o3 and o4 series.
Parameters Unknown
Context 200,000 tokens
Released 2024-12-17
API Type openai
This model is available for answering.
o3 openai high-performance thinking expensive
Learn more Model Terms of Service
o3
Thinking
Tier 1
The top reasoning model from OpenAi's family of reasoning models. Extremely powerful (and expensive) in all tasks.
Parameters Unknown
Context 128,000 tokens
Released 2025-01-22
API Type openai
This model is available for answering.
o3 openai high-performance thinking expensive
Learn more Model Terms of Service
o3 Mini
Thinking
Tier 1
Smaller, faster, and less expensive version of o3, the most powerful and currently available reasoning model in OpenAi's o-series
Parameters Unknown
Context 200,000 tokens
Released 2025-01-31
API Type openai
This model is available for answering and ranking.
o3 openai high-performance thinking
Learn more Model Terms of Service
o4 Mini
Thinking
Tier 1
A fast, cost-efficient reasoning model that balances performance with affordability. O4-mini shows strong performance in coding, math, and visual tasks, making it ideal for high-volume applications needing good-enough logic and faster response times without the higher cost of larger models.
Parameters Unknown
Context 200,000 tokens
Released 2025-04-16
API Type openai
This model is available for answering and ranking.
o3 openai high-performance thinking
Learn more Model Terms of Service

xAI

Grok-4
Thinking
Tier 1
Grok-4 is the most performing model in the Grok family. Ranking top in almost all bencmarks, it delivers strong performance on math, code, and logic tasks. It is especially good at tasks like code review, document analysis, planning, information extraction, and coding.
Parameters Unknown
Context 256,000 tokens
Released 2025-06-10
API Type grok
This model is available for answering and ranking.
grok xai thinking top-performing
Learn more Model Terms of Service
Grok-3
Thinking
Tier 1
Grok-3 is a powerful reasoning model from the Grok family. It is a state of the art reasoning model trained with reinforcement learning. It delivers strong performance on math, code, and logic tasks. It is especially good at tasks like code review, document analysis, planning, information extraction, and coding.
Parameters Unknown
Context 131,072 tokens
Released 2025-03-27
API Type grok
This model is available for answering and ranking.
grok xai high-performance
Learn more Model Terms of Service
Grok-3 Mini
Thinking
Tier Free
Grok-3 mini is the more affordable version of Grok-3 by xAI.
Parameters Unknown
Context 131,072 tokens
Released 2025-03-27
API Type grok
This model is available for answering and ranking.
grok xai high-performance
Learn more Model Terms of Service