LLMs include OpenAI's GPT-4/4o/4-mini, Anthropic's Claude (Sonnet, Opus, Haiku), Google's Gemini Pro/Flash, Meta's Llama, Mistral, and many others. As of 2026, frontier models (Claude Opus 4.7, GPT-5, Gemini Ultra) approach human expert-level on many tasks.
Key dimensions: context window (how much text the model can see at once — typically 8K to 1M tokens), pricing (per 1K input/output tokens), latency (response speed), and capability (reasoning, code, instruction-following).
For Indian SMBs, LLM choice depends on use case + budget: GPT-4o-mini or Gemini Flash for cheap high-volume tasks; Claude Sonnet or GPT-4o for quality-critical; Llama 3 for self-hosted privacy needs.