AI & Voice

LLM (Large Language Model)

AI model trained on vast text data — generates human-like text, answers questions, follows instructions.

LLMs include OpenAI's GPT-4/4o/4-mini, Anthropic's Claude (Sonnet, Opus, Haiku), Google's Gemini Pro/Flash, Meta's Llama, Mistral, and many others. As of 2026, frontier models (Claude Opus 4.7, GPT-5, Gemini Ultra) approach human expert-level on many tasks.

Key dimensions: context window (how much text the model can see at once — typically 8K to 1M tokens), pricing (per 1K input/output tokens), latency (response speed), and capability (reasoning, code, instruction-following).

For Indian SMBs, LLM choice depends on use case + budget: GPT-4o-mini or Gemini Flash for cheap high-volume tasks; Claude Sonnet or GPT-4o for quality-critical; Llama 3 for self-hosted privacy needs.

India context

Indian SaaS founders increasingly use OpenRouter (model aggregator) to mix-and-match LLMs without managing 5+ provider integrations. Free models on OpenRouter handle 80% of bulk content generation needs at zero cost.

Examples

A blog generator uses Gemini Flash for cheap drafts, Claude Sonnet for premium edits.
A customer support bot uses GPT-4o-mini for quick answers, escalates to Claude Sonnet for complex queries.

FAQ

Which LLM should I use?

Depends on use case. Cheap + fast: GPT-4o-mini, Gemini Flash. Quality: Claude Sonnet, GPT-4o. Self-hosted privacy: Llama 3 / Mistral. Test 2-3 on your specific tasks.

What's a context window?

Max tokens an LLM can read at once. GPT-4o = 128K tokens. Claude Sonnet = 200K. Gemini = 1M. Bigger windows enable more context-rich responses.

How expensive are LLMs?

Varies wildly. GPT-4o-mini ~₹0.15/1M input tokens. Claude Opus ~₹15/1M input tokens. Free tiers exist on OpenRouter for many models.

Related concepts

RAGembeddingfine-tuningcontext windowOpenRouterhallucination

Doggu handles LLM (Large Language Model) compliance for you.

Whether it's automating the workflow above, Doggu was built specifically for the Indian SMB regulatory environment. One platform, all the requirements.

Try Doggu free for 14 days

Related glossary entries

AOV (Average Order Value) — Average rupee value per customer transaction — key revenue lever for ecommerce.RAG (Retrieval-Augmented Generation) — Technique where an LLM is given relevant documents/data at query time, instead of relying solely on its training data.Hallucination (AI) — When an LLM generates plausible-sounding but factually incorrect or fabricated content.

More in AI & Voice

STT (Speech-to-Text)TTS (Text-to-Speech)RAG (Retrieval-Augmented Generation)Hallucination (AI)

← All glossary entries Blog WhatsApp Templates Free tools