AI Archives | Page 4 of 4 | nexusofnerds.com

Agentic AI

Model Context Protocol (MCP)

February 4, 2026

Nahush Gowda
AI Ethics

Constitutional AI

November 24, 2025

Nahush Gowda
AI Foundations

Tree-of-Thought (ToT)

November 24, 2025

Nahush Gowda
RAG

Retrieval-Augmented Generation (RAG)

November 23, 2025

Nahush Gowda
AI Foundations

Small Language Model (SLM)

November 23, 2025

Nahush Gowda
AI Foundations

Direct Preference Optimization (DPO)

November 23, 2025

Nahush Gowda
RAG

Graph Retrieval-Augmented Generation (Graph RAG)

November 23, 2025

Nahush Gowda
AI Foundations

Chain-of-Thought (CoT)

November 23, 2025

Nahush Gowda
Agentic AI

Function Calling

November 23, 2025

Nahush Gowda

Retrieval-Augmented Generation (RAG)

RAG

RAG grounds LLM outputs in retrieved documents via sparse, dense, or hybrid search, improving factuality, citations, and freshness…

November 23, 2025

Nahush Gowda
Grouped-Query Attention (GQA)

AI Infrastructure and MLOPs

Grouped-Query Attention shares keys/values across groups of query heads, shrinking KV caches and bandwidth to speed LLM inference…

November 23, 2025

Nahush Gowda
ReAct Prompting

Agentic AI

ReAct prompting interleaves reasoning with tool actions and observations (Thought → Action → Observation), letting LLM agents plan,…

November 23, 2025

Nahush Gowda
Toolformer

Agentic AI

Toolformer teaches LMs to autonomously invoke external tools during generation by training on interleaved tool-call traces, boosting factuality…

November 23, 2025

Nahush Gowda
Low-Rank Adaptation (LoRA)

AI Infrastructure and MLOPs

LoRA fine-tunes LLMs by training small low-rank adapters on top of frozen weights, slashing memory and compute while…

November 23, 2025

Nahush Gowda
FlashAttention

AI Infrastructure and MLOPs

FlashAttention is an IO‑aware, exact attention algorithm that tiles work into GPU SRAM and fuses kernels to cut…

November 23, 2025

Nahush Gowda
Paged Attention

AI Infrastructure and MLOPs

Paged Attention organizes LLM KV caches into fixed-size pages to reduce fragmentation, enable continuous batching, and support long…

November 23, 2025

Nahush Gowda
Vision-Language Model (VLM)

Multimodal AI

A Vision-Language Model (VLM) jointly learns from images and text to understand and generate multimodal content, enabling captioning,…

November 23, 2025

Nahush Gowda
In-Context Learning (ICL)

AI Foundations

ICL lets LLMs infer tasks from prompt-only examples—no weight updates—enabling zero/few-shot classification, extraction, and reasoning with schema-following in…

November 23, 2025

Nahush Gowda
Mixture of Experts (MoE)

AI Foundations

Mixture of Experts (MoE) scales model capacity by routing each token to a small subset of expert networks,…

November 23, 2025

Nahush Gowda

ai search

Ask me anything. I will answer your question based on my website database.

Stay Connected

@techwirenews

1.4M+ Followers

TechWire News

2M+ Followers

TechWire

4M+ Subscribers

Subscribe to our newsletters. We’ll keep you in the loop.

Retrieval-Augmented Generation (RAG)

RAG

RAG grounds LLM outputs in retrieved documents via sparse, dense, or hybrid search, improving factuality, citations, and freshness…

November 23, 2025

Nahush Gowda
Grouped-Query Attention (GQA)

AI Infrastructure and MLOPs

Grouped-Query Attention shares keys/values across groups of query heads, shrinking KV caches and bandwidth to speed LLM inference…

November 23, 2025

Nahush Gowda
ReAct Prompting

Agentic AI

ReAct prompting interleaves reasoning with tool actions and observations (Thought → Action → Observation), letting LLM agents plan,…

November 23, 2025

Nahush Gowda
Toolformer

Agentic AI

Toolformer teaches LMs to autonomously invoke external tools during generation by training on interleaved tool-call traces, boosting factuality…

November 23, 2025

Nahush Gowda
Low-Rank Adaptation (LoRA)

AI Infrastructure and MLOPs

LoRA fine-tunes LLMs by training small low-rank adapters on top of frozen weights, slashing memory and compute while…

November 23, 2025

Nahush Gowda
FlashAttention

AI Infrastructure and MLOPs

FlashAttention is an IO‑aware, exact attention algorithm that tiles work into GPU SRAM and fuses kernels to cut…

November 23, 2025

Nahush Gowda
Paged Attention

AI Infrastructure and MLOPs

Paged Attention organizes LLM KV caches into fixed-size pages to reduce fragmentation, enable continuous batching, and support long…

November 23, 2025

Nahush Gowda
Vision-Language Model (VLM)

Multimodal AI

A Vision-Language Model (VLM) jointly learns from images and text to understand and generate multimodal content, enabling captioning,…

November 23, 2025

Nahush Gowda
In-Context Learning (ICL)

AI Foundations

ICL lets LLMs infer tasks from prompt-only examples—no weight updates—enabling zero/few-shot classification, extraction, and reasoning with schema-following in…

November 23, 2025

Nahush Gowda
Mixture of Experts (MoE)

AI Foundations

Mixture of Experts (MoE) scales model capacity by routing each token to a small subset of expert networks,…

November 23, 2025

Nahush Gowda