📘 Notes on “Intro to Large Language Models”

1. Introduction to LLMs

Definition: A large neural network trained on vast text data to predict the next word in a sequence.
Scale: Billions of parameters, trained on hundreds of terabytes of text.
Examples: GPT series (OpenAI), Claude (Anthropic), Bard (Google).

Training Objective: Next-word prediction across large text corpora.
Capabilities: Language generation, reasoning, summarization, translation, coding, etc.
Emergent Properties: With scale, models gain reasoning, planning, and even “chain-of-thought” like abilities.

Custom Instructions: Modify system prompts to steer outputs.
File Uploads (RAG): Retrieval Augmented Generation — allows grounding responses in user data instead of internet alone.
Fine-Tuning (future): Adapt models for specialized domains/tasks.
“App Store of GPTs”: A marketplace of task-specific models/agents.

LLM ≈ Kernel Process orchestrating resources:
- Memory: Context window = working memory.
- Knowledge: Internet + local files via RAG.
- Tools: Python, calculator, APIs, external software.
- Modalities: Text, images, video, audio, music.
Parallel to traditional OS:
- User space ↔ prompts
- Kernel ↔ LLM core
- RAM ↔ context window
- Multi-threading, speculative execution analogies.
Ecosystem:
- Proprietary OS (GPT, Claude, Gemini).
- Open-source OS (LLaMA family, Falcon, Mistral).

Like traditional OS, LLMs face adversarial threats:

Attackers hide malicious instructions inside inputs.
Example: Hidden white-on-white text in an image → model obeys new instructions (e.g., fake ads).
Web browsing assistants (like Bing) vulnerable to malicious webpages injecting fraudulent links.
Google Docs injection attacks → attempts at exfiltrating private data.

Promise: LLMs as a new computing paradigm (LLM-OS).
Risks: Security attacks, trust issues, safety concerns.
Trend: Cat-and-mouse game between attacks and defenses (like traditional cybersecurity).

LLMs are not just chatbots → they are evolving into general-purpose problem-solving systems.
Ecosystem will mirror operating systems: mix of closed-source giants and open-source communities.
Security research is critical: jailbreaks, prompt injections, and poisoning are real threats.
The field is young, rapidly evolving, and exciting to follow.