Generative AI¶

LLM¶

Large Language Models

Limitations¶

Bias
Hallucinations
Expensive to build & run

ChatGPT¶

Train supervised policy
Provide prompt
Labeler demonstrates desired output behavior
Fine-tune model
Collect comparaison data & train reward model
Prompt and several model outputs are samples
Labeler ranks outputs from best to worst
Data used to train reward model
Policy optimization

Fine-Tuning¶

Process of training model using specific data, usually with a significantly smaller learning rate

Disadvantages¶

requires copy of the model
associated costs of hosting it
Risk of “catastrophic forgetting”: model forgets previously learnt information

RAG¶

Retrieval Augmented Generation

Makes use of a source of knowledge, usually vector store of embeddings and associated texts

By comparing predicted embeddings of query to embeddings in the vector store, we can form a prompt for the LLM that fits inside its context and contains the information needed to answer the question

Advantages¶

Does not require re-training
No need to deal with internal workings of model
Just adjust the data that the model “cheats” off
Reduces the amount a model “hallucinates”

Difficulties¶

Finding relevant data to give the model

Keywords¶

Data organization:
Vector creation: Unique index that points right to a chunk of information
Querying: Prompting
Retrieval
Prompt goes through embedding model and transforms into a vector
Systems uses this to get the chunks most relevant to the question
Prepending the context: Most relevant chunks are served up as context
Answer generation

Types of Questions¶


	No special skills required to answer Just need right reference material	What is the capital of France?
		Write a poem in German Write a computer program to calculate the first n natural numbers

Last Updated: 2024-12-26 ; Contributors: AhmedThahir, web-flow