[Add] chapter 8 decoding and rename other chapters

slds-lmu · Aug 12, 2024 · 0adfe6b · 0adfe6b
1 parent f8abb5c
commit 0adfe6b
Show file tree

Hide file tree

Showing 15 changed files with 93 additions and 49 deletions.
diff --git a/content/chapters/08_decoding/08_01_intro.md b/content/chapters/08_decoding/08_01_intro.md
@@ -0,0 +1,16 @@
+---
+title: "Chapter 08.01: What is Decoding?"
+weight: 8001
+---
+
+
+
+<!--more-->
+
+### Lecture Slides
+
+{{< pdfjs file="https://github.com/slds-lmu/lecture_dl4nlp/blob/main/slides/chapter12-decoding/slides-121-intro.pdf" >}}
+
+### References
+
+- [1] [Radford et al., 2018](https://cdn.openai.com/research-covers/language-unsupervised/language_understanding_paper.pdf)
diff --git a/content/chapters/08_decoding/08_02_determ.md b/content/chapters/08_decoding/08_02_determ.md
@@ -0,0 +1,16 @@
+---
+title: "Chapter 08.02: Greedy & Beam Search"
+weight: 8002
+---
+
+
+
+<!--more-->
+
+### Lecture Slides
+
+{{< pdfjs file="https://github.com/slds-lmu/lecture_dl4nlp/blob/main/slides/chapter12-decoding/slides-122-determ.pdf" >}}
+
+### References
+
+- [1] [Radford et al., 2018](https://cdn.openai.com/research-covers/language-unsupervised/language_understanding_paper.pdf)
diff --git a/content/chapters/08_decoding/08_03_sampling.md b/content/chapters/08_decoding/08_03_sampling.md
@@ -0,0 +1,16 @@
+---
+title: "Chapter 08.03: Stochastic Decoding & CS/CD"
+weight: 8003
+---
+
+
+
+<!--more-->
+
+### Lecture Slides
+
+{{< pdfjs file="https://github.com/slds-lmu/lecture_dl4nlp/blob/main/slides/chapter12-decoding/slides-123-sampling.pdf" >}}
+
+### References
+
+- [1] [Radford et al., 2018](https://cdn.openai.com/research-covers/language-unsupervised/language_understanding_paper.pdf)
diff --git a/content/chapters/08_decoding/08_04_hyper_param.md b/content/chapters/08_decoding/08_04_hyper_param.md
@@ -0,0 +1,16 @@
+---
+title: "Chapter 08.04: Decoding Hyperparameters & Practical considerations"
+weight: 8004
+---
+
+
+
+<!--more-->
+
+### Lecture Slides
+
+{{< pdfjs file="https://github.com/slds-lmu/lecture_dl4nlp/blob/main/slides/chapter12-decoding/slides-124-hyper-param.pdf" >}}
+
+### References
+
+- [1] [Radford et al., 2018](https://cdn.openai.com/research-covers/language-unsupervised/language_understanding_paper.pdf)
diff --git a/content/chapters/08_decoding/08_05_eval_metrics.md b/content/chapters/08_decoding/08_05_eval_metrics.md
@@ -0,0 +1,16 @@
+---
+title: "Chapter 08.05: Decoding Hyperparameters & Practical considerations"
+weight: 8005
+---
+
+
+
+<!--more-->
+
+### Lecture Slides
+
+{{< pdfjs file="https://github.com/slds-lmu/lecture_dl4nlp/blob/main/slides/chapter12-decoding/slides-125-eval_metrics.pdf" >}}
+
+### References
+
+- [1] [Radford et al., 2018](https://cdn.openai.com/research-covers/language-unsupervised/language_understanding_paper.pdf)
diff --git a/content/chapters/08_decoding/_index.md b/content/chapters/08_decoding/_index.md
@@ -0,0 +1,5 @@
+---
+title: "Chapter 8: Decoding Strategies"
+---
+
+This chapter is about various decoding strategies. You will learn about deterministic methods (greedy search, beam search, contrastive search, contrastive decoding) and stochastic methods (top-k, top-p, sampling with temperature). This chapter also covers evaluation metrics for open ended text generation.
diff --git a/...apters/08_llm/08_01_instruction_tuning.md → ...apters/09_llm/09_01_instruction_tuning.md b/...apters/08_llm/08_01_instruction_tuning.md → ...apters/09_llm/09_01_instruction_tuning.md
@@ -1,6 +1,6 @@
 ---
-title: "Chapter 08.01: Instruction Fine-Tuning"
-weight: 8001
+title: "Chapter 09.01: Instruction Fine-Tuning"
+weight: 9001
 ---
 
 Instruction fine-tuning aims to enhance the adaptability of large language models (LLMs) by providing explicit instructions or task descriptions, enabling more precise control over model behavior and adaptation to diverse contexts.

diff --git a/content/chapters/08_llm/08_02_cot.md → content/chapters/09_llm/09_02_cot.md b/content/chapters/08_llm/08_02_cot.md → content/chapters/09_llm/09_02_cot.md
@@ -1,6 +1,6 @@
 ---
-title: "Chapter 08.02: Chain-of-thought Prompting"
-weight: 8002
+title: "Chapter 09.02: Chain-of-thought Prompting"
+weight: 9002
 ---
 
 Chain of thought (CoT) prompting [1] is a prompting method that encourage Large Language Models (LLMs) to explain their reasoning. This method contrasts with standard prompting by not only seeking an answer but also requiring the model to explain its steps to arrive at that answer. By guiding the model through a logical chain of thought, chain of thought prompting encourages the generation of more structured and cohesive text, enabling LLMs to produce more accurate and informative outputs across various tasks and domains. 

diff --git a/content/chapters/08_llm/08_03_emerging.md → content/chapters/09_llm/09_03_emerging.md b/content/chapters/08_llm/08_03_emerging.md → content/chapters/09_llm/09_03_emerging.md
@@ -1,6 +1,6 @@
 ---
-title: "Chapter 08.03: Emergent Abilities"
-weight: 8003
+title: "Chapter 09.03: Emergent Abilities"
+weight: 9003
 ---
 Various researchers have reported that LLMs seem to have emergent abilities. These are sudden appearances of new abilities when Large Language Models (LLMs) are scaled up. In this section we introduce the concept of emergent abilities and discuss a potential counter argument for the concept of emergence.
 

diff --git a/content/chapters/08_llm/_index.md → content/chapters/09_llm/_index.md b/content/chapters/08_llm/_index.md → content/chapters/09_llm/_index.md
@@ -1,5 +1,5 @@
 ---
-title: "Chapter 8: Large Language Models (LLMs)"
+title: "Chapter 9: Large Language Models (LLMs)"
 ---
 
 In this chapter we cover LLM concepts, such as Instruction Fine-Tuning, Chain-of-Thought prompting and discuss the possbility of emerging abilities of LLMs. 
diff --git a/content/chapters/10_multilingual/10_01_why_multilinguality.md b/content/chapters/10_multilingual/10_01_why_multilinguality.md
diff --git a/content/chapters/10_multilingual/10_02_cross_lingual_word_embeddings.md b/content/chapters/10_multilingual/10_02_cross_lingual_word_embeddings.md
diff --git a/content/chapters/10_multilingual/10_03_multi_lingual_transformers.md b/content/chapters/10_multilingual/10_03_multi_lingual_transformers.md
diff --git a/content/chapters/10_multilingual/_index.md b/content/chapters/10_multilingual/_index.md
diff --git a/content/chapters/09_rlhf/_index.md → content/chapters/10_rlhf/_index.md b/content/chapters/09_rlhf/_index.md → content/chapters/10_rlhf/_index.md
@@ -1,5 +1,5 @@
 ---
-title: "Chapter 9: Reinforcement Learning from Human Feedback (RLHF)"
+title: "Chapter 10: Reinforcement Learning from Human Feedback (RLHF)"
 ---
 
 In the context of natural language processing (NLP), RLHF (Reinforcement Learning from Human Feedback) involves training language models to generate text or perform tasks based on evaluative signals provided by human annotators or users. This technique allows NLP models to learn from human feedback, such as ratings or corrections, to improve their language understanding, generation, or task performance. By iteratively adjusting model parameters to maximize the reward signal derived from human feedback, RLHF enables models to adapt to specific preferences or requirements, leading to more accurate and contextually relevant outputs in various NLP applications.