Effective Long-Context Scaling of Foundation Models Paper Notes
This article includes my notes on “Effective Long-Context Scaling of Foundation Models” paper. All images if not stated oherwise are from the paper.
This article includes my notes on “Effective Long-Context Scaling of Foundation Models” paper. All images if not stated oherwise are from the paper.
Paper explores inscruction fine-tuning with focus on: Scaling number of tasks Scaling model size Fine-tuning of Chain of Thougts data
Techinques
Introduction
Introduction