Skip to content

Latest commit

 

History

History
15 lines (10 loc) · 511 Bytes

File metadata and controls

15 lines (10 loc) · 511 Bytes

LLAMA3 Architecture

This repo hosts a Jupyter Notebook that demonstrates the architecture, training, and inference process of the LLAMA3 language model. It includes sections for model initialization, training, and inference, with detailed explanations of the implementation. The test dataset is the tiny shakespeare dataset, included in the repo.


Prerequisites

  • Python 3.8 or higher
  • Required libraries:
    • torch
    • transformers
    • numpy
    • matplotlib (optional for visualizations)