diff --git a/training/a4x/qwen3-235b-a22b/megatron-bridge-pretraining-gke/16node-BF16-GBS1024/recipe/README.md b/training/a4x/qwen3-235b-a22b/megatron-bridge-pretraining-gke/16node-BF16-GBS1024/recipe/README.md index 58e32ac8..85ef56c1 100644 --- a/training/a4x/qwen3-235b-a22b/megatron-bridge-pretraining-gke/16node-BF16-GBS1024/recipe/README.md +++ b/training/a4x/qwen3-235b-a22b/megatron-bridge-pretraining-gke/16node-BF16-GBS1024/recipe/README.md @@ -1,7 +1,7 @@ # Pretrain qwen3-235b-a22b-bf16-gbs1024-gpus64 workloads on a4x GKE Node pools with Megatron-Bridge -This recipe outlines the steps for running a qwen3-30b pretraining +This recipe outlines the steps for running a qwen3-235b-a22b pretraining workload on [a4x GKE Node pools](https://cloud.google.com/kubernetes-engine) by using the [NVIDIA Megatron-Bridge framework](https://github.com/NVIDIA-NeMo/Megatron-Bridge). diff --git a/training/a4x/qwen3-235b-a22b/megatron-bridge-pretraining-gke/32node-BF16-GBS2048/recipe/README.md b/training/a4x/qwen3-235b-a22b/megatron-bridge-pretraining-gke/32node-BF16-GBS2048/recipe/README.md index 5d13b9c5..b34eff1a 100644 --- a/training/a4x/qwen3-235b-a22b/megatron-bridge-pretraining-gke/32node-BF16-GBS2048/recipe/README.md +++ b/training/a4x/qwen3-235b-a22b/megatron-bridge-pretraining-gke/32node-BF16-GBS2048/recipe/README.md @@ -1,7 +1,7 @@ # Pretrain qwen3-235b-a22b-bf16-gbs2048-gpus128 workloads on a4x GKE Node pools with Megatron-Bridge -This recipe outlines the steps for running a qwen3-30b pretraining +This recipe outlines the steps for running a qwen3-235b-a22b pretraining workload on [a4x GKE Node pools](https://cloud.google.com/kubernetes-engine) by using the [NVIDIA Megatron-Bridge framework](https://github.com/NVIDIA-NeMo/Megatron-Bridge). diff --git a/training/a4x/qwen3-235b-a22b/megatron-bridge-pretraining-gke/64node-BF16-GBS4096/recipe/README.md b/training/a4x/qwen3-235b-a22b/megatron-bridge-pretraining-gke/64node-BF16-GBS4096/recipe/README.md index 48b99f79..44148ec0 100644 --- a/training/a4x/qwen3-235b-a22b/megatron-bridge-pretraining-gke/64node-BF16-GBS4096/recipe/README.md +++ b/training/a4x/qwen3-235b-a22b/megatron-bridge-pretraining-gke/64node-BF16-GBS4096/recipe/README.md @@ -1,7 +1,7 @@ # Pretrain qwen3-235b-a22b-bf16-gbs4096-gpus256 workloads on a4x GKE Node pools with Megatron-Bridge -This recipe outlines the steps for running a qwen3-30b pretraining +This recipe outlines the steps for running a qwen3-235b-a22b pretraining workload on [a4x GKE Node pools](https://cloud.google.com/kubernetes-engine) by using the [NVIDIA Megatron-Bridge framework](https://github.com/NVIDIA-NeMo/Megatron-Bridge).