Skip to content

Latest commit

 

History

History
25 lines (21 loc) · 466 Bytes

parallel.md

File metadata and controls

25 lines (21 loc) · 466 Bytes

3D parallelism parameters.

megatron

  • Tensor Parallel
  • Pipeline Parallel
  • GPU Num = DP * TP * PP
  • sequence_parallel Input alignment needs to be a multiple of the tensor processing size.
runtime:
  seed: &seed 42
  tensor_model_parallel_size: 4  # TP
  pipeline_model_parallel_size: 2  # PP
  ...

sequence parallel config example

model:
  type: llama2_70b
  kwargs:
     use_flash_attn: True
     sequence_parallel: True