荣懿的学术网站
荣懿的学术网站
关于
论文
经历
项目
文章
个人主页
浅色
深色
自动
中文 (简体)
English
Dynamic Programming
DAPPLE: A Pipelined Data Parallel Approach for Training Large Models
We propose DAPPLE, a synchronous training framework which combines data parallelism and pipeline parallelism for large DNN models. It features a novel parallelization strategy planner to solve the partition and placement problems, and explores the optimal hybrid strategy of data and pipeline parallelism. We also propose a new runtime scheduling algorithm…
Shiqing Fan
,
荣懿
,
Chen Meng
,
Zongyan Cao
,
Siyu Wang
,
Zhen Zheng
,
Chuan Wu
,
Guoping Long
,
Jun Yang
,
Lixue Xia
,
LanSong Diao
,
Xiaoyong Liu
,
Wei Lin
PDF
代码
源文档
引用
×