ccah-500 第5题 How will the Fair Scheduler handle these two jobs?
2016-06-01 10:11
429 查看
5.You
have a cluster running with the fair Scheduler enabled. There are currently no jobs running on the cluster, and you submit a job A, so that only job A is running on the cluster. A while later, you submit Job B. now Job A and Job B are running on the cluster
at the same time. How will the Fair Scheduler handle these two jobs?
A. When Job B gets submitted, it will get assigned tasks, while job A continues to run with fewer tasks.
B. When Job B gets submitted, Job A has to finish first, before job B can gets scheduled.
C. When Job A gets submitted, it doesn't
consumes all the task slots.
D. When Job A gets submitted, it consumes all the task slots.
Answer: B -->
A
解析: A
oreily:
With the Fair Scheduler (iii in Figure 4-3), there is no need to reserve a set amount of
capacity, since it will dynamically balance resources between all running jobs. Just after
the first (large) job starts, it is the only job running, so it gets all the resources in the
cluster. When the second (small) job starts, it is allocated half of the cluster resources so
that each job is using its fair share of resources.
Note that there is a lag between the time the second job starts and when it receives its fair
share, since it has to wait for resources to free up as containers used by the first job
complete. After the small job completes and no longer requires resources, the large job
goes back to using the full cluster capacity again. The overall effect is both high cluster
utilization and timely small job completion.
Fair scheduling is a method of assigning resources to jobs such that all jobs get, on average, an equal share of resources over time. When there is a single job running,
that job uses the entire cluster. When other jobs are submitted, tasks slots that free up are ass
aed4
igned to the new jobs, so that each job gets roughly the same amount of CPU time. Unlike the default Hadoop scheduler, which forms a queue of jobs, this lets short
jobs finish in reasonable time while not starving long jobs. It is also a reasonable way to share a cluster between a number of users.
have a cluster running with the fair Scheduler enabled. There are currently no jobs running on the cluster, and you submit a job A, so that only job A is running on the cluster. A while later, you submit Job B. now Job A and Job B are running on the cluster
at the same time. How will the Fair Scheduler handle these two jobs?
A. When Job B gets submitted, it will get assigned tasks, while job A continues to run with fewer tasks.
B. When Job B gets submitted, Job A has to finish first, before job B can gets scheduled.
C. When Job A gets submitted, it doesn't
consumes all the task slots.
D. When Job A gets submitted, it consumes all the task slots.
Answer: B -->
A
解析: A
oreily:
With the Fair Scheduler (iii in Figure 4-3), there is no need to reserve a set amount of
capacity, since it will dynamically balance resources between all running jobs. Just after
the first (large) job starts, it is the only job running, so it gets all the resources in the
cluster. When the second (small) job starts, it is allocated half of the cluster resources so
that each job is using its fair share of resources.
Note that there is a lag between the time the second job starts and when it receives its fair
share, since it has to wait for resources to free up as containers used by the first job
complete. After the small job completes and no longer requires resources, the large job
goes back to using the full cluster capacity again. The overall effect is both high cluster
utilization and timely small job completion.
Fair scheduling is a method of assigning resources to jobs such that all jobs get, on average, an equal share of resources over time. When there is a single job running,
that job uses the entire cluster. When other jobs are submitted, tasks slots that free up are ass
aed4
igned to the new jobs, so that each job gets roughly the same amount of CPU time. Unlike the default Hadoop scheduler, which forms a queue of jobs, this lets short
jobs finish in reasonable time while not starving long jobs. It is also a reasonable way to share a cluster between a number of users.
相关文章推荐
- Cloudera 推动即时通讯巨头 LINE 实现数据驱动的创新
- Build Your Own Cluster
- Cloudera大数据培训开班信息
- Cloudera Hadoop管理员培训课程—成都站
- Cloudera Hadoop Developer开发者课程免费听
- Cloudera Hadoop Developer数据分析师—上海
- Cloudera 开发者 北京
- “暖心回馈”Cloudera全套课程价值11000元免费送!!
- Cloudera Hadoop 管理员 广州
- Cloudera Hadoop数据分析师课程—广州
- Hadoop Sentry 学习
- Troubleshooting: java.lang.RuntimeException: HMaster Aborted
- 离线安装cloudera manager5.0.2
- 【免费咨询】 Cloudera Hadoop大数据培训:管理员、开发者、数据分析
- 大数据在金融行业的应用——银行大数据
- 关于Cloudera Hadoop认证改革
- Hadoop技术预览
- Impala简明调优手册
- trying new thing
- 方案四成功了奥