一般来说,排查问题,需要如下几个日志:
${FATE_PROJECT_BASE}/fateflow/logs/$job_id/fate_flow_schedule.log
,这个是某个任务的内部调度日志
${FATE_PROJECT_BASE}/fateflow/logs/$job_id/*
这些是某个任务的所有执行日志
${FATE_PROJECT_BASE}/fateflow/logs/fate_flow/fate_flow_stat.log
,这个是与任务无关的一些日志
${FATE_PROJECT_BASE}/fateflow/logs/fate_flow/fate_flow_schedule.log
,这个是所有任务的整体调度日志
${FATE_PROJECT_BASE}/fateflow/logs/fate_flow/fate_flow_detect.log
,这个是所有任务的整体异常探测日志
${FATE_PROJECT_BASE}/logs/$job_id/fate_flow_schedule.log
,这个是某个任务的内部调度日志
${FATE_PROJECT_BASE}/logs/$job_id/*
这些是某个任务的所有执行日志
${FATE_PROJECT_BASE}/logs/fate_flow/fate_flow_stat.log
,这个是与任务无关的一些日志
${FATE_PROJECT_BASE}/logs/fate_flow/fate_flow_schedule.log
,这个是所有任务的整体调度日志
${FATE_PROJECT_BASE}/logs/fate_flow/fate_flow_detect.log
,这个是所有任务的整体异常探测日志
没有部署fate-servings
flow没有获取到fate-servings的地址
flow读取fate-servings的地址的优先级排序:
从zk读取
没有打开zk的话,会从fate的服务配置文件读取,配置路径在
1.5+: ${FATE_PROJECT_BASE}/conf/service_conf.yaml
1.5-: ${FATE_PROJECT_BASE}/arch/conf/server_conf.json
${FATE_PROJECT_BASE}/conf/service_conf.yaml
servings:
hosts:
- 127.0.0.1:8000
${FATE_PROJECT_BASE}/arch/conf/server_conf.json
{
"servers": {
"servings": ["127.0.0.1:8000"]
}
}