用于上传建模任务的输入数据到fate所支持的存储系统
flow data upload -c ${conf_path}
注: conf_path为参数路径,具体参数如下
选项
参数名 | 必选 | 类型 | 说明 |
---|---|---|---|
file | 是 | string | 数据存储路径 |
id_delimiter | 是 | string | 数据分隔符,如"," |
head | 否 | int | 数据是否有表头 |
partition | 是 | int | 数据分区数 |
storage_engine | 否 | string | 存储引擎类型,默认"EGGROLL",还支持"HDFS","LOCALFS", "HIVE"等 |
namespace | 是 | string | 表命名空间 |
table_name | 是 | string | 表名 |
storage_address | 否 | object | 需要填写对应存储引擎的存储地址 |
use_local_data | 否 | int | 默认1,代表使用client机器的数据;0代表使用fate flow服务所在机器的数据 |
drop | 否 | int | 是否覆盖上传 |
extend_sid | 否 | bool | 是否新增一列uuid id,默认False |
auto_increasing_sid | 否 | bool | 新增的id列是否自增(extend_sid为True才会生效), 默认False |
with_meta | 否 | bool | 是否携带meta数据, 默认False |
meta | 否 | object | 元数据, 默认为空,with_meta为true生效 |
mete信息
参数名 | 必选 | 类型 | 说明 |
---|---|---|---|
input_format | 否 | string | 数据格式(danse、svmlight、tag:value),用来判断 |
delimiter | 否 | string | 数据分隔符,默认"," |
tag_with_value | 否 | bool | 对tag的数据格式生效,是否携带value |
tag_value_delimiter | 否 | string | tag:value数据分隔符,默认":" |
with_match_id | 否 | bool | 是否携带match id |
id_list | 否 | object | id列名称,开启extend_sid下生效,如:["imei", "phone"] |
id_range | 否 | object | 对于tag/svmlight格式数据,哪几列为id |
exclusive_data_type | 否 | string | 特殊类型数据列格式 |
data_type | 否 | string | 列数据类型,默认"float64 |
with_label | 否 | bool | 是否有标签,默认False |
label_name | 否 | string | 标签名,默认"y" |
label_type | 否 | string | 标签类型, 默认"int" |
注意:在1.9.0及之后的版本中,若传入meta参数,会生成特征的匿名信息。
样例
{
"file": "examples/data/breast_hetero_guest.csv",
"id_delimiter": ",",
"head": 1,
"partition": 10,
"namespace": "experiment",
"table_name": "breast_hetero_guest",
"storage_engine": "EGGROLL"
}
{
"file": "examples/data/breast_hetero_guest.csv",
"id_delimiter": ",",
"head": 1,
"partition": 10,
"namespace": "experiment",
"table_name": "breast_hetero_guest",
"storage_engine": "HDFS"
}
{
"file": "examples/data/breast_hetero_guest.csv",
"id_delimiter": ",",
"head": 1,
"partition": 4,
"namespace": "experiment",
"table_name": "breast_hetero_guest",
"storage_engine": "LOCALFS"
}
返回
参数名 | 类型 | 说明 |
---|---|---|
jobId | string | 任务id |
retcode | int | 返回码 |
retmsg | string | 返回信息 |
data | object | 返回数据 |
样例
{
"data": {
"board_url": "http://xxx.xxx.xxx.xxx:8080/index.html#/dashboard?job_id=202111081218319075660&role=local&party_id=0",
"code": 0,
"dsl_path": "/data/projects/fate/jobs/202111081218319075660/job_dsl.json",
"job_id": "202111081218319075660",
"logs_directory": "/data/projects/fate/logs/202111081218319075660",
"message": "success",
"model_info": {
"model_id": "local-0#model",
"model_version": "202111081218319075660"
},
"namespace": "experiment",
"pipeline_dsl_path": "/data/projects/fate/jobs/202111081218319075660/pipeline_dsl.json",
"runtime_conf_on_party_path": "/data/projects/fate/jobs/202111081218319075660/local/0/job_runtime_on_party_conf.json",
"runtime_conf_path": "/data/projects/fate/jobs/202111081218319075660/job_runtime_conf.json",
"table_name": "breast_hetero_host",
"train_runtime_conf_path": "/data/projects/fate/jobs/202111081218319075660/train_runtime_conf.json"
},
"jobId": "202111081218319075660",
"retcode": 0,
"retmsg": "success"
}
用于查询上传历史
flow data upload-history -l 20
flow data upload-history --job-id $JOB_ID
选项
参数名 | 必选 | 类型 | 说明 |
---|---|---|---|
-l --limit | no | int | 返回数量 (默认: 10) |
-j --job_id | no | string | 任务ID |
简要描述:
用于下载fate存储引擎内的数据到文件格式数据
flow data download -c ${conf_path}
注: conf_path为参数路径,具体参数如下
选项
参数名 | 必选 | 类型 | 说明 |
---|---|---|---|
output_path | 是 | string | 下载路径 |
table_name | 是 | string | fate表名 |
namespace | 是 | int | fate表命名空间 |
样例:
{
"output_path": "/data/projects/fate/breast_hetero_guest.csv",
"namespace": "experiment",
"table_name": "breast_hetero_guest"
}
返回
参数名 | 类型 | 说明 |
---|---|---|
jobId | string | 任务id |
retcode | int | 返回码 |
retmsg | string | 返回信息 |
data | object | 返回数据 |
样例
{
"data": {
"board_url": "http://xxx.xxx.xxx.xxx:8080/index.html#/dashboard?job_id=202111081457135282090&role=local&party_id=0",
"code": 0,
"dsl_path": "/data/projects/fate/jobs/202111081457135282090/job_dsl.json",
"job_id": "202111081457135282090",
"logs_directory": "/data/projects/fate/logs/202111081457135282090",
"message": "success",
"model_info": {
"model_id": "local-0#model",
"model_version": "202111081457135282090"
},
"pipeline_dsl_path": "/data/projects/fate/jobs/202111081457135282090/pipeline_dsl.json",
"runtime_conf_on_party_path": "/data/projects/fate/jobs/202111081457135282090/local/0/job_runtime_on_party_conf.json",
"runtime_conf_path": "/data/projects/fate/jobs/202111081457135282090/job_runtime_conf.json",
"train_runtime_conf_path": "/data/projects/fate/jobs/202111081457135282090/train_runtime_conf.json"
},
"jobId": "202111081457135282090",
"retcode": 0,
"retmsg": "success"
}
简要描述:
用于下载fate存储引擎内的数据到外部引擎或者将数据另存为新表
flow data writer -c ${conf_path}
注: conf_path为参数路径,具体参数如下
选项
参数名 | 必选 | 类型 | 说明 |
---|---|---|---|
table_name | 是 | string | fate表名 |
namespace | 是 | int | fate表命名空间 |
storage_engine | 否 | string | 存储类型,如:MYSQL |
address | 否 | object | 存储地址 |
output_namespace | 否 | string | 另存为fate的表命名空间 |
output_name | 否 | string | 另存为fate的表名 |
注: storage_engine、address是组合参数,提供存储到指定引擎的功能; output_namespace、output_name也是组合参数,提供另存为同种引擎的新表功能
样例:
{
"table_name": "name1",
"namespace": "namespace1",
"output_name": "name2",
"output_namespace": "namespace2"
}
返回
参数名 | 类型 | 说明 |
---|---|---|
jobId | string | 任务id |
retcode | int | 返回码 |
retmsg | string | 返回信息 |
data | object | 返回数据 |
样例
{
"data": {
"board_url": "http://xxx.xxx.xxx.xxx:8080/index.html#/dashboard?job_id=202201121235115028490&role=local&party_id=0",
"code": 0,
"dsl_path": "/data/projects/fate/fateflow/jobs/202201121235115028490/job_dsl.json",
"job_id": "202201121235115028490",
"logs_directory": "/data/projects/fate/fateflow/logs/202201121235115028490",
"message": "success",
"model_info": {
"model_id": "local-0#model",
"model_version": "202201121235115028490"
},
"pipeline_dsl_path": "/data/projects/fate/fateflow/jobs/202201121235115028490/pipeline_dsl.json",
"runtime_conf_on_party_path": "/data/projects/fate/fateflow/jobs/202201121235115028490/local/0/job_runtime_on_party_conf.json",
"runtime_conf_path": "/data/projects/fate/fateflow/jobs/202201121235115028490/job_runtime_conf.json",
"train_runtime_conf_path": "/data/projects/fate/fateflow/jobs/202201121235115028490/train_runtime_conf.json"
},
"jobId": "202201121235115028490",
"retcode": 0,
"retmsg": "success"
}