Distributed copy allows users to efficiently copy data from one UFS to another UFS. This can be used for data relocation and synchronization between two UFS.
Usage
There are two recommended ways to trigger distributed load:
job copy CLI
The CLI sends a copy request to the Alluxio coordinator, which subsequently distributes the copy operation to all worker nodes.
For detailed usage of CLI, please refer to the [job copy](../reference/User-CLI.md) documentation。
### REST API
Similar to the CLI, the REST API can also be used to copy data.
Requests are sent directly to the coordinator.
```shell
curl -H "Content-Type: application/json" -v -X POST http://coordinator_host:19999/api/v1/master/submit_job/copy -d '{
"src": "s3://alluxiow/testm/dir-1/dir1",
"dst": "s3://alluxiow/testm/dir-1/dir2",
"options": {
"batchSize": "300",
"check_content": "true"
}
}'
Progress can be checked by sending a GET request with the same path.
The copy jobs can be list by sending a POST request. The results only include copy tasks within seven days. The residence time of historical tasks can be configured through alluxio.job.retention.time.