Copying files and directories

This documents the expected behavior of the fsspec file and directory copying functions. There are three functions of interest here: copy(), get() and put(). Each of these copies files and/or directories from a source to a target location. If we refer to our filesystem of interest, derived from AbstractFileSystem, as the remote filesystem (even though it may be local) then the difference between the three functions is:

copy() copies from a remote source to a remote target

get() copies from a remote source to a local target

put() copies from a local source to a remote target

The source and target are the first two arguments passed to these functions, and each consists of one or more files, directories and/or glob (wildcard) patterns. The behavior of the fsspec copy functions is intended to be the same as that obtained using POSIX command line cp but fsspec functions have extra functionality because:

They support more than one target whereas command line cp is restricted to one.

They can create new directories, either automatically or via the auto_mkdir=True keyword argument, whereas command line cp only does this as part of a recursive copy.

Expected behavior

There follows a comprehensive list of the expected behavior of the fsspec copying functions that also forms the basis of a set of tests that all classes that derive from AbstractFileSystem can be tested against to check that they conform. For all scenarios the source filesystem contains the following directories and files:

📁 source
├── 📄 file1
├── 📄 file2
└── 📁 subdir
    ├── 📄 subfile1
    ├── 📄 subfile2
    └── 📁 nesteddir
        └── 📄 nestedfile

and before each scenario the target directory exists and is empty unless otherwise noted:

📁 target

All example code uses cp() which is an alias of copy(); equivalent behavior is expected by get() and put(). Forward slashes are used for directory separators throughout.

Copying files and directories

Expected behavior

1. Single source to single target

2. Multiple source to single target