dlt.destinations.impl.filesystem.filesystem
FilesystemLoadJob Objects
class FilesystemLoadJob(RunnableLoadJob)
make_remote_file_path
def make_remote_file_path(file_name: str) -> str
Returns path on remote filesystem to move file to, without scheme and dataset.
make_remote_path
def make_remote_path() -> str
Returns path on remote filesystem to move file(s) to, without scheme, but with dataset.
For local filesystem a native path is used.
make_remote_url
def make_remote_url() -> str
Returns path on a remote filesystem as a full url including scheme.
HfFilesystemUploadJob Objects
class HfFilesystemUploadJob(HasFollowupJobs, FilesystemLoadJob)
Pre-uploads a single file to HF LFS storage without creating a commit.
File content is uploaded to HF's content-addressed storage via preupload_lfs_files. The actual git commit is created later by HfFilesystemCommitJob once all uploads in the table chain complete.
HfFilesystemCommitJob Objects
class HfFilesystemCommitJob(ReferenceFollowupJob)
Commits pre-uploaded files to a HF dataset repo.
Files are already in HF's content-addressed storage (uploaded by HfFilesystemUploadJob). create_commit internally detects pre-uploaded blobs and only creates the git commit. Retries on 409/412 commit conflicts with exponential backoff via tenacity. Other errors propagate to dlt load engine for dlt-level retry.
make_path_in_repo
def make_path_in_repo(file_path: str) -> str
Returns path relative to repo root, without namespace and repo name.
FilesystemClient Objects
class FilesystemClient(FSClientBase, WithSqlClient, JobClientBase,
WithStagingDataset, WithStateSync, SupportsOpenTables)