FigshareDataLoader#

class polpo.preprocessing.load.FigshareDataLoader(figshare_id, remote_path, data_dir=None, use_cache=True, local_basename=None, version=1, remove_id=True)[source]#

Bases: PreprocessingStep, CacheableDataLoader

Transfer files and directories from figshare.

Parameters:
  • figshare_id (int) – Id of figshare article.

  • remote_path (str) – Path to retrieve from remote host.

  • data_dir (str) – Directory where to store data.

  • use_cache (bool) – Whether to verify if data is already available locally.

  • local_basename (str) – Basename of transferred file/folder if different from remote host.

  • version (int) – Dataset version.

  • remove_id (bool) – Whether to remove figshare added id when downloading items that are within a folder.

Notes

  • If remote_path does not exist in figshare, it automatically downloads

the full dataset. Code will fail after download. Warning will give information about downloaded zip. Can’t know in advance if this will happen.

apply(data=None)[source]#

Apply step.

load()[source]#

Load data.