Stores
In SDK there is two types of stores: Store
and DataStore
. The first one is used to store/collect files, the second one is used to store/collect dataframes. Every DataStore
has a Store
as parent.
We support natively the following stores:
s3
for MinIO (default store)local
for local filesremote
for remote http/https filessql
for PostgreSQL database tables
Entity paths
When declaring an Artifact
, a Dataitem
or a Model
, you must specify a path
parameter to declare where the files are stored. There are some rules to follow when specifying the path:
- The
path
parameter is asstr
. - It can have a scheme declared in the beginning of the path.
- Based on the scheme, the SDK will create a
Store
object to interact with the files/tables in the path. - The path can point to a single file, a directory, a partition or a table. See below for more details.
Supported path types
Local paths
A local path is declared by not providing a scheme. For example:
S3 paths
To declare an S3 path, you need to provide the scheme s3://
. The first element of the path must be the bucket name and the second must be the key/partition. If you provide a partition path, you need to append a final /
at the end. For example:
Remote paths
To declare a remote path, you need to provide the scheme http://
or https://
. For example:
SQL paths
To declare a SQL path, you need to provide the scheme sql://
. An SQL path will always point to a single table. The path is composed by three parts, delimited by /
. The first part is the database name, the second (optional) is the schema and the third is the table name. For example: