Private Datasets: Checking dataset accessibility in `kamu` subcommands (multi-tenant workspace) #1055

s373r · 2025-02-04T12:01:52Z

Preamble

We have implemented dataset availability checks in API (HTTP, GQL), so, for example, we cannot access another user's private datasets.
However, if we work locally with a workspace (multi-tenant) using kamu CLI, some of the checks are missing.

The purpose of this ticket is to achieve consistency/parity with the API when accessing other users' datasets.

Points of technical interest/patterns

In general, it is possible to identify generalized patterns (P) for obtaining datasets that are currently in use:

P1) Addressing a single dataset:

let dataset_handle = self
    .dataset_registry
    .resolve_dataset_handle_by_ref(&self.dataset_ref)
    .await
    .map_err(CLIError::failure)?;

P2) Obtaining datasets by pattern:

let dataset_handles = kamu::utils::datasets_filtering::filter_datasets_by_local_pattern(
        self.dataset_registry.as_ref(),
        self.dataset_ref_patterns.clone(),
    )
    .try_collect::<Vec<odf::DatasetHandle>>()
    .await?;

P3) Retrieving all datasets, followed by checking by arbitrary conditions

let dataset_handles = self.dataset_registry
    .all_dataset_handles()
    .try_collect::<Vec<odf::DatasetHandle>>()
    .await?;

Possible approaches when implementing checks:

A1) Straightforward: we can apply DatasetActionAuthorizer to the received dataset_handle(s), which will sift through the datasets, identifying only those to which the current user has access.

Optionally, the sifting can be hidden inside the filtering utilities: kamu::utils::datasets_filtering::*.

A2) Continue the trend of extracting (generalizing) use cases that may absorb the access checks.

As a reference implementation, it is suggested to check: ViewDatasetUseCase, EditDatasetUseCase. In some cases, these use cases will be enough to use.

The choice of proportion between the two approaches (A1, A2) is determined on the spot.

The text was updated successfully, but these errors were encountered:

s373r · 2025-02-04T14:08:15Z

It's worth considering using the tweaks from #1057 to simplify the job

s373r added enhancement New feature or request good first issue Good for newcomers rust Pull requests that update Rust code labels Feb 4, 2025

s373r mentioned this issue Feb 4, 2025

Tracking issue: Private Datasets, backend #676

Open

22 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Private Datasets: Checking dataset accessibility in `kamu` subcommands (multi-tenant workspace) #1055

Private Datasets: Checking dataset accessibility in `kamu` subcommands (multi-tenant workspace) #1055

s373r commented Feb 4, 2025

s373r commented Feb 4, 2025

Private Datasets: Checking dataset accessibility in kamu subcommands (multi-tenant workspace) #1055

Private Datasets: Checking dataset accessibility in kamu subcommands (multi-tenant workspace) #1055

Comments

s373r commented Feb 4, 2025

Preamble

Points of technical interest/patterns

s373r commented Feb 4, 2025

Private Datasets: Checking dataset accessibility in `kamu` subcommands (multi-tenant workspace) #1055

Private Datasets: Checking dataset accessibility in `kamu` subcommands (multi-tenant workspace) #1055