Add UnavailableDataset as a default dataset for lite mode #2083

ravi-kumar-pilla · 2024-09-05T20:19:29Z

Description

Related to #2058

Development notes

Created a custom dataset UnavailableDataset
Updated the default dataset from MemoryDataset to UnavailableDataset in --lite mode.
Updated filepath to have an empty string as N/A instead of - as suggested by @stephkaiser
Updated tests

QA notes

When you run kedro viz --lite with kedro-datasets missing in the conda env, the metadata panel for the dataset should show the dataset type as UnavailableDataset if the dataset is mentioned in catalog.py (i.e., not a MemoryDataset)

All tests should pass

Checklist

Read the contributing guidelines
Opened this PR as a 'Draft Pull Request' if it is work-in-progress
Updated the documentation to reflect the code changes
Added new entries to the RELEASE.md file
Added tests to cover my changes

astrojuanlu · 2024-09-09T16:33:45Z

Thanks @ravi-kumar-pilla - could you clarify what was the situation before this PR?

ravi-kumar-pilla · 2024-09-09T17:30:39Z

Thanks @ravi-kumar-pilla - could you clarify what was the situation before this PR?

Hi @astrojuanlu , Thanks for reviewing. Before this change, the default was MemoryDataset . This PR is part of making users aware of the missing dependency. MemoryDataset will be a bit misleading as the dataset is registered in the catalog.

stephkaiser · 2024-09-10T11:18:45Z

thanks Ravi! using Gitpod, filepath is showing as N/A for me 👍 but still shows MemoryDataset, maybe i did something wrong?

ravi-kumar-pilla · 2024-09-10T14:41:33Z

thanks Ravi! using Gitpod, filepath is showing as N/A for me 👍 but still shows MemoryDataset, maybe i did something wrong?

Thanks for reviewing. Yes these are actually MemoryDatasets (as they are not registered in the DataCatalog). We will show UnavailableDataset to the datasets which are registered in the DataCatalog but the dependency is not installed.

For example, companies is registered in DataCatalog as below but has a missing dependency of pandas.CSVDataset, so companies metadata will have dataset type as UnavailableDataset. I hope this is clear. Thank you

companies:
  type: pandas.CSVDataset
  filepath: ${_base_location}/01_raw/companies.csv
  metadata:
    kedro-viz:
      layer: raw
      preview_args:
        nrows: 5

stephkaiser · 2024-09-10T15:13:25Z

@ravi-kumar-pilla understood, thank you!

rashidakanchwala

LGTM. thanks! <3

ravi-kumar-pilla added 10 commits July 28, 2024 11:02

sync remote

8830cab

Merge branch 'main' of https://github.com/kedro-org/kedro-viz

0a5d1ae

merge main

bf47b47

Merge branch 'main' of https://github.com/kedro-org/kedro-viz

13e905b

Merge branch 'main' of https://github.com/kedro-org/kedro-viz

3d6326c

Merge branch 'main' of https://github.com/kedro-org/kedro-viz

3d3bdda

Merge branch 'main' of https://github.com/kedro-org/kedro-viz

1abfc43

Merge branch 'main' of https://github.com/kedro-org/kedro-viz

1461a3c

fallback dataset for lite

106cc5e

update file permissions

7d37475

ravi-kumar-pilla changed the title ~~Chore/lite metatdata change~~ Add UnavailableDataset as a default dataset for lite mode Sep 5, 2024

ravi-kumar-pilla added 3 commits September 5, 2024 15:22

update release note

af8f06d

fix tests

ade2cb3

Merge branch 'main' into chore/lite-metatdata-change

5bb8a86

ravi-kumar-pilla marked this pull request as ready for review September 5, 2024 21:05

ravi-kumar-pilla requested review from rashidakanchwala, astrojuanlu and jitu5 as code owners September 5, 2024 21:05

ravi-kumar-pilla requested review from stephkaiser and removed request for astrojuanlu September 5, 2024 21:05

Merge branch 'main' into chore/lite-metatdata-change

a327044

stephkaiser approved these changes Sep 10, 2024

View reviewed changes

Merge branch 'main' into chore/lite-metatdata-change

0f14720

rashidakanchwala approved these changes Sep 10, 2024

View reviewed changes

ravi-kumar-pilla merged commit c90e723 into main Sep 10, 2024
25 checks passed

ravi-kumar-pilla deleted the chore/lite-metatdata-change branch September 10, 2024 21:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add UnavailableDataset as a default dataset for lite mode #2083

Add UnavailableDataset as a default dataset for lite mode #2083

ravi-kumar-pilla commented Sep 5, 2024 •

edited

Loading

astrojuanlu commented Sep 9, 2024

ravi-kumar-pilla commented Sep 9, 2024

stephkaiser commented Sep 10, 2024

ravi-kumar-pilla commented Sep 10, 2024

stephkaiser commented Sep 10, 2024

rashidakanchwala left a comment

Add UnavailableDataset as a default dataset for lite mode #2083

Add UnavailableDataset as a default dataset for lite mode #2083

Conversation

ravi-kumar-pilla commented Sep 5, 2024 • edited Loading

Description

Development notes

QA notes

Checklist

astrojuanlu commented Sep 9, 2024

ravi-kumar-pilla commented Sep 9, 2024

stephkaiser commented Sep 10, 2024

ravi-kumar-pilla commented Sep 10, 2024

stephkaiser commented Sep 10, 2024

rashidakanchwala left a comment

Choose a reason for hiding this comment

ravi-kumar-pilla commented Sep 5, 2024 •

edited

Loading