Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add store: true flag on each Singer tap to allow storing extracted data in internal storage for use in downstream task (i.e. without requiring a target task) #41

Open
anna-geller opened this issue Oct 11, 2023 · 0 comments
Labels
area/plugin Plugin-related issue or feature request enhancement New feature or request good first issue Great issue for new contributors

Comments

@anna-geller
Copy link
Member

anna-geller commented Oct 11, 2023

Extend each tap to allow storing data extracted from a tap (without schema information) to internal storage.

id: postgres_to_ion
namespace: dev
  
tasks:
  - id: extract
    type: io.kestra.plugin.singer.taps.PipelinewisePostgres
    store: true # ⬅️ THIS 
    docker:
      image: python:3.8
    host: localhost
    port: 5432
    dbName: postgres
    username: postgres
    password: "{{ secret('DB_PASSWORD') }}"
    streamsConfigurations:
      - replicationMethod: FULL_TABLE # FULL_TABLE, INCREMENTAL, LOG_BASED
        selected: true

This task might either return:

  1. the schema + data in separate files
  2. one file with data converted to the proper types.

The latter is preferable.

Ofc, this example is silly, as for Postgres there are already dedicated Query and CopyOut tasks, but for more complex source systems, this opens up the doors to useful patterns e.g. retrieving data from a specific system and then running tasks ForEachItem.

Not a high priority for now, but worth keeping in mind the possibility.

@anna-geller anna-geller added the enhancement New feature or request label Oct 11, 2023
@anna-geller anna-geller changed the title Add Ion target task to allow storing extracted data in Kestra's internal storage and using it in downstream task (i.e. without requiring a Singer target) Add store: true flag on each Singer tap to allow storing extracted data in internal storage for use in downstream task (i.e. without requiring a target task) Oct 11, 2023
@anna-geller anna-geller added this to the v0.19.0 milestone Dec 5, 2023
@anna-geller anna-geller added kind/pending-feedback Idea waiting for user feedback and removed enhancement New feature or request labels Mar 26, 2024
@anna-geller anna-geller removed this from the v0.19.0 milestone Mar 26, 2024
@anna-geller anna-geller added enhancement New feature or request and removed kind/pending-feedback Idea waiting for user feedback labels Aug 13, 2024
@anna-geller anna-geller added the area/plugin Plugin-related issue or feature request label Aug 21, 2024
@tchiotludo tchiotludo added good first issue Great issue for new contributors and removed kind/good-first-issue labels Oct 1, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
area/plugin Plugin-related issue or feature request enhancement New feature or request good first issue Great issue for new contributors
Projects
Status: Backlog
Status: Planned
Development

No branches or pull requests

2 participants