Relational directives #641

saimukkamala · 2023-05-24T06:06:07Z

No description provided.

iam-divya · 2023-05-24T13:53:13Z

wrangler-transform/src/main/java/io/cdap/wrangler/Wrangler.java

+      }
+      // currently supporting only drop column
+      // SQL will be returned as "DROP COLUMN col1, col2"
+      String sql = ((RelationalDirective) directive).getSQL();


Can we not invoke execute on directives and offload directive execution to RecipePipelineExecutor. We can introduce a new function for relationalDirective execute

That's a good point. will look into offloading directive execution to RecipePipelineExecutor.

iam-divya · 2023-05-24T14:00:27Z

wrangler-transform/src/main/java/io/cdap/wrangler/Wrangler.java

+      String sql = ((RelationalDirective) directive).getSQL();
+      List<String> cols = getColumnsOfDropSQL(sql);
+      for (String col : cols) {
+        filteredRelation = filteredRelation.dropColumn(col);


We should move this logic into the directive. SparkSQLDataset should be returned from each Directive and passed onto the next one. SparkSQLEngine.transform should be invoked in each directive for executing.
Building just one SQL for each Directive should be good enough. SparkSQLEngine will internally chain the SQL execute in one map operation is my understanding, please check that part.
There is a comment on that already, https://github.com/cdapio/cdap/blob/1cc26e664a22977fb39f29ea03a46ee4ab531f92/cdap-app-templates/cdap-etl/hydrator-spark-core-base/src/main/java/io/cdap/cdap/etl/spark/batch/SparkSQLEngine.java#L193

Agreed. These changes are for poc. will move the logic to directives.

iam-divya · 2023-05-24T14:04:11Z

wrangler-transform/widgets/Wrangler-transform.json

@@ -73,6 +73,24 @@
        }
      ]
    },
+    {
+      "label": "RelationalDirectives",


Do you want to keep these changes by feature flag? Otherwise we have to move this code to a branch and keep syncing the branch with Develop or other future changes.

We will add feature flag. until then we won't merge to develop

Column Transform directives

UI change wrangler

Directive validation

Transformation and Row Directives

saimukkamala added 5 commits May 19, 2023 13:32

testing

a161c7c

poc for running on spark sql

e1fbdd5

add RelationalDirective interface to system directive registry

0efb2d3

fixing bug

e6603da

fixing registry nullPointer exception

400892a

iam-divya reviewed May 24, 2023

View reviewed changes

saimukkamala and others added 24 commits May 30, 2023 13:39

move relation execution into directive

b4dd875

Case transform directives

aa99ccd

Implement trim space directives

6a736c9

Implement rename directive

9e492e3

implement keep and copy directives

ddf5f7a

Implement merge directive

1e0eda0

Undo changes in Lower.execute

2cdc2b3

Implement set-type directive

cd4c680

Add utility class for set-type sql expression

b25accf

Implement ChangeColumnCase Directive

2bb3fe2

Move getExpressionfactory() to the Directive interface

f7c5bd3

Move generateColumnExpMap to Directive.java

6d4093b

Clean up code

376e313

Move sql expression generator functions to a new util class

c3002ef

Change util class name

51d3d05

Fix checkstyle errors

31e6f84

Changes to set type util function

e217358

Fix rename directive implementation

c5165f1

Merge branch 'develop' into relational-directives

67755cf

Add UI toggle to wrangler

7cb8aa3

Fix checkstyle error

1adbb2d

Implement swap directive

2a1c2b7

Refactor execution logic

19518e6

Implement filter directives

35fbf93

shrverma and others added 27 commits July 14, 2023 15:08

Change UI toggle

0158b38

Move feature flag checks to separate function

cb5eb7f

Implement SetRecordDelimiter directive

8f926a5

Implement split-email directive

68785b5

Implement transformation directives

6404c8d

Refactor code

ca89034

Implement UUID, split-rows and JSON-object directives

09aa10e

Remove row filter directive implementation

278a30e

Implement fill-null-or-empty

480f2aa

Implement URL encoding and decoding directives

7f3f3cb

Move partially supported directives

63a5932

Implement fixed-length-parser

69e8527

Update Directive.java

3ee71d7

Update ChangeColCaseNames.java

016c989

Merge pull request #646 from data-integrations/sql-directives

5899112

Column Transform directives

Merge branch 'relational-directives' into UI-change-wrangler

ff57c83

Merge pull request #648 from data-integrations/UI-change-wrangler

1e1d898

UI change wrangler

Add directiverelationaltransform interface

7ebc4c3

Refactor execution logic

6f7d14d

Refactor code

29eda87

Add sql directive validation

a656f47

Refactor code

5ce676d

Refactor code

b5ce529

Fix class not found error

ed0e9a4

Merge pull request #653 from data-integrations/Directive-validation

36326f4

Directive validation

Remove extra function

bec9679

Merge pull request #651 from data-integrations/sql-temp

39e360f

Transformation and Row Directives

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Relational directives #641

Relational directives #641

saimukkamala commented May 24, 2023

iam-divya May 24, 2023

saimukkamala May 25, 2023

iam-divya May 24, 2023

saimukkamala May 25, 2023

iam-divya May 24, 2023

saimukkamala May 25, 2023

Relational directives #641

Are you sure you want to change the base?

Relational directives #641

Conversation

saimukkamala commented May 24, 2023

iam-divya May 24, 2023

Choose a reason for hiding this comment

saimukkamala May 25, 2023

Choose a reason for hiding this comment

iam-divya May 24, 2023

Choose a reason for hiding this comment

saimukkamala May 25, 2023

Choose a reason for hiding this comment

iam-divya May 24, 2023

Choose a reason for hiding this comment

saimukkamala May 25, 2023

Choose a reason for hiding this comment